Logo

Notes | Work | Github | Twitter | Linkedin | Email

Search and Ranking Systems

Staff Engineer @ BloomReach — Oct 2013 to Mar 2018
Location: Bengaluru, India

Patents and Blog

Overview
Led the design and implementation of an automated synonym generation pipeline for BloomReach Discovery, processing 100 million+ product descriptions and 30 million+ queries weekly to extract high-quality synonym pairs for e-commerce search. As deep learning started emerging, designed deep NNs to match user queries with document text & images.

E-commerce search faces messy, short queries (“nj devils tees”), shifting language (“jorts”), and ambiguity (“frozen” the temperature vs. Frozen the movie). Synonyms aren’t just dictionary swaps; they’re a controlled expansion of user intent.

Goal. Bridge query–catalog mismatch without flooding results. Boost recall while preserving precision and rank quality.

Synonym Types.

Data Scale. Weekly mining over ~100M product descriptions, billions of web lines, and ~30M queries to capture new vocabulary, trends, and seasonality.

Pipeline (Mining, not per-query).

Application.

Outcomes.

synonyms.png