Glossary
Plain-language definitions for the ecommerce search, product discovery, and AI relevance concepts we work with every day.
A
B
C
Click-Through Rate (CTR)
The percentage of search-result impressions that result in a click — the primary engagement signal for measuring search ranking quality.
Conversion Rate
The percentage of searches that result in a purchase — the dollar-value endpoint of search quality, downstream of CTR and add-to-cart.
Cosine Similarity
A distance metric that measures the angle between two vectors, ignoring their magnitude — the default similarity score for embedding-based search.
Cross-Encoder
A transformer that scores query and document together in a single pass — slower than bi-encoders but much more accurate, used as a re-ranker on the top-N candidates.
D
E
Embedding
A dense numerical vector that represents a piece of text, image, or other input in a learned space where geometric distance reflects semantic similarity.
Entity Recognition
The NLP step that finds and classifies named things in a query — brands, sizes, colors, materials, locations — so they can be routed to facets instead of full-text search.
F
G
H
I
K
L
Learning to Rank
A family of supervised ML techniques that learn a ranking function from labeled query-result pairs, replacing hand-tuned scoring formulas with a model trained on relevance data.
Long-tail Query
A search query that appears rarely or only once in your search logs — individually low-traffic but collectively dominant, and historically the hardest queries to rank well.
M
Mean Reciprocal Rank (MRR)
A ranking metric that averages 1/(position of first relevant result) across queries — rewarding rankers that put the right answer first, indifferent to deeper ranking quality.
Merchandising
Manual or rule-based overrides that pin, boost, bury, or replace specific products in search results — the human layer on top of algorithmic ranking.
N
O
P
Personalization
Tailoring search results to the individual shopper based on their history, behavior, or declared preferences — a per-user re-ranking layer on top of relevance.
Precision
Of the results returned, what fraction are relevant — the “quality, not quantity” IR metric, complementary to recall.
Q
R
Re-ranking
A second-stage scoring pass that takes the top-N results from a fast retriever (BM25 or vector) and re-orders them with a more expensive, higher-quality model.
Recall
Of all the relevant items in the catalog, what fraction did the search return — the coverage metric, complementary to precision.
S
Semantic Search
Vector-based retrieval that matches by meaning rather than exact keywords, using embeddings to find products even when query and title share no words.
Synonyms
Equivalent terms a search engine treats as interchangeable so a query for one returns results containing the other (couch ↔ sofa, sneaker ↔ trainer).
T
V
Vector Search
Approximate nearest-neighbor (ANN) retrieval over high-dimensional embeddings — the engine behind semantic search, recommendation, and visual similarity.
Visual Search
Image-to-product search — upload or paste a photo, get matching catalog items based on visual similarity rather than text.