Search Result Diversity

When and how to diversify search results — ensuring a ranked list covers multiple aspects of user intent rather than converging on one interpretation.

Why Diversity Matters

Diversity serves two distinct purposes:

1. Ambiguity Hedging

The query has multiple valid interpretations:

“jaguar” → car, animal, OS
“apple” → fruit, company, color
“necklace” → gold, silver, minimalist, statement, budget, luxury

When intent is uncertain, a homogeneous result list serves only one segment of users. Diversity covers the intent space.

2. Redundancy Reduction

Even for clear queries, near-duplicate results waste rank positions. Showing ten barely-different black t-shirts is worse than showing three variations.

The Goldilocks Problem

Too little diversity: results all look the same → boring, wastes real estate Too much diversity: results are incoherent → users can’t find what they need

The Wundt Curve describes this: user satisfaction increases with diversity up to a point, then drops as the result set becomes incoherent. Finding the optimal λ is empirical.

Methods

MMR (Maximal Marginal Relevance)

Active diversification during reranking:

score(d) = λ × rel(d, q) − (1−λ) × max_sim(d, already_selected)

Greedy selection: pick the next item that maximizes the MMR score. λ=1 = pure relevance, λ=0 = pure diversity.

See MMR for full formula and implementation.

Entropy-Based (Etsy’s Approach)

Measure Shannon entropy over category/attribute distributions in the result set. Set an entropy target per query type. Re-rank or inject items to meet the target.

Works well for marketplace search where items have structured categories.

Category/Subtopic Caps (Hard Diversity)

Limit N results per category: “max 2 results per domain”, “max 3 results per price tier.”

Simple and interpretable but requires explicit taxonomy.

KL-Divergence

Measure divergence between the result list’s topic distribution and the corpus’s topic distribution. Penalize result sets that over-concentrate on one subtopic.

When to Apply Diversity

Query type	Apply diversity?	Why
Broad head queries (“shoes”, “necklace”)	Yes — strongly	Huge intent space; homogeneity wastes real estate
Ambiguous queries (“python”, “apple”)	Yes — intent coverage	Multiple distinct interpretations
Category queries (“winter jackets”)	Yes — moderate	Covers different styles, price points
Navigational (“Nike Air Max 270”)	No	User intent is specific; diversity adds noise
Exact product (“B08N5LNQCV”)	No	Single correct answer

Detecting Broad/Ambiguous Queries

Classifier features:

Query length (shorter = more likely broad)
Query frequency (head queries tend to be broader)
Click entropy on historical results (high click entropy → ambiguous)
Number of distinct categories in past click sessions for this query
Absence of specific attributes (no color, size, brand mentioned)

Diversity vs. Precision Trade-off

Standard precision metrics (NDCG@k, P@k) reward near-duplicate top hits — they don’t penalize redundancy. If you optimize purely for NDCG, diversity suffers.

Diversity-aware metrics:

α-nDCG: penalizes redundant relevant documents at lower ranks
ERR-IA: intent-aware Expected Reciprocal Rank
APD: Average Pairwise Distance — passive diversity measurement

Track APD alongside NDCG to catch diversity regressions.

Diversity and faceted search are complementary:

Facets: user explicitly constrains intent — diversity becomes less important
No facets selected: diversity hedges the ambiguity the user didn’t resolve

Broad-query diversification is “pre-emptive faceting” — the system makes the intent-coverage decision the user didn’t.

Key People

Daniel Tunkelang — Searching for Goldilocks (Wundt Curve, λ-MMR), Thoughts on Search Result Diversity (KL-divergence, greedy reranking)
Andreas Wagner — Three Pillars framework (Discovery pillar); MICES talk on diverse result sets showing how result positioning shapes user basket composition

MMR — primary active diversification algorithm
APD — passive diversity measurement
E-commerce Search — diversity is especially critical for head queries
Faceted Search — complementary user-controlled disambiguation

Awesome Search KG

Explorer

Search Result Diversity

Search Result Diversity

Why Diversity Matters

1. Ambiguity Hedging

2. Redundancy Reduction

The Goldilocks Problem

Methods

MMR (Maximal Marginal Relevance)

Entropy-Based (Etsy’s Approach)

Category/Subtopic Caps (Hard Diversity)

KL-Divergence

When to Apply Diversity

Detecting Broad/Ambiguous Queries

Diversity vs. Precision Trade-off

Relationship to Facets

Key People

Graph View

Table of Contents

Backlinks

Awesome Search KG

Explorer

Search Result Diversity

Search Result Diversity

Why Diversity Matters

1. Ambiguity Hedging

2. Redundancy Reduction

The Goldilocks Problem

Methods

MMR (Maximal Marginal Relevance)

Entropy-Based (Etsy’s Approach)

Category/Subtopic Caps (Hard Diversity)

KL-Divergence

When to Apply Diversity

Detecting Broad/Ambiguous Queries

Diversity vs. Precision Trade-off

Relationship to Facets

Key People

Related

Graph View

Table of Contents

Backlinks