Evaluating Good Search Part I: Measure It

Source: https://medium.com/@dtunkelang/evaluating-good-search-part-i-measure-it-5507b2dbf4f6 Author: Daniel Tunkelang Series: Evaluating Search (Part 1 of 4)

Summary

Opening of Tunkelang’s four-part series on search evaluation. Anchored in Lord Kelvin’s principle: “If you cannot measure it, you cannot improve it.” Covers the full taxonomy of supervised and unsupervised search metrics.

Supervised Metrics (require judgment labels)

Metric	What it measures
Precision	Fraction of returned results that are relevant
Recall	Fraction of all relevant results that were returned
Precision@k	Precision for top k results only
Average Precision@k	Weighted avg giving more weight to top-ranked results
NDCG	Accounts for relevance gradations + position discounting

Unsupervised Metrics (from behavior)

Metric	What it measures
CTR	Fraction of searches that receive clicks
MRR	Weighted click signal favoring earlier positions
Conversions	Stronger signal than clicks (purchase, signup, etc.)

Key Insight

Conversions are sparse but strong; clicks are plentiful but noisy. Individual components (spelling correction, autocomplete) need their own targeted metrics.

Series

Measure It (this article)
Measuring Searcher Behavior
Evaluating Search - Using Human Judgments (already processed)
When There’s No Conversion Rate

People

Daniel Tunkelang

Awesome Search KG

Explorer

Evaluating Good Search Part I: Measure It

Evaluating Good Search Part I: Measure It

Summary

Supervised Metrics (require judgment labels)

Unsupervised Metrics (from behavior)

Key Insight

Series

People

Graph View

Table of Contents

Backlinks

Awesome Search KG

Explorer

Evaluating Good Search Part I: Measure It

Evaluating Good Search Part I: Measure It

Summary

Supervised Metrics (require judgment labels)

Unsupervised Metrics (from behavior)

Key Insight

Series

Related Concepts

People

Graph View

Table of Contents

Backlinks