A/B Testing Search: Thinking Like a Scientist

Source: https://medium.com/@jamesrubinstein/a-b-testing-search-thinking-like-a-scientist-1cc34b88392e Author: James Rubinstein

Summary

Applies scientific thinking to search A/B testing — how to form proper hypotheses, design tests, and interpret results without p-hacking or confirmation bias.

Core Principles

Pre-registration of Hypotheses

Define success criteria before running the experiment. Post-hoc interpretation (“we expected this result”) leads to confirmatory bias and false positives.

Statistical Rigor

Set minimum detectable effect size based on business significance, not just statistical significance
Account for multiple comparisons (testing 10 metrics means false positive rate multiplies)
Run experiments for a full week cycle to avoid day-of-week effects

The Scientist’s Mindset

Treat each experiment as a chance to learn, not just to confirm a belief. A “failed” experiment that reveals a hidden interaction or explains user behavior has value.

Search-Specific Complications

Query volume varies enormously — tail queries have low power
Session context means one query’s result affects the next
Different user segments may respond oppositely to the same change

Key Complement to Offline Evaluation

A/B tests confirm or refute what Search Evaluation and Judgment Lists predicted offline. The two methods together catch both relevance regressions AND user experience issues that don’t show up in static judgment sets.

People

James Rubinstein

Awesome Search KG

Explorer

A/B Testing Search: Thinking Like a Scientist

A/B Testing Search: Thinking Like a Scientist

Summary

Core Principles

Pre-registration of Hypotheses

Statistical Rigor

The Scientist’s Mindset

Search-Specific Complications

Key Complement to Offline Evaluation

People

Graph View

Table of Contents

Backlinks

Awesome Search KG

Explorer

A/B Testing Search: Thinking Like a Scientist

A/B Testing Search: Thinking Like a Scientist

Summary

Core Principles

Pre-registration of Hypotheses

Statistical Rigor

The Scientist’s Mindset

Search-Specific Complications

Key Complement to Offline Evaluation

Related Concepts

People

Related Articles

Graph View

Table of Contents

Backlinks