Pointwise vs. Pairwise vs. Listwise Learning to Rank

Source: https://medium.com/@nikhilbd/pointwise-vs-pairwise-vs-listwise-learning-to-rank-80a8fe8fadfd Author: Nikhil Dandekar

Summary

A concise taxonomy of the three families of Learning to Rank approaches, distinguished by how many documents the loss function considers at once. This is the canonical framing for Ranking Objectives.

The Three Families

Pointwise

Looks at a single document at a time. Trains a standard regressor/classifier to predict a relevance score per document; the final ranking is produced by sorting on those scores. Each document’s score is independent of the others in the result list. Any standard regression/classification algorithm applies directly.

Pairwise

Looks at a pair of documents at a time. The model learns the optimal ordering for each pair and is penalized for inversions (pairs ordered wrongly relative to ground truth). Pairwise works better in practice than pointwise because predicting relative order is closer to the nature of ranking than predicting an absolute label. RankNet, LambdaRank and LambdaMART are pairwise approaches.

Listwise

Looks at the entire list and optimizes its ordering directly. Two sub-techniques:

Direct optimization of IR measures such as NDCG — e.g. SoftRank, AdaRank.
Custom listwise loss capturing properties of the target ranking — e.g. ListNet, ListMLE.

Listwise approaches can get considerably more complex than pointwise or pairwise.

Why It Matters

The choice of family is the choice of ranking objective — what the model is actually told to optimize. Predicting relative order (pairwise) or list quality (listwise) aligns the training signal with how ranking quality is measured, which is why the pairwise/listwise lineage (RankNet → LambdaRank → LambdaMART) dominates production LTR.

How LambdaMART Works — the pairwise-swap trick that approximates a listwise metric

Primary Sources (from the article’s footnotes)

From RankNet to LambdaRank to LambdaMART: An Overview — Burges, Microsoft Research (the definitive lineage overview)
SoftRank: Optimising Non-Smooth Rank Metrics — listwise, metric smoothing
AdaRank on LETOR — boosting for IR measures
Learning to Rank: From Pairwise Approach to Listwise Approach (ListNet)
Position-Aware ListMLE: A Sequential Learning Process for Ranking
Tutorials: Learning to Rank for IR (WWW2009) · A Short Introduction to Learning to Rank

People

Nikhil Dandekar — author

Awesome Search KG

Explorer

Pointwise vs. Pairwise vs. Listwise Learning to Rank

Pointwise vs. Pairwise vs. Listwise Learning to Rank

Summary

The Three Families

Pointwise

Pairwise

Listwise

Why It Matters

Primary Sources (from the article’s footnotes)

People

Graph View

Table of Contents

Backlinks

Awesome Search KG

Explorer

Pointwise vs. Pairwise vs. Listwise Learning to Rank

Pointwise vs. Pairwise vs. Listwise Learning to Rank

Summary

The Three Families

Pointwise

Pairwise

Listwise

Why It Matters

Related Concepts

Related Articles

Primary Sources (from the article’s footnotes)

People

Graph View

Table of Contents

Backlinks