SPLADE

Definition

SPLADE (Sparse Lexical and Expansion / SParse Lexical AnD Expansion) is a neural sparse retrieval model that creates learned sparse embeddings via BERT’s Masked Language Model (MLM) head. It combines semantic understanding from transformers with the efficiency of inverted-index retrieval.

Developed by Thibault Formal, Stéphane Clinchant, and Benjamin Piwowarski at NAVER LABS Europe.

Core Mechanism

Input text → BERT → MLM head → 30,522-dim vocabulary distribution per token
→ Log-ReLU activation
→ Max-pool across tokens
→ Sparse vector (≈100-200 non-zero entries out of 30,522)

For each token in the input, the MLM head predicts probability over the entire vocabulary — enabling term expansion (predicting relevant terms not in the original text) and compression (suppressing uninformative terms).

Example: For document about binary numbers, SPLADE:

  • Expands: adds “computing”, “digit” (semantically related)
  • Compresses: removes conjunctions and articles
  • Result: 23-term sparse vector from a 60-term passage

Key Technical Components

Log saturation: Prevents single terms from dominating scores.

FLOPS regularizer: Penalizes computation cost to encourage sparsity, acts as implicit stop-word removal.

Versions

VersionInnovation
SPLADE v1Original: both query & document expansion
SPLADE v2Max pooling; document-only expansion (faster queries)
SPLADE-V3 (2024)Updated models; Hugging Face release

vs. BM25 and Dense Retrieval

BM25SPLADEDense (Bi-Encoder)
VocabularyFixed (term frequency)Expanded (learned)None (continuous)
SemanticsNoneGood (via BERT)Excellent
SpeedVery fastFast (inverted index)Fast (ANN)
InterpretabilityHighHigh (vocabulary terms)Low
Domain adaptationManualLearnedLearned

Advantages Over Dense Retrieval

  • No Vector Search infrastructure needed — works with standard inverted indexes
  • Interpretable representations (vocabulary-dimension vectors)
  • Strong zero-shot performance
  • Easier integration into existing Hybrid Search pipelines

Articles

People