Awesome Search KG

❯

❯

Improving information retrieval in the Elastic Stack: Introducing Elastic Learned Sparse Encoder (ELSER)

Improving information retrieval in the Elastic Stack: Introducing Elastic Learned Sparse Encoder (ELSER)

May 15, 20262 min read

clippings
company-blog

Elastic Learned Sparse Encoder (ELSER)

ELSER is a 100M-parameter pre-trained language model for zero-shot retrieval in Elasticsearch, built on the SPLADE architecture.

Performance

vs. BM25: 10 wins, 1 draw, 1 loss across 12 BEIR datasets; average +17% NDCG@10
Outperforms SPLADEv2; integrates as a text_expansion query clause

Why SPLADE architecture?

Storage efficient: Documents expand to ~100 tokens on average — approximate size parity with normal text indices
Inverted index compatible: Leverages mature Lucene with superior memory efficiency vs. ANN
Controllable trade-offs: FLOPS regularizer balances quality vs. latency
Interpretable: Highlights matching words naturally

SPLADE uses token logits from masked word prediction (not simple synonym expansion).

Training

Distillation: cross-encoder teacher → sparse student, using MSE loss on score margins
Teacher ensemble: weighted combination of MiniLM and monot5-3b with score distribution smoothing
FLOPS regularization key findings:
- 99% of token pruning happens in first 50K training batches
- Reducing regularization or substituting sparser alternatives hurt benchmark performance
- Larger, diverse batches help more than clustered in-batch negatives
- Functions analogously to stop-word removal

Integration

Available via Elasticsearch Relevance Engine. Positioned as complementary to dense vector search, especially for cross-modal retrieval use cases.

Related Concepts

Embeddings — parent concept
Sparse Embeddings — ELSER is a learned sparse embedding model
ELSER — the model described in this article
SPLADE — the architecture ELSER is based on
BM25 — classical baseline; ELSER achieves +17% NDCG@10 over BM25
Hybrid Search — ELSER positioned as complement to dense vector search
Sparse Vector Retrieval — inverted index compatibility
NDCG — benchmark metric used throughout

People

Thomas Veasey — Elastic; ELSER co-author
Quentin Herreros — Elastic; ELSER co-author

Graph View

Elastic Learned Sparse Encoder (ELSER)
Performance
Why SPLADE architecture?
Training
Integration
Related Concepts
People

Backlinks

Map of Content: Agentic Search & Embeddings
Thomas Veasey

Created with Quartz v4.5.2 © 2026

GitHub
Discord Community