SIRA (Superintelligent Retrieval Agent)

Definition

SIRA is a retrieval architecture proposed by Meta that replaces complex multi-step agentic retrieval pipelines with a single, highly intelligent BM25 query generated by a strong LLM. The central thesis: retrieval quality is limited by query quality, not architecture complexity.

Core Idea

Most modern RAG systems assume that better retrieval requires more sophisticated machinery:

Vector databases and dense embeddings
Multi-step search loops (search → read → reformulate → search again)
Ensemble of dense retrievers + sparse + reranking
Agentic retrieval with tool use and reasoning chains

SIRA argues: what if one really smart BM25 query is enough?

Rather than building complex retrieval architectures, SIRA invests in making the retrieval agent reason like a domain expert:

Which exact keywords distinguish relevant from irrelevant documents
What terminology would appear in the answer
Which concepts are likely co-located in useful documents

Architecture

Query → LLM (domain expert reasoning) → Optimized BM25 query → BM25 retrieval → Results

The LLM reasons about:

Query intent and domain
Vocabulary that expert authors use (not user vocabulary)
Discriminating terms that separate relevant from irrelevant documents
Likely answer structure and where it would appear

Results

On BEIR benchmarks, SIRA wins average Recall@10 and NDCG@10 against:

BM25 (baseline)
E5, SPLADE (dense/sparse embedding models)
HyDE (Hypothetical Document Embeddings)
Search-R1, GrepRAG, ShellAgent (complex agentic retrieval systems)
Multiple dense retrievers in ensemble

Grep-style agentic systems performed poorly. The paper conclusion:

All the complexity, tool use, and multi-step reasoning may be doing less than one well-formulated query.

Implications

Retrieval is a query problem, not primarily an architecture problem
Expert knowledge of the domain matters more than retrieval algorithm sophistication
Complex agentic pipelines may add latency and failure modes without retrieval gains
The bottleneck is semantic translation: turning user intent into the vocabulary of relevant documents
Challenges the prevailing assumption that embeddings always beat lexical search

Connection to Search Theory

SIRA relates to the gap between user vocabulary and document vocabulary — a core problem in IR since the 1990s. Where HyDE generates a hypothetical document to bridge this gap, SIRA uses LLM reasoning to directly generate the right query vocabulary.

BM25 — the retrieval algorithm SIRA uses, but with optimized queries
Hypothetical Document Embeddings — alternative approach to the vocabulary gap
Agentic Search — the paradigm SIRA pushes back against
Query Understanding — what SIRA automates with LLM reasoning
Retrieval Pipeline — SIRA simplifies multi-stage pipelines to one stage

SRA - Superintelligent Retrieval Agent (raw article)
facebookresearchsira Superintelligent Retrieval Agent (SIRA) (raw article)
Superintelligent Retrieval Agent SIRA (processed article)

Awesome Search KG

Explorer

SIRA (Superintelligent Retrieval Agent)

SIRA (Superintelligent Retrieval Agent)

Definition

Core Idea

Architecture

Results

Implications

Connection to Search Theory

Graph View

Table of Contents

Backlinks

Awesome Search KG

Explorer

SIRA (Superintelligent Retrieval Agent)

SIRA (Superintelligent Retrieval Agent)

Definition

Core Idea

Architecture

Results

Implications

Connection to Search Theory

Related Concepts

Related Articles

Graph View

Table of Contents

Backlinks