Map of Content: Agentic Search & Embeddings

Entry point for the semantic knowledge graph covering the Agentic Search and Embeddings sections of the Awesome Search collection. Navigate by concept, person, or article.


Core Paradigms


Embedding Architectures

ArchitectureKey PropertyNote
Bi-EncoderIndependent encoding, fastTwo-tower, dual encoder
Cross-EncoderJoint encoding, accurateReranker
ColBERTPer-token MaxSimLate interaction
Late InteractionToken-level alignmentColBERT’s mechanism

Vector Types

Dense

Sparse

Constructed Query Vectors

Quantization

  • Vector Quantization — compressing embeddings: scalar (SQ8), binary (BBQ), product (PQ), K-Quants/I-Quants
  • BBQ — Elasticsearch’s Better Binary Quantization + OSQ; 32× compression, 10–40× speed
  • GGUF — quantized LLM weight format for local deployment (rerankers, query expansion)

Retrieval Strategies

ANN Index Structures

  • HNSW — graph-based; best recall/speed; dominant in Elasticsearch, Qdrant, Weaviate
  • IVF — cluster-based; lower memory; IVF-PQ for billion-scale

Quantization Methods


Text Preparation

  • Text Chunking — fixed, recursive, semantic, contextual methods

Key People

PersonAffiliationKey Contributions
Daniel TunkelangQueryUnderstanding.comAgentic Search, Bag-of-Documents, Pipeline Distillation
Omar KhattabStanfordColBERT creator
Matei ZahariaStanford/DatabricksColBERT co-creator
Jo Kristian BergumVespaColBERT 32x compression
Han XiaoJina AIjina-colbert-v1-en (8192 tokens)
Trey GraingerWormhole Vectors, “AI-Powered Search”
Stéphane ClinchantNAVER LABSSPLADE co-inventor
Thibault FormalNAVER LABSSPLADE co-inventor
James BriggsPineconeSPLADE explainer, Vector Filtering
Shaw TalebiFine-tuning text & multimodal embeddings
Asif MakhaniInfino AICo-authored Agentic Search with Tunkelang
Dima KanAivenWormhole Vectors implementation
Lester Solbakkenhornet.devMAD framework, defensive retrieval
Thomas VeaseyElasticBBQ/OSQ quantization benchmarks
Ivan PleshkovQdrantTurboQuant + RaBitQ implementation in Qdrant 1.18
Piotr Mazurektensoreconomics.comEmbeddings economics, FLOPS/dollar analysis
Quynh NguyenElasticMultilingual embedding hybrid search

Articles by Topic

ColBERT / Late Interaction

Matryoshka Embeddings

Text Chunking

Context-Aware / Task-Aware Embeddings

SPLADE & Sparse Retrieval

Constructed Query Vectors

Fine-tuning

Vector Infrastructure

Quantization

Case Studies


Concept Relationship Map

Agentic Search
  └── uses → Retrieval Pipeline
               ├── Stage 1: Bi-Encoder / Sparse Vector Retrieval / Hybrid Search
               └── Stage 2: Cross-Encoder / ColBERT (Late Interaction)

RAG
  └── uses → Dense Vector Retrieval
               ├── input: Text Chunking → Bi-Encoder embeddings
               └── query: Asymmetric Semantic Search / HyDE / Task-Aware Embeddings

Hybrid Search
  ├── sparse leg: SPLADE / ELSER / BM25
  └── dense leg: Bi-Encoder
       └── advanced: Wormhole Vectors / Bag-of-Documents Model

Embedding Quality
  ├── Matryoshka Embeddings (flexible dimensions)
  ├── Embedding Fine-tuning (domain adaptation)
  ├── Multimodal Embeddings (image + text)
  └── Vector Filtering (metadata + ANN)

New: Direct Corpus Interaction

New: ColBERT Training