Vector Filtering

Definition

Vector filtering refers to combining ANN (approximate nearest neighbor) similarity search with metadata filters — e.g., finding the most similar vectors that also match category="shoes" AND price<100.

This is harder than it sounds: standard ANN indexes are built for pure similarity search and don’t natively handle arbitrary metadata predicates.

The Filter Problem

Three naive approaches, each with serious flaws:

Pre-filtering

Filter to matching docs first
Run ANN on the filtered subset

Problem: If the filter is selective (1% of corpus matches), the filtered subset may be too small for ANN to work well, or the subset doesn’t have a usable ANN index.

Post-filtering

Run ANN to get top-K candidates
Filter candidates by metadata

Problem: If 90% of top-K candidates are filtered out, effective recall collapses. Need to over-retrieve K by 10–100x, killing latency.

Re-indexing per filter

Pre-build separate ANN indexes for each filter combination.

Problem: Combinatorial explosion. n_categories × n_price_buckets × ... indexes is infeasible.

Single-Stage Solution (Pinecone’s Approach)

James Briggs of Pinecone explains their single-stage approach:

Metadata is stored alongside vectors at index time
During search: metadata filter and ANN search happen simultaneously
The index is structured so metadata predicates can prune the search space during graph traversal (HNSW)
No separate filter step, no over-retrieval

This avoids the pre-filter/post-filter tradeoff by making filtering a native operation.

HNSW with Filters

In HNSW (Hierarchical Navigable Small World) graphs:

Traditional traversal: follow edges to nearest neighbors
Filtered traversal: follow edges only to nodes matching the predicate
Challenge: if predicate is highly selective, graph becomes poorly connected in filtered subspace

Solutions:

Build separate graph layers per filter value (for high-cardinality fields)
Hybrid: coarse filter first, then graph search in filtered subspace

Elasticsearch Dense Vector Filtering

GET /products/_knn_search
{
  "knn": {
    "field": "embedding",
    "query_vector": [...],
    "k": 10,
    "num_candidates": 100,
    "filter": {
      "term": {"category": "shoes"}
    }
  }
}

Elasticsearch’s kNN search applies the filter during candidate selection.

Selective Filtering Impact

Filter Selectivity	Pre-filter Risk	Post-filter Risk
50% of corpus	Low	Low
10% of corpus	Medium	Medium
1% of corpus	High (ANN degrades)	High (recall collapse)
0.1% of corpus	Critical	Critical

Dense Vector Retrieval — base ANN search infrastructure
Hybrid Search — combines metadata filters with both sparse and dense
RAG — often needs filtered retrieval (e.g., retrieve only from specific docs)
Sparse Vector Retrieval — sparse indexes handle filters more naturally
ACORN-1 — HNSW extension that uses multi-hop neighborhoods to maintain recall under aggressive filtering

People

James Briggs — “The Missing WHERE Clause in Vector Search” (Pinecone)

Awesome Search KG

Explorer

Vector Filtering

Vector Filtering

Definition

The Filter Problem

Pre-filtering

Post-filtering

Re-indexing per filter

Single-Stage Solution (Pinecone’s Approach)

HNSW with Filters

Elasticsearch Dense Vector Filtering

Selective Filtering Impact

People

Graph View

Table of Contents

Backlinks

Awesome Search KG

Explorer

Vector Filtering

Vector Filtering

Definition

The Filter Problem

Pre-filtering

Post-filtering

Re-indexing per filter

Single-Stage Solution (Pinecone’s Approach)

HNSW with Filters

Elasticsearch Dense Vector Filtering

Selective Filtering Impact

Related Concepts

People

Graph View

Table of Contents

Backlinks