RAG Engineering Mastery4 / 10

Hybrid Retrieval — Keyword + Vector

Vector search understands meaning but fumbles exact terms, IDs, and rare words. Keyword search nails those and misses paraphrase. Use both.

Published May 9, 20261 min readHaythem Rehouma · Claude Mastery

Vector search is great at "what does this mean" and bad at "find the chunk that literally says ERR_CONN_4032." Keyword search is the opposite. Production RAG uses both.

Where each one wins

Vector — paraphrase, concepts, "how do I cancel" matching "subscription termination."
Keyword (BM25) — exact terms, error codes, product names, acronyms, rare jargon the embedding smooths over.

Run both for every query; you get two ranked lists.

Fusing the lists with RRF

Reciprocal Rank Fusion combines ranked lists without needing comparable scores: each document gets 1 / (k + rank) from each list, summed. Documents that rank well in either list rise; documents strong in both dominate.

score(doc) = Σ  1 / (k + rank_in_list_i)     # k ≈ 60

It is a few lines of code, needs no score calibration, and reliably beats either retriever alone.

Where each one wins

Fusing the lists with RRF

Share this article

Series — RAG Engineering Mastery

Keep learning

The Claude Mastery course