Hybrid Search Explained

Question

What is hybrid search and why use it instead of just vector search?

Hybrid search = running BM25 (keyword) and vector (semantic) search at the same time, then merging the results.

You send a query (text + vector)
Weaviate runs two searches in parallel: BM25 scores chunks by keyword matching, vector scores chunks by how close their meaning is to the query (cosine similarity)
Results are merged using a parameter called alpha

Controls the balance between the two:

Exact term queries ("BM25 algorithm") - BM25 finds it precisely, vector might return vaguely related results
Vague queries ("how to find relevant text") - vector understands the meaning, BM25 misses because no exact match
Acronyms ("RAG") - BM25 finds the acronym, vector also finds "retrieval-augmented generation"

With alpha=0.5, you get the best of both worlds.

results = store.similarity_search(
    query,
    k=20,          # return top 20 candidates
    alpha=0.5,     # balanced hybrid
)

LangChain + Weaviate handles the merge automatically.