Semantic Search: Definition & Meaning — AI Wiki

Search that finds results based on meaning rather than exact keyword matches. Instead of looking for documents containing the word "fix," semantic search finds documents about "repair," "resolve," "patch," and "debug" because they mean similar things. It works by converting text into embeddings (numerical vectors) and finding the closest matches in vector space.

Why it matters

Semantic search is why modern search feels magical compared to keyword search. It powers RAG systems, documentation search, e-commerce product discovery, and support ticket routing. If you're building any application that needs to find relevant information, semantic search is probably the right approach.

Deep Dive

The pipeline: (1) encode your documents into embeddings using a model like BGE, E5, or Voyage, (2) store these embeddings in a vector database (Pinecone, Qdrant, Weaviate, pgvector), (3) when a query arrives, encode it with the same model, (4) find the nearest embeddings using similarity metrics like cosine similarity or dot product. The query "how to fix a memory leak" matches a document titled "debugging RAM consumption in Node.js" because their embeddings are close in vector space.

Hybrid Search

Pure semantic search has a weakness: it can miss exact matches that keyword search catches easily. If someone searches for error code "ERR_SSL_PROTOCOL_ERROR," semantic search might return general SSL troubleshooting instead of the exact error. Hybrid search combines both: keyword matching (BM25) for precision and semantic search for recall, then merges the results. Most production search systems use hybrid approaches.

Embedding Model Choice Matters

The quality of semantic search depends entirely on the embedding model. General-purpose models (OpenAI's text-embedding-3, Cohere Embed) work well for most text. Domain-specific models (trained on medical, legal, or code data) outperform general models in their domain. Multilingual models enable cross-language search. The MTEB leaderboard benchmarks embedding models across many tasks — it's the best resource for choosing one.

Semantic Search

Why it matters

Deep Dive

Hybrid Search

Embedding Model Choice Matters

Related Concepts