D
Doc and Tell
Glossary/ai/ml
ai/ml

BM25

A probabilistic keyword-ranking algorithm that scores documents by term frequency and inverse document frequency.

BM25 (Best Matching 25) is a bag-of-words retrieval function that ranks documents based on how often query terms appear, adjusted for document length. It excels at matching exact terminology, acronyms, and proper nouns that semantic search sometimes misses.

In hybrid retrieval pipelines, BM25 complements vector search. Combining both with reciprocal rank fusion yields significantly better recall than either method alone, which is critical for compliance workflows where missing a relevant clause could have legal consequences.

Analyze Documents Related to BM25

Upload any document and get AI-powered analysis with verifiable citations.

Start Free