sigridjineth / bb25
Publicbb25 is a fast, self-contained BM25 + Bayesian calibration implementation with a minimal Python API.
bb25 is a Python tool for ranking documents in search systems using an advanced Bayesian version of the BM25 algorithm, including hybrid fusion with vector similarities and validation benchmarks.
How It Works
You learn about bb25, a helpful tool that makes finding the right information in a bunch of texts smarter and more accurate.
You quickly set up bb25 on your computer so it's all prepared to use.
You play with the ready-made examples of texts and questions to see how bb25 ranks matches perfectly right away.
You build your own collection by adding documents and their special number patterns that capture meaning.
You give bb25 your search words and watch it score each document to show the best matches first.
You combine word matching with meaning similarities for even better hybrid search results.
You test everything with built-in experiments and benchmarks to confirm it works great.
Your searches now deliver the most relevant findings quickly, outperforming usual methods.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.