chunk-norris is a Python tool that evaluates multiple document chunking strategies for RAG systems by scoring retrieval performance against user-provided questions and answers, then returns the best chunker for immediate use.
How It Works
You learn about chunk-norris, a helpful friend that tests different ways to break your long document into bite-sized pieces so AI can find answers faster and better.
You collect your full document text and write down 15-30 real questions people might ask, along with their exact answers from the text.
You create a simple tester using the built-in thinking tool that understands meaning without needing extra services.
You hand over your document, questions, and a few splitting styles like by size, sentences, or paragraphs, and it automatically tests each one to see which grabs the right info best.
A clear table appears showing scores for completeness and focus, highlighting the winning splitter that works best for your document.
You instantly get the top splitter ready to use, feeling confident you've picked the perfect method without guessing.
Your document is now split ideally, so your AI question-answering setup retrieves spot-on info every time, making searches smooth and reliable.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.