LEMUR reduces multi-vector retrieval for late interaction models such as ColBERT into regular single-vector retrieval.
LEMUR is a Python library implementing a fast learned method for multi-vector retrieval to approximate and rerank maximum similarity scores between queries and large corpora of token embeddings.
How It Works
You learn about Lemur, a speedy tool that helps find the best matches in huge collections of documents or items super fast.
You add Lemur to your computer with a simple command, and it's ready to go in moments.
You gather your big list of items, like documents, each described by numbers from their key parts, and note how many parts each has.
You teach Lemur about your collection by letting it learn patterns – it practices on samples to get really good at spotting similarities.
You describe what you're looking for with similar number descriptions, and Lemur scans the whole collection lightning-fast.
Lemur quickly narrows down thousands of possibilities to your best handful of matches, perfectly ranked.
Now you can zip through massive libraries or datasets, finding exactly what you need every time, feeling like a pro.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.