CL-bench is a high-quality benchmark dataset and evaluation tools for testing AI language models' ability to learn novel knowledge from context in realistic tasks across various domains.
How It Works
You stumble upon this smart set of tests designed to check how well AI chatbots pick up and use brand new information given right in the conversation.
You download the bundle of ready-to-use challenges, each with questions, new facts to learn, and clear scoring rules.
You connect your chosen AI service so it can chat and respond to these fresh learning tests.
You let your AI tackle all the challenges one by one, gathering its answers as it tries to learn and solve them.
A reliable checker goes through each response, matching it strictly against the rules to decide if it's spot on or not.
You see a simple score showing how often your AI fully nailed the challenges by learning from the new info provided.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.