Tweety is a command-line toolkit that evaluates AI language models on 14 structured tasks covering text comprehension, reasoning, vision, structured outputs, safety, and performance metrics with detailed reports.
How It Works
You hear about Tweety, a friendly bird that helps test how smart AI helpers are at understanding text, thinking, seeing pictures, staying safe, and running fast.
Download and install Tweety easily so it's ready to use right away.
Link Tweety to a thinking service like GPT so it can fairly score the AI's answers.
Tweety gathers stories, puzzles, pictures, and challenges once, ready for any AI.
Pick your AI and watch Tweety run it through fun challenges, measuring smarts and speed.
Open colorful charts and summaries showing strengths, weaknesses, and scores.
You now know exactly how well your AI performs and where to make it even better!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.