Calibre-Labs / reforge-ai-evals
PublicMarket Map agent eval suite for the Reforge AI Evaluation course
This repository offers prompts, test datasets, and scoring methods to evaluate an AI agent that ranks top companies in markets based on user queries, as demonstrated in a Reforge AI evaluation course.
How It Works
You stumble upon this handy collection from a course that helps test and improve an AI assistant for ranking top companies in any market.
You create free accounts for a testing area and your AI service so everything is ready to experiment safely.
You run a one-time setup to unlock special commands in your AI chat that make building tests super easy.
You copy ready lists of real-world questions like 'team chat apps' into your playground to challenge the AI.
You paste a smart instruction set and watch the AI generate ranked lists of top companies with reasons.
You apply simple checks to see if rankings are accurate, backed by facts, and handle tricky questions well.
Your AI now confidently maps any market with top picks and proof, ready for real use without surprises.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.