A testing tool for Agent Skills that runs AI model prompts with and without a skill, uses a judge model to score outputs, and generates benchmark reports.
How It Works
While building smart AI agent skills, you find this handy tester that proves if they really help AI think better.
Put your skill guide and a few test prompts with expected results into a simple folder.
Pick an AI brain like GPT and let the tool connect so it can run smart tests.
Tell the tool to check your folder, and it quietly runs tests twice—once with your skill, once plain.
Sit back as a smart judge reviews each test, grading success with clear reasons.
Open the beautiful webpage report showing pass rates, side-by-side proofs, and hard evidence your skill shines!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.