A lab for reproducibly evaluating AI agent safety through mock and real traces, tool policies, and classification benchmarks on public datasets.
How It Works
You find this handy lab online that helps check how safe AI helpers are when they use tools and chat with people.
You make a cozy spot on your computer by following simple steps to prepare everything for testing.
You launch a demo test on pretend AI behaviors and instantly see which ones pass or raise red flags.
Colorful charts and summaries pop up, showing safe actions, blocked risks, and what needs a closer look.
You try tests with actual conversation examples to spot hidden dangers in AI responses.
Explore lists of failures, risk scores, and patterns to understand exactly what's going wrong.
You gain clear insights to make your AI helpers smarter and safer, ready for real use.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.