ATBench: A Diverse and Realistic Agent Trajectory Benchmark for Safety Evaluation and Diagnosis
ATBench provides benchmark datasets of AI agent interaction trajectories for evaluating safety in long-horizon tool-using scenarios.
How It Works
You hear about ATBench, a collection of real-life test stories for checking if AI helpers stay safe while using tools over many steps.
You visit the main page to read stories, see example tests, and learn how it helps make AI agents safer.
You choose between the big new set of 1,000 tests or the original 500 for your safety checks – both balanced with safe and risky examples.
You easily download the complete interaction stories, each with user requests, AI replies, tool uses, and outcomes.
You run your AI agent through these multi-turn scenarios to see if it spots dangers and stays on the safe path.
You examine results for risks, failure spots, and potential real harms to understand what went wrong or right.
With clear insights from the tests, you improve your AI helper to handle tools and long tasks more safely.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.