Harikishanth / Incident-Triage-Environment
PublicAn OpenEnv benchmark testing the ability of AI agents to act as Site Reliability Engineers (SREs) by diagnosing and filtering raw production failure logs.
An interactive benchmark for testing AI agents on diagnosing root causes and planning fixes for simulated production system outages across easy, medium, and hard scenarios.
How It Works
You hear about a fun tool that tests how well AI can spot and fix computer system emergencies, like when websites crash.
Head straight to the online playground where everything is already set up—no setup needed, just start playing.
Choose from easy fixes, tricky puzzles, or full-blown disasters to challenge your AI buddy.
Get a bundle of clues like error messages, user complaints, and charts showing what's breaking.
Type out what you think went wrong and the step-by-step plan to make it right again.
Submit your answer and see your score plus helpful notes on what was spot-on or needs tweaking.
Watch your scores climb as your AI learns to triage crises like a pro, ready for real-world heroics.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.