🚀 An AI-native RCA platform for incident ingestion, evidence governance, runtime orchestration, and async diagnosis delivery. Built to help teams close the loop from alerts to diagnosis. 🚀 一款面向故障根因分析的 AI Native RCA 平台,覆盖告警接入、证据治理、运行时编排与异步诊断交付,帮助团队打通从告警到诊断的闭环。
API server for a root cause analysis platform that ingests alerts from monitoring systems, collects evidence from Kubernetes/Prometheus/Elasticsearch, runs multi-agent AI diagnosis jobs, and manages operator workflows with audit trails.
How It Works
You get a notification that something's wrong with your app or service.
Forward the alert to the smart analyzer which starts investigating automatically.
The system gathers clues like metrics and logs, then uses AI to find the root cause.
Check the AI's diagnosis, evidence, and suggested fixes in a clear dashboard.
Greenlight the solution and watch it resolve the issue.
Request deeper checks with new time ranges or hints.
Your service is back to normal with full audit trail of what happened.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.