TaimoorKhan10 / replayd
PublicTurn failed AI agent runs into replayable regression tests. Catch regressions before you ship.
Replayd is an open-source testing tool that turns failures from AI agents into reusable regression tests, helping teams catch the same bugs returning after prompt or model changes before shipping to production.
How It Works
Your AI agent makes a mistake in production — maybe it approved something it shouldn't have, or gave wrong advice to a customer.
You wrap your agent's run in a special recording block that saves everything: what the user asked, what the agent did, and every tool it called.
You write a short note explaining what went wrong — like 'agent approved a refund over the policy limit' — and save it as a regression test.
Weeks later, your team updates the prompt, switches to a new AI model, or fixes some other part of the agent.
Before your changes go live, you run all your saved tests against the new version to check if old bugs have returned.
The same mistake happened again — your release is blocked until you fix it
No old bugs returned — your changes are safe to ship
Your AI agent is now protected by a safety net that catches regressions before they reach your users.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.