OBLITERATUS is an open-source toolkit for surgically removing content refusal behaviors from large language models using interpretability techniques, with a user-friendly web interface and community benchmarks.
How It Works
You find this free tool on Hugging Face Spaces while looking for ways to understand AI models better.
Click the link to launch the web interface—no setup needed, it runs instantly with free GPU time.
Choose from popular models like Llama or Mistral that fit your computer's power.
Select a method and hit 'Obliterate'—watch as it maps and removes the model's built-in restrictions in minutes.
Talk to your updated model right there, seeing how it responds without old limits while keeping its smarts.
Side-by-side view shows exactly what changed, with charts proving capabilities stayed strong.
Save your liberated model or push it online, now part of a community advancing AI understanding together.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.