SarahXC / codex-autoresearch-harness
PublicBash harness to run Karpathy's autoresearch with Codex CLI in a loop. Includes A/B testing framework for comparing models.
Scripts that wrap an AI coding tool in a loop to run autonomous experiments improving a neural network training script, with support for comparing different AI models on a GPU machine.
How It Works
You find a handy kit that lets AI brains compete to smarten up a learning program all by themselves.
You pick a cloud service and launch a powerful virtual machine with a top graphics card for big calculations.
You follow simple steps to prepare the machine, downloading sample data and tools so everything is ready.
You connect your AI account so the smart models can get to work thinking and creating.
You pick two AI models, set a time like 6 hours each, and launch them to take turns improving the program.
You peek at the logs anytime to see experiments running, scores dropping, and which changes stick.
You get a clear comparison table showing the best scores and how much better each AI made the program.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.