RL training environments with verifiable rewards for coding agents. Works with TRL, Unsloth, verl, OpenRLHF.
DeepGym offers sandboxed coding environments with automatic scoring for reinforcement learning training of AI coding agents.
How It Works
You hear about DeepGym, a helpful tool for training AI models to write better code using reliable scores.
With one simple command, you add DeepGym to your setup and it's ready to go.
Choose from ready-made coding tasks like making change with coins or sorting lists.
Your AI model creates a solution for the challenge.
DeepGym safely runs the code and gives you a clear score showing how good it is.
Use the scores to teach your AI model to improve step by step.
Your model now solves coding problems reliably and gets top scores on benchmarks.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.