geocodebench / GeoCodeBench
Public[CVPR' 26] Benchmarking PhD-Level Coding in 3D Geometric Computer Vision
GeoCodeBench benchmarks large language models on implementing complex 3D geometric computer vision algorithms from research papers using fill-in-the-blank coding tasks and unit tests.
How It Works
You find this benchmark on arXiv or the project page while researching AI coding abilities in 3D vision.
Create a simple environment so everything runs smoothly on your computer.
Link AI services like chat models so they can generate code solutions.
Watch as different AIs read papers and fill in missing code for tough 3D vision problems.
Extract the AI-generated code into runnable files ready for testing.
Automatically check how well each AI's code handles edge cases with unit tests.
Get clear summaries showing which AIs excel at PhD-level 3D coding tasks.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.