huggingface

The home of Carbon Genomic Foundation Model 🧬

48
6
100% credibility
Found May 20, 2026 at 63 stars -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
Python
AI Summary

Carbon is a family of genomic foundation models (causal language models) trained on DNA/RNA sequences, providing eval code, inference scripts, and fine-tuning capabilities for zero-shot DNA tasks like sequence recovery and variant effect prediction.

Star Growth

See how this repo grew from 63 to 48 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is Carbon?

Carbon is a genomic foundation model from Hugging Face -- a family of causal language models trained on 1 trillion tokens of DNA sequences. It solves the problem of scattered, hard-to-reproduce zero-shot DNA evaluations by bundling seven benchmarks into one repo: sequence recovery, variant effect prediction on BRCA2 and ClinVar, and long-context retrieval tasks up to 786,000 base pairs. The flagship 3B model matches or beats Evo2 7B despite being smaller. Python-based, it runs on standard HuggingFace Transformers with optional vLLM acceleration.

Why is it gaining traction?

The hook is the evaluation suite. Instead of hunting through different repos for VEP benchmarks, you get BRCA2, TraitGym Mendelian, ClinVar, and custom perturbation tasks all in one place with reproducible run commands. The same scripts work across model families -- flip a flag and you're comparing Carbon against GENERator or Evo2 directly. The hybrid tokenizer that handles both English and DNA with a `` tag is clever, letting one model serve both genomics and text workloads.

Who should use this?

Computational biologists benchmarking genomic foundation models for variant effect prediction or regulatory variant scoring. ML engineers comparing DNA language model performance across benchmarks. Researchers fine-tuning on promoter detection or other downstream genomic tasks. If you need to evaluate whether a genomic model understands codon usage, motif disruption, or long-range context -- this is the toolbox.

Verdict

The 48 stars and 1.0% credibility score signal a young, low-visibility project -- solid research code but unproven in production. The documentation is thorough and the eval suite is genuinely useful, but the community footprint is minimal. Worth evaluating for benchmarking purposes, but treat it as research infrastructure rather than a production-grade tool until adoption grows.

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.