shiyi-zh0408 / Meta-CoT

Public

[CVPR 2026] Official code of the paper "Meta-CoT: Enhancing Granularity and Generalization in Image Editing"

100% credibility

Found Apr 28, 2026 at 43 stars -- GitGems finds repos before they trend. Get early access to the next one.

AI Analysis

Python

AI Summary

Meta-CoT is a research tool for editing images using natural language instructions with step-by-step reasoning to handle tasks like adding or removing objects.

How It Works

🔍 Discover smart photo editing

You hear about Meta-CoT, a fun tool that edits pictures just by describing changes like 'add sunglasses' or 'change the sky'.

📥 Get it ready

Download everything with simple clicks so your computer is set up for editing.

🖼️ Pick your photo

Choose any picture from your phone or computer to start editing.

💭 Describe your idea

Type what you want, like 'make the dog wear a hat', and watch it think through the steps.

✨ See the magic happen

It reasons out loud, plans the changes, and creates your new edited image.

🎉 Share your creation

Enjoy your perfect edit, ready to post or print, with pro results every time.

Sign up to see the full architecture

4 more

Star Growth

See how this repo grew from 43 to 43 stars Sign Up Free

Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose

AI-Generated Review

What is Meta-CoT?

Meta-CoT is a Python framework for text-driven image editing that breaks instructions into task-target-understanding triplets and five core meta-tasks like addition or replacement. It powers a BAGEL-based model to handle 21+ operations with strong generalization, delivering edited images via simple CLI like `python inference/edit_single.py --image input.jpg --instruction "add a cat"`. Among cvpr 2026 accepted papers github repos, it releases HF models, benchmarks, and SFT training scripts for quick experimentation.

Why is it gaining traction?

It beats baselines by 15.8% on a 21-task benchmark and 13% on ImgEdit, thanks to CoT consistency rewards that align reasoning with outputs. Devs grab it for one-command env setup, auto-downloaded checkpoints, and eval scripts that batch-process benchmarks like GEdit or RiseBench. In cvpr 2026 papers discussions on reddit and github cvpr 2026 threads, the meta-task approach hooks those chasing zero-shot editing without retraining.

Who should use this?

Vision-language researchers prototyping diffusion-LLM hybrids, AI product devs needing robust editors for apps like photo enhancers, or ML engineers fine-tuning on custom edits via provided YAML dataset configs and multi-node training scripts.

Verdict

Grab it if you're into cvpr 2026 paper list gems—solid README, HF integration, and runnable inference make it dev-ready despite 43 stars and 1.0% credibility score signaling early maturity. Test on your edits before production; RL code is TODO.

(198 words)

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.

Stars

Forks

Followers

Base stars: 43 stars

Penalty: Very new repo (0d): -70%

Bonus: AI verified quality (100%)

Account age: 1,376 days

Repo age: 0 days

License: Apache-2.0

Updated: Apr 28, 2026