[CVPR 2026 Workshop] Official code and models for Plain Mask Transformer (PMT).
PMT is a research tool for training models to segment objects in images and videos while keeping the core image understanding part unchanged for reuse across tasks.
How It Works
You find this clever tool for outlining objects in photos and videos without messing with the main image analyzer.
You prepare your computer by installing the needed software so everything runs smoothly.
You collect pictures along with labels showing where objects like people or cars are located.
You choose a ready recipe and launch the learning process, watching your model get smarter with each batch of images.
You check colorful charts and logs to see how much better it's getting at spotting and outlining things.
You try it out on fresh images to measure how accurately it draws boundaries around objects.
Your model now flawlessly segments everyday scenes or videos, ready for your projects or research.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.