inclusionAI / Zooming-without-Zooming
PublicZwZ model family: SOTA fine-grained perception performace; ZoomBench: a new challenging perception benchmark
This repository provides code, models, and a benchmark for training efficient multimodal AI models that perform state-of-the-art fine-grained visual perception in a single pass.
How It Works
You find this helpful tool for making AI see tiny details in pictures without zooming.
Collect a folder of high-resolution images to teach your AI about fine details.
The tool automatically zooms into small areas of your photos and makes questions and answers about them.
Run the training to build a powerful AI that understands details in one quick look.
Test on special challenges to see how well it spots counts, text, colors, and more.
Your AI now excels at precise vision tasks, ready for real-world use without extra steps.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.