facebookresearch / tuna-2
PublicOfficial implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
TUNA-2 is a research project from Meta providing code for training and evaluating unified multimodal AI models that handle image understanding and generation using pixel embeddings.
How It Works
You stumble upon Tuna-2, a clever AI from researchers that blends picture smarts with creative image making.
Download the ready-to-use package to your computer in moments.
Run a quick setup script to prepare everything for fun experiments.
Describe a scene in words and watch the AI bring it to life with stunning visuals.
Tweak existing photos or ask the AI to explain what's in pictures.
Use built-in checks to see how well your AI handles real challenges.
Now you create, edit, and understand images effortlessly with your new AI companion!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.