FudanCVL / OcclusionFormer
Public[ICML2026] OcclusionFormer: Arranging Z-Order for Layout-Grounded Image Generation
OcclusionFormer is an AI image generation system from Fudan University that creates pictures from layout descriptions. Unlike standard tools, it handles the tricky problem of objects blocking each other correctlyโso if you say 'a person standing in front of a tree,' the person appears in front of the tree, not tangled with it. You can either draw your scene using a visual canvas or describe it in a layout file, and the system generates realistic images with proper depth ordering. It's designed for scenes with lots of overlapping objects where traditional tools get confused.
How It Works
You find OcclusionFormer while researching how to generate images from layout descriptions with realistic object layering.
You install the required tools and download the trained model weights so the system can understand your layouts.
Open the web demo where you can draw boxes directly on a canvas, type in what each object should look like, and specify which objects block others.
Prepare a layout file describing your scene with boxes, captions, and occlusion relationships, then run the generator.
The system creates your image, carefully placing objects in front of or behind each other exactly as you specified.
You receive a beautiful image where overlapping objects look natural and realistic, with proper depth ordering throughout.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.