InstructSAM is an AI system that lets you find and highlight specific objects in photos by simply describing what you want in plain language. Instead of clicking on objects manually, you type instructions like 'segment the person on the left' or 'find all the cats' and the AI automatically draws outlines around the matching objects. The system can handle multiple objects at once and works with different types of descriptions—from simple category names to complex referring expressions. It's designed for researchers and developers working on image understanding, visual AI assistants, and image editing tools.
How It Works
You come across InstructSAM, an AI that can highlight specific objects in images just by following your written instructions.
You grab the trained model from HuggingFace so your computer can understand and follow instructions about images.
You point to any photo, type something like 'the red car on the left' or 'all the people wearing hats', and the AI prepares to find those things.
The model analyzes your image and creates clear outlines around every object that matches your description.
The results show colorful overlays on your original image, letting you see precisely which objects the AI found based on your instructions.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.