shawn0728 / OpenSearch-VL
Public🔍 OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diverse visual/search tools, and fatal-aware agentic reinforcement learning.
OpenSearch-VL is an open-source toolkit for creating AI agents that analyze images, use visual tools like cropping and sharpening, and search the web to provide accurate answers to visual questions.
How It Works
You stumble upon this clever project that builds smart helpers to look at pictures, zoom in on details, and search the web for spot-on answers.
Grab the pre-made smart brains and picture examples to get started right away.
Feed it a tricky photo—like a faded sign—and watch it sharpen, crop, and hunt online to reveal the hidden facts.
Run quick checks on tough image questions to confirm it nails the answers every time.
Add your own examples so it learns even better at handling real-life visual challenges.
Now you have a reliable sidekick that cracks any picture mystery with tools and web wisdom.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.