korale77 / mlx-vlm-falcon
PublicGrounded reasoning agent: Falcon Perception + Gemma 4 VLM on Apple Silicon
This is a local Apple Silicon tool that analyzes images by detecting and visualizing specific objects from user questions, then reasons over the annotations to provide accurate answers.
How It Works
You stumble upon a handy tool that lets you ask questions about objects in your photos, like 'How many cars?' and get highlighted answers right on your Mac.
Check you have a recent Apple Mac with plenty of memory, then grab the files and ready the simple pieces needed.
In one window, start the core analyzer – it grabs the clever thinking brains on first go, ready to examine pictures.
In another spot, pick any image from your computer and whisper a question about what's inside it.
It pulls out the key item from your question, hunts them down, and paints colorful glows and numbers around each one.
The tool shows the marked-up picture to its reasoning partner, which crafts a spot-on answer, even zooming close if needed.
Celebrate with saved highlighted images, detailed crops, and trustworthy insights into your photo's secrets!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.