edmicho / mm-probe-kit
PublicA small, hackable toolkit for probing multimodal LLMs — attention, hidden states, alignment, and causal tracing.
A research toolkit that helps people examine how AI models that understand both images and text actually work, by visualizing attention patterns, hidden states, and text-image alignment.
How It Works
You discover a free toolkit that lets you peek inside AI models that can see pictures and read text.
You install the toolkit with one simple command and everything is ready to go.
You choose a popular AI model that understands both images and text to study.
You show the AI a picture and ask it a question, like asking a friend to describe what they see.
The toolkit reveals exactly which parts of the image the AI focused on while thinking about its answer.
You see colorful heat maps and charts showing how text and images connect in the AI's thinking.
You've gained real insight into how vision-language AI works, and you can share what you discovered.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.