TrustEval-MM is a toolkit that helps you understand how trustworthy an AI model is at understanding images. Instead of giving you one confusing score, it tests the AI across five important areas: whether it tells the truth about what it sees, whether it stays consistent when inputs change slightly, whether it treats different groups of people fairly, whether it knows when it's wrong, and whether it accidentally shares private information. The tool runs automated tests, then creates a clear 'trust card' showing strengths and weaknesses across all areas so you can make smart choices about which AI to use.
How It Works
You've been using AI models that look at images, but you want to know which one you can really trust with important decisions.
With one simple command, you add TrustEval-MM to your computer and everything is ready to go.
The tool helps you set up a small collection of test images and questions that will be used to probe the AI's responses.
You point the tool at any AI model that understands images, and it automatically asks hundreds of questions across five different areas of trustworthiness.
A colorful markdown card shows bar charts and scores so you can quickly see where the AI shines and where it struggles.
A JSON file contains all the raw numbers so you can compare models, track changes over time, or build your own reports.
Instead of a single confusing number, you now see exactly where the AI might make mistakes, treat people unfairly, or leak private information.
With a clear picture of your AI's trustworthiness, you can choose the right model for your project or identify areas that need improvement.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.