zhanghaotian0225 / Accumulative-Decoding
PublicMitigating Hallucinations in Large Vision-Language Models via Accumulative Decoding
This project offers a lightweight, no-training-needed technique to reduce false details in AI-generated image descriptions using vision-language models like LLaVA, including scripts to evaluate performance on key benchmarks.
How It Works
You hear about Accumulative Decoding, a simple way to make AI assistants describe images more accurately without making up details.
Download the tool and prepare a ready-to-use AI assistant specialized in understanding pictures.
Choose a photo or picture you want the AI to analyze, like a scene or object.
Activate the special feature that keeps the AI focused on what's really in the image throughout its entire response.
Give the AI a question or request to describe the picture in detail.
Run quick checks on standard image-understanding tests to see the improvements.
Your AI now gives spot-on descriptions without hallucinations, beating benchmarks and feeling trustworthy.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.