deepseek-vision is an open-source proxy service that enhances DeepSeek's text-only AI models with image understanding and web capabilities. It works by intercepting requests that need vision or web features, processing them through additional AI services, and passing the results back to DeepSeek. The project includes a user-friendly web dashboard for configuration, supports both Anthropic-style and OpenAI-style API connections, and can be deployed via Docker or run directly. It includes security features like SSRF protection for web fetching and rate limiting for the admin interface. The project is MIT-licensed and maintained by Proma, a general-purpose AI agent project.
How It Works
You want to use DeepSeek for your AI projects, but realize it can't see images or search the web like other AI assistants can.
deepseek-vision is a free tool that acts like a bridge, adding image understanding and web access to your DeepSeek setup.
With one simple command, you start the proxy server on your computer. It runs quietly in the background, ready to help.
You open a friendly web page where you enter your DeepSeek key and optionally keys for image recognition and web search. Everything has clear labels and helpful hints.
Point your AI coding assistant to the proxy and start asking about images or current topics
Use any OpenAI-compatible app with your DeepSeek setup and enjoy vision and search features
Send an image and ask questions about it. Ask about news or recent events. The proxy handles all the complexity behind the scenes.
You've transformed DeepSeek into a full-featured assistant that can see, search, and fetch — all running privately on your own setup.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.