jmerelnyc / Photo-agents
PublicAutonomous self-evolving agents. Vision-grounded layered memory and self-written skills for LLM agents that operate your computer.
Photo Agents is a local AI agent framework that uses screenshots to perceive your screen, reasons with LLMs, and automates computer tasks via tools like code execution and browser control.
How It Works
You hear about a smart helper that watches your screen and does tasks for you, like a friendly robot assistant.
Download and set it up with a simple command, just like installing any helpful app.
Sign up on their website to get a special code that unlocks your assistant.
Tell it which smart service (like Claude or GPT) to use for making decisions.
Click launch and watch your new screen-seeing helper come alive on your desktop or phone.
A floating button pops up; click to chat anytime while using your computer.
Connect to Telegram or similar for messages on your phone.
It handles boring tasks on its own, learning and improving over time so you relax.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.