Prism is a windows desktop ai agent built with electron and a fastapi backend, powered by the google-gemini-3-api. it can understand what is on your screen, plan multi step actions.
Prism is a Windows desktop AI agent powered by Google Gemini that observes your screen and automates tasks like opening apps, navigating browsers, summarizing content, and handling files through natural language commands.
How It Works
You hear about Prism, a helpful desktop assistant that understands your screen and automates everyday tasks on Windows.
Download the app, double-click to launch it, and it appears as a small floating helper in your taskbar.
Enter a simple key from Google's AI studio so Prism can think and understand your screen.
Press Alt+Space anywhere on your desktop to bring up the chat box – it's always on top and super quick.
Type a natural command like 'open Chrome and go to YouTube' or 'summarize this screen', and attach files if needed.
Prism shows glowing borders around what it's doing, plans steps, and automates clicks, typing, and navigation right before your eyes.
Your apps open, files get summarized, workflows complete – saving you time while you relax and watch.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.