yeahhe365 / WebDroid-Agent
PublicBrowser-based Android phone agent using WebADB/WebUSB and OpenAI-compatible vision models
WebDroid Agent is a browser-based tool that lets you control an Android phone using AI. You connect your phone to Chrome with a USB cable, then describe a task in plain language—like 'open Settings and find Wi-Fi.' The app shows your phone's screen to an AI, which decides what to tap, swipe, or type. It executes each action on your phone, takes a new screenshot, and repeats until the task is done. The app includes safety features like requiring confirmation for sensitive actions, stopping after a set number of steps, and letting you stop the run at any time. It's designed for experimenting with AI-powered phone automation in a local, controlled environment—not for handling payments, logins, or other sensitive tasks.
How It Works
A friend tells you about a tool that can control an Android phone right from Chrome, using AI to understand what it sees on the screen.
Using a USB cable, you connect your Android phone to your computer and enable debugging mode on the phone. Chrome asks for permission to talk to your device.
You enter the address of your AI service and your personal key. The app remembers these settings for next time.
In plain English, you type something like 'Open Settings and find the Wi-Fi page.' The app takes a screenshot and shows it to the AI.
The AI studies the screenshot, understands the layout, and decides what action to take next—like tapping a button or opening an app.
Safe actions run automatically while the app shows you each step
You review and approve every action before it runs on your phone
After each action, the app takes a new screenshot and asks the AI what to do next. This repeats until your task is complete or you stop it.
The AI finishes what you asked, or asks you to take over if it hits something it can't handle—like entering a password. You can export a log of everything that happened.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.