GiggleWang / MobileAgent-Android
PublicAn Android-native autonomous agent that uses vision-language models to see and operate your phone — no PC or ADB required. Inspired by X-PLUG/MobileAgent.
MobileAgent is an Android app that acts as an AI assistant for your phone. You install it, grant it permission to see your screen and perform taps and swipes, connect your AI service account, then type any task in plain language—like 'open my photos and share the last one.' The assistant starts working, shows you its progress in a small floating window, and completes the task by tapping buttons, swiping screens, and typing text just like you would. It's based on academic research from Alibaba and is fully open source with a friendly, bilingual interface.
How It Works
You hear about an app that uses AI to operate your phone automatically—just tell it what you want done and it figures out the steps.
You download the app and open it on your Android phone. The welcome screen shows three main sections: Home, Settings, and Permissions.
The app explains it needs to see your screen and tap buttons on your behalf—you grant these permissions yourself, with clear explanations of why each is needed.
You enter the address and password for your AI account (like OpenAI or Claude) so the assistant has the intelligence to understand and complete tasks.
Moves quickly, good for simple tasks you just want done
Good balance of speed and checking, the recommended setting
Double-checks every action, best for important tasks
You type something like 'Open Settings and turn on Dark Mode' and press Start. A floating bubble appears showing the assistant is working.
The assistant shows you what's happening through a small window—you see it analyzing each screen, deciding on the next tap, and moving toward your goal step by step.
The assistant finishes your request and tells you what it did. You can read a detailed log of every step it took, or start a new task whenever you like.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.