njucckevin

The model, data and code for OpenMobile

17
4
100% credibility
Found Apr 23, 2026 at 17 stars -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
Python
AI Summary

OpenMobile is an open-source toolkit for creating training data and evaluating AI agents that automate tasks on Android devices via the AndroidWorld benchmark.

How It Works

1
📖 Discover OpenMobile

You find this open project through a research paper or homepage, excited to train smart helpers for phone tasks.

2
📱 Set up phone simulator

You prepare a virtual phone screen to test and explore apps safely on your computer.

3
🔍 Explore apps freely

The tool wanders through apps, capturing real movements and screens to build a map of how phones work.

4
⚙️ Shape the data

You refine the captured journeys into clean examples of successful phone actions.

5
🤖 Test smart helpers

Connect your AI brain and watch it tackle phone challenges, seeing scores improve with your data.

6
📊 Train better agents

Use the examples to teach AI models new skills for automating everyday phone tasks.

🎉 Agents master phones

Your trained helpers now handle complex phone jobs reliably, ready for real-world use.

Sign up to see the full architecture

5 more

Sign Up Free

Star Growth

See how this repo grew from 17 to 17 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is OpenMobile-Code?

OpenMobile-Code delivers Python scripts, trajectory data, and fine-tuned vision-language models for training mobile agents on Android tasks. It tackles the black-box training data of top closed-source agents by synthesizing grounded instructions from exploration and generating diverse trajectories via policy-switching rollouts. Developers get ready-to-run eval code for the AndroidWorld benchmark, HF-hosted model data driven datasets, and tools to convert trajectories into ShareGPT format for LlamaFactory training.

Why is it gaining traction?

Unlike proprietary setups, it offers transparent pipelines for task synthesis and error-recovery data, hitting 64.7% success on AndroidWorld with Qwen3-VL—beating prior open efforts. The HF model registry, playground, and data viewer make experimentation fast, while eval scripts support vLLM-deployed models like OpenMobile-8B. Its focus on broad functionality coverage avoids benchmark overfitting, appealing to those tracking github model usage metrics.

Who should use this?

ML engineers fine-tuning VLMs for GUI agents, researchers benchmarking mobile AI on AndroidWorld, or teams needing model data vault for trajectory-based training. Ideal for devs integrating github model copilot-like agents into android apps, especially with tabular model data views for analysis.

Verdict

Grab it if you're prototyping open mobile agents—solid paper, HF assets, and eval code make it usable now despite 17 stars and 1.0% credibility score signaling early maturity. Docs are README-focused with setup guides; expect rough edges in unreleased synthesis code, but trajectories and checkpoints deliver immediate value.

(198 words)

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.