lgy0404 / MemGUI-Bench
PublicOfficial code repo for the paper "MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments"
MemGUI-Bench is a benchmark for evaluating AI agents' long-term memory in dynamic Android app interactions across 128 tasks in 26 apps.
How It Works
You find this helpful tool through a research paper or website that tests how smart phone helpers remember what they've seen on apps.
You get a pretend phone screen ready using a simple box or your own computer setup so tests can run smoothly.
Choose an AI buddy and link it up so it can see and act on the phone screens during tests.
Hit go to run tests where your helper navigates real apps like contacts or camera, remembering steps across screens.
See live updates on success rates, memory strength, and recovery from mistakes as tests finish one by one.
Check detailed reports on how well your helper remembers and competes, then share on the public rankings to compare.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.