JetAstra / MacAgentBench
PublicMacAgentBench: Benchmark agents where they actually work — on macOS.
MacAgentBench provides a Dockerized macOS environment to benchmark AI agents on 110 realistic desktop tasks across apps like Notes, Reminders, and Keynote.
How It Works
You find this project while looking for ways to test AI helpers on everyday Mac apps like Notes or Keynote.
Download a complete Mac environment with AI tools already installed—no setup hassle.
Run a simple command to launch a full Mac desktop in a window on your computer.
Use a screen viewer to see your Mac desktop come alive, with AI assistant ready to go.
Pick AI models and let them tackle 110 real Mac chores like reminders or slides.
Chat with the AI to make it edit notes or check weather right on the Mac screen.
See the AI click, type, and complete tasks just like a human would on your Mac apps.
Review pass rates, video recordings, and compare your AI on the live leaderboard.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.