AmrDab

AmrDab / clawd-cursor

Public

AI desktop agent — sees your screen, controls your cursor, completes tasks autonomously.

110
17
69% credibility
Found Feb 23, 2026 at 19 stars 6x -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
TypeScript
AI Summary

Clawd Cursor is an open-source AI desktop agent that automates computer tasks like opening apps and typing using a layered approach with screen reading, accessibility tools, and optional AI vision.

How It Works

1
🔍 Discover Clawd Cursor

You hear about Clawd Cursor, a helpful AI friend that can use your computer to open apps and do simple tasks for you.

2
📥 Get it ready

You download it easily and run a friendly setup check that tests your screen and finds the best way to work.

3
🩺 Smart setup finishes

The setup doctor automatically checks everything, picks the perfect helpers, and gets your AI assistant ready to go with one click.

4
🚀 Turn it on

You start your assistant, and it waits quietly for your instructions over a simple web link.

5
Tell it what to do
🆓
Use free local helper

It works instantly with your own free AI brain on your computer for basic tasks.

Use powerful online AI

Connect a smart online service for tougher jobs, and it handles everything smoothly.

6
👀 Watch the magic

Your AI sees your screen, clicks buttons, types words, and completes the task right before your eyes.

Task done perfectly

Everything works as asked, saving you time, and you can send more tasks anytime.

Sign up to see the full architecture

5 more

Sign Up Free

Star Growth

See how this repo grew from 19 to 110 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is clawd-cursor?

Clawd-cursor is a TypeScript desktop agent AI that sees your screen, moves your cursor, and completes tasks autonomously—like opening apps, typing text, or navigating browsers—via a simple HTTP API. It solves brittle desktop automation by layering instant regex routing, accessibility tree parsing, and vision LLM fallback, running free on local models like Ollama or powered by Anthropic/OpenAI. Developers hit `npx clawd-cursor doctor` to auto-configure providers, then POST tasks like "Open Notepad and type hello" to localhost:3847/task.

Why is it gaining traction?

This desktop agent stands out with a 3-layer pipeline that skips expensive vision for 80% of tasks, hitting 2s benchmarks on local Qwen vs 40s+ for pure screenshot agents—95% cheaper overall. Multi-provider support (Anthropic Computer Use, ChatGPT-compatible, Ollama) plus self-healing retries and safety tiers (auto/preview/confirm) make it reliable for desktop agent windows/macOS without VNC hacks. Benchmarks and curl-ready API hook devs prototyping autonomous agents fast.

Who should use this?

DevOps engineers automating SAP build process or Postman Chrome flows on Windows desktops. QA testers scripting GitHub desktop client tasks like cloning repos on Ubuntu or macOS. Researchers building desktop agents pets for UI experiments, skipping manual nut.js boilerplate.

Verdict

Grab it for desktop agent experiments—solid docs, CLI doctor, and perf tests make setup painless despite 16 stars and 0.7% credibility score signaling early maturity. Test locally before workflows; prod needs more battle scars.

(198 words)

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.