yu20103983

中文语音助手 | 唤醒词 + ASR + OpenClaw Agent + TTS | 离线唤醒、流式语音交互、工具调用、Skills 扩展

18
4
100% credibility
Found Apr 10, 2026 at 18 stars -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
Python
AI Summary

A hands-free voice assistant that connects to an AI agent to handle coding, file edits, searches, and music playback through speech commands and audio replies.

How It Works

1
🔍 Discover Xiaolong

You find this fun voice helper that lets you boss around your coding tools just by talking, no more staring at the screen.

2
📥 Grab It Easily

Download the ready-to-go package to your computer with a simple click.

3
🛠️ Quick Setup Magic

Hit the easy setup button and it grabs everything needed, like voice smarts and sound tools, in just seconds.

4
🔗 Link Your AI Buddy

Tell it which smart thinker to use so it can understand and do your tasks.

5
🚀 Turn It On

Launch the helper, pop on headphones, and it's listening for your voice.

6
🎤 Talk and Relax

Say 'Xiaolong Xiaolong' to wake it, give commands like 'refactor that code' or 'play music', and hear back while you stretch or sip coffee.

😎 Hands-Free Wins

Your projects get done in the background with voice updates, freeing you to multitask without touching keyboard or mouse.

Sign up to see the full architecture

5 more

Sign Up Free

Star Growth

See how this repo grew from 18 to 18 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is xiaolong-openclaw?

Xiaolong-openclaw is a Python voice assistant that bolts offline wake-word detection, ASR, and TTS onto OpenClaw agents for hands-free coding and tasks. Wake it with "Xiaolong Xiaolong," speak commands like "refactor that project," and hear results via streaming speech—no screen required. It targets Chinese users with local SenseVoice ASR, tool-calling via OpenClaw, and skills like music playback, solving desk-bound agent interactions.

Why is it gaining traction?

Full offline wake words via SenseVoice and Silero VAD beat cloud-dependent github whisper asr or qwen asr github setups, plus Bluetooth auto-detection and duplex audio adapt to real workflows. Streaming TTS with Edge fallback to local Matcha-TTS keeps responses snappy, and OpenClaw skills extend it to weather or GitHub queries without recoding. Devs hook on the natural flow: tolerant wake variants, input merging, and interruptible playback.

Who should use this?

OpenClaw users building agent-driven terminals who want voice control for file edits, command execution, or searches. Chinese developers eyeing omnilingual asr github alternatives like openclaw asr tts for offline reliability. Multitaskers with Bluetooth headsets coding on the go—say, refactoring while pacing.

Verdict

Grab it for OpenClaw experiments: bat-file setup and detailed README make prototyping fast. Low 1.0% credibility and 18 stars signal early maturity—solid for tinkering, but stabilize with your own tests before daily driver status.

(198 words)

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.