pguso / voice-agents-from-scratch
PublicFrom-scratch voice agents in Python: end-to-end speech pipelines, runnable chapters, and a small shared library. Local models, explicit streaming behavior.
Hands-on tutorial repository teaching how to build fully local real-time voice agents that listen via microphone, think with language models, and speak back using open-source tools.
How It Works
You stumble upon this friendly tutorial promising to teach you how to make a computer that listens to your voice and chats back like a real friend.
You follow easy steps to prepare your computer, making sure your microphone and speakers are ready for fun conversations.
You grab the special sound files and thinking brains so your agent can understand words and speak naturally.
You speak into the mic, and moments later, it talks back to you, sparking excitement as the conversation begins.
You dive into short lessons on listening, thinking up answers, and speaking smoothly, building confidence step by step.
You mix the skills to build your own helpers, like a tutor or interviewer, tailored just how you like.
Your personal voice companion is alive and local, ready for natural talks without needing the internet.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.