jingyaogong / minimind-o
Public🎙️ 「大模型」从0训练0.1B能听能说能看的全模态Omni模型!A 0.1B Omni model trained from scratch, capable of listening, speaking, and seeing!
MiniMind-O is a lightweight open-source AI system that takes text, voice, and image inputs to produce thoughtful text responses and natural-sounding streaming speech.
How It Works
You stumble upon this fun project online: a tiny AI buddy that listens to your voice, looks at pictures, reads text, and chats back with spoken words.
Download the main files and helper pieces for voices and pictures so everything is ready to play with.
Start the web page with one simple command, and your AI assistant wakes up, ready to talk.
Type a question, speak into your mic, or upload a photo – watch it understand and reply in natural voice, feeling like chatting with a friend.
Use the mini training data to teach it new tricks in just a couple hours on your home computer, making it truly yours.
Now you have a personal companion that sees, hears, thinks, and speaks – perfect for fun experiments or daily helpers.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.