MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.
MOSS-TTS Family provides open-source AI models to generate high-fidelity speech from text, including voice cloning, multi-speaker dialogues, sound effects, and real-time streaming audio.
How It Works
You stumble upon MOSS-TTS, a collection of tools that turns everyday text into amazingly realistic speech and sounds.
Create a quiet corner on your computer with a fresh notebook for playing with voices.
Bring in the simple pieces needed to start making speech, like grabbing a few helpful apps.
Type in words, add a short voice sample if you want to copy a style, and watch as smooth talking audio appears.
Experiment with copying voices, chatting dialogues, or even creating fun sound effects from descriptions.
Listen to your perfect audio clips and save them for videos, stories, or real-time chats.
Now you craft natural-sounding speech anytime, making your projects feel alive and professional.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.