FireRedTeam / FireRedASR2S
PublicFireRedASR2S is a SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and singing lyrics recognition. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects.
FireRedASR2S is a complete speech-to-text toolkit combining recognition, voice detection, language identification, and punctuation addition with top accuracy for Chinese dialects, English, and more.
How It Works
You hear about a super-smart tool that listens to audio and turns spoken words into accurate text, even handling tricky accents and dialects perfectly.
Download the simple package and set up a clean space for it on your computer with a few easy steps.
Fetch the pre-trained helpers that understand speech, silence, languages, and add proper punctuation automatically.
Convert any sound clips to the right simple format if needed, so everything works smoothly.
Feed your audio files into the full system, and watch it detect speech parts, identify languages, transcribe words, and add punctuation in one go.
Get back neatly formatted text with timings, confidence scores, and separated sentences ready to use.
Enjoy spot-on text from your audio, saving hours of manual work, whether for meetings, songs, or multilingual chats.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.