ASLP-lab / Speaker-Reasoner
PublicSpeaker-Reasoner: Scaling Interaction Turns and Reasoning Patterns for Timestamped Speaker-Attributed ASR
Speaker-Reasoner is a research model that transcribes audio conversations with timestamps, speaker identities, and other details using step-by-step reasoning.
How It Works
You hear about a clever tool that turns meeting audio into detailed transcripts showing who spoke when.
You read how it smartly breaks down conversations by first getting the big picture, then zooming into each speaker's part.
You see charts proving it does better than big-name AI tools at spotting speakers and timing words accurately.
You set up a quiet corner on your computer just for handling audio wonders like this.
You wait a bit and grab the ready-made thinking pieces to power the tool.
You share your recording of a chat or meeting with the tool.
It thinks step by step: overview of voices, guessing change points, then detailed notes on each part.
You receive a clear write-up with every speaker named, timed, and transcribed spot-on, even for long talks.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.