Saganaki22 / ComfyUI-LongCat-AudioDIT-TTS
PublicComfyUI custom nodes for LongCat-AudioDiT \ Diffusion-based Zero-Shot Text-to-Speech
Custom nodes for the ComfyUI interface that provide zero-shot text-to-speech synthesis, voice cloning from reference audio, and multi-speaker dialogue generation using diffusion-based audio models.
How It Works
You find this fun add-on in your ComfyUI toolbox that turns words into realistic speech and copies voices perfectly.
Open the manager, search for it, and install – everything sets up automatically without any hassle.
Drag the text-to-speech piece into your canvas, type a message, and connect it up.
Upload a short audio clip of someone talking, add your new words, and watch it recreate their voice like magic.
Add more voices for different people, tag their lines in your script, and build a full conversation.
Press play, wait a moment, and hear your custom audio come to life right in the player.
Download the crystal-clear speech or conversation, perfect for videos, stories, or fun projects – it sounds just like real people talking!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.