omni2sound / Omni2Sound
PublicOmni2Sound — Your Multimodal Audio Generation Codebase (CVPR 2026 Highlight)
Omni2Sound is a unified open-source tool for generating temporally aligned audio from video inputs, text descriptions, or both, achieving top performance on audio synthesis benchmarks.
How It Works
You find this free tool while searching for ways to add realistic sounds to videos or create audio from simple descriptions.
Download the ready-to-use package and open it on your computer with a quick launch.
Upload a video and let it create matching sounds like footsteps or music.
Type words like 'rain on window' to hear lifelike audio.
Upload video and add text tips for even better matching sounds.
Hit the button and watch as it creates synchronized, high-quality audio in seconds.
Play back the perfectly timed sounds that bring your video or idea to life.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.