mathiaschu / watch
PublicGive Claude a video input. /watch downloads from YouTube/Instagram/X/Vimeo/any yt-dlp site, extracts frames, and transcribes locally with mlx-whisper โ no API key. Fork of bradautomates/claude-video.
/watch is a skill for AI assistants that lets you ask questions about any video and get answers based on what the video actually contains. You paste a video URL (YouTube, Instagram, X, Vimeo, TikTok, and thousands more) or point to a local file, ask your question, and the tool downloads the video, extracts key frames, pulls or generates a transcript, and hands everything to the AI. The standout feature is privacy: transcription runs entirely on your machine using local speech recognition, so audio never leaves your computer. No account, no API key, no subscription โ just you and your videos.
How It Works
You hear about /watch โ a way to ask an AI assistant questions about any video, and it actually sees and hears everything in it.
You add it to your AI assistant with a simple command, and run a quick setup that prepares everything on your computer.
You share a YouTube, Instagram, or any video link along with a question like 'what happens at the 30-second mark?'
The video downloads directly and captions are pulled automatically when available.
If a video needs login (like private Instagram reels), you can share your browser cookies just for that download.
If a video has no captions, the audio is converted to text right on your computer โ nothing ever leaves your machine.
It reads each frame as an image and aligns everything with the transcript, understanding both what's shown and what's said.
No guessing โ your assistant answers based on what it actually saw and heard in the video, with timestamps to back it up.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.