MrCarlsama

把单条抖音/小红书链接整理成本地视频、音频、字幕和逐字稿的 Codex / Claude Code Skill。

16
1
85% credibility
Found May 29, 2026 at 17 stars -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
Python
AI Summary

A tool that takes a Douyin or Xiaohongshu video link and transforms it into local text transcripts, downloading the video, extracting audio, and using AI to create word-by-word transcriptions with timestamps in multiple formats.

How It Works

1
🔗 You have a video link you want to transcribe

You found an interesting video on Douyin or Xiaohongshu and want to save it as text.

2
📋 You paste the link into the tool

The tool automatically recognizes which platform the video comes from and prepares to download it.

3
🎬 Your video is downloaded automatically

The tool fetches the video from the platform and saves it to your computer, handling any login requirements behind the scenes.

4
🎵 The audio is extracted from the video

The sound is separated from the video and converted into a clean audio file ready for transcription.

5
✍️ Your video is transcribed into text

An AI reads through the audio and creates a word-by-word transcript with timestamps.

6
Choose your preferred transcript format
📝
Plain text transcript

Simple text file with just the words spoken, easy to read or paste anywhere

⏱️
Subtitles with timing

Subtitle file that shows words at the right moment, perfect for video editing

📖
Full document with timestamps

Markdown file showing every word with when it was said, plus all the video details

You have everything saved locally

Your video, audio, and transcripts are organized in one folder on your computer, ready for you to read, edit, or use however you want.

Sign up to see the full architecture

5 more

Sign Up Free

Star Growth

See how this repo grew from 17 to 16 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is mrcarlsama-social-transcriber-skill?

This is a Python skill for Codex and Claude Code that takes a single Douyin or Xiaohongshu URL and converts it into a complete local content package. You get the original video, extracted audio, SRT subtitles, and a word-level timestamped transcript. It runs locally using faster-whisper for speech recognition, yt-dlp for downloading, and Playwright to handle platform authentication when cookies are needed.

Why is it gaining traction?

The hook is the transcript-first workflow. Researchers and content creators working with Chinese short-video platforms have few tools that produce clean, timestamped text without cloud services. This runs entirely on your machine with a configurable Whisper model size, and it handles the messy reality of platform-specific cookie requirements automatically. The resume capability means interrupted downloads pick up where they left off, which matters when processing longer content.

Who should use this?

Market researchers analyzing Chinese social media, developers building training datasets for speech models, and content teams that need searchable transcripts of short-video content. If you are evaluating Codex or Claude Code as an alternative to GitHub Copilot and want practical skills that solve real workflow gaps, this demonstrates the skill ecosystem in action.

Verdict

At 16 stars with a 0.85% credibility score, this is a niche tool from an individual developer that works but lacks the polish of a mature project. The code is functional and the architecture is sound, but test coverage and documentation are minimal. Worth trying if you actually need Chinese short-video transcription, but do not adopt it as a production dependency without careful validation.

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.