rav4nn / youtube-rag-scraper
PublicScrape YouTube videos, extract transcripts, and build a semantic search AI knowledge base using RAG and FAISS.
This repository provides a Python-based command-line tool for scraping metadata and transcripts from YouTube channels, playlists, or videos, generating chunked datasets for retrieval-augmented generation, building vector search indexes, and performing semantic queries.
How It Works
You hear about a handy program that turns YouTube channels into your own searchable library of videos.
You follow easy steps to set up the program on your computer so it can connect to YouTube.
Gather everything from a creator's videos to build a full knowledge collection.
Collect from a curated list of videos on a topic.
Pull details from one specific video you care about.
Tell the program your choice, and it quietly gathers video titles, details, and full spoken words.
It builds a clever index that understands the content, ready for your questions.
Type natural questions like 'How to make great coffee?' and see relevant answers pulled from the videos.
Now you have a personal search engine for any YouTube channel, finding exactly what you need instantly.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.