A self-healing web scraper built for hostile sites: selectors repair themselves, browser rendering kicks in when needed, and Chrome TLS fingerprinting evades bot detection. Ships with an MCP server so any LLM can drive a full crawl through conversation.
Anansi is an adaptive web scraping framework that automatically extracts data from websites and learns to handle site changes over time. When websites block automated access or require JavaScript rendering, Anansi intelligently switches to browser-based fetching and mimics real browser fingerprints to continue collecting data. The tool includes a self-healing parser that remembers successful extraction methods and adapts when sites change their layout. It also ships with an MCP server that allows AI assistants to control web scraping through conversational commands, enabling research agents to autonomously gather data from the web.
How It Works
You discover Anansi when you need to regularly extract product prices, articles, or listings from websites that keep changing their layout.
Instead of breaking when a site redesigns, Anansi automatically finds your data using multiple strategies and remembers what worked for next time.
When a site blocks automated access, Anansi switches to a real browser, waits out security checks, and mimics real browser fingerprints—all without you lifting a finger.
Write a simple Python script that extracts exactly the fields you need from any page
Connect to an AI through a chat interface and ask it to research topics, gather data, and report back
Running a large crawl? You can pause it mid-way and pick up exactly where you left off—even days later after a restart.
Your extracted data is automatically validated, deduplicated, and exported in formats ready for spreadsheets or databases.
You have structured, reliable data from websites that would normally block automated access—all while Anansi got smarter handling them.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.