Nuclear-Marmalade

Free open-source business data enrichment engine. The open-source alternative to Apollo, ZoomInfo, and Clearbit.

16
5
100% credibility
Found Apr 08, 2026 at 19 stars -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
Python
AI Summary

FORGE is an open-source tool that enriches business contact lists from spreadsheets or public sources with emails, technologies, government data, and AI-generated insights, all running locally for free.

How It Works

1
🔍 Discover FORGE

You hear about a free tool that supercharges your business leads by finding emails, tech details, and smart insights without expensive subscriptions.

2
📦 Set it up quickly

Install with a simple command and prepare your list of companies from a spreadsheet or by searching a ZIP code.

3
📤 Add your companies

Upload your customer spreadsheet or discover new businesses nearby to get started.

4
🚀 Launch enrichment

Click to start and watch it pull in emails, check websites, and add helpful summaries automatically.

5
📊 Track the magic

Use the dashboard to see progress as your list fills up with valuable details in real time.

Get your super list

Download your enriched spreadsheet packed with contacts and insights, ready for outreach and saving you thousands.

Sign up to see the full architecture

4 more

Sign Up Free

Star Growth

See how this repo grew from 19 to 16 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is dataforge?

Dataforge is a Python-based business data enrichment engine that turns basic company lists into rich profiles with emails, tech stacks, industry tags, and AI-generated insights like health scores and pain points. Feed it a CSV of leads or a PostgreSQL table, and it scrapes websites, verifies emails via SMTP, pulls free government data from FCC, NPI, and SAM.gov, then runs local AI models via Ollama for summaries—all without API keys or subscriptions. CLI commands like `forge enrich --file leads.csv` or `forge discover --zip 33602` make it dead simple to enrich or find businesses by location.

Why is it gaining traction?

It undercuts pricey services like Apollo or ZoomInfo by using public sources and running fully self-hosted on your machine, dodging $10K+ annual fees. Developers love the zero-config CSV mode, resumeable pipelines with 16K records/hour scraping speeds, and optional local AI that avoids cloud costs—plus a dashboard for monitoring and MCP tools for AI assistants. Free GitHub Actions minutes stretch further here than with paid alternatives, making it a practical free open source CRM booster.

Who should use this?

Sales engineers building lead gen pipelines from scratch, indie hackers enriching dataforge analytics for dataforge canada or dataforge africa lists without budgets, and marketing ops folks at dataforge labs or dataforge solutions verifying emails for outreach. Ideal for dataforge henderson nc startups or dataforge it services private limited needing quick tech stack intel on prospects.

Verdict

Promising beta for small-scale enrichment (46% branch test coverage, solid CLI/docs), but 1.0% credibility score and 16 stars signal early days—star it on your free GitHub account if self-hosting appeals. Try for personal projects; scale cautiously until more production miles.

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.