whatsthisaithing

An AI-powered image dataset captioning tool

21
3
100% credibility
Found Feb 02, 2026 at 16 stars -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
Python
AI Summary

CaptionFoundry is a desktop app that organizes image folders into datasets, generates AI-powered captions locally, and exports them ready for training AI image models.

How It Works

1
🖥️ Download and launch

Download CaptionFoundry and open it on your computer – it sets itself up automatically.

2
🤖 Connect your AI helper

Link to a free local AI tool on your machine so it can automatically describe your pictures.

3
📁 Add picture folders

Drag or pick folders full of images – the app scans them safely without touching your originals.

4
Group and auto-caption

Select images for a dataset, pick a caption style, and watch AI create smart descriptions for hundreds at once.

5
✏️ Review and edit captions

Browse thumbnails, tweak any description, use bulk fixes or undo changes with easy history.

Export ready dataset

Download perfectly numbered images with matching captions, all set for your AI training project.

Sign up to see the full architecture

4 more

Sign Up Free

Star Growth

See how this repo grew from 16 to 21 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is caption-foundry?

CaptionFoundry is a desktop app for preparing image datasets for AI training like LoRA fine-tuning, using local vision models for AI-powered image analysis. Drag folders into it to organize images into datasets, auto-generate captions via Ollama or LM Studio, review with quality scores, and bulk-edit using regex find-replace or prepend/append. Export numbered datasets with captions as folders or ZIPs—all offline in Python with Electron UI, keeping originals untouched.

Why is it gaining traction?

It skips cloud APIs for zero-cost, private captioning, unlike manual tools or paid services, with version history and rollback for safe experimentation. Bulk previews before applying changes, plus smart exports with resizing and metadata stripping, streamline workflows devs hate. Among AI-powered GitHub projects, its non-destructive local focus hooks dataset preppers tired of scattered scripts.

Who should use this?

ML engineers building vision model datasets, Stable Diffusion artists training LoRAs, or researchers needing captioned image sets without manual labor. Ideal for anyone with local Ollama/LM Studio setups handling hundreds of images for fine-tuning.

Verdict

Promising for local AI-powered image dataset prep, with solid docs, one-click installers, and intuitive workflow—worth trying if you're in LoRA training. But 18 stars and 1.0% credibility signal early maturity; test small before big datasets.

(198 words)

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.