riddleling

riddleling / docOCR

Public

macOS CLI and HTTP OCR tool for converting document images to Markdown.

16
0
89% credibility
Found May 28, 2026 at 16 stars -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
Swift
AI Summary

docOCR is a macOS tool that converts document images into editable Markdown text, either through direct command-line conversion or via a local web interface for browser-based uploads.

How It Works

1
📄 You have images of documents

You have photos of documents, scanned pages, or images of text that you need converted to editable text.

2
💻 You install the tool on your Mac

You install the tool on your Mac by building it from the source code.

3
🔄 You choose how to process your images

You choose between converting files directly or starting a web page to upload images.

4
Two paths to get your text
⌨️
Convert files directly

Run a simple command to convert your images into text files saved next to the originals.

🌐
Use the web upload page

Start a local web page where you can upload images and see results in your browser.

5
📝 Your text files are created

The tool reads each image and saves the extracted text as a Markdown file right next to your original image.

6
🖱️ You upload and view results

You open the web page, pick an image from your computer, and click upload to see the extracted text.

Your text is ready to use

Your document images have been converted into clean, readable text that you can edit, search, or save anywhere.

Sign up to see the full architecture

5 more

Sign Up Free

Star Growth

See how this repo grew from 16 to 16 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is docOCR?

docOCR is a macOS command-line tool that converts document images into Markdown text. Built in Swift, it uses Apple's Vision framework to perform OCR locally on your Mac without sending data to external services. It works two ways: as a batch CLI tool that processes image files and outputs .md files, or as a local HTTP server with a web UI and JSON API for programmatic access.

Why is it gaining traction?

The local-first approach is the main draw -- your documents never leave your machine. The Markdown output preserves document structure including paragraphs, lists, and tables. The dual interface (CLI and HTTP server) covers both quick one-off conversions and integration into larger workflows. Using Apple's native document recognition API means solid results without third-party dependencies or API keys.

Who should use this?

Researchers digitizing paper documents, developers building document processing pipelines, and anyone who needs to extract text from images without cloud services. The CLI is useful for batch processing folders of scans. The HTTP server suits teams wanting a simple internal OCR endpoint. Not for Windows or Linux users -- this is macOS-only.

Verdict

docOCR is a focused tool with a clear purpose and solid execution. At version 0.1.1 with only 16 stars, it's early-stage and the test coverage is minimal. The 0.9% credibility score reflects this nascent status. Worth trying if you need local OCR on macOS, but monitor the project for maturity before betting on it for production workflows.

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.