openclaw

openclaw / clawpdf

Public

Zero-dependency PDFium WebAssembly bindings for Node and browsers.

44
2
100% credibility
Found May 30, 2026 at 44 stars -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
TypeScript
AI Summary

clawpdf is a zero-dependency PDF processing library that provides WebAssembly bindings for PDFium, allowing text extraction, page rendering, and PNG output in Node.js and browsers without native addons.

Star Growth

See how this repo grew from 44 to 44 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is clawpdf?

clawpdf is a TypeScript library that wraps PDFium in WebAssembly, letting you load PDFs, extract text, and render pages to PNG images. It runs in both Node and browsers without native addons, postinstall scripts, or external dependencies. The package gives you a clean API: open a PDF, grab its metadata, read text, or render pages at any DPI. It also ships a CLI tool for quick extraction and rendering directly from the terminal.

Why is it gaining traction?

The zero-dependency pitch is the main draw. You get PDFium's rendering power without pulling in a heavy transitive dependency tree. The performance benchmarks are compelling--the README shows memory usage dropping by roughly 40% compared to previous stacks, with render times cut in half on some documents. The text-first extraction mode is clever: it tries to pull text from pages first and only falls back to rendering images when text is sparse. That matters for AI pipelines where you want machine-readable text before falling back to vision. The browser entry point is straightforward too, with WASM handling automatically wired up.

Who should use this?

Backend developers building PDF processing pipelines will get the most value. If you're feeding PDFs to LLMs or extracting data for search indexing, the text-plus-image fallback pattern handles scanned documents gracefully. Frontend teams needing client-side PDF rendering without server roundtrips will appreciate the browser build. The CLI makes this useful for quick one-off tasks like converting a PDF page to an image or extracting text from a password-protected file.

Verdict

clawpdf delivers on its promise of simple, dependency-free PDF processing in TypeScript. The API is ergonomic and the performance numbers back up the claims. At 44 stars with a 1.0% credibility score, this is an early-stage project from a small team, so production use warrants some caution around maintenance and support. The documentation is solid and the codebase appears well-structured, but the limited community footprint means you should test it against your specific PDFs before committing. Worth evaluating for any Node or browser PDF workflow where you want PDFium without the overhead.

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.