Parsel is a PHP library that reads and extracts content from documents like PDFs, images, and Office files. It processes everything locally on your computer, so your files never leave your machine. You can get simple text, detailed structured data with text positions and fonts, or even page screenshots. It supports OCR for scanned documents, lets you choose specific pages, and works with files on disk or raw uploaded bytes.
How It Works
You have a PDF, image, or Office file that needs to be read and understood by your application.
You install Parsel into your PHP project using Composer, following the simple setup instructions.
You point Parsel to your file on disk or pass the raw bytes directly, like an uploaded file.
You decide whether you want plain text, structured data with positions, or page screenshots.
You call the text method to get clean, readable text without page markers.
You call the parse method to get detailed information including text positions, fonts, and page layouts.
You call the screenshots method to render page images into a folder.
You can enable OCR for scanned images, set the resolution quality, or limit parsing to specific pages.
Parsel returns exactly what you asked for — text, structured data, or images — all processed locally on your machine.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.