chad-loder

chad-loder / pyhaul

Public

Resumable HTTP downloads with any Python requests library. Pure Python, zero dependencies, crash-safe.

17
1
100% credibility
Found May 03, 2026 at 17 stars -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
Python
AI Summary

pyhaul is a small Python library that adds resumable downloading to existing HTTP clients by managing byte ranges, checkpoints, and integrity checks.

How It Works

1
📥 Need reliable big downloads

You want to grab huge files from the web without starting over if your internet drops or your computer crashes.

2
🛠️ Add pyhaul to your setup

You simply include pyhaul in your project so it can make downloads super safe and resumable.

3
🔗 Link your web tool

You connect pyhaul to the web connector you already use, like your favorite way to fetch online stuff.

4
🚀 Start downloading

You point pyhaul at the big file URL and your save spot, and it begins pulling the data steadily.

5
🔄 Resumes automatically

If something interrupts like a power outage, just restart – pyhaul smartly continues from the exact spot it stopped, keeping everything safe.

Perfect file ready

Your full file lands safely in place, checked for perfection, ready to use without a worry.

Sign up to see the full architecture

4 more

Sign Up Free

Star Growth

See how this repo grew from 17 to 17 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is pyhaul?

pyhaul is a pure Python library for resumable HTTP downloads that integrates with any requests-compatible client like httpx, requests, niquests, or aiohttp. It manages byte-range negotiation, crash-safe checkpointing to disk, and ETag validation, ensuring interrupted downloads resume from the exact byte without corruption. Users get atomic file completion—data stays in a .part file until fully validated and moved.

Why is it gaining traction?

Unlike curl or aria2c, pyhaul has zero dependencies and borrows your existing HTTP session, avoiding new stacks for sync/async apps. Crash-safe persistence survives kills or network flakes, with a simple CLI for one-offs and retry loops you control. It shines for github resumable google drive download or any large-file http resumable download over flaky connections.

Who should use this?

Python devs building download tools for datasets, ML models, or backups where network drops are common. Data engineers handling bulk resumable http downloads from CDNs or S3-like services. CLI users wanting a lightweight alternative to wget for crash-safe, github resumable stream transfers.

Verdict

Solid pick for reliable Python downloads—excellent docs, full test coverage, and MIT license—but at 15 stars and 1.0% credibility, it's alpha-stage; test thoroughly before production. Pair with httpx for async resumable streams.

(178 words)

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.