ltczding-gif

Batch-download reference PDFs from a DOI or paper PDF using Crossref and your institutional Edge session.

17
1
100% credibility
Found May 12, 2026 at 17 stars -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
Python
AI Summary

A Python script that automates downloading PDF references from a scientific paper's DOI list using the user's institutional browser access via publisher-specific paths.

How It Works

1
📚 Tired of chasing references

You discover this helper tool because manually hunting down dozens of PDF files from a paper's reference list wastes your whole afternoon.

2
💻 Get it ready on your computer

Download the tool to your computer, prepare it once by adding your email and pointing to your reference collection if you have one, and make sure your web browser is closed.

3
🔍 Feed it a paper ID or file

Type in the unique ID (DOI) of your main paper or drag in the PDF file, and watch it spring to life.

4
🌐 It gathers the list and downloads

The tool peeks at the paper's reference list online, checks each one, and uses your university login in the browser to grab the PDFs your school already pays for.

5
📁 See what worked and what didn't

It sorts everything into a folder with all the successful downloads, plus a clear list noting any that need your manual touch due to login redirects or other hiccups.

🎉 All set with your PDFs

Now you have a complete collection of reference files right at hand, saving hours and letting you focus on your research.

Sign up to see the full architecture

4 more

Sign Up Free

Star Growth

See how this repo grew from 17 to 17 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is ref-downloader?

ref-downloader is a Python batch-download tool that takes a DOI or paper PDF and pulls every reference PDF using Crossref metadata and your institutional Edge browser session. It automates chasing paywalled refs you already have access to, landing clean PDFs in organized folders with a detailed report on successes, failures, and manual follow-ups. No API keys needed—just your existing login cookies carry through via Playwright.

Why is it gaining traction?

Unlike Zotero's one-by-one PDF hunts or SciHub's license-ignoring grabs, it leverages your real institutional session for legit access, handles 17+ publisher quirks like Wiley PDFDirect or Elsevier clicks, and resumes after VPN drops or crashes. Batch-download fans on GitHub appreciate the CLI simplicity (`python run_ref_downloader.py `), clear failure reasons in CSV/JSON, and config for SSO detection—far beyond generic scrapers.

Who should use this?

Academic researchers in chemistry, physics, or materials science grinding through paper refs their university subscribes to. PhD students batch-processing literature for reviews, or devs prototyping research agents needing DOI-to-PDF pipelines. Skip if you're on macOS/Linux or lack Edge.

Verdict

Solid beta for Windows+Edge users tired of manual PDF hunts (17 stars, strong README), but 1.0% credibility score flags low adoption—test on your subs first. PRs could expand publishers and platforms; worth forking if it clicks.

(178 words)

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.