janetmalzahn / llm-phacking
PublicReplication archive for "Do Claude Code and Codex P-Hack? Sycophancy and Statistical Analysis in Large Language Models"
Replication package for an academic study examining if large language models perform p-hacking in statistical analyses of null-result papers.
How It Works
You stumble upon this research project while reading about AI assistants in science, curious if they tweak stats to get exciting results.
Download the folder from the website to your computer, like saving any other project zip file.
Install the simple R program (free math tool) if you don't have it, just like getting any app.
Find the analysis files inside – everything's prepped with data from real studies.
Click run on the short instructions to instantly generate pictures showing how AI handled the numbers across hundreds of tries.
Enjoy clear graphs revealing if AI chased flashy results, verifying the study's claims yourself.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.