This repository contains the analysis pipeline for a published medical research study that predicts gastric cancer recurrence after surgery. The project trains six machine learning models on patient data, handles missing laboratory values, and validates predictions on independent patient groups. Researchers can use this code to reproduce the study's findings or apply the same methods to their own patient datasets. The pipeline produces performance charts, calibration curves, and statistical metrics while following strict medical reporting guidelines (TRIPOD+AI and PROBAST). Patient data is not included due to privacy protections.
How It Works
A researcher shares a study about predicting gastric cancer return after surgery using machine learning, and you want to explore it further.
The study trains six different prediction models on patient data to guess whether cancer will come back, then tests them on separate patient groups.
The best model achieved 83% accuracy in predicting recurrence, with clear charts showing calibration and feature importance.
You organize your hospital's gastric cancer patient records into a spreadsheet with the same format, ready for analysis.
Launch the prediction pipeline with recommended settings to generate all charts and metrics automatically.
Adjust how missing values are handled or enable hyperparameter tuning for your specific dataset.
The pipeline creates prediction charts, performance metrics, and comparison tables saved to your computer.
Your analysis is complete with reproducible results following medical research standards, ready for clinical review or publication.
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.