Cancer Screening Outreach Guided by Machine Learning: The Benefits of Proactive Care
Minje Park et al.
Abstract
Problem definition: With the advance of data analytics, many disease prediction models have been developed with the intent of detecting diseases earlier and improving patient outcomes through earlier treatment. The operationalization of interventions and care based on these predictive models is critical to attaining these goals. We study the real-world effects of a machine learning-guided colorectal cancer screening outreach program deployed at a health system in Pennsylvania. Methodology/results: Using a regression discontinuity design based on the predicted risk score for having cancer, we find that the program increases the likelihood of colonoscopy uptake in three and six months by 6.0 percentage points (214% increase relative to the control sample within the bandwidth) and 6.9 percentage points (117% increase), respectively. Importantly, we also find significant effects on mortality. We estimate that the program decreases two-year mortality by 6.2 percentage points (43% decrease). Managerial implications: Our finding suggests that a proactive cancer screening outreach program where individuals are selected for intervention based on a machine learning algorithm could significantly improve patient outcomes in addition to achieving higher disease detection rates. Our analysis demonstrates an analytical framework for rigorously evaluating machine learning-aided outreach programs for other cancers and diseases. Establishing unbiased estimates of the impact of machine learning-aided screening outreach is critical for capacity planning of screening resources, such as colonoscopies. History: This paper has been accepted as part of the 2025 Manufacturing & Service Operations Management Practice-Based Research Competition. Supplemental Material: The online appendix is available at https://doi.org/10.1287/msom.2024.1353 .
Evidence weight
Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40
| F · citation impact | 0.50 × 0.4 = 0.20 |
| M · momentum | 0.50 × 0.15 = 0.07 |
| V · venue signal | 0.50 × 0.05 = 0.03 |
| R · text relevance † | 0.50 × 0.4 = 0.20 |
† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.