Leveraging Machine Learning to Predict High-Risk Opioid Overdose Cases in Massachusetts: A Jurisdiction-Level Analysis

machine learning

Authors

  • Awele Okolie School of Computing and Data Science, Wentworth Institute of Technology, United States
  • Dumebi Okolie Department of Finance and Economics, Faculty of Business and Law, Manchester Metropolitan University, United Kingdom
  • Callistus Obunadike Department of Computer Science and Quantitative Methods, Austin Peay State University, Tennessee, United States
  • Darlington Ekweli Department of Health Care Administration, University of the Potomac, Washington, DC, United States
  • Bello Abdul-Waliyyu Department of Computer Science and Quantitative Methods, Austin Peay State University, Tennessee, USA, United States
  • Paschal Alumona Booth School of Business, University of Chicago, United States
Vol. 13 No. 10 (2025)
Engineering and Computer Science
October 21, 2025

Downloads

The opioid crisis remains a significant public health challenge in the United States, characterized by evolving drug supply dynamics and significant geographic and demographic disparities. This study leverages machine learning to analyze jurisdiction-level data from Massachusetts to identify the primary drivers of opioid overdose deaths. Using a descriptive quantitative design, this research employs a supervised machine learning framework, comparing Random Forest, Gradient Boosting, and Ridge Regression models to predict state-level opioid overdose rates. The analysis focuses on model interpretability to determine the relative importance of various substance use rates and demographic factors. The Gradient Boosting Regressor demonstrated exceptional predictive accuracy, achieving a Mean Absolute Error (MAE) of 0.81 and a coefficient of determination (R²) of 0.9930. The feature importance analysis revealed a singular, dominant predictor: the stimulant rate. This finding indicates that the co-use of stimulants is the most critical factor driving opioid-related fatalities, overshadowing the predictive power of other variables, including the fentanyl dominance ratio, heroin rate, and demographic characteristics. The results strongly suggest that the opioid epidemic has transitioned into a polysubstance crisis. Consequently, effective public health interventions must shift from an opioid-centric focus to integrated strategies that address the concurrent use of opioids and stimulants. This study underscores the power of machine learning to provide clear, actionable insights for tackling complex public health crises and recommends the reallocation of resources toward programs that target polysubstance use.