1
|
Liu YS, Kiyang L, Hayward J, Zhang Y, Metes D, Wang M, Svenson LW, Talarico F, Chue P, Li XM, Greiner R, Greenshaw AJ, Cao B. Individualized Prospective Prediction of Opioid Use Disorder. CANADIAN JOURNAL OF PSYCHIATRY. REVUE CANADIENNE DE PSYCHIATRIE 2023; 68:54-63. [PMID: 35892186 PMCID: PMC9720482 DOI: 10.1177/07067437221114094] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
OBJECTIVE Opioid use disorder (OUD) is a chronic relapsing disorder with a problematic pattern of opioid use, affecting nearly 27 million people worldwide. Machine learning (ML)-based prediction of OUD may lead to early detection and intervention. However, most ML prediction studies were not based on representative data sources and prospective validations, limiting their potential to predict future new cases. In the current study, we aimed to develop and prospectively validate an ML model that could predict individual OUD cases based on representative large-scale health data. METHOD We present an ensemble machine-learning model trained on a cross-linked Canadian administrative health data set from 2014 to 2018 (n = 699,164), with validation of model-predicted OUD cases on a hold-out sample from 2014 to 2018 (n = 174,791) and prospective prediction of OUD cases on a non-overlapping sample from 2019 (n = 316,039). We used administrative records of OUD diagnosis for each subject based on International Classification of Diseases (ICD) codes. RESULTS With 6409 OUD cases in 2019 (mean [SD], 45.34 [14.28], 3400 males), our model prospectively predicted OUD cases at a high accuracy (balanced accuracy, 86%, sensitivity, 93%; specificity 79%). In accord with prior findings, the top risk factors for OUD in this model were opioid use indicators and a history of other substance use disorders. CONCLUSION Our study presents an individualized prospective prediction of OUD cases by applying ML to large administrative health datasets. Such prospective predictions based on ML would be essential for potential future clinical applications in the early detection of OUD.
Collapse
Affiliation(s)
- Yang S Liu
- Department of Psychiatry, 3158University of Alberta, Edmonton, Alberta, Canada.,Analytics and Performance Reporting Branch, Ministry of Health, 151965Government of Alberta, Edmonton, Alberta, Canada
| | - Lawrence Kiyang
- Analytics and Performance Reporting Branch, Ministry of Health, 151965Government of Alberta, Edmonton, Alberta, Canada
| | - Jake Hayward
- Department of Emergency Medicine, 3158University of Alberta, Edmonton, Alberta, Canada
| | - Yanbo Zhang
- Department of Psychiatry, 3158University of Alberta, Edmonton, Alberta, Canada
| | - Dan Metes
- Analytics and Performance Reporting Branch, Ministry of Health, 151965Government of Alberta, Edmonton, Alberta, Canada
| | - Mengzhe Wang
- Analytics and Performance Reporting Branch, Ministry of Health, 151965Government of Alberta, Edmonton, Alberta, Canada
| | - Lawrence W Svenson
- Analytics and Performance Reporting Branch, Ministry of Health, 151965Government of Alberta, Edmonton, Alberta, Canada.,School of Public Health, 3158University of Alberta, Edmonton, Alberta, Canada.,Division of Preventive Medicine, 3158University of Alberta, Edmonton, Alberta, Canada.,Department of Community Health Sciences, University of Calgary, Calgary, Alberta, Canada
| | - Fernanda Talarico
- Department of Psychiatry, 3158University of Alberta, Edmonton, Alberta, Canada
| | - Pierre Chue
- Department of Psychiatry, 3158University of Alberta, Edmonton, Alberta, Canada
| | - Xin-Min Li
- Department of Psychiatry, 3158University of Alberta, Edmonton, Alberta, Canada
| | - Russell Greiner
- Department of Psychiatry, 3158University of Alberta, Edmonton, Alberta, Canada.,Department of Computing Science, 3158University of Alberta, Edmonton, Alberta, Canada.,Alberta Machine Intelligence Institute (Amii), Edmonton, Alberta, Canada
| | - Andrew J Greenshaw
- Department of Psychiatry, 3158University of Alberta, Edmonton, Alberta, Canada
| | - Bo Cao
- Department of Psychiatry, 3158University of Alberta, Edmonton, Alberta, Canada.,Analytics and Performance Reporting Branch, Ministry of Health, 151965Government of Alberta, Edmonton, Alberta, Canada
| |
Collapse
|
2
|
To D, Joyce C, Kulshrestha S, Sharma B, Dligach D, Churpek M, Afshar M. The Addition of United States Census-Tract Data Does Not Improve the Prediction of Substance Misuse. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2022; 2021:1149-1158. [PMID: 35308901 PMCID: PMC8861711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Predictors from the structured data in the electronic health record (EHR) have previously been used for case-identification in substance misuse. We aim to examine the added benefit from census-tract data, a proxy for socioeconomic status, to improve identification. A cohort of 186,611 hospitalizations was derived between 2007 and 2017. Reference labels included alcohol misuse only, opioid misuse only, and both alcohol and opioid misuse. Baseline models were created using 24 EHR variables, and enhanced models were created with the addition of 48 census-tract variables from the United States American Community Survey. The absolute net reclassification index (NRI) was applied to measure the benefit in adding census-tract variables to baseline models. The baseline models already had good calibration and discrimination. Adding census-tract variables provided negligible improvement to sensitivity and specificity and NRI was less than 1% across substance groups. Our results show the census-tract added minimal value to prediction models.
Collapse
Affiliation(s)
- Daniel To
- Stritch School of Medicine, Loyola University Chicago, Maywood, IL
| | - Cara Joyce
- Department of Public Health, Stritch School of Medicine, Loyola University Chicago, Maywood, IL
| | - Sujay Kulshrestha
- Department of Surgery, Loyola University Medical Center, Maywood, IL
| | - Brihat Sharma
- Department of Psychiatry and Behavioral Sciences, Rush University Medical Center, Chicago, IL
| | - Dmitry Dligach
- Department of Public Health, Stritch School of Medicine, Loyola University Chicago, Maywood, IL
- Department of Computer Science, Loyola University Chicago, Chicago, IL
| | - Matthew Churpek
- Department of Medicine, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin
| | - Majid Afshar
- Department of Medicine, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin
| |
Collapse
|
3
|
Sharma B, Dligach D, Swope K, Salisbury-Afshar E, Karnik NS, Joyce C, Afshar M. Publicly available machine learning models for identifying opioid misuse from the clinical notes of hospitalized patients. BMC Med Inform Decis Mak 2020; 20:79. [PMID: 32349766 PMCID: PMC7191715 DOI: 10.1186/s12911-020-1099-y] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 04/22/2020] [Indexed: 12/15/2022] Open
Abstract
Background Automated de-identification methods for removing protected health information (PHI) from the source notes of the electronic health record (EHR) rely on building systems to recognize mentions of PHI in text, but they remain inadequate at ensuring perfect PHI removal. As an alternative to relying on de-identification systems, we propose the following solutions: (1) Mapping the corpus of documents to standardized medical vocabulary (concept unique identifier [CUI] codes mapped from the Unified Medical Language System) thus eliminating PHI as inputs to a machine learning model; and (2) training character-based machine learning models that obviate the need for a dictionary containing input words/n-grams. We aim to test the performance of models with and without PHI in a use-case for an opioid misuse classifier. Methods An observational cohort sampled from adult hospital inpatient encounters at a health system between 2007 and 2017. A case-control stratified sampling (n = 1000) was performed to build an annotated dataset for a reference standard of cases and non-cases of opioid misuse. Models for training and testing included CUI codes, character-based, and n-gram features. Models applied were machine learning with neural network and logistic regression as well as expert consensus with a rule-based model for opioid misuse. The area under the receiver operating characteristic curves (AUROC) were compared between models for discrimination. The Hosmer-Lemeshow test and visual plots measured model fit and calibration. Results Machine learning models with CUI codes performed similarly to n-gram models with PHI. The top performing models with AUROCs > 0.90 included CUI codes as inputs to a convolutional neural network, max pooling network, and logistic regression model. The top calibrated models with the best model fit were the CUI-based convolutional neural network and max pooling network. The top weighted CUI codes in logistic regression has the related terms ‘Heroin’ and ‘Victim of abuse’. Conclusions We demonstrate good test characteristics for an opioid misuse computable phenotype that is void of any PHI and performs similarly to models that use PHI. Herein we share a PHI-free, trained opioid misuse classifier for other researchers and health systems to use and benchmark to overcome privacy and security concerns.
Collapse
Affiliation(s)
- Brihat Sharma
- Department of Computer Science, Loyola University Chicago, Chicago, IL, USA
| | - Dmitriy Dligach
- Department of Computer Science, Loyola University Chicago, Chicago, IL, USA.,Center for Health Outcomes and Informatics Research, Loyola University Chicago, 2160 S. First Avenue, Maywood, IL, 60156, USA
| | - Kristin Swope
- Stritch School of Medicine, Loyola University Chicago, Maywood, IL, USA
| | - Elizabeth Salisbury-Afshar
- Center for Multi-System Solutions to the Opioid Epidemic, American Institute for Research, Chicago, IL, USA
| | - Niranjan S Karnik
- Department of Psychiatry, Rush University Medical Center, Chicago, IL, USA
| | - Cara Joyce
- Center for Health Outcomes and Informatics Research, Loyola University Chicago, 2160 S. First Avenue, Maywood, IL, 60156, USA.,Stritch School of Medicine, Loyola University Chicago, Maywood, IL, USA
| | - Majid Afshar
- Center for Health Outcomes and Informatics Research, Loyola University Chicago, 2160 S. First Avenue, Maywood, IL, 60156, USA. .,Department of Health Informatics and Data Science, Loyola University Chicago, Maywood, IL, USA. .,Department of Medicine, Loyola University Medical Center, Maywood, IL, USA.
| |
Collapse
|
4
|
Hastings JS, Howison M, Inman SE. Predicting high-risk opioid prescriptions before they are given. Proc Natl Acad Sci U S A 2020; 117:1917-1923. [PMID: 31937665 PMCID: PMC6994994 DOI: 10.1073/pnas.1905355117] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Misuse of prescription opioids is a leading cause of premature death in the United States. We use state government administrative data and machine learning methods to examine whether the risk of future opioid dependence, abuse, or poisoning can be predicted in advance of an initial opioid prescription. Our models accurately predict these outcomes and identify particular prior nonopioid prescriptions, medical history, incarceration, and demographics as strong predictors. Using our estimates, we simulate a hypothetical policy which restricts new opioid prescriptions to only those with low predicted risk. The policy's potential benefits likely outweigh costs across demographic subgroups, even for lenient definitions of "high risk." Our findings suggest new avenues for prevention using state administrative data, which could aid providers in making better, data-informed decisions when weighing the medical benefits of opioid therapy against the risks.
Collapse
Affiliation(s)
- Justine S Hastings
- Research Improving People's Lives, Providence, RI 02903;
- Watson Institute for International and Public Affairs, Brown University, Providence, RI 02912
- Department of Economics, Brown University, Providence, RI 02912
- National Bureau of Economic Research, Cambridge, MA 02138
| | - Mark Howison
- Research Improving People's Lives, Providence, RI 02903
- Watson Institute for International and Public Affairs, Brown University, Providence, RI 02912
| | - Sarah E Inman
- Research Improving People's Lives, Providence, RI 02903
- School of International and Public Affairs, Columbia University, New York, NY 10027
| |
Collapse
|
5
|
Afshar M, Joyce C, Dligach D, Sharma B, Kania R, Xie M, Swope K, Salisbury-Afshar E, Karnik NS. Subtypes in patients with opioid misuse: A prognostic enrichment strategy using electronic health record data in hospitalized patients. PLoS One 2019; 14:e0219717. [PMID: 31310611 PMCID: PMC6634397 DOI: 10.1371/journal.pone.0219717] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2019] [Accepted: 06/28/2019] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Approaches are needed to better delineate the continuum of opioid misuse that occurs in hospitalized patients. A prognostic enrichment strategy with latent class analysis (LCA) may facilitate treatment strategies in subtypes of opioid misuse. We aim to identify subtypes of patients with opioid misuse and examine the distinctions between the subtypes by examining patient characteristics, topic models from clinical notes, and clinical outcomes. METHODS This was an observational study of inpatient hospitalizations at a tertiary care center between 2007 and 2017. Patients with opioid misuse were identified using an operational definition applied to all inpatient encounters. LCA with eight class-defining variables from the electronic health record (EHR) was applied to identify subtypes in the cohort of patients with opioid misuse. Comparisons between subtypes were made using the following approaches: (1) descriptive statistics on patient characteristics and healthcare utilization using EHR data and census-level data; (2) topic models with natural language processing (NLP) from clinical notes; (3) association with hospital outcomes. FINDINGS The analysis cohort was 6,224 (2.7% of all hospitalizations) patient encounters with opioid misuse with a data corpus of 422,147 clinical notes. LCA identified four subtypes with differing patient characteristics, topics from the clinical notes, and hospital outcomes. Class 1 was categorized by high hospital utilization with known opioid-related conditions (36.5%); Class 2 included patients with illicit use, low socioeconomic status, and psychoses (12.8%); Class 3 contained patients with alcohol use disorders with complications (39.2%); and class 4 consisted of those with low hospital utilization and incidental opioid misuse (11.5%). The following hospital outcomes were the highest for each subtype when compared against the other subtypes: readmission for class 1 (13.9% vs. 10.5%, p<0.01); discharge against medical advice for class 2 (12.3% vs. 5.3%, p<0.01); and in-hospital death for classes 3 and 4 (3.2% vs. 1.9%, p<0.01). CONCLUSIONS A 4-class latent model was the most parsimonious model that defined clinically interpretable and relevant subtypes for opioid misuse. Distinct subtypes were delineated after examining multiple domains of EHR data and applying methods in artificial intelligence. The approach with LCA and readily available class-defining substance use variables from the EHR may be applied as a prognostic enrichment strategy for targeted interventions.
Collapse
Affiliation(s)
- Majid Afshar
- Department of Public Health Sciences, Loyola University, Maywood, Illinois, United States of America
- Center for Health Outcomes and Informatics Research, Loyola University, Maywood, Illinois, United States of America
- Stritch School of Medicine, Loyola University, Maywood, Illinois, United States of America
| | - Cara Joyce
- Department of Public Health Sciences, Loyola University, Maywood, Illinois, United States of America
- Center for Health Outcomes and Informatics Research, Loyola University, Maywood, Illinois, United States of America
- Stritch School of Medicine, Loyola University, Maywood, Illinois, United States of America
| | - Dmitriy Dligach
- Department of Public Health Sciences, Loyola University, Maywood, Illinois, United States of America
- Center for Health Outcomes and Informatics Research, Loyola University, Maywood, Illinois, United States of America
- Department of Computer Science, Loyola University Medical Center, Maywood, Illinois, United States of America
| | - Brihat Sharma
- Department of Computer Science, Loyola University Medical Center, Maywood, Illinois, United States of America
| | - Robert Kania
- Department of Computer Science, Loyola University Medical Center, Maywood, Illinois, United States of America
| | - Meng Xie
- Department of Mathematics and Statistics, Loyola University, Chicago, Illinois, United States of America
| | - Kristin Swope
- Department of Public Health Sciences, Loyola University, Maywood, Illinois, United States of America
- Stritch School of Medicine, Loyola University, Maywood, Illinois, United States of America
| | - Elizabeth Salisbury-Afshar
- Center for Multi-System Solutions to the Opioid Epidemic, American Institute for Research, Chicago, Illinois, United States of America
| | - Niranjan S. Karnik
- Department of Psychiatry & Behavioral Sciences, Rush University Medical Center, Chicago, Illinois, United States of America
| |
Collapse
|