1
|
Park Y, Park S, Lee M. Effectiveness of artificial intelligence in detecting and managing depressive disorders: Systematic review. J Affect Disord 2024; 361:445-456. [PMID: 38889858 DOI: 10.1016/j.jad.2024.06.035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 11/27/2023] [Accepted: 06/14/2024] [Indexed: 06/20/2024]
Abstract
OBJECTIVES This study underscores the importance of exploring AI's creative applications in treating depressive disorders to revolutionize mental health care. Through innovative integration of AI technologies, the research confirms their positive effects on preventing, diagnosing, and treating depression. The systematic review establishes an evidence base for AI in depression management, offering directions for effective interventions. METHODS This systematic literature review investigates the effectiveness of AI in depression management by analyzing studies from January 1, 2017, to May 31, 2022. Utilizing search engines like IEEE Xplore, PubMed, and Web of Science, the review focused on keywords such as Depression/Mental Health, Machine Learning/Artificial Intelligence, and Prediction/Diagnosis. The analysis of 95 documents involved classification based on use, data type, and algorithm type. RESULTS The study revealed that AI in depression management excelled in accuracy, particularly in monitoring and prediction. Biomarker-derived data demonstrated the highest accuracy, with the CNN algorithm proving most effective. The findings affirm the therapeutic benefits of AI, including treatment, detection, and disease prediction, highlighting its potential in analyzing monitored data for depression management. LIMITATIONS This study exclusively examined the application of AI in individuals with depressive disorders. Interpretation should be cautious due to the limited scope of subjects to this specific population. CONCLUSIONS To introduce digital healthcare and therapies for ongoing depression management, it's crucial to present empirical evidence on the medical fee payment system, safety, and efficacy. These findings support enhanced medical accessibility through digital healthcare, offering personalized disease management for patients seeking non-face-to-face treatment.
Collapse
Affiliation(s)
- Yoonseo Park
- Department of Convergence Healthcare Medicine, Ajou University, Suwon, South Korea
| | - Sewon Park
- Department of Medical Science, Ajou University School of Medicine, Suwon, South Korea
| | - Munjae Lee
- Department of Medical Science, Ajou University School of Medicine, Suwon, South Korea.
| |
Collapse
|
2
|
Tu JB, Liao WJ, Liu WC, Gao XH. Using machine learning techniques to predict the risk of osteoporosis based on nationwide chronic disease data. Sci Rep 2024; 14:5245. [PMID: 38438569 PMCID: PMC10912338 DOI: 10.1038/s41598-024-56114-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 03/01/2024] [Indexed: 03/06/2024] Open
Abstract
Osteoporosis is a major public health concern that significantly increases the risk of fractures. The aim of this study was to develop a Machine Learning based predictive model to screen individuals at high risk of osteoporosis based on chronic disease data, thus facilitating early detection and personalized management. A total of 10,000 complete patient records of primary healthcare data in the German Disease Analyzer database (IMS HEALTH) were included, of which 1293 diagnosed with osteoporosis and 8707 without the condition. The demographic characteristics and chronic disease data, including age, gender, lipid disorder, cancer, COPD, hypertension, heart failure, CHD, diabetes, chronic kidney disease, and stroke were collected from electronic health records. Ten different machine learning algorithms were employed to construct the predictive mode. The performance of the model was further validated and the relative importance of features in the model was analyzed. Out of the ten machine learning algorithms, the Stacker model based on Logistic Regression, AdaBoost Classifier, and Gradient Boosting Classifier demonstrated superior performance. The Stacker model demonstrated excellent performance through ten-fold cross-validation on the training set and ROC curve analysis on the test set. The confusion matrix, lift curve and calibration curves indicated that the Stacker model had optimal clinical utility. Further analysis on feature importance highlighted age, gender, lipid metabolism disorders, cancer, and COPD as the top five influential variables. In this study, a predictive model for osteoporosis based on chronic disease data was developed using machine learning. The model shows great potential in early detection and risk stratification of osteoporosis, ultimately facilitating personalized prevention and management strategies.
Collapse
Affiliation(s)
- Jun-Bo Tu
- Department of Orthopaedics, Xinfeng County People's Hospital, Jiangxi, 341600, Xinfeng, China
| | - Wei-Jie Liao
- Department of ICU, GanZhou People's Hospital, GanZhou, 341000, Jiangxi, China
| | - Wen-Cai Liu
- Department of Orthopaedics, Shanghai Sixth People's Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, 600 Yishan Road, Shanghai, 200233, China.
| | - Xing-Hua Gao
- Department of Orthopaedics, Guangzhou First People's Hospital, South China University of Technology, Guangzhou, 510180, China.
| |
Collapse
|
3
|
Huang Y, Guo J, Chen WH, Lin HY, Tang H, Wang F, Xu H, Bian J. A scoping review of fair machine learning techniques when using real-world data. J Biomed Inform 2024; 151:104622. [PMID: 38452862 PMCID: PMC11146346 DOI: 10.1016/j.jbi.2024.104622] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Revised: 01/19/2024] [Accepted: 03/03/2024] [Indexed: 03/09/2024]
Abstract
OBJECTIVE The integration of artificial intelligence (AI) and machine learning (ML) in health care to aid clinical decisions is widespread. However, as AI and ML take important roles in health care, there are concerns about AI and ML associated fairness and bias. That is, an AI tool may have a disparate impact, with its benefits and drawbacks unevenly distributed across societal strata and subpopulations, potentially exacerbating existing health inequities. Thus, the objectives of this scoping review were to summarize existing literature and identify gaps in the topic of tackling algorithmic bias and optimizing fairness in AI/ML models using real-world data (RWD) in health care domains. METHODS We conducted a thorough review of techniques for assessing and optimizing AI/ML model fairness in health care when using RWD in health care domains. The focus lies on appraising different quantification metrics for accessing fairness, publicly accessible datasets for ML fairness research, and bias mitigation approaches. RESULTS We identified 11 papers that are focused on optimizing model fairness in health care applications. The current research on mitigating bias issues in RWD is limited, both in terms of disease variety and health care applications, as well as the accessibility of public datasets for ML fairness research. Existing studies often indicate positive outcomes when using pre-processing techniques to address algorithmic bias. There remain unresolved questions within the field that require further research, which includes pinpointing the root causes of bias in ML models, broadening fairness research in AI/ML with the use of RWD and exploring its implications in healthcare settings, and evaluating and addressing bias in multi-modal data. CONCLUSION This paper provides useful reference material and insights to researchers regarding AI/ML fairness in real-world health care data and reveals the gaps in the field. Fair AI/ML in health care is a burgeoning field that requires a heightened research focus to cover diverse applications and different types of RWD.
Collapse
Affiliation(s)
- Yu Huang
- Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, FL, USA
| | - Jingchuan Guo
- Pharmaceutical Outcomes & Policy, University of Florida, Gainesville, FL, USA
| | - Wei-Han Chen
- Pharmaceutical Outcomes & Policy, University of Florida, Gainesville, FL, USA
| | - Hsin-Yueh Lin
- Pharmaceutical Outcomes & Policy, University of Florida, Gainesville, FL, USA
| | - Huilin Tang
- Pharmaceutical Outcomes & Policy, University of Florida, Gainesville, FL, USA
| | - Fei Wang
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA; Institute of Artificial Intelligence for Digital Health, Weill Cornell Medicine, New York, NY, USA
| | - Hua Xu
- Section of Biomedical Informatics and Data Science, School of Medicine, Yale University, New Haven, CT, USA
| | - Jiang Bian
- Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, FL, USA.
| |
Collapse
|
4
|
Feng W, Wu H, Ma H, Tao Z, Xu M, Zhang X, Lu S, Wan C, Liu Y. Applying contrastive pre-training for depression and anxiety risk prediction in type 2 diabetes patients based on heterogeneous electronic health records: a primary healthcare case study. J Am Med Inform Assoc 2024; 31:445-455. [PMID: 38062850 PMCID: PMC10797279 DOI: 10.1093/jamia/ocad228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 11/13/2023] [Accepted: 11/21/2023] [Indexed: 01/22/2024] Open
Abstract
OBJECTIVE Due to heterogeneity and limited medical data in primary healthcare services (PHS), assessing the psychological risk of type 2 diabetes mellitus (T2DM) patients in PHS is difficult. Using unsupervised contrastive pre-training, we proposed a deep learning framework named depression and anxiety prediction (DAP) to predict depression and anxiety in T2DM patients. MATERIALS AND METHODS The DAP model consists of two sub-models. Firstly, the pre-trained model of DAP used unlabeled discharge records of 85 085 T2DM patients from the First Affiliated Hospital of Nanjing Medical University for unsupervised contrastive learning on heterogeneous electronic health records (EHRs). Secondly, the fine-tuned model of DAP used case-control cohorts (17 491 patients) selected from 149 596 T2DM patients' EHRs in the Nanjing Health Information Platform (NHIP). The DAP model was validated in 1028 patients from PHS in NHIP. Evaluation included receiver operating characteristic area under the curve (ROC-AUC) and precision-recall area under the curve (PR-AUC), and decision curve analysis (DCA). RESULTS The pre-training step allowed the DAP model to converge at a faster rate. The fine-tuned DAP model significantly outperformed the baseline models (logistic regression, extreme gradient boosting, and random forest) with ROC-AUC of 0.91±0.028 and PR-AUC of 0.80±0.067 in 10-fold internal validation, and with ROC-AUC of 0.75 ± 0.045 and PR-AUC of 0.47 ± 0.081 in external validation. The DCA indicate the clinical potential of the DAP model. CONCLUSION The DAP model effectively predicted post-discharge depression and anxiety in T2DM patients from PHS, reducing data fragmentation and limitations. This study highlights the DAP model's potential for early detection and intervention in depression and anxiety, improving outcomes for diabetes patients.
Collapse
Affiliation(s)
- Wei Feng
- Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China
| | - Honghan Wu
- Institute of Health Informatics, University College London, London, WC1E 6BT, United Kingdom
- The Alan Turing Institute, London, NW1 2DB, United Kingdom
| | - Hui Ma
- Department of Medical Psychology, Nanjing Brain Hospital affiliated with Nanjing Medical University, Nanjing, Jiangsu, 210024, China
| | - Zhenhuan Tao
- Department of Planning, Nanjing Health Information Center, Nanjing, Jiangsu, 210003, China
| | - Mengdie Xu
- Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China
| | - Xin Zhang
- Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China
- Department of Information, The First Affiliated Hospital, Nanjing Medical University, Nanjing, Jiangsu, 210029, China
| | - Shan Lu
- Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China
- Department of Information, The First Affiliated Hospital, Nanjing Medical University, Nanjing, Jiangsu, 210029, China
| | - Cheng Wan
- Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China
| | - Yun Liu
- Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China
- Department of Information, The First Affiliated Hospital, Nanjing Medical University, Nanjing, Jiangsu, 210029, China
| |
Collapse
|
5
|
Zhu Y, Bi D, Saunders M, Ji Y. Prediction of chronic kidney disease progression using recurrent neural network and electronic health records. Sci Rep 2023; 13:22091. [PMID: 38086905 PMCID: PMC10716428 DOI: 10.1038/s41598-023-49271-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Accepted: 12/06/2023] [Indexed: 12/18/2023] Open
Abstract
Chronic kidney disease (CKD) is a progressive loss in kidney function. Early detection of patients who will progress to late-stage CKD is of paramount importance for patient care. To address this, we develop a pipeline to process longitudinal electronic heath records (EHRs) and construct recurrent neural network (RNN) models to predict CKD progression from stages II/III to stages IV/V. The RNN model generates predictions based on time-series records of patients, including repeated lab tests and other clinical variables. Our investigation reveals that using a single variable, the recorded estimated glomerular filtration rate (eGFR) over time, the RNN model achieves an average area under the receiver operating characteristic curve (AUROC) of 0.957 for predicting future CKD progression. When additional clinical variables, such as demographics, vital information, lab test results, and health behaviors, are incorporated, the average AUROC increases to 0.967. In both scenarios, the standard deviation of the AUROC across cross-validation trials is less than 0.01, indicating a stable and high prediction accuracy. Our analysis results demonstrate the proposed RNN model outperforms existing standard approaches, including static and dynamic Cox proportional hazards models, random forest, and LightGBM. The utilization of the RNN model and the time-series data of previous eGFR measurements underscores its potential as a straightforward and effective tool for assessing the clinical risk of CKD patients concerning their disease progression.
Collapse
Affiliation(s)
- Yitan Zhu
- Computing, Environment and Life Sciences, Argonne National Laboratory, 9700 S Cass Ave, Lemont, IL, 60439, USA.
| | - Dehua Bi
- Department of Public Health Sciences, The University of Chicago, 5841 South Maryland Ave, MC 2000, Chicago, IL, 60637, USA
| | - Milda Saunders
- Department of Medicine, The University of Chicago, 5841 South Maryland Ave, MC 2007, Chicago, IL, 60637, USA
| | - Yuan Ji
- Department of Public Health Sciences, The University of Chicago, 5841 South Maryland Ave, MC 2000, Chicago, IL, 60637, USA.
| |
Collapse
|
6
|
Nickson D, Meyer C, Walasek L, Toro C. Prediction and diagnosis of depression using machine learning with electronic health records data: a systematic review. BMC Med Inform Decis Mak 2023; 23:271. [PMID: 38012655 PMCID: PMC10680172 DOI: 10.1186/s12911-023-02341-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 10/15/2023] [Indexed: 11/29/2023] Open
Abstract
BACKGROUND Depression is one of the most significant health conditions in personal, social, and economic impact. The aim of this review is to summarize existing literature in which machine learning methods have been used in combination with Electronic Health Records for prediction of depression. METHODS Systematic literature searches were conducted within arXiv, PubMed, PsycINFO, Science Direct, SCOPUS and Web of Science electronic databases. Searches were restricted to information published after 2010 (from 1st January 2011 onwards) and were updated prior to the final synthesis of data (27th January 2022). RESULTS Following the PRISMA process, the initial 744 studies were reduced to 19 eligible for detailed evaluation. Data extraction identified machine learning methods used, types of predictors used, the definition of depression, classification performance achieved, sample size, and benchmarks used. Area Under the Curve (AUC) values more than 0.9 were claimed, though the average was around 0.8. Regression methods proved as effective as more developed machine learning techniques. LIMITATIONS The categorization, definition, and identification of the numbers of predictors used within models was sometimes difficult to establish, Studies were largely Western Educated Industrialised, Rich, Democratic (WEIRD) in demography. CONCLUSION This review supports the potential use of machine learning techniques with Electronic Health Records for the prediction of depression. All the selected studies used clinically based, though sometimes broad, definitions of depression as their classification criteria. The reported performance of the studies was comparable to or even better than that found in primary care. There are concerns with generalizability and interpretability.
Collapse
Affiliation(s)
| | - Caroline Meyer
- Warwick Medical School, University of Warwick, Coventry, UK
| | - Lukasz Walasek
- Department of Psychology, University of Warwick, Coventry, UK
| | - Carla Toro
- Warwick Medical School, University of Warwick, Coventry, UK
| |
Collapse
|
7
|
Qu Z, Wang Y, Guo D, He G, Sui C, Duan Y, Zhang X, Lan L, Meng H, Wang Y, Liu X. Identifying depression in the United States veterans using deep learning algorithms, NHANES 2005-2018. BMC Psychiatry 2023; 23:620. [PMID: 37612646 PMCID: PMC10463693 DOI: 10.1186/s12888-023-05109-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/25/2022] [Accepted: 08/13/2023] [Indexed: 08/25/2023] Open
Abstract
BACKGROUND Depression is a common mental health problem among veterans, with high mortality. Despite the numerous conducted investigations, the prediction and identification of risk factors for depression are still severely limited. This study used a deep learning algorithm to identify depression in veterans and its factors associated with clinical manifestations. METHODS Our data originated from the National Health and Nutrition Examination Survey (2005-2018). A dataset of 2,546 veterans was identified using deep learning and five traditional machine learning algorithms with 10-fold cross-validation. Model performance was assessed by examining the area under the subject operating characteristic curve (AUC), accuracy, recall, specificity, precision, and F1 score. RESULTS Deep learning had the highest AUC (0.891, 95%CI 0.869-0.914) and specificity (0.906) in identifying depression in veterans. Further study on depression among veterans of different ages showed that the AUC values for deep learning were 0.929 (95%CI 0.904-0.955) in the middle-aged group and 0.924(95%CI 0.900-0.948) in the older age group. In addition to general health conditions, sleep difficulties, memory impairment, work incapacity, income, BMI, and chronic diseases, factors such as vitamins E and C, and palmitic acid were also identified as important influencing factors. CONCLUSIONS Compared with traditional machine learning methods, deep learning algorithms achieved optimal performance, making it conducive for identifying depression and its risk factors among veterans.
Collapse
Affiliation(s)
- Zihan Qu
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, 130021, China
| | - Yashan Wang
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, 130021, China
| | - Dingjie Guo
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, 130021, China
| | - Guangliang He
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, 130021, China
| | - Chuanying Sui
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, 130021, China
| | - Yuqing Duan
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, 130021, China
| | - Xin Zhang
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, 130021, China
| | - Linwei Lan
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, 130021, China
| | - Hengyu Meng
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, 130021, China
| | - Yajing Wang
- School of Computer Science, McGill University, Montreal, H3A 0G4, Canada
| | - Xin Liu
- Department of Epidemiology and Statistics, School of Public Health, Jilin University, Changchun, 130021, China.
| |
Collapse
|
8
|
Chen J, Engelhard M, Henao R, Berchuck S, Eichner B, Perrin EM, Sapiro G, Dawson G. Enhancing early autism prediction based on electronic records using clinical narratives. J Biomed Inform 2023; 144:104390. [PMID: 37182592 PMCID: PMC10526711 DOI: 10.1016/j.jbi.2023.104390] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 04/14/2023] [Accepted: 05/09/2023] [Indexed: 05/16/2023]
Abstract
Recent work has shown that predictive models can be applied to structured electronic health record (EHR) data to stratify autism likelihood from an early age (<1 year). Integrating clinical narratives (or notes) with structured data has been shown to improve prediction performance in other clinical applications, but the added predictive value of this information in early autism prediction has not yet been explored. In this study, we aimed to enhance the performance of early autism prediction by using both structured EHR data and clinical narratives. We built models based on structured data and clinical narratives separately, and then an ensemble model that integrated both sources of data. We assessed the predictive value of these models from Duke University Health System over a 14-year span to evaluate ensemble models predicting later autism diagnosis (by age 4 years) from data collected from ages 30 to 360 days. Our sample included 11,750 children above by age 3 years (385 meeting autism diagnostic criteria). The ensemble model for autism prediction showed superior performance and at age 30 days achieved 46.8% sensitivity (95% confidence interval, CI: 22.0%, 52.9%), 28.0% positive predictive value (PPV) at high (90%) specificity (CI: 2.0%, 33.1%), and AUC4 (with at least 4-year follow-up for controls) reaching 0.769 (CI: 0.715, 0.811). Prediction by 360 days achieved 44.5% sensitivity (CI: 23.6%, 62.9%), and 13.7% PPV at high (90%) specificity (CI: 9.6%, 18.9%), and AUC4 reaching 0.797 (CI: 0.746, 0.840). Results show that incorporating clinical narratives in early autism prediction achieved promising accuracy by age 30 days, outperforming models based on structured data only. Furthermore, findings suggest that additional features learned from clinician narratives might be hypothesis generating for understanding early development in autism.
Collapse
Affiliation(s)
- Junya Chen
- Department of Electrical and Computer Engineering, Duke University, Durham, NC, 27705, United States.
| | - Matthew Engelhard
- Department of Electrical and Computer Engineering, Duke University, Durham, NC, 27705, United States
| | - Ricardo Henao
- Department of Electrical and Computer Engineering, Duke University, Durham, NC, 27705, United States
| | - Samuel Berchuck
- Department of Electrical and Computer Engineering, Duke University, Durham, NC, 27705, United States
| | - Brian Eichner
- Department of Electrical and Computer Engineering, Duke University, Durham, NC, 27705, United States
| | - Eliana M Perrin
- Department of Electrical and Computer Engineering, Duke University, Durham, NC, 27705, United States
| | - Guillermo Sapiro
- Department of Electrical and Computer Engineering, Duke University, Durham, NC, 27705, United States
| | - Geraldine Dawson
- Department of Electrical and Computer Engineering, Duke University, Durham, NC, 27705, United States
| |
Collapse
|
9
|
Amirahmadi A, Ohlsson M, Etminani K. Deep learning prediction models based on EHR trajectories: A systematic review. J Biomed Inform 2023; 144:104430. [PMID: 37380061 DOI: 10.1016/j.jbi.2023.104430] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 06/08/2023] [Accepted: 06/17/2023] [Indexed: 06/30/2023]
Abstract
BACKGROUND Electronic health records (EHRs) are generated at an ever-increasing rate. EHR trajectories, the temporal aspect of health records, facilitate predicting patients' future health-related risks. It enables healthcare systems to increase the quality of care through early identification and primary prevention. Deep learning techniques have shown great capacity for analyzing complex data and have been successful for prediction tasks using complex EHR trajectories. This systematic review aims to analyze recent studies to identify challenges, knowledge gaps, and ongoing research directions. METHODS For this systematic review, we searched Scopus, PubMed, IEEE Xplore, and ACM databases from Jan 2016 to April 2022 using search terms centered around EHR, deep learning, and trajectories. Then the selected papers were analyzed according to publication characteristics, objectives, and their solutions regarding existing challenges, such as the model's capacity to deal with intricate data dependencies, data insufficiency, and explainability. RESULTS After removing duplicates and out-of-scope papers, 63 papers were selected, which showed rapid growth in the number of research in recent years. Predicting all diseases in the next visit and the onset of cardiovascular diseases were the most common targets. Different contextual and non-contextual representation learning methods are employed to retrieve important information from the sequence of EHR trajectories. Recurrent neural networks and the time-aware attention mechanism for modeling long-term dependencies, self-attentions, convolutional neural networks, graphs for representing inner visit relations, and attention scores for explainability were frequently used among the reviewed publications. CONCLUSIONS This systematic review demonstrated how recent breakthroughs in deep learning methods have facilitated the modeling of EHR trajectories. Research on improving the ability of graph neural networks, attention mechanisms, and cross-modal learning to analyze intricate dependencies among EHRs has shown good progress. There is a need to increase the number of publicly available EHR trajectory datasets to allow for easier comparison among different models. Also, very few developed models can handle all aspects of EHR trajectory data.
Collapse
Affiliation(s)
- Ali Amirahmadi
- Center for Applied Intelligent Systems Research, Halmstad University, Sweden.
| | - Mattias Ohlsson
- Center for Applied Intelligent Systems Research, Halmstad University, Sweden; Computational Biology & Biological Physics, Department of Astronomy and Theoretical Physics, Lund University, Sweden
| | - Kobra Etminani
- Center for Applied Intelligent Systems Research, Halmstad University, Sweden
| |
Collapse
|
10
|
Kim M, Noh Y, Yamada A, Hong SH. Comparison of the Erectile Dysfunction Drugs Sildenafil and Tadalafil Using Patient Medication Reviews: Topic Modeling Study. JMIR Med Inform 2022; 10:e32689. [PMID: 35225813 PMCID: PMC8922152 DOI: 10.2196/32689] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 10/22/2021] [Accepted: 11/17/2021] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Topic modeling of patient medication reviews of erectile dysfunction (ED) drugs can help identify patient preferences regarding ED treatment options. The identification of a set of topics important to the patient from social network service drug reviews would inform the design of patient-centered medication counseling. OBJECTIVE This study aimed to (1) identify the distinctive topics from patient medication reviews unique to tadalafil versus sildenafil; (2) determine if the primary topics are distributed differently for each drug and for each patient characteristic (age and time on ED drug therapy); and (3) test if the primary topics affect satisfaction with ED drug therapy controlling for patient characteristics. METHODS Data were collected from the patient medication reviews of sildenafil and tadalafil posted on WebMD and Ask a Patient. The latent Dirichlet allocation method of natural language processing was used to identify 5 distinctive topics from the patient medication reviews on each drug. Analysis of variance and a 2-sample t test were conducted to compare the topic distribution and assess whether patient satisfaction varies with the primary topics, age, and time on medication for each ED drug. Statistical significance was tested at an alpha of .05. RESULTS The patient medication reviews of sildenafil (N=463) had 2 topics on treatment benefit and 1 each on medication safety, marketing claim, and treatment comparison, while the patient medication reviews of tadalafil (N=919) had 2 topics on medication safety and 1 each on the remaining subjects. Sildenafil's reviewers quite frequently (94/463, 20.4%) mentioned erection sustainability as their primary topic, whereas tadalafil's reviewers were more concerned about severe medication safety. Those who mentioned erection sustainability as their primary topic were quite satisfied with their treatment as opposed to those who mentioned severe medication safety as their primary topic (score 3.85 vs 2.44). The discovered topics reflected the marketing claims of blue magic and amber romance for sildenafil and tadalafil, respectively. The topic of blue magic was preferred among younger patients, while the topic of amber romance was preferred among older patients. The topic alternative choices, which appeared for both the ED drugs, reflected patient interest in the comparative effectiveness and price outside the drug labeling information. CONCLUSIONS The patient medication reviews of ED drugs reflect patient preferences regarding drug labeling information, marketing claims, and alternative treatment choices. The patient preferences concerning ED treatment attributes inform the design of patient-centered communication for improved ED drug therapy.
Collapse
Affiliation(s)
- Maryanne Kim
- College of Pharmacy, Seoul National University, Seoul, Republic of Korea.,Research Institute of Pharmaceutical Sciences, Seoul National University, Seoul, Republic of Korea
| | - Youran Noh
- College of Pharmacy, Seoul National University, Seoul, Republic of Korea.,Research Institute of Pharmaceutical Sciences, Seoul National University, Seoul, Republic of Korea
| | - Akihiko Yamada
- College of Pharmacy, Seoul National University, Seoul, Republic of Korea
| | - Song Hee Hong
- College of Pharmacy, Seoul National University, Seoul, Republic of Korea.,Research Institute of Pharmaceutical Sciences, Seoul National University, Seoul, Republic of Korea
| |
Collapse
|
11
|
Kondratieff KE, Brown JT, Barron M, Warner JL, Yin Z. Mining Medication Use Patterns from Clinical Notes for Breast Cancer Patients Through a Two-Stage Topic Modeling Approach. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2022; 2022:303-312. [PMID: 35854740 PMCID: PMC9285151] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/01/2023]
Abstract
Obtaining medication use and response information is essential for both care providers and researchers to understand patients' medication use and long-term treatment patterns. While unstructured clinical notes contain such information, they have rarely been analyzed for this purpose on a large scale due to the demands of expensive manual reviews. Here, we aimed to extract and analyze medication use patterns from clinical notes for a population of breast cancer patients at an academic medical center using unsupervised topic modeling techniques. Notably, we proposed a two-stage modeling process that was built upon correlated topic modeling (CTM) and structural topic modeling (STM) to capture nuanced information about medication behavior, including drug-disease relationships as well as medication schedules. The STM-derived topics show longitudinal prevalence patterns that may reflect changing patient needs and behaviors after the diagnosis of a severe disease. The patterns also show promise as a predictor for medication-taking behavior.
Collapse
Affiliation(s)
| | - J Thomas Brown
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN USA
| | - Marily Barron
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN USA
| | - Jeremy L Warner
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN USA
- Department of Medicine, Division of Hematology/Oncology, Vanderbilt University Medical Center, Nashville, TN USA
| | - Zhijun Yin
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN USA
- Department of Computer Science, Vanderbilt University, Nashville, TN USA
| |
Collapse
|
12
|
Meng Y, Speier W, Ong MK, Arnold CW. Bidirectional Representation Learning From Transformers Using Multimodal Electronic Health Record Data to Predict Depression. IEEE J Biomed Health Inform 2021; 25:3121-3129. [PMID: 33661740 DOI: 10.1109/jbhi.2021.3063721] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Advancements in machine learning algorithms have had a beneficial impact on representation learning, classification, and prediction models built using electronic health record (EHR) data. Effort has been put both on increasing models' overall performance as well as improving their interpretability, particularly regarding the decision-making process. In this study, we present a temporal deep learning model to perform bidirectional representation learning on EHR sequences with a transformer architecture to predict future diagnosis of depression. This model is able to aggregate five heterogenous and high-dimensional data sources from the EHR and process them in a temporal manner for chronic disease prediction at various prediction windows. We applied the current trend of pretraining and fine-tuning on EHR data to outperform the current state-of-the-art in chronic disease prediction, and to demonstrate the underlying relation between EHR codes in the sequence. The model generated the highest increases of precision-recall area under the curve (PRAUC) from 0.70 to 0.76 in depression prediction compared to the best baseline model. Furthermore, the self-attention weights in each sequence quantitatively demonstrated the inner relationship between various codes, which improved the model's interpretability. These results demonstrate the model's ability to utilize heterogeneous EHR data to predict depression while achieving high accuracy and interpretability, which may facilitate constructing clinical decision support systems in the future for chronic disease screening and early detection.
Collapse
|