1
|
Song Y, Yuan Q, Liu H, Gu K, Liu Y. Machine learning algorithms to predict mild cognitive impairment in older adults in China: A cross-sectional study. J Affect Disord 2025; 368:117-126. [PMID: 39271065 DOI: 10.1016/j.jad.2024.09.059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Revised: 08/29/2024] [Accepted: 09/10/2024] [Indexed: 09/15/2024]
Abstract
OBJECTIVE This study aimed to explore the predictive value of machine learning (ML) in mild cognitive impairment (MCI) among older adults in China and to identify important factors causing MCI. METHODS In this study, 6434 older adults were selected based on the data of the China Health and Elderly Care Longitudinal Survey (CHARLS) in 2020, and the dataset was subsequently divided into the training set and the test set, with a ratio of 6:4. To construct a prediction model for MCI in older adults, six ML algorithms were used, including logistic regression, KNN, SVM, decision tree (DT), LightGBM, and random forest (RF). The Delong test was used to compare the differences of ROC curves of different models, while decision curve analysis (DCA) was used to evaluate the model performance. The important contributions of the prediction results were then used to explain the model by the SHAP value.The Matthews correlation coefficient (MCC) was calculated to evaluate the performance of the models on imbalanced datasets. Additionally, causal analysis and counterfactual analysis were conducted to understand the feature importance and variable effects. RESULTS The area under the ROC curve of each model range from 0.71 to 0.77, indicating significant difference (P < 0.01). The DCA results show that the net benefits of LightGBM is the largest within various probability thresholds. Among all the models, the LightGBM model demonstrated the highest performance and stability. The five most important characteristics for predicting MCI were educational level, social events, gender, relationship with children, and age. Causal analysis revealed that these variables had a significant impact on MCI, with an average treatment effect of -0.144. Counterfactual analysis further validated these findings by simulating different scenarios, such as improving educational level, increasing age, and increasing social events. CONCLUSION The ML algorithm can effectively predict the MCI of older adults in China and identify the important factors causing MCI.
Collapse
Affiliation(s)
- Yanliqing Song
- College of Sports, Nanjing Tech University, Nanjing, China
| | - Quan Yuan
- College of Sports, Nanjing Tech University, Nanjing, China
| | - Haoqiang Liu
- College of Sports, Nanjing Tech University, Nanjing, China
| | - KeNan Gu
- College of Sports, Nanjing Tech University, Nanjing, China
| | - Yue Liu
- School of Athletic Performance, Shanghai University of Sport, Shanghai, China.
| |
Collapse
|
2
|
Qiu Y, Ma X. Using machine learning models to identify the risk of depression in middle-aged and older adults with frequent and infrequent nicotine use: A cross-sectional study. J Affect Disord 2024; 367:554-561. [PMID: 39222853 DOI: 10.1016/j.jad.2024.08.185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 08/04/2024] [Accepted: 08/27/2024] [Indexed: 09/04/2024]
Abstract
BACKGROUND Depression is very prevalent in middle-aged and older smokers. Therefore, we aimed to identify the risk of depression among middle-aged and older adults with frequent and infrequent nicotine use, as this is quite necessary for supporting their well-being. METHODS This study included a total of 10,821 participants, which were derived from the China Health and Retirement Longitudinal Study Wave 5, 2020 (CHARLS-5). Five machine learning (ML) algorithms were employed. Some metrics were used to evaluate the performance of models, including area under the receiver operating characteristic curve (AUC), positive predictive value (PPV), specificity, accuracy. RESULTS 10,821 participants (6472 males, 4349 females) had a mean age of 60.47 ± 8.98, with a score of 8.90 ± 6.53 on depression scale. For middle-aged and older adults with frequent nicotine use, random forest (RF) achieved the highest AUC value, PPV and specificity (0.75, 0.74 and 0.88, respectively). For the other group, support vector machines (SVM) showed the highest PPV (0.74), and relatively high accuracy and specificity (0.72 and 0.87, respectively). Feature importance analysis indicated that "dissatisfaction with life" was the most important variable of identifying the risk of depression in the SVM model, while "attitude towards expected life span" was the most important one in the RF model. LIMITATIONS CHARLS-5 was collected during the COVID-19, so our results may be influenced by the pandemic. CONCLUSIONS This study indicated that certain ML models can ideally identify the risk of depression in middle-aged and older adults, which holds significant value for their health management.
Collapse
Affiliation(s)
- Yuran Qiu
- Department of Psychology, Henan University, Kaifeng, China.
| | - Xu Ma
- Beijing Anding Hospital, Capital Medical University, Beijing, China
| |
Collapse
|
3
|
Kuang Y, Liao X, Jiang Z, Gu Y, Liu B, Tan C, Zhang W, Li K. Federated learning-based prediction of depression among adolescents across multiple districts in China. J Affect Disord 2024; 369:625-632. [PMID: 39389117 DOI: 10.1016/j.jad.2024.10.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Revised: 10/03/2024] [Accepted: 10/07/2024] [Indexed: 10/12/2024]
Abstract
Depression in adolescents is a serious mental health condition that can affect their emotional and social well-being. Detailed understanding of depression patterns and status of depressive symptoms in adolescents could help identify early intervention targets. Despite the growing use of artificial intelligence for diagnosis and prediction of mental health conditions, the traditional centralized machine learning methods require aggregating adolescents' data; this raises concerns about confidentiality and privacy, which hampers the clinical application of machine learning algorithms. In this study, we use federated learning to solve those problems. We included 583,405 middle and high school adolescents from 20 districts in Chengdu China, and collected from three aspects: individuals, families, and schools, containing 11 psychological phenomena to evaluate the status of depressive symptoms. We compared federated and local training frameworks; the results showed the area under the receiver operating characteristic curve for depression increased by up to 20 % (from 0.7544 with local training to 0.9064 with federated training). Moreover, based on the best-performing model, the XGBoost model, we explore the data heterogeneity in federated learning and found that stress, student burnout, and social connection were the three most important predictors of depression symptoms. We then assessed the impact of each subdimension of depression symptoms, results show that sleep was the most impact one which may provide clues to predict depression symptoms in early stage and improve control and prevention efforts.
Collapse
Affiliation(s)
- Yalan Kuang
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu 610041, China; Med-X Center for Informatics, Sichuan University, Chengdu 610041, China
| | - Xiao Liao
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu 610041, China; Med-X Center for Informatics, Sichuan University, Chengdu 610041, China; Mental Health Center, West China Hospital, Sichuan University, Chengdu 610041, China
| | - Zekun Jiang
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu 610041, China; College of Computer Science, Sichuan University, Chengdu 610041, China
| | - Yonghong Gu
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu 610041, China; Med-X Center for Informatics, Sichuan University, Chengdu 610041, China
| | - Bo Liu
- School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230601, China
| | | | - Wei Zhang
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu 610041, China; Med-X Center for Informatics, Sichuan University, Chengdu 610041, China; Mental Health Center, West China Hospital, Sichuan University, Chengdu 610041, China.
| | - Kang Li
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu 610041, China; Med-X Center for Informatics, Sichuan University, Chengdu 610041, China.
| |
Collapse
|
4
|
Zhao J, Li S, Zhang N, Cui C, Wang T, Fan M, Zeng J, Xie Y. Felt stigma and associated factors in children and adolescents with epilepsy: a multicenter cross-sectional study in China. Front Neurol 2024; 15:1459392. [PMID: 39206293 PMCID: PMC11349658 DOI: 10.3389/fneur.2024.1459392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Accepted: 08/05/2024] [Indexed: 09/04/2024] Open
Abstract
Objective Epilepsy-related stigma is a global problem, yet there has been an inadequate focus on children and adolescents. The purpose of this study was to determine the status quo of stigma and its determinants among children and adolescents with epilepsy in China. Methods A multicenter cross-sectional study was conducted across nine hospitals in eight cities within six provinces in China from 10 October 2023 to 15 June 2024. Participants included patients aged 8 to 18 years with epilepsy and their caregivers. Felt stigma was assessed with the Kilifi Stigma Scale for Epilepsy (KSSE). Social support and self-efficacy were collected through the Social Support Rating Scale (SSRS) and the Generalized Self-Efficacy Scale (GSES). The data were analyzed using t-tests, analysis of variance (ANOVA), Spearman correlation analysis, and multiple linear regression analysis. Results The study enrolled 281 children and adolescents, with a mean age of 12.25 years (SD = 2.56), including 46.6% females. A total of 35.6% participants had self-reported felt stigma. The mean KSSE score is 9.58 (SD = 7.11). Meanwhile, stigma scores correlated strongly with reduced social support (r = -0.55, p < 0.01) and self-efficacy (r = -0.43, p < 0.01). Place of residence (rural vs. non-rural), academic performance (average and above vs. fair or poor), region (western region vs. non-western region), duration of epilepsy (≤5 years vs. >5 years), drug-resistant epilepsy (yes vs. no), comorbidities (yes vs. no), social support and self-efficacy are major influencing factors among the complex factors influencing the felt stigma among children and adolescents. Conclusion Medical staff should be more aware of stigma among children and adolescents with epilepsy, especially those who live in rural and western areas, have poor academic performance, have epilepsy duration of more than 5 years, have drug-resistant epilepsy, and have comorbidities, who are at higher risk of stigma. It is recommended that effective measures be taken to alleviate stigma by improving children and adolescents' self-efficacy and providing more social support for them and their families.
Collapse
Affiliation(s)
- Jing Zhao
- Department of Nursing, Children’s Hospital of Chongqing Medical University, National Clinical Research Center for Child Health and Disorders, Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing Key Laboratory of Pediatrics, Chongqing, China
| | - Shuangzi Li
- Department of Neurology, Children’s Hospital of Chongqing Medical University, Chongqing, China
| | - Ni Zhang
- Department of Traditional Chinese Medicine, Children’s Hospital of Chongqing Medical University, Chongqing, China
| | - Cui Cui
- Department of Nursing, Children’s Hospital of Chongqing Medical University, National Clinical Research Center for Child Health and Disorders, Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing Key Laboratory of Pediatrics, Chongqing, China
| | - Ting Wang
- Department of Neurology, Children’s Hospital of Chongqing Medical University, Chongqing, China
| | - Mingping Fan
- Department of Neurology, Children’s Hospital of Chongqing Medical University, Chongqing, China
| | - Junqi Zeng
- Department of Nursing, Children’s Hospital of Chongqing Medical University, National Clinical Research Center for Child Health and Disorders, Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing Key Laboratory of Pediatrics, Chongqing, China
| | - Yuan Xie
- Department of Nursing, Children’s Hospital of Chongqing Medical University, National Clinical Research Center for Child Health and Disorders, Ministry of Education Key Laboratory of Child Development and Disorders, Chongqing Key Laboratory of Pediatrics, Chongqing, China
| |
Collapse
|
5
|
Tabares Tabares M, Vélez Álvarez C, Bernal Salcedo J, Murillo Rendón S. Anxiety in young people: Analysis from a machine learning model. Acta Psychol (Amst) 2024; 248:104410. [PMID: 39032273 DOI: 10.1016/j.actpsy.2024.104410] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 07/12/2024] [Accepted: 07/15/2024] [Indexed: 07/23/2024] Open
Abstract
The study addresses the detection of anxiety symptoms in young people using artificial intelligence models. Questionnaires such as the Patient Health Questionnaire-9 (PHQ-9) and Generalized Anxiety Disorder 7-item scale (GAD-7) are used to collect data, with a focus on early detection of anxiety. Three machine learning models are employed: Support Vector Machine (SVM), K Nearest Neighbors (KNN), and Random Forest (RF), with cross-validation to assess their effectiveness. Results show that the RF model is the most efficient, with an accuracy of 91 %, surpassing previous studies. Significant predictors of anxiety are identified, such as parental education level, alcohol consumption, and social security affiliation. A relationship is observed between anxiety and personal and family history of mental illness, as well as with characteristics external to the model, such as family and personal history of depression. The analysis of the results highlights the importance of considering not only clinical but also social and family aspects in mental health interventions. It is suggested that the sample size be expanded in future studies to improve the robustness of the model. In summary, the study demonstrates the usefulness of artificial intelligence in the early detection of anxiety in young people and highlights the relevance of addressing multidimensional factors in the assessment and treatment of this condition.
Collapse
Affiliation(s)
| | - Consuelo Vélez Álvarez
- Grupo Promoción de la Salud y Prevención de la Enfermedad, Universidad de Caldas, Colombia.
| | | | - Santiago Murillo Rendón
- Grupo Inteligencia Artificial, Universidad de Caldas, Colombia; Grupo Ingeniería de Software, Universidad Autónoma de Manizales, Colombia.
| |
Collapse
|
6
|
Wei Z, Wang X, Liu C, Feng Y, Gan Y, Shi Y, Wang X, Liu Y, Deng Y. Microstate-based brain network dynamics distinguishing temporal lobe epilepsy patients: A machine learning approach. Neuroimage 2024; 296:120683. [PMID: 38880308 DOI: 10.1016/j.neuroimage.2024.120683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 06/02/2024] [Accepted: 06/10/2024] [Indexed: 06/18/2024] Open
Abstract
Temporal lobe epilepsy (TLE) stands as the predominant adult focal epilepsy syndrome, characterized by dysfunctional intrinsic brain dynamics. However, the precise mechanisms underlying seizures in these patients remain elusive. Our study encompassed 116 TLE patients compared with 51 healthy controls. Employing microstate analysis, we assessed brain dynamic disparities between TLE patients and healthy controls, as well as between drug-resistant epilepsy (DRE) and drug-sensitive epilepsy (DSE) patients. We constructed dynamic functional connectivity networks based on microstates and quantified their spatial and temporal variability. Utilizing these brain network features, we developed machine learning models to discriminate between TLE patients and healthy controls, and between DRE and DSE patients. Temporal dynamics in TLE patients exhibited significant acceleration compared to healthy controls, along with heightened synchronization and instability in brain networks. Moreover, DRE patients displayed notably lower spatial variability in certain parts of microstate B, E and F dynamic functional connectivity networks, while temporal variability in certain parts of microstate E and G dynamic functional connectivity networks was markedly higher in DRE patients compared to DSE patients. The machine learning model based on these spatiotemporal metrics effectively differentiated TLE patients from healthy controls and discerned DRE from DSE patients. The accelerated microstate dynamics and disrupted microstate sequences observed in TLE patients mirror highly unstable intrinsic brain dynamics, potentially underlying abnormal discharges. Additionally, the presence of highly synchronized and unstable activities in brain networks of DRE patients signifies the establishment of stable epileptogenic networks, contributing to the poor responsiveness to antiseizure medications. The model based on spatiotemporal metrics demonstrated robust predictive performance, accurately distinguishing both TLE patients from healthy controls and DRE patients from DSE patients.
Collapse
Affiliation(s)
- Zihan Wei
- Department of Neurology, Xijing Hospital, Fourth Military Medical University, 127 West Changle Road, Xi'an 710032, PR China
| | - Xinpei Wang
- School of Aerospace Medicine, Fourth Military Medical University, Xi'an, China
| | - Chao Liu
- Department of Neurology, Xijing Hospital, Fourth Military Medical University, 127 West Changle Road, Xi'an 710032, PR China
| | - Yan Feng
- Department of Neurology, Xijing Hospital, Fourth Military Medical University, 127 West Changle Road, Xi'an 710032, PR China; Xi'an Medical University, Xi'an 710021, PR China
| | - Yajing Gan
- Department of Neurology, Xijing Hospital, Fourth Military Medical University, 127 West Changle Road, Xi'an 710032, PR China
| | - Yuqing Shi
- Department of Neurology, Xijing Hospital, Fourth Military Medical University, 127 West Changle Road, Xi'an 710032, PR China; Xi'an Medical University, Xi'an 710021, PR China
| | - Xiaoli Wang
- Department of Neurology, Xijing Hospital, Fourth Military Medical University, 127 West Changle Road, Xi'an 710032, PR China
| | - Yonghong Liu
- Department of Neurology, Xijing Hospital, Fourth Military Medical University, 127 West Changle Road, Xi'an 710032, PR China
| | - Yanchun Deng
- Department of Neurology, Xijing Hospital, Fourth Military Medical University, 127 West Changle Road, Xi'an 710032, PR China.
| |
Collapse
|
7
|
Pan Y, Zhang X, Wen X, Yuan N, Guo L, Shi Y, Jia Y, Guo Y, Hao F, Qu S, Chen Z, Yang L, Wang X, Liu Y. Development and validation of a machine learning model for prediction of comorbid major depression disorder among narcolepsy type 1. Sleep Med 2024; 119:556-564. [PMID: 38810481 DOI: 10.1016/j.sleep.2024.05.045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 05/04/2024] [Accepted: 05/22/2024] [Indexed: 05/31/2024]
Abstract
BACKGROUND Major depression disorder (MDD) forms a common psychiatric comorbidity among patients with narcolepsy type 1 (NT1), yet its impact on patients with NT1 is often overlooked by neurologists. Currently, there is a lack of effective methods for accurately predicting MDD in patients with NT1. OBJECTIVE This study utilized machine learning (ML) algorithms to identify critical variables and developed the prediction model for predicting MDD in patients with NT1. METHODS The study included 267 NT1 patients from four sleep centers. The diagnosis of comorbid MDD was based on Diagnostic and Statistical Manual of Mental Disorders fifth edition (DSM-5). ML models, including six full models and six compact models, were developed using a training set. The performance of these models was compared in the testing set, and the optimal model was evaluated in the testing set. Various evaluation metrics, such as Area under the receiver operating curve (AUC), precision-recall (PR) curve and calibration curve were employed to assess and compare the performance of the ML models. Model interpretability was demonstrated using SHAP. RESULT In the testing set, the logistic regression (LG) model demonstrated superior performance compared to other ML models based on evaluation metrics such as AUC, PR curve, and calibration curve. The top eight features used in the LG model, ranked by feature importance, included social impact scale (SIS) score, narcolepsy severity scale (NSS) score, total sleep time, body mass index (BMI), education years, age of onset, sleep efficiency, sleep latency. CONCLUSION The study yielded a straightforward and practical ML model for the early identification of MDD in patients with NT1. A web-based tool for clinical applications was developed, which deserves further verification in diverse clinical settings.
Collapse
Affiliation(s)
- Yuanhang Pan
- Department of Neurology, Xijing Hospital, Air Force Medical University, Xi'an, PR China.
| | - Xinbo Zhang
- Department of Neurology, Xijing Hospital, Air Force Medical University, Xi'an, PR China.
| | - Xinyu Wen
- Department of Neurology, Xijing Hospital, Air Force Medical University, Xi'an, PR China.
| | - Na Yuan
- Department of Neurology, Xijing Hospital, Air Force Medical University, Xi'an, PR China.
| | - Li Guo
- Department of Psychiatry, Xijing Hospital, Air Force Medical University, Xi'an, PR China.
| | - Yifan Shi
- Department of Psychiatry, Xijing Hospital, Air Force Medical University, Xi'an, PR China.
| | - Yuanyuan Jia
- Encerebropathy Department, No.2, Baoji Hospital of Traditional Chinese Medicine, Baoji, PR China.
| | - Yanzhao Guo
- Encerebropathy Department, No.10, Xi'an Hospital of Traditional Chinese Medicine, Xi'an, PR China.
| | - Fengli Hao
- Department of Neurology, Xi'an Daxing Hospital, Xi'an, PR China.
| | - Shuyi Qu
- Department of Neurology, Xijing Hospital, Air Force Medical University, Xi'an, PR China.
| | - Ze Chen
- Department of Neurology, Xijing Hospital, Air Force Medical University, Xi'an, PR China.
| | - Lei Yang
- Department of Neurology, Xijing Hospital, Air Force Medical University, Xi'an, PR China.
| | - Xiaoli Wang
- Department of Neurology, Xijing Hospital, Air Force Medical University, Xi'an, PR China.
| | - Yonghong Liu
- Department of Neurology, Xijing Hospital, Air Force Medical University, Xi'an, PR China.
| |
Collapse
|
8
|
Yin X, Niu S, Yu Q, Xuan Y, Feng X. Fear of disease in patients with epilepsy - a network analysis. Front Neurol 2024; 15:1285744. [PMID: 38515450 PMCID: PMC10954812 DOI: 10.3389/fneur.2024.1285744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Accepted: 02/27/2024] [Indexed: 03/23/2024] Open
Abstract
Background Disease-related fear among patients with epilepsy has significantly impacted their quality of life. The Disease-Related Fear Scale (D-RFS), comprising three dimensions, serves as a relatively well-established tool for assessing fear in these patients. However, certain problems potentially exist within the D-RFS's attribution of items, and its internal structure is still unclear. To establish an appropriate dimensional structure and gain deeper comprehension of its internal structure-particularly its core variables-is vital for developing more effective interventions aimed at alleviating disease-related fear among patients with epilepsy. Methods This study employed a cross-sectional survey involving 609 patients with epilepsy. All participants underwent assessment using the Chinese version of the D-RFS. We used exploratory network analysis to discover a new structure and network analysis to investigate the interrelationships among fear symptom domains. In addition to the regularized partial correlation network, we also estimated the node and bridge centrality index to identify the importance of each item within the network. Finally, it was applied to analyze the differences in network analysis outcomes among epilepsy patients with different seizure frequencies. Results The research findings indicate that nodes within the network of disease-related fear symptoms are interconnected, and there are no isolated nodes. Nodes within groups 3 and 4 present the strongest centrality. Additionally, a tight interconnection exists among fear symptoms within each group. Moreover, the frequency of epileptic episodes does not significantly impact the network structure. Conclusion In this study, a new 5-dimension structure was constructed for D-RFS, and the fear of disease in patients with epilepsy has been conceptualized through a network perspective. The goal is to identify potential targets for relevant interventions and gain insights for future research.
Collapse
Affiliation(s)
| | | | | | | | - Xiuqin Feng
- Nursing Department, The Second Affiliated Hospital Zhejiang University School of Medicine (SAHZU), Hangzhou, China
| |
Collapse
|
9
|
Feng W, Wu H, Ma H, Tao Z, Xu M, Zhang X, Lu S, Wan C, Liu Y. Applying contrastive pre-training for depression and anxiety risk prediction in type 2 diabetes patients based on heterogeneous electronic health records: a primary healthcare case study. J Am Med Inform Assoc 2024; 31:445-455. [PMID: 38062850 PMCID: PMC10797279 DOI: 10.1093/jamia/ocad228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 11/13/2023] [Accepted: 11/21/2023] [Indexed: 01/22/2024] Open
Abstract
OBJECTIVE Due to heterogeneity and limited medical data in primary healthcare services (PHS), assessing the psychological risk of type 2 diabetes mellitus (T2DM) patients in PHS is difficult. Using unsupervised contrastive pre-training, we proposed a deep learning framework named depression and anxiety prediction (DAP) to predict depression and anxiety in T2DM patients. MATERIALS AND METHODS The DAP model consists of two sub-models. Firstly, the pre-trained model of DAP used unlabeled discharge records of 85 085 T2DM patients from the First Affiliated Hospital of Nanjing Medical University for unsupervised contrastive learning on heterogeneous electronic health records (EHRs). Secondly, the fine-tuned model of DAP used case-control cohorts (17 491 patients) selected from 149 596 T2DM patients' EHRs in the Nanjing Health Information Platform (NHIP). The DAP model was validated in 1028 patients from PHS in NHIP. Evaluation included receiver operating characteristic area under the curve (ROC-AUC) and precision-recall area under the curve (PR-AUC), and decision curve analysis (DCA). RESULTS The pre-training step allowed the DAP model to converge at a faster rate. The fine-tuned DAP model significantly outperformed the baseline models (logistic regression, extreme gradient boosting, and random forest) with ROC-AUC of 0.91±0.028 and PR-AUC of 0.80±0.067 in 10-fold internal validation, and with ROC-AUC of 0.75 ± 0.045 and PR-AUC of 0.47 ± 0.081 in external validation. The DCA indicate the clinical potential of the DAP model. CONCLUSION The DAP model effectively predicted post-discharge depression and anxiety in T2DM patients from PHS, reducing data fragmentation and limitations. This study highlights the DAP model's potential for early detection and intervention in depression and anxiety, improving outcomes for diabetes patients.
Collapse
Affiliation(s)
- Wei Feng
- Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China
| | - Honghan Wu
- Institute of Health Informatics, University College London, London, WC1E 6BT, United Kingdom
- The Alan Turing Institute, London, NW1 2DB, United Kingdom
| | - Hui Ma
- Department of Medical Psychology, Nanjing Brain Hospital affiliated with Nanjing Medical University, Nanjing, Jiangsu, 210024, China
| | - Zhenhuan Tao
- Department of Planning, Nanjing Health Information Center, Nanjing, Jiangsu, 210003, China
| | - Mengdie Xu
- Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China
| | - Xin Zhang
- Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China
- Department of Information, The First Affiliated Hospital, Nanjing Medical University, Nanjing, Jiangsu, 210029, China
| | - Shan Lu
- Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China
- Department of Information, The First Affiliated Hospital, Nanjing Medical University, Nanjing, Jiangsu, 210029, China
| | - Cheng Wan
- Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China
| | - Yun Liu
- Department of Medical Informatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu, 210009, China
- Department of Information, The First Affiliated Hospital, Nanjing Medical University, Nanjing, Jiangsu, 210029, China
| |
Collapse
|