1
|
Tay JL, Ang YL, Tam WWS, Sim K. Accuracy of machine learning methods in predicting prognosis of patients with psychotic spectrum disorders: a systematic review. BMJ Open 2025; 15:e084463. [PMID: 40000074 DOI: 10.1136/bmjopen-2024-084463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/27/2025] Open
Abstract
OBJECTIVES We aimed to examine the predictive accuracy of functioning, relapse or remission among patients with psychotic disorders, using machine learning methods. We also identified specific features that were associated with these clinical outcomes. DESIGN The methodology of this review was guided by the Cochrane Handbook for Systematic Reviews of Diagnostic Test Accuracy. DATA SOURCES CINAHL, EMBASE, PubMed, PsycINFO, Scopus and ScienceDirect were searched for relevant articles from database inception until 21 November 2024. ELIGIBILITY CRITERIA Studies were included if they involved the use of machine learning methods to predict functioning, relapse and/or remission among individuals with psychotic spectrum disorders. DATA EXTRACTION AND SYNTHESIS Two independent reviewers screened the records from the database search. Risk of bias was evaluated using the Quality Assessment of Diagnostic Accuracy Studies tool from Cochrane. Synthesised findings were presented in tables. RESULTS 23 studies were included in the review, which were mostly conducted in the west (91%). Predictive summary area under the curve values for functioning, relapse and remission were 0.63-0.92 (poor to outstanding), 0.45-0.95 (poor to outstanding), 0.70-0.79 (acceptable), respectively. Logistic regression and random forest were the best performing algorithms. Factors influencing outcomes included demographic (age, ethnicity), illness (duration of untreated illness, types of symptoms), functioning (baseline functioning, interpersonal relationships and activity engagement), treatment variables (use of higher doses of antipsychotics, electroconvulsive therapy), data from passive sensor (call log, distance travelled, time spent in certain locations) and online activities (time of use, use of certain words, changes in search frequencies and length of queries). CONCLUSION Machine learning methods show promise in the prediction of prognosis (specifically functioning, relapse and remission) of mental disorders based on relevant collected variables. Future machine learning studies may want to focus on the inclusion of a broader swathe of variables including ecological momentary assessments, with a greater amount of good quality big data covering longer longitudinal illness courses and coupled with external validation of study findings. PROSPERO REGISTRATION NUMBER The review was registered on PROSPERO, ID: CRD42023441108.
Collapse
Affiliation(s)
| | - Yun Ling Ang
- Department of Nursing, Institute of Mental Health, Singapore
| | - Wilson W S Tam
- Alice Lee Centre for Nursing Studies, Yong Loo Lin School of Medicine, National University of Singapore, Singapore
| | - Kang Sim
- West Region, Institute of Mental Health, Singapore
- Yong Loo Lin School of Medicine, National University of Singapore, Singapore
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore
| |
Collapse
|
2
|
Mardini MT, Khalil GE, Bai C, DivaKaran AM, Ray JM. Identifying Adolescent Depression and Anxiety Through Real-World Data and Social Determinants of Health: Machine Learning Model Development and Validation. JMIR Ment Health 2025; 12:e66665. [PMID: 39937988 PMCID: PMC11838812 DOI: 10.2196/66665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Revised: 12/30/2024] [Accepted: 12/31/2024] [Indexed: 02/14/2025] Open
Abstract
Background The prevalence of adolescent mental health conditions such as depression and anxiety has significantly increased. Despite the potential of machine learning (ML), there is a shortage of models that use real-world data (RWD) to enhance early detection and intervention for these conditions. Objective This study aimed to identify depression and anxiety in adolescents using ML techniques on RWD and social determinants of health (SDoH). Methods We analyzed RWD of adolescents aged 10-17 years, considering various factors such as demographics, prior diagnoses, prescribed medications, medical procedures, and laboratory measurements recorded before the onset of anxiety or depression. Clinical data were linked with SDoH at the block-level. Three separate models were developed to predict anxiety, depression, and both conditions. Our ML model of choice was Extreme Gradient Boosting (XGBoost) and we evaluated its performance using the nested cross-validation technique. To interpret the model predictions, we used the Shapley additive explanation method. Results Our cohort included 52,054 adolescents, identifying 12,572 with anxiety, 7812 with depression, and 14,019 with either condition. The models achieved area under the curve values of 0.80 for anxiety, 0.81 for depression, and 0.78 for both combined. Excluding SDoH data had a minimal impact on model performance. Shapley additive explanation analysis identified gender, race, educational attainment, and various medical factors as key predictors of anxiety and depression. Conclusions This study highlights the potential of ML in early identification of depression and anxiety in adolescents using RWD. By leveraging RWD, health care providers may more precisely identify at-risk adolescents and intervene earlier, potentially leading to improved mental health outcomes.
Collapse
Affiliation(s)
- Mamoun T Mardini
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, 7th Floor, Suite 7000, 1889 Museum Rd, Gainesville, FL, 32611, United States, 1 7049045847
| | - Georges E Khalil
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, 7th Floor, Suite 7000, 1889 Museum Rd, Gainesville, FL, 32611, United States, 1 7049045847
| | - Chen Bai
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, 7th Floor, Suite 7000, 1889 Museum Rd, Gainesville, FL, 32611, United States, 1 7049045847
| | - Aparna Menon DivaKaran
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, 7th Floor, Suite 7000, 1889 Museum Rd, Gainesville, FL, 32611, United States, 1 7049045847
| | - Jessica M Ray
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, 7th Floor, Suite 7000, 1889 Museum Rd, Gainesville, FL, 32611, United States, 1 7049045847
| |
Collapse
|
3
|
Liang Y, Gu X, Shi Y, Fang Y, Wu Z, Li X. Electrophysiological biomarkers based on CISANET characterize illness severity and suicidal ideation among patients with major depressive disorder. Med Biol Eng Comput 2025:10.1007/s11517-024-03279-6. [PMID: 39849234 DOI: 10.1007/s11517-024-03279-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 12/25/2024] [Indexed: 01/25/2025]
Abstract
Major depressive disorder (MDD) is a significant neurological disorder that imposes a substantial burden on society, characterized by its high recurrence rate and associated suicide risk. Clinical diagnosis, which relies on interviews with psychiatrists and questionnaires used as auxiliary diagnostic tools, lacks precision and objectivity in diagnosing MDD. To address these challenges, this study proposes an assessment method based on EEG. It involves calculating the phase lag index (PLI) in alpha and gamma bands to construct functional brain connectivity. This method aims to find biomarkers to assess the severity of MDD and suicidal ideation. The convolutional inception with shuffled attention network (CISANET) was introduced for this purpose. The study included 61 patients with MDD, who were classified into mild, moderate, and severe levels based on depression scales, and the presence of suicidal ideation was evaluated. Two paradigms were designed for the study, with EEG analysis focusing on 32 selected electrodes to extract alpha and gamma bands. In the gamma band, the classification accuracy reached 77.37% in the visual paradigm and 80.12% in the auditory paradigm. The average accuracy in classifying suicidal ideation was 93.60%. The findings suggest that gamma bands can be used as potential biomarkers differentiating illness severity and identifying suicidal ideation of MDD, and that objective assessment methods can effectively assess MDD The objective assessment method can effectively assess the severity of MDD and identify suicidal ideation of MDD patients, which provides a valuable theoretical basis for understanding the biological characteristics of MDD.
Collapse
Affiliation(s)
- Yuchen Liang
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, China
- College of Medical Instruments, Shanghai University of Medicine and Health Sciences, Shanghai, China
| | - Xuelin Gu
- College of Medical Instruments, Shanghai University of Medicine and Health Sciences, Shanghai, China
| | - Yifan Shi
- Department of Psychiatry, Xijing Hospital, the Fourth Military Medical University, Xi'an, China
| | - Yiru Fang
- Department of Psychiatry & Affective Disorders Center, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
- Division of Mood Disorders, Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai, China.
- CAS Center for Excellence in Brain Science and Intelligence Technology, Shanghai, China.
| | - Zhiguo Wu
- Shanghai Yangpu Mental Health Center, Shanghai, 200093, China.
- Department of Psychological Medicine, Renji Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China.
| | - Xiaoou Li
- School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai, China.
- College of Medical Instruments, Shanghai University of Medicine and Health Sciences, Shanghai, China.
- Shanghai Yangpu Mental Health Center, Shanghai, 200093, China.
| |
Collapse
|
4
|
Lee KS, Ham BJ. Graph Machine Learning With Systematic Hyper-Parameter Selection on Hidden Networks and Mental Health Conditions in the Middle-Aged and Old. Psychiatry Investig 2024; 21:1382-1390. [PMID: 39757816 PMCID: PMC11704800 DOI: 10.30773/pi.2024.0249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/02/2024] [Revised: 09/20/2024] [Accepted: 10/02/2024] [Indexed: 01/07/2025] Open
Abstract
OBJECTIVE It takes significant time and energy to collect data on explicit networks. This study used graph machine learning to identify hidden networks and predict mental health conditions in the middle-aged and old. METHODS Data came from the Korean Longitudinal Study of Ageing (2016-2018), with 2,000 participants aged 56 or more. The dependent variable was mental disease (no vs. yes) in 2018. Twenty-eight predictors in 2016 were included. Graph machine learning with systematic hyper-parameter selection was conducted. RESULTS The area under the curve was similar across different models in different scenarios. However, sensitivity (93%) was highest for the graph random forest in the scenario of 2,000 participants and the centrality requirement of life satisfaction 90. Based on the graph random forest, top-10 determinants of mental disease were mental disease in previous period (2016), age, income, life satisfaction-health, life satisfaction-overall, subjective health, body mass index, life satisfaction-economic, children alive and health insurance. Especially, life satisfaction-overall was a top-5 determinant in the graph random forest, which considers life satisfaction as an emotional connection and a group interaction. CONCLUSION Improving an individual's life satisfaction as a personal condition is expected to strengthen the individual's emotional connection as a group interaction, which would reduce the risk of the individual's mental disease in the end. This would bring an important clinical implication for highlighting the importance of a patient's life satisfaction and emotional connection regarding the diagnosis and management of the patient's mental disease.
Collapse
Affiliation(s)
- Kwang-Sig Lee
- AI Center, Korea University College of Medicine & Anam Hospital, Seoul, Republic of Korea
| | - Byung-Joo Ham
- Department of Psychiatry, Korea University College of Medicine & Anam Hospital, Seoul, Republic of Korea
| |
Collapse
|
5
|
Park JS, Lee KS, Heo JS, Ahn KH. Clinical and dental predictors of preterm birth using machine learning methods: the MOHEPI study. Sci Rep 2024; 14:24664. [PMID: 39433922 PMCID: PMC11494142 DOI: 10.1038/s41598-024-75684-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Accepted: 10/07/2024] [Indexed: 10/23/2024] Open
Abstract
Preterm birth (PTB) is one of the most common and serious complications of pregnancy, leading to mortality and severe morbidities that can impact lifelong health. PTB could be associated with various maternal medical condition and dental status including periodontitis. The purpose of this study was to identify major predictors of PTB among clinical and dental variables using machine learning methods. Prospective cohort data were obtained from 60 women who delivered singleton births via cesarean section (30 PTB, 30 full-term birth [FTB]). Dependent variables were PTB and spontaneous PTB (SPTB). 15 independent variables (10 clinical and 5 dental factors) were selected for inclusion in the machine learning analysis. Random forest (RF) variable importance was used to identify the major predictors of PTB and SPTB. Shapley additive explanation (SHAP) values were calculated to analyze the directions of the associations between the predictors and PTB/SPTB. Major predictors of PTB identified by RF variable importance included pre-pregnancy body mass index (BMI), modified gingival index (MGI), preeclampsia, decayed missing filled teeth (DMFT) index, and maternal age as in top five rankings. SHAP values revealed positive correlations between PTB/SPTB and its major predictors such as premature rupture of the membranes, pre-pregnancy BMI, maternal age, and MGI. The positive correlations between these predictors and PTB emphasize the need for integrated medical and dental care during pregnancy. Future research should focus on validating these predictors in larger populations and exploring interventions to mitigate these risk factors.
Collapse
Affiliation(s)
- Jung Soo Park
- Department of Periodontology, Korea University Anam Hospital, Seoul, Republic of Korea
| | - Kwang-Sig Lee
- Center for Artificial Intelligence, Korea University College of Medicine, Seoul, Republic of Korea
| | - Ju Sun Heo
- Department of Pediatrics, Seoul National University College of Medicine, Seoul, Republic of Korea.
- Department of Pediatrics, Seoul National University Children's Hospital, 101 Daehak-Ro, Jongno-Gu, Seoul, 110-769, South Korea.
- Department of Pediatrics, Korea University College of Medicine, Seoul, Republic of Korea.
| | - Ki Hoon Ahn
- Department of Obstetrics and Gynecology, Gynecology, Korea University College of Medicine, Korea University Anam Hospital, 73 Goryeodae-Ro, Seongbuk-Gu, Seoul, 02841, Korea.
| |
Collapse
|
6
|
Mumenin N, Kabir Hossain ABM, Hossain MA, Debnath PP, Nusrat Della M, Hasan Rashed MM, Hossen A, Basar MR, Hossain MS. Screening depression among university students utilizing GHQ-12 and machine learning. Heliyon 2024; 10:e37182. [PMID: 39296063 PMCID: PMC11409111 DOI: 10.1016/j.heliyon.2024.e37182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2024] [Revised: 08/22/2024] [Accepted: 08/28/2024] [Indexed: 09/21/2024] Open
Abstract
The escalating incidence of depression has brought attention to the increasing concern regarding the mental well-being of university students in the current academic environment. Given the increasing mental health challenges faced by students, there is a critical need for efficient, scalable, and accurate screening methods. This study aims to address the issue by using the General Health Questionnaire-12 (GHQ-12), a well recognized tool for evaluating psychological discomfort, in combination with machine learning (ML) techniques. Firstly, for effective screening of depression, a comprehensive questionnaire has been created with the help of an expert psychiatrist. The questionnaire includes the GHQ-12, socio-demographic, and job and career-related inquiries. A total of 804 responses has been collected from various public and private universities across Bangladesh. The data has been then analyzed and preprocessed. It has been found that around 60% of the study population are suffering from depression. Lastly, 16 different ML models, including both traditional algorithms and ensemble methods has been applied to examine the data to identify trends and predictors of depression in this demographic. The models' performance has been rigorously evaluated in order to ascertain their effectiveness in precisely identifying individuals who are at risk. Among the ML models, Extremely Randomized Tree (ET) has achieved the highest accuracy of 90.26%, showcasing its classification effectiveness. A thorough investigation of the performance of the models compared, therefore clarifying their possible relevance in the early detection of depression among university students, has been presented in this paper. The findings shed light on the complex interplay among socio-demographic variables, stressors associated with one's profession, and mental well-being, which offer an original viewpoint on utilizing ML in psychological research.
Collapse
Affiliation(s)
- Nasirul Mumenin
- Bangladesh Army University of Engineering and Technology, Rajshahi, Bangladesh
| | - A B M Kabir Hossain
- Bangladesh Army University of Engineering and Technology, Rajshahi, Bangladesh
| | - Md Arafat Hossain
- Bangladesh Army University of Engineering and Technology, Rajshahi, Bangladesh
| | | | | | | | - Afzal Hossen
- Bangladesh Army University of Engineering and Technology, Rajshahi, Bangladesh
| | - Md Rubel Basar
- Bangladesh Army University of Engineering and Technology, Rajshahi, Bangladesh
| | - Md Sejan Hossain
- Bangladesh Army University of Engineering and Technology, Rajshahi, Bangladesh
| |
Collapse
|
7
|
Park H, Kim K, Moon E, Lim HJ, Suh H, Kim KE, Kang T. Exploration of a Machine Learning Model Using Self-rating Questionnaires for Detecting Depression in Patients with Breast Cancer. CLINICAL PSYCHOPHARMACOLOGY AND NEUROSCIENCE : THE OFFICIAL SCIENTIFIC JOURNAL OF THE KOREAN COLLEGE OF NEUROPSYCHOPHARMACOLOGY 2024; 22:466-472. [PMID: 39069686 PMCID: PMC11289607 DOI: 10.9758/cpn.23.1147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 02/07/2024] [Accepted: 02/26/2024] [Indexed: 07/30/2024]
Abstract
Objective Given the long-term and severe distress experienced during breast cancer treatment, detecting depression among breast cancer patients is clinically crucial. This study aimed to explore a machine-learning model using self-report questionnaires to screen for depression in patients with breast cancer. Methods A total of 327 patients who visited the breast cancer clinic were included in this study. Depressive symptoms were measured using the Patient Health Questionnaire-9 (PHQ-9), Beck Depression Inventory (BDI), and Hospital Anxiety and Depression Scale (HADS). The depression was evaluated according to the Diagnostic and Statistical Manual of Mental Disorders 5th edition. The prediction model's performance based on supervised machine learning was conducted using MATLAB2022. Results The BDI showed an area under the curve (AUC) of 0.785 when using the logistic regression (LR) classifier. The HADS and PHQ-9 showed an AUC of 0.784 and 0.756 when using the linear discriminant analysis, respectively. The combinations of BDI and HADS showed an AUC of 0.812 when using the LR. The combinations of PHQ-9, BDI, and HADS showed an AUC of 0.807 when using LR. Conclusion The combination model with BDI and HADS in breast cancer patients might be better than the method using a single scale. In future studies, it is necessary to explore strategies that can improve the performance of the model by integrating the method using questionnaires and other methods.
Collapse
Affiliation(s)
- Heeseung Park
- Breast Cancer Clinic of Busan Cancer Center, Pusan National University Hospital, Busan, Korea
- Biomedical Research Institute, Pusan National University Hospital, Busan, Korea
- Department of Surgery, Pusan National University School of Medicine, Busan, Korea
| | - Kyungwon Kim
- Biomedical Research Institute, Pusan National University Hospital, Busan, Korea
- Department of Psychiatry, Pusan National University Hospital, Busan, Korea
- Department of Psychiatry, Pusan National University School of Medicine, Yangsan, Korea
| | - Eunsoo Moon
- Biomedical Research Institute, Pusan National University Hospital, Busan, Korea
- Department of Psychiatry, Pusan National University Hospital, Busan, Korea
- Department of Psychiatry, Pusan National University School of Medicine, Yangsan, Korea
| | - Hyun Ju Lim
- Biomedical Research Institute, Pusan National University Hospital, Busan, Korea
- Department of Psychiatry, Pusan National University Hospital, Busan, Korea
- Department of Psychology, Gyeoungsang National University, Jinju, Korea
| | - Hwagyu Suh
- Biomedical Research Institute, Pusan National University Hospital, Busan, Korea
- Department of Psychiatry, Pusan National University Hospital, Busan, Korea
- Department of Psychiatry, Pusan National University School of Medicine, Yangsan, Korea
| | - Kyoung-Eun Kim
- Breast Cancer Clinic of Busan Cancer Center, Pusan National University Hospital, Busan, Korea
- Biomedical Research Institute, Pusan National University Hospital, Busan, Korea
| | - Taewoo Kang
- Breast Cancer Clinic of Busan Cancer Center, Pusan National University Hospital, Busan, Korea
- Biomedical Research Institute, Pusan National University Hospital, Busan, Korea
- Department of Surgery, Pusan National University School of Medicine, Busan, Korea
| |
Collapse
|
8
|
Tang B, Ellis RJ, Vaida F, Umlauf A, Franklin DR, Dastgheyb R, Rubin LH, Riggs PK, Iudicello JE, Clifford DB, Moore DJ, Heaton RK, Letendre SL. Biopsychosocial phenotypes in people with HIV in the CHARTER cohort. Brain Commun 2024; 6:fcae224. [PMID: 39077377 PMCID: PMC11285184 DOI: 10.1093/braincomms/fcae224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 05/22/2024] [Accepted: 07/23/2024] [Indexed: 07/31/2024] Open
Abstract
Neuropsychiatric complications such as neurocognitive impairment and depression are common in people with HIV despite viral suppression on antiretroviral therapy, but these conditions are heterogeneous in their clinical presentations and associated disability. Identifying novel biopsychosocial phenotypes that account for neurocognitive performance and depressive and functional symptoms will better reflect the complexities encountered in clinical practice and may have pathological and therapeutic implications. We classified 1580 people with HIV based on 17 features, including 7 cognitive domains, 4 subscales of the Beck depression inventory-II, 5 components of the patient's assessment of own functioning inventory, and dependence in instrumental and basic activities of daily living. A two-stage clustering procedure consisting of dimension reduction with self-organizing maps and Mahalanobis distance-based k-means clustering algorithms was applied to cross-sectional data. Baseline demographic and clinical characteristics were compared between the phenotypes, and their prediction on the biopsychosocial phenotypes was evaluated using multinomial logistic regression. Four distinct phenotypes were identified. Participants in Phenotype 1 overall did well in all domains. Phenotype 2 had mild-to-moderate depressive symptoms and the most substance use disorders. Phenotype 3 had mild-to-moderate cognitive impairment, moderate depressive symptoms, and the worst daily functioning; they also had the highest proportion of females and non-HIV conditions that could affect cognition. Phenotype 4 had mild-to-moderate cognitive impairment but with relatively good mood, and daily functioning. Multivariable analysis showed that demographic characteristics, medical conditions, lifetime cocaine use disorder, triglycerides, and non-antiretroviral therapy medications were important variables associated with biopsychosocial phenotype. We found complex, multidimensional biopsychosocial profiles in people with HIV that were associated with different risk patterns. Future longitudinal work should determine the stability of these phenotypes, assess factors that influence transitions from one phenotype to another, and characterize their biological associations.
Collapse
Affiliation(s)
- Bin Tang
- Department of Psychiatry, University of California San Diego, San Diego, CA 92093, USA
| | - Ronald J Ellis
- Department of Psychiatry, University of California San Diego, San Diego, CA 92093, USA
- Department of Neurosciences, University of California San Diego, San Diego, CA 92093, USA
| | - Florin Vaida
- Department of Psychiatry, University of California San Diego, San Diego, CA 92093, USA
| | - Anya Umlauf
- Department of Psychiatry, University of California San Diego, San Diego, CA 92093, USA
| | - Donald R Franklin
- Department of Psychiatry, University of California San Diego, San Diego, CA 92093, USA
| | - Raha Dastgheyb
- Department of Neurology, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Leah H Rubin
- Department of Neurology, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Patricia K Riggs
- Department of Medicine, University of California San Diego, San Diego, CA 92093, USA
| | - Jennifer E Iudicello
- Department of Psychiatry, University of California San Diego, San Diego, CA 92093, USA
| | - David B Clifford
- Department of Neurology, Washington University at St. Louis, St. Louis, MO 63110, USA
| | - David J Moore
- Department of Psychiatry, University of California San Diego, San Diego, CA 92093, USA
| | - Robert K Heaton
- Department of Psychiatry, University of California San Diego, San Diego, CA 92093, USA
| | - Scott L Letendre
- Department of Psychiatry, University of California San Diego, San Diego, CA 92093, USA
- Department of Medicine, University of California San Diego, San Diego, CA 92093, USA
| |
Collapse
|
9
|
Kim ES, Lee KS. Artificial intelligence in colonoscopy: from detection to diagnosis. Korean J Intern Med 2024; 39:555-562. [PMID: 38695105 PMCID: PMC11236815 DOI: 10.3904/kjim.2023.332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 11/13/2023] [Indexed: 07/12/2024] Open
Abstract
This study reviews the recent progress of artificial intelligence for colonoscopy from detection to diagnosis. The source of data was 27 original studies in PubMed. The search terms were "colonoscopy" (title) and "deep learning" (abstract). The eligibility criteria were: (1) the dependent variable of gastrointestinal disease; (2) the interventions of deep learning for classification, detection and/or segmentation for colonoscopy; (3) the outcomes of accuracy, sensitivity, specificity, area under the curve (AUC), precision, F1, intersection of union (IOU), Dice and/or inference frames per second (FPS); (3) the publication year of 2021 or later; (4) the publication language of English. Based on the results of this study, different deep learning methods would be appropriate for different tasks for colonoscopy, e.g., Efficientnet with neural architecture search (AUC 99.8%) in the case of classification, You Only Look Once with the instance tracking head (F1 96.3%) in the case of detection, and Unet with dense-dilation-residual blocks (Dice 97.3%) in the case of segmentation. Their performance measures reported varied within 74.0-95.0% for accuracy, 60.0-93.0% for sensitivity, 60.0-100.0% for specificity, 71.0-99.8% for the AUC, 70.1-93.3% for precision, 81.0-96.3% for F1, 57.2-89.5% for the IOU, 75.1-97.3% for Dice and 66-182 for FPS. In conclusion, artificial intelligence provides an effective, non-invasive decision support system for colonoscopy from detection to diagnosis.
Collapse
Affiliation(s)
- Eun Sun Kim
- Department of Gastroenterology, Korea University Anam Hospital, Seoul, Korea
| | - Kwang-Sig Lee
- AI Center, Korea University Anam Hospital, Seoul, Korea
| |
Collapse
|
10
|
Sharma CM, Chariar VM. Diagnosis of mental disorders using machine learning: Literature review and bibliometric mapping from 2012 to 2023. Heliyon 2024; 10:e32548. [PMID: 38975193 PMCID: PMC11225745 DOI: 10.1016/j.heliyon.2024.e32548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Revised: 06/05/2024] [Accepted: 06/05/2024] [Indexed: 07/09/2024] Open
Abstract
Background Mental disorders (MDs) are becoming a leading burden in non-communicable diseases (NCDs). As per the World Health Organization's 2022 assessment report, there was a steep increase of 25 % in MDs during the COVID-19 pandemic. Early diagnosis of MDs can significantly improve treatment outcome and save disability-adjusted life years (DALYs). In recent times, the application of machine learning (ML) and deep learning (DL)) has shown promising results in the diagnosis of MDs, and the field has witnessed a huge research output in the form of research publications. Therefore, a bibliometric mapping along with a review of recent advancements is required. Methods This study presents a bibliometric analysis and review of the research, published over the last 10 years. Literature searches were conducted in the Scopus database for the period from January 1, 2012, to June 9, 2023. The data was filtered and screened to include only relevant and reliable publications. A total of 2811 journal articles were found. The data was exported to a comma-separated value (CSV) format for further analysis. Furthermore, a review of 40 selected studies was performed. Results The popularity of ML techniques in diagnosing MDs has been growing, with an annual research growth rate of 17.05 %. The Journal of Affective Disorders published the most documents (n = 97), while Wang Y. (n = 64) has published the most articles. Lotka's law is observed, with a minority of authors contributing the majority of publications. The top affiliating institutes are the West China Hospital of Sichuan University followed by the University of California, with China and the US dominating the top 10 institutes. While China has more publications, papers affiliated with the US receive more citations. Depression and schizophrenia are the primary focuses of ML and deep learning (DL) in mental disease detection. Co-occurrence network analysis reveals that ML is associated with depression, schizophrenia, autism, anxiety, ADHD, obsessive-compulsive disorder, and PTSD. Popular algorithms include support vector machine (SVM) classifier, decision tree classifier, and random forest classifier. Furthermore, DL is linked to neuroimaging techniques such as MRI, fMRI, and EEG, as well as bipolar disorder. Current research trends encompass DL, LSTM, generalized anxiety disorder, feature fusion, and convolutional neural networks.
Collapse
Affiliation(s)
- Chandra Mani Sharma
- CRDT, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, 110016, India
- School of Computer Science, UPES, Dehradun, Uttarakhand, India
| | | |
Collapse
|
11
|
Yang SW, Lee KS, Heo JS, Choi ES, Kim K, Lee S, Ahn KH. Machine learning analysis with population data for prepregnancy and perinatal risk factors for the neurodevelopmental delay of offspring. Sci Rep 2024; 14:13993. [PMID: 38886474 PMCID: PMC11183197 DOI: 10.1038/s41598-024-64590-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 06/11/2024] [Indexed: 06/20/2024] Open
Abstract
Neurodevelopmental disorders (NDD) in offspring are associated with a complex combination of pre-and postnatal factors. This study uses machine learning and population data to evaluate the association between prepregnancy or perinatal risk factors and the NDD of offspring. Population-based retrospective cohort data were obtained from Korea National Health Insurance Service claims data for 209,424 singleton offspring and their mothers who gave birth for the first time in 2007. The dependent variables were motor development disorder (MDD), cognitive development disorder (CDD) and combined overall neurodevelopmental disorder (NDD) from offspring. Seventeen independent variables from 2002 to 2007 were included. Random forest variable importance and Shapley Additive Explanation (SHAP) values were calculated to analyze the directions of its associations with the predictors. The random forest with oversampling registered much higher areas under the receiver-operating-characteristic curves than the logistic regression of interaction and non-linearity terms, 79% versus 50% (MDD), 82% versus 52% (CDD) and 74% versus 50% (NDD). Based on random forest variable importance, low socioeconomic status and age at birth were highly ranked. In SHAP values, there was a positive association between NDD and pre- or perinatal outcomes, especially, fetal male sex with growth restriction associated the development of NDD in offspring.
Collapse
Affiliation(s)
- Seung-Woo Yang
- Research Institute of Medical Science, Konkuk University School of Medicine, Seoul, Republic of Korea
- School of Medicine, University of California, San Diego, USA
| | - Kwang-Sig Lee
- AI Center, Korea University College of Medicine, Anam Hospital, Seoul, Korea
| | - Ju Sun Heo
- Department of Pediatrics, Seoul National University Children's Hospital, Seoul, Republic of Korea
- Department of Pediatrics, Seoul National University College of Medicine, Seoul, Republic of Korea
| | - Eun-Saem Choi
- Department of Obstetrics and Gynecology, Korea University College of Medicine, Anam Hospital, Seoul, Korea
| | - Kyumin Kim
- Graduate School of Artificial Intelligence, Pohang University of Science and Technology, Pohang, Korea
| | - Sohee Lee
- Department of Statistics, Korea University College of Political Science and Economics, Seoul, Korea
| | - Ki Hoon Ahn
- Department of Obstetrics and Gynecology, Korea University College of Medicine, Anam Hospital, Seoul, Korea.
| |
Collapse
|
12
|
Lee KS, Kim SJ, Kim DC, Park SH, Jang DH, Kim EH, Kang Y, Lee S, Lee SW. Machine learning-based prediction of cerebral oxygen saturation based on multi-modal cerebral oximetry data. Health Informatics J 2024; 30:14604582241259341. [PMID: 38847787 DOI: 10.1177/14604582241259341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/18/2024]
Abstract
This study develops machine learning-based algorithms that facilitate accurate prediction of cerebral oxygen saturation using waveform data in the near-infrared range from a multi-modal oxygen saturation sensor. Data were obtained from 150,000 observations of a popular cerebral oximeter, Masimo O3™ regional oximetry (Co., United States) and a multi-modal cerebral oximeter, Votem (Inc., Korea). Among these observations, 112,500 (75%) and 37,500 (25%) were used for training and test sets, respectively. The dependent variable was the cerebral oxygen saturation value from the Masimo O3™ (0-100%). The independent variables were the time of measurement (0-300,000 ms) and the 16-bit decimal amplitudes values (infrared and red) from Votem (0-65,535). For the right part of the forehead, the root mean square error of the random forest (0.06) was much smaller than those of linear regression (1.22) and the artificial neural network with one, two or three hidden layers (2.58). The result was similar for the left part of forehead, that is, random forest (0.05) vs logistic regression (1.22) and the artificial neural network with one, two or three hidden layers (2.97). Machine learning aids in accurately predicting of cerebral oxygen saturation, employing the data from a multi-modal cerebral oximeter.
Collapse
Affiliation(s)
- Kwang-Sig Lee
- AI Center, Korea University Anam Hospital, Seoul, Korea
| | - Su Jin Kim
- Department of Emergency Medicine, Korea University Anam Hospital, Seoul, Korea
| | | | - Sang-Hyun Park
- Institute for Healthcare Service Innovation, Korea University College of Medicine, Seoul, Korea
| | - Dong-Hyun Jang
- Department of Public Healthcare Service, Seoul National University Bundang Hospital, Seongnam, Korea
| | - Eung Hwi Kim
- Institute for Healthcare Service Innovation, Korea University College of Medicine, Seoul, Korea
| | - YoungShin Kang
- Institute for Healthcare Service Innovation, Korea University College of Medicine, Seoul, Korea
| | - Sijin Lee
- Department of Emergency Medicine, Korea University Anam Hospital, Seoul, Korea
| | - Sung Woo Lee
- Department of Emergency Medicine, Korea University Anam Hospital, Seoul, Korea
| |
Collapse
|
13
|
Choi JH, Choi Y, Lee KS, Ahn KH, Jang WY. Explainable Model Using Shapley Additive Explanations Approach on Wound Infection after Wide Soft Tissue Sarcoma Resection: "Big Data" Analysis Based on Health Insurance Review and Assessment Service Hub. MEDICINA (KAUNAS, LITHUANIA) 2024; 60:327. [PMID: 38399614 PMCID: PMC10890019 DOI: 10.3390/medicina60020327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 02/04/2024] [Accepted: 02/12/2024] [Indexed: 02/25/2024]
Abstract
Background and Objectives: Soft tissue sarcomas represent a heterogeneous group of malignant mesenchymal tissues. Despite their low prevalence, soft tissue sarcomas present clinical challenges for orthopedic surgeons owing to their aggressive nature, and perioperative wound infections. However, the low prevalence of soft tissue sarcomas has hindered the availability of large-scale studies. This study aimed to analyze wound infections after wide resection in patients with soft tissue sarcomas by employing big data analytics from the Hub of the Health Insurance Review and Assessment Service (HIRA). Materials and Methods: Patients who underwent wide excision of soft tissue sarcomas between 2010 and 2021 were included. Data were collected from the HIRA database of approximately 50 million individuals' information in the Republic of Korea. The data collected included demographic information, diagnoses, prescribed medications, and surgical procedures. Random forest has been used to analyze the major associated determinants. A total of 10,906 observations with complete data were divided into training and validation sets in an 80:20 ratio (8773 vs. 2193 cases). Random forest permutation importance was employed to identify the major predictors of infection and Shapley Additive Explanations (SHAP) values were derived to analyze the directions of associations with predictors. Results: A total of 10,969 patients who underwent wide excision of soft tissue sarcomas were included. Among the study population, 886 (8.08%) patients had post-operative infections requiring surgery. The overall transfusion rate for wide excision was 20.67% (2267 patients). Risk factors among the comorbidities of each patient with wound infection were analyzed and dependence plots of individual features were visualized. The transfusion dependence plot reveals a distinctive pattern, with SHAP values displaying a negative trend for individuals without blood transfusions and a positive trend for those who received blood transfusions, emphasizing the substantial impact of blood transfusions on the likelihood of wound infection. Conclusions: Using the machine learning random forest model and the SHAP values, the perioperative transfusion, male sex, old age, and low SES were important features of wound infection in soft-tissue sarcoma patients.
Collapse
Affiliation(s)
- Ji-Hye Choi
- Department of Orthopedic Surgery, Anam Hospital, Korea University College of Medicine, 73 Goryeodae-ro, Seongbuk-gu, Seoul 02841, Republic of Korea;
- Anam Hospital Bloodless Medicine Center, Korea University College of Medicine, Seoul 02841, Republic of Korea
| | - Yumin Choi
- School of Mechanical Engineering, Korea University College of Medicine, 73 Goryeodae-ro, Seongbuk-gu, Seoul 02841, Republic of Korea;
| | - Kwang-Sig Lee
- AI Center, Anam Hospital, Korea University College of Medicine, 73 Goryeodae-ro, Seongbuk-gu, Seoul 02841, Republic of Korea;
| | - Ki-Hoon Ahn
- Anam Hospital Bloodless Medicine Center, Korea University College of Medicine, Seoul 02841, Republic of Korea
- Department of Obstetrics and Gynecology, Anam Hospital, Korea University College of Medicine, Seoul 02841, Republic of Korea
| | - Woo Young Jang
- Department of Orthopedic Surgery, Anam Hospital, Korea University College of Medicine, 73 Goryeodae-ro, Seongbuk-gu, Seoul 02841, Republic of Korea;
- Anam Hospital Bloodless Medicine Center, Korea University College of Medicine, Seoul 02841, Republic of Korea
| |
Collapse
|
14
|
Lee KS, Song IS, Kim ES, Kim J, Jung S, Nam S, Ahn KH. Machine learning analysis with population data for the associations of preterm birth with temporomandibular disorder and gastrointestinal diseases. PLoS One 2024; 19:e0296329. [PMID: 38165877 PMCID: PMC10760735 DOI: 10.1371/journal.pone.0296329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2023] [Accepted: 12/11/2023] [Indexed: 01/04/2024] Open
Abstract
This study employs machine learning analysis with population data for the associations of preterm birth (PTB) with temporomandibular disorder (TMD) and gastrointestinal diseases. The source of the population-based retrospective cohort was Korea National Health Insurance claims for 489,893 primiparous women with delivery at the age of 25-40 in 2017. The dependent variable was PTB in 2017. Twenty-one predictors were included, i.e., demographic, socioeconomic, disease and medication information during 2002-2016. Random forest variable importance was derived for finding important predictors of PTB and evaluating its associations with the predictors including TMD and gastroesophageal reflux disease (GERD). Shapley Additive Explanation (SHAP) values were calculated to analyze the directions of these associations. The random forest with oversampling registered a much higher area under the receiver-operating-characteristic curve compared to logistic regression with oversampling, i.e., 79.3% vs. 53.1%. According to random forest variable importance values and rankings, PTB has strong associations with low socioeconomic status, GERD, age, infertility, irritable bowel syndrome, diabetes, TMD, salivary gland disease, hypertension, tricyclic antidepressant and benzodiazepine. In terms of max SHAP values, these associations were positive, e.g., low socioeconomic status (0.29), age (0.21), GERD (0.27) and TMD (0.23). The inclusion of low socioeconomic status, age, GERD or TMD into the random forest will increase the probability of PTB by 0.29, 0.21, 0.27 or 0.23. A cutting-edge approach of explainable artificial intelligence highlights the strong associations of preterm birth with temporomandibular disorder, gastrointestinal diseases and antidepressant medication. Close surveillance is needed for pregnant women regarding these multiple risks at the same time.
Collapse
Affiliation(s)
- Kwang-Sig Lee
- AI Center, Korea University College of Medicine, Anam Hospital, Seoul, Korea
| | - In-Seok Song
- Department of Oral and Maxillofacial Surgery, Korea University Anam Hospital, Seoul, Korea
| | - Eun Sun Kim
- Department of Gastroenterology, Korea University College of Medicine, Anam Hospital, Seoul, Korea
| | - Jisu Kim
- AI Center, Korea University College of Medicine, Anam Hospital, Seoul, Korea
- Department of Statistics, Korea University College of Political Science and Economics, Seoul, Korea
- Department of Obstetrics and Gynecology, Korea University College of Medicine, Anam Hospital, Seoul, Korea
| | - Sohee Jung
- AI Center, Korea University College of Medicine, Anam Hospital, Seoul, Korea
- Department of Statistics, Korea University College of Political Science and Economics, Seoul, Korea
- Department of Obstetrics and Gynecology, Korea University College of Medicine, Anam Hospital, Seoul, Korea
| | - Sunwoo Nam
- AI Center, Korea University College of Medicine, Anam Hospital, Seoul, Korea
- Department of Obstetrics and Gynecology, Korea University College of Medicine, Anam Hospital, Seoul, Korea
| | - Ki Hoon Ahn
- Department of Obstetrics and Gynecology, Korea University College of Medicine, Anam Hospital, Seoul, Korea
| |
Collapse
|
15
|
Shah J, Siddiquee MMR, Krell-Roesch J, Syrjanen JA, Kremers WK, Vassilaki M, Forzani E, Wu T, Geda YE. Neuropsychiatric Symptoms and Commonly Used Biomarkers of Alzheimer's Disease: A Literature Review from a Machine Learning Perspective. J Alzheimers Dis 2023; 92:1131-1146. [PMID: 36872783 PMCID: PMC11102734 DOI: 10.3233/jad-221261] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
Abstract
There is a growing interest in the application of machine learning (ML) in Alzheimer's disease (AD) research. However, neuropsychiatric symptoms (NPS), frequent in subjects with AD, mild cognitive impairment (MCI), and other related dementias have not been analyzed sufficiently using ML methods. To portray the landscape and potential of ML research in AD and NPS studies, we present a comprehensive literature review of existing ML approaches and commonly studied AD biomarkers. We conducted PubMed searches with keywords related to NPS, AD biomarkers, machine learning, and cognition. We included a total of 38 articles in this review after excluding some irrelevant studies from the search results and including 6 articles based on a snowball search from the bibliography of the relevant studies. We found a limited number of studies focused on NPS with or without AD biomarkers. In contrast, multiple statistical machine learning and deep learning methods have been used to build predictive diagnostic models using commonly known AD biomarkers. These mainly included multiple imaging biomarkers, cognitive scores, and various omics biomarkers. Deep learning approaches that combine these biomarkers or multi-modality datasets typically outperform single-modality datasets. We conclude ML may be leveraged to untangle the complex relationships of NPS and AD biomarkers with cognition. This may potentially help to predict the progression of MCI or dementia and develop more targeted early intervention approaches based on NPS.
Collapse
Affiliation(s)
- Jay Shah
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ, USA
- ASU-Mayo Center for Innovative Imaging, Tempe, AZ, USA
| | - Md Mahfuzur Rahman Siddiquee
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ, USA
- ASU-Mayo Center for Innovative Imaging, Tempe, AZ, USA
| | - Janina Krell-Roesch
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA
- Institute of Sports and Sports Science, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Jeremy A. Syrjanen
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA
| | - Walter K. Kremers
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA
| | - Maria Vassilaki
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, USA
| | - Erica Forzani
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
| | - Teresa Wu
- School of Computing and Augmented Intelligence, Arizona State University, Tempe, AZ, USA
- ASU-Mayo Center for Innovative Imaging, Tempe, AZ, USA
| | - Yonas E. Geda
- Department of Neurology and the Franke Global Neuroscience Education Center, Barrow Neurological Institute, Phoenix, AZ, USA
| |
Collapse
|