1
|
Baser O, Samayoa G, Yapar N, Baser E. Artificial Intelligence in Identifying Patients With Undiagnosed Nonalcoholic Steatohepatitis. JOURNAL OF HEALTH ECONOMICS AND OUTCOMES RESEARCH 2024; 11:86-94. [PMID: 39351190 PMCID: PMC11441708 DOI: 10.36469/001c.123645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2024] [Accepted: 09/13/2024] [Indexed: 10/04/2024]
Abstract
Background: Although increasing in prevalence, nonalcoholic steatohepatitis (NASH) is often undiagnosed in clinical practice. Objective: This study identified patients in the Veterans Affairs (VA) health system who likely had undiagnosed NASH using a machine learning algorithm. Methods: From a VA data set of 25 million adult enrollees, the study population was divided into NASH-positive, non-NASH, and at-risk cohorts. We performed a claims data analysis using a machine learning algorithm. To build our model, the study population was randomly divided into an 80% training subset and a 20% testing subset and tested and trained using a cross-validation technique. In addition to the baseline model, a gradient-boosted classification tree, naïve Bayes, and random forest model were created and compared using receiver operator characteristics, area under the curve, and accuracy. The best performing model was retrained on the full 80% training subset and applied to the 20% testing subset to calculate the performance metrics. Results: In total, 4 223 443 patients met the study inclusion criteria, of whom 4903 were positive for NASH and 35 528 were non-NASH patients. The remainder was in the at-risk patient cohort, of which 514 997 patients (12%) were identified as likely to have NASH. Age, obesity, and abnormal liver function tests were the top determinants in assigning NASH probability. Conclusions: Utilization of machine learning to predict NASH allows for wider recognition, timely intervention, and targeted treatments to improve or mitigate disease progression and could be used as an initial screening tool.
Collapse
Affiliation(s)
- Onur Baser
- Graduate School of Public Health, City University of New York, New York, NY, USA
- University of Michigan Medical School, Ann Arbor, Michigan, USA
- John D. Dingell VA Center, Detroit, Michigan, USA
| | | | - Nehir Yapar
- Columbia Data Analytics, Ann Arbor, Michigan, USA
| | - Erdem Baser
- Columbia Data Analytics, Ann Arbor, Michigan, USA
| |
Collapse
|
2
|
Alizargar A, Chang YL, Alkhaleefah M, Tan TH. Precision Non-Alcoholic Fatty Liver Disease (NAFLD) Diagnosis: Leveraging Ensemble Machine Learning and Gender Insights for Cost-Effective Detection. Bioengineering (Basel) 2024; 11:600. [PMID: 38927836 PMCID: PMC11201081 DOI: 10.3390/bioengineering11060600] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Revised: 05/23/2024] [Accepted: 06/06/2024] [Indexed: 06/28/2024] Open
Abstract
Non-Alcoholic Fatty Liver Disease (NAFLD) is characterized by the accumulation of excess fat in the liver. If left undiagnosed and untreated during the early stages, NAFLD can progress to more severe conditions such as inflammation, liver fibrosis, cirrhosis, and even liver failure. In this study, machine learning techniques were employed to predict NAFLD using affordable and accessible laboratory test data, while the conventional technique hepatic steatosis index (HSI)was calculated for comparison. Six algorithms (random forest, K-nearest Neighbors, Logistic Regression, Support Vector Machine, extreme gradient boosting, decision tree), along with an ensemble model, were utilized for dataset analysis. The objective was to develop a cost-effective tool for enabling early diagnosis, leading to better management of the condition. The issue of imbalanced data was addressed using the Synthetic Minority Oversampling Technique Edited Nearest Neighbors (SMOTEENN). Various evaluation metrics including the F1 score, precision, accuracy, recall, confusion matrix, the mean absolute error (MAE), receiver operating characteristics (ROC), and area under the curve (AUC) were employed to assess the suitability of each technique for disease prediction. Experimental results using the National Health and Nutrition Examination Survey (NHANES) dataset demonstrated that the ensemble model achieved the highest accuracy (0.99) and AUC (1.00) compared to the machine learning techniques that we used and HSI. These findings indicate that the ensemble model holds potential as a beneficial tool for healthcare professionals to predict NAFLD, leveraging accessible and cost-effective laboratory test data.
Collapse
Affiliation(s)
- Azadeh Alizargar
- Department of Electrical Engineering, College of Electrical Engineering and Computer Science, National Taipei University of Technology, Taipei 10608, Taiwan; (A.A.); (Y.-L.C.); (M.A.)
| | - Yang-Lang Chang
- Department of Electrical Engineering, College of Electrical Engineering and Computer Science, National Taipei University of Technology, Taipei 10608, Taiwan; (A.A.); (Y.-L.C.); (M.A.)
| | - Mohammad Alkhaleefah
- Department of Electrical Engineering, College of Electrical Engineering and Computer Science, National Taipei University of Technology, Taipei 10608, Taiwan; (A.A.); (Y.-L.C.); (M.A.)
| | - Tan-Hsu Tan
- Department of Electrical Engineering, College of Electrical Engineering and Computer Science, National Taipei University of Technology, Taipei 10608, Taiwan; (A.A.); (Y.-L.C.); (M.A.)
- Innovation Frontier Institute of Research for Science and Technology, National Taipei University of Technology, Taipei 10608, Taiwan
| |
Collapse
|
3
|
Cao T, Zhu Q, Tong C, Halengbieke A, Ni X, Tang J, Han Y, Li Q, Yang X. Establishment of a machine learning predictive model for non-alcoholic fatty liver disease: A longitudinal cohort study. Nutr Metab Cardiovasc Dis 2024; 34:1456-1466. [PMID: 38508988 DOI: 10.1016/j.numecd.2024.02.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 01/25/2024] [Accepted: 02/10/2024] [Indexed: 03/22/2024]
Abstract
BACKGROUND AND AIMS Non-alcoholic fatty liver disease (NAFLD) is a common chronic liver disease, which lacks effective drug treatments. This study aimed to construct an eXtreme Gradient Boosting (XGBoost) prediction model to identify or evaluate potential NAFLD patients. METHODS AND RESULTS We conducted a longitudinal study of 22,140 individuals from the Beijing Health Management Cohort. Variable filtering was performed using the least absolute shrinkage and selection operator. Random Over Sampling Examples was used to address imbalanced data. Next, the XGBoost model and the other three machine learning (ML) models were built using balanced data. Finally, the variable importance of the XGBoost model was ranked. Among four ML algorithms, we got that the XGBoost model outperformed the other models with the following results: accuracy of 0.835, sensitivity of 0.835, specificity of 0.834, Youden index of 0.669, precision of 0.831, recall of 0.835, F-1 score of 0.833, and an area under the curve of 0.914. The top five variables with the greatest impact on the onset of NAFLD were aspartate aminotransferase, cardiometabolic index, body mass index, alanine aminotransferase, and triglyceride-glucose index. CONCLUSION The predictive model based on the XGBoost algorithm enables early prediction of the onset of NAFLD. Additionally, assessing variable importance provides valuable insights into the prevention and treatment of NAFLD.
Collapse
Affiliation(s)
- Tengrui Cao
- School of Public Health, Capital Medical University, NO. 10 Xitoutiao, Youanmenwai, Fengtai District, Beijing 100069, China; Beijing Municipal Key Laboratory of Clinical Epidemiology, NO. 10 Xitoutiao, Youanmenwai, Fengtai District, Beijing 100069, China.
| | - Qian Zhu
- School of Public Health, Capital Medical University, NO. 10 Xitoutiao, Youanmenwai, Fengtai District, Beijing 100069, China; Beijing Municipal Key Laboratory of Clinical Epidemiology, NO. 10 Xitoutiao, Youanmenwai, Fengtai District, Beijing 100069, China; Office for Cancer Registry, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100021, China.
| | - Chao Tong
- Beijing Center for Disease Prevention and Control, Beijing 100013, China.
| | - Aheyeerke Halengbieke
- School of Public Health, Capital Medical University, NO. 10 Xitoutiao, Youanmenwai, Fengtai District, Beijing 100069, China; Beijing Municipal Key Laboratory of Clinical Epidemiology, NO. 10 Xitoutiao, Youanmenwai, Fengtai District, Beijing 100069, China.
| | - Xuetong Ni
- School of Public Health, Capital Medical University, NO. 10 Xitoutiao, Youanmenwai, Fengtai District, Beijing 100069, China; Beijing Municipal Key Laboratory of Clinical Epidemiology, NO. 10 Xitoutiao, Youanmenwai, Fengtai District, Beijing 100069, China.
| | - Jianmin Tang
- School of Public Health, Capital Medical University, NO. 10 Xitoutiao, Youanmenwai, Fengtai District, Beijing 100069, China; Beijing Municipal Key Laboratory of Clinical Epidemiology, NO. 10 Xitoutiao, Youanmenwai, Fengtai District, Beijing 100069, China.
| | - Yumei Han
- Science and Education Section, Beijing Physical Examination Center, No. 59, Beiwei Road, Xicheng District, Beijing 100050, China.
| | - Qiang Li
- Science and Education Section, Beijing Physical Examination Center, No. 59, Beiwei Road, Xicheng District, Beijing 100050, China.
| | - Xinghua Yang
- School of Public Health, Capital Medical University, NO. 10 Xitoutiao, Youanmenwai, Fengtai District, Beijing 100069, China; Beijing Municipal Key Laboratory of Clinical Epidemiology, NO. 10 Xitoutiao, Youanmenwai, Fengtai District, Beijing 100069, China.
| |
Collapse
|
4
|
Kokkorakis M, Muzurović E, Volčanšek Š, Chakhtoura M, Hill MA, Mikhailidis DP, Mantzoros CS. Steatotic Liver Disease: Pathophysiology and Emerging Pharmacotherapies. Pharmacol Rev 2024; 76:454-499. [PMID: 38697855 DOI: 10.1124/pharmrev.123.001087] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 12/22/2023] [Accepted: 01/25/2024] [Indexed: 05/05/2024] Open
Abstract
Steatotic liver disease (SLD) displays a dynamic and complex disease phenotype. Consequently, the metabolic dysfunction-associated steatotic liver disease (MASLD)/metabolic dysfunction-associated steatohepatitis (MASH) therapeutic pipeline is expanding rapidly and in multiple directions. In parallel, noninvasive tools for diagnosing and monitoring responses to therapeutic interventions are being studied, and clinically feasible findings are being explored as primary outcomes in interventional trials. The realization that distinct subgroups exist under the umbrella of SLD should guide more precise and personalized treatment recommendations and facilitate advancements in pharmacotherapeutics. This review summarizes recent updates of pathophysiology-based nomenclature and outlines both effective pharmacotherapeutics and those in the pipeline for MASLD/MASH, detailing their mode of action and the current status of phase 2 and 3 clinical trials. Of the extensive arsenal of pharmacotherapeutics in the MASLD/MASH pipeline, several have been rejected, whereas other, mainly monotherapy options, have shown only marginal benefits and are now being tested as part of combination therapies, yet others are still in development as monotherapies. Although the Food and Drug Administration (FDA) has recently approved resmetirom, additional therapeutic approaches in development will ideally target MASH and fibrosis while improving cardiometabolic risk factors. Due to the urgent need for the development of novel therapeutic strategies and the potential availability of safety and tolerability data, repurposing existing and approved drugs is an appealing option. Finally, it is essential to highlight that SLD and, by extension, MASLD should be recognized and approached as a systemic disease affecting multiple organs, with the vigorous implementation of interdisciplinary and coordinated action plans. SIGNIFICANCE STATEMENT: Steatotic liver disease (SLD), including metabolic dysfunction-associated steatotic liver disease and metabolic dysfunction-associated steatohepatitis, is the most prevalent chronic liver condition, affecting more than one-fourth of the global population. This review aims to provide the most recent information regarding SLD pathophysiology, diagnosis, and management according to the latest advancements in the guidelines and clinical trials. Collectively, it is hoped that the information provided furthers the understanding of the current state of SLD with direct clinical implications and stimulates research initiatives.
Collapse
Affiliation(s)
- Michail Kokkorakis
- Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts (M.K., C.S.M.); Department of Clinical Pharmacy and Pharmacology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands (M.K.); Endocrinology Section, Department of Internal Medicine, Clinical Center of Montenegro, Podgorica, Montenegro (E.M.); Faculty of Medicine, University of Montenegro, Podgorica, Montenegro (E.M.); Department of Endocrinology, Diabetes, and Metabolic Diseases, University Medical Center Ljubljana, Ljubljana, Slovenia (Š.V.); Medical Faculty Ljubljana, Ljubljana, Slovenia (Š.V.); Division of Endocrinology, Department of Internal Medicine, American University of Beirut Medical Center, Beirut, Lebanon (M.C.); Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri (M.A.H.); Department of Medical Pharmacology and Physiology, School of Medicine, University of Missouri, Columbia, Missouri (M.A.H.); Department of Clinical Biochemistry, Royal Free Hospital Campus, University College London Medical School, University College London (UCL), London, United Kingdom (D.P.M.); Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai, United Arab Emirates (D.P.M.); and Boston VA Healthcare System, Harvard Medical School, Boston, Massachusetts (C.S.M.)
| | - Emir Muzurović
- Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts (M.K., C.S.M.); Department of Clinical Pharmacy and Pharmacology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands (M.K.); Endocrinology Section, Department of Internal Medicine, Clinical Center of Montenegro, Podgorica, Montenegro (E.M.); Faculty of Medicine, University of Montenegro, Podgorica, Montenegro (E.M.); Department of Endocrinology, Diabetes, and Metabolic Diseases, University Medical Center Ljubljana, Ljubljana, Slovenia (Š.V.); Medical Faculty Ljubljana, Ljubljana, Slovenia (Š.V.); Division of Endocrinology, Department of Internal Medicine, American University of Beirut Medical Center, Beirut, Lebanon (M.C.); Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri (M.A.H.); Department of Medical Pharmacology and Physiology, School of Medicine, University of Missouri, Columbia, Missouri (M.A.H.); Department of Clinical Biochemistry, Royal Free Hospital Campus, University College London Medical School, University College London (UCL), London, United Kingdom (D.P.M.); Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai, United Arab Emirates (D.P.M.); and Boston VA Healthcare System, Harvard Medical School, Boston, Massachusetts (C.S.M.)
| | - Špela Volčanšek
- Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts (M.K., C.S.M.); Department of Clinical Pharmacy and Pharmacology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands (M.K.); Endocrinology Section, Department of Internal Medicine, Clinical Center of Montenegro, Podgorica, Montenegro (E.M.); Faculty of Medicine, University of Montenegro, Podgorica, Montenegro (E.M.); Department of Endocrinology, Diabetes, and Metabolic Diseases, University Medical Center Ljubljana, Ljubljana, Slovenia (Š.V.); Medical Faculty Ljubljana, Ljubljana, Slovenia (Š.V.); Division of Endocrinology, Department of Internal Medicine, American University of Beirut Medical Center, Beirut, Lebanon (M.C.); Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri (M.A.H.); Department of Medical Pharmacology and Physiology, School of Medicine, University of Missouri, Columbia, Missouri (M.A.H.); Department of Clinical Biochemistry, Royal Free Hospital Campus, University College London Medical School, University College London (UCL), London, United Kingdom (D.P.M.); Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai, United Arab Emirates (D.P.M.); and Boston VA Healthcare System, Harvard Medical School, Boston, Massachusetts (C.S.M.)
| | - Marlene Chakhtoura
- Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts (M.K., C.S.M.); Department of Clinical Pharmacy and Pharmacology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands (M.K.); Endocrinology Section, Department of Internal Medicine, Clinical Center of Montenegro, Podgorica, Montenegro (E.M.); Faculty of Medicine, University of Montenegro, Podgorica, Montenegro (E.M.); Department of Endocrinology, Diabetes, and Metabolic Diseases, University Medical Center Ljubljana, Ljubljana, Slovenia (Š.V.); Medical Faculty Ljubljana, Ljubljana, Slovenia (Š.V.); Division of Endocrinology, Department of Internal Medicine, American University of Beirut Medical Center, Beirut, Lebanon (M.C.); Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri (M.A.H.); Department of Medical Pharmacology and Physiology, School of Medicine, University of Missouri, Columbia, Missouri (M.A.H.); Department of Clinical Biochemistry, Royal Free Hospital Campus, University College London Medical School, University College London (UCL), London, United Kingdom (D.P.M.); Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai, United Arab Emirates (D.P.M.); and Boston VA Healthcare System, Harvard Medical School, Boston, Massachusetts (C.S.M.)
| | - Michael A Hill
- Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts (M.K., C.S.M.); Department of Clinical Pharmacy and Pharmacology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands (M.K.); Endocrinology Section, Department of Internal Medicine, Clinical Center of Montenegro, Podgorica, Montenegro (E.M.); Faculty of Medicine, University of Montenegro, Podgorica, Montenegro (E.M.); Department of Endocrinology, Diabetes, and Metabolic Diseases, University Medical Center Ljubljana, Ljubljana, Slovenia (Š.V.); Medical Faculty Ljubljana, Ljubljana, Slovenia (Š.V.); Division of Endocrinology, Department of Internal Medicine, American University of Beirut Medical Center, Beirut, Lebanon (M.C.); Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri (M.A.H.); Department of Medical Pharmacology and Physiology, School of Medicine, University of Missouri, Columbia, Missouri (M.A.H.); Department of Clinical Biochemistry, Royal Free Hospital Campus, University College London Medical School, University College London (UCL), London, United Kingdom (D.P.M.); Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai, United Arab Emirates (D.P.M.); and Boston VA Healthcare System, Harvard Medical School, Boston, Massachusetts (C.S.M.)
| | - Dimitri P Mikhailidis
- Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts (M.K., C.S.M.); Department of Clinical Pharmacy and Pharmacology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands (M.K.); Endocrinology Section, Department of Internal Medicine, Clinical Center of Montenegro, Podgorica, Montenegro (E.M.); Faculty of Medicine, University of Montenegro, Podgorica, Montenegro (E.M.); Department of Endocrinology, Diabetes, and Metabolic Diseases, University Medical Center Ljubljana, Ljubljana, Slovenia (Š.V.); Medical Faculty Ljubljana, Ljubljana, Slovenia (Š.V.); Division of Endocrinology, Department of Internal Medicine, American University of Beirut Medical Center, Beirut, Lebanon (M.C.); Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri (M.A.H.); Department of Medical Pharmacology and Physiology, School of Medicine, University of Missouri, Columbia, Missouri (M.A.H.); Department of Clinical Biochemistry, Royal Free Hospital Campus, University College London Medical School, University College London (UCL), London, United Kingdom (D.P.M.); Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai, United Arab Emirates (D.P.M.); and Boston VA Healthcare System, Harvard Medical School, Boston, Massachusetts (C.S.M.)
| | - Christos S Mantzoros
- Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts (M.K., C.S.M.); Department of Clinical Pharmacy and Pharmacology, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands (M.K.); Endocrinology Section, Department of Internal Medicine, Clinical Center of Montenegro, Podgorica, Montenegro (E.M.); Faculty of Medicine, University of Montenegro, Podgorica, Montenegro (E.M.); Department of Endocrinology, Diabetes, and Metabolic Diseases, University Medical Center Ljubljana, Ljubljana, Slovenia (Š.V.); Medical Faculty Ljubljana, Ljubljana, Slovenia (Š.V.); Division of Endocrinology, Department of Internal Medicine, American University of Beirut Medical Center, Beirut, Lebanon (M.C.); Dalton Cardiovascular Research Center, University of Missouri, Columbia, Missouri (M.A.H.); Department of Medical Pharmacology and Physiology, School of Medicine, University of Missouri, Columbia, Missouri (M.A.H.); Department of Clinical Biochemistry, Royal Free Hospital Campus, University College London Medical School, University College London (UCL), London, United Kingdom (D.P.M.); Mohammed Bin Rashid University of Medicine and Health Sciences, Dubai, United Arab Emirates (D.P.M.); and Boston VA Healthcare System, Harvard Medical School, Boston, Massachusetts (C.S.M.)
| |
Collapse
|
5
|
Zamanian H, Shalbaf A, Zali MR, Khalaj AR, Dehghan P, Tabesh M, Hatami B, Alizadehsani R, Tan RS, Acharya UR. Application of artificial intelligence techniques for non-alcoholic fatty liver disease diagnosis: A systematic review (2005-2023). COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 244:107932. [PMID: 38008040 DOI: 10.1016/j.cmpb.2023.107932] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/19/2023] [Revised: 11/13/2023] [Accepted: 11/15/2023] [Indexed: 11/28/2023]
Abstract
BACKGROUND AND OBJECTIVES Non-alcoholic fatty liver disease (NAFLD) is a common liver disease with a rapidly growing incidence worldwide. For prognostication and therapeutic decisions, it is important to distinguish the pathological stages of NAFLD: steatosis, steatohepatitis, and liver fibrosis, which are definitively diagnosed on invasive biopsy. Non-invasive ultrasound (US) imaging, including US elastography technique, and clinical parameters can be used to diagnose and grade NAFLD and its complications. Artificial intelligence (AI) is increasingly being harnessed for developing NAFLD diagnostic models based on clinical, biomarker, or imaging data. In this work, we systemically reviewed the literature for AI-enabled NAFLD diagnostic models based on US (including elastography) and clinical (including serological) data. METHODS We performed a comprehensive search on Google Scholar, Scopus, and PubMed search engines for articles published between January 2005 and June 2023 related to AI models for NAFLD diagnosis based on US and/or clinical parameters using the following search terms: "non-alcoholic fatty liver disease", "non-alcoholic steatohepatitis", "deep learning", "machine learning", "artificial intelligence", "ultrasound imaging", "sonography", "clinical information". RESULTS We reviewed 64 published models that used either US (including elastography) or clinical data input to detect the presence of NAFLD, non-alcoholic steatohepatitis, and/or fibrosis, and in some cases, the severity of steatosis, inflammation, and/or fibrosis as well. The performances of the published models were summarized, and stratified by data input and algorithms used, which could be broadly divided into machine and deep learning approaches. CONCLUSION AI models based on US imaging and clinical data can reliably detect NAFLD and its complications, thereby reducing diagnostic costs and the need for invasive liver biopsy. The models offer advantages of efficiency, accuracy, and accessibility, and serve as virtual assistants for specialists to accelerate disease diagnosis and reduce treatment costs for patients and healthcare systems.
Collapse
Affiliation(s)
- H Zamanian
- Department of Biomedical Engineering and Medical Physics, School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - A Shalbaf
- Department of Biomedical Engineering and Medical Physics, School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| | - M R Zali
- Gastroenterology and Liver Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - A R Khalaj
- Tehran obesity treatment center, Department of Surgery, Faculty of Medicine, Shahed University, Tehran, Iran
| | - P Dehghan
- Department of Radiology, Imaging Department, Taleghani Hospital, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - M Tabesh
- Cardiac Primary Prevention Research Center, Cardiovascular Diseases Research, Tehran University of Medical Sciences, Tehran, Iran
| | - B Hatami
- Gastroenterology and Liver Diseases Research Center, Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - R Alizadehsani
- Institute for Intelligent Systems Research and Innovation (IISRI), Deakin University, Waurn Ponds, VIC, Australia
| | - Ru-San Tan
- National Heart Centre Singapore, Singapore 169609, Singapore; Duke-NUS Medical School, Singapore
| | - U Rajendra Acharya
- School of Mathematics, Physics and Computing, University of Southern Queensland, Toowoomba, QLD, Australia; Centre for Health Research, University of Southern Queensland, Australia
| |
Collapse
|
6
|
Liu L, Lin J, Liu L, Gao J, Xu G, Yin M, Liu X, Wu A, Zhu J. Automated machine learning models for nonalcoholic fatty liver disease assessed by controlled attenuation parameter from the NHANES 2017-2020. Digit Health 2024; 10:20552076241272535. [PMID: 39119551 PMCID: PMC11307367 DOI: 10.1177/20552076241272535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 07/09/2024] [Indexed: 08/10/2024] Open
Abstract
Background Nonalcoholic fatty liver disease (NAFLD) is recognized as one of the most common chronic liver diseases worldwide. This study aims to assess the efficacy of automated machine learning (AutoML) in the identification of NAFLD using a population-based cross-sectional database. Methods All data, including laboratory examinations, anthropometric measurements, and demographic variables, were obtained from the National Health and Nutrition Examination Survey (NHANES). NAFLD was defined by controlled attenuation parameter (CAP) in liver transient ultrasound elastography. The least absolute shrinkage and selection operator (LASSO) regression analysis was employed for feature selection. Six algorithms were utilized on the H2O-automated machine learning platform: Gradient Boosting Machine (GBM), Distributed Random Forest (DRF), Extremely Randomized Trees (XRT), Generalized Linear Model (GLM), eXtreme Gradient Boosting (XGBoost), and Deep Learning (DL). These algorithms were selected for their diverse strengths, including their ability to handle complex, non-linear relationships, provide high predictive accuracy, and ensure interpretability. The models were evaluated by area under receiver operating characteristic curves (AUC) and interpreted by the calibration curve, the decision curve analysis, variable importance plot, SHapley Additive exPlanation plot, partial dependence plots, and local interpretable model agnostic explanation plot. Results A total of 4177 participants (non-NAFLD 3167 vs NAFLD 1010) were included to develop and validate the AutoML models. The model developed by XGBoost performed better than other models in AutoML, achieving an AUC of 0.859, an accuracy of 0.795, a sensitivity of 0.773, and a specificity of 0.802 on the validation set. Conclusions We developed an XGBoost model to better evaluate the presence of NAFLD. Based on the XGBoost model, we created an R Shiny web-based application named Shiny NAFLD (http://39.101.122.171:3838/App2/). This application demonstrates the potential of AutoML in clinical research and practice, offering a promising tool for the real-world identification of NAFLD.
Collapse
Affiliation(s)
- Lihe Liu
- Department of Gastroenterology, The First Affiliated Hospital of Soochow University, Suzhou, China
| | - Jiaxi Lin
- Department of Gastroenterology, The First Affiliated Hospital of Soochow University, Suzhou, China
| | - Lu Liu
- Department of Gastroenterology, The First Affiliated Hospital of Soochow University, Suzhou, China
| | - Jingwen Gao
- Department of Gastroenterology, The First Affiliated Hospital of Soochow University, Suzhou, China
| | - Guoting Xu
- Department of Gastroenterology, The First Affiliated Hospital of Soochow University, Suzhou, China
| | - Minyue Yin
- Department of Gastroenterology, Beijing Friendship Hospital, Capital Medical University, Beijing, China
| | - Xiaolin Liu
- Department of Gastroenterology, The First Affiliated Hospital of Soochow University, Suzhou, China
| | - Airong Wu
- Department of Gastroenterology, The First Affiliated Hospital of Soochow University, Suzhou, China
| | - Jinzhou Zhu
- Department of Gastroenterology, The First Affiliated Hospital of Soochow University, Suzhou, China
| |
Collapse
|
7
|
Deng Y, Ma Y, Fu J, Wang X, Yu C, Lv J, Man S, Wang B, Li L. A dynamic machine learning model for prediction of NAFLD in a health checkup population: A longitudinal study. Heliyon 2023; 9:e18758. [PMID: 37576311 PMCID: PMC10412833 DOI: 10.1016/j.heliyon.2023.e18758] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 07/25/2023] [Accepted: 07/26/2023] [Indexed: 08/15/2023] Open
Abstract
Background Non-alcoholic fatty liver disease (NAFLD) is one of the most common liver diseases worldwide. Currently, most NAFLD prediction models are diagnostic models based on cross-sectional data, which failed to provide early identification or clarify causal relationships. We aimed to use time-series deep learning models with longitudinal health checkup records to predict the onset of NAFLD in the future, and update the model stepwise by incorporating new checkup records to achieve dynamic prediction. Methods 10,493 participants with over 6 health checkup records from Beijing MJ Health Screening Center were included to conduct a retrospective cohort study, in which the constantly updated initial 5 checkup data were incorporated stepwise to predict the risk of NAFLD at and after their sixth health checkups. A total of 33 variables were considered, consisting of demographic characteristics, medical history, lifestyle, physical examinations, and laboratory tests. L1-penalized logistic regression (LR) was used for feature selection. The long short-term memory (LSTM) algorithm was introduced for model development, and five-fold cross-validation was conducted to tune and choose optimal hyperparameters. Both internal validation and external validation were conducted, using the 20% randomly divided holdout test dataset and previously unseen data from Shanghai MJ Health Screening Center, respectively, to evaluate model performance. The evaluation metrics included area under the receiver operating characteristic curve (AUROC), sensitivity, specificity, Brier score, and decision curve. Bootstrap sampling was implemented to generate 95% confidence intervals of all the metrics. Finally, the Shapley additive explanations (SHAP) algorithm was applied in the holdout test dataset for model interpretability to obtain time-specific and sample-specific contributions of each feature. Results Among the 10,493 participants, 1662 (15.84%) were diagnosed with NAFLD at and after their sixth health checkups. The predictive performance of the deep learning model in the internal validation dataset improved over the incorporation of the checkups, with AUROC increasing from 0.729 (95% CI: 0.698,0.760) at baseline to 0.818 (95% CI: 0.798,0.844) when consecutive 5 checkups were included. The external validation dataset, containing 1728 participants, was used to verify the results, in which AUROC increased from 0.700 (95% CI: 0.657,0.740) with only the first checkups to 0.792 (95% CI: 0.758,0.825) with all five. The results of feature significance showed that body fat percentage, alanine transaminase (ALT), and uric acid owned the greatest impact on the outcome, time-specific, individual-specific and dynamic feature contributions were also produced for model interpretability. Conclusion A dynamic prediction model was successfully established in our study, and the prediction capability kept improving with the renewal of the latest checkup records. In addition, we identified key features associated with the onset of NAFLD, making it possible to optimize the prevention and control strategies of the disease in the general population.
Collapse
Affiliation(s)
- Yuhan Deng
- Chongqing Research Institute of Big Data, Peking University, Chongqing, China
- Meinian Institute of Health, Beijing, China
| | - Yuan Ma
- School of Population Medicine and Public Health, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Jingzhu Fu
- Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China
- Peking University Health Science Center Meinian Public Health Institute, Beijing, China
- Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China
| | | | - Canqing Yu
- Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China
- Peking University Health Science Center Meinian Public Health Institute, Beijing, China
- Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China
- Peking University Center for Public Health and Epidemic Preparedness & Response, Beijing, China
| | - Jun Lv
- Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China
- Peking University Health Science Center Meinian Public Health Institute, Beijing, China
- Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China
- Peking University Center for Public Health and Epidemic Preparedness & Response, Beijing, China
| | - Sailimai Man
- Meinian Institute of Health, Beijing, China
- Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China
- Peking University Health Science Center Meinian Public Health Institute, Beijing, China
- Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China
| | - Bo Wang
- Meinian Institute of Health, Beijing, China
- Peking University Health Science Center Meinian Public Health Institute, Beijing, China
- Peking University Center for Public Health and Epidemic Preparedness & Response, Beijing, China
| | - Liming Li
- Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China
- Peking University Health Science Center Meinian Public Health Institute, Beijing, China
- Key Laboratory of Epidemiology of Major Diseases (Peking University), Ministry of Education, Beijing, China
- Peking University Center for Public Health and Epidemic Preparedness & Response, Beijing, China
| |
Collapse
|
8
|
Machine-Learning Algorithm for Predicting Fatty Liver Disease in a Taiwanese Population. J Pers Med 2022; 12:jpm12071026. [PMID: 35887527 PMCID: PMC9317783 DOI: 10.3390/jpm12071026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2022] [Revised: 06/18/2022] [Accepted: 06/22/2022] [Indexed: 12/04/2022] Open
Abstract
The rising incidence of fatty liver disease (FLD) poses a health challenge, and is expected to be the leading global cause of liver-related morbidity and mortality in the near future. Early case identification is crucial for disease intervention. A retrospective cross-sectional study was performed on 31,930 Taiwanese subjects (25,544 training and 6386 testing sets) who had received health check-ups and abdominal ultrasounds in Changhua Christian Hospital from January 2009 to January 2019. Clinical and laboratory factors were included for analysis by different machine-learning algorithms. In addition, the performance of the machine-learning algorithms was compared with that of the fatty liver index (FLI). Totally, 6658/25,544 (26.1%) and 1647/6386 (25.8%) subjects had moderate-to-severe liver disease in the training and testing sets, respectively. Five machine-learning models were examined and demonstrated exemplary performance in predicting FLD. Among these models, the xgBoost model revealed the highest area under the receiver operating characteristic (AUROC) (0.882), accuracy (0.833), F1 score (0.829), sensitivity (0.833), and specificity (0.683) compared with those of neural network, logistic regression, random forest, and support vector machine-learning models. The xgBoost, neural network, and logistic regression models had a significantly higher AUROC than that of FLI. Body mass index was the most important feature to predict FLD according to the feature ranking scores. The xgBoost model had the best overall prediction ability for diagnosing FLD in our study. Machine-learning algorithms provide considerable benefits for screening candidates with FLD.
Collapse
|