1
|
Critelli B, Hassan A, Lahooti I, Noh L, Park JS, Tong K, Lahooti A, Matzko N, Adams JN, Liss L, Quion J, Restrepo D, Nikahd M, Culp S, Lacy-Hulbert A, Speake C, Buxbaum J, Bischof J, Yazici C, Evans-Phillips A, Terp S, Weissman A, Conwell D, Hart P, Ramsey M, Krishna S, Han S, Park E, Shah R, Akshintala V, Windsor JA, Mull NK, Papachristou G, Celi LA, Lee P. A systematic review of machine learning-based prognostic models for acute pancreatitis: Towards improving methods and reporting quality. PLoS Med 2025; 22:e1004432. [PMID: 39992936 PMCID: PMC11870378 DOI: 10.1371/journal.pmed.1004432] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Revised: 02/28/2025] [Accepted: 01/07/2025] [Indexed: 02/26/2025] Open
Abstract
BACKGROUND An accurate prognostic tool is essential to aid clinical decision-making (e.g., patient triage) and to advance personalized medicine. However, such a prognostic tool is lacking for acute pancreatitis (AP). Increasingly machine learning (ML) techniques are being used to develop high-performing prognostic models in AP. However, methodologic and reporting quality has received little attention. High-quality reporting and study methodology are critical for model validity, reproducibility, and clinical implementation. In collaboration with content experts in ML methodology, we performed a systematic review critically appraising the quality of methodology and reporting of recently published ML AP prognostic models. METHODS/FINDINGS Using a validated search strategy, we identified ML AP studies from the databases MEDLINE and EMBASE published between January 2021 and December 2023. We also searched pre-print servers medRxiv, bioRxiv, and arXiv for pre-prints registered between January 2021 and December 2023. Eligibility criteria included all retrospective or prospective studies that developed or validated new or existing ML models in patients with AP that predicted an outcome following an episode of AP. Meta-analysis was considered if there was homogeneity in the study design and in the type of outcome predicted. For risk of bias (ROB) assessment, we used the Prediction Model Risk of Bias Assessment Tool. Quality of reporting was assessed using the Transparent Reporting of a Multivariable Prediction Model of Individual Prognosis or Diagnosis-Artificial Intelligence (TRIPOD+AI) statement that defines standards for 27 items that should be reported in publications using ML prognostic models. The search strategy identified 6,480 publications of which 30 met the eligibility criteria. Studies originated from China (22), the United States (4), and other (4). All 30 studies developed a new ML model and none sought to validate an existing ML model, producing a total of 39 new ML models. AP severity (23/39) or mortality (6/39) were the most common outcomes predicted. The mean area under the curve for all models and endpoints was 0.91 (SD 0.08). The ROB was high for at least one domain in all 39 models, particularly for the analysis domain (37/39 models). Steps were not taken to minimize over-optimistic model performance in 27/39 models. Due to heterogeneity in the study design and in how the outcomes were defined and determined, meta-analysis was not performed. Studies reported on only 15/27 items from TRIPOD+AI standards, with only 7/30 justifying sample size and 13/30 assessing data quality. Other reporting deficiencies included omissions regarding human-AI interaction (28/30), handling low-quality or incomplete data in practice (27/30), sharing analytical codes (25/30), study protocols (25/30), and reporting source data (19/30). CONCLUSIONS There are significant deficiencies in the methodology and reporting of recently published ML based prognostic models in AP patients. These undermine the validity, reproducibility, and implementation of these prognostic models despite their promise of superior predictive accuracy. REGISTRATION Research Registry (reviewregistry1727).
Collapse
Affiliation(s)
- Brian Critelli
- Department of Gastroenterology and Hepatology, Weill Cornell Medical College, New York, New York, United States of America
| | - Amier Hassan
- Department of Gastroenterology and Hepatology, Weill Cornell Medical College, New York, New York, United States of America
| | - Ila Lahooti
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Lydia Noh
- Northeast Ohio Medical School, Rootstown, Ohio, United States of America
| | - Jun Sung Park
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Kathleen Tong
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Ali Lahooti
- Department of Gastroenterology and Hepatology, Weill Cornell Medical College, New York, New York, United States of America
| | - Nathan Matzko
- Department of Gastroenterology and Hepatology, Weill Cornell Medical College, New York, New York, United States of America
| | - Jan Niklas Adams
- Department of Process and Data Science, Rheinisch-Westfälische Technische Hochschule Aachen University, Aachen, Germany
| | - Lukas Liss
- Department of Process and Data Science, Rheinisch-Westfälische Technische Hochschule Aachen University, Aachen, Germany
| | - Justin Quion
- Department of Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - David Restrepo
- Department of Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Melica Nikahd
- Department of Bioinformatics, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Stacey Culp
- Department of Bioinformatics, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Adam Lacy-Hulbert
- Department of Systems Immunology, Benaroya Research Institute at Virginia Mason, Seattle, Washington, United States of America
| | - Cate Speake
- Department of Interventional Immunology, Benaroya Research Institute at Virginia Mason, Seattle, Washington, United States of America
| | - James Buxbaum
- Department of Gastroenterology, University of Southern California, Los Angeles, California, United States of America
| | - Jason Bischof
- Department of Emergency Medicine, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Cemal Yazici
- Department of Gastroenterology, University of Illinois at Chicago, Chicago, Illinois, United States of America
| | - Anna Evans-Phillips
- Department of Gastroenterology, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania, United States of America
| | - Sophie Terp
- Department of Emergency Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Alexandra Weissman
- Department of Emergency Medicine, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania, United States of America
| | - Darwin Conwell
- Department of Medicine, University of Kentucky, Lexington, Kentucky, United States of America
| | - Philip Hart
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Mitchell Ramsey
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Somashekar Krishna
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Samuel Han
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Erica Park
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Raj Shah
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Venkata Akshintala
- Department of Gastroenterology, Johns Hopkins Medical Center, Baltimore, Maryland, United States of America
| | - John A. Windsor
- Department of Surgical and Translational Research Centre, University of Auckland, Auckland, New Zealand
| | - Nikhil K. Mull
- Department of Hospital Medicine and Penn Medicine Center for Evidence-based Practice, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Georgios Papachristou
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| | - Leo Anthony Celi
- Department of Computational Physiology, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Department of Critical Care, Beth Israel Medical Center, Boston, Massachusetts, United States of America
| | - Peter Lee
- Department of Gastroenterology and Hepatology, Ohio State University Wexner Medical Center, Columbus, Ohio, United States of America
| |
Collapse
|
2
|
Maletz S, Balagurunathan Y, Murphy K, Folio L, Chima R, Zaheer A, Vadvala H. AI-powered innovations in pancreatitis imaging: a comprehensive literature synthesis. Abdom Radiol (NY) 2025; 50:438-452. [PMID: 39133362 DOI: 10.1007/s00261-024-04512-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Revised: 07/16/2024] [Accepted: 07/29/2024] [Indexed: 08/13/2024]
Abstract
Early identification of pancreatitis remains a significant clinical diagnostic challenge that impacts patient outcomes. The evolution of quantitative imaging followed by deep learning models has shown great promise in the non-invasive diagnosis of pancreatitis and its complications. We provide an overview of advancements in diagnostic imaging and quantitative imaging methods along with the evolution of artificial intelligence (AI). In this article, we review the current and future states of methodology and limitations of AI in improving clinical support in the context of early detection and management of pancreatitis.
Collapse
Affiliation(s)
- Sebastian Maletz
- University of South Florida Morsani College of Medicine, Tampa, USA
| | | | - Kade Murphy
- University of South Florida Morsani College of Medicine, Tampa, USA
| | - Les Folio
- University of South Florida Morsani College of Medicine, Tampa, USA
- Moffitt Cancer Center, Tampa, USA
| | - Ranjit Chima
- University of South Florida Morsani College of Medicine, Tampa, USA
- Moffitt Cancer Center, Tampa, USA
| | | | - Harshna Vadvala
- University of South Florida Morsani College of Medicine, Tampa, USA.
- Moffitt Cancer Center, Tampa, USA.
| |
Collapse
|
3
|
Ge P, Luo Y, Zhang G, Chen H. The role of proteomics in acute pancreatitis: new and old knowledge. Expert Rev Proteomics 2024; 21:115-123. [PMID: 38372668 DOI: 10.1080/14789450.2024.2320810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Accepted: 01/10/2024] [Indexed: 02/20/2024]
Abstract
INTRODUCTION Around 20% of individuals diagnosed with acute pancreatitis (AP) may develop severe acute pancreatitis (SAP), possibly resulting in a mortality rate ranging from 15% to 35%. There is an urgent need to thoroughly understand the molecular phenotypes of SAP resulting from diverse etiologies. The field of translational research on AP has seen the use of several innovative proteomic methodologies via the ongoing improvement of isolation, tagging, and quantification methods. AREAS COVERED This paper provides a comprehensive overview of differentially abundant proteins (DAPs) identified in AP by searching the PubMed/MEDLINE database (2003-2023) and adds significantly to the current theoretical framework. EXPERT OPINION DAPs for potentially diagnosing AP based on proteomic identification need to be confirmed by multi-center studies that include larger samples. The discovery of DAPs in various organs at different AP stages via proteomic technologies is essential better to understand the pathophysiology of AP-related multiple organ dysfunction syndrome. Regarding the translational research of AP, novel approaches like single-cell proteomics and imaging using mass spectrometry may be used as soon as they become available.
Collapse
Affiliation(s)
- Peng Ge
- Department of General Surgery, The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
- Institute (College) of Integrative Medicine, Dalian Medical University, Dalian, Liaoning, China
- Laboratory of Integrative Medicine, The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
| | - Yalan Luo
- Laboratory of Integrative Medicine, The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
- Department of Gastroenterology, The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
| | - Guixin Zhang
- Department of General Surgery, The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
- Institute (College) of Integrative Medicine, Dalian Medical University, Dalian, Liaoning, China
- Laboratory of Integrative Medicine, The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
| | - Hailong Chen
- Department of General Surgery, The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
- Institute (College) of Integrative Medicine, Dalian Medical University, Dalian, Liaoning, China
- Laboratory of Integrative Medicine, The First Affiliated Hospital of Dalian Medical University, Dalian, Liaoning, China
| |
Collapse
|
4
|
Liang H, Wang M, Wen Y, Du F, Jiang L, Geng X, Tang L, Yan H. Predicting acute pancreatitis severity with enhanced computed tomography scans using convolutional neural networks. Sci Rep 2023; 13:17514. [PMID: 37845380 PMCID: PMC10579320 DOI: 10.1038/s41598-023-44828-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Accepted: 10/12/2023] [Indexed: 10/18/2023] Open
Abstract
This study aimed to evaluate acute pancreatitis (AP) severity using convolutional neural network (CNN) models with enhanced computed tomography (CT) scans. Three-dimensional DenseNet CNN models were developed and trained using the enhanced CT scans labeled with two severity assessment methods: the computed tomography severity index (CTSI) and Atlanta classification. Each labeling method was used independently for model training and validation. Model performance was evaluated using confusion matrices, areas under the receiver operating characteristic curve (AUC-ROC), accuracy, precision, recall, F1 score, and respective macro-average metrics. A total of 1,798 enhanced CT scans met the inclusion criteria were included in this study. The dataset was randomly divided into a training dataset (n = 1618) and a test dataset (n = 180) with a ratio of 9:1. The DenseNet model demonstrated promising predictions for both CTSI and Atlanta classification-labeled CT scans, with accuracy greater than 0.7 and AUC-ROC greater than 0.8. Specifically, when trained with CT scans labeled using CTSI, the DenseNet model achieved good performance, with a macro-average F1 score of 0.835 and a macro-average AUC-ROC of 0.980. The findings of this study affirm the feasibility of employing CNN models to predict the severity of AP using enhanced CT scans.
Collapse
Affiliation(s)
- Hongyin Liang
- Department of General Surgery, The General Hospital of Western Theater Command (Chengdu Military General Hospital), Chengdu, 610083, China
- Sichuan Provincial Key Laboratory of Pancreatic Injury and Repair, Chengdu, 610083, China
| | - Meng Wang
- Department of Traditional Chinese Medicine, The General Hospital of Western Theater Command (Chengdu Military General Hospital), Chengdu, 610083, China
| | - Yi Wen
- Department of General Surgery, The General Hospital of Western Theater Command (Chengdu Military General Hospital), Chengdu, 610083, China
- Sichuan Provincial Key Laboratory of Pancreatic Injury and Repair, Chengdu, 610083, China
| | - Feizhou Du
- Department of Radiology, The General Hospital of Western Theater Command (Chengdu Military General Hospital), Chengdu, 610083, China
| | - Li Jiang
- Department of Cardiac Surgery, The General Hospital of Western Theater Command (Chengdu Military General Hospital), Chengdu, 610083, China
| | - Xuelong Geng
- Department of Radiology, The General Hospital of Western Theater Command (Chengdu Military General Hospital), Chengdu, 610083, China
| | - Lijun Tang
- Department of General Surgery, The General Hospital of Western Theater Command (Chengdu Military General Hospital), Chengdu, 610083, China
- Sichuan Provincial Key Laboratory of Pancreatic Injury and Repair, Chengdu, 610083, China
| | - Hongtao Yan
- Department of Liver Transplantation and Hepato-biliary-pancreatic Surgery, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610016, China.
| |
Collapse
|
5
|
Liu F, Yao J, Liu C, Shou S. Construction and validation of machine learning models for sepsis prediction in patients with acute pancreatitis. BMC Surg 2023; 23:267. [PMID: 37658375 PMCID: PMC10474758 DOI: 10.1186/s12893-023-02151-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 08/11/2023] [Indexed: 09/03/2023] Open
Abstract
BACKGROUND This study aimed to construct predictive models for the risk of sepsis in patients with Acute pancreatitis (AP) using machine learning methods and compared optimal one with the logistic regression (LR) model and scoring systems. METHODS In this retrospective cohort study, data were collected from the Medical Information Mart for Intensive Care III (MIMIC III) database between 2001 and 2012 and the MIMIC IV database between 2008 and 2019. Patients were randomly divided into training and test sets (8:2). The least absolute shrinkage and selection operator (LASSO) regression plus 5-fold cross-validation were used to screen and confirm the predictive factors. Based on the selected predictive factors, 6 machine learning models were constructed, including support vector machine (SVM), K-nearest neighbour (KNN), multi-layer perceptron (MLP), LR, gradient boosting decision tree (GBDT) and adaptive enhancement algorithm (AdaBoost). The models and scoring systems were evaluated and compared using sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), accuracy, and the area under the curve (AUC). RESULTS A total of 1, 672 patients were eligible for participation. In the training set, 261 AP patients (19.51%) were diagnosed with sepsis. The predictive factors for the risk of sepsis in AP patients included age, insurance, vasopressors, mechanical ventilation, Glasgow Coma Scale (GCS), heart rate, respiratory rate, temperature, SpO2, platelet, red blood cell distribution width (RDW), International Normalized Ratio (INR), and blood urea nitrogen (BUN). The AUC of the GBDT model for sepsis prediction in the AP patients in the testing set was 0.985. The GBDT model showed better performance in sepsis prediction than the LR, systemic inflammatory response syndrome (SIRS) score, bedside index for severity in acute pancreatitis (BISAP) score, sequential organ failure assessment (SOFA) score, quick-SOFA (qSOFA), and simplified acute physiology score II (SAPS II). CONCLUSION The present findings suggest that compared to the classical LR model and SOFA, qSOFA, SAPS II, SIRS, and BISAP scores, the machine learning model-GBDT model had a better performance in predicting sepsis in the AP patients, which is a useful tool for early identification of high-risk patients and timely clinical interventions.
Collapse
Affiliation(s)
- Fei Liu
- Department of Emergency Medicine, Tianjin Medical University General Hospital, 154 Anshan Road, Heping District, Tianjin, 300052, P.R. China
| | - Jie Yao
- Department of Anesthesiology, The First Affiliated Hospital of Hebei North University, Zhangjiakou, Hebei, 075000, P.R. China
| | - Chunyan Liu
- Department of Intensive Care Unit, The First Affiliated Hospital of Hebei North University, Zhangjiakou, Hebei, 075000, P.R. China
| | - Songtao Shou
- Department of Emergency Medicine, Tianjin Medical University General Hospital, 154 Anshan Road, Heping District, Tianjin, 300052, P.R. China.
| |
Collapse
|