1
|
AlMuhaideb S, bin Shawyah A, Alhamid MF, Alabbad A, Alabbad M, Alsergani H, Alswailem O. Beyond the Bedside: Machine Learning-Guided Length of Stay (LOS) Prediction for Cardiac Patients in Tertiary Care. Healthcare (Basel) 2024; 12:1110. [PMID: 38891185 PMCID: PMC11171809 DOI: 10.3390/healthcare12111110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Revised: 05/19/2024] [Accepted: 05/28/2024] [Indexed: 06/21/2024] Open
Abstract
Efficient management of hospital resources is essential for providing high-quality healthcare while ensuring sustainability. Length of stay (LOS), measuring the duration from admission to discharge, directly impacts patient outcomes and resource utilization. Accurate LOS prediction offers numerous benefits, including reducing re-admissions, ensuring appropriate staffing, and facilitating informed discharge planning. While conventional methods rely on statistical models and clinical expertise, recent advances in machine learning (ML) present promising avenues for enhancing LOS prediction. This research focuses on developing an ML-based LOS prediction model trained on a comprehensive real-world dataset and discussing the important factors towards practical deployment of trained ML models in clinical settings. This research involves the development of a comprehensive adult cardiac patient dataset (SaudiCardioStay (SCS)) from the King Faisal Specialist Hospital & Research Centre (KFSH&RC) hospital in Saudi Arabia, comprising 4930 patient encounters for 3611 unique patients collected from 2019 to 2022 (excluding 2020). A diverse range of classical ML models (i.e., Random Forest (RF), Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LGBM), artificial neural networks (ANNs), Average Voting Regression (AvgVotReg)) are implemented for the SCS dataset to explore the potential of existing ML models in LOS prediction. In addition, this study introduces a novel approach for LOS prediction by incorporating a dedicated LOS classifier within a sophisticated ensemble methodology (i.e., Two-Level Sequential Cascade Generalization (2LSCG), Three-Level Sequential Cascade Generalization (3LSCG), Parallel Cascade Generalization (PCG)), aiming to enhance prediction accuracy and capture nuanced patterns in healthcare data. The experimental results indicate the best mean absolute error (MAE) of 0.1700 for the 3LSCG model. Relatively comparable performance was observed for the AvgVotReg model, with a MAE of 0.1703. In the end, a detailed analysis of the practical implications, limitations, and recommendations concerning the deployment of ML approaches in actual clinical settings is presented.
Collapse
Affiliation(s)
- Sarab AlMuhaideb
- Department of Computer Science, College of Computer and Information Sciences, King Saud University, P.O. Box 266, Riyadh 11362, Saudi Arabia;
| | - Alanoud bin Shawyah
- Department of Computer Science, College of Computer and Information Sciences, King Saud University, P.O. Box 266, Riyadh 11362, Saudi Arabia;
| | - Mohammed F. Alhamid
- Healthcare Information Technology Affairs (HITA), King Faisal Specialist Hospital & Research Center, P.O. Box 3354, Riyadh 11211, Saudi Arabia; (M.F.A.); (A.A.); (M.A.); (O.A.)
| | - Arwa Alabbad
- Healthcare Information Technology Affairs (HITA), King Faisal Specialist Hospital & Research Center, P.O. Box 3354, Riyadh 11211, Saudi Arabia; (M.F.A.); (A.A.); (M.A.); (O.A.)
| | - Maram Alabbad
- Healthcare Information Technology Affairs (HITA), King Faisal Specialist Hospital & Research Center, P.O. Box 3354, Riyadh 11211, Saudi Arabia; (M.F.A.); (A.A.); (M.A.); (O.A.)
| | - Hani Alsergani
- Heart Center, King Faisal Specialist Hospital & Research Center, P.O. Box 3354, Riyadh 11211, Saudi Arabia;
| | - Osama Alswailem
- Healthcare Information Technology Affairs (HITA), King Faisal Specialist Hospital & Research Center, P.O. Box 3354, Riyadh 11211, Saudi Arabia; (M.F.A.); (A.A.); (M.A.); (O.A.)
| |
Collapse
|
2
|
Abdurrab I, Mahmood T, Sheikh S, Aijaz S, Kashif M, Memon A, Ali I, Peerwani G, Pathan A, Alkhodre AB, Siddiqui MS. Predicting the Length of Stay of Cardiac Patients Based on Pre-Operative Variables-Bayesian Models vs. Machine Learning Models. Healthcare (Basel) 2024; 12:249. [PMID: 38255136 PMCID: PMC10815919 DOI: 10.3390/healthcare12020249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Revised: 01/04/2024] [Accepted: 01/16/2024] [Indexed: 01/24/2024] Open
Abstract
Length of stay (LoS) prediction is deemed important for a medical institution's operational and logistical efficiency. Sound estimates of a patient's stay increase clinical preparedness and reduce aberrations. Various statistical methods and techniques are used to quantify and predict the LoS of a patient based on pre-operative clinical features. This study evaluates and compares the results of Bayesian (simple Bayesian regression and hierarchical Bayesian regression) models and machine learning (ML) regression models against multiple evaluation metrics for the problem of LoS prediction of cardiac patients admitted to Tabba Heart Institute, Karachi, Pakistan (THI) between 2015 and 2020. In addition, the study also presents the use of hierarchical Bayesian regression to account for data variability and skewness without homogenizing the data (by removing outliers). LoS estimates from the hierarchical Bayesian regression model resulted in a root mean squared error (RMSE) and mean absolute error (MAE) of 1.49 and 1.16, respectively. Simple Bayesian regression (without hierarchy) achieved an RMSE and MAE of 3.36 and 2.05, respectively. The average RMSE and MAE of ML models remained at 3.36 and 1.98, respectively.
Collapse
Affiliation(s)
- Ibrahim Abdurrab
- Department of Computer Science, Institute of Business Administration, Karachi 75270, Pakistan;
| | - Tariq Mahmood
- Department of Computer Science, Institute of Business Administration, Karachi 75270, Pakistan;
| | - Sana Sheikh
- Department of Clinical Research Cardiology, Tabba Heart Institute, Karachi 75950, Pakistan; (S.S.); (S.A.); (M.K.); (A.M.); (I.A.); (G.P.); (A.P.)
| | - Saba Aijaz
- Department of Clinical Research Cardiology, Tabba Heart Institute, Karachi 75950, Pakistan; (S.S.); (S.A.); (M.K.); (A.M.); (I.A.); (G.P.); (A.P.)
| | - Muhammad Kashif
- Department of Clinical Research Cardiology, Tabba Heart Institute, Karachi 75950, Pakistan; (S.S.); (S.A.); (M.K.); (A.M.); (I.A.); (G.P.); (A.P.)
| | - Ahson Memon
- Department of Clinical Research Cardiology, Tabba Heart Institute, Karachi 75950, Pakistan; (S.S.); (S.A.); (M.K.); (A.M.); (I.A.); (G.P.); (A.P.)
| | - Imran Ali
- Department of Clinical Research Cardiology, Tabba Heart Institute, Karachi 75950, Pakistan; (S.S.); (S.A.); (M.K.); (A.M.); (I.A.); (G.P.); (A.P.)
| | - Ghazal Peerwani
- Department of Clinical Research Cardiology, Tabba Heart Institute, Karachi 75950, Pakistan; (S.S.); (S.A.); (M.K.); (A.M.); (I.A.); (G.P.); (A.P.)
| | - Asad Pathan
- Department of Clinical Research Cardiology, Tabba Heart Institute, Karachi 75950, Pakistan; (S.S.); (S.A.); (M.K.); (A.M.); (I.A.); (G.P.); (A.P.)
| | - Ahmad B. Alkhodre
- Faculty of Computer and Information Systems, Islamic University of Madinah, Madinah 42351, Saudi Arabia; (A.B.A.); (M.S.S.)
| | - Muhammad Shoaib Siddiqui
- Faculty of Computer and Information Systems, Islamic University of Madinah, Madinah 42351, Saudi Arabia; (A.B.A.); (M.S.S.)
| |
Collapse
|
3
|
Lee JJR, Srinivasan R, Ong CS, Alejo D, Schena S, Shpitser I, Sussman M, Whitman GJR, Malinsky D. Causal determinants of postoperative length of stay in cardiac surgery using causal graphical learning. J Thorac Cardiovasc Surg 2023; 166:e446-e462. [PMID: 36154975 PMCID: PMC9968823 DOI: 10.1016/j.jtcvs.2022.08.012] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Revised: 07/24/2022] [Accepted: 08/18/2022] [Indexed: 11/22/2022]
Abstract
OBJECTIVE We aimed to learn the causal determinants of postoperative length of stay in cardiac surgery patients undergoing isolated coronary artery bypass grafting or aortic valve replacement surgery. METHODS For patients undergoing isolated coronary artery bypass grafting or isolated aortic valve replacement surgeries between 2011 and 2016, we used causal graphical modeling on electronic health record data. The Fast Causal Inference (FCI) algorithm from the Tetrad software was used on data to estimate a Partial Ancestral Graph (PAG) depicting direct and indirect causes of postoperative length of stay, given background clinical knowledge. Then, we used the latent variable intervention-calculus when the directed acyclic graph is absent (LV-IDA) algorithm to estimate strengths of causal effects of interest. Finally, we ran a linear regression for postoperative length of stay to contrast statistical associations with what was learned by our causal analysis. RESULTS In our cohort of 2610 patients, the mean postoperative length of stay was 219 hours compared with the Society of Thoracic Surgeons 2016 national mean postoperative length of stay of approximately 168 hours. Most variables that clinicians believe to be predictors of postoperative length of stay were found to be causes, but some were direct (eg, age, diabetes, hematocrit, total operating time, and postoperative complications), and others were indirect (including gender, race, and operating surgeon). The strongest average causal effects on postoperative length of stay were exhibited by preoperative dialysis (209 hours); neuro-, pulmonary-, and infection-related postoperative complications (315 hours, 89 hours, and 131 hours, respectively); reintubation (61 hours); extubation in operating room (-47 hours); and total operating room duration (48 hours). Linear regression coefficients diverged from causal effects in magnitude (eg, dialysis) and direction (eg, crossclamp time). CONCLUSIONS By using retrospective electronic health record data and background clinical knowledge, causal graphical modeling retrieved direct and indirect causes of postoperative length of stay and their relative strengths. These insights will be useful in designing clinical protocols and targeting improvements in patient management.
Collapse
Affiliation(s)
- Jaron J R Lee
- Department of Computer Science, Johns Hopkins University, Baltimore, Md; Malone Center for Engineering in Healthcare, Johns Hopkins University, Baltimore, Md.
| | - Ranjani Srinivasan
- Malone Center for Engineering in Healthcare, Johns Hopkins University, Baltimore, Md; Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, Md
| | - Chin Siang Ong
- Division of Surgical Outcomes, Department of Surgery, Yale School of Medicine, New Haven, Conn
| | - Diane Alejo
- Division of Cardiac Surgery, Department of Surgery, Johns Hopkins Hospital, Baltimore, Md
| | - Stefano Schena
- Division of Cardiac Surgery, Department of Surgery, Johns Hopkins Hospital, Baltimore, Md
| | - Ilya Shpitser
- Department of Computer Science, Johns Hopkins University, Baltimore, Md; Malone Center for Engineering in Healthcare, Johns Hopkins University, Baltimore, Md
| | - Marc Sussman
- Division of Cardiac Surgery, Department of Surgery, Johns Hopkins Hospital, Baltimore, Md
| | - Glenn J R Whitman
- Division of Cardiac Surgery, Department of Surgery, Johns Hopkins Hospital, Baltimore, Md
| | - Daniel Malinsky
- Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, NY
| |
Collapse
|
4
|
Samy SS, Karthick S, Ghosal M, Singh S, Sudarsan JS, Nithiyanantham S. Adoption of machine learning algorithm for predicting the length of stay of patients (construction workers) during COVID pandemic. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY : AN OFFICIAL JOURNAL OF BHARATI VIDYAPEETH'S INSTITUTE OF COMPUTER APPLICATIONS AND MANAGEMENT 2023; 15:1-9. [PMID: 37360312 PMCID: PMC10250170 DOI: 10.1007/s41870-023-01296-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 05/15/2023] [Indexed: 06/28/2023]
Abstract
The construction sector in a rapidly developing country like India is a very unorganized sector. A large number of workers were affected and hospitalized during the pandemic. This situation is costing the sector heavily in several respects. This research study was conducted as part of using machine learning algorithms to improve construction company health and safety policies. LOS (length of stay) is used to predict how long a patient will stay in a hospital. Predicting LOS is very useful not only for hospitals, but also for construction companies to measure resources and reduce costs. Predicting LOS has become an important step in most hospitals before admitting patients. In this post, we used the Medical Information Mart for Intensive Care(MIMIC III) dataset and applied four different machine learning algorithms: decision tree classifier, random forest, Artificial Neural Network (ANN), and logistic regression. First, I performed data pre-processing to clean up the dataset. In the next step, we performed function selection using the Select Best algorithm with an evaluation function of chi2 to perform hot coding. We then performed a split between training and testing and applied a machine learning algorithm. The metric used for comparison was accuracy. After implementing the algorithms, the accuracy was compared. Random forest was found to perform best at 89%. Afterwards, we performed hyperparameter tuning using a grid search algorithm on a random forest to obtain higher accuracy. The final accuracy is 90%. This kind of research can help improve health security policies by introducing modern computational techniques, and can also help optimize resources.
Collapse
Affiliation(s)
- S. Selvakumara Samy
- Department of Computational Intelligence, SRM Institute of Science and Technology, Kattankulathur, Tamilnadu 603203 India
| | - S. Karthick
- Department of Computational Intelligence, SRM Institute of Science and Technology, Kattankulathur, Tamilnadu 603203 India
| | - Meghna Ghosal
- Department of Computational Intelligence, SRM Institute of Science and Technology, Kattankulathur, Tamilnadu 603203 India
| | - Sameer Singh
- Department of Computational Intelligence, SRM Institute of Science and Technology, Kattankulathur, Tamilnadu 603203 India
| | - J. S. Sudarsan
- School of Energy and Environment, NICMAR University, 25/1, Balewadi, Pune, 411045 India
| | - S. Nithiyanantham
- Department of Physics, (Ultrasonic/NDT and Bio-Physics Divisions), Thiru. Vi. Kalyanasundaram Government Arts and Science College (Affiliated to Bharathidasan University, Thiruchirapalli), Thiruvarur, Tamilnadu 610003 India
| |
Collapse
|
5
|
Weiss AJ, Yadaw AS, Meretzky DL, Levin MA, Adams DH, McCardle K, Pandey G, Iyengar R. Machine learning using institution-specific multi-modal electronic health records improves mortality risk prediction for cardiac surgery patients. JTCVS OPEN 2023; 14:214-251. [PMID: 37425442 PMCID: PMC10328834 DOI: 10.1016/j.xjon.2023.03.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 02/04/2023] [Accepted: 03/16/2023] [Indexed: 07/11/2023]
Abstract
Background The Society of Thoracic Surgeons risk scores are widely used to assess risk of morbidity and mortality in specific cardiac surgeries but may not perform optimally in all patients. In a cohort of patients undergoing cardiac surgery, we developed a data-driven, institution-specific machine learning-based model inferred from multi-modal electronic health records and compared the performance with the Society of Thoracic Surgeons models. Methods All adult patients undergoing cardiac surgery between 2011 and 2016 were included. Routine electronic health record administrative, demographic, clinical, hemodynamic, laboratory, pharmacological, and procedural data features were extracted. The outcome was postoperative mortality. The database was randomly split into training (development) and test (evaluation) cohorts. Models developed using 4 classification algorithms were compared using 6 evaluation metrics. The performance of the final model was compared with the Society of Thoracic Surgeons models for 7 index surgical procedures. Results A total of 6392 patients were included and described by 4016 features. Overall mortality was 3.0% (n = 193). The XGBoost algorithm using only features with no missing data (336 features) yielded the best-performing predictor. When applied to the test set, the predictor performed well (F-measure = 0.775; precision = 0.756; recall = 0.795; accuracy = 0.986; area under the receiver operating characteristic curve = 0.978; area under the precision-recall curve = 0.804). eXtreme Gradient Boosting consistently demonstrated improved performance over the Society of Thoracic Surgeons models when evaluated on index procedures within the test set. Conclusions Machine learning models using institution-specific multi-modal electronic health records may improve performance in predicting mortality for individual patients undergoing cardiac surgery compared with the standard-of-care, population-derived Society of Thoracic Surgeons models. Institution-specific models may provide insights complementary to population-derived risk predictions to aid patient-level decision making.
Collapse
Affiliation(s)
- Aaron J. Weiss
- Department of Cardiovascular Surgery, Icahn School of Medicine at Mount Sinai, New York, NY
- Department of Thoracic and Cardiovascular Surgery, Heart, Vascular & Thoracic Institute, Cleveland Clinic, Cleveland, Ohio
- Department of Pharmacological Sciences and Institute for Systems Biomedicine, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Arjun S. Yadaw
- Department of Pharmacological Sciences and Institute for Systems Biomedicine, Icahn School of Medicine at Mount Sinai, New York, NY
| | - David L. Meretzky
- Department of Pharmacological Sciences and Institute for Systems Biomedicine, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Matthew A. Levin
- Division of Cardiothoracic Anesthesia, Department of Anesthesiology and Critical Care, Icahn School of Medicine at Mount Sinai, New York, NY
| | - David H. Adams
- Department of Cardiovascular Surgery, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Ken McCardle
- Department of Clinical Operations, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Gaurav Pandey
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY
| | - Ravi Iyengar
- Department of Pharmacological Sciences and Institute for Systems Biomedicine, Icahn School of Medicine at Mount Sinai, New York, NY
| |
Collapse
|
6
|
Babayoff O, Shehory O, Geller S, Shitrit-Niselbaum C, Weiss-Meilik A, Sprecher E. Improving Hospital Outpatient Clinics Appointment Schedules by Prediction Models. J Med Syst 2022; 47:5. [PMID: 36585996 DOI: 10.1007/s10916-022-01902-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2022] [Accepted: 12/14/2022] [Indexed: 01/01/2023]
Abstract
Patient no-shows and suboptimal patient appointment length scheduling reduce clinical efficiency and impair the clinic's quality of service. The main objective of this study is to improve appointment scheduling in hospital outpatient clinics. We developed generic supervised machine learning models to predict patient no-shows and patient's length of appointment (LOA). We performed a retrospective study using more than 100,000 records of patient appointments in a hospital outpatient clinic. Several machine learning algorithms were used for the development of our prediction models. We trained our models on a dataset that contained patients', physicians', and appointments' characteristics. Our feature set combines both unstudied features and features adopted from previous studies. In addition, we identified the influential features for predicting LOA and no-show. Our LOA model's performance was 6.92 in terms of MAE, and our no-show model's performance was 92.1% in terms of F-score. We compared our models' performance to the performance of previous research models by applying their methods to our dataset; our models demonstrated better performance. We show that the major effector of such differences is the use of our novel features. To evaluate the effect of our prediction results on the quality of schedules produced by appointment systems (AS), we developed an interface layer between our prediction models and the AS, where prediction results comprise the AS input. Using our prediction models, there was an 80% improvement in the daily cumulative patient waiting time and a 33% reduction in the daily cumulative physician idle time.
Collapse
Affiliation(s)
| | - Onn Shehory
- Bar-Ilan University, 5290002, Ramat Gan, Israel
| | - Shamir Geller
- Division of Dermatology, Tel Aviv Sourasky Medical Center and Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Chen Shitrit-Niselbaum
- I-Medata AI Center, Tel-Aviv Sourasky Medical Center, Sackler Faculty of Medicine, Tel-Aviv University, Tel Aviv, Israel
| | - Ahuva Weiss-Meilik
- I-Medata AI Center, Tel-Aviv Sourasky Medical Center, Sackler Faculty of Medicine, Tel-Aviv University, Tel Aviv, Israel
| | - Eli Sprecher
- Division of Dermatology, Tel Aviv Sourasky Medical Center and Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
7
|
Stratification of Length of Stay Prediction following Surgical Cytoreduction in Advanced High-Grade Serous Ovarian Cancer Patients Using Artificial Intelligence; the Leeds L-AI-OS Score. Curr Oncol 2022; 29:9088-9104. [PMID: 36547125 PMCID: PMC9776955 DOI: 10.3390/curroncol29120711] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2022] [Revised: 11/11/2022] [Accepted: 11/21/2022] [Indexed: 11/25/2022] Open
Abstract
(1) Background: Length of stay (LOS) has been suggested as a marker of the effectiveness of short-term care. Artificial Intelligence (AI) technologies could help monitor hospital stays. We developed an AI-based novel predictive LOS score for advanced-stage high-grade serous ovarian cancer (HGSOC) patients following cytoreductive surgery and refined factors significantly affecting LOS. (2) Methods: Machine learning and deep learning methods using artificial neural networks (ANN) were used together with conventional logistic regression to predict continuous and binary LOS outcomes for HGSOC patients. The models were evaluated in a post-hoc internal validation set and a Graphical User Interface (GUI) was developed to demonstrate the clinical feasibility of sophisticated LOS predictions. (3) Results: For binary LOS predictions at differential time points, the accuracy ranged between 70-98%. Feature selection identified surgical complexity, pre-surgery albumin, blood loss, operative time, bowel resection with stoma formation, and severe postoperative complications (CD3-5) as independent LOS predictors. For the GUI numerical LOS score, the ANN model was a good estimator for the standard deviation of the LOS distribution by ± two days. (4) Conclusions: We demonstrated the development and application of both quantitative and qualitative AI models to predict LOS in advanced-stage EOC patients following their cytoreduction. Accurate identification of potentially modifiable factors delaying hospital discharge can further inform services performing root cause analysis of LOS.
Collapse
|
8
|
Multilayer dynamic ensemble model for intensive care unit mortality prediction of neonate patients. J Biomed Inform 2022; 135:104216. [DOI: 10.1016/j.jbi.2022.104216] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2021] [Revised: 09/25/2022] [Accepted: 09/28/2022] [Indexed: 12/26/2022]
|
9
|
Wen Y, Rahman MF, Zhuang Y, Pokojovy M, Xu H, McCaffrey P, Vo A, Walser E, Moen S, Tseng TLB. Time-to-event modeling for hospital length of stay prediction for COVID-19 patients. MACHINE LEARNING WITH APPLICATIONS 2022; 9:100365. [PMID: 35756359 PMCID: PMC9213016 DOI: 10.1016/j.mlwa.2022.100365] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2022] [Revised: 05/30/2022] [Accepted: 06/14/2022] [Indexed: 11/19/2022] Open
Abstract
Providing timely patient care while maintaining optimal resource utilization is one of the central operational challenges hospitals have been facing throughout the pandemic. Hospital length of stay (LOS) is an important indicator of hospital efficiency, quality of patient care, and operational resilience. Numerous researchers have developed regression or classification models to predict LOS. However, conventional models suffer from the lack of capability to make use of typically censored clinical data. We propose to use time-to-event modeling techniques, also known as survival analysis, to predict the LOS for patients based on individualized information collected from multiple sources. The performance of six proposed survival models is evaluated and compared based on clinical data from COVID-19 patients.
Collapse
Affiliation(s)
- Yuxin Wen
- Dale E. and Sarah Ann Fowler School of Engineering, Chapman University, Orange, CA 92866, USA
| | - Md Fashiar Rahman
- Department of Industrial, Manufacturing and Systems Engineering, The University of Texas at El Paso, El Paso, TX 79968, USA
| | - Yan Zhuang
- Department of Biomedical Engineering, Sichuan University, Chengdu, Sichuan 610065, China
| | - Michael Pokojovy
- Department of Mathematical Sciences, The University of Texas at El Paso, El Paso, TX 79968, USA
| | - Honglun Xu
- Department of Industrial, Manufacturing and Systems Engineering, The University of Texas at El Paso, El Paso, TX 79968, USA
| | - Peter McCaffrey
- The University of Texas Medical Branch, Galveston, TX 77550, USA
| | - Alexander Vo
- The University of Texas Medical Branch, Galveston, TX 77550, USA
| | - Eric Walser
- The University of Texas Medical Branch, Galveston, TX 77550, USA
| | - Scott Moen
- The University of Texas Medical Branch, Galveston, TX 77550, USA
| | - Tzu-Liang Bill Tseng
- Department of Industrial, Manufacturing and Systems Engineering, The University of Texas at El Paso, El Paso, TX 79968, USA
| |
Collapse
|
10
|
Stone K, Zwiggelaar R, Jones P, Mac Parthaláin N. A systematic review of the prediction of hospital length of stay: Towards a unified framework. PLOS DIGITAL HEALTH 2022; 1:e0000017. [PMID: 36812502 PMCID: PMC9931263 DOI: 10.1371/journal.pdig.0000017] [Citation(s) in RCA: 44] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 01/06/2022] [Indexed: 05/09/2023]
Abstract
Hospital length of stay of patients is a crucial factor for the effective planning and management of hospital resources. There is considerable interest in predicting the LoS of patients in order to improve patient care, control hospital costs and increase service efficiency. This paper presents an extensive review of the literature, examining the approaches employed for the prediction of LoS in terms of their merits and shortcomings. In order to address some of these problems, a unified framework is proposed to better generalise the approaches that are being used to predict length of stay. This includes the investigation of the types of routinely collected data used in the problem as well as recommendations to ensure robust and meaningful knowledge modelling. This unified common framework enables the direct comparison of results between length of stay prediction approaches and will ensure that such approaches can be used across several hospital environments. A literature search was conducted in PubMed, Google Scholar and Web of Science from 1970 until 2019 to identify LoS surveys which review the literature. 32 Surveys were identified, from these 32 surveys, 220 papers were manually identified to be relevant to LoS prediction. After removing duplicates, and exploring the reference list of studies included for review, 93 studies remained. Despite the continuing efforts to predict and reduce the LoS of patients, current research in this domain remains ad-hoc; as such, the model tuning and data preprocessing steps are too specific and result in a large proportion of the current prediction mechanisms being restricted to the hospital that they were employed in. Adopting a unified framework for the prediction of LoS could yield a more reliable estimate of the LoS as a unified framework enables the direct comparison of length of stay methods. Additional research is also required to explore novel methods such as fuzzy systems which could build upon the success of current models as well as further exploration of black-box approaches and model interpretability.
Collapse
Affiliation(s)
- Kieran Stone
- Department of Computer Science, Aberystwyth University, Ceredigion, SY23 3DB, Wales, United Kingdom
| | - Reyer Zwiggelaar
- Department of Computer Science, Aberystwyth University, Ceredigion, SY23 3DB, Wales, United Kingdom
| | - Phil Jones
- Bronglais District General Hospital, Aberystwyth, Ceredigion, SY23 1ER, Wales, United Kingdom
| | - Neil Mac Parthaláin
- Department of Computer Science, Aberystwyth University, Ceredigion, SY23 3DB, Wales, United Kingdom
| |
Collapse
|
11
|
Rellum SR, Schuurmans J, van der Ven WH, Eberl S, Driessen AHG, Vlaar APJ, Veelo DP. Machine learning methods for perioperative anesthetic management in cardiac surgery patients: a scoping review. J Thorac Dis 2022; 13:6976-6993. [PMID: 35070381 PMCID: PMC8743411 DOI: 10.21037/jtd-21-765] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Accepted: 08/27/2021] [Indexed: 12/27/2022]
Abstract
Background Machine learning (ML) is developing fast with promising prospects within medicine and already has several applications in perioperative care. We conducted a scoping review to examine the extent and potential limitations of ML implementation in perioperative anesthetic care, specifically in cardiac surgery patients. Methods We mapped the current literature by searching three databases: MEDLINE (Ovid), EMBASE (Ovid), and Cochrane Library. Articles were eligible if they reported on perioperative ML use in the field of cardiac surgery with relevance to anesthetic practices. Data on the applicability of ML and comparability to conventional statistical methods were extracted. Results Forty-six articles on ML relevant to the work of the anesthesiologist in cardiac surgery were identified. Three main categories emerged: (I) event and risk prediction, (II) hemodynamic monitoring, and (III) automation of echocardiography. Prediction models based on ML tend to behave similarly to conventional statistical methods. Using dynamic hemodynamic or ultrasound data in ML models, however, shifts the potential to promising results. Conclusions ML in cardiac surgery is increasingly used in perioperative anesthetic management. The majority is used for prediction purposes similar to conventional clinical scores. Remarkable ML model performances are achieved when using real-time dynamic parameters. However, beneficial clinical outcomes of ML integration have yet to be determined. Nonetheless, the first steps introducing ML in perioperative anesthetic care for cardiac surgery have been taken.
Collapse
Affiliation(s)
- Santino R Rellum
- Department of Anesthesiology, Amsterdam UMC, Location AMC, Amsterdam, The Netherlands.,Department of Intensive Care, Amsterdam UMC, Location AMC, Amsterdam, The Netherlands
| | - Jaap Schuurmans
- Department of Anesthesiology, Amsterdam UMC, Location AMC, Amsterdam, The Netherlands.,Department of Intensive Care, Amsterdam UMC, Location AMC, Amsterdam, The Netherlands
| | - Ward H van der Ven
- Department of Anesthesiology, Amsterdam UMC, Location AMC, Amsterdam, The Netherlands.,Department of Intensive Care, Amsterdam UMC, Location AMC, Amsterdam, The Netherlands
| | - Susanne Eberl
- Department of Anesthesiology, Amsterdam UMC, Location AMC, Amsterdam, The Netherlands
| | - Antoine H G Driessen
- Department of Cardiothoracic Surgery, Heart Center, Amsterdam UMC, Location AMC, Amsterdam, The Netherlands
| | - Alexander P J Vlaar
- Department of Intensive Care, Amsterdam UMC, Location AMC, Amsterdam, The Netherlands
| | - Denise P Veelo
- Department of Anesthesiology, Amsterdam UMC, Location AMC, Amsterdam, The Netherlands
| |
Collapse
|
12
|
Machine Learning-Based Risk Prediction of Critical Care Unit Admission for Advanced Stage High Grade Serous Ovarian Cancer Patients Undergoing Cytoreductive Surgery: The Leeds-Natal Score. J Clin Med 2021; 11:jcm11010087. [PMID: 35011828 PMCID: PMC8745521 DOI: 10.3390/jcm11010087] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 12/20/2021] [Accepted: 12/22/2021] [Indexed: 12/12/2022] Open
Abstract
Achieving complete surgical cytoreduction in advanced stage high grade serous ovarian cancer (HGSOC) patients warrants an availability of Critical Care Unit (CCU) beds. Machine Learning (ML) could be helpful in monitoring CCU admissions to improve standards of care. We aimed to improve the accuracy of predicting CCU admission in HGSOC patients by ML algorithms and developed an ML-based predictive score. A cohort of 291 advanced stage HGSOC patients with fully curated data was selected. Several linear and non-linear distances, and quadratic discriminant ML methods, were employed to derive prediction information for CCU admission. When all the variables were included in the model, the prediction accuracies were higher for linear discriminant (0.90) and quadratic discriminant (0.93) methods compared with conventional logistic regression (0.84). Feature selection identified pre-treatment albumin, surgical complexity score, estimated blood loss, operative time, and bowel resection with stoma as the most significant prediction features. The real-time prediction accuracy of the Graphical User Interface CCU calculator reached 95%. Limited, potentially modifiable, mostly intra-operative factors contributing to CCU admission were identified and suggest areas for targeted interventions. The accurate quantification of CCU admission patterns is critical information when counseling patients about peri-operative risks related to their cytoreductive surgery.
Collapse
|
13
|
Abd-Elrazek MA, Eltahawi AA, Abd Elaziz MH, Abd-Elwhab MN. Predicting length of stay in hospitals intensive care unit using general admission features. AIN SHAMS ENGINEERING JOURNAL 2021; 12:3691-3702. [DOI: 10.1016/j.asej.2021.02.018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
|
14
|
Leveraging electronic health record data to inform hospital resource management : A systematic data mining approach. Health Care Manag Sci 2021; 24:716-741. [PMID: 34031792 DOI: 10.1007/s10729-021-09554-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Accepted: 02/02/2021] [Indexed: 10/21/2022]
Abstract
Early identification of resource needs is instrumental in promoting efficient hospital resource management. Hospital information systems, and electronic health records (EHR) in particular, collect valuable demographic and clinical patient data from the moment patients are admitted, which can help predict expected resource needs in early stages of patient episodes. To this end, this article proposes a data mining methodology to systematically obtain predictions for relevant managerial variables by leveraging structured EHR data. Specifically, these managerial variables are: i) Diagnosis categories, ii) procedure codes, iii) diagnosis-related groups (DRGs), iv) outlier episodes and v) length of stay (LOS). The proposed methodology approaches the problem in four stages: Feature set construction, feature selection, prediction model development, and model performance evaluation. We tested this approach with an EHR dataset of 5,089 inpatient episodes and compared different classification and regression models (for categorical and continuous variables, respectively), performed temporal analysis of model performance, analyzed the impact of training set homogeneity on performance and assessed the contribution of different EHR data elements for model predictive power. Overall, our results indicate that inpatient EHR data can effectively be leveraged to inform resource management on multiple perspectives. Logistic regression (combined with minimal redundancy maximum relevance feature selection) and bagged decision trees yielded best results for predicting categorical and numerical managerial variables, respectively. Furthermore, our temporal analysis indicated that, while DRG classes are more difficult to predict, several diagnosis categories, procedure codes and LOS amongst shorter-stay patients can be predicted with higher confidence in early stages of patient stay. Lastly, value of information analysis indicated that diagnoses, medication and structured assessment forms were the most valuable EHR data elements in predicting managerial variables of interest through a data mining approach.
Collapse
|
15
|
Sen E, Seckiner SU. The Use of Artificial Neural Networks to Determine In-Hospital Mortality After Coronary Artery Bypass Surgery. J Cardiothorac Vasc Anesth 2021; 35:2432-2437. [PMID: 33934989 DOI: 10.1053/j.jvca.2021.03.049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 03/05/2021] [Accepted: 03/26/2021] [Indexed: 11/11/2022]
Abstract
OBJECTIVES The aim of this study was to present an artificial neural network (ANN) model for the accurate estimation of in-hospital mortality and to demonstrate the validity of the model with real data and a comparison with conventional multiple linear regression models. DESIGN Retrospective clinical study. SETTING University hospital. INTERVENTIONS None. MEASUREMENTS AND MAIN RESULTS Data were collected from the medical records of 88 patients who had undergone coronary artery bypass graft surgery with an extracorporeal cardiopulmonary pump between January 2018 and March 2020. An ANN approach was used to assess the association between in-hospital mortality and variables from preoperative, intraoperative, and postoperative data garnered retrospectively from patient files. The study examined the data of 88 patients with a mean age of 62.4 ± ten years, 60 (68.1%) of whom were men and 28 (31.8%) of whom were women. An examination of the average success of the training algorithms in the training, validation, and test sets revealed that the quick propagation algorithm ranked first with 97.397%. The algorithm that best matched the present study's dataset was the batch back propagation algorithm, with an average of 99.622 (in other words, this training set accurately estimated 99.622% of every 100 items of data). Furthermore, the rates continuously were greater than 90% when the probability of estimating the estimated output was examined. CONCLUSION The ANN model tended to outperform multiple linear regression models in predicting in-hospital mortality among patients who have undergone coronary artery bypass graft surgery. Physicians can make use of this information as an aid in performing treatments and ensuring that more accurate quality of surgical care is achieved.
Collapse
Affiliation(s)
- Elzem Sen
- School of Medicine, Department of Anesthesiology and Reanimation University of Gaziantep, Gaziantep, Turkey.
| | - Serap Ulusam Seckiner
- School of Engineering, Department of Industrial Engineering University of Gaziantep, Gaziantep, Turkey
| |
Collapse
|
16
|
Dogu E, Albayrak YE, Tuncay E. Length of hospital stay prediction with an integrated approach of statistical-based fuzzy cognitive maps and artificial neural networks. Med Biol Eng Comput 2021; 59:483-496. [PMID: 33544271 DOI: 10.1007/s11517-021-02327-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Accepted: 01/24/2021] [Indexed: 10/22/2022]
Abstract
Chronic obstructive pulmonary disease (COPD) is a global burden, which is estimated to be the third leading cause of death worldwide by 2030. The economic burden of COPD grows continuously because it is not a curable disease. These conditions make COPD an important research field of artificial intelligence (AI) techniques in medicine. In this study, an integrated approach of the statistical-based fuzzy cognitive maps (SBFCM) and artificial neural networks (ANN) is proposed for predicting length of hospital stay of patients with COPD, who admitted to the hospital with an acute exacerbation. The SBFCM method is developed to determine the input variables of the ANN model. The SBFCM conducts statistical analysis to prepare preliminary information for the experts and then collects expert opinions accordingly, to define a conceptual map of the system. The integration of SBFCM and ANN methods provides both statistical data and expert opinion in the prediction model. In the numerical application, the proposed approach outperformed the conventional approach and other machine learning algorithms with 79.95% accuracy, revealing the power of expert opinion involvement in medical decisions. A medical decision support framework is constructed for better prediction of length of hospital stay and more effective hospital management.
Collapse
Affiliation(s)
- Elif Dogu
- Industrial Engineering Dept., Galatasaray University, Ciragan Cad. No.: 36, Ortakoy, 34349, Istanbul, Turkey.
| | - Y Esra Albayrak
- Industrial Engineering Dept., Galatasaray University, Ciragan Cad. No.: 36, Ortakoy, 34349, Istanbul, Turkey
| | - Esin Tuncay
- Yedikule Chest Diseases & Thoracic Surgery Training & Research Hospital, Belgrad Kapi Yolu Cad. No.: 1 34020 Zeytinburnu, Istanbul, Turkey
| |
Collapse
|
17
|
Kabir S, Farrokhvar L, Russell MW, Forman A, Kamali B. Regional socioeconomic factors and length of hospital stay: a case study in Appalachia. J Public Health (Oxf) 2021. [DOI: 10.1007/s10389-020-01418-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
|
18
|
Sarıyer G, Ataman MG. The likelihood of requiring a diagnostic test: Classifying emergency department patients with logistic regression. HEALTH INF MANAG J 2020; 51:13-22. [PMID: 32223440 DOI: 10.1177/1833358320908975] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
BACKGROUND Emergency departments (EDs) play an important role in health systems since they are the front line for patients with emergency medical conditions who frequently require diagnostic tests and timely treatment. OBJECTIVE To improve decision-making and accelerate processes in EDs, this study proposes predictive models for classifying patients according to whether or not they are likely to require a diagnostic test based on referral diagnosis, age, gender, triage category and type of arrival. METHOD Retrospective data were categorised into four output patient groups: not requiring any diagnostic test (group A); requiring a radiology test (group B); requiring a laboratory test (group C); requiring both tests (group D). Multivariable logistic regression models were used, with the outcome classifications represented as a series of binary variables: test (1) or no test (0); in the case of group A, no test (1) or test (0). RESULTS For all models, age, triage category, type of arrival and referral diagnosis were significant predictors whereas gender was not. The main referral diagnosis with high model coefficients varied by designed output groups (groups A, B, C and D). The overall accuracies of the logistic regression models for groups A, B, C and D were, respectively, 74.11%, 73.07%, 82.47% and 85.79%. Specificity metrics were higher than the sensitivities for groups B, C and D, meaning that these models were better able to predict negative outcomes. IMPLICATIONS These results provide guidance for ED triage staff, researchers and practitioners in making rapid decisions regarding patients' diagnostic test requirements based on specified variables in the predictive models. This is critical in ED operations planning as it potentially decreases waiting times, while increasing patient satisfaction and operational performance.
Collapse
|
19
|
Symum H, Zayas-Castro JL. Prediction of Chronic Disease-Related Inpatient Prolonged Length of Stay Using Machine Learning Algorithms. Healthc Inform Res 2020; 26:20-33. [PMID: 32082697 PMCID: PMC7010949 DOI: 10.4258/hir.2020.26.1.20] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Revised: 11/06/2019] [Accepted: 11/21/2019] [Indexed: 11/23/2022] Open
Abstract
Objectives The study aimed to develop and compare predictive models based on supervised machine learning algorithms for predicting the prolonged length of stay (LOS) of hospitalized patients diagnosed with five different chronic conditions. Methods An administrative claim dataset (2008-2012) of a regional network of nine hospitals in the Tampa Bay area, Florida, USA, was used to develop the prediction models. Features were extracted from the dataset using the International Classification of Diseases, 9th Revision, Clinical Modification (ICD-9-CM) codes. Five learning algorithms, namely, decision tree C5.0, linear support vector machine (LSVM), k-nearest neighbors, random forest, and multi-layered artificial neural networks, were used to build the model with semi-supervised anomaly detection and two feature selection methods. Issues with the unbalanced nature of the dataset were resolved using the Synthetic Minority Over-sampling Technique (SMOTE). Results LSVM with wrapper feature selection performed moderately well for all patient cohorts. Using SMOTE to counter data imbalances triggered a tradeoff between the model's sensitivity and specificity, which can be masked under a similar area under the curve. The proposed aggregate rank selection approach resulted in a balanced performing model compared to other criteria. Finally, factors such as comorbidity conditions, source of admission, and payer types were associated with the increased risk of a prolonged LOS. Conclusions Prolonged LOS is mostly associated with pre-intraoperative clinical and patient socioeconomic factors. Accurate patient identification with the risk of prolonged LOS using the selected model can provide hospitals a better tool for planning early discharge and resource allocation, thus reducing avoidable hospitalization costs.
Collapse
Affiliation(s)
- Hasan Symum
- Department of Industrial and Management System Engineering, University of South Florida, Tampa, FL, USA
| | - José L Zayas-Castro
- Department of Industrial and Management System Engineering, University of South Florida, Tampa, FL, USA.,College of Engineering, University of South Florida, Tampa, FL, USA
| |
Collapse
|
20
|
Data analytics for the sustainable use of resources in hospitals: Predicting the length of stay for patients with chronic diseases. INFORMATION & MANAGEMENT 2020. [DOI: 10.1016/j.im.2020.103282] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
21
|
Shillan D, Sterne JAC, Champneys A, Gibbison B. Use of machine learning to analyse routinely collected intensive care unit data: a systematic review. CRITICAL CARE : THE OFFICIAL JOURNAL OF THE CRITICAL CARE FORUM 2019; 23:284. [PMID: 31439010 PMCID: PMC6704673 DOI: 10.1186/s13054-019-2564-9] [Citation(s) in RCA: 103] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/12/2019] [Accepted: 08/09/2019] [Indexed: 01/30/2023]
Abstract
BACKGROUND Intensive care units (ICUs) face financial, bed management, and staffing constraints. Detailed data covering all aspects of patients' journeys into and through intensive care are now collected and stored in electronic health records: machine learning has been used to analyse such data in order to provide decision support to clinicians. METHODS Systematic review of the applications of machine learning to routinely collected ICU data. Web of Science and MEDLINE databases were searched to identify candidate articles: those on image processing were excluded. The study aim, the type of machine learning used, the size of dataset analysed, whether and how the model was validated, and measures of predictive accuracy were extracted. RESULTS Of 2450 papers identified, 258 fulfilled eligibility criteria. The most common study aims were predicting complications (77 papers [29.8% of studies]), predicting mortality (70 [27.1%]), improving prognostic models (43 [16.7%]), and classifying sub-populations (29 [11.2%]). Median sample size was 488 (IQR 108-4099): 41 studies analysed data on > 10,000 patients. Analyses focused on 169 (65.5%) papers that used machine learning to predict complications, mortality, length of stay, or improvement of health. Predictions were validated in 161 (95.2%) of these studies: the area under the ROC curve (AUC) was reported by 97 (60.2%) but only 10 (6.2%) validated predictions using independent data. The median AUC was 0.83 in studies of 1000-10,000 patients, rising to 0.94 in studies of > 100,000 patients. The most common machine learning methods were neural networks (72 studies [42.6%]), support vector machines (40 [23.7%]), and classification/decision trees (34 [20.1%]). Since 2015 (125 studies [48.4%]), the most common methods were support vector machines (37 studies [29.6%]) and random forests (29 [23.2%]). CONCLUSIONS The rate of publication of studies using machine learning to analyse routinely collected ICU data is increasing rapidly. The sample sizes used in many published studies are too small to exploit the potential of these methods. Methodological and reporting guidelines are needed, particularly with regard to the choice of method and validation of predictions, to increase confidence in reported findings and aid in translating findings towards routine use in clinical practice.
Collapse
Affiliation(s)
- Duncan Shillan
- NIHR Bristol Biomedical Research Centre, University of Bristol, Bristol, UK.,Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Jonathan A C Sterne
- NIHR Bristol Biomedical Research Centre, University of Bristol, Bristol, UK.,Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Alan Champneys
- Department of Engineering Mathematics, University of Bristol, Bristol, UK
| | - Ben Gibbison
- NIHR Bristol Biomedical Research Centre, University of Bristol, Bristol, UK. .,Translational Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK. .,Department of Anaesthesia, Bristol Royal Infirmary, Level 7 Queens Building, Upper Maudlin St, Bristol, BS2 8HW, UK.
| |
Collapse
|
22
|
Predicting Intracerebral Hemorrhage Patients' Length-of-Stay Probability Distribution Based on Demographic, Clinical, Admission Diagnosis, and Surgery Information. JOURNAL OF HEALTHCARE ENGINEERING 2019; 2019:4571636. [PMID: 30809336 PMCID: PMC6369489 DOI: 10.1155/2019/4571636] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Accepted: 01/01/2019] [Indexed: 02/05/2023]
Abstract
The vast majority of patients with intracerebral hemorrhage (ICH) suffer from long and uncertain length of stay (LOS). The aim of our study was to provide decision support for discharge and admission plans by predicting ICH patients' LOS probability distribution. The demographics, clinical predictors, admission diagnosis, and surgery information from 3,600 ICH patients were used in this study. We used univariable Cox analysis, multivariable Cox analysis, Cox-variable of importance (Cox-VIMP) analysis, and an intersection analysis to select predictors and used random survival forests (RSF)—a method in survival analysis—to predict LOS probability distribution. The Cox-VIMP method constructed by us effectively selected significant correlation predictors. The Cox-VIMP RSF model can improve prediction performance and is significantly different from the other models. The Cox-VIMP can contribute to the screening of predictors, and the RSF model can be established through those predictors to predict the probability distribution of LOS in each patient.
Collapse
|
23
|
Cuadrado D, Riaño D, Gómez J, Bodí M, Sirgo G, Esteban F, García R, Rodríguez A. Pursuing Optimal Prediction of Discharge Time in ICUs with Machine Learning Methods. Artif Intell Med 2019. [DOI: 10.1007/978-3-030-21642-9_20] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
24
|
Length of Hospital Stay Prediction at the Admission Stage for Cardiology Patients Using Artificial Neural Network. JOURNAL OF HEALTHCARE ENGINEERING 2018; 2016:7035463. [PMID: 27195660 PMCID: PMC5058566 DOI: 10.1155/2016/7035463] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/01/2015] [Accepted: 03/15/2016] [Indexed: 11/18/2022]
Abstract
For hospitals' admission management, the ability to predict length of stay (LOS) as early as in the preadmission stage might be helpful to monitor the quality of inpatient care. This study is to develop artificial neural network (ANN) models to predict LOS for inpatients with one of the three primary diagnoses: coronary atherosclerosis (CAS), heart failure (HF), and acute myocardial infarction (AMI) in a cardiovascular unit in a Christian hospital in Taipei, Taiwan. A total of 2,377 cardiology patients discharged between October 1, 2010, and December 31, 2011, were analyzed. Using ANN or linear regression model was able to predict correctly for 88.07% to 89.95% CAS patients at the predischarge stage and for 88.31% to 91.53% at the preadmission stage. For AMI or HF patients, the accuracy ranged from 64.12% to 66.78% at the predischarge stage and 63.69% to 67.47% at the preadmission stage when a tolerance of 2 days was allowed.
Collapse
|
25
|
Belderrar A, Hazzab A. Hierarchical Genetic Algorithm and Fuzzy Radial Basis Function Networks for Factors Influencing Hospital Length of Stay Outliers. Healthc Inform Res 2017; 23:226-232. [PMID: 28875058 PMCID: PMC5572527 DOI: 10.4258/hir.2017.23.3.226] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2017] [Revised: 06/10/2017] [Accepted: 07/02/2017] [Indexed: 11/23/2022] Open
Abstract
Objectives Controlling hospital high length of stay outliers can provide significant benefits to hospital management resources and lead to cost reduction. The strongest predictive factors influencing high length of stay outliers should be identified to build a high-performance prediction model for hospital outliers. Methods We highlight the application of the hierarchical genetic algorithm to provide the main predictive factors and to define the optimal structure of the prediction model fuzzy radial basis function neural network. To establish the prediction model, we used a data set of 26,897 admissions from five different intensive care units with discharges between 2001 and 2012. We selected and analyzed the high length of stay outliers using the trimming method geometric mean plus two standard deviations. A total of 28 predictive factors were extracted from the collected data set and investigated. Results High length of stay outliers comprised 5.07% of the collected data set. The results indicate that the prediction model can provide effective forecasting. We found 10 common predictive factors within the studied intensive care units. The obtained main predictive factors include patient demographic characteristics, hospital characteristics, medical events, and comorbidities. Conclusions The main initial predictive factors available at the time of admission are useful in evaluating high length of stay outliers. The proposed approach can provide a practical tool for healthcare providers, and its application can be extended to other hospital predictions, such as readmissions and cost.
Collapse
Affiliation(s)
- Ahmed Belderrar
- Laboratory of Control, Analysis and Optimization of Electro-Energetic Systems, University Tahri Mohamed Bechar, Bechar, Algeria
| | - Abdeldjebar Hazzab
- Laboratory of Control, Analysis and Optimization of Electro-Energetic Systems, University Tahri Mohamed Bechar, Bechar, Algeria
| |
Collapse
|
26
|
Khajehali N, Alizadeh S. Extract critical factors affecting the length of hospital stay of pneumonia patient by data mining (case study: an Iranian hospital). Artif Intell Med 2017; 83:2-13. [PMID: 28712673 DOI: 10.1016/j.artmed.2017.06.010] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2017] [Revised: 06/21/2017] [Accepted: 06/28/2017] [Indexed: 11/30/2022]
Abstract
MOTIVATION Pneumonia is a prevalent infection of lower respiratory tract caused by infected lungs. Length of stay (LOS) in hospital is one of the simplest and most important indicators in hospital activity that is used for different purposes. The aim of this study is to explore the important factors affecting the LOS of patients with pneumonia in hospitals. METHODS The clinical data set for the study were collected from 387 patients in a specialized hospital in Iran between 2009 and 2015. Patients discharge summary includes their demographic details, reasons for admission, prescribed medications for the patient, the result of laboratory tests, and length of treatment. RESULTS AND CONCLUSIONS The proposed model in the study demonstrates the way various scenarios of data processing impact on the scale efficiency model, which points to the significance of the pre-processing in data mining. In this article, some methods were utilized; it is noteworthy that Bayesian boosting method led to better results in identifying the factors affecting LOS (accuracy 95.17%). In addition, it was found that 58% of patients younger than 15 years old and 74% of the elderly within the age range of 74-88 were more vulnerable to pneumonia disease. Also, it was found that the Meropenem is a relatively more effective medicine compared to other antibiotics which are used to treat pneumonia in the majority of age groups. Regardless of the impact of various laboratory findings (including CRP, ESR, WBC, NA, K), the patients LOS decreased as a result of Meropenem.
Collapse
|
27
|
Awad A, Bader–El–Den M, McNicholas J. Patient length of stay and mortality prediction: A survey. Health Serv Manage Res 2017; 30:105-120. [DOI: 10.1177/0951484817696212] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
Over the past few years, there has been increased interest in data mining and machine learning methods to improve hospital performance, in particular hospitals want to improve their intensive care unit statistics by reducing the number of patients dying inside the intensive care unit. Research has focused on prediction of measurable outcomes, including risk of complications, mortality and length of hospital stay. The length of stay is an important metric both for healthcare providers and patients, influenced by numerous factors. In particular, the length of stay in critical care is of great significance, both to patient experience and the cost of care, and is influenced by factors specific to the highly complex environment of the intensive care unit. The length of stay is often used as a surrogate for other outcomes, where those outcomes cannot be measured; for example as a surrogate for hospital or intensive care unit mortality. The length of stay is also a parameter, which has been used to identify the severity of illnesses and healthcare resource utilisation. This paper examines a range of length of stay and mortality prediction applications in acute medicine and the critical care unit. It also focuses on the methods of analysing length of stay and mortality prediction. Moreover, the paper provides a classification and evaluation for the analytical methods of the length of stay and mortality prediction associated with a grouping of relevant research papers published in the years 1984 to 2016 related to the domain of survival analysis. In addition, the paper highlights some of the gaps and challenges of the domain.
Collapse
Affiliation(s)
- Aya Awad
- School of Computing, University of Portsmouth, UK
| | | | | |
Collapse
|
28
|
Ferrão JC, Oliveira MD, Janela F, Martins HMG. Preprocessing structured clinical data for predictive modeling and decision support. A roadmap to tackle the challenges. Appl Clin Inform 2016; 7:1135-1153. [PMID: 27924347 PMCID: PMC5228148 DOI: 10.4338/aci-2016-03-soa-0035] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2016] [Accepted: 10/01/2016] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND EHR systems have high potential to improve healthcare delivery and management. Although structured EHR data generates information in machine-readable formats, their use for decision support still poses technical challenges for researchers due to the need to preprocess and convert data into a matrix format. During our research, we observed that clinical informatics literature does not provide guidance for researchers on how to build this matrix while avoiding potential pitfalls. OBJECTIVES This article aims to provide researchers a roadmap of the main technical challenges of preprocessing structured EHR data and possible strategies to overcome them. METHODS Along standard data processing stages - extracting database entries, defining features, processing data, assessing feature values and integrating data elements, within an EDPAI framework -, we identified the main challenges faced by researchers and reflect on how to address those challenges based on lessons learned from our research experience and on best practices from related literature. We highlight the main potential sources of error, present strategies to approach those challenges and discuss implications of these strategies. RESULTS Following the EDPAI framework, researchers face five key challenges: (1) gathering and integrating data, (2) identifying and handling different feature types, (3) combining features to handle redundancy and granularity, (4) addressing data missingness, and (5) handling multiple feature values. Strategies to address these challenges include: cross-checking identifiers for robust data retrieval and integration; applying clinical knowledge in identifying feature types, in addressing redundancy and granularity, and in accommodating multiple feature values; and investigating missing patterns adequately. CONCLUSIONS This article contributes to literature by providing a roadmap to inform structured EHR data preprocessing. It may advise researchers on potential pitfalls and implications of methodological decisions in handling structured data, so as to avoid biases and help realize the benefits of the secondary use of EHR data.
Collapse
Affiliation(s)
- José Carlos Ferrão
- José Carlos Ferrão, Rua Irmãos Siemens 1, Ed. 3 Piso 3, 2720-093 Amadora, Portugal, Email address: , Telephone: (+351) 214 178 668, Fax: (+351) 214 178 030
| | | | | | | |
Collapse
|
29
|
Random Survival Forests for Predicting the Bed Occupancy in the Intensive Care Unit. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2016; 2016:7087053. [PMID: 27818706 PMCID: PMC5081505 DOI: 10.1155/2016/7087053] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/27/2016] [Revised: 09/08/2016] [Accepted: 09/22/2016] [Indexed: 11/21/2022]
Abstract
Predicting the bed occupancy of an intensive care unit (ICU) is a daunting task. The uncertainty associated with the prognosis of critically ill patients and the random arrival of new patients can lead to capacity problems and the need for reactive measures. In this paper, we work towards a predictive model based on Random Survival Forests which can assist physicians in estimating the bed occupancy. As input data, we make use of the Sequential Organ Failure Assessment (SOFA) score collected and calculated from 4098 patients at two ICU units of Ghent University Hospital over a time period of four years. We compare the performance of our system with a baseline performance and a standard Random Forest regression approach. Our results indicate that Random Survival Forests can effectively be used to assist in the occupancy prediction problem. Furthermore, we show that a group based approach, such as Random Survival Forests, performs better compared to a setting in which the length of stay of a patient is individually assessed.
Collapse
|
30
|
Determining Factors Influencing Length of Stay and Predicting Length of Stay Using Data Mining in the General Surgery Department. HOSPITAL PRACTICES AND RESEARCH 2016. [DOI: 10.20286/hpr-010251] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
|
31
|
LaFaro RJ, Pothula S, Kubal KP, Inchiosa ME, Pothula VM, Yuan SC, Maerz DA, Montes L, Oleszkiewicz SM, Yusupov A, Perline R, Inchiosa MA. Neural Network Prediction of ICU Length of Stay Following Cardiac Surgery Based on Pre-Incision Variables. PLoS One 2015; 10:e0145395. [PMID: 26710254 PMCID: PMC4692524 DOI: 10.1371/journal.pone.0145395] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2015] [Accepted: 12/03/2015] [Indexed: 11/29/2022] Open
Abstract
Background Advanced predictive analytical techniques are being increasingly applied to clinical risk assessment. This study compared a neural network model to several other models in predicting the length of stay (LOS) in the cardiac surgical intensive care unit (ICU) based on pre-incision patient characteristics. Methods Thirty six variables collected from 185 cardiac surgical patients were analyzed for contribution to ICU LOS. The Automatic Linear Modeling (ALM) module of IBM-SPSS software identified 8 factors with statistically significant associations with ICU LOS; these factors were also analyzed with the Artificial Neural Network (ANN) module of the same software. The weighted contributions of each factor (“trained” data) were then applied to data for a “new” patient to predict ICU LOS for that individual. Results Factors identified in the ALM model were: use of an intra-aortic balloon pump; O2 delivery index; age; use of positive cardiac inotropic agents; hematocrit; serum creatinine ≥ 1.3 mg/deciliter; gender; arterial pCO2. The r2 value for ALM prediction of ICU LOS in the initial (training) model was 0.356, p <0.0001. Cross validation in prediction of a “new” patient yielded r2 = 0.200, p <0.0001. The same 8 factors analyzed with ANN yielded a training prediction r2 of 0.535 (p <0.0001) and a cross validation prediction r2 of 0.410, p <0.0001. Two additional predictive algorithms were studied, but they had lower prediction accuracies. Our validated neural network model identified the upper quartile of ICU LOS with an odds ratio of 9.8(p <0.0001). Conclusions ANN demonstrated a 2-fold greater accuracy than ALM in prediction of observed ICU LOS. This greater accuracy would be presumed to result from the capacity of ANN to capture nonlinear effects and higher order interactions. Predictive modeling may be of value in early anticipation of risks of post-operative morbidity and utilization of ICU facilities.
Collapse
Affiliation(s)
- Rocco J. LaFaro
- Department of Surgery, New York Medical College, Valhalla, New York, United States of America
| | - Suryanarayana Pothula
- Department of Anesthesiology, New York Medical College, Valhalla, New York, United States of America
| | - Keshar Paul Kubal
- Department of Pharmacology, New York Medical College, Valhalla, New York, United States of America
| | - Mario Emil Inchiosa
- Revolution Analytics, Inc., Mountain View, California, United States of America
| | - Venu M. Pothula
- Department of Pharmacology, New York Medical College, Valhalla, New York, United States of America
| | - Stanley C. Yuan
- Department of Anesthesiology, New York Medical College, Valhalla, New York, United States of America
| | - David A. Maerz
- Department of Pharmacology, New York Medical College, Valhalla, New York, United States of America
| | - Lucresia Montes
- Department of Pharmacology, New York Medical College, Valhalla, New York, United States of America
| | - Stephen M. Oleszkiewicz
- Department of Pharmacology, New York Medical College, Valhalla, New York, United States of America
| | - Albert Yusupov
- Department of Anesthesiology, New York Medical College, Valhalla, New York, United States of America
| | - Richard Perline
- The SAS Institute, Cary, North Carolina, United States of America
| | - Mario Anthony Inchiosa
- Department of Pharmacology, New York Medical College, Valhalla, New York, United States of America
- * E-mail:
| |
Collapse
|
32
|
Economou GPK, Sourla E, Stamatopoulou KM, Syrimpeis V, Sioutas S, Tsakalidis A, Tzimas G. Exploiting expert systems in cardiology: a comparative study. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2014; 820:79-89. [PMID: 25417018 DOI: 10.1007/978-3-319-09012-2_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/21/2023]
Abstract
An improved Adaptive Neuro-Fuzzy Inference System (ANFIS) in the field of critical cardiovascular diseases is presented. The system stems from an earlier application based only on a Sugeno-type Fuzzy Expert System (FES) with the addition of an Artificial Neural Network (ANN) computational structure. Thus, inherent characteristics of ANNs, along with the human-like knowledge representation of fuzzy systems are integrated. The ANFIS has been utilized into building five different sub-systems, distinctly covering Coronary Disease, Hypertension, Atrial Fibrillation, Heart Failure, and Diabetes, hence aiding doctors of medicine (MDs), guide trainees, and encourage medical experts in their diagnoses centering a wide range of Cardiology. The Fuzzy Rules have been trimmed down and the ANNs have been optimized in order to focus into each particular disease and produce results ready-to-be applied to real-world patients.
Collapse
|
33
|
Hachesu PR, Ahmadi M, Alizadeh S, Sadoughi F. Use of data mining techniques to determine and predict length of stay of cardiac patients. Healthc Inform Res 2013; 19:121-9. [PMID: 23882417 PMCID: PMC3717435 DOI: 10.4258/hir.2013.19.2.121] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2013] [Revised: 03/28/2013] [Accepted: 04/01/2013] [Indexed: 11/23/2022] Open
Abstract
Objectives Predicting the length of stay (LOS) of patients in a hospital is important in providing them with better services and higher satisfaction, as well as helping the hospital management plan and managing hospital resources as meticulously as possible. We propose applying data mining techniques to extract useful knowledge and draw an accurate model to predict the LOS of heart patients. Methods Data were collected from patients with coronary artery disease (CAD). The patient records of 4,948 patients who had suffered CAD were included in the analysis. The techniques used are classification with three algorithms, namely, decision tree, support vector machines (SVM), and artificial neural network (ANN). LOS is the target variable, and 36 input variables are used for prediction. A confusion matrix was obtained to calculate sensitivity, specificity, and accuracy. Results The overall accuracy of SVM was 96.4% in the training set. Most single patients (64.3%) had an LOS ≤5 days, whereas 41.2% of married patients had an LOS >10 days. Moreover, the study showed that comorbidity states, such as lung disorders and hemorrhage with drug consumption have an impact on long LOS. The presence of comorbidities, an ejection fraction <2, being a current smoker, and having social security type insurance in coronary artery patients led to longer LOS than other subjects. Conclusions All three algorithms are able to predict LOS with various degrees of accuracy. The findings demonstrated that the SVM was the best fit. There was a significant tendency for LOS to be longer in patients with lung or respiratory disorders and high blood pressure.
Collapse
Affiliation(s)
- Peyman Rezaei Hachesu
- Department of Health Information Management, School of Health Management and Information Sciences, Tehran University of Medical Sciences, Tehran, Iran
| | | | | | | |
Collapse
|
34
|
Abstract
Artificial intelligence has significantly contributed in the evolution of medical informatics and biomedicine, providing a variety of tools available to be exploited, from rule-based expert systems and fuzzy logic to neural networks and genetic algorithms. Moreover, familiarizing people with smartphones and the constantly growing use of medical-related mobile applications enables complete and systematic monitoring of a series of chronic diseases both by health professionals and patients. In this work, we propose an integrated system for monitoring and early notification for patients suffering from heart diseases. CardioSmart365 consists of web applications, smartphone native applications, decision support systems, and web services that allow interaction and communication among end users: cardiologists, patients, and general doctors. The key features of the proposed solution are (a) recording and management of patients' measurements of vital signs performed at home on regular basis (blood pressure, blood glucose, oxygen saturation, weight, and height), (b) management of patients' EMRs, (c) cardiologic patient modules for the most common heart diseases, (d) decision support systems based on fuzzy logic, (e) integrated message management module for optimal communication between end users and instant notifications, and (f) interconnection to Microsoft HealthVault platform. CardioSmart365 contributes to the effort for optimal patient monitoring at home and early response in cases of emergency.
Collapse
|
35
|
Predicting the impact of hospital health information technology adoption on patient satisfaction. Artif Intell Med 2012; 56:123-35. [DOI: 10.1016/j.artmed.2012.08.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2011] [Revised: 08/02/2012] [Accepted: 08/19/2012] [Indexed: 11/20/2022]
|
36
|
Berchialla P, Foltran F, Bigi R, Gregori D. Integrating stress-related ventricular functional and angiographic data in preventive cardiology: a unified approach implementing a Bayesian network. J Eval Clin Pract 2012; 18:637-43. [PMID: 21449973 DOI: 10.1111/j.1365-2753.2011.01651.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
BACKGROUND Identification of key factors associated with the risk of adverse cardiovascular events and quantification of this risk using multivariable prediction algorithms are among the major advances made in preventive cardiology and cardiovascular epidemiology. METHODS In the present paper, we examined clinical predictors of adverse cardiovascular events among 228 individuals with symptoms suggestive of coronary artery disease (CAD) undergoing functional (stress echocardiography) and anatomical (coronary angiography) assessment of CAD. Particularly, we evaluate the possibility to integrate simple measures that have known prognostic value and more recently discovered predictors of risk, such as stress-related ventricular function data and angiographic data, in a unique model implementing a Bayesian network (BN). Moreover, we compared the performance of BN and the covariates hierarchy with those obtained from logistic regression model and from a set of alternative tools becoming popular in various clinical settings, including random forest classification tree analysis, artificial neural networks and support vector machine. RESULTS Network graph and results coming from sensitivity analysis, where variables are ranked according to the gain they provided in variance reduction, seem have an easily intuitive lecture: variables that are measure of ventricular disfunction or of the extent of CAD show a greater impact in predicting event. On the other hand, anamnestic data such as diabetes, dyslipidaemia, hypertension, smoke habits, which are related to the outcome throughout a process of intermediate variables, per se have a small role in outcome prediction. BNs are able to explain a relevant part of variance (70%) and have discrimination ability superior or comparable with those to random forest classification tree analysis, artificial neural networks and support vector machine. DISCUSSION Despite the complexity of interactions, model obtained implementing a BN seems to be able to adequately describe the relationships existing among the analysed variables. BN, being able to predict scenarios in which new variables can be incorporated as health process evolves, can measure individual's risks for adverse cardiovascular events, providing a permanent second opinion to the medical practitioner and assisting diagnostic and therapeutic process.
Collapse
Affiliation(s)
- Paola Berchialla
- Department of Public Health and Microbiology, University of Torino, Torino, Italy
| | | | | | | |
Collapse
|
37
|
Hewett JN, Rodgers GW, Chase JG, Le Compte AJ, Pretty CG, Shaw GM. Assessment of SOFA Score as a Diagnostic Indicator in Intensive Care Medicine. ACTA ACUST UNITED AC 2012. [DOI: 10.3182/20120829-3-hu-2029.00035] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
|
38
|
|
39
|
Hsieh NC, Hung LP, Shih CC, Keh HC, Chan CH. Intelligent Postoperative Morbidity Prediction of Heart Disease Using Artificial Intelligence Techniques. J Med Syst 2010; 36:1809-20. [DOI: 10.1007/s10916-010-9640-7] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2010] [Accepted: 12/08/2010] [Indexed: 10/18/2022]
|
40
|
Foltran F, Berchialla P, Giunta F, Malacarne P, Merletti F, Gregori D. Using VLAD scores to have a look insight ICU performance: towards a modelling of the errors. J Eval Clin Pract 2010; 16:968-75. [PMID: 20722890 DOI: 10.1111/j.1365-2753.2009.01240.x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
RATIONALE, AIMS AND OBJECTIVES Mortality prediction models using logistic regression analysis play a pivotal role in intensive care quality evaluation, allowing a hospital's performance to be compared with a standard. However, when a difference between predicted and observed mortality exists, that is, the numerator of the Variable Life Adjusted Display (VLAD) score, the investigation for a possible explanation could be arduous. In this article we tested the ability of Bayesian Network (BN) to identify factors determining the negative discrepancy between expected and actual outcomes recorded in four Italian intensive care units (ICUs). METHODS A BN was implemented to predict the extent of the expected-observed distance quantified by the VLAD score. BN performance was compared with those of a set of tools including Linear Model, Random Forest Regression Tree analysis, Artificial Neural Networks and Support Vector Machine. RESULTS BN allows the identification of critical areas responsible for bad performance. Compared with other techniques, BN always explains a higher variance percentage and it shows similar or superior discrimination ability. CONCLUSIONS BN, being able to guide interpretation of covariates role by means of a graphic representation of relationships, confirms its utility particularly where many interactions between predictors exist and when a coherent set of theories regarding which variables are related and how is not available.
Collapse
|
41
|
|