1
|
Fan S, Abulizi A, You Y, Huang C, Yimit Y, Li Q, Zou X, Nijiati M. Predicting hospitalization costs for pulmonary tuberculosis patients based on machine learning. BMC Infect Dis 2024; 24:875. [PMID: 39198742 PMCID: PMC11360310 DOI: 10.1186/s12879-024-09771-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Accepted: 08/20/2024] [Indexed: 09/01/2024] Open
Abstract
BACKGROUND Pulmonary tuberculosis (PTB) is a prevalent chronic disease associated with a significant economic burden on patients. Using machine learning to predict hospitalization costs can allocate medical resources effectively and optimize the cost structure rationally, so as to control the hospitalization costs of patients better. METHODS This research analyzed data (2020-2022) from a Kashgar pulmonary hospital's information system, involving 9570 eligible PTB patients. SPSS 26.0 was used for multiple regression analysis, while Python 3.7 was used for random forest regression (RFR) and MLP. The training set included data from 2020 and 2021, while the test set included data from 2022. The models predicted seven various costs related to PTB patients, including diagnostic cost, medical service cost, material cost, treatment cost, drug cost, other cost, and total hospitalization cost. The model's predictive performance was evaluated using R-square (R2), Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE) metrics. RESULTS Among the 9570 PTB patients included in the study, the median and quartile of total hospitalization cost were 13,150.45 (9891.34, 19,648.48) yuan. Nine factors, including age, marital status, admission condition, length of hospital stay, initial treatment, presence of other diseases, transfer, drug resistance, and admission department, significantly influenced hospitalization costs for PTB patients. Overall, MLP demonstrated superior performance in most cost predictions, outperforming RFR and multiple regression; The performance of RFR is between MLP and multiple regression; The predictive performance of multiple regression is the lowest, but it shows the best results for Other costs. CONCLUSION The MLP can effectively leverage patient information and accurately predict various hospitalization costs, achieving a rationalized structure of hospitalization costs by adjusting higher-cost inpatient items and balancing different cost categories. The insights of this predictive model also hold relevance for research in other medical conditions.
Collapse
Affiliation(s)
- Shiyu Fan
- Department of Preventive Healthcare, Shihezi University, Shihezi, 832000, China
| | - Abudoukeyoumujiang Abulizi
- Department of Radiology, The First People's Hospital of Kashi (Kashgar) Prefecture, Kashgar, 844000, China
- Xinjiang Key Laboratory of Artificial Intelligence Assisted Imaging Diagnosis, Kashgar, 844000, China
| | - Yi You
- Department of Research Collaboration, Hangzhou Deepwise & League of PHD Technology Co., Ltd, R&D Center, Hangzhou, 311101, China
| | - Chencui Huang
- Department of Research Collaboration, Hangzhou Deepwise & League of PHD Technology Co., Ltd, R&D Center, Hangzhou, 311101, China
| | - Yasen Yimit
- Department of Radiology, The First People's Hospital of Kashi (Kashgar) Prefecture, Kashgar, 844000, China
- Xinjiang Key Laboratory of Artificial Intelligence Assisted Imaging Diagnosis, Kashgar, 844000, China
| | - Qiange Li
- Department of Preventive Healthcare, Shihezi University, Shihezi, 832000, China
| | - Xiaoguang Zou
- Xinjiang Key Laboratory of Artificial Intelligence Assisted Imaging Diagnosis, Kashgar, 844000, China.
- Xinnjiang Health Commission, Urumqi, 830000, China.
| | - Mayidili Nijiati
- Xinjiang Key Laboratory of Artificial Intelligence Assisted Imaging Diagnosis, Kashgar, 844000, China.
- The Fourth Affiliated Hospital of Xinjiang Medical University, Urumqi, 830000, China.
| |
Collapse
|
2
|
Morid MA, Sheng ORL, Dunbar J. Time Series Prediction Using Deep Learning Methods in Healthcare. ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS 2022. [DOI: 10.1145/3531326] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Traditional Machine Learning (ML) methods face unique challenges when applied to healthcare predictive analytics. The high-dimensional nature of healthcare data necessitates labor-intensive and time-consuming processes when selecting an appropriate set of features for each new task. Furthermore, ML methods depend heavily on feature engineering to capture the sequential nature of patient data, oftentimes failing to adequately leverage the temporal patterns of medical events and their dependencies. In contrast, recent Deep Learning (DL) methods have shown promising performance for various healthcare prediction tasks by specifically addressing the high-dimensional and temporal challenges of medical data. DL techniques excel at learning useful representations of medical concepts and patient clinical data as well as their nonlinear interactions from high-dimensional raw or minimally-processed healthcare data.
In this paper we systematically reviewed research works that focused on advancing deep neural networks to leverage patient structured time series data for healthcare prediction tasks. To identify relevant studies, we searched MEDLINE, IEEE, Scopus, and ACM digital library for relevant publications through November 4
th
, 2021. Overall, we found that researchers have contributed to deep time series prediction literature in ten identifiable research streams: DL models, missing value handling, addressing temporal irregularity, patient representation, static data inclusion, attention mechanisms, interpretation, incorporation of medical ontologies, learning strategies, and scalability. This study summarizes research insights from these literature streams, identifies several critical research gaps, and suggests future research opportunities for DL applications using patient time series data.
Collapse
Affiliation(s)
- Mohammad Amin Morid
- Department of Information Systems and Analytics, Leavey School of Business, Santa Clara University, Santa Clara, CA, USA
| | - Olivia R. Liu Sheng
- Department of Operations and Information Systems, David Eccles School of Business, University of Utah, Salt Lake City, UT, USA
| | - Joseph Dunbar
- Department of Operations and Information Systems, David Eccles School of Business, University of Utah, Salt Lake City, UT, USA
| |
Collapse
|
3
|
Huang Y, Zheng Z, Ma M, Xin X, Liu H, Fei X, Wei L, Chen H. Improving Performance of Outcome Prediction for In-patients with Acute Myocardial Infarction Based on Embedding Representation Learned from Electronic Medical Records: Development and Validation Study (Preprint). J Med Internet Res 2022; 24:e37486. [PMID: 35921141 PMCID: PMC9386580 DOI: 10.2196/37486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 06/02/2022] [Accepted: 07/18/2022] [Indexed: 11/18/2022] Open
Abstract
Background The widespread secondary use of electronic medical records (EMRs) promotes health care quality improvement. Representation learning that can automatically extract hidden information from EMR data has gained increasing attention. Objective We aimed to propose a patient representation with more feature associations and task-specific feature importance to improve the outcome prediction performance for inpatients with acute myocardial infarction (AMI). Methods Medical concepts, including patients’ age, gender, disease diagnoses, laboratory tests, structured radiological features, procedures, and medications, were first embedded into real-value vectors using the improved skip-gram algorithm, where concepts in the context windows were selected by feature association strengths measured by association rule confidence. Then, each patient was represented as the sum of the feature embeddings weighted by the task-specific feature importance, which was applied to facilitate predictive model prediction from global and local perspectives. We finally applied the proposed patient representation into mortality risk prediction for 3010 and 1671 AMI inpatients from a public data set and a private data set, respectively, and compared it with several reference representation methods in terms of the area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), and F1-score. Results Compared with the reference methods, the proposed embedding-based representation showed consistently superior predictive performance on the 2 data sets, achieving mean AUROCs of 0.878 and 0.973, AUPRCs of 0.220 and 0.505, and F1-scores of 0.376 and 0.674 for the public and private data sets, respectively, while the greatest AUROCs, AUPRCs, and F1-scores among the reference methods were 0.847 and 0.939, 0.196 and 0.283, and 0.344 and 0.361 for the public and private data sets, respectively. Feature importance integrated in patient representation reflected features that were also critical in prediction tasks and clinical practice. Conclusions The introduction of feature associations and feature importance facilitated an effective patient representation and contributed to prediction performance improvement and model interpretation.
Collapse
Affiliation(s)
- Yanqun Huang
- School of Biomedical Engineering, Capital Medical University, Beijing, China
- Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Zhimin Zheng
- School of Biomedical Engineering, Capital Medical University, Beijing, China
- Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Moxuan Ma
- School of Biomedical Engineering, Capital Medical University, Beijing, China
- Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Xin Xin
- School of Biomedical Engineering, Capital Medical University, Beijing, China
- Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Honglei Liu
- School of Biomedical Engineering, Capital Medical University, Beijing, China
- Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| | - Xiaolu Fei
- Information Center, Xuanwu Hospital, Capital Medical University, Beijing, China
| | - Lan Wei
- Information Center, Xuanwu Hospital, Capital Medical University, Beijing, China
| | - Hui Chen
- School of Biomedical Engineering, Capital Medical University, Beijing, China
- Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China
| |
Collapse
|
4
|
Kuo R, Zulvia FE. The application of gradient evolution algorithm to an intuitionistic fuzzy neural network for forecasting medical cost of acute hepatitis treatment in Taiwan. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107711] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
5
|
Kumar PM, Hong CS, Afghah F, Manogaran G, Yu K, Hua Q, Gao J. Clouds Proportionate Medical Data Stream Analytics for Internet of Things-based Healthcare Systems. IEEE J Biomed Health Inform 2021; 26:973-982. [PMID: 34415841 DOI: 10.1109/jbhi.2021.3106387] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Internet of Things (IoT) assisted healthcare systems are designed for providing ubiquitous access and recommendations for personal and distributed electronic health services. The heterogeneous IoT platform assists healthcare services with reliable data management through dedicated computing devices. Healthcare services' reliability depends upon the efficient handling of heterogeneous data streams due to variations and errors. A Proportionate Data Analytics (PDA) for heterogeneous healthcare data stream processing is introduced in this manuscript. This analytics method differentiates the data streams based on variations and errors for satisfying the service responses. The classification is streamlined using linear regression for segregating errors from the variations in different time intervals. The time intervals are differentiated recurrently after detecting errors in the stream's variation. This process of differentiation and classification retains a high response ratio for healthcare services through spontaneous regressions. The proposed method's performance is analyzed using the metrics accuracy, identification ratio, delivery, variation factor, and processing time.
Collapse
|
6
|
Forecasting Teleconsultation Demand Using an Ensemble CNN Attention-Based BILSTM Model with Additional Variables. Healthcare (Basel) 2021; 9:healthcare9080992. [PMID: 34442130 PMCID: PMC8391747 DOI: 10.3390/healthcare9080992] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Revised: 07/21/2021] [Accepted: 08/02/2021] [Indexed: 11/16/2022] Open
Abstract
To enhance the forecasting accuracy of daily teleconsultation demand, this study proposes an ensemble hybrid deep learning model. The proposed ensemble CNN attention-based BILSTM model (ECA-BILSTM) combines shallow convolutional neural networks (CNNs), attention mechanisms, and bidirectional long short-term memory (BILSTM). Moreover, additional variables are selected according to the characteristics of teleconsultation demand and added to the inputs of forecasting models. To verify the superiority of ECA-BILSTM and the effectiveness of additional variables, two actual teleconsultation datasets collected in the National Telemedicine Center of China (NTCC) are used as the experimental data. Results showed that ECA-BILSTMs can significantly outperform corresponding benchmark models. And two key additional variables were identified for teleconsultation demand prediction improvement. Overall, the proposed ECA-BILSTM model with effective additional variables is a feasible promising approach in teleconsultation demand forecasting.
Collapse
|
7
|
Liu J, Capurro D, Nguyen A, Verspoor K. Early prediction of diagnostic-related groups and estimation of hospital cost by processing clinical notes. NPJ Digit Med 2021; 4:103. [PMID: 34211109 PMCID: PMC8249417 DOI: 10.1038/s41746-021-00474-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 06/08/2021] [Indexed: 11/09/2022] Open
Abstract
As healthcare providers receive fixed amounts of reimbursement for given services under DRG (Diagnosis-Related Groups) payment, DRG codes are valuable for cost monitoring and resource allocation. However, coding is typically performed retrospectively post-discharge. We seek to predict DRGs and DRG-based case mix index (CMI) at early inpatient admission using routine clinical text to estimate hospital cost in an acute setting. We examined a deep learning-based natural language processing (NLP) model to automatically predict per-episode DRGs and corresponding cost-reflecting weights on two cohorts (paid under Medicare Severity (MS) DRG or All Patient Refined (APR) DRG), without human coding efforts. It achieved macro-averaged area under the receiver operating characteristic curve (AUC) scores of 0·871 (SD 0·011) on MS-DRG and 0·884 (0·003) on APR-DRG in fivefold cross-validation experiments on the first day of ICU admission. When extended to simulated patient populations to estimate average cost-reflecting weights, the model increased its accuracy over time and obtained absolute CMI error of 2·40 (1·07%) and 12·79% (2·31%), respectively on the first day. As the model could adapt to variations in admission time, cohort size, and requires no extra manual coding efforts, it shows potential to help estimating costs for active patients to support better operational decision-making in hospitals.
Collapse
Affiliation(s)
- Jinghui Liu
- School of Computing and Information Systems, The University of Melbourne, Melbourne, VIC, Australia
- Australian e-Health Research Centre, CSIRO, Brisbane, QLD, Australia
| | - Daniel Capurro
- School of Computing and Information Systems, The University of Melbourne, Melbourne, VIC, Australia
- Centre for Digital Transformation of Health, Melbourne Medical School, The University of Melbourne, Melbourne, VIC, Australia
| | - Anthony Nguyen
- Australian e-Health Research Centre, CSIRO, Brisbane, QLD, Australia
| | - Karin Verspoor
- School of Computing and Information Systems, The University of Melbourne, Melbourne, VIC, Australia.
- Centre for Digital Transformation of Health, Melbourne Medical School, The University of Melbourne, Melbourne, VIC, Australia.
- School of Computing Technologies, RMIT University, Melbourne, VIC, Australia.
| |
Collapse
|
8
|
Nomura Y, Ishii Y, Chiba Y, Suzuki S, Suzuki A, Suzuki S, Morita K, Tanabe J, Yamakawa K, Ishiwata Y, Ishikawa M, Sogabe K, Kakuta E, Okada A, Otsuka R, Hanada N. Does Last Year's Cost Predict the Present Cost? An Application of Machine Leaning for the Japanese Area-Basis Public Health Insurance Database. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph18020565. [PMID: 33445431 PMCID: PMC7827468 DOI: 10.3390/ijerph18020565] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2020] [Revised: 12/15/2020] [Accepted: 01/07/2021] [Indexed: 12/21/2022]
Abstract
The increasing healthcare cost imposes a large economic burden for the Japanese government. Predicting the healthcare cost may be a useful tool for policy making. A database of the area-basis public health insurance of one city was analyzed to predict the medical healthcare cost by the dental healthcare cost with a machine learning strategy. The 30,340 subjects who had continued registration of the area-basis public health insurance of Ebina city during April 2017 to September 2018 were analyzed. The sum of the healthcare cost was JPY 13,548,831,930. The per capita healthcare cost was JPY 446,567. The proportion of medical healthcare cost, medication cost, and dental healthcare cost was 78%, 15%, and 7%, respectively. By the results of the neural network model, the medical healthcare cost proportionally depended on the medical healthcare cost of the previous year. The dental healthcare cost of the previous year had a reducing effect on the medical healthcare cost. However, the effect was very small. Oral health may be a risk for chronic diseases. However, when evaluated by the healthcare cost, its effect was very small during the observation period.
Collapse
Affiliation(s)
- Yoshiaki Nomura
- Department of Translational Research, Tsurumi University School of Dental Medicine, Yokohama 230-8501, Japan; (M.I.); (K.S.); (R.O.); (N.H.)
- Correspondence:
| | - Yoshimasa Ishii
- Ebina Dental Association, Kanagawa 243-0421, Japan; (Y.I.); (Y.C.); (S.S.); (A.S.); (S.S.); (K.M.); (J.T.); (K.Y.); (Y.I.)
| | - Yota Chiba
- Ebina Dental Association, Kanagawa 243-0421, Japan; (Y.I.); (Y.C.); (S.S.); (A.S.); (S.S.); (K.M.); (J.T.); (K.Y.); (Y.I.)
| | - Shunsuke Suzuki
- Ebina Dental Association, Kanagawa 243-0421, Japan; (Y.I.); (Y.C.); (S.S.); (A.S.); (S.S.); (K.M.); (J.T.); (K.Y.); (Y.I.)
| | - Akira Suzuki
- Ebina Dental Association, Kanagawa 243-0421, Japan; (Y.I.); (Y.C.); (S.S.); (A.S.); (S.S.); (K.M.); (J.T.); (K.Y.); (Y.I.)
| | - Senichi Suzuki
- Ebina Dental Association, Kanagawa 243-0421, Japan; (Y.I.); (Y.C.); (S.S.); (A.S.); (S.S.); (K.M.); (J.T.); (K.Y.); (Y.I.)
| | - Kenji Morita
- Ebina Dental Association, Kanagawa 243-0421, Japan; (Y.I.); (Y.C.); (S.S.); (A.S.); (S.S.); (K.M.); (J.T.); (K.Y.); (Y.I.)
| | - Joji Tanabe
- Ebina Dental Association, Kanagawa 243-0421, Japan; (Y.I.); (Y.C.); (S.S.); (A.S.); (S.S.); (K.M.); (J.T.); (K.Y.); (Y.I.)
| | - Koji Yamakawa
- Ebina Dental Association, Kanagawa 243-0421, Japan; (Y.I.); (Y.C.); (S.S.); (A.S.); (S.S.); (K.M.); (J.T.); (K.Y.); (Y.I.)
| | - Yasuo Ishiwata
- Ebina Dental Association, Kanagawa 243-0421, Japan; (Y.I.); (Y.C.); (S.S.); (A.S.); (S.S.); (K.M.); (J.T.); (K.Y.); (Y.I.)
| | - Meu Ishikawa
- Department of Translational Research, Tsurumi University School of Dental Medicine, Yokohama 230-8501, Japan; (M.I.); (K.S.); (R.O.); (N.H.)
| | - Kaoru Sogabe
- Department of Translational Research, Tsurumi University School of Dental Medicine, Yokohama 230-8501, Japan; (M.I.); (K.S.); (R.O.); (N.H.)
| | - Erika Kakuta
- Department of Oral Microbiology, Tsurumi University School of Dental Medicine, Yokohama 230-8501, Japan;
| | - Ayako Okada
- Department of Operative Dentistry, Tsurumi University School of Dental Medicine, Yokohama 230-8501, Japan;
| | - Ryoko Otsuka
- Department of Translational Research, Tsurumi University School of Dental Medicine, Yokohama 230-8501, Japan; (M.I.); (K.S.); (R.O.); (N.H.)
| | - Nobuhiro Hanada
- Department of Translational Research, Tsurumi University School of Dental Medicine, Yokohama 230-8501, Japan; (M.I.); (K.S.); (R.O.); (N.H.)
| |
Collapse
|
9
|
Morid MA, Borjali A, Del Fiol G. A scoping review of transfer learning research on medical image analysis using ImageNet. Comput Biol Med 2020; 128:104115. [PMID: 33227578 DOI: 10.1016/j.compbiomed.2020.104115] [Citation(s) in RCA: 135] [Impact Index Per Article: 33.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2020] [Revised: 10/19/2020] [Accepted: 11/09/2020] [Indexed: 02/06/2023]
Abstract
OBJECTIVE Employing transfer learning (TL) with convolutional neural networks (CNNs), well-trained on non-medical ImageNet dataset, has shown promising results for medical image analysis in recent years. We aimed to conduct a scoping review to identify these studies and summarize their characteristics in terms of the problem description, input, methodology, and outcome. MATERIALS AND METHODS To identify relevant studies, MEDLINE, IEEE, and ACM digital library were searched for studies published between June 1st, 2012 and January 2nd, 2020. Two investigators independently reviewed articles to determine eligibility and to extract data according to a study protocol defined a priori. RESULTS After screening of 8421 articles, 102 met the inclusion criteria. Of 22 anatomical areas, eye (18%), breast (14%), and brain (12%) were the most commonly studied. Data augmentation was performed in 72% of fine-tuning TL studies versus 15% of the feature-extracting TL studies. Inception models were the most commonly used in breast related studies (50%), while VGGNet was the common in eye (44%), skin (50%) and tooth (57%) studies. AlexNet for brain (42%) and DenseNet for lung studies (38%) were the most frequently used models. Inception models were the most frequently used for studies that analyzed ultrasound (55%), endoscopy (57%), and skeletal system X-rays (57%). VGGNet was the most common for fundus (42%) and optical coherence tomography images (50%). AlexNet was the most frequent model for brain MRIs (36%) and breast X-Rays (50%). 35% of the studies compared their model with other well-trained CNN models and 33% of them provided visualization for interpretation. DISCUSSION This study identified the most prevalent tracks of implementation in the literature for data preparation, methodology selection and output evaluation for various medical image analysis tasks. Also, we identified several critical research gaps existing in the TL studies on medical image analysis. The findings of this scoping review can be used in future TL studies to guide the selection of appropriate research approaches, as well as identify research gaps and opportunities for innovation.
Collapse
Affiliation(s)
- Mohammad Amin Morid
- Department of Information Systems and Analytics, Leavey School of Business, Santa Clara University, Santa Clara, CA, USA.
| | - Alireza Borjali
- Department of Orthopaedic Surgery, Harvard Medical School, Boston, MA, USA; Department of Orthopaedic Surgery, Harris Orthopaedics Laboratory, Massachusetts General Hospital, Boston, MA, USA
| | - Guilherme Del Fiol
- Department of Biomedical Informatics, University of Utah, Salt Lake City, UT, USA
| |
Collapse
|