Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Kim YJ, Park H. Improving Prediction of High-Cost Health Care Users with Medical Check-Up Data. Big Data 2019;7:163-175. [PMID: 31246499 DOI: 10.1089/big.2018.0096] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]

Number

Cited by Other Article(s)

Krefting J, Sen P, David-Rus D, Güldener U, Hawe JS, Cassese S, von Scheidt M, Schunkert H. Use of big data from health insurance for assessment of cardiovascular outcomes. Front Artif Intell 2023;6:1155404. [PMID: 37207237 PMCID: PMC10188985 DOI: 10.3389/frai.2023.1155404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Accepted: 04/13/2023] [Indexed: 05/21/2023] Open

Choi Y, An J, Ryu S, Kim J. Development and Evaluation of Machine Learning-Based High-Cost Prediction Model Using Health Check-Up Data by the National Health Insurance Service of Korea. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022;19:13672. [PMID: 36294248 PMCID: PMC9603723 DOI: 10.3390/ijerph192013672] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 10/10/2022] [Accepted: 10/14/2022] [Indexed: 06/16/2023]

Abstract

In this study, socioeconomic, medical treatment, and health check-up data from 2010 to 2017 of the National Health Insurance Service (NHIS) of Korea were analyzed. This year's socioeconomic, treatment, and health check-up data are used to develop a predictive model for high medical expenses in the next year. The characteristic of this study is to derive important variables related to the high cost of domestic medical expenses users by using data on health check-up items conducted by the country. In this study, we tried to classify data and evaluate its performance using classification supervised learning algorithms for high-cost medical expense prediction. Supervised learning for predicting high-cost medical expenses was performed using the logistic regression model, random forest, and XGBoost, which have been known to result the best performance and explanatory power among the machine learning algorithms used in previous studies. Our experimental results show that the XGBoost model had the best performance with 77.1% accuracy. The contribution of this study is to identify the variables that affect the prediction of high-cost medical expenses by analyzing the medical bills using the health check-up variables and the Korea Classification Disease (KCD) large group as input variables. Through this study, it was confirmed that musculoskeletal disorders (M) and respiratory diseases (J), which are the most frequently treated diseases, as important KCD disease groups for high-cost prediction in Korea, affect the future high cost prediction. In addition, it was confirmed that malignant neoplasia diseases (C) with high medical cost per treatment are a group of diseases related to high future medical cost prediction. Unlike previous studies, it is the result of analyzing all disease data, so it is expected that the study will be more meaningful when compared with the results of other national health check-up data.

Collapse

He X, Li D, Wang W, Liang H, Liang Y. Identifying patterns of clinical conditions among high-cost older adult health care users using claims data: a latent class approach. Int J Equity Health 2022;21:86. [PMID: 35725607 PMCID: PMC9210624 DOI: 10.1186/s12939-022-01688-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2022] [Accepted: 06/14/2022] [Indexed: 11/24/2022] Open

Vimont A, Leleu H, Durand-Zaleski I. Machine learning versus regression modelling in predicting individual healthcare costs from a representative sample of the nationwide claims database in France. THE EUROPEAN JOURNAL OF HEALTH ECONOMICS : HEPAC : HEALTH ECONOMICS IN PREVENTION AND CARE 2022;23:211-223. [PMID: 34373958 DOI: 10.1007/s10198-021-01363-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Accepted: 07/29/2021] [Indexed: 06/13/2023]

Abstract

BACKGROUND

Innovative provider payment methods that avoid adverse selection and reward performance require accurate prediction of healthcare costs based on individual risk adjustment. Our objective was to compare the performances of a simple neural network (NN) and random forest (RF) to a generalized linear model (GLM) for the prediction of medical cost at the individual level.

METHODS

A 1/97 representative sample of the French National Health Data Information System was used. Predictors selected were: demographic information; pre-existing conditions, Charlson comorbidity index; healthcare service use and costs. Predictive performances of each model were compared through individual-level (adjusted R-squared (adj-R²), mean absolute error (MAE) and hit ratio (HiR)), and distribution-level metrics on different sets of covariates in the general population and by pre-existing morbid condition, using a quasi-Monte Carlo design.

RESULTS

We included 510,182 subjects alive on 31st December, 2015. Mean annual costs were 1894€ (standard deviation 9326€) (median 393€, IQ range 95€; 1480€), including zero-claim subjects. All models performed similarly after adjustment on demographics. RF model had better performances on other sets of covariates (pre-existing conditions, resource counts and past year costs). On full model, RF reached an adj-R² of 47.5%, a MAE of 1338€ and a HiR of 67%, while GLM and NN had an adj-R² of 34.7% and 31.6%, a MAE of 1635€ and 1660€, and a HiR of 58% and 55 M, respectively. RF model outperformed GLM and NN for most conditions and for high-cost subjects.

CONCLUSIONS

RF should be preferred when the objective is to best predict medical costs. When the objective is to understand the contribution of predictors, GLM was well suited with demographics, conditions and base year cost.

Collapse

Huang H, Shih PC, Zhu Y, Gao W. An integrated model for medical expense system optimization during diagnosis process based on artificial intelligence algorithm. JOURNAL OF COMBINATORIAL OPTIMIZATION 2022;44:2515-2532. [PMID: 34220290 PMCID: PMC8235905 DOI: 10.1007/s10878-021-00761-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 05/18/2021] [Indexed: 05/11/2023]

Dziegielewski C, Talarico R, Imsirovic H, Qureshi D, Choudhri Y, Tanuseputro P, Thompson LH, Kyeremanteng K. Characteristics and resource utilization of high-cost users in the intensive care unit: a population-based cohort study. BMC Health Serv Res 2021;21:1312. [PMID: 34872546 PMCID: PMC8647444 DOI: 10.1186/s12913-021-07318-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 11/01/2021] [Indexed: 11/10/2022] Open

Abstract

Background

Healthcare expenditure within the intensive care unit (ICU) is costly. A cost reduction strategy may be to target patients accounting for a disproportionate amount of healthcare spending, or high-cost users. This study aims to describe high-cost users in the ICU, including health outcomes and cost patterns.

Methods

We conducted a population-based retrospective cohort study of patients with ICU admissions in Ontario from 2011 to 2018. Patients with total healthcare costs in the year following ICU admission (including the admission itself) in the upper 10th percentile were defined as high-cost users. We compared characteristics and outcomes including length of stay, mortality, disposition, and costs between groups.

Results

Among 370,061 patients included, 37,006 were high-cost users. High-cost users were 64.2 years old, 58.3% male, and had more comorbidities (41.2% had ≥3) when likened to non-high cost users (66.1 years old, 57.2% male, 27.9% had ≥3 comorbidities). ICU length of stay was four times greater for high-cost users compared to non-high cost users (22.4 days, 95% confidence interval [CI] 22.0–22.7 days vs. 5.56 days, 95% CI 5.54–5.57 days). High-cost users had lower in-hospital mortality (10.0% vs.14.2%), but increased dispositioning outside of home (77.4% vs. 42.2%) compared to non-high-cost users. Total healthcare costs were five-fold higher for high-cost users ($238,231, 95% CI $237,020–$239,442) compared to non-high-cost users ($45,155, 95% CI $45,046–$45,264). High-cost users accounted for 37.0% of total healthcare costs.

Conclusion

High-cost users have increased length of stay, lower in-hospital mortality, and higher total healthcare costs when compared to non-high-cost users. Further studies into cost patterns and predictors of high-cost users are necessary to identify methods of decreasing healthcare expenditure.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12913-021-07318-y.

Collapse

Ramachandran R, McShea MJ, Howson SN, Burkom HS, Chang HY, Weiner JP, Kharrazi H. Assessing the Value of Unsupervised Clustering in Predicting Persistent High Health Care Utilizers: Retrospective Analysis of Insurance Claims Data. JMIR Med Inform 2021;9:e31442. [PMID: 34592712 PMCID: PMC8663459 DOI: 10.2196/31442] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 07/26/2021] [Accepted: 09/30/2021] [Indexed: 01/10/2023] Open

Abstract

BACKGROUND

A high proportion of health care services are persistently utilized by a small subpopulation of patients. To improve clinical outcomes while reducing costs and utilization, population health management programs often provide targeted interventions to patients who may become persistent high users/utilizers (PHUs). Enhanced prediction and management of PHUs can improve health care system efficiencies and improve the overall quality of patient care.

OBJECTIVE

The aim of this study was to detect key classes of diseases and medications among the study population and to assess the predictive value of these classes in identifying PHUs.

METHODS

This study was a retrospective analysis of insurance claims data of patients from the Johns Hopkins Health Care system. We defined a PHU as a patient incurring health care costs in the top 20% of all patients' costs for 4 consecutive 6-month periods. We used 2013 claims data to predict PHU status in 2014-2015. We applied latent class analysis (LCA), an unsupervised clustering approach, to identify patient subgroups with similar diagnostic and medication patterns to differentiate variations in health care utilization across PHUs. Logistic regression models were then built to predict PHUs in the full population and in select subpopulations. Predictors included LCA membership probabilities, demographic covariates, and health utilization covariates. Predictive powers of the regression models were assessed and compared using standard metrics.

RESULTS

We identified 164,221 patients with continuous enrollment between 2013 and 2015. The mean study population age was 19.7 years, 55.9% were women, 3.3% had ≥1 hospitalization, and 19.1% had 10+ outpatient visits in 2013. A total of 8359 (5.09%) patients were identified as PHUs in both 2014 and 2015. The LCA performed optimally when assigning patients to four probability disease/medication classes. Given the feedback provided by clinical experts, we further divided the population into four diagnostic groups for sensitivity analysis: acute upper respiratory infection (URI) (n=53,232; 4.6% PHUs), mental health (n=34,456; 12.8% PHUs), otitis media (n=24,992; 4.5% PHUs), and musculoskeletal (n=24,799; 15.5% PHUs). For the regression models predicting PHUs in the full population, the F1-score classification metric was lower using a parsimonious model that included LCA categories (F1=38.62%) compared to that of a complex risk stratification model with a full set of predictors (F1=48.20%). However, the LCA-enabled simple models were comparable to the complex model when predicting PHUs in the mental health and musculoskeletal subpopulations (F1-scores of 48.69% and 48.15%, respectively). F1-scores were lower than that of the complex model when the LCA-enabled models were limited to the otitis media and acute URI subpopulations (45.77% and 43.05%, respectively).

CONCLUSIONS

Our study illustrates the value of LCA in identifying subgroups of patients with similar patterns of diagnoses and medications. Our results show that LCA-derived classes can simplify predictive models of PHUs without compromising predictive accuracy. Future studies should investigate the value of LCA-derived classes for predicting PHUs in other health care settings.

Collapse

Kuo R, Zulvia FE. The application of gradient evolution algorithm to an intuitionistic fuzzy neural network for forecasting medical cost of acute hepatitis treatment in Taiwan. Appl Soft Comput 2021. [DOI: 10.1016/j.asoc.2021.107711] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Tong LL, Gu JB, Li JJ, Liu GX, Jin SW, Yan AY. Application of Bayesian network and regression method in treatment cost prediction. BMC Med Inform Decis Mak 2021;21:284. [PMID: 34656109 PMCID: PMC8520647 DOI: 10.1186/s12911-021-01647-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 10/04/2021] [Indexed: 11/24/2022] Open

Predicting Future Service Use in Dutch Mental Healthcare: A Machine Learning Approach. ADMINISTRATION AND POLICY IN MENTAL HEALTH AND MENTAL HEALTH SERVICES RESEARCH 2021;49:116-124. [PMID: 34463857 PMCID: PMC8732820 DOI: 10.1007/s10488-021-01150-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/26/2021] [Indexed: 11/30/2022]

Zeng J, Lawrence WR, Yang J, Tian J, Li C, Lian W, He J, Qu H, Wang X, Liu H, Li G, Li G. Association between serum uric acid and obesity in Chinese adults: a 9-year longitudinal data analysis. BMJ Open 2021;11:e041919. [PMID: 33550245 PMCID: PMC7908911 DOI: 10.1136/bmjopen-2020-041919] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

Abstract

OBJECTIVES

Hyperuricaemia has been reported to be significantly associated with risk of obesity. However, previous studies on the association between serum uric acid (SUA) and body mass index (BMI) yielded conflicting results. The present study examined the relationship between SUA and obesity among Chinese adults.

METHODS

Data were collected at Guangdong Second Provincial General Hospital in Guangzhou City, China, between January 2010 and December 2018. Participants with ≥2 medical check-up times were included in our analyses. Physical examinations and laboratory measurement variables were obtained from the medical check-up system. The high SUA level group was classified as participants with hyperuricaemia, and obesity was defined as BMI ≥28 kg/m². Logistic regression model was performed for data at baseline. For all participants, generalised estimation equation (GEE) model was used to assess the association between SUA and obesity, where the data were repeatedly measured over the 9-year study period. Subgroup analyses were performed by gender and age group. We calculated the cut-off values for SUA of obesity using the receiver operating characteristic curves (ROC) technique.

RESULTS

A total of 15 959 participants (10 023 men and 5936 women) were included in this study, with an average age of 37.38 years (SD: 13.27) and average SUA of 367.05 μmol/L (SD: 97.97) at baseline, respectively. Finally, 1078 participants developed obesity over the 9-year period. The prevalence of obesity was approximately 14.2% for high SUA level. In logistic regression analysis at baseline, we observed a positive association between SUA and risk of obesity: OR=1.84 (95% CI: 1.77 to 1.90) for per-SD increase in SUA. Considering repeated measures over 9 year for all participants in the GEE model, the per-SD OR was 1.85 (95% CI: 1.77 to 1.91) for SUA and the increased risk of obesity were greater for men (OR=1.45) and elderly participants (OR=1.01). In subgroup analyses by gender and age, we observed significant associations between SUA and obesity with higher risk in women (OR=2.35) and young participants (OR=1.87) when compared with men (OR=1.70) and elderly participants (OR=1.48). The SUA cut-off points for risk of obesity using ROC curves were approximately consistent with the international standard.

CONCLUSIONS

Our study observed higher SUA level was associated with increased risk of obesity. More high-quality research is needed to further support these findings.

Collapse

Affiliation(s)

Jie Zeng Center for Clinical Epidemiology and Methodology, Guangdong Second Provincial General Hospital, Guangdong, China Institute of Ultrasound in Musculoskeletal Sports Medicine, Guangdong Second Provincial General Hospital, Guangzhou, China
Wayne R Lawrence Department of Epidemiology and Biostatistics, University at Albany State University of New York, Albany, New York, USA
Jun Yang Institute for Environmental and Climate Research, Jinan University, Guangzhou, China
Junzhang Tian Center for Clinical Epidemiology and Methodology, Guangdong Second Provincial General Hospital, Guangdong, China
Cheng Li Guangdong Traditional Medical and Sports Injury Rehabilitation Research Institute, Guangdong Second Provincial General Hospital, Guangzhou, China
Wanmin Lian Center for Information, Guangdong Second Provincial General Hospital, Guangzhou, China
Jingjun He Center for Health Management and Examination, Guangdong Second Provincial General Hospital, Guangzhou, China
Hongying Qu Center for Clinical Epidemiology and Methodology, Guangdong Second Provincial General Hospital, Guangdong, China Center for Health Management and Examination, Guangdong Second Provincial General Hospital, Guangzhou, China
Xiaojie Wang Center for Clinical Epidemiology and Methodology, Guangdong Second Provincial General Hospital, Guangdong, China
Hongmei Liu Institute of Ultrasound in Musculoskeletal Sports Medicine, Guangdong Second Provincial General Hospital, Guangzhou, China Department of Ultrasound, Guangdong Second Provincial General Hospital, Guangzhou, China
Guanming Li Center for Clinical Epidemiology and Methodology, Guangdong Second Provincial General Hospital, Guangdong, China
Guowei Li Center for Clinical Epidemiology and Methodology, Guangdong Second Provincial General Hospital, Guangdong, China Department of Health Research Methods, Evidence, and Impact (HEI), McMaster University, Hamilton, Ontario, Canada

Collapse

Trading-Off Machine Learning Algorithms towards Data-Driven Administrative-Socio-Economic Population Health Management. COMPUTERS 2020. [DOI: 10.3390/computers10010004] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Saide S, Sheng ML. Toward Business Process Innovation in the Big Data Era: A Mediating Roles of Big Data Knowledge Management. BIG DATA 2020;8:464-477. [PMID: 33216653 DOI: 10.1089/big.2020.0140] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Abstract

While recent debate recognizes the importance of big data (BD) and knowledge management (KM) in firm performance, there has been a paucity of literature regarding big data analytics technological (BDAT) and knowledge exploration-exploitation capabilities (KEEC) in the context of business process innovation (BPI). This study aims to identify whether BD and KM can be established in these emerging issues. We used a survey questionnaire to collect data from various firms and industries. We used structural equation modeling (SmartPLS and SPSS) to validate the research model with a sample of 155 companies in a developing country such as Indonesia. The result demonstrates a positive relationship between KEEC and BPI, followed by several significant findings such as BDAT with KEEC; KEEC on big data knowledge management (BDKM); BDKM and BPI; and BDAT on BDKM. In contrast, BDAT is nonsignificant for direct relationship on BPI, and interestingly, it becomes a significant result after mediated by BDKM. Similarly, BDKM has successfully mediated the relationship between KEEC and BPI. The management level ideally develops and increases such a knowledge creation/acquisition practices and BDAT in an organization to gain more meaningful benefits from these two capabilities. BDAT, KEEC, and BDKM simultaneously are a clear antecedent approach, which ultimately results in flexibility, effectiveness, and effectivity of BPI. The cases of this research are profit firms in a developing country such as Indonesia. A future study could be considered in different settings such as type of industries or more specific company's type, the economy level of countries (comparing between developed and developing countries), and environmental dynamical. A novel field of study is the inclusion of knowledge exploration-exploitation and BDAT that drives BPI.

Collapse

Osawa I, Goto T, Yamamoto Y, Tsugawa Y. Machine-learning-based prediction models for high-need high-cost patients using nationwide clinical and claims data. NPJ Digit Med 2020;3:148. [PMID: 33299137 PMCID: PMC7658979 DOI: 10.1038/s41746-020-00354-8] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Accepted: 10/09/2020] [Indexed: 12/23/2022] Open