Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Rose S. Robust Machine Learning Variable Importance Analyses of Medical Conditions for Health Care Spending. Health Serv Res 2018. [PMID: 29527659 DOI: 10.1111/1475-6773.12848] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

For:	Rose S. Robust Machine Learning Variable Importance Analyses of Medical Conditions for Health Care Spending. Health Serv Res 2018. [PMID: 29527659 DOI: 10.1111/1475-6773.12848] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open

Number

Cited by Other Article(s)

Jung HW, Jang JS. Constructing prediction models and analyzing factors in suicidal ideation using machine learning, focusing on the older population. PLoS One 2024;19:e0305777. [PMID: 39038039 PMCID: PMC11262681 DOI: 10.1371/journal.pone.0305777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Accepted: 06/04/2024] [Indexed: 07/24/2024] Open

Mputu Mputu P, Beauséjour M, Richard-Denis A, Fallah N, Noonan VK, Mac-Thiong JM. Classifying clinical phenotypes of functional recovery for acute traumatic spinal cord injury. An observational cohort study. Disabil Rehabil 2024:1-8. [PMID: 38390856 DOI: 10.1080/09638288.2024.2320267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 02/14/2024] [Indexed: 02/24/2024]

Li H, Rosete S, Coyle J, Phillips RV, Hejazi NS, Malenica I, Arnold BF, Benjamin-Chung J, Mertens A, Colford JM, van der Laan MJ, Hubbard AE. Evaluating the robustness of targeted maximum likelihood estimators via realistic simulations in nutrition intervention trials. Stat Med 2022;41:2132-2165. [PMID: 35172378 PMCID: PMC10362909 DOI: 10.1002/sim.9348] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2021] [Revised: 01/20/2022] [Accepted: 01/26/2022] [Indexed: 12/18/2022]

Wallace J, McWilliams JM, Lollo A, Eaton J, Ndumele CD. Residual Confounding in Health Plan Performance Assessments: Evidence From Randomization in Medicaid. Ann Intern Med 2022;175:314-324. [PMID: 34978862 DOI: 10.7326/m21-0881] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Abstract

BACKGROUND

Risk adjustment is used widely in payment systems and performance assessments, but the extent to which it distinguishes plan or provider effects from confounding due to patient differences is typically unknown.

OBJECTIVE

To assess the degree to which risk-adjusted measures of health plan performance adequately adjust for the variation across plans that arises because of differences in patient characteristics (residual confounding).

DESIGN

Comparison between plan performance estimates based on enrollees who made plan choices (observational population) and estimates based on enrollees assigned to plans (randomized population).

SETTING

Natural experiment in which more than two thirds of a state's Medicaid population in 1 region was randomly assigned to 1 of 5 plans.

PARTICIPANTS

137 933 enrollees in 2013 to 2014, of whom 31.1% selected a plan and 68.9% were randomly assigned to 1 of the same 5 plans.

MEASUREMENTS

Annual total spending (that is, payments to providers), primary care use, dental care use, and avoidable emergency department visits, all scored as plan-specific deviations from the "average" plan performance within each population.

RESULTS

Enrollee characteristics were appreciably imbalanced across plans in the observational population, as expected, but were not in the randomized population. Annual total spending varied across plans more in the observational population (SD, $147 per enrollee) than in the randomized population (SD, $70 per enrollee) after accounting for baseline differences in the observational and randomized populations and for differences across plans. On average, a plan's spending score (its deviation from the "average" performance) in the observational population differed from its score in the randomized population by $67 per enrollee in absolute value (95% CI, $38 to $123), or 4.2% of mean spending per enrollee (P = 0.009, rejecting the null hypothesis that this difference would be expected from sampling error). The difference was reduced modestly by risk adjustment to $62 per enrollee (P = 0.012). Residual confounding was similarly substantial for most other performance measures. Further adjustment for social factors did not materially change estimates.

LIMITATION

Potential heterogeneity in plan effects between the 2 populations.

CONCLUSION

Residual confounding in risk-adjusted performance assessments can be substantial and should caution policymakers against assuming that risk adjustment isolates real differences in plan performance.

PRIMARY FUNDING SOURCE

Arnold Ventures.

Collapse

Zink A, Rose S. Identifying undercompensated groups defined by multiple attributes in risk adjustment. BMJ Health Care Inform 2021;28:bmjhci-2021-100414. [PMID: 34535447 PMCID: PMC8451283 DOI: 10.1136/bmjhci-2021-100414] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 08/25/2021] [Indexed: 11/22/2022] Open

Wang Z, Chen X, Tan X, Yang L, Kannapur K, Vincent JL, Kessler GN, Ru B, Yang M. Using Deep Learning to Identify High-Risk Patients with Heart Failure with Reduced Ejection Fraction. JOURNAL OF HEALTH ECONOMICS AND OUTCOMES RESEARCH 2021;8:6-13. [PMID: 34414250 PMCID: PMC8322198 DOI: 10.36469/jheor.2021.25753] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 07/15/2021] [Indexed: 06/13/2023]

Abstract

Background: Deep Learning (DL) has not been well-established as a method to identify high-risk patients among patients with heart failure (HF). Objectives: This study aimed to use DL models to predict hospitalizations, worsening HF events, and 30-day and 90-day readmissions in patients with heart failure with reduced ejection fraction (HFrEF). Methods: We analyzed the data of adult HFrEF patients from the IBM® MarketScan® Commercial and Medicare Supplement databases between January 1, 2015 and December 31, 2017. A sequential model architecture based on bi-directional long short-term memory (Bi-LSTM) layers was utilized. For DL models to predict HF hospitalizations and worsening HF events, we utilized two study designs: with and without a buffer window. For comparison, we also tested multiple traditional machine learning models including logistic regression, random forest, and eXtreme Gradient Boosting (XGBoost). Model performance was assessed by area under the curve (AUC) values, precision, and recall on an independent testing dataset. Results: A total of 47 498 HFrEF patients were included; 9427 with at least one HF hospitalization. The best AUCs of DL models without a buffer window in predicting HF hospitalizations and worsening HF events in the total patient cohort were 0.977 and 0.972; with a 7-day buffer window the best AUCs were 0.573 and 0.608, respectively. The best AUCs in predicting 30- and 90-day readmissions in all adult patients were 0.597 and 0.614, respectively. An AUC of 0.861 was attained for prediction of 90-day readmission in patients aged 18-64. For all outcomes assessed, the DL approach outperformed traditional machine learning models. Discussion: The DL approach can automate feature engineering during the model learning, which can increase the clinical applicability and lead to comparable or better model performance. However, the lack of granular clinical data, and sample size and imbalance issues may have limited the model's performance. Conclusions: A DL approach using Bi-LSTM was shown to be a feasible and useful tool to predict HF-related outcomes. This study can help inform the future development and deployment of predictive tools to identify high-risk HFrEF patients and ultimately facilitate targeted interventions in clinical practice.

Collapse

Rose S. Intersections of machine learning and epidemiological methods for health services research. Int J Epidemiol 2021;49:1763-1770. [PMID: 32236476 PMCID: PMC7825941 DOI: 10.1093/ije/dyaa035] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/17/2020] [Indexed: 12/15/2022] Open

McConnell KJ, Lindner S. Estimating treatment effects with machine learning. Health Serv Res 2019;54:1273-1282. [PMID: 31602641 DOI: 10.1111/1475-6773.13212] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open

Huber M, Kurz C, Leidl R. Predicting patient-reported outcomes following hip and knee replacement surgery using supervised machine learning. BMC Med Inform Decis Mak 2019;19:3. [PMID: 30621670 PMCID: PMC6325823 DOI: 10.1186/s12911-018-0731-6] [Citation(s) in RCA: 60] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Accepted: 12/27/2018] [Indexed: 12/28/2022] Open

Abstract

BACKGROUND

Machine-learning classifiers mostly offer good predictive performance and are increasingly used to support shared decision-making in clinical practice. Focusing on performance and practicability, this study evaluates prediction of patient-reported outcomes (PROs) by eight supervised classifiers including a linear model, following hip and knee replacement surgery.

METHODS

NHS PRO data (130,945 observations) from April 2015 to April 2017 were used to train and test eight classifiers to predict binary postoperative improvement based on minimal important differences. Area under the receiver operating characteristic, J-statistic and several other metrics were calculated. The dependent outcomes were generic and disease-specific improvement based on the EQ-5D-3L visual analogue scale (VAS) as well as the Oxford Hip and Knee Score (Q score).

RESULTS

The area under the receiver operating characteristic of the best training models was around 0.87 (VAS) and 0.78 (Q score) for hip replacement, while it was around 0.86 (VAS) and 0.70 (Q score) for knee replacement surgery. Extreme gradient boosting, random forests, multistep elastic net and linear model provided the highest overall J-statistics. Based on variable importance, the most important predictors for post-operative outcomes were preoperative VAS, Q score and single Q score dimensions. Sensitivity analysis for hip replacement VAS evaluated the influence of minimal important difference, patient selection criteria as well as additional data years. Together with a small benchmark of the NHS prediction model, robustness of our results was confirmed.

CONCLUSIONS

Supervised machine-learning implementations, like extreme gradient boosting, can provide better performance than linear models and should be considered, when high predictive performance is needed. Preoperative VAS, Q score and specific dimensions like limping are the most important predictors for postoperative hip and knee PROMs.

Collapse