Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Weberpals J, Becker T, Davies J, Schmich F, Rüttinger D, Theis FJ, Bauer-Mehren A. Deep Learning-based Propensity Scores for Confounding Control in Comparative Effectiveness Research: A Large-scale, Real-world Data Study. Epidemiology 2021;32:378-388. [PMID: 33591049 DOI: 10.1097/ede.0000000000001338] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

For:	Weberpals J, Becker T, Davies J, Schmich F, Rüttinger D, Theis FJ, Bauer-Mehren A. Deep Learning-based Propensity Scores for Confounding Control in Comparative Effectiveness Research: A Large-scale, Real-world Data Study. Epidemiology 2021;32:378-388. [PMID: 33591049 DOI: 10.1097/ede.0000000000001338] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Number

Cited by Other Article(s)

Chen F, Wang L, Hong J, Jiang J, Zhou L. Unmasking bias in artificial intelligence: a systematic review of bias detection and mitigation strategies in electronic health record-based models. J Am Med Inform Assoc 2024;31:1172-1183. [PMID: 38520723 PMCID: PMC11031231 DOI: 10.1093/jamia/ocae060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 02/26/2024] [Accepted: 03/05/2024] [Indexed: 03/25/2024] Open

Abstract

OBJECTIVES

Leveraging artificial intelligence (AI) in conjunction with electronic health records (EHRs) holds transformative potential to improve healthcare. However, addressing bias in AI, which risks worsening healthcare disparities, cannot be overlooked. This study reviews methods to handle various biases in AI models developed using EHR data.

MATERIALS AND METHODS

We conducted a systematic review following the Preferred Reporting Items for Systematic Reviews and Meta-analyses guidelines, analyzing articles from PubMed, Web of Science, and IEEE published between January 01, 2010 and December 17, 2023. The review identified key biases, outlined strategies for detecting and mitigating bias throughout the AI model development, and analyzed metrics for bias assessment.

RESULTS

Of the 450 articles retrieved, 20 met our criteria, revealing 6 major bias types: algorithmic, confounding, implicit, measurement, selection, and temporal. The AI models were primarily developed for predictive tasks, yet none have been deployed in real-world healthcare settings. Five studies concentrated on the detection of implicit and algorithmic biases employing fairness metrics like statistical parity, equal opportunity, and predictive equity. Fifteen studies proposed strategies for mitigating biases, especially targeting implicit and selection biases. These strategies, evaluated through both performance and fairness metrics, predominantly involved data collection and preprocessing techniques like resampling and reweighting.

DISCUSSION

This review highlights evolving strategies to mitigate bias in EHR-based AI models, emphasizing the urgent need for both standardized and detailed reporting of the methodologies and systematic real-world testing and evaluation. Such measures are essential for gauging models' practical impact and fostering ethical AI that ensures fairness and equity in healthcare.

Collapse

Loureiro H, Roller A, Schneider M, Talavera-López C, Becker T, Bauer-Mehren A. Matching by OS Prognostic Score to Construct External Controls in Lung Cancer Clinical Trials. Clin Pharmacol Ther 2024;115:333-341. [PMID: 37975320 DOI: 10.1002/cpt.3109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Accepted: 11/08/2023] [Indexed: 11/19/2023]

Zang C, Zhang H, Xu J, Zhang H, Fouladvand S, Havaldar S, Cheng F, Chen K, Chen Y, Glicksberg BS, Chen J, Bian J, Wang F. High-throughput target trial emulation for Alzheimer's disease drug repurposing with real-world data. Nat Commun 2023;14:8180. [PMID: 38081829 PMCID: PMC10713627 DOI: 10.1038/s41467-023-43929-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2022] [Accepted: 11/24/2023] [Indexed: 12/18/2023] Open

Affiliation(s)

Chengxi Zang Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA Institute of Artificial Intelligence for Digital Health, Weill Cornell Medicine, New York, NY, USA
Hao Zhang Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA
Jie Xu Department of Health Outcomes & Biomedical Informatics, University of Florida, Gainesville, FL, USA
Hansi Zhang Department of Health Outcomes & Biomedical Informatics, University of Florida, Gainesville, FL, USA
Sajjad Fouladvand Institude for Biomedical Informatics (IBI) and Department of Computer Science, University of Kentucky, Lexington, KY, USA
Shreyas Havaldar Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Feixiong Cheng Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH, USA
Kun Chen Department of Statistics, University of Connecticut, Storrs, CT, USA
Yong Chen Department of Biostatistics, Epidemiology and Informatics (DBEI), the Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Benjamin S Glicksberg Hasso Plattner Institute for Digital Health at Mount Sinai, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Jin Chen Institude for Biomedical Informatics (IBI) and Department of Computer Science, University of Kentucky, Lexington, KY, USA
Jiang Bian Department of Health Outcomes & Biomedical Informatics, University of Florida, Gainesville, FL, USA
Fei Wang Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, USA. Institute of Artificial Intelligence for Digital Health, Weill Cornell Medicine, New York, NY, USA.

Collapse

Weymann D, Chan B, Regier DA. Genetic matching for time-dependent treatments: a longitudinal extension and simulation study. BMC Med Res Methodol 2023;23:181. [PMID: 37559105 PMCID: PMC10413721 DOI: 10.1186/s12874-023-01995-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Accepted: 07/21/2023] [Indexed: 08/11/2023] Open

Abstract

BACKGROUND

Longitudinal matching can mitigate confounding in observational, real-world studies of time-dependent treatments. To date, these methods have required iterative, manual re-specifications to achieve covariate balance. We propose a longitudinal extension of genetic matching, a machine learning approach that automates balancing of covariate histories. We examine performance by comparing the proposed extension against baseline propensity score matching and time-dependent propensity score matching.

METHODS

To evaluate comparative performance, we developed a Monte Carlo simulation framework that reflects a static treatment assigned at multiple time points. Data generation considers a treatment assignment model, a continuous outcome model, and underlying covariates. In simulation, we generated 1,000 datasets, each consisting of 1,000 subjects, and applied: (1) nearest neighbour matching on time-invariant, baseline propensity scores; (2) sequential risk set matching on time-dependent propensity scores; and (3) longitudinal genetic matching on time-dependent covariates. To measure comparative performance, we estimated covariate balance, efficiency, bias, and root mean squared error (RMSE) of treatment effect estimates. In scenario analysis, we varied underlying assumptions for assumed covariate distributions, correlations, treatment assignment models, and outcome models.

RESULTS

In all scenarios, baseline propensity score matching resulted in biased effect estimation in the presence of time-dependent confounding, with mean bias ranging from 29.7% to 37.2%. In contrast, time-dependent propensity score matching and longitudinal genetic matching achieved stronger covariate balance and yielded less biased estimation, with mean bias ranging from 0.7% to 13.7%. Across scenarios, longitudinal genetic matching achieved similar or better performance than time-dependent propensity score matching without requiring manual re-specifications or normality of covariates.

CONCLUSIONS

While the most appropriate longitudinal method will depend on research questions and underlying data patterns, our study can help guide these decisions. Simulation results demonstrate the validity of our longitudinal genetic matching approach for supporting future real-world assessments of treatments accessible at multiple time points.

Collapse

MacDonald S, Foley H, Yap M, Johnston RL, Steven K, Koufariotis LT, Sharma S, Wood S, Addala V, Pearson JV, Roosta F, Waddell N, Kondrashova O, Trzaskowski M. Generalising uncertainty improves accuracy and safety of deep learning analytics applied to oncology. Sci Rep 2023;13:7395. [PMID: 37149669 PMCID: PMC10164181 DOI: 10.1038/s41598-023-31126-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Accepted: 03/07/2023] [Indexed: 05/08/2023] Open

Yang S, Du P, Feng X, He D, Chen Y, Zhong LLD, Yan X, Luo J. Propensity score analysis with missing data using a multi-task neural network. BMC Med Res Methodol 2023;23:41. [PMID: 36793016 PMCID: PMC9930709 DOI: 10.1186/s12874-023-01847-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2022] [Accepted: 01/20/2023] [Indexed: 02/17/2023] Open

Kwee SA, Wong LL, Ludema C, Deng CK, Taira D, Seto T, Landsittel D. Target Trial Emulation: A Design Tool for Cancer Clinical Trials. JCO Clin Cancer Inform 2023;7:e2200140. [PMID: 36608311 PMCID: PMC10166475 DOI: 10.1200/cci.22.00140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 11/11/2022] [Accepted: 11/23/2022] [Indexed: 01/09/2023] Open

Kasim S, Malek S, Song C, Wan Ahmad WA, Fong A, Ibrahim KS, Safiruz MS, Aziz F, Hiew JH, Ibrahim N. In-hospital mortality risk stratification of Asian ACS patients with artificial intelligence algorithm. PLoS One 2022;17:e0278944. [PMID: 36508425 PMCID: PMC9744311 DOI: 10.1371/journal.pone.0278944] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Accepted: 11/25/2022] [Indexed: 12/14/2022] Open

Abstract

BACKGROUND

Conventional risk score for predicting in-hospital mortality following Acute Coronary Syndrome (ACS) is not catered for Asian patients and requires different types of scoring algorithms for STEMI and NSTEMI patients.

OBJECTIVE

To derive a single algorithm using deep learning and machine learning for the prediction and identification of factors associated with in-hospital mortality in Asian patients with ACS and to compare performance to a conventional risk score.

METHODS

The Malaysian National Cardiovascular Disease Database (NCVD) registry, is a multi-ethnic, heterogeneous database spanning from 2006-2017. It was used for in-hospital mortality model development with 54 variables considered for patients with STEMI and Non-STEMI (NSTEMI). Mortality prediction was analyzed using feature selection methods with machine learning algorithms. Deep learning algorithm using features selected from machine learning was compared to Thrombolysis in Myocardial Infarction (TIMI) score.

RESULTS

A total of 68528 patients were included in the analysis. Deep learning models constructed using all features and selected features from machine learning resulted in higher performance than machine learning and TIMI risk score (p < 0.0001 for all). The best model in this study is the combination of features selected from the SVM algorithm with a deep learning classifier. The DL (SVM selected var) algorithm demonstrated the highest predictive performance with the least number of predictors (14 predictors) for in-hospital prediction of STEMI patients (AUC = 0.96, 95% CI: 0.95-0.96). In NSTEMI in-hospital prediction, DL (RF selected var) (AUC = 0.96, 95% CI: 0.95-0.96, reported slightly higher AUC compared to DL (SVM selected var) (AUC = 0.95, 95% CI: 0.94-0.95). There was no significant difference between DL (SVM selected var) algorithm and DL (RF selected var) algorithm (p = 0.5). When compared to the DL (SVM selected var) model, the TIMI score underestimates patients' risk of mortality. TIMI risk score correctly identified 13.08% of the high-risk patient's non-survival vs 24.7% for the DL model and 4.65% vs 19.7% of the high-risk patient's non-survival for NSTEMI. Age, heart rate, Killip class, cardiac catheterization, oral hypoglycemia use and antiarrhythmic agent were found to be common predictors of in-hospital mortality across all ML feature selection models in this study. The final algorithm was converted into an online tool with a database for continuous data archiving for prospective validation.

CONCLUSIONS

ACS patients were better classified using a combination of machine learning and deep learning in a multi-ethnic Asian population when compared to TIMI scoring. Machine learning enables the identification of distinct factors in individual Asian populations to improve mortality prediction. Continuous testing and validation will allow for better risk stratification in the future, potentially altering management and outcomes.

Collapse

Affiliation(s)

Sazzli Kasim Cardiology Department, Faculty of Medicine, Universiti Teknologi MARA (UiTM), Shah Alam, Malaysia Cardiac Vascular and Lung Research Institute, Universiti Teknologi MARA (UiTM), Shah Alam, Malaysia National Heart Association of Malaysia, Heart House, Kuala Lumpur, Malaysia Faculty of Medicine, Universiti Teknologi MARA (UiTM), Sungai Buloh Campus, Sungai Buloh, Malaysia
Sorayya Malek Bioinformatics Division, Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia * E-mail:
Cheen Song Bioinformatics Division, Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
Wan Azman Wan Ahmad National Heart Association of Malaysia, Heart House, Kuala Lumpur, Malaysia Division of Cardiology, University Malaya Medical Centre, Kuala Lumpur, Malaysia
Alan Fong Sarawak Heart Centre, Kota Samarahan, Sarawak, Malaysia Clinical Research Centre, Sarawak General Hospital, Institute for Clinical Research, National Institutes of Health, Jalan Hospital, Kuching, Sarawak, Malaysia Swinburne University of Technology, Sarawak Campus, Kuching, Malaysia
Khairul Shafiq Ibrahim Cardiology Department, Faculty of Medicine, Universiti Teknologi MARA (UiTM), Shah Alam, Malaysia Cardiac Vascular and Lung Research Institute, Universiti Teknologi MARA (UiTM), Shah Alam, Malaysia National Heart Association of Malaysia, Heart House, Kuala Lumpur, Malaysia
Muhammad Shahreeza Safiruz Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur, Malaysia
Firdaus Aziz Bioinformatics Division, Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
Jia Hui Hiew Bioinformatics Division, Institute of Biological Sciences, Faculty of Science, University of Malaya, Kuala Lumpur, Malaysia
Nurulain Ibrahim Faculty of Medicine, Universiti Teknologi MARA (UiTM), Sungai Buloh Campus, Sungai Buloh, Malaysia

Collapse

Wyss R, Yanover C, El-Hay T, Bennett D, Platt RW, Zullo AR, Sari G, Wen X, Ye Y, Yuan H, Gokhale M, Patorno E, Lin KJ. Machine learning for improving high-dimensional proxy confounder adjustment in healthcare database studies: an overview of the current literature. Pharmacoepidemiol Drug Saf 2022;31:932-943. [PMID: 35729705 PMCID: PMC9541861 DOI: 10.1002/pds.5500] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Revised: 06/01/2022] [Accepted: 06/05/2022] [Indexed: 11/10/2022]