51
|
Burn E, Sena AG, Prats-Uribe A, Spotnitz M, DuVall S, Lynch KE, Matheny ME, Nyberg F, Ahmed WUR, Alser O, Alghoul H, Alshammari T, Zhang L, Casajust P, Areia C, Shah K, Reich C, Blacketer C, Andryc A, Fortin S, Natarajan K, Gong M, Golozar A, Morales D, Rijnbeek P, Subbian V, Roel E, Recalde M, Lane JCE, Vizcaya D, Posada JD, Shah NH, Jonnagaddala J, Lai LYH, Avilés-Jurado FX, Hripcsak G, Suchard MA, Ranzani OT, Ryan P, Prieto-Alhambra D, Kostka K, Duarte-Salles T. Use of dialysis, tracheostomy, and extracorporeal membrane oxygenation among 842,928 patients hospitalized with COVID-19 in the United States. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2021. [PMID: 33269356 PMCID: PMC7709172 DOI: 10.1101/2020.11.25.20229088] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Objective To estimate the proportion of patients hospitalized with COVID-19 who undergo dialysis, tracheostomy, and extracorporeal membrane oxygenation (ECMO). Design A network cohort study. Setting Seven databases from the United States containing routinely-collected patient data: HealthVerity, Premier, IQVIA Hospital CDM, IQVIA Open Claims, Optum EHR, Optum SES, and VA-OMOP. Patients Patients hospitalized with a clinical diagnosis or a positive test result for COVID-19. Interventions Dialysis, tracheostomy, and ECMO. Measurements and Main Results 842,928 patients hospitalized with COVID-19 were included (22,887 from HealthVerity, 77,853 from IQVIA Hospital CDM, 533,997 from IQVIA Open Claims, 36,717 from Optum EHR, 4,336 from OPTUM SES, 156,187 from Premier, and 10,951 from VA-OMOP). Across the six databases, 35,192 (4.17% [95% CI: 4.13% to 4.22%]) patients received dialysis, 6,950 (0.82% [0.81% to 0.84%]) had a tracheostomy, and 1,568 (0.19% [95% CI: 0.18% to 0.20%]) patients underwent ECMO over the 30 days following hospitalization. Use of ECMO was more common among patients who were younger, male, and with fewer comorbidities. Tracheostomy was broadly used for a similar proportion of patients regardless of age, sex, or comorbidity. While dialysis was generally used for a similar proportion among younger and older patients, it was more frequent among male patients and among those with chronic kidney disease. Conclusion Use of dialysis among those hospitalized with COVID-19 is high at around 4%. Although less than one percent of patients undergo tracheostomy and ECMO, the absolute numbers of patients who have undergone these interventions is substantial.
Collapse
|
52
|
Long DR, Gombar S, Hogan CA, Greninger AL, O’Reilly-Shah V, Bryson-Cahn C, Stevens B, Rustagi A, Jerome KR, Kong CS, Zehnder J, Shah NH, Weiss NS, Pinsky BA, Sunshine JE. Occurrence and Timing of Subsequent Severe Acute Respiratory Syndrome Coronavirus 2 Reverse-transcription Polymerase Chain Reaction Positivity Among Initially Negative Patients. Clin Infect Dis 2021; 72:323-326. [PMID: 33501950 PMCID: PMC7314163 DOI: 10.1093/cid/ciaa722] [Citation(s) in RCA: 68] [Impact Index Per Article: 22.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Accepted: 06/03/2020] [Indexed: 12/20/2022] Open
Abstract
Using data for 20 912 patients from 2 large academic health systems, we analyzed the frequency of severe acute respiratory syndrome coronavirus 2 reverse-transcription polymerase chain reaction test discordance among individuals initially testing negative by nasopharyngeal swab who were retested on clinical grounds within 7 days. The frequency of subsequent positivity within this window was 3.5% and was similar across institutions.
Collapse
|
53
|
Steinberg E, Jung K, Fries JA, Corbin CK, Pfohl SR, Shah NH. Language models are an effective representation learning technique for electronic health record data. J Biomed Inform 2021; 113:103637. [PMID: 33290879 PMCID: PMC7863633 DOI: 10.1016/j.jbi.2020.103637] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Revised: 10/10/2020] [Accepted: 11/26/2020] [Indexed: 11/17/2022]
Abstract
Widespread adoption of electronic health records (EHRs) has fueled the development of using machine learning to build prediction models for various clinical outcomes. However, this process is often constrained by having a relatively small number of patient records for training the model. We demonstrate that using patient representation schemes inspired from techniques in natural language processing can increase the accuracy of clinical prediction models by transferring information learned from the entire patient population to the task of training a specific model, where only a subset of the population is relevant. Such patient representation schemes enable a 3.5% mean improvement in AUROC on five prediction tasks compared to standard baselines, with the average improvement rising to 19% when only a small number of patient records are available for training the clinical prediction model.
Collapse
|
54
|
Jung K, Kashyap S, Avati A, Harman S, Shaw H, Li R, Smith M, Shum K, Javitz J, Vetteth Y, Seto T, Bagley SC, Shah NH. A framework for making predictive models useful in practice. J Am Med Inform Assoc 2020; 28:1149-1158. [PMID: 33355350 PMCID: PMC8200271 DOI: 10.1093/jamia/ocaa318] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2020] [Accepted: 11/27/2020] [Indexed: 01/03/2023] Open
Abstract
Objective To analyze the impact of factors in healthcare delivery on the net benefit of triggering an Advanced Care Planning (ACP) workflow based on predictions of 12-month mortality. Materials and Methods We built a predictive model of 12-month mortality using electronic health record data and evaluated the impact of healthcare delivery factors on the net benefit of triggering an ACP workflow based on the models’ predictions. Factors included nonclinical reasons that make ACP inappropriate: limited capacity for ACP, inability to follow up due to patient discharge, and availability of an outpatient workflow to follow up on missed cases. We also quantified the relative benefits of increasing capacity for inpatient ACP versus outpatient ACP. Results Work capacity constraints and discharge timing can significantly reduce the net benefit of triggering the ACP workflow based on a model’s predictions. However, the reduction can be mitigated by creating an outpatient ACP workflow. Given limited resources to either add capacity for inpatient ACP versus developing outpatient ACP capability, the latter is likely to provide more benefit to patient care. Discussion The benefit of using a predictive model for identifying patients for interventions is highly dependent on the capacity to execute the workflow triggered by the model. We provide a framework for quantifying the impact of healthcare delivery factors and work capacity constraints on achieved benefit. Conclusion An analysis of the sensitivity of the net benefit realized by a predictive model triggered clinical workflow to various healthcare delivery factors is necessary for making predictive models useful in practice.
Collapse
|
55
|
Röltgen K, Powell AE, Wirz OF, Stevens BA, Hogan CA, Najeeb J, Hunter M, Wang H, Sahoo MK, Huang C, Yamamoto F, Manohar M, Manalac J, Otrelo-Cardoso AR, Pham TD, Rustagi A, Rogers AJ, Shah NH, Blish CA, Cochran JR, Jardetzky TS, Zehnder JL, Wang TT, Narasimhan B, Gombar S, Tibshirani R, Nadeau KC, Kim PS, Pinsky BA, Boyd SD. Defining the features and duration of antibody responses to SARS-CoV-2 infection associated with disease severity and outcome. Sci Immunol 2020; 5:eabe0240. [PMID: 33288645 PMCID: PMC7857392 DOI: 10.1126/sciimmunol.abe0240] [Citation(s) in RCA: 325] [Impact Index Per Article: 81.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Revised: 10/05/2020] [Accepted: 12/03/2020] [Indexed: 12/11/2022]
Abstract
SARS-CoV-2-specific antibodies, particularly those preventing viral spike receptor binding domain (RBD) interaction with host angiotensin-converting enzyme 2 (ACE2) receptor, can neutralize the virus. It is, however, unknown which features of the serological response may affect clinical outcomes of COVID-19 patients. We analyzed 983 longitudinal plasma samples from 79 hospitalized COVID-19 patients and 175 SARS-CoV-2-infected outpatients and asymptomatic individuals. Within this cohort, 25 patients died of their illness. Higher ratios of IgG antibodies targeting S1 or RBD domains of spike compared to nucleocapsid antigen were seen in outpatients who had mild illness versus severely ill patients. Plasma antibody increases correlated with decreases in viral RNAemia, but antibody responses in acute illness were insufficient to predict inpatient outcomes. Pseudovirus neutralization assays and a scalable ELISA measuring antibodies blocking RBD-ACE2 interaction were well correlated with patient IgG titers to RBD. Outpatient and asymptomatic individuals' SARS-CoV-2 antibodies, including IgG, progressively decreased during observation up to five months post-infection.
Collapse
|
56
|
Tan EH, Sena AG, Prats-Uribe A, You SC, Ahmed WUR, Kostka K, Reich C, Duvall SL, Lynch KE, Matheny ME, Duarte-Salles T, Bertolin SF, Hripcsak G, Natarajan K, Falconer T, Spotnitz M, Ostropolets A, Blacketer C, Alshammari TM, Alghoul H, Alser O, Lane JC, Dawoud DM, Shah K, Yang Y, Zhang L, Areia C, Golozar A, Relcade M, Casajust P, Jonnagaddala J, Subbian V, Vizcaya D, Lai LYH, Nyberg F, Morales DR, Posada JD, Shah NH, Gong M, Vivekanantham A, Abend A, Minty EP, Suchard M, Rijnbeek P, Ryan PB, Prieto-Alhambra D. Characteristics, outcomes, and mortality amongst 133,589 patients with prevalent autoimmune diseases diagnosed with, and 48,418 hospitalised for COVID-19: a multinational distributed network cohort analysis. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2020:2020.11.24.20236802. [PMID: 33269355 PMCID: PMC7709171 DOI: 10.1101/2020.11.24.20236802] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
OBJECTIVE Patients with autoimmune diseases were advised to shield to avoid COVID-19, but information on their prognosis is lacking. We characterised 30-day outcomes and mortality after hospitalisation with COVID-19 among patients with prevalent autoimmune diseases, and compared outcomes after hospital admissions among similar patients with seasonal influenza. DESIGN Multinational network cohort study. SETTING Electronic health records data from Columbia University Irving Medical Center (CUIMC) (NYC, United States [US]), Optum [US], Department of Veterans Affairs (VA) (US), Information System for Research in Primary Care-Hospitalisation Linked Data (SIDIAP-H) (Spain), and claims data from IQVIA Open Claims (US) and Health Insurance and Review Assessment (HIRA) (South Korea). PARTICIPANTS All patients with prevalent autoimmune diseases, diagnosed and/or hospitalised between January and June 2020 with COVID-19, and similar patients hospitalised with influenza in 2017-2018 were included. MAIN OUTCOME MEASURES 30-day complications during hospitalisation and death. RESULTS We studied 133,589 patients diagnosed and 48,418 hospitalised with COVID-19 with prevalent autoimmune diseases. The majority of participants were female (60.5% to 65.9%) and aged ≥50 years. The most prevalent autoimmune conditions were psoriasis (3.5 to 32.5%), rheumatoid arthritis (3.9 to 18.9%), and vasculitis (3.3 to 17.6%). Amongst hospitalised patients, Type 1 diabetes was the most common autoimmune condition (4.8% to 7.5%) in US databases, rheumatoid arthritis in HIRA (18.9%), and psoriasis in SIDIAP-H (26.4%).Compared to 70,660 hospitalised with influenza, those admitted with COVID-19 had more respiratory complications including pneumonia and acute respiratory distress syndrome, and higher 30-day mortality (2.2% to 4.3% versus 6.3% to 24.6%). CONCLUSIONS Patients with autoimmune diseases had high rates of respiratory complications and 30-day mortality following a hospitalization with COVID-19. Compared to influenza, COVID-19 is a more severe disease, leading to more complications and higher mortality. Future studies should investigate predictors of poor outcomes in COVID-19 patients with autoimmune diseases. WHAT IS ALREADY KNOWN ABOUT THIS TOPIC Patients with autoimmune conditions may be at increased risk of COVID-19 infection andcomplications.There is a paucity of evidence characterising the outcomes of hospitalised COVID-19 patients with prevalent autoimmune conditions. WHAT THIS STUDY ADDS Most people with autoimmune diseases who required hospitalisation for COVID-19 were women, aged 50 years or older, and had substantial previous comorbidities.Patients who were hospitalised with COVID-19 and had prevalent autoimmune diseases had higher prevalence of hypertension, chronic kidney disease, heart disease, and Type 2 diabetes as compared to those with prevalent autoimmune diseases who were diagnosed with COVID-19.A variable proportion of 6% to 25% across data sources died within one month of hospitalisation with COVID-19 and prevalent autoimmune diseases.For people with autoimmune diseases, COVID-19 hospitalisation was associated with worse outcomes and 30-day mortality compared to admission with influenza in the 2017-2018 season.
Collapse
|
57
|
Pfohl SR, Foryciarz A, Shah NH. An empirical characterization of fair machine learning for clinical risk prediction. J Biomed Inform 2020; 113:103621. [PMID: 33220494 DOI: 10.1016/j.jbi.2020.103621] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Revised: 10/06/2020] [Accepted: 11/05/2020] [Indexed: 11/19/2022]
Abstract
The use of machine learning to guide clinical decision making has the potential to worsen existing health disparities. Several recent works frame the problem as that of algorithmic fairness, a framework that has attracted considerable attention and criticism. However, the appropriateness of this framework is unclear due to both ethical as well as technical considerations, the latter of which include trade-offs between measures of fairness and model performance that are not well-understood for predictive models of clinical outcomes. To inform the ongoing debate, we conduct an empirical study to characterize the impact of penalizing group fairness violations on an array of measures of model performance and group fairness. We repeat the analysis across multiple observational healthcare databases, clinical outcomes, and sensitive attributes. We find that procedures that penalize differences between the distributions of predictions across groups induce nearly-universal degradation of multiple performance metrics within groups. On examining the secondary impact of these procedures, we observe heterogeneity of the effect of these procedures on measures of fairness in calibration and ranking across experimental conditions. Beyond the reported trade-offs, we emphasize that analyses of algorithmic fairness in healthcare lack the contextual grounding and causal awareness necessary to reason about the mechanisms that lead to health disparities, as well as about the potential of algorithmic fairness methods to counteract those mechanisms. In light of these limitations, we encourage researchers building predictive models for clinical use to step outside the algorithmic fairness frame and engage critically with the broader sociotechnical context surrounding the use of machine learning in healthcare.
Collapse
|
58
|
Chen R, Ryan P, Natarajan K, Falconer T, Crew KD, Reich CG, Vashisht R, Randhawa G, Shah NH, Hripcsak G. Treatment Patterns for Chronic Comorbid Conditions in Patients With Cancer Using a Large-Scale Observational Data Network. JCO Clin Cancer Inform 2020; 4:171-183. [PMID: 32134687 PMCID: PMC7113074 DOI: 10.1200/cci.19.00107] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
PURPOSE Patients with cancer are predisposed to developing chronic, comorbid conditions that affect prognosis, quality of life, and mortality. While treatment guidelines and care variations for these comorbidities have been described for the general noncancer population, less is known about real-world treatment patterns in patients with cancer. We sought to characterize the prevalence and distribution of initial treatment patterns across a large-scale data network for depression, hypertension, and type II diabetes mellitus (T2DM) among patients with cancer. METHODS We used the Observational Health Data Sciences and Informatics network, an international collaborative implementing the Observational Medical Outcomes Partnership Common Data Model to standardize more than 2 billion patient records. For this study, we used 8 databases across 3 countries—the United States, France, and Germany—with 295,529,655 patient records. We identified patients with cancer using SNOMED (Systematized Nomenclature of Medicine) codes validated via manual review. We then characterized the treatment patterns of these patients initiating treatment of depression, hypertension, or T2DM with persistent treatment and at least 365 days of observation. RESULTS Across databases, wide variations exist in treatment patterns for depression (n = 1,145,510), hypertension (n = 3,178,944), and T2DM (n = 886,766). When limited to 6-node (6-drug) sequences, we identified 61,052 unique sequences for depression, 346,067 sequences for hypertension, and 40,629 sequences for T2DM. These variations persisted across sites, databases, countries, and conditions, with the exception of metformin (73.8%) being the most common initial T2DM treatment. The most common initial medications were sertraline (17.5%) and escitalopram (17.5%) for depression and hydrochlorothiazide (20.5%) and lisinopril (19.6%) for hypertension. CONCLUSION We identified wide variations in the treatment of common comorbidities in patients with cancer, similar to the general population, and demonstrate the feasibility of conducting research on patients with cancer across a large-scale observational data network using a common data model.
Collapse
|
59
|
Duarte-Salles T, Vizcaya D, Pistillo A, Casajust P, Sena AG, Lai LYH, Prats-Uribe A, Ahmed WUR, Alshammari TM, Alghoul H, Alser O, Burn E, You SC, Areia C, Blacketer C, DuVall S, Falconer T, Fernandez-Bertolin S, Fortin S, Golozar A, Gong M, Tan EH, Huser V, Iveli P, Morales DR, Nyberg F, Posada JD, Recalde M, Roel E, Schilling LM, Shah NH, Shah K, Suchard MA, Zhang L, Zhang Y, Williams AE, Reich CG, Hripcsak G, Rijnbeek P, Ryan P, Kostka K, Prieto-Alhambra D. Baseline characteristics, management, and outcomes of 55,270 children and adolescents diagnosed with COVID-19 and 1,952,693 with influenza in France, Germany, Spain, South Korea and the United States: an international network cohort study. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2020:2020.10.29.20222083. [PMID: 33140074 PMCID: PMC7605587 DOI: 10.1101/2020.10.29.20222083] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Objectives To characterize the demographics, comorbidities, symptoms, in-hospital treatments, and health outcomes among children/adolescents diagnosed or hospitalized with COVID-19. Secondly, to describe health outcomes amongst children/adolescents diagnosed with previous seasonal influenza. Design International network cohort. Setting Real-world data from European primary care records (France/Germany/Spain), South Korean claims and US claims and hospital databases. Participants Diagnosed and/or hospitalized children/adolescents with COVID-19 at age <18 between January and June 2020; diagnosed with influenza in 2017-2018. Main outcome measures Baseline demographics and comorbidities, symptoms, 30-day in-hospital treatments and outcomes including hospitalization, pneumonia, acute respiratory distress syndrome (ARDS), multi-system inflammatory syndrome (MIS-C), and death. Results A total of 55,270 children/adolescents diagnosed and 3,693 hospitalized with COVID-19 and 1,952,693 diagnosed with influenza were studied. Comorbidities including neurodevelopmental disorders, heart disease, and cancer were all more common among those hospitalized vs diagnosed with COVID-19. The most common COVID-19 symptom was fever. Dyspnea, bronchiolitis, anosmia and gastrointestinal symptoms were more common in COVID-19 than influenza. In-hospital treatments for COVID-19 included repurposed medications (<10%), and adjunctive therapies: systemic corticosteroids (6.8% to 37.6%), famotidine (9.0% to 28.1%), and antithrombotics such as aspirin (2.0% to 21.4%), heparin (2.2% to 18.1%), and enoxaparin (2.8% to 14.8%). Hospitalization was observed in 0.3% to 1.3% of the COVID-19 diagnosed cohort, with undetectable (N<5 per database) 30-day fatality. Thirty-day outcomes including pneumonia, ARDS, and MIS-C were more frequent in COVID-19 than influenza. Conclusions Despite negligible fatality, complications including pneumonia, ARDS and MIS-C were more frequent in children/adolescents with COVID-19 than with influenza. Dyspnea, anosmia and gastrointestinal symptoms could help differential diagnosis. A wide range of medications were used for the inpatient management of pediatric COVID-19.
Collapse
|
60
|
Golozar A, Lai LYH, Sena AG, Vizcaya D, Schilling LM, Huser V, Nyberg F, Duvall SL, Morales DR, Alshammari TM, Abedtash H, Ahmed WUR, Alser O, Alghoul H, Zhang Y, Gong M, Guan Y, Areia C, Jonnagaddala J, Shah K, Lane JC, Prats-Uribe A, Posada JD, Shah NH, Subbian V, Zhang L, Abrahão MTF, Rijnbeek PR, You SC, Casajust P, Roel E, Recalde M, Fernández-Bertolín S, Andryc A, Thomas JA, Wilcox AB, Fortin S, Blacketer C, DeFalco F, Natarajan K, Falconer T, Spotnitz M, Ostropolets A, Hripcsak G, Suchard M, Lynch KE, Matheny ME, Williams A, Reich C, Duarte-Salles T, Kostka K, Ryan PB, Prieto-Alhambra D. Baseline phenotype and 30-day outcomes of people tested for COVID-19: an international network cohort including >3.32 million people tested with real-time PCR and >219,000 tested positive for SARS-CoV-2 in South Korea, Spain and the United States. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2020:2020.10.25.20218875. [PMID: 33140068 PMCID: PMC7605581 DOI: 10.1101/2020.10.25.20218875] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Early identification of symptoms and comorbidities most predictive of COVID-19 is critical to identify infection, guide policies to effectively contain the pandemic, and improve health systems' response. Here, we characterised socio-demographics and comorbidity in 3,316,107persons tested and 219,072 persons tested positive for SARS-CoV-2 since January 2020, and their key health outcomes in the month following the first positive test. Routine care data from primary care electronic health records (EHR) from Spain, hospital EHR from the United States (US), and claims data from South Korea and the US were used. The majority of study participants were women aged 18-65 years old. Positive/tested ratio varied greatly geographically (2.2:100 to 31.2:100) and over time (from 50:100 in February-April to 6.8:100 in May-June). Fever, cough and dyspnoea were the most common symptoms at presentation. Between 4%-38% required admission and 1-10.5% died within a month from their first positive test. Observed disparity in testing practices led to variable baseline characteristics and outcomes, both nationally (US) and internationally. Our findings highlight the importance of large scale characterization of COVID-19 international cohorts to inform planning and resource allocation including testing as countries face a second wave.
Collapse
|
61
|
Burn E, You SC, Sena AG, Kostka K, Abedtash H, Abrahão MTF, Alberga A, Alghoul H, Alser O, Alshammari TM, Aragon M, Areia C, Banda JM, Cho J, Culhane AC, Davydov A, DeFalco FJ, Duarte-Salles T, DuVall S, Falconer T, Fernandez-Bertolin S, Gao W, Golozar A, Hardin J, Hripcsak G, Huser V, Jeon H, Jing Y, Jung CY, Kaas-Hansen BS, Kaduk D, Kent S, Kim Y, Kolovos S, Lane JCE, Lee H, Lynch KE, Makadia R, Matheny ME, Mehta PP, Morales DR, Natarajan K, Nyberg F, Ostropolets A, Park RW, Park J, Posada JD, Prats-Uribe A, Rao G, Reich C, Rho Y, Rijnbeek P, Schilling LM, Schuemie M, Shah NH, Shoaibi A, Song S, Spotnitz M, Suchard MA, Swerdel JN, Vizcaya D, Volpe S, Wen H, Williams AE, Yimer BB, Zhang L, Zhuk O, Prieto-Alhambra D, Ryan P. Deep phenotyping of 34,128 adult patients hospitalised with COVID-19 in an international network study. Nat Commun 2020; 11:5009. [PMID: 33024121 PMCID: PMC7538555 DOI: 10.1038/s41467-020-18849-z] [Citation(s) in RCA: 66] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 09/10/2020] [Indexed: 01/08/2023] Open
Abstract
Comorbid conditions appear to be common among individuals hospitalised with coronavirus disease 2019 (COVID-19) but estimates of prevalence vary and little is known about the prior medication use of patients. Here, we describe the characteristics of adults hospitalised with COVID-19 and compare them with influenza patients. We include 34,128 (US: 8362, South Korea: 7341, Spain: 18,425) COVID-19 patients, summarising between 4811 and 11,643 unique aggregate characteristics. COVID-19 patients have been majority male in the US and Spain, but predominantly female in South Korea. Age profiles vary across data sources. Compared to 84,585 individuals hospitalised with influenza in 2014-19, COVID-19 patients have more typically been male, younger, and with fewer comorbidities and lower medication use. While protecting groups vulnerable to influenza is likely a useful starting point in the response to COVID-19, strategies will likely need to be broadened to reflect the particular characteristics of individuals being hospitalised with COVID-19.
Collapse
|
62
|
Li RC, Asch SM, Shah NH. Developing a delivery science for artificial intelligence in healthcare. NPJ Digit Med 2020; 3:107. [PMID: 32885053 PMCID: PMC7443141 DOI: 10.1038/s41746-020-00318-y] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Accepted: 07/06/2020] [Indexed: 11/09/2022] Open
Abstract
Artificial Intelligence (AI) has generated a large amount of excitement in healthcare, mostly driven by the emergence of increasingly accurate machine learning models. However, the promise of AI delivering scalable and sustained value for patient care in the real world setting has yet to be realized. In order to safely and effectively bring AI into use in healthcare, there needs to be a concerted effort around not just the creation, but also the delivery of AI. This AI "delivery science" will require a broader set of tools, such as design thinking, process improvement, and implementation science, as well as a broader definition of what AI will look like in practice, which includes not just machine learning models and their predictions, but also the new systems for care delivery that they enable. The careful design, implementation, and evaluation of these AI enabled systems will be important in the effort to understand how AI can improve healthcare.
Collapse
|
63
|
Röltgen K, Wirz OF, Stevens BA, Powell AE, Hogan CA, Najeeb J, Hunter M, Sahoo MK, Huang C, Yamamoto F, Manalac J, Otrelo-Cardoso AR, Pham TD, Rustagi A, Rogers AJ, Shah NH, Blish CA, Cochran JR, Nadeau KC, Jardetzky TS, Zehnder JL, Wang TT, Kim PS, Gombar S, Tibshirani R, Pinsky BA, Boyd SD. SARS-CoV-2 Antibody Responses Correlate with Resolution of RNAemia But Are Short-Lived in Patients with Mild Illness. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2020:2020.08.15.20175794. [PMID: 32839786 PMCID: PMC7444305 DOI: 10.1101/2020.08.15.20175794] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
SARS-CoV-2-specific antibodies, particularly those preventing viral spike receptor binding domain (RBD) interaction with host angiotensin-converting enzyme 2 (ACE2) receptor, could offer protective immunity, and may affect clinical outcomes of COVID-19 patients. We analyzed 625 serial plasma samples from 40 hospitalized COVID-19 patients and 170 SARS-CoV-2-infected outpatients and asymptomatic individuals. Severely ill patients developed significantly higher SARS-CoV-2-specific antibody responses than outpatients and asymptomatic individuals. The development of plasma antibodies was correlated with decreases in viral RNAemia, consistent with potential humoral immune clearance of virus. Using a novel competition ELISA, we detected antibodies blocking RBD-ACE2 interactions in 68% of inpatients and 40% of outpatients tested. Cross-reactive antibodies recognizing SARS-CoV RBD were found almost exclusively in hospitalized patients. Outpatient and asymptomatic individuals' serological responses to SARS-CoV-2 decreased within 2 months, suggesting that humoral protection may be short-lived.
Collapse
|
64
|
Bernert RA, Hilberg AM, Melia R, Kim JP, Shah NH, Abnousi F. Artificial Intelligence and Suicide Prevention: A Systematic Review of Machine Learning Investigations. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:E5929. [PMID: 32824149 PMCID: PMC7460360 DOI: 10.3390/ijerph17165929] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 07/28/2020] [Indexed: 12/12/2022]
Abstract
Suicide is a leading cause of death that defies prediction and challenges prevention efforts worldwide. Artificial intelligence (AI) and machine learning (ML) have emerged as a means of investigating large datasets to enhance risk detection. A systematic review of ML investigations evaluating suicidal behaviors was conducted using PubMed/MEDLINE, PsychInfo, Web-of-Science, and EMBASE, employing search strings and MeSH terms relevant to suicide and AI. Databases were supplemented by hand-search techniques and Google Scholar. Inclusion criteria: (1) journal article, available in English, (2) original investigation, (3) employment of AI/ML, (4) evaluation of a suicide risk outcome. N = 594 records were identified based on abstract search, and 25 hand-searched reports. N = 461 reports remained after duplicates were removed, n = 316 were excluded after abstract screening. Of n = 149 full-text articles assessed for eligibility, n = 87 were included for quantitative synthesis, grouped according to suicide behavior outcome. Reports varied widely in methodology and outcomes. Results suggest high levels of risk classification accuracy (>90%) and Area Under the Curve (AUC) in the prediction of suicidal behaviors. We report key findings and central limitations in the use of AI/ML frameworks to guide additional research, which hold the potential to impact suicide on broad scale.
Collapse
|
65
|
Fries JA, Steinberg E, Khattar S, Fleming SL, Posada J, Callahan A, Shah NH. Ontology-driven weak supervision for clinical entity classification in electronic health records. ARXIV 2020:arXiv:2008.01972v2. [PMID: 32793768 PMCID: PMC7418750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Figures] [Subscribe] [Scholar Register] [Revised: 04/06/2021] [Indexed: 12/24/2022]
Abstract
In the electronic health record, using clinical notes to identify entities such as disorders and their temporality (e.g. the order of an event relative to a time index) can inform many important analyses. However, creating training data for clinical entity tasks is time consuming and sharing labeled data is challenging due to privacy concerns. The information needs of the COVID-19 pandemic highlight the need for agile methods of training machine learning models for clinical notes. We present Trove, a framework for weakly supervised entity classification using medical ontologies and expert-generated rules. Our approach, unlike hand-labeled notes, is easy to share and modify, while offering performance comparable to learning from manually labeled training data. In this work, we validate our framework on six benchmark tasks and demonstrate Trove's ability to analyze the records of patients visiting the emergency department at Stanford Health Care for COVID-19 presenting symptoms and risk factors.
Collapse
|
66
|
Kashyap S, Gombar S, Yadlowsky S, Callahan A, Fries J, Pinsky BA, Shah NH. Measure what matters: Counts of hospitalized patients are a better metric for health system capacity planning for a reopening. J Am Med Inform Assoc 2020; 27:1026-1131. [PMID: 32548636 PMCID: PMC7337779 DOI: 10.1093/jamia/ocaa076] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Revised: 04/21/2020] [Accepted: 04/25/2020] [Indexed: 12/20/2022] Open
Abstract
Objective Responding to the COVID-19 pandemic requires accurate forecasting of health system capacity requirements using readily available inputs. We examined whether testing and hospitalization data could help quantify the anticipated burden on the health system given shelter-in-place (SIP) order. Materials and Methods 16,103 SARS-CoV-2 RT-PCR tests were performed on 15,807 patients at Stanford facilities between March 2 and April 11, 2020. We analyzed the fraction of tested patients that were confirmed positive for COVID-19, the fraction of those needing hospitalization, and the fraction requiring ICU admission over the 40 days between March 2nd and April 11th 2020. Results We find a marked slowdown in the hospitalization rate within ten days of SIP even as cases continued to rise. We also find a shift towards younger patients in the age distribution of those testing positive for COVID-19 over the four weeks of SIP. The impact of this shift is a divergence between increasing positive case confirmations and slowing new hospitalizations, both of which affects the demand on health systems. Conclusion Without using local hospitalization rates and the age distribution of positive patients, current models are likely to overestimate the resource burden of COVID-19. It is imperative that health systems start using these data to quantify effects of SIP and aid reopening planning.
Collapse
|
67
|
Gombar S, Chang M, Hogan CA, Zehnder J, Boyd S, Pinsky BA, Shah NH. Persistent detection of SARS-CoV-2 RNA in patients and healthcare workers with COVID-19. J Clin Virol 2020; 129:104477. [PMID: 32505778 PMCID: PMC7260561 DOI: 10.1016/j.jcv.2020.104477] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 05/26/2020] [Indexed: 12/20/2022]
Abstract
BACKGROUND Current guidelines for returning health care workers (HCW) to service after a positive SARS-CoV-2 RT-PCR test and ceasing of transmission precautions for patients is based on two general strategies. A test-based strategy that requires negative respiratory RT-PCR tests obtained after the resolution of symptoms. Alternatively, due to the limited availability of testing, many sites employ a symptom-based strategy that recommends excluding HCW from the workforce and keeping patients on contact precautions until a fixed period of time has elapsed from symptom recovery. The underlying assumption of the symptom-based strategy is that waiting for a fixed period of time is a surrogate for negative RT-PCR testing, which itself is a surrogate for the absence of shedding infectious virus. OBJECTIVES To better understand the appropriate length of symptom based return to work and contact precaution strategies. STUDY DESIGN We performed an observational analysis of 150 patients and HCW that transitioned from RT-PCR SARS-CoV-2 positive to negative over the course of 2 months at a US academic medical center. RESULTS We found that the average time to transition from RT-PCR positive to negative was 24 days after symptom onset and 10 % remained positive even 33 days after symptom onset. No difference was seen in HCW and patients. CONCLUSIONS These findings suggest until definitive evidence of the length of infective viral shedding is obtained that the fixed length of time before returning to work or ceasing contract precautions be revised to over one-month.
Collapse
|
68
|
Callahan A, Steinberg E, Fries JA, Gombar S, Patel B, Corbin CK, Shah NH. Estimating the efficacy of symptom-based screening for COVID-19. NPJ Digit Med 2020; 3:95. [PMID: 32695885 PMCID: PMC7359358 DOI: 10.1038/s41746-020-0300-0] [Citation(s) in RCA: 56] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2020] [Accepted: 06/16/2020] [Indexed: 11/28/2022] Open
Abstract
There is substantial interest in using presenting symptoms to prioritize testing for COVID-19 and establish symptom-based surveillance. However, little is currently known about the specificity of COVID-19 symptoms. To assess the feasibility of symptom-based screening for COVID-19, we used data from tests for common respiratory viruses and SARS-CoV-2 in our health system to measure the ability to correctly classify virus test results based on presenting symptoms. Based on these results, symptom-based screening may not be an effective strategy to identify individuals who should be tested for SARS-CoV-2 infection or to obtain a leading indicator of new COVID-19 cases.
Collapse
|
69
|
Burn E, You SC, Sena A, Kostka K, Abedtash H, Abrahao MTF, Alberga A, Alghoul H, Alser O, Alshammari TM, Aragon M, Areia C, Banda JM, Cho J, Culhane AC, Davydov A, DeFalco FJ, Duarte-Salles T, DuVall SL, Falconer T, Fernandez-Bertolin S, Gao W, Golozar A, Hardin J, Hripcsak G, Huser V, Jeon H, Jing Y, Jung CY, Kaas-Hansen BS, Kaduk D, Kent S, Kim Y, Kolovos S, Lane J, Lee H, Lynch KE, Makadia R, Matheny ME, Mehta P, Morales DR, Natarajan K, Nyberg F, Ostropolets A, Park RW, Park J, Posada JD, Prats-Uribe A, Rao GA, Reich C, Rho Y, Rijnbeek P, Schilling LM, Schuemie M, Shah NH, Shoaibi A, Song S, Spotnitz M, Suchard MA, Swerdel J, Vizcaya D, Volpe S, Wen H, Williams AE, Yimer BB, Zhang L, Zhuk O, Prieto-Alhambra D, Ryan P. Deep phenotyping of 34,128 patients hospitalised with COVID-19 and a comparison with 81,596 influenza patients in America, Europe and Asia: an international network study. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2020. [PMID: 32511443 DOI: 10.1101/2020.04.22.20074336] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Background In this study we phenotyped individuals hospitalised with coronavirus disease 2019 (COVID-19) in depth, summarising entire medical histories, including medications, as captured in routinely collected data drawn from databases across three continents. We then compared individuals hospitalised with COVID-19 to those previously hospitalised with influenza. Methods We report demographics, previously recorded conditions and medication use of patients hospitalised with COVID-19 in the US (Columbia University Irving Medical Center [CUIMC], Premier Healthcare Database [PHD], UCHealth System Health Data Compass Database [UC HDC], and the Department of Veterans Affairs [VA OMOP]), in South Korea (Health Insurance Review & Assessment [HIRA]), and Spain (The Information System for Research in Primary Care [SIDIAP] and HM Hospitales [HM]). These patients were then compared with patients hospitalised with influenza in 2014-19. Results 34,128 (US: 8,362, South Korea: 7,341, Spain: 18,425) individuals hospitalised with COVID-19 were included. Between 4,811 (HM) and 11,643 (CUIMC) unique aggregate characteristics were extracted per patient, with all summarised in an accompanying interactive website (http://evidence.ohdsi.org/Covid19CharacterizationHospitalization/). Patients were majority male in the US (CUIMC: 52%, PHD: 52%, UC HDC: 54%, VA OMOP: 94%,) and Spain (SIDIAP: 54%, HM: 60%), but were predominantly female in South Korea (HIRA: 60%). Age profiles varied across data sources. Prevalence of asthma ranged from 4% to 15%, diabetes from 13% to 43%, and hypertensive disorder from 24% to 70% across data sources. Between 14% and 33% were taking drugs acting on the renin-angiotensin system in the 30 days prior to hospitalisation. Compared to 81,596 individuals hospitalised with influenza in 2014-19, patients admitted with COVID-19 were more typically male, younger, and healthier, with fewer comorbidities and lower medication use. Conclusions We provide a detailed characterisation of patients hospitalised with COVID-19. Protecting groups known to be vulnerable to influenza is a useful starting point to minimize the number of hospital admissions needed for COVID-19. However, such strategies will also likely need to be broadened so as to reflect the particular characteristics of individuals hospitalised with COVID-19.
Collapse
|
70
|
Miner AS, Haque A, Fries JA, Fleming SL, Wilfley DE, Terence Wilson G, Milstein A, Jurafsky D, Arnow BA, Stewart Agras W, Fei-Fei L, Shah NH. Assessing the accuracy of automatic speech recognition for psychotherapy. NPJ Digit Med 2020; 3:82. [PMID: 32550644 PMCID: PMC7270106 DOI: 10.1038/s41746-020-0285-8] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2019] [Accepted: 04/30/2020] [Indexed: 01/17/2023] Open
Abstract
Accurate transcription of audio recordings in psychotherapy would improve therapy effectiveness, clinician training, and safety monitoring. Although automatic speech recognition software is commercially available, its accuracy in mental health settings has not been well described. It is unclear which metrics and thresholds are appropriate for different clinical use cases, which may range from population descriptions to individual safety monitoring. Here we show that automatic speech recognition is feasible in psychotherapy, but further improvements in accuracy are needed before widespread use. Our HIPAA-compliant automatic speech recognition system demonstrated a transcription word error rate of 25%. For depression-related utterances, sensitivity was 80% and positive predictive value was 83%. For clinician-identified harm-related sentences, the word error rate was 34%. These results suggest that automatic speech recognition may support understanding of language patterns and subgroup variation in existing treatments but may not be ready for individual-level safety surveillance.
Collapse
|
71
|
Callahan A, Shah NH, Chen JH. Research and Reporting Considerations for Observational Studies Using Electronic Health Record Data. Ann Intern Med 2020; 172:S79-S84. [PMID: 32479175 PMCID: PMC7413106 DOI: 10.7326/m19-0873] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
Electronic health records (EHRs) are an increasingly important source of real-world health care data for observational research. Analyses of data collected for purposes other than research require careful consideration of data quality as well as the general research and reporting principles relevant to observational studies. The core principles for observational research in general also apply to observational research using EHR data, and these are well addressed in prior literature and guidelines. This article provides additional recommendations for EHR-based research. Considerations unique to EHR-based studies include assessment of the accuracy of computer-executable cohort definitions that can incorporate unstructured data from clinical notes and management of data challenges, such as irregular sampling, missingness, and variation across time and place. Principled application of existing research and reporting guidelines alongside these additional considerations will improve the quality of EHR-based observational studies.
Collapse
|
72
|
Abstract
This study describes the prevalence of SARS-CoV-2 co-infection with noncoronavirus respiratory pathogens in a sample of symptomatic patients undergoing PCR testing in March 2020.
Collapse
|
73
|
Long DR, Gombar S, Hogan CA, Greninger AL, OReilly Shah V, Bryson-Cahn C, Stevens B, Rustagi A, Jerome KR, Kong CS, Zehnder J, Shah NH, Weiss NS, Pinsky BA, Sunshine J. Occurrence and Timing of Subsequent SARS-CoV-2 RT-PCR Positivity Among Initially Negative Patients. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2020. [PMID: 32511542 DOI: 10.1101/2020.05.03.20089151] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
BACKGROUND SARS-CoV-2 reverse transcriptase polymerase chain reaction (RT-PCR) testing remains the cornerstone of laboratory-based identification of patients with COVID-19. As the availability and speed of SARS-CoV-2 testing platforms improve, results are increasingly relied upon to inform critical decisions related to therapy, use of personal protective equipment, and workforce readiness. However, early reports of RT-PCR test performance have left clinicians and the public with concerns regarding the reliability of this predominant testing modality and the interpretation of negative results. In this work, two independent research teams report the frequency of discordant SARS-CoV-2 test results among initially negative, repeatedly tested patients in regions of the United States with early community transmission and access to testing. METHODS All patients at the University of Washington (UW) and Stanford Health Care undergoing initial testing by nasopharyngeal (NP) swab between March 2nd and April 7th, 2020 were included. SARS-CoV-2 RT-PCR was performed targeting the N, RdRp, S, and E genes and ORF1ab, using a combination of Emergency Use Authorization laboratory-developed tests and commercial assays. Results through April 14th were extracted to allow for a complete 7-day observation period and an additional day for reporting. RESULTS A total of 23,126 SARS-CoV-2 RT-PCR tests (10,583 UW, 12,543 Stanford) were performed in 20,912 eligible patients (8,977 UW, 11,935 Stanford) undergoing initial testing by NP swab; 626 initially test-negative patients were re-tested within 7 days. Among this group, repeat testing within 7 days yielded a positive result in 3.5% (4.3% UW, 2.8% Stanford) of cases, suggesting an initial false negative RT-PCR result; the majority (96.5%) of patients with an initial negative result who warranted reevaluation for any reason remained negative on all subsequent tests performed within this window. CONCLUSIONS Two independent research teams report the similar finding that, among initially negative patients subjected to repeat SARS-CoV-2 RT-PCR testing, the occurrence of a newly positive result within 7 days is uncommon. These observations suggest that false negative results at the time of initial presentation do occur, but potentially at a lower frequency than is currently believed. Although it is not possible to infer the clinical sensitivity of NP SARS-CoV-2 RT-PCR testing using these data, they may be used in combination with other reports to guide the use and interpretation of this common testing modality.
Collapse
|
74
|
Weng C, Shah NH, Hripcsak G. Deep phenotyping: Embracing complexity and temporality-Towards scalability, portability, and interoperability. J Biomed Inform 2020; 105:103433. [PMID: 32335224 PMCID: PMC7179504 DOI: 10.1016/j.jbi.2020.103433] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2020] [Accepted: 04/20/2020] [Indexed: 01/07/2023]
|
75
|
Larson DB, Magnus DC, Lungren MP, Shah NH, Langlotz CP. Ethics of Using and Sharing Clinical Imaging Data for Artificial Intelligence: A Proposed Framework. Radiology 2020; 295:675-682. [PMID: 32208097 DOI: 10.1148/radiol.2020192536] [Citation(s) in RCA: 78] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
In this article, the authors propose an ethical framework for using and sharing clinical data for the development of artificial intelligence (AI) applications. The philosophical premise is as follows: when clinical data are used to provide care, the primary purpose for acquiring the data is fulfilled. At that point, clinical data should be treated as a form of public good, to be used for the benefit of future patients. In their 2013 article, Faden et al argued that all who participate in the health care system, including patients, have a moral obligation to contribute to improving that system. The authors extend that framework to questions surrounding the secondary use of clinical data for AI applications. Specifically, the authors propose that all individuals and entities with access to clinical data become data stewards, with fiduciary (or trust) responsibilities to patients to carefully safeguard patient privacy, and to the public to ensure that the data are made widely available for the development of knowledge and tools to benefit future patients. According to this framework, the authors maintain that it is unethical for providers to "sell" clinical data to other parties by granting access to clinical data, especially under exclusive arrangements, in exchange for monetary or in-kind payments that exceed costs. The authors also propose that patient consent is not required before the data are used for secondary purposes when obtaining such consent is prohibitively costly or burdensome, as long as mechanisms are in place to ensure that ethical standards are strictly followed. Rather than debate whether patients or provider organizations "own" the data, the authors propose that clinical data are not owned at all in the traditional sense, but rather that all who interact with or control the data have an obligation to ensure that the data are used for the benefit of future patients and society.
Collapse
|