1
|
Breger A, Selby I, Roberts M, Babar J, Gkrania-Klotsas E, Preller J, Escudero Sánchez L, Rudd JHF, Aston JAD, Weir-McCall JR, Sala E, Schönlieb CB. A pipeline to further enhance quality, integrity and reusability of the NCCID clinical data. Sci Data 2023; 10:493. [PMID: 37500661 PMCID: PMC10374610 DOI: 10.1038/s41597-023-02340-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 06/27/2023] [Indexed: 07/29/2023] Open
Abstract
The National COVID-19 Chest Imaging Database (NCCID) is a centralized UK database of thoracic imaging and corresponding clinical data. It is made available by the National Health Service Artificial Intelligence (NHS AI) Lab to support the development of machine learning tools focused on Coronavirus Disease 2019 (COVID-19). A bespoke cleaning pipeline for NCCID, developed by the NHSx, was introduced in 2021. We present an extension to the original cleaning pipeline for the clinical data of the database. It has been adjusted to correct additional systematic inconsistencies in the raw data such as patient sex, oxygen levels and date values. The most important changes will be discussed in this paper, whilst the code and further explanations are made publicly available on GitLab. The suggested cleaning will allow global users to work with more consistent data for the development of machine learning tools without being an expert. In addition, it highlights some of the challenges when working with clinical multi-center data and includes recommendations for similar future initiatives.
Collapse
Affiliation(s)
- Anna Breger
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK.
- Center of Medical Physics and Biomedical Engineering, Medical University of Vienna, Vienna, Austria.
| | - Ian Selby
- Department of Radiology, University of Cambridge, Cambridge, UK.
- Cambridge University Hospitals NHS Trust, Cambridge, UK.
| | - Michael Roberts
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
- Department of Medicine, University of Cambridge, Cambridge, UK
| | - Judith Babar
- Department of Radiology, University of Cambridge, Cambridge, UK
- Cambridge University Hospitals NHS Trust, Cambridge, UK
| | - Effrossyni Gkrania-Klotsas
- Cambridge University Hospitals NHS Trust, Cambridge, UK
- School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - Jacobus Preller
- Cambridge University Hospitals NHS Trust, Cambridge, UK
- School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - Lorena Escudero Sánchez
- Department of Radiology, University of Cambridge, Cambridge, UK
- Cancer Research UK (CRUK) Cambridge Centre, Cambridge, UK
| | - James H F Rudd
- Department of Medicine, University of Cambridge, Cambridge, UK
| | - John A D Aston
- Department of Pure Mathematics and Mathematical Statistics, University of Cambridge, Cambridge, UK
| | - Jonathan R Weir-McCall
- Department of Radiology, University of Cambridge, Cambridge, UK
- Cambridge University Hospitals NHS Trust, Cambridge, UK
- Department of Radiology, Royal Papworth Hospital, Cambridge, UK
| | - Evis Sala
- Advanced Radiodiagnostics Centre, Fondazione Policlinico Universitario Agostino Gemelli, Rome, Italy
- Università Cattolica del Sacro Cuore, Rome, Italy
| | - Carola-Bibiane Schönlieb
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
| |
Collapse
|
2
|
Rangelov B, Young A, Lilaonitkul W, Aslani S, Taylor P, Guðmundsson E, Yang Q, Hu Y, Hurst JR, Hawkes DJ, Jacob J. Delineating COVID-19 subgroups using routine clinical data identifies distinct in-hospital outcomes. Sci Rep 2023; 13:9986. [PMID: 37339958 PMCID: PMC10282086 DOI: 10.1038/s41598-023-32469-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 03/28/2023] [Indexed: 06/22/2023] Open
Abstract
The COVID-19 pandemic has been a great challenge to healthcare systems worldwide. It highlighted the need for robust predictive models which can be readily deployed to uncover heterogeneities in disease course, aid decision-making and prioritise treatment. We adapted an unsupervised data-driven model-SuStaIn, to be utilised for short-term infectious disease like COVID-19, based on 11 commonly recorded clinical measures. We used 1344 patients from the National COVID-19 Chest Imaging Database (NCCID), hospitalised for RT-PCR confirmed COVID-19 disease, splitting them equally into a training and an independent validation cohort. We discovered three COVID-19 subtypes (General Haemodynamic, Renal and Immunological) and introduced disease severity stages, both of which were predictive of distinct risks of in-hospital mortality or escalation of treatment, when analysed using Cox Proportional Hazards models. A low-risk Normal-appearing subtype was also discovered. The model and our full pipeline are available online and can be adapted for future outbreaks of COVID-19 or other infectious disease.
Collapse
Affiliation(s)
- Bojidar Rangelov
- Satsuma Lab, Centre for Medical Image Computing (CMIC), University College London, London, UK.
| | - Alexandra Young
- Satsuma Lab, Centre for Medical Image Computing (CMIC), University College London, London, UK
- Department of Neuroimaging, King's College London, London, UK
| | | | - Shahab Aslani
- Satsuma Lab, Centre for Medical Image Computing (CMIC), University College London, London, UK
| | - Paul Taylor
- Institute of Health Informatics, University College London, London, UK
| | - Eyjólfur Guðmundsson
- Satsuma Lab, Centre for Medical Image Computing (CMIC), University College London, London, UK
| | - Qianye Yang
- Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK
| | - Yipeng Hu
- Wellcome/EPSRC Centre for Interventional and Surgical Sciences, University College London, London, UK
- Centre for Medical Image Computing, University College London, London, UK
| | - John R Hurst
- UCL Respiratory, University College London, London, UK
| | - David J Hawkes
- Centre for Medical Image Computing, University College London, London, UK
| | - Joseph Jacob
- Satsuma Lab, Centre for Medical Image Computing (CMIC), University College London, London, UK
| |
Collapse
|
3
|
Al-Yousif N, Komanduri S, Qurashi H, Korzhuk A, Lawal HO, Abourizk N, Schaefer C, Mitchell KJ, Dietz CM, Hughes EK, Brandt CS, Fitzgerald GM, Joyce R, Chaudhry AS, Kotok D, Rivera JD, Kim AI, Shettigar S, Lavina A, Girard CE, Gillenwater SR, Hadeh A, Bain W, Shah FA, Bittner M, Lu M, Prendergast N, Evankovich J, Golubykh K, Ramesh N, Jacobs JJ, Kessinger C, Methe B, Lee JS, Morris A, McVerry BJ, Kitsios GD. Inter-rater reliability and prognostic value of baseline Radiographic Assessment of Lung Edema (RALE) scores in observational cohort studies of inpatients with COVID-19. BMJ Open 2023; 13:e066626. [PMID: 36635036 PMCID: PMC9842602 DOI: 10.1136/bmjopen-2022-066626] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Accepted: 12/16/2022] [Indexed: 01/14/2023] Open
Abstract
OBJECTIVES To reliably quantify the radiographic severity of COVID-19 pneumonia with the Radiographic Assessment of Lung Edema (RALE) score on clinical chest X-rays among inpatients and examine the prognostic value of baseline RALE scores on COVID-19 clinical outcomes. SETTING Hospitalised patients with COVID-19 in dedicated wards and intensive care units from two different hospital systems. PARTICIPANTS 425 patients with COVID-19 in a discovery data set and 415 patients in a validation data set. PRIMARY AND SECONDARY OUTCOMES We measured inter-rater reliability for RALE score annotations by different reviewers and examined for associations of consensus RALE scores with the level of respiratory support, demographics, physiologic variables, applied therapies, plasma host-response biomarkers, SARS-CoV-2 RNA load and clinical outcomes. RESULTS Inter-rater agreement for RALE scores improved from fair to excellent following reviewer training and feedback (intraclass correlation coefficient of 0.85 vs 0.93, respectively). In the discovery cohort, the required level of respiratory support at the time of CXR acquisition (supplemental oxygen or non-invasive ventilation (n=178); invasive-mechanical ventilation (n=234), extracorporeal membrane oxygenation (n=13)) was significantly associated with RALE scores (median (IQR): 20.0 (14.1-26.7), 26.0 (20.5-34.0) and 44.5 (34.5-48.0), respectively, p<0.0001). Among invasively ventilated patients, RALE scores were significantly associated with worse respiratory mechanics (plateau and driving pressure) and gas exchange metrics (PaO2/FiO2 and ventilatory ratio), as well as higher plasma levels of IL-6, soluble receptor of advanced glycation end-products and soluble tumour necrosis factor receptor 1 (p<0.05). RALE scores were independently associated with 90-day survival in a multivariate Cox proportional hazards model (adjusted HR 1.04 (1.02-1.07), p=0.002). We replicated the significant associations of RALE scores with baseline disease severity and mortality in the independent validation data set. CONCLUSIONS With a reproducible method to measure radiographic severity in COVID-19, we found significant associations with clinical and physiologic severity, host inflammation and clinical outcomes. The incorporation of radiographic severity assessments in clinical decision-making may provide important guidance for prognostication and treatment allocation in COVID-19.
Collapse
Affiliation(s)
- Nameer Al-Yousif
- Internal Medicine Residency Program, UPMC Mercy, Pittsburgh, Pennsylvania, USA
- Division of Pulmonary, Critical Care, and Sleep Medicine, MetroHealth Medical Center, Cleveland, Ohio, USA
| | - Saketram Komanduri
- Internal Medicine Residency Program, UPMC Pinnacle Harrisburg, Harrisburg, Pennsylvania, USA
| | - Hafiz Qurashi
- Internal Medicine Residency Program, UPMC Pinnacle Harrisburg, Harrisburg, Pennsylvania, USA
| | - Anatoliy Korzhuk
- Internal Medicine Residency Program, UPMC Pinnacle Harrisburg, Harrisburg, Pennsylvania, USA
| | - Halimat O Lawal
- Internal Medicine Residency Program, UPMC Pinnacle Harrisburg, Harrisburg, Pennsylvania, USA
| | - Nicholas Abourizk
- Internal Medicine Residency Program, UPMC Pinnacle Harrisburg, Harrisburg, Pennsylvania, USA
| | - Caitlin Schaefer
- Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
- Acute Lung Injury Center of Excellence, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Kevin J Mitchell
- Computer Vision Group, Veytel LLC, Pittsburgh, Pennsylvania, USA
| | | | - Ellen K Hughes
- Computer Vision Group, Veytel LLC, Pittsburgh, Pennsylvania, USA
| | - Clara S Brandt
- Computer Vision Group, Veytel LLC, Pittsburgh, Pennsylvania, USA
| | | | - Robin Joyce
- Computer Vision Group, Veytel LLC, Pittsburgh, Pennsylvania, USA
| | - Asmaa S Chaudhry
- Computer Vision Group, Veytel LLC, Pittsburgh, Pennsylvania, USA
| | - Daniel Kotok
- Department of Pulmonary and Critical Care, Cleveland Clinic Florida, Weston, Florida, USA
| | - Jose D Rivera
- Department of Pulmonary and Critical Care, Cleveland Clinic Florida, Weston, Florida, USA
| | - Andrew I Kim
- Department of Pulmonary and Critical Care, Cleveland Clinic Florida, Weston, Florida, USA
| | - Shruti Shettigar
- Department of Pulmonary and Critical Care, Cleveland Clinic Florida, Weston, Florida, USA
| | - Allen Lavina
- Department of Pulmonary and Critical Care, Cleveland Clinic Florida, Weston, Florida, USA
| | - Christine E Girard
- Department of Pulmonary and Critical Care, Cleveland Clinic Florida, Weston, Florida, USA
| | - Samantha R Gillenwater
- Department of Pulmonary and Critical Care, Cleveland Clinic Florida, Weston, Florida, USA
| | - Anas Hadeh
- Department of Pulmonary and Critical Care, Cleveland Clinic Florida, Weston, Florida, USA
| | - William Bain
- Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
- Acute Lung Injury Center of Excellence, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Faraaz A Shah
- Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
- Acute Lung Injury Center of Excellence, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Matthew Bittner
- Internal Medicine Residency Program, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Michael Lu
- Internal Medicine Residency Program, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Niall Prendergast
- Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - John Evankovich
- Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
- Acute Lung Injury Center of Excellence, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Konstantin Golubykh
- Internal Medicine Residency Program, UPMC Pinnacle Harrisburg, Harrisburg, Pennsylvania, USA
| | - Navitha Ramesh
- Department of Pulmonary and Critical Care, UPMC Pinnacle Harrisburg, Harrisburg, Pennsylvania, USA
| | - Jana J Jacobs
- Department of Medicine, Division of Infectious Diseases, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA
| | - Cathy Kessinger
- Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Barbara Methe
- Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
- Center for Medicine and the Microbiome, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Janet S Lee
- Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
- Acute Lung Injury Center of Excellence, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Alison Morris
- Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
- Acute Lung Injury Center of Excellence, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
- Center for Medicine and the Microbiome, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Bryan J McVerry
- Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
- Acute Lung Injury Center of Excellence, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
- Center for Medicine and the Microbiome, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Georgios D Kitsios
- Division of Pulmonary, Allergy and Critical Care Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
- Acute Lung Injury Center of Excellence, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
- Center for Medicine and the Microbiome, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| |
Collapse
|
4
|
Bioinformatics, Computational Informatics, and Modeling Approaches to the Design of mRNA COVID-19 Vaccine Candidates. COMPUTATION 2022. [DOI: 10.3390/computation10070117] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
This article is devoted to applying bioinformatics and immunoinformatics approaches for the development of a multi-epitope mRNA vaccine against the spike glycoproteins of circulating SARS-CoV-2 variants in selected African countries. The study’s relevance is dictated by the fact that severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) began its global threat at the end of 2019 and since then has had a devastating impact on the whole world. Measures to reduce threats from the pandemic include social restrictions, restrictions on international travel, and vaccine development. In most cases, vaccine development depends on the spike glycoprotein, which serves as a medium for its entry into host cells. Although several variants of SARS-CoV-2 have emerged from mutations crossing continental boundaries, about 6000 delta variants have been reported along the coast of more than 20 countries in Africa, with South Africa accounting for the highest percentage. This also applies to the omicron variant of the SARS-CoV-2 virus in South Africa. The authors suggest that bioinformatics and immunoinformatics approaches be used to develop a multi-epitope mRNA vaccine against the spike glycoproteins of circulating SARS-CoV-2 variants in selected African countries. Various immunoinformatics tools have been used to predict T- and B-lymphocyte epitopes. The epitopes were further subjected to multiple evaluations to select epitopes that could elicit a sustained immunological response. The candidate vaccine consisted of seven epitopes, a highly immunogenic adjuvant, an MHC I-targeting domain (MITD), a signal peptide, and linkers. The molecular weight (MW) was predicted to be 223.1 kDa, well above the acceptable threshold of 110 kDa on an excellent vaccine candidate. In addition, the results showed that the candidate vaccine was antigenic, non-allergenic, non-toxic, thermostable, and hydrophilic. The vaccine candidate has good population coverage, with the highest range in East Africa (80.44%) followed by South Africa (77.23%). West Africa and North Africa have 76.65% and 76.13%, respectively, while Central Africa (75.64%) has minimal coverage. Among seven epitopes, no mutations were observed in 100 randomly selected SARS-CoV-2 spike glycoproteins in the study area. Evaluation of the secondary structure of the vaccine constructs revealed a stabilized structure showing 36.44% alpha-helices, 20.45% drawn filaments, and 33.38% random helices. Molecular docking of the TLR4 vaccine showed that the simulated vaccine has a high binding affinity for TLR-4, reflecting its ability to stimulate the innate and adaptive immune response.
Collapse
|
5
|
Al-Yousif N, Komanduri S, Qurashi H, Korzhuk A, Lawal HO, Abourizk N, Schaefer C, Mitchell KJ, Dietz CM, Hughes EK, Brandt CS, Fitzgerald GM, Joyce R, Chaudhry AS, Kotok D, Rivera JD, Kim AI, Shettigar S, Lavina A, Girard CE, Gillenwater SR, Hadeh A, Bain W, Shah FA, Bittner M, Lu M, Prendergast N, Evankovich J, Golubykh K, Ramesh N, Jacobs JJ, Kessinger C, Methé B, Lee JS, Morris A, McVerry BJ, Kitsios GD. Radiographic Assessment of Lung Edema (RALE) Scores are Highly Reproducible and Prognostic of Clinical Outcomes for Inpatients with COVID-19. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2022:2022.06.10.22276249. [PMID: 35734089 PMCID: PMC9216727 DOI: 10.1101/2022.06.10.22276249] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
INTRODUCTION Chest imaging is necessary for diagnosis of COVID-19 pneumonia, but current risk stratification tools do not consider radiographic severity. We quantified radiographic heterogeneity among inpatients with COVID-19 with the Radiographic Assessment of Lung Edema (RALE) score on Chest X-rays (CXRs). METHODS We performed independent RALE scoring by ≥2 reviewers on baseline CXRs from 425 inpatients with COVID-19 (discovery dataset), we recorded clinical variables and outcomes, and measured plasma host-response biomarkers and SARS-CoV-2 RNA load from subjects with available biospecimens. RESULTS We found excellent inter-rater agreement for RALE scores (intraclass correlation co-efficient=0.93). The required level of respiratory support at the time of baseline CXRs (supplemental oxygen or non-invasive ventilation [n=178]; invasive-mechanical ventilation [n=234], extracorporeal membrane oxygenation [n=13]) was significantly associated with RALE scores (median [interquartile range]: 20.0[14.1-26.7], 26.0[20.5-34.0] and 44.5[34.5-48.0], respectively, p<0.0001). Among invasively-ventilated patients, RALE scores were significantly associated with worse respiratory mechanics (plateau and driving pressure) and gas exchange metrics (PaO2/FiO2 and ventilatory ratio), as well as higher plasma levels of IL-6, sRAGE and TNFR1 levels (p<0.05). RALE scores were independently associated with 90-day survival in a multivariate Cox proportional hazards model (adjusted hazard ratio 1.04[1.02-1.07], p=0.002). We validated significant associations of RALE scores with baseline severity and mortality in an independent dataset of 415 COVID-19 inpatients. CONCLUSION Reproducible assessment of radiographic severity revealed significant associations with clinical and physiologic severity, host-response biomarkers and clinical outcome in COVID-19 pneumonia. Incorporation of radiographic severity assessments may provide prognostic and treatment allocation guidance in patients hospitalized with COVID-19.
Collapse
|