1
|
Calcagnile M, Damiano F, Lobreglio G, Siculella L, Bozzetti MP, Forgez P, Malgoyre A, Libert N, Bucci C, Alifano M, Alifano P. In silico evidence that substitution of glycine for valine (p.G8V) in a common variant of TMPRSS2 isoform 1 increases accessibility to an endocytic signal: Implication for SARS-cov-2 entry into host cells and susceptibility to COVID-19. Biochimie 2024; 225:89-98. [PMID: 38754620 DOI: 10.1016/j.biochi.2024.05.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 05/01/2024] [Accepted: 05/03/2024] [Indexed: 05/18/2024]
Abstract
The TMPRSS2 protease plays a key role in the entry of the SARS-CoV-2 into cells. The TMPRSS2 gene is highly polymorphic in humans, and some polymorphisms may affect the susceptibility to COVID-19 or disease severity. rs75603675 (c.23G > T) is a missense variant that causes the replacement of glycine with valine at position 8 (p.G8V) in the TMPRSS2 isoform 1. According to GnomAD v4.0.0 database, the allele frequency of the rs75603675 on a global scale is 38.10 %, and range from 0.92 % in East Asian to 40.77 % in non-Finnish European (NFE) population. We analyzed the occurrence of the rs75603675 in two cohorts of patients, the first with severe/critical COVID-19 enrolled in a French hospital (42 patients), and the second with predominantly asymptomatic/pauci-symptomatic/mild COVID-19 enrolled in an Italian hospital (69 patients). We found that the TMPRSS2-c.23T minor allele frequency was similar in the two cohorts, 46.43 % and 46.38 %, respectively, and higher than the frequency in the NFE population (40.77 %). Chi-square test provided significant results (p < 0.05) when the genotype data (TMPRSS2-c.23T/c.23T homozygotes + TMPRSS2-c.23G/c.23T heterozygotes vs. TMPRSS2-c.23G/c.23G homozygotes) of the two patient groups were pooled and compared to the expected data for the NFE population, suggesting a possible pathogenetic mechanism of the p.G8V substitution. We explored the possible effects of the p.G8V substitution and found that the N-terminal region of the TMPRSS2 isoform 1 contains a signal for clathrin/AP-2-dependent endocytosis. In silico analysis predicted that the p.G8V substitution may increase the accessibility to the endocytic signal, which could help SARS-CoV-2 enter cells.
Collapse
Affiliation(s)
- Matteo Calcagnile
- Department of Biological and Environmental Sciences and Technologies, University of Salento, Lecce, Italy
| | - Fabrizio Damiano
- Department of Biological and Environmental Sciences and Technologies, University of Salento, Lecce, Italy
| | - Giambattista Lobreglio
- Clinical Pathology and Microbiology Unit, Vito Fazzi General Hospital, 73100, Lecce, Italy
| | - Luisa Siculella
- Department of Experimental Medicine, University of Salento, Lecce, Italy
| | - Maria Pia Bozzetti
- Department of Biological and Environmental Sciences and Technologies, University of Salento, Lecce, Italy
| | - Patricia Forgez
- INSERM UMR-S 1124 T3S, Eq 5 CELLULAR HOMEOSTASIS, CANCER and THERAPY, University of Paris, Campus Saint Germain, Paris, France
| | - Alexandra Malgoyre
- Institut de Recherche Biomedicale des Armées, French Armed Forces Health Services, Brétigny sur Orge, France; Ecole Du Val de Grâce, French Armed Forces Health Service, France; Laboratoire de Biologie de L'Exercice pour La Performance et La Santé, Université Evry-Paris-Saclay, Evry, France
| | - Nicolas Libert
- Ecole Du Val de Grâce, French Armed Forces Health Service, France; Hopital D'Instruction des Armées, French Armed Forces Health Services, Clamart, France
| | - Cecilia Bucci
- Department of Experimental Medicine, University of Salento, Lecce, Italy
| | - Marco Alifano
- Thoracic Surgery Department, Cochin Hospital, APHP Centre, University of Paris, France; INSERM U1138 Team «Cancer, Immune Control, and Escape», Cordeliers Research Center, University of Paris, France.
| | - Pietro Alifano
- Department of Experimental Medicine, University of Salento, Lecce, Italy.
| |
Collapse
|
2
|
Zhang Z. The Initial COVID-19 Reliable Interactive DNA Methylation Markers and Biological Implications. BIOLOGY 2024; 13:245. [PMID: 38666857 PMCID: PMC11048280 DOI: 10.3390/biology13040245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 03/22/2024] [Accepted: 04/04/2024] [Indexed: 04/28/2024]
Abstract
Earlier research has established the existence of reliable interactive genomic biomarkers. However, reliable DNA methylation biomarkers, not to mention interactivity, have yet to be identified at the epigenetic level. This study, drawing from 865,859 methylation sites, discovered two miniature sets of Infinium MethylationEPIC sites, each having eight CpG sites (genes) to interact with each other and disease subtypes. They led to the nearly perfect (96.87-100% accuracy) prediction of COVID-19 patients from patients with other diseases or healthy controls. These CpG sites can jointly explain some post-COVID-19-related conditions. These CpG sites and the optimally performing genomic biomarkers reported in the literature become potential druggable targets. Among these CpG sites, cg16785077 (gene MX1), cg25932713 (gene PARP9), and cg22930808 (gene PARP9) at DNA methylation levels indicate that the initial SARS-CoV-2 virus may be better treated as a transcribed viral DNA into RNA virus, i.e., not as an RNA virus that has concerned scientists in the field. Such a discovery can significantly change the scientific thinking and knowledge of viruses.
Collapse
Affiliation(s)
- Zhengjun Zhang
- School of Computer, Data and Information Sciences, University of Wisconsin, Madison, WI 53706, USA
| |
Collapse
|
3
|
Asteris PG, Gandomi AH, Armaghani DJ, Tsoukalas MZ, Gavriilaki E, Gerber G, Konstantakatos G, Skentou AD, Triantafyllidis L, Kotsiou N, Braunstein E, Chen H, Brodsky R, Touloumenidou T, Sakellari I, Alkayem NF, Bardhan A, Cao M, Cavaleri L, Formisano A, Guney D, Hasanipanah M, Khandelwal M, Mohammed AS, Samui P, Zhou J, Terpos E, Dimopoulos MA. Genetic justification of COVID-19 patient outcomes using DERGA, a novel data ensemble refinement greedy algorithm. J Cell Mol Med 2024; 28:e18105. [PMID: 38339761 PMCID: PMC10863978 DOI: 10.1111/jcmm.18105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 11/14/2023] [Accepted: 11/22/2023] [Indexed: 02/12/2024] Open
Abstract
Complement inhibition has shown promise in various disorders, including COVID-19. A prediction tool including complement genetic variants is vital. This study aims to identify crucial complement-related variants and determine an optimal pattern for accurate disease outcome prediction. Genetic data from 204 COVID-19 patients hospitalized between April 2020 and April 2021 at three referral centres were analysed using an artificial intelligence-based algorithm to predict disease outcome (ICU vs. non-ICU admission). A recently introduced alpha-index identified the 30 most predictive genetic variants. DERGA algorithm, which employs multiple classification algorithms, determined the optimal pattern of these key variants, resulting in 97% accuracy for predicting disease outcome. Individual variations ranged from 40 to 161 variants per patient, with 977 total variants detected. This study demonstrates the utility of alpha-index in ranking a substantial number of genetic variants. This approach enables the implementation of well-established classification algorithms that effectively determine the relevance of genetic variants in predicting outcomes with high accuracy.
Collapse
Affiliation(s)
- Panagiotis G. Asteris
- Computational Mechanics Laboratory, School of Pedagogical and Technological EducationAthensGreece
| | - Amir H. Gandomi
- Faculty of Engineering & ITUniversity of Technology SydneySydneyNew South WalesAustralia
- University Research and Innovation Center (EKIK), Óbuda UniversityBudapestHungary
| | - Danial J. Armaghani
- School of Civil and Environmental EngineeringUniversity of Technology SydneySydneyNew South WalesAustralia
| | - Markos Z. Tsoukalas
- Computational Mechanics Laboratory, School of Pedagogical and Technological EducationAthensGreece
| | - Eleni Gavriilaki
- 2nd Propedeutic Department of Internal MedicineAristotle University of ThessalonikiThessalonikiGreece
| | - Gloria Gerber
- Hematology DivisionJohns Hopkins UniversityBaltimoreUSA
| | - Gerasimos Konstantakatos
- Computational Mechanics Laboratory, School of Pedagogical and Technological EducationAthensGreece
| | - Athanasia D. Skentou
- Computational Mechanics Laboratory, School of Pedagogical and Technological EducationAthensGreece
| | - Leonidas Triantafyllidis
- Computational Mechanics Laboratory, School of Pedagogical and Technological EducationAthensGreece
| | - Nikolaos Kotsiou
- 2nd Propedeutic Department of Internal MedicineAristotle University of ThessalonikiThessalonikiGreece
| | | | - Hang Chen
- Hematology DivisionJohns Hopkins UniversityBaltimoreUSA
| | | | | | - Ioanna Sakellari
- Hematology Department – BMT UnitG Papanicolaou HospitalThessalonikiGreece
| | | | - Abidhan Bardhan
- Civil Engineering DepartmentNational Institute of Technology PatnaPatnaIndia
| | - Maosen Cao
- Department of Engineering MechanicsHohai UniversityNanjingChina
| | - Liborio Cavaleri
- Department of Civil, Environmental, Aerospace and Materials EngineeringUniversity of PalermoPalermoItaly
| | - Antonio Formisano
- Department of Structures for Engineering and ArchitectureUniversity of Naples “Federico II”NaplesItaly
| | - Deniz Guney
- Engineering FacultySan Diego State UniversitySan DiegoCaliforniaUSA
| | - Mahdi Hasanipanah
- Department of Geotechnics and Transportation, Faculty of Civil EngineeringUniversiti Teknologi MalaysiaJohor BahruMalaysia
| | - Manoj Khandelwal
- Institute of Innovation, Science and SustainabilityFederation University AustraliaBallaratVictoriaAustralia
| | | | - Pijush Samui
- Civil Engineering DepartmentNational Institute of Technology PatnaPatnaIndia
| | - Jian Zhou
- School of Resources and Safety EngineeringCentral South UniversityChangshaChina
| | - Evangelos Terpos
- Department of Clinical Therapeutics, Medical School, Faculty of MedicineNational Kapodistrian University of AthensAthensGreece
| | - Meletios A. Dimopoulos
- Department of Clinical Therapeutics, Medical School, Faculty of MedicineNational Kapodistrian University of AthensAthensGreece
| |
Collapse
|
4
|
Farooqi R, Kooner JS, Zhang W. Associations between polygenic risk score and covid-19 susceptibility and severity across ethnic groups: UK Biobank analysis. BMC Med Genomics 2023; 16:150. [PMID: 37386504 PMCID: PMC10311902 DOI: 10.1186/s12920-023-01584-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 06/16/2023] [Indexed: 07/01/2023] Open
Abstract
BACKGROUND COVID-19 manifests with huge heterogeneity in susceptibility and severity outcomes. UK Black Asian and Minority Ethnic (BAME) groups have demonstrated disproportionate burdens. Some variability remains unexplained, suggesting potential genetic contribution. Polygenic Risk Scores (PRS) can determine genetic predisposition to disease based on Single Nucleotide Polymorphisms (SNPs) within the genome. COVID-19 PRS analyses within non-European samples are extremely limited. We applied a multi-ethnic PRS to a UK-based cohort to understand genetic contribution to COVID-19 variability. METHODS We constructed two PRS for susceptibility and severity outcomes based on leading risk-variants from the COVID-19 Host Genetics Initiative. Scores were applied to 447,382 participants from the UK-Biobank. Associations with COVID-19 outcomes were assessed using binary logistic regression and discriminative power was validated using incremental area under receiver operating curve (ΔAUC). Variance explained was compared between ethnic groups via incremental pseudo-R2 (ΔR2). RESULTS Compared to those at low genetic risk, those at high risk had a significantly greater risk of severe COVID-19 for White (odds ratio [OR] 1.57, 95% confidence interval [CI] 1.42-1.74), Asian (OR 2.88, 95% CI 1.63-5.09) and Black (OR 1.98, 95% CI 1.11-3.53) ethnic groups. Severity PRS performed best within Asian (ΔAUC 0.9%, ΔR2 0.98%) and Black (ΔAUC 0.6%, ΔR2 0.61%) cohorts. For susceptibility, higher genetic risk was significantly associated with COVID-19 infection risk for the White cohort (OR 1.31, 95% CI 1.26-1.36), but not for Black or Asian groups. CONCLUSIONS Significant associations between PRS and COVID-19 outcomes were elicited, establishing a genetic basis for variability in COVID-19. PRS showed utility in identifying high-risk individuals. The multi-ethnic approach allowed applicability of PRS to diverse populations, with the severity model performing well within Black and Asian cohorts. Further studies with larger sample sizes of non-White samples are required to increase statistical power and better assess impacts within BAME populations.
Collapse
Affiliation(s)
- Raabia Farooqi
- Department of Epidemiology and Biostatistics, Imperial College London, London, W2 1PG, UK.
| | - Jaspal S Kooner
- Department of Cardiology, Ealing Hospital, London North West University Healthcare NHS Trust, Middlesex, UB1 3HW, UK
- National Heart and Lung Institute, Imperial College London, London, W12 0NN, UK
- Imperial College Healthcare NHS Trust, London, W12 0HS, UK
- MRC-PHE Centre for Environment and Health, Imperial College London, London, W2 1PG, UK
| | - Weihua Zhang
- Department of Epidemiology and Biostatistics, Imperial College London, London, W2 1PG, UK
- Department of Cardiology, Ealing Hospital, London North West University Healthcare NHS Trust, Middlesex, UB1 3HW, UK
| |
Collapse
|
5
|
Sokhansanj BA, Zhao Z, Rosen GL. Interpretable and Predictive Deep Neural Network Modeling of the SARS-CoV-2 Spike Protein Sequence to Predict COVID-19 Disease Severity. BIOLOGY 2022; 11:1786. [PMID: 36552295 PMCID: PMC9774807 DOI: 10.3390/biology11121786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 11/28/2022] [Accepted: 12/05/2022] [Indexed: 12/13/2022]
Abstract
Through the COVID-19 pandemic, SARS-CoV-2 has gained and lost multiple mutations in novel or unexpected combinations. Predicting how complex mutations affect COVID-19 disease severity is critical in planning public health responses as the virus continues to evolve. This paper presents a novel computational framework to complement conventional lineage classification and applies it to predict the severe disease potential of viral genetic variation. The transformer-based neural network model architecture has additional layers that provide sample embeddings and sequence-wide attention for interpretation and visualization. First, training a model to predict SARS-CoV-2 taxonomy validates the architecture's interpretability. Second, an interpretable predictive model of disease severity is trained on spike protein sequence and patient metadata from GISAID. Confounding effects of changing patient demographics, increasing vaccination rates, and improving treatment over time are addressed by including demographics and case date as independent input to the neural network model. The resulting model can be interpreted to identify potentially significant virus mutations and proves to be a robust predctive tool. Although trained on sequence data obtained entirely before the availability of empirical data for Omicron, the model can predict the Omicron's reduced risk of severe disease, in accord with epidemiological and experimental data.
Collapse
Affiliation(s)
- Bahrad A. Sokhansanj
- Ecological and Evolutionary Signal-Processing and Informatics Laboratory, Department of Electrical & Computer Engineering, College of Engineering, Drexel University, Philadelphia, PA 19104, USA
| | | | | |
Collapse
|
6
|
Zhang Z. Genomic Transcriptome Benefits and Potential Harms of COVID-19 Vaccines Indicated from Optimized Genomic Biomarkers. Vaccines (Basel) 2022; 10:1774. [PMID: 36366282 PMCID: PMC9692407 DOI: 10.3390/vaccines10111774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Revised: 09/28/2022] [Accepted: 10/20/2022] [Indexed: 11/07/2023] Open
Abstract
COVID-19 vaccines can be the tugboats for preventing SARS-CoV-2 infections when they are practical and, more importantly, without adverse effects. However, the reality is that they may result in short-term or long-term impacts on COVID-19-related diseases and even trigger the formation of new variants of SARS-CoV-2. Using published data, we use a set of optimized-performance COVID-19 genomic biomarkers (MND1, CDC6, ZNF282) to study the benefits and adverse effects of the BNT162b2 vaccine. We found that the vaccine lowered the expression values of genes MND1 and CDC6 while heightening the expression values of ZNF282 in individuals who are SARS-CoV-2 naïve, which is expected and satisfies the biological equivalence between the COVID-19 disease and the genomic signature patterns established in the literature. However, we also found that COVID-19-convalescent octogenarians responded reversely. The vaccine heightened the expression values of MND1 and CDC6. In addition, it lowered the expression values of ZNF282. Such adverse effects raise outstanding concerns about whether or not COVID-19-convalescent individuals should take the current vaccine or when they can take it. These findings are new at the genomic level and can provide insights into developing next-generation vaccines, antiviral drugs, and pandemic management guidance.
Collapse
Affiliation(s)
- Zhengjun Zhang
- Department of Statistics, School of Computer, Data & Information Sciences, University of Wisconsin, Madison, WI 53706, USA
| |
Collapse
|
7
|
Zhang Z. Genomic Biomarker Heterogeneities between SARS-CoV-2 and COVID-19. Vaccines (Basel) 2022; 10:1657. [PMID: 36298522 PMCID: PMC9608907 DOI: 10.3390/vaccines10101657] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Revised: 09/27/2022] [Accepted: 09/29/2022] [Indexed: 11/06/2022] Open
Abstract
Genes functionally associated with SARS-CoV-2 infection and genes functionally related to the COVID-19 disease can be different, whose distinction will become the first essential step for successfully fighting against the COVID-19 pandemic. Unfortunately, this first step has not been completed in all biological and medical research. Using a newly developed max-competing logistic classifier, two genes, ATP6V1B2 and IFI27, stand out to be critical in the transcriptional response to SARS-CoV-2 infection with differential expressions derived from NP/OP swab PCR. This finding is evidenced by combining these two genes with another gene in predicting disease status to achieve better-indicating accuracy than existing classifiers with the same number of genes. In addition, combining these two genes with three other genes to form a five-gene classifier outperforms existing classifiers with ten or more genes. These two genes can be critical in fighting against the COVID-19 pandemic as a new focus and direction with their exceptional predicting accuracy. Comparing the functional effects of these genes with a five-gene classifier with 100% accuracy identified and tested from blood samples in our earlier work, the genes and their transcriptional response and functional effects on SARS-CoV-2 infection, and the genes and their functional signature patterns on COVID-19 antibodies, are significantly different. We will use a total of fourteen cohort studies (including breakthrough infections and omicron variants) with 1481 samples to justify our results. Such significant findings can help explore the causal and pathological links between SARS-CoV-2 infection and the COVID-19 disease, and fight against the disease with more targeted genes, vaccines, antiviral drugs, and therapies.
Collapse
Affiliation(s)
- Zhengjun Zhang
- Department of Statistics, School of Computer, Data & Information Sciences, University of Wisconsin, Madison, WI 53706, USA
| |
Collapse
|
8
|
Villapalos-García G, Zubiaur P, Rivas-Durán R, Campos-Norte P, Arévalo-Román C, Fernández-Rico M, García-Fraile Fraile L, Fernández-Campos P, Soria-Chacartegui P, Fernández de Córdoba-Oñate S, Delgado-Wicke P, Fernández-Ruiz E, González-Álvaro I, Sanz J, Abad-Santos F, de Los Santos I. Transmembrane protease serine 2 ( TMPRSS2) rs75603675, comorbidity, and sex are the primary predictors of COVID-19 severity. Life Sci Alliance 2022; 5:e202201396. [PMID: 35636966 PMCID: PMC9152129 DOI: 10.26508/lsa.202201396] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Revised: 05/12/2022] [Accepted: 05/13/2022] [Indexed: 01/08/2023] Open
Abstract
By the end of December 2021, coronavirus disease 2019 (COVID-19) produced more than 271 million cases and 5.3 million deaths. Although vaccination is an effective strategy for pandemic control, it is not yet equally available in all countries. Therefore, identification of prognostic biomarkers remains crucial to manage COVID-19 patients. The aim of this study was to evaluate predictors of COVID-19 severity previously proposed. Clinical and demographic characteristics and 120 single-nucleotide polymorphisms were analyzed from 817 patients with COVID-19, who attended the emergency department of the Hospital Universitario de La Princesa during March and April 2020. The main outcome was a modified version of the 7-point World Health Organization (WHO) COVID-19 severity scale (WHOCS); both in the moment of the first hospital examination (WHOCS-1) and of the severest WHOCS score (WHOCS-2). The TMPRSS2 rs75603675 genotype (OR = 0.586), dyslipidemia (OR = 2.289), sex (OR = 0.586), and the Charlson Comorbidity Index (OR = 1.126) were identified as the main predictors of disease severity. Consequently, these variables might influence COVID-19 severity and could be used as predictors of disease development.
Collapse
Affiliation(s)
- Gonzalo Villapalos-García
- Clinical Pharmacology Department, Hospital Universitario La Princesa, Instituto Teófilo Hernando, Universidad Autónoma de Madrid (UAM), Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
| | - Pablo Zubiaur
- Clinical Pharmacology Department, Hospital Universitario La Princesa, Instituto Teófilo Hernando, Universidad Autónoma de Madrid (UAM), Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, Madrid, Spain
| | - Rebeca Rivas-Durán
- Infectious Diseases Unit, Hospital Universitario La Princesa, Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
| | - Pilar Campos-Norte
- Infectious Diseases Unit, Hospital Universitario La Princesa, Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
| | - Cristina Arévalo-Román
- Infectious Diseases Unit, Hospital Universitario La Princesa, Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
| | - Marta Fernández-Rico
- Infectious Diseases Unit, Hospital Universitario La Princesa, Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
| | - Lucio García-Fraile Fraile
- Infectious Diseases Unit, Hospital Universitario La Princesa, Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
| | - Paula Fernández-Campos
- Clinical Pharmacology Department, Hospital Universitario La Princesa, Instituto Teófilo Hernando, Universidad Autónoma de Madrid (UAM), Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
| | - Paula Soria-Chacartegui
- Clinical Pharmacology Department, Hospital Universitario La Princesa, Instituto Teófilo Hernando, Universidad Autónoma de Madrid (UAM), Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
| | - Sara Fernández de Córdoba-Oñate
- Molecular Biology Unit, Hospital Universitario La Princesa, Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
| | - Pablo Delgado-Wicke
- Molecular Biology Unit, Hospital Universitario La Princesa, Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
| | - Elena Fernández-Ruiz
- Molecular Biology Unit, Hospital Universitario La Princesa, Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
| | - Isidoro González-Álvaro
- Rheumatology Service, Hospital Universitario La Princesa, Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
| | - Jesús Sanz
- Infectious Diseases Unit, Hospital Universitario La Princesa, Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
| | - Francisco Abad-Santos
- Clinical Pharmacology Department, Hospital Universitario La Princesa, Instituto Teófilo Hernando, Universidad Autónoma de Madrid (UAM), Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, Madrid, Spain
| | - Ignacio de Los Santos
- Infectious Diseases Unit, Hospital Universitario La Princesa, Instituto de Investigación Sanitaria La Princesa (IIS-IP), Madrid, Spain
| |
Collapse
|
9
|
Sokhansanj BA, Rosen GL. Predicting COVID-19 disease severity from SARS-CoV-2 spike protein sequence by mixed effects machine learning. Comput Biol Med 2022; 149:105969. [PMID: 36041271 PMCID: PMC9384346 DOI: 10.1016/j.compbiomed.2022.105969] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 07/11/2022] [Accepted: 08/13/2022] [Indexed: 11/17/2022]
Abstract
Epidemiological studies show that COVID-19 variants-of-concern, like Delta and Omicron, pose different risks for severe disease, but they typically lack sequence-level information for the virus. Studies which do obtain viral genome sequences are generally limited in time, location, and population scope. Retrospective meta-analyses require time-consuming data extraction from heterogeneous formats and are limited to publicly available reports. Fortuitously, a subset of GISAID, the global SARS-CoV-2 sequence repository, includes "patient status" metadata that can indicate whether a sequence record is associated with mild or severe disease. While GISAID lacks data on comorbidities relevant to severity, such as obesity and chronic disease, it does include metadata for age and sex to use as additional attributes in modeling. With these caveats, previous efforts have demonstrated that genotype-patient status models can be fit to GISAID data, particularly when country-of-origin is used as an additional feature. But are these models robust and biologically meaningful? This paper shows that, in fact, temporal and geographic biases in sequences submitted to GISAID, as well as the evolving pandemic response, particularly reduction in severe disease due to vaccination, create complex issues for model development and interpretation. This paper poses a potential solution: efficient mixed effects machine learning using GPBoost, treating country as a random effect group. Training and validation using temporally split GISAID data and emerging Omicron variants demonstrates that GPBoost models are more predictive of the impact of spike protein mutations on patient outcomes than fixed effect XGBoost, LightGBM, random forests, and elastic net logistic regression models.
Collapse
Affiliation(s)
- Bahrad A Sokhansanj
- Ecological and Evolutionary Signal Processing & Informatics Laboratory, Drexel University, 3100 Chestnut St., Philadelphia, PA, 19104, United States of America.
| | - Gail L Rosen
- Ecological and Evolutionary Signal Processing & Informatics Laboratory, Drexel University, 3100 Chestnut St., Philadelphia, PA, 19104, United States of America.
| |
Collapse
|
10
|
Rüter J, Pallerla SR, Meyer CG, Casadei N, Sonnabend M, Peter S, Nurjadi D, Linh LTK, Fendel R, Göpel S, Riess O, Kremsner PG, Velavan TP. Host genetic loci LZTFL1 and CCL2 associated with SARS-CoV-2 infection and severity of COVID-19. Int J Infect Dis 2022; 122:427-436. [PMID: 35753602 PMCID: PMC9222649 DOI: 10.1016/j.ijid.2022.06.030] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Revised: 06/14/2022] [Accepted: 06/17/2022] [Indexed: 01/25/2023] Open
Abstract
OBJECTIVES Host genetic factors contribute to the variable severity of COVID-19. We examined genetic variants from genome-wide association studies and candidate gene association studies in a cohort of patients with COVID-19 and investigated the role of early SARS-CoV-2 strains in COVID-19 severity. METHODS This case-control study included 123 COVID-19 cases (hospitalized or ambulatory) and healthy controls from the state of Baden-Wuerttemberg, Germany. We genotyped 30 single nucleotide polymorphisms, using a custom-designed panel. Cases were also compared with the 1000 genomes project. Polygenic risk scores were constructed. SARS-CoV-2 genomes from 26 patients with COVID-19 were sequenced and compared between ambulatory and hospitalized cases, and phylogeny was reconstructed. RESULTS Eight variants reached nominal significance and two were significantly associated with at least one of the phenotypes "susceptibility to infection", "hospitalization", or "severity": rs73064425 in LZTFL1 (hospitalization and severity, P <0.001) and rs1024611 near CCL2 (susceptibility, including 1000 genomes project, P = 0.001). The polygenic risk score could predict hospitalization. Most (23/26, 89%) of the SARS-CoV-2 genomes were classified as B.1 lineage. No associations of SARS-CoV-2 mutations or lineages with severity were observed. CONCLUSION These host genetic markers provide insights into pathogenesis and enable risk classification. Variants which reached nominal significance should be included in larger studies.
Collapse
Affiliation(s)
- Jule Rüter
- Institute of Tropical Medicine, University Hospital Tübingen, Tübingen, Germany
| | - Srinivas Reddy Pallerla
- Institute of Tropical Medicine, University Hospital Tübingen, Tübingen, Germany; Vietnamese-German Center for Medical Research, VG-CARE, Hanoi, Vietnam
| | - Christian G Meyer
- Institute of Tropical Medicine, University Hospital Tübingen, Tübingen, Germany; Vietnamese-German Center for Medical Research, VG-CARE, Hanoi, Vietnam; Duy Tan University, Da Nang, Vietnam
| | - Nicolas Casadei
- Institute for Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany; NGS Competence Center Tübingen (NCCT), Tübingen, Germany
| | - Michael Sonnabend
- Institute for Medical Microbiology and Hygiene, University Hospital Tübingen, Tübingen, Germany
| | - Silke Peter
- Institute for Medical Microbiology and Hygiene, University Hospital Tübingen, Tübingen, Germany
| | - Dennis Nurjadi
- Department of Infectious Diseases, Medical Microbiology and Hygiene, Heidelberg University Hospital, Germany
| | - Le Thi Kieu Linh
- Institute of Tropical Medicine, University Hospital Tübingen, Tübingen, Germany; Vietnamese-German Center for Medical Research, VG-CARE, Hanoi, Vietnam
| | - Rolf Fendel
- Institute of Tropical Medicine, University Hospital Tübingen, Tübingen, Germany
| | - Siri Göpel
- Department of Internal Medicine I, Tübingen University Hospital, Tübingen, Germany
| | - Olaf Riess
- Institute for Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany
| | - Peter G Kremsner
- Institute of Tropical Medicine, University Hospital Tübingen, Tübingen, Germany; Centre de Recherches Médicales de Lambaréné (CERMEL), Gabon
| | - Thirumalaisamy P Velavan
- Institute of Tropical Medicine, University Hospital Tübingen, Tübingen, Germany; Vietnamese-German Center for Medical Research, VG-CARE, Hanoi, Vietnam.
| |
Collapse
|
11
|
Abstract
Human genetics can inform the biology and epidemiology of coronavirus disease 2019 (COVID-19) by pinpointing causal mechanisms that explain why some individuals become more severely affected by the disease upon infection by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus. Large-scale genetic association studies, encompassing both rare and common genetic variants, have used different study designs and multiple disease phenotype definitions to identify several genomic regions associated with COVID-19. Along with a multitude of follow-up studies, these findings have increased our understanding of disease aetiology and provided routes for management of COVID-19. Important emergent opportunities include the clinical translatability of genetic risk prediction, the repurposing of existing drugs, exploration of variable host effects of different viral strains, study of inter-individual variability in vaccination response and understanding the long-term consequences of SARS-CoV-2 infection. Beyond the current pandemic, these transferrable opportunities are likely to affect the study of many infectious diseases.
Collapse
Affiliation(s)
- Mari E K Niemi
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
| | - Mark J Daly
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
- Broad Institute, Cambridge, MA, USA
- Analytical and Translational Genetics Unit, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Andrea Ganna
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland.
- Broad Institute, Cambridge, MA, USA.
- Analytical and Translational Genetics Unit, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
12
|
Zhang Z. The Existence of at Least Three Genomic Signature Patterns and at Least Seven Subtypes of COVID-19 and the End of the Disease. Vaccines (Basel) 2022; 10:761. [PMID: 35632517 PMCID: PMC9146581 DOI: 10.3390/vaccines10050761] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Revised: 05/08/2022] [Accepted: 05/09/2022] [Indexed: 12/15/2022] Open
Abstract
Hoping to find genomic clues linked to COVID-19 and end the pandemic has driven scientists' tremendous efforts to try all kinds of research. Signs of progress have been achieved but are still limited. This paper intends to prove the existence of at least three genomic signature patterns and at least seven subtypes of COVID-19 driven by five critical genes (the smallest subset of genes) using three blood-sampled datasets. These signatures and subtypes provide crucial genomic information in COVID-19 diagnosis (including ICU patients), research focuses, and treatment methods. Unlike existing approaches focused on gene fold-changes and pathways, gene-gene nonlinear and competing interactions are the driving forces in finding the signature patterns and subtypes. Furthermore, the method leads to high accuracy with hospitalized patients, showing biological and mathematical equivalences between COVID-19 status and the signature patterns and a methodological advantage over other methods that cannot lead to high accuracy. As a result, as new biomarkers, the new findings and genomic clues can be much more informative than other findings for interpreting biological mechanisms, developing the second (third) generation of vaccines, antiviral drugs, and treatment methods, and eventually bringing new hopes of an end to the pandemic.
Collapse
Affiliation(s)
- Zhengjun Zhang
- Department of Statistics, University of Wisconsin, Madison, WI 53706, USA
| |
Collapse
|
13
|
Dite GS, Murphy NM, Spaeth E, Allman R. Validation of a clinical and genetic model for predicting severe COVID-19. Epidemiol Infect 2022; 150:1-15. [PMID: 35465870 PMCID: PMC9096108 DOI: 10.1017/s0950268822000541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Revised: 02/17/2022] [Accepted: 03/15/2022] [Indexed: 11/07/2022] Open
Abstract
Using nested case–control data from the Lifelines COVID-19 cohort, we undertook a validation study of a clinical and genetic model to predict the risk of severe COVID-19 in people with confirmed COVID-19 and in people with confirmed or self-reported COVID-19. The model performed well in terms of discrimination of cases and controls for all ages (area under the receiver operating characteristic curve (AUC) = 0.680 for confirmed COVID-19 and AUC = 0.689 for confirmed and self-reported COVID-19) and in the age group in which the model was developed (50 years and older; AUC = 0.658 for confirmed COVID-19 and AUC = 0.651 for confirmed and self-reported COVID-19). There was no evidence of over- or under-dispersion of risk scores but there was evidence of overall over-estimation of risk in all analyses (all P < 0.0001). In the light of large numbers of people worldwide remaining unvaccinated and continuing uncertainty regarding vaccine efficacy over time and against variants of concern, identification of people at high risk of severe COVID-19 may encourage the uptake of vaccinations (including boosters) and the use of non-pharmaceutical inventions.
Collapse
Affiliation(s)
| | | | - Erika Spaeth
- Phenogen Sciences Inc, Charlotte, North Carolina, USA
| | - Richard Allman
- Genetic Technologies Limited, Fitzroy, Victoria, Australia
| | | |
Collapse
|
14
|
Velavan TP, Pallerla SR, Rüter J, Augustin Y, Kremsner PG, Krishna S, Meyer CG. Host genetic factors determining COVID-19 susceptibility and severity. EBioMedicine 2021; 72:103629. [PMID: 34655949 PMCID: PMC8512556 DOI: 10.1016/j.ebiom.2021.103629] [Citation(s) in RCA: 114] [Impact Index Per Article: 38.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 10/02/2021] [Accepted: 10/03/2021] [Indexed: 12/12/2022] Open
Abstract
The COVID-19 pandemic caused by the Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2) poses an unprecedented challenge to humanity. SARS-CoV-2 infections range from asymptomatic to severe courses of COVID-19 with acute respiratory distress syndrome (ARDS), multiorgan involvement and death. Risk factors for disease severity include older age, male sex, increased BMI and pre-existing comorbidities. Ethnicity is also relevant to COVID-19 susceptibility and severity. Host genetic predisposition to COVID-19 is now increasingly recognized and whole genome and candidate gene association studies regarding COVID-19 susceptibility have been performed. Several common and rare variants in genes related to inflammation or immune responses have been identified. We summarize research on COVID-19 host genetics and compile genetic variants associated with susceptibility to COVID-19 and disease severity. We discuss candidate genes that should be investigated further to understand such associations and provide insights relevant to pathogenesis, risk classification, therapy response, precision medicine, and drug repurposing.
Collapse
Affiliation(s)
- Thirumalaisamy P Velavan
- Institute of Tropical Medicine, Universitätsklinikum Tübingen, Wilhelmstrasse 27, Tübingen 72074, Germany; Vietnamese-German Center for Medical Research, VG-CARE, Hanoi, Vietnam.
| | - Srinivas Reddy Pallerla
- Institute of Tropical Medicine, Universitätsklinikum Tübingen, Wilhelmstrasse 27, Tübingen 72074, Germany; Vietnamese-German Center for Medical Research, VG-CARE, Hanoi, Vietnam
| | - Jule Rüter
- Institute of Tropical Medicine, Universitätsklinikum Tübingen, Wilhelmstrasse 27, Tübingen 72074, Germany
| | - Yolanda Augustin
- Institute of Infection and Immunity, St George's University of London, United Kingdom
| | - Peter G Kremsner
- Institute of Tropical Medicine, Universitätsklinikum Tübingen, Wilhelmstrasse 27, Tübingen 72074, Germany; Centre de Recherches Médicales de Lambaréné (CERMEL), Gabon
| | - Sanjeev Krishna
- Institute of Infection and Immunity, St George's University of London, United Kingdom; Centre de Recherches Médicales de Lambaréné (CERMEL), Gabon
| | - Christian G Meyer
- Institute of Tropical Medicine, Universitätsklinikum Tübingen, Wilhelmstrasse 27, Tübingen 72074, Germany; Vietnamese-German Center for Medical Research, VG-CARE, Hanoi, Vietnam; Duy Tan University, Da Nang, Vietnam
| |
Collapse
|
15
|
Colona VL, Vasiliou V, Watt J, Novelli G, Reichardt JKV. Update on human genetic susceptibility to COVID-19: susceptibility to virus and response. Hum Genomics 2021; 15:57. [PMID: 34429158 PMCID: PMC8384585 DOI: 10.1186/s40246-021-00356-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Affiliation(s)
- Vito Luigi Colona
- Department of Biomedicine and Prevention, "Tor Vergata" University of Rome, 00133, Rome, Italy
| | - Vasilis Vasiliou
- Department of Environmental Health Sciences, School of Public Health, Yale University, New Haven, USA
| | - Jessica Watt
- College of Public Health, Medical and Veterinary Sciences, James Cook University, Smithfield, QLD, Australia
| | - Giuseppe Novelli
- Department of Biomedicine and Prevention, "Tor Vergata" University of Rome, 00133, Rome, Italy
- IRCCS Neuromed, Pozzilli, IS, Italy
- Department of Pharmacology, School of Medicine, University of Nevada, Reno, NV, 89557, USA
| | - Juergen K V Reichardt
- Australian Institute of Tropical Health and Medicine, James Cook University, Smithfield, QLD, 4878, Australia.
| |
Collapse
|