1
|
Jacquat AG, Theumer MG, Dambolena JS. Selective and non-selective evolutionary signatures found in the simplest replicative biological entities. J Evol Biol 2024; 37:862-876. [PMID: 38822575 DOI: 10.1093/jeb/voae070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 05/30/2024] [Indexed: 06/03/2024]
Abstract
Mitoviruses, which are considered evolutionary relics of extinct alpha-proteobacteria RNA phages, represent one of the simplest self-replicating biological systems. This study aims to quantitatively describe genomes and identify potential genomic signatures that support the protein phylogenetic-based classification criterion. Genomic variables, such as mononucleotide and dinucleotide composition, codon usage bias, and minimal free energy derived from optimized predicted RNA secondary structure, were analyzed. From the values obtained, the main evolutionary pressures were discussed, indicating that natural selection plays a significant role in shaping mitovirus genomes. However, neutral evolution also makes a significant contribution. This study reveals a significant discovery of structural divergence in Kvaramitovirus. The energy minimization approach employed to study 2D folding in this study reveals a distinct spatial organization of their genomes, providing evidence for the hypothesis of a single evolutionary event of circularization in the most recent common ancestor of the lineage. This hypothesis was discussed in light of recent discoveries by other researchers that partially support the existence of mitoviruses with circular genomes. Finally, this study represents a significant advancement in the understanding of mitoviruses, as it quantitatively describes the nucleotide sequence at the family and genus taxonomic levels. Additionally, we provide hypotheses that can be experimentally validated to inspire new research and address the gaps in knowledge of this fascinating, basally divergent RNA virus lineage.
Collapse
Affiliation(s)
- Andrés Gustavo Jacquat
- Facultad de Ciencias Exactas Físicas y Naturales (FCEFyN), Universidad Nacional de Córdoba (UNC), Córdoba, Argentina
- Instituto Multidisciplinario de Biología Vegetal (IMBIV), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Córdoba, Argentina
| | - Martín Gustavo Theumer
- Departamento de Bioquímica Clínica, Facultad de Ciencias Químicas (FCQ), Universidad Nacional de Córdoba (UNC), Córdoba, Argentina
- Centro de Investigaciones en Bioquímica Clínica e Inmunología (CIBICI), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Córdoba, Argentina
| | - José Sebastián Dambolena
- Facultad de Ciencias Exactas Físicas y Naturales (FCEFyN), Universidad Nacional de Córdoba (UNC), Córdoba, Argentina
- Instituto Multidisciplinario de Biología Vegetal (IMBIV), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Córdoba, Argentina
| |
Collapse
|
2
|
Castellano LA, McNamara RJ, Pallarés HM, Gamarnik AV, Alvarez DE, Bazzini AA. Dengue virus preferentially uses human and mosquito non-optimal codons. Mol Syst Biol 2024:10.1038/s44320-024-00052-7. [PMID: 39039212 DOI: 10.1038/s44320-024-00052-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 06/25/2024] [Accepted: 06/26/2024] [Indexed: 07/24/2024] Open
Abstract
Codon optimality refers to the effect that codon composition has on messenger RNA (mRNA) stability and translation level and implies that synonymous codons are not silent from a regulatory point of view. Here, we investigated the adaptation of virus genomes to the host optimality code using mosquito-borne dengue virus (DENV) as a model. We demonstrated that codon optimality exists in mosquito cells and showed that DENV preferentially uses nonoptimal (destabilizing) codons and avoids codons that are defined as optimal (stabilizing) in either human or mosquito cells. Human genes enriched in the codons preferentially and frequently used by DENV are upregulated during infection, and so is the tRNA decoding the nonoptimal and DENV preferentially used codon for arginine. We found that adaptation during single-host passaging in human or mosquito cells results in the selection of synonymous mutations towards DENV's preferred nonoptimal codons that increase virus fitness. Finally, our analyses revealed that hundreds of viruses preferentially use nonoptimal codons, with those infecting a single host displaying an even stronger bias, suggesting that host-pathogen interaction shapes virus-synonymous codon choice.
Collapse
Affiliation(s)
- Luciana A Castellano
- Stowers Institute for Medical Research, 1000 E 50th Street, Kansas City, MO, 64110, USA
| | - Ryan J McNamara
- Stowers Institute for Medical Research, 1000 E 50th Street, Kansas City, MO, 64110, USA
| | - Horacio M Pallarés
- Stowers Institute for Medical Research, 1000 E 50th Street, Kansas City, MO, 64110, USA
- Fundación Instituto Leloir, Instituto de Investigaciones Bioquímicas de Buenos Aires IIBBA-CONICET, Ciudad Autónoma de Buenos Aires, Argentina
| | - Andrea V Gamarnik
- Fundación Instituto Leloir, Instituto de Investigaciones Bioquímicas de Buenos Aires IIBBA-CONICET, Ciudad Autónoma de Buenos Aires, Argentina
| | - Diego E Alvarez
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín-CONICET, San Martín B1650, Argentina
| | - Ariel A Bazzini
- Stowers Institute for Medical Research, 1000 E 50th Street, Kansas City, MO, 64110, USA.
- Department of Molecular and Integrative Physiology, University of Kansas Medical Center, 3901 Rainbow Blvd, Kansas City, KS, 66160, USA.
| |
Collapse
|
3
|
Kaushik R, Kumar N, Yadav P, Sircar S, Shete-Aich A, Singh A, Tomar S, Launey T, Malik YS. Comprehensive Genomics Investigation of Neboviruses Reveals Distinct Codon Usage Patterns and Host Specificity. Microorganisms 2024; 12:696. [PMID: 38674640 PMCID: PMC11052288 DOI: 10.3390/microorganisms12040696] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 03/24/2024] [Accepted: 03/26/2024] [Indexed: 04/28/2024] Open
Abstract
Neboviruses (NeVs) from the Caliciviridae family have been linked to enteric diseases in bovines and have been detected worldwide. As viruses rely entirely on the cellular machinery of the host for replication, their ability to thrive in a specific host is greatly impacted by the specific codon usage preferences. Here, we systematically analyzed the codon usage bias in NeVs to explore the genetic and evolutionary patterns. Relative Synonymous Codon Usage and Effective Number of Codon analyses indicated a marginally lower codon usage bias in NeVs, predominantly influenced by the nucleotide compositional constraints. Nonetheless, NeVs showed a higher codon usage bias for codons containing G/C at the third codon position. The neutrality plot analysis revealed natural selection as the primary factor that shaped the codon usage bias in both the VP1 (82%) and VP2 (57%) genes of NeVs. Furthermore, the NeVs showed a highly comparable codon usage pattern to bovines, as reflected through Codon Adaptation Index and Relative Codon Deoptimization Index analyses. Notably, yak NeVs showed considerably different nucleotide compositional constraints and mutational pressure compared to bovine NeVs, which appear to be predominantly host-driven. This study sheds light on the genetic mechanism driving NeVs' adaptability, evolution, and fitness to their host species.
Collapse
Affiliation(s)
- Rahul Kaushik
- Biotechnology Research Center, Technology Innovation Institute, Masdar City, Abu Dhabi P.O. Box 9639, United Arab Emirates;
| | - Naveen Kumar
- Diagnostics and Vaccines Group, ICAR—National Institute of High Security Animal Diseases, Bhopal 462021, Madhya Pradesh, India;
| | - Pragya Yadav
- Maximum Containment Facility, ICMR—National Institute of Virology, Pune 411001, Maharashtra, India; (P.Y.); (A.S.-A.)
| | - Shubhankar Sircar
- Department of Animal Sciences, Washington State University, Pullman, WA 99163, USA;
| | - Anita Shete-Aich
- Maximum Containment Facility, ICMR—National Institute of Virology, Pune 411001, Maharashtra, India; (P.Y.); (A.S.-A.)
| | - Ankur Singh
- Department of Biosciences and Bioengineering, Indian Institute of Technology Roorkee, Roorkee 247667, Uttarakhand, India; (A.S.); (S.T.)
| | - Shailly Tomar
- Department of Biosciences and Bioengineering, Indian Institute of Technology Roorkee, Roorkee 247667, Uttarakhand, India; (A.S.); (S.T.)
| | - Thomas Launey
- Biotechnology Research Center, Technology Innovation Institute, Masdar City, Abu Dhabi P.O. Box 9639, United Arab Emirates;
| | - Yashpal Singh Malik
- College of Animal Biotechnology, Guru Angad Dev Veterinary and Animal Science University, Ludhiana 141004, Punjab, India
| |
Collapse
|
4
|
Gurjar P, Khan AA, Alanazi AM, Vasil'ev VG, Zouganelis G, Alexiou A. Molecular Dissection of Herpes Simplex Virus Type 1 to Elucidate Molecular Mechanisms Behind Latency and Comparison of Its Codon Usage Patterns with Genes Modulated During Alzheimer's Disease as a Part of Host-Pathogen Interaction. J Alzheimers Dis 2024; 97:1111-1123. [PMID: 38306057 DOI: 10.3233/jad-231083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2024]
Abstract
BACKGROUND Herpes simplex virus type 1 (HSV-1) is associated with Alzheimer's disease, which goes into a cycle of latency and reactivation. The present study was envisaged to understand the reasons for latency and specific molecular patterns present in the HSV-1. OBJECTIVE The objective is the molecular dissection of Herpes simplex virus type 1 to elucidate molecular mechanisms behind latency and compare its codon usage patterns with genes modulated during Alzheimer's disease as a part of host-pathogen interaction. METHODS In the present study, we tried to investigate the potential reasons for the latency of HSV-1 virus bioinformatically by determining the CpG patterns. Also, we investigated the codon usage pattern, the presence of rare codons, codon context, and protein properties. RESULTS The top 222 codon pairs graded based on their frequency in the HSV-1 genome revealed that with only one exception (CUG-UUU), all other codon pairs have codons ending with G/C. Considering it an extension of host-pathogen interaction, we compared HSV-1 codon usage with that of codon usage of genes modulated during Alzheimer's disease, and we found that CGT and TTT are only two codons that exhibited similar codon usage patterns and other codons showed statistically highly significant different codon preferences. Dinucleotide CpG tends to mutate to TpG, suggesting the presence of mutational forces and the imperative role of CpG methylation in HSV-1 latency. CONCLUSIONS Upon comparison of codon usage between HSV-1 and Alzheimer's disease genes, no similarities in codon usage were found as a part of host-pathogen interaction. CpG methylation plays an imperative role in latency HSV-1.
Collapse
Affiliation(s)
- Pankaj Gurjar
- Centre for Global Health Research, Saveetha Medical College and Hospital, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, Tamil Nadu, India
- Department of Science and Engineering, Novel Global Community Educational Foundation, Hebersham, Australia
| | - Azmat Ali Khan
- Department of Pharmaceutical Chemistry, Pharmaceutical Biotechnology Laboratory, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
| | - Amer M Alanazi
- Department of Pharmaceutical Chemistry, Pharmaceutical Biotechnology Laboratory, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
| | | | - George Zouganelis
- School of Human Sciences, College of Life and Natural Sciences, University of Derby, Derby, UK
| | - Athanasios Alexiou
- Department of Science and Engineering, Novel Global Community Educational Foundation, Hebersham, Australia
- AFNP Med, Vienna, Austria
| |
Collapse
|
5
|
Yimyaem M, Jitobaom K, Auewarakul P. A small stretch of poor codon usage at the beginning of dengue virus open reading frame may act as a translational checkpoint. BMC Res Notes 2023; 16:359. [PMID: 38053139 DOI: 10.1186/s13104-023-06615-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Accepted: 11/06/2023] [Indexed: 12/07/2023] Open
Abstract
OBJECTIVE Rare codons were previously shown to be enriched at the beginning of the dengue virus (DENV) open reading frame. However, the role of rare codons in regulating translation efficiency and replication of DENV remains unclear. The present study aims to clarify the significance of rare codon usage at the beginning of DENV transcripts using the codon adaptation index (CAI). METHODOLOGY CAIs of the whole starting regions of DENV transcripts as well as 18-codon sliding windows of the regions were analyzed. RESULTS One of the intriguing findings is that those rare codons do not typically result in uniformly low CAI in the starting region with rare codons. However, it shows a notable local drop in CAI around the 50th codon in all dengue serotypes. This suggests that there may be a translational checkpoint at this site and that the rare codon usage upstream to this checkpoint may not be related to translational control.
Collapse
Affiliation(s)
- Maneenop Yimyaem
- Graduate Program in Molecular Medicine, Faculty of Science, Mahidol University, Princess Srisavangavadhana College of Medicine, Chulabhorn Royal Academy, Bangkok, Thailand
| | - Kunlakanya Jitobaom
- Department of Microbiology, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Prasert Auewarakul
- Department of Microbiology, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand.
| |
Collapse
|
6
|
Guo F, Yang J, Abd El-Aty AM, Wang R, Ju X. Base composition, adaptation, and evolution of goose astroviruses: codon-based investigation. Poult Sci 2023; 102:103029. [PMID: 37713803 PMCID: PMC10511809 DOI: 10.1016/j.psj.2023.103029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 07/24/2023] [Accepted: 08/11/2023] [Indexed: 09/01/2023] Open
Abstract
Goose astroviruses (GoAstVs) are causative agents that account for fatal infection of goslings characterized by visceral urate deposition, resulting in severe economic losses in major goose-producing regions in China since 2017. In this study, we sought to unravel the intrinsic properties associated with adaptation and evolution in the host environment of GoAstVs. Consistent results from phylogenetic analysis and correspondence analysis performed on the codon usage patterns (CUPs) reveal 2 clusters of GoAstVs, namely, GoAstV-1 and GoAstV-2. However, multiple similar compositional characteristics were found, despite the high divergence between GoAstV-1 and GoAstV-2. Studies on the base composition of GoAstVs reveal an A/U bias, indicating a compositional constraint, while natural selection prevailed in determining the CUPs in the virus genome based on our neutrality plot analysis, reflecting high adaptive pressure to fit the host environment. Codon adaptation index (CAI) analysis revealed a higher degree of fitness to the CUPs of the corresponding host for GoAstVs than avian influenza virus and betacoronaviruses, which may be a favorable factor contributing to the high pathogenicity and wide distribution of GoAstVs in goslings. In addition, GoAstVs were less adapted to ducks and chickens, with significantly lower CAI values than to geese, which may be a reason for the different prevalence of GoAstVs among these species. Extensive investigations on dinucleotide distribution revealed a significant suppression of the CpG and UpA motifs in the virus genome, which may facilitate adaptation to the host's innate immune system by evading surveillance. In addition, our study reported the trends of increasing fitness to the host's microenvironment for GoAstVs through increasing adaptation to host CUPs and ongoing reduction of CpG motifs in the virus genome. The present analysis deepens our understanding of the basic biology, pathogenesis, adaptation and evolutionary pattern of GoAstVs, and contributes to the development of novel antiviral strategies.
Collapse
Affiliation(s)
- Fucheng Guo
- Department of Veterinary Medicine, College of Coastal Agricultural Science, Guangdong Ocean University, Zhanjiang, 524088, China
| | - Jinjin Yang
- Technology Center of Zhanjiang Customs District, Zhanjiang, 524000, Guangdong, China
| | - A M Abd El-Aty
- Department of Pharmacology, Faculty of Veterinary Medicine, Cairo University, 12211 Giza, Egypt; Department of Medical Pharmacology, Medical Faculty, Ataturk University, Erzurum 25240, Turkey
| | - Ruichen Wang
- Chinese Center for Disease Control and Prevention, National Institute for Viral Disease Control and Prevention, Beijing 102206, China
| | - Xianghong Ju
- Department of Veterinary Medicine, College of Coastal Agricultural Science, Guangdong Ocean University, Zhanjiang, 524088, China; Marine Medical Research and Development Centre, Shenzhen Institute of Guangdong Ocean University, Shenzhen 518120, China.
| |
Collapse
|
7
|
Jitobaom K, Sirihongthong T, Boonarkart C, Phakaratsakul S, Suptawiwat O, Auewarakul P. Human Schlafen 11 inhibits influenza A virus production. Virus Res 2023; 334:199162. [PMID: 37356582 PMCID: PMC10410578 DOI: 10.1016/j.virusres.2023.199162] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 06/20/2023] [Accepted: 06/22/2023] [Indexed: 06/27/2023]
Abstract
Schlafen (SLFN) proteins are a subset of interferon-stimulated early response genes with antiviral properties. An antiviral mechanism of SLFN11 was previously demonstrated in human immunodeficiency virus type 1 (HIV-1)-infected cells, and it was shown that SLFN11 inhibited HIV-1 virus production in a codon usage-specific manner. The codon usage patterns of many viruses are vastly different from those of their hosts. The codon usage-specific inhibition of HIV-1 expression by SLFN11 suggests that SLFN11 may be able to inhibit other viruses with a suboptimal codon usage pattern. However, the effect of SLFN11 on the replication of influenza A virus (IAV) has never been reported. The induction of SLFN11 expression was observed upon IAV infection. The reduction of SLFN11 expression also promotes influenza virus replication. Moreover, we found that overexpression of SLFN11 could reduce the expression of a reporter gene with a viral codon usage pattern, and the inhibition of viral hemagglutinin (HA) gene was codon-specific as the expression of codon optimized HA was not affected. These results indicate that SLFN11 inhibits the influenza A virus in a codon-specific manner and that SLFN11 may contribute to innate defense against influenza A viruses.
Collapse
Affiliation(s)
- Kunlakanya Jitobaom
- Department of Microbiology, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Wanglang Road, Bangkoknoi, Bangkok 10700, Thailand
| | - Thanyaporn Sirihongthong
- Department of Microbiology, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Wanglang Road, Bangkoknoi, Bangkok 10700, Thailand
| | - Chompunuch Boonarkart
- Department of Microbiology, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Wanglang Road, Bangkoknoi, Bangkok 10700, Thailand
| | - Supinya Phakaratsakul
- Department of Microbiology, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Wanglang Road, Bangkoknoi, Bangkok 10700, Thailand
| | - Ornpreya Suptawiwat
- Princess Srisavangavadhana College of Medicine, Chulabhorn Royal Academy, Thailand
| | - Prasert Auewarakul
- Department of Microbiology, Faculty of Medicine Siriraj Hospital, Mahidol University, 2 Wanglang Road, Bangkoknoi, Bangkok 10700, Thailand.
| |
Collapse
|
8
|
Wang L, Zhao H, Wang Z, Ding S, Qin L, Jiang R, Deng X, He Z, Li L. An Evolutionary Perspective of Codon Usage Pattern, Dinucleotide Composition and Codon Pair Bias in Prunus Necrotic Ringspot Virus. Genes (Basel) 2023; 14:1712. [PMID: 37761852 PMCID: PMC10530913 DOI: 10.3390/genes14091712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 08/24/2023] [Accepted: 08/25/2023] [Indexed: 09/29/2023] Open
Abstract
Prunus necrotic ringspot virus (PNRSV) is a significant virus of ornamental plants and fruit trees. It is essential to study this virus due to its impact on the horticultural industry. Several studies on PNRSV diversity and phytosanitary detection technology were reported, but the content on the codon usage bias (CUB), dinucleotide preference and codon pair bias (CPB) of PNRSV is still uncertain. We performed comprehensive analyses on a dataset consisting of 359 coat protein (CP) gene sequences in PNRSV to examine the characteristics of CUB, dinucleotide composition, and CPB. The CUB analysis of PNRSV CP sequences showed that it was not only affected by natural selection, but also affected by mutations, and natural selection played a more significant role compared to mutations as the driving force. The dinucleotide composition analysis showed an over-expression of the CpC/GpA dinucleotides and an under-expression of the UpA/GpC dinucleotides. The dinucleotide composition of the PNRSV CP gene showed a weak association with the viral lineages and hosts, but a strong association with viral codon positions. Furthermore, the CPB of PNRSV CP gene is low and is related to dinucleotide preference and codon usage patterns. This research provides reference for future research on PNRSV genetic diversity and gene evolution mechanism.
Collapse
Affiliation(s)
- Lingqi Wang
- College of Horticulture and Landscape Architecture, Yangzhou University, Yangzhou 225009, China;
- College of Plant Protection, Yangzhou University, Yangzhou 225009, China; (H.Z.); (Z.W.); (S.D.); (L.Q.); (R.J.); (X.D.)
| | - Haiting Zhao
- College of Plant Protection, Yangzhou University, Yangzhou 225009, China; (H.Z.); (Z.W.); (S.D.); (L.Q.); (R.J.); (X.D.)
| | - Zhilei Wang
- College of Plant Protection, Yangzhou University, Yangzhou 225009, China; (H.Z.); (Z.W.); (S.D.); (L.Q.); (R.J.); (X.D.)
| | - Shiwen Ding
- College of Plant Protection, Yangzhou University, Yangzhou 225009, China; (H.Z.); (Z.W.); (S.D.); (L.Q.); (R.J.); (X.D.)
| | - Lang Qin
- College of Plant Protection, Yangzhou University, Yangzhou 225009, China; (H.Z.); (Z.W.); (S.D.); (L.Q.); (R.J.); (X.D.)
| | - Runzhou Jiang
- College of Plant Protection, Yangzhou University, Yangzhou 225009, China; (H.Z.); (Z.W.); (S.D.); (L.Q.); (R.J.); (X.D.)
| | - Xiaolong Deng
- College of Plant Protection, Yangzhou University, Yangzhou 225009, China; (H.Z.); (Z.W.); (S.D.); (L.Q.); (R.J.); (X.D.)
| | - Zhen He
- College of Plant Protection, Yangzhou University, Yangzhou 225009, China; (H.Z.); (Z.W.); (S.D.); (L.Q.); (R.J.); (X.D.)
- Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education of China, Yangzhou University, Yangzhou 225009, China
| | - Liangjun Li
- College of Horticulture and Landscape Architecture, Yangzhou University, Yangzhou 225009, China;
- Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education of China, Yangzhou University, Yangzhou 225009, China
| |
Collapse
|
9
|
Perrotti E, L'Episcopia M, Menegon M, Soares IS, Rosas-Aguirre A, Speybroeck N, LLanos-Cuentas A, Menard D, Ferreira MU, Severini C. Reduced polymorphism of Plasmodium vivax early transcribed membrane protein (PvETRAMP) 11.2. Parasit Vectors 2023; 16:238. [PMID: 37461081 DOI: 10.1186/s13071-023-05851-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 06/27/2023] [Indexed: 07/20/2023] Open
Abstract
BACKGROUND ETRAMP11.2 (PVX_003565) is a well-characterized protein with antigenic potential. It is considered to be a serological marker for diagnostic tools, and it has been suggested as a potential vaccine candidate. Despite its immunological relevance, the polymorphism of the P. vivax ETRAMP11.2 gene (pvetramp11.2) remains undefined. The genetic variability of an antigen may limit the effectiveness of its application as a serological surveillance tool and in vaccine development and, therefore, the aim of this study was to investigate the genetic diversity of pvetramp11.2 in parasite populations from Amazonian regions and worldwide. We also evaluated amino acid polymorphism on predicted B-cell epitopes. The low variability of the sequence encoding PvETRAMP11.2 protein suggests that it would be a suitable marker in prospective serodiagnostic assays for surveillance strategies or in vaccine design against P. vivax malaria. METHODS The pvetramp11.2 of P. vivax isolates collected from Brazil (n = 68) and Peru (n = 36) were sequenced and analyzed to assess nucleotide polymorphisms, allele distributions, population differentiation, genetic diversity and signature of selection. In addition, sequences (n = 104) of seven populations from different geographical regions were retrieved from the PlasmoDB database and included in the analysis to study the worldwide allele distribution. Potential linear B-cell epitopes and their polymorphisms were also explored. RESULTS The multiple alignments of 208 pvetramp11.2 sequences revealed a low polymorphism and a marked geographical variation in allele diversity. Seven polymorphic sites and 11 alleles were identified. All of the alleles were detected in isolates from the Latin American region and five alleles were detected in isolates from the Southeast Asia/Papua New Guinea (SEA/PNG) region. Three alleles were shared by all Latin American populations (H1, H6 and H7). The H1 allele (reference allele from Salvador-1 strain), which was absent in the SEA/PNG populations, was the most represented allele in populations from Brazil (54%) and was also detected at high frequencies in populations from all other Latin America countries (range: 13.0% to 33.3%). The H2 allele was the major allele in SEA/PNG populations, but was poorly represented in Latin America populations (only in Brazil: 7.3%). Plasmodium vivax populations from Latin America showed a marked inter-population genetic differentiation (fixation index [Fst]) in contrast to SEA/PNG populations. Codon bias measures (effective number of codons [ENC] and Codon bias index [CBI]) indicated preferential use of synonymous codons, suggesting selective pressure at the translation level. Only three amino acid substitutions, located in the C-terminus, were detected. Linear B-cell epitope mapping predicted two epitopes in the Sal-1 PvETRAMP11.2 protein, one of which was fully conserved in all of the parasite populations analyzed. CONCLUSIONS We provide an overview of the allele distribution and genetic differentiation of ETRAMP11.2 antigen in P. vivax populations from different endemic areas of the world. The reduced polymorphism and the high degree of protein conservation supports the application of PvETRAMP11.2 protein as a reliable antigen for application in serological assays or vaccine design. Our findings provide useful information that can be used to inform future study designs.
Collapse
Affiliation(s)
- Edvige Perrotti
- Department of Infectious Diseases, Istituto Superiore Di Sanità, Rome, Italy.
| | | | - Michela Menegon
- Department of Infectious Diseases, Istituto Superiore Di Sanità, Rome, Italy
| | - Irene S Soares
- Department of Clinical and Toxicological Analyses, School of Pharmaceutical Sciences, University of São Paulo, São Paulo, Brazil
| | - Angel Rosas-Aguirre
- Research Institute of Health and Society (IRSS), Université Catholique de Louvain, Brussels, Belgium
- Instituto de Medicina Tropical "Alexander Von Humboldt", Universidad Peruana Cayetano Heredia, Lima, Peru
| | - Niko Speybroeck
- Research Institute of Health and Society (IRSS), Université Catholique de Louvain, Brussels, Belgium
| | - Alejandro LLanos-Cuentas
- Instituto de Medicina Tropical "Alexander Von Humboldt", Universidad Peruana Cayetano Heredia, Lima, Peru
| | - Didier Menard
- Laboratoire de Parasitologie Et Mycologie Médicale, Les Hôpitaux Universitaires de Strasbourg, Strasbourg, France
- Institut de Parasitologie Et Pathologie Tropicale, Université de Strasbourg, Strasbourg, France
- Malaria Genetics and Resistance Unit-INSERM U1201, Institut Pasteur, Paris, France
| | - Marcelo Urbano Ferreira
- Department of Parasitology, Institute of Biomedical Sciences, University of São Paulo, São Paulo, Brazil
- Global Health and Tropical Medicine, Instituto de Higiene e Medicina Tropical, Universidade NOVA de Lisboa, Lisbon, Portugal
| | - Carlo Severini
- Department of Infectious Diseases, Istituto Superiore Di Sanità, Rome, Italy.
| |
Collapse
|
10
|
Noor F, Ashfaq UA, Bakar A, Qasim M, Masoud MS, Alshammari A, Alharbi M, Riaz MS. Identification and characterization of codon usage pattern and influencing factors in HFRS-causing hantaviruses. Front Immunol 2023; 14:1131647. [PMID: 37492567 PMCID: PMC10364125 DOI: 10.3389/fimmu.2023.1131647] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2022] [Accepted: 06/22/2023] [Indexed: 07/27/2023] Open
Abstract
Hemorrhagic fever with renal syndrome (HFRS) is an acute viral zoonosis carried and transmitted by infected rodents through urine, droppings, or saliva. The etiology of HFRS is complex due to the involvement of viral factors and host immune and genetic factors which hinder the development of potential therapeutic solutions for HFRS. Hantaan virus (HTNV), Dobrava-Belgrade virus (DOBV), Seoul virus (SEOV), and Puumala virus (PUUV) are predominantly found in hantaviral species that cause HFRS in patients. Despite ongoing prevention and control efforts, HFRS remains a serious economic burden worldwide. Furthermore, recent studies reported that the hantavirus nucleocapsid protein is a multi-functional protein and plays a major role in the replication cycle of the hantavirus. However, the precise mechanism of the nucleoproteins in viral pathogenesis is not completely understood. In the framework of the current study, various in silico approaches were employed to identify the factors influencing the codon usage pattern of hantaviral nucleoproteins. Based on the relative synonymous codon usage (RSCU) values, a comparative analysis was performed between HFRS-causing hantavirus and their hosts, suggesting that HTNV, DOBV, SEOV, and PUUV, were inclined to evolve their codon usage patterns that were comparable to those of their hosts. The results indicated that most of the overrepresented codons had AU-endings, which revealed that mutational pressure is the major force shaping codon usage patterns. However, the influence of natural selection and geographical factors cannot be ignored on viral codon usage bias. Further analysis also demonstrated that HFRS causing hantaviruses adapted host-specific codon usage patterns to sustain successful replication and transmission chains within hosts. To our knowledge, no study to date reported the factors influencing the codon usage pattern within hantaviral nucleoproteins. Thus, the proposed computational scheme can help in understanding the underlying mechanism of codon usage patterns in HFRS-causing hantaviruses which lend a helping hand in designing effective anti-HFRS treatments in future. This study, although comprehensive, relies on in silico methods and thus necessitates experimental validation for more solid outcomes. Beyond the identified factors influencing viral behavior, there could be other yet undiscovered influences. These potential factors should be targets for further research to improve HFRS therapeutic strategies.
Collapse
Affiliation(s)
- Fatima Noor
- Department of Bioinformatics and Biotechnology, Government College University, Faisalabad, Pakistan
| | - Usman Ali Ashfaq
- Department of Bioinformatics and Biotechnology, Government College University, Faisalabad, Pakistan
| | - Abu Bakar
- Centre of Agricultural Biochemistry and Biotechnology (CABB), University of Agriculture, Faisalabad, Pakistan
| | - Muhammad Qasim
- Department of Bioinformatics and Biotechnology, Government College University, Faisalabad, Pakistan
| | - Muhammad Shareef Masoud
- Department of Bioinformatics and Biotechnology, Government College University, Faisalabad, Pakistan
| | - Abdulrahman Alshammari
- Department of Pharmacology and Toxicology, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
| | - Metab Alharbi
- Department of Pharmacology and Toxicology, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
| | | |
Collapse
|
11
|
Qin L, Ding S, He Z. Compositional biases and evolution of the largest plant RNA virus order Patatavirales. Int J Biol Macromol 2023; 240:124403. [PMID: 37076075 DOI: 10.1016/j.ijbiomac.2023.124403] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Revised: 03/13/2023] [Accepted: 03/25/2023] [Indexed: 04/21/2023]
Abstract
Patatavirales is the largest order of plant RNA viruses and exclusively contains the family Potyviridae, accounting for 30 % of all known plant viruses. The composition bias of animal RNA viruses and several plant RNA viruses has been determined. However, the comprehensive nucleic acid composition, codon pair usage patterns, dinucleotide preference and codon pair preference of plant RNA viruses have not been investigated to date. In this study, integrated analysis and discussion of the nucleic acid composition, codon usage patterns, dinucleotide composition and codon pair bias of potyvirids were performed using 3732 complete genome coding sequences. The nucleic acid composition of potyvirids was significantly enriched in A/U. Interestingly, the A/U-rich nucleotide composition of Patatavirales is essential for determining the preferred A-ended and U-ended codons and the overexpression of UpG and CpA dinucleotides. The codon usage patterns and codon pair bias of potyvirids were significantly correlated with their nucleic acid composition. Additionally, the codon usage pattern, dinucleotide composition and codon-pair bias of potyvirids are more dependent on the classification of the virus compared with their hosts. Our analysis provides a better understanding of future research on the origin and evolution patterns of the order Patatavirales.
Collapse
Affiliation(s)
- Lang Qin
- College of Plant Protection, Yangzhou University, Wenhui East Road No.48, Yangzhou 225009, Jiangsu Province, PR China
| | - Shiwen Ding
- College of Plant Protection, Yangzhou University, Wenhui East Road No.48, Yangzhou 225009, Jiangsu Province, PR China
| | - Zhen He
- College of Plant Protection, Yangzhou University, Wenhui East Road No.48, Yangzhou 225009, Jiangsu Province, PR China.
| |
Collapse
|
12
|
Fumagalli SE, Padhiar NH, Meyer D, Katneni U, Bar H, DiCuccio M, Komar AA, Kimchi-Sarfaty C. Analysis of 3.5 million SARS-CoV-2 sequences reveals unique mutational trends with consistent nucleotide and codon frequencies. Virol J 2023; 20:31. [PMID: 36812119 PMCID: PMC9936480 DOI: 10.1186/s12985-023-01982-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 02/02/2023] [Indexed: 02/19/2023] Open
Abstract
BACKGROUND Since the onset of the SARS-CoV-2 pandemic, bioinformatic analyses have been performed to understand the nucleotide and synonymous codon usage features and mutational patterns of the virus. However, comparatively few have attempted to perform such analyses on a considerably large cohort of viral genomes while organizing the plethora of available sequence data for a month-by-month analysis to observe changes over time. Here, we aimed to perform sequence composition and mutation analysis of SARS-CoV-2, separating sequences by gene, clade, and timepoints, and contrast the mutational profile of SARS-CoV-2 to other comparable RNA viruses. METHODS Using a cleaned, filtered, and pre-aligned dataset of over 3.5 million sequences downloaded from the GISAID database, we computed nucleotide and codon usage statistics, including calculation of relative synonymous codon usage values. We then calculated codon adaptation index (CAI) changes and a nonsynonymous/synonymous mutation ratio (dN/dS) over time for our dataset. Finally, we compiled information on the types of mutations occurring for SARS-CoV-2 and other comparable RNA viruses, and generated heatmaps showing codon and nucleotide composition at high entropy positions along the Spike sequence. RESULTS We show that nucleotide and codon usage metrics remain relatively consistent over the 32-month span, though there are significant differences between clades within each gene at various timepoints. CAI and dN/dS values vary substantially between different timepoints and different genes, with Spike gene on average showing both the highest CAI and dN/dS values. Mutational analysis showed that SARS-CoV-2 Spike has a higher proportion of nonsynonymous mutations than analogous genes in other RNA viruses, with nonsynonymous mutations outnumbering synonymous ones by up to 20:1. However, at several specific positions, synonymous mutations were overwhelmingly predominant. CONCLUSIONS Our multifaceted analysis covering both the composition and mutation signature of SARS-CoV-2 gives valuable insight into the nucleotide frequency and codon usage heterogeneity of SARS-CoV-2 over time, and its unique mutational profile compared to other RNA viruses.
Collapse
Affiliation(s)
- Sarah E Fumagalli
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Nigam H Padhiar
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Douglas Meyer
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Upendra Katneni
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA
| | - Haim Bar
- Department of Statistics, University of Connecticut, Storrs, CT, USA
| | | | - Anton A Komar
- Department of Biological, Geological and Environmental Sciences, Center for Gene Regulation in Health and Disease, Cleveland State University, Cleveland, OH, USA
| | - Chava Kimchi-Sarfaty
- Hemostasis Branch, Division of Plasma Protein Therapeutics, Office of Tissues and Advanced Therapies, Center for Biologics Evaluation and Research, US Food and Drug Administration, Silver Spring, MD, USA.
| |
Collapse
|
13
|
Alqahtani T, Khandia R, Puranik N, Alqahtani AM, Chidambaram K, Kamal MA. Codon Usage is Influenced by Compositional Constraints in Genes Associated with Dementia. Front Genet 2022; 13:884348. [PMID: 36017501 PMCID: PMC9395603 DOI: 10.3389/fgene.2022.884348] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2022] [Accepted: 04/18/2022] [Indexed: 11/28/2022] Open
Abstract
Dementia is a clinical syndrome characterized by progressive cognitive decline, and the symptoms could be gradual, persistent, and progressive. In the present study, we investigated 47 genes that have been linked to dementia. Compositional, selectional, and mutational forces were seen to be involved. Nucleotide components that influenced A- and GC-affected codon usages bias at all three codon positions. The influence of these two compositional constraints on codon usage bias (CUB) was positive for nucleotide A and negative for GC. Nucleotide A also experienced the highest mutational force, and GC-ending codons were preferred over AT-ending codons. A high bias toward GC-ending codons enhances the gene expression level, evidenced by the positive association between CAI- and GC-ending codons. Unusual behavior of the TTG codon showing an inverse relationship with the GC-ending codon and negative influence of gene expression, behavior contrary to all other GC-ending codons, shows an operative selectional force. Furthermore, parity analysis, higher translational selection value, preference of GC-ending codons over AT-ending codons, and association of gene length with gene expression refer to the dominant role of selection pressure with compositional constraint and mutational force-shaping codon usage.
Collapse
Affiliation(s)
- Taha Alqahtani
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha, Saudi Arabia
| | - Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal, India
| | - Nidhi Puranik
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal, India
| | - Ali M Alqahtani
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha, Saudi Arabia
| | - Kumarappan Chidambaram
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha, Saudi Arabia
| | - Mohammad Amjad Kamal
- Institutes for Systems Genetics, Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu, China.,King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Saudi Arabia.,Department of Pharmacy, Faculty of Allied Health Sciences, Daffodil International University, Dhaka, Bangladesh.,Enzymoics, Novel Global Community Educational Foundation, Hebersham, NSW, Australia
| |
Collapse
|
14
|
Robins WP, Mekalanos JJ. Covariance predicts conserved protein residue interactions important for the emergence and continued evolution of SARS-CoV-2 as a human pathogen. PLoS One 2022; 17:e0270276. [PMID: 35895734 PMCID: PMC9328546 DOI: 10.1371/journal.pone.0270276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2022] [Accepted: 06/07/2022] [Indexed: 12/03/2022] Open
Abstract
SARS-CoV-2 is one of three recognized coronaviruses (CoVs) that have caused epidemics or pandemics in the 21st century and that likely emerged from animal reservoirs. Differences in nucleotide and protein sequence composition within related β-coronaviruses are often used to better understand CoV evolution, host adaptation, and their emergence as human pathogens. Here we report the comprehensive analysis of amino acid residue changes that have occurred in lineage B β-coronaviruses that show covariance with each other. This analysis revealed patterns of covariance within conserved viral proteins that potentially define conserved interactions within and between core proteins encoded by SARS-CoV-2 related β-coronaviruses. We identified not only individual pairs but also networks of amino acid residues that exhibited statistically high frequencies of covariance with each other using an independent pair model followed by a tandem model approach. Using 149 different CoV genomes that vary in their relatedness, we identified networks of unique combinations of alleles that can be incrementally traced genome by genome within different phylogenic lineages. Remarkably, covariant residues and their respective regions most abundantly represented are implicated in the emergence of SARS-CoV-2 and are also enriched in dominant SARS-CoV-2 variants.
Collapse
Affiliation(s)
- William P. Robins
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - John J. Mekalanos
- Department of Microbiology, Harvard Medical School, Boston, Massachusetts, United States of America
| |
Collapse
|
15
|
Wu H, Li B, Miao Z, Hu L, Zhou L, Lu Y. Codon usage of host-specific P genotypes (VP4) in group A rotavirus. BMC Genomics 2022; 23:518. [PMID: 35842571 PMCID: PMC9288207 DOI: 10.1186/s12864-022-08730-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Accepted: 06/30/2022] [Indexed: 11/26/2022] Open
Abstract
Background Group A rotavirus (RVA) is a common causative agent of acute gastroenteritis in infants and young children worldwide. RVA P genotypes, determined by VP4 sequences, have been confirmed to infect humans and animals. However, their codon usage patterns that are essential to obtain insights into the viral evolution, host adaptability, and genetic characterization remained unclear, especially across animal hosts. Results We performed a comprehensive codon usage analysis of eight host-specific RVA P genotypes, including human RVA (P[4] and P[8]), porcine RVA (P[13] and P[23]), and zoonotic RVA (P[1], P[6], P[7] and P[19]), based on 233 VP4 complete coding sequences. Nucleotide composition, relative synonymous codon usage (RSCU), and effective number of codons (ENC) were calculated. Principal component analysis (PCA) based on RSCU values was used to explore the codon usage patterns of different RVA P genotypes. In addition, mutation pressure and natural selection were identified by using ENC-plot, parity rule 2 plot, and neutrality plot analyses. All VP4 sequences preferred using A/U nucleotides (A: 0.354-0.377, U: 0.267-0.314) than G/C nucleotides across genotypes. Similarly, majority of commonly used synonymous codons were likely to end with A/U nucleotides (A: 9/18-12/18, U: 6/18-9/18). In PCA, human, porcine, and zoonotic genotypes clustered separately in terms of RSCU values, indicating the host-specific codon usage patterns; however, porcine and zoonotic genotypes were partly overlapped. Human genotypes, P[4] and P[8], had stronger codon usage bias, as indicated by more over-represented codons and lower ENC, compared to porcine and zoonotic genotypes. Moreover, natural selection was determined to be a predominant driver in shaping the codon usage bias across the eight P genotypes. In addition, mutation pressure contributed to the codon usage bias of human genotypes. Conclusions Our study identified a strong codon usage bias of human RVA P genotypes attributable to both natural selection and mutation pressure, whereas similar codon usage bias between porcine and zoonotic genotypes predominantly attributable to natural selection. It further suggests possible cross-species transmission. Therefore, it warrants further surveillance of RVA P genotypes for early identification of zoonotic infection. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08730-2.
Collapse
Affiliation(s)
- Han Wu
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety (Fudan University), School of Public Health, Fudan University, Shanghai, 200032, China
| | - Bingzhe Li
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety (Fudan University), School of Public Health, Fudan University, Shanghai, 200032, China
| | - Ziping Miao
- Institute of Communicable Diseases Prevention and Control, Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, 310052, Zhejiang, China
| | - Linjie Hu
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety (Fudan University), School of Public Health, Fudan University, Shanghai, 200032, China
| | - Lu Zhou
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety (Fudan University), School of Public Health, Fudan University, Shanghai, 200032, China
| | - Yihan Lu
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety (Fudan University), School of Public Health, Fudan University, Shanghai, 200032, China.
| |
Collapse
|
16
|
Zeng L, Chen M, Wang M, Zhu L, Yan J, Zhang X, Xu J, Zhang S. Enterovirus A Shows Unique Patterns of Codon Usage Bias in Conventional Versus Unconventional Clade. Front Cell Infect Microbiol 2022; 12:941325. [PMID: 35909978 PMCID: PMC9329520 DOI: 10.3389/fcimb.2022.941325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 06/21/2022] [Indexed: 11/13/2022] Open
Abstract
Enterovirus A (EV-A) species cause hand, foot and mouth disease (HFMD), threatening the health of young children. Understanding the mutual codon usage pattern of the virus and its host(s) has fundamental and applied values. Here, through examining multiple codon usage parameters, we found that the codon usage bias among EV-A strains varies and is clade-specific. EVA76, EVA89, EVA90, EVA91 and EVA92, the unconventional clade of EV-A strains, show unique codon usage pattern relative to the two conventional clades, including EVA71, CVA16, CVA6 and CVA10, etc. Analyses of Effective Number of Codon (ENC), Correspondence Analysis (COA) and Parity Rule 2 (PR2), etc., revealed that the codon usage patterns of EV-A strains are shaped by mutation pressure and natural selection. Based on the neutrality analysis, we determined the dominant role of natural selection in the formation of the codon usage bias of EV-A. In addition, we have determined the codon usage compatibility of potential hosts for EV-A strains using codon adaptation index (CAI), relative codon deoptimization index (RCDI) and similarity index (SiD) analyses, and found that EV-A showed host-specific codon adaptation patterns in different clades. Finally, we confirmed that the unique codon usage pattern of the unconventional clade affected protein expression level in human cell lines. In conclusion, we identified novel characteristics of codon usage bias in distinct EV-A clades associated with their host range, transmission and pathogenicity.
Collapse
Affiliation(s)
- Liyan Zeng
- Shanghai Public Health Clinical center AND Institutes of Biomedical Sciences, Fudan University, Shanghai, China
| | - Ming Chen
- Shanghai Public Health Clinical center AND Institutes of Biomedical Sciences, Fudan University, Shanghai, China
| | - Min Wang
- Shanghai Public Health Clinical center AND Institutes of Biomedical Sciences, Fudan University, Shanghai, China
| | - Liuyao Zhu
- Shanghai Public Health Clinical center AND Institutes of Biomedical Sciences, Fudan University, Shanghai, China
| | - Jingjing Yan
- Shanghai Public Health Clinical center AND Institutes of Biomedical Sciences, Fudan University, Shanghai, China
| | - Xiaoyan Zhang
- Shanghai Public Health Clinical center AND Institutes of Biomedical Sciences, Fudan University, Shanghai, China
- *Correspondence: Shuye Zhang, ; Xiaoyan Zhang, ; Jianqing Xu,
| | - Jianqing Xu
- Shanghai Public Health Clinical center AND Institutes of Biomedical Sciences, Fudan University, Shanghai, China
- *Correspondence: Shuye Zhang, ; Xiaoyan Zhang, ; Jianqing Xu,
| | - Shuye Zhang
- Clinical Center for BioTherapy & Institutes of Biomedical Sciences, Zhongshan Hospital, Fudan University, Shanghai, China
- *Correspondence: Shuye Zhang, ; Xiaoyan Zhang, ; Jianqing Xu,
| |
Collapse
|
17
|
Tyagi N, Sardar R, Gupta D. Natural selection plays a significant role in governing the codon usage bias in the novel SARS-CoV-2 variants of concern (VOC). PeerJ 2022; 10:e13562. [PMID: 35765592 PMCID: PMC9233899 DOI: 10.7717/peerj.13562] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Accepted: 05/19/2022] [Indexed: 01/17/2023] Open
Abstract
The ongoing prevailing COVID-19 pandemic caused by SARS-CoV-2 is becoming one of the major global health concerns worldwide. The SARS-CoV-2 genome encodes spike (S) glycoprotein that plays a very crucial role in viral entry into the host cell via binding of its receptor binding domain (RBD) to the host angiotensin converting enzyme 2 (ACE2) receptor. The continuously evolving SARS-CoV-2 genome results in more severe and transmissible variants characterized by the emergence of novel mutations called 'variants of concern' (VOC). The currently designated alpha, beta, gamma, delta and omicron VOC are the focus of this study due to their high transmissibility, increased virulence, and concerns for decreased effectiveness of the available vaccines. In VOC, the spike (S) gene and other non-structural protein mutations may affect the efficacies of the approved COVID-19 vaccines. To understand the diversity of SARS-CoV-2, several studies have been performed on a limited number of sequences. However, only a few studies have focused on codon usage bias (CUBs) pattern analysis of all the VOC strains. Therefore, to evaluate the evolutionary divergence of all VOC S-genes, we performed CUBs analysis on 300,354 sequences to understand the evolutionary relationship with its adaptation in different hosts, i.e., humans, bats, and pangolins. Base composition and RSCU analysis revealed the presence of 20 preferred AU-ended and 10 under-preferred GC-ended codons. In addition, CpG was found to be depleted, which may be attributable to the adaptive response by viruses to escape from the host defense process. Moreover, the ENC values revealed a higher bias in codon usage in the VOC S-gene. Further, the neutrality plot analysis demonstrated that S-genes analyzed in this study are under 83.93% influence of natural selection, suggesting its pivotal role in shaping the CUBs. The CUBs pattern of S-genes was found to be very similar among all the VOC strains. Interestingly, we observed that VOC strains followed a trend of antagonistic codon usage with respect to the human host. The identified CUBs divergence would help to understand the virus evolution and its host adaptation, thus help design novel vaccine strategies against the emerging VOC strains. To the best of our knowledge, this is the first report for identifying the evolution of CUBs pattern in all the currently identified VOC.
Collapse
Affiliation(s)
- Neetu Tyagi
- Translational Bioinformatics Group, International Centre for Genetic Engineering and Biotechnology (ICGEB), New Delhi, India, New Delhi, New Delhi, India,Regional Centre for Biotechnology, Faridabad, Haryana, India
| | - Rahila Sardar
- Translational Bioinformatics Group, International Centre for Genetic Engineering and Biotechnology (ICGEB), New Delhi, India, New Delhi, New Delhi, India,Biochemistry, Jamia Hamdard University, New Delhi, New Delhi, India
| | - Dinesh Gupta
- Translational Bioinformatics Group, International Centre for Genetic Engineering and Biotechnology (ICGEB), New Delhi, India, New Delhi, New Delhi, India
| |
Collapse
|
18
|
Li B, Wu H, Miao Z, Hu L, Zhou L, Lu Y. Codon Usage of Hepatitis E Viruses: A Comprehensive Analysis. Front Microbiol 2022; 13:938651. [PMID: 35801104 PMCID: PMC9253588 DOI: 10.3389/fmicb.2022.938651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2022] [Accepted: 05/30/2022] [Indexed: 11/25/2022] Open
Abstract
Hepatitis E virus (HEV) is an emerging zoonotic pathogen with multiple species and genotypes, which may be classified into human, animal, and zoonotic HEV. Codon usage bias of HEV remained unclear. This study aims to characterize the codon usage of HEV and elucidate the main drivers influencing the codon usage bias. A total of seven HEV genotypes, HEV-1 (human HEV), HEV-3 and HEV-4 (zoonotic HEV), HEV-8, HEV-B, HEV-C1, and HEV-C2 (emerging animal HEV), were included in the study. Complete coding sequences, ORF1, ORF2, and ORF3, were accordingly obtained in the GenBank. Except for HEV-8, the other six genotypes tended to use codons ending in G/C. Based on the analysis of relatively synonymous codon usage (RSCU) and principal component analysis (PCA), codon usage bias was determined for HEV genotypes. Codon usage bias differed widely across human, zoonotic, and animal HEV genotypes; furthermore, it varied within certain genotypes such as HEV-4, HEV-8, and HEV-C1. In addition, dinucleotide abundance revealed that HEV was affected by translation selection to form a unique dinucleotide usage pattern. Moreover, parity rule 2 analysis (PR2), effective codon number (ENC)-plot, and neutrality analysis were jointly performed. Natural selection played a leading role in forming HEV codon usage bias, which was predominant in HEV-1, HEV-3, HEV-B and HEV-C1, while affected HEV-4, HEV-8, and HEV-C2 in combination with mutation pressure. Our findings may provide insights into HEV evolution and codon usage bias.
Collapse
Affiliation(s)
- Bingzhe Li
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety, School of Public Health, Fudan University, Shanghai, China
| | - Han Wu
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety, School of Public Health, Fudan University, Shanghai, China
| | - Ziping Miao
- Institute of Communicable Diseases Prevention and Control, Zhejiang Provincial Center for Disease Control and Prevention, Hangzhou, China
| | - Linjie Hu
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety, School of Public Health, Fudan University, Shanghai, China
| | - Lu Zhou
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety, School of Public Health, Fudan University, Shanghai, China
| | - Yihan Lu
- Department of Epidemiology, Ministry of Education Key Laboratory of Public Health Safety, School of Public Health, Fudan University, Shanghai, China
- *Correspondence: Yihan Lu,
| |
Collapse
|
19
|
Peng Q, Zhang X, Li J, He W, Fan B, Ni Y, Liu M, Li B. Comprehensive analysis of codon usage pattern of porcine deltacoronavirus and its host adaptability. Transbound Emerg Dis 2022; 69:e2443-e2455. [DOI: 10.1111/tbed.14588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Revised: 05/01/2022] [Accepted: 05/03/2022] [Indexed: 11/30/2022]
Affiliation(s)
- Qi Peng
- Institute of Veterinary Medicine, Jiangsu Academy of Agricultural Sciences, Key Laboratory of Veterinary Biological Engineering and Technology Ministry of Agriculture Nanjing 210014 China
- Jiangsu Key Laboratory for Food Quality and Safety‐State Key Laboratory Cultivation Base Ministry of Science and Technology Nanjing 210014 China
- Jiangsu Co‐Innovation Center for the Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Jiangsu Key Laboratory of Zoonoses Yangzhou University Yangzhou 225009 China
| | - Xue Zhang
- Institute of Veterinary Medicine, Jiangsu Academy of Agricultural Sciences, Key Laboratory of Veterinary Biological Engineering and Technology Ministry of Agriculture Nanjing 210014 China
- Jiangsu Key Laboratory for Food Quality and Safety‐State Key Laboratory Cultivation Base Ministry of Science and Technology Nanjing 210014 China
- Key Laboratory of Animal Disease Diagnosis and Immunology, College of Veterinary Medicine Nanjing Agricultural University Nanjing 210095 China
| | - Jizong Li
- Institute of Veterinary Medicine, Jiangsu Academy of Agricultural Sciences, Key Laboratory of Veterinary Biological Engineering and Technology Ministry of Agriculture Nanjing 210014 China
- Jiangsu Key Laboratory for Food Quality and Safety‐State Key Laboratory Cultivation Base Ministry of Science and Technology Nanjing 210014 China
- Jiangsu Co‐Innovation Center for the Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Jiangsu Key Laboratory of Zoonoses Yangzhou University Yangzhou 225009 China
| | - Wenlong He
- Institute of Veterinary Medicine, Jiangsu Academy of Agricultural Sciences, Key Laboratory of Veterinary Biological Engineering and Technology Ministry of Agriculture Nanjing 210014 China
- College of Veterinary Medicine Hebei Agricultural University Baoding 071001 China
| | - Baochao Fan
- Institute of Veterinary Medicine, Jiangsu Academy of Agricultural Sciences, Key Laboratory of Veterinary Biological Engineering and Technology Ministry of Agriculture Nanjing 210014 China
- Jiangsu Key Laboratory for Food Quality and Safety‐State Key Laboratory Cultivation Base Ministry of Science and Technology Nanjing 210014 China
- Jiangsu Co‐Innovation Center for the Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Jiangsu Key Laboratory of Zoonoses Yangzhou University Yangzhou 225009 China
| | - Yanxiu Ni
- Institute of Veterinary Medicine, Jiangsu Academy of Agricultural Sciences, Key Laboratory of Veterinary Biological Engineering and Technology Ministry of Agriculture Nanjing 210014 China
- Jiangsu Key Laboratory for Food Quality and Safety‐State Key Laboratory Cultivation Base Ministry of Science and Technology Nanjing 210014 China
| | - Maojun Liu
- Institute of Veterinary Medicine, Jiangsu Academy of Agricultural Sciences, Key Laboratory of Veterinary Biological Engineering and Technology Ministry of Agriculture Nanjing 210014 China
- Jiangsu Key Laboratory for Food Quality and Safety‐State Key Laboratory Cultivation Base Ministry of Science and Technology Nanjing 210014 China
| | - Bin Li
- Institute of Veterinary Medicine, Jiangsu Academy of Agricultural Sciences, Key Laboratory of Veterinary Biological Engineering and Technology Ministry of Agriculture Nanjing 210014 China
- Jiangsu Key Laboratory for Food Quality and Safety‐State Key Laboratory Cultivation Base Ministry of Science and Technology Nanjing 210014 China
- Jiangsu Co‐Innovation Center for the Prevention and Control of Important Animal Infectious Diseases and Zoonoses, Jiangsu Key Laboratory of Zoonoses Yangzhou University Yangzhou 225009 China
- Key Laboratory of Animal Disease Diagnosis and Immunology, College of Veterinary Medicine Nanjing Agricultural University Nanjing 210095 China
- College of Veterinary Medicine Hebei Agricultural University Baoding 071001 China
| |
Collapse
|
20
|
He Z, Qin L, Xu X, Ding S. Evolution and host adaptability of plant RNA viruses: Research insights on compositional biases. Comput Struct Biotechnol J 2022; 20:2600-2610. [PMID: 35685354 PMCID: PMC9160401 DOI: 10.1016/j.csbj.2022.05.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2022] [Revised: 05/10/2022] [Accepted: 05/12/2022] [Indexed: 01/23/2023] Open
Abstract
During recent decades, many new emerging or re-emerging RNA viruses have been found in plants through the development of deep-sequencing technology and big data analysis. These findings largely changed our understanding of the origin, evolution and host range of plant RNA viruses. There is evidence that their genetic composition originates from viruses, and host populations play a key role in the evolution and host adaptability of plant RNA viruses. In this mini-review, we describe the state of our understanding of the evolution of plant RNA viruses in view of compositional biases and explore how they adapt to the host. It appears that adenine rich (A-rich) coding sequences, low CpG and UpA dinucleotide frequencies and lower codon usage patterns were found in the vast majority of plant RNA viruses. The codon usage pattern of plant RNA viruses was influenced by both natural selection and mutation pressure, and natural selection mostly from hosts was the dominant factor. The codon adaptation analyses support that plant RNA viruses probably evolved a dynamic balance between codon adaptation and deoptimization to maintain efficient replication cycles in multiple hosts with various codon usage patterns. In the future, additional combinations of computational and experimental analyses of the nucleotide composition and codon usage of plant RNA viruses should be addressed.
Collapse
Affiliation(s)
- Zhen He
- School of Horticulture and Plant Protection, Yangzhou University, Wenhui East Road No. 48, Yangzhou 225009, Jiangsu Province, PR China
- Joint International Research Laboratory of Agriculture and Agri-Product Safety of Ministry of Education of China, Yangzhou University, Wenhui East Road No. 48, Yangzhou 225009, Jiangsu Province, PR China
- Corresponding author.
| | - Lang Qin
- School of Horticulture and Plant Protection, Yangzhou University, Wenhui East Road No. 48, Yangzhou 225009, Jiangsu Province, PR China
| | - Xiaowei Xu
- School of Horticulture and Plant Protection, Yangzhou University, Wenhui East Road No. 48, Yangzhou 225009, Jiangsu Province, PR China
| | - Shiwen Ding
- School of Horticulture and Plant Protection, Yangzhou University, Wenhui East Road No. 48, Yangzhou 225009, Jiangsu Province, PR China
| |
Collapse
|
21
|
Wang Y, Yao L, Fan J, Zhao X, Zhang Q, Chen Y, Guo C. The Codon Usage Bias Analysis of Free-Living Ciliates' Macronuclear Genomes and Clustered Regularly Interspaced Short Palindromic Repeats/Cas9 Vector Construction of Stylonychia lemnae. Front Microbiol 2022; 13:785889. [PMID: 35308388 PMCID: PMC8927777 DOI: 10.3389/fmicb.2022.785889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 01/24/2022] [Indexed: 11/13/2022] Open
Abstract
Ciliates represent higher unicellular animals, and several species are also important model organisms for molecular biology research. Analyses of codon usage bias (CUB) of the macronuclear (MAC) genome in ciliates can not only promote a better understanding of the genetic mode and evolution history of these organisms but also help optimize codons to improve the gene editing efficiency of model ciliates. In this study, macronuclear genome sequences of nine free-living ciliates were analyzed with CodonW software to calculate the following indices: the guanine-cytosine content (GC); the frequency of the nucleotides U, C, A, and G at the third position of codons (U3s, C3s, A3s, G3s); the effective number of codons (ENC); the correlation between GC at the first and second positions (GC12); the frequency of the nucleotides G + C at the third position of synonymous codons (GC3s); the relative synonymous codon usage (RSCU). Parity rule 2 plot analysis, neutrality plot analysis, and correlation analysis were performed to explore the factors that influence codon preference. The results showed that the GC contents in nine ciliates' MAC genomes were lower than 50% and appeared AT-rich. The base compositions of GC12 and GC3s are markedly distinct and the codon usage pattern and evolution of ciliates are affected by genetic mutation and natural selection. According to the synonymous codon analysis, the codons of most ciliates ended with A or U and eight codons were the general optimal codons of nine ciliates. A clustered regularly interspaced short palindromic repeats/Cas9 (CRISPR/Cas9) expression vector of Stylonychia lemnae was constructed by optimizing the macronuclear genome codon and was successfully used to knock out the Adss gene. This is the first such extensive investigation of the MAC genome CUB of ciliates and the initial successful application of the CRISPR/Cas9 technique in free-living ciliates.
Collapse
Affiliation(s)
- Ying Wang
- Key Laboratory of Biodiversity of Aquatic Organisms, Harbin Normal University, Harbin, China
| | - Lin Yao
- Key Laboratory of Biodiversity of Aquatic Organisms, Harbin Normal University, Harbin, China.,Key Laboratory of Molecular Cytogenetics and Genetic Breeding of Heilongjiang Province, Harbin, China
| | - Jinfeng Fan
- Key Laboratory of Biodiversity of Aquatic Organisms, Harbin Normal University, Harbin, China
| | - Xue Zhao
- Key Laboratory of Biodiversity of Aquatic Organisms, Harbin Normal University, Harbin, China
| | - Qing Zhang
- Key Laboratory of Biodiversity of Aquatic Organisms, Harbin Normal University, Harbin, China
| | - Ying Chen
- Key Laboratory of Biodiversity of Aquatic Organisms, Harbin Normal University, Harbin, China.,School of Civil and Environmental Engineering, Harbin Institute of Technology (Shenzhen), Shenzhen, China
| | - Changhong Guo
- Key Laboratory of Molecular Cytogenetics and Genetic Breeding of Heilongjiang Province, Harbin, China
| |
Collapse
|
22
|
Gaunt ER, Digard P. Compositional biases in RNA viruses: Causes, consequences and applications. WILEY INTERDISCIPLINARY REVIEWS. RNA 2022; 13:e1679. [PMID: 34155814 PMCID: PMC8420353 DOI: 10.1002/wrna.1679] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Revised: 05/29/2021] [Accepted: 05/31/2021] [Indexed: 01/05/2023]
Abstract
If each of the four nucleotides were represented equally in the genomes of viruses and the hosts they infect, each base would occur at a frequency of 25%. However, this is not observed in nature. Similarly, the order of nucleotides is not random (e.g., in the human genome, guanine follows cytosine at a frequency of ~0.0125, or a quarter the number of times predicted by random representation). Codon usage and codon order are also nonrandom. Furthermore, nucleotide and codon biases vary between species. Such biases have various drivers, including cellular proteins that recognize specific patterns in nucleic acids, that once triggered, induce mutations or invoke intrinsic or innate immune responses. In this review we examine the types of compositional biases identified in viral genomes and current understanding of the evolutionary mechanisms underpinning these trends. Finally, we consider the potential for large scale synonymous recoding strategies to engineer RNA virus vaccines, including those with pandemic potential, such as influenza A virus and Severe Acute Respiratory Syndrome Coronavirus Virus 2. This article is categorized under: RNA in Disease and Development > RNA in Disease RNA Evolution and Genomics > Computational Analyses of RNA RNA Interactions with Proteins and Other Molecules > Protein-RNA Recognition.
Collapse
Affiliation(s)
- Eleanor R. Gaunt
- Department of Infection and ImmunityThe Roslin Institute, The University of EdinburghEdinburghUK
| | - Paul Digard
- Department of Infection and ImmunityThe Roslin Institute, The University of EdinburghEdinburghUK
| |
Collapse
|
23
|
Khandia R, Ali Khan A, Alexiou A, Povetkin SN, Nikolaevna VM. Codon Usage Analysis of Pro-Apoptotic Bim Gene Isoforms. J Alzheimers Dis 2022; 86:1711-1725. [DOI: 10.3233/jad-215691] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Background: Bim is a Bcl-2 homology 3 (BH3)-only proteins, a group of pro-apoptotic proteins involved in physiological and pathological conditions. Both the overexpression and under-expression of Bim protein are associated with the diseased condition, and various isoforms of Bim protein are present with differential apoptotic potential. Objective: The present study attempted to envisage the association of various molecular signatures with the codon choices of Bim isoforms. Methods: Molecular signatures like composition, codon usage, nucleotide skews, the free energy of mRNA transcript, physical properties of proteins, codon adaptation index, relative synonymous codon usage, and dinucleotide odds ratio were determined and analyzed for their associations with codon choices of Bim gene. Results: Skew analysis of the Bim gene indicated the preference of C nucleotide over G, A, and T and preference of G over T and A nucleotides was observed. An increase in C content at the first and third codon position increased gene expression while it decreased at the second codon position. Compositional constraints on nucleotide C at all three codon positions affected gene expression. The analysis revealed an exceptionally high usage of CpC dinucleotide in all the envisaged 31 isoforms of Bim. We correlated it with the requirement of rapid demethylation machinery to fine-tune the Bimgene expression. Also, mutational pressure played a dominant role in shaping codon usage bias in Bim isoforms. Conclusion: An exceptionally high usage of CpC dinucleotide in all the envisaged 31 isoforms of Bim indicates a high order selectional force to fine tune Bim gene expression.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal, India
| | - Azmat Ali Khan
- Pharmaceutical Biotechnology Laboratory, Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
| | - Athanasios Alexiou
- Novel Global Community Educational Foundation, Australia & AFNP Med, Austria
| | | | | |
Collapse
|
24
|
Robins WP, Mekalanos JJ. Covariance predicts conserved protein residue interactions important to the emergence and continued evolution of SARS-CoV-2 as a human pathogen. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2022:2022.01.13.476204. [PMID: 35169805 PMCID: PMC8845505 DOI: 10.1101/2022.01.13.476204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
SARS-CoV-2 is one of three recognized coronaviruses (CoVs) that have caused epidemics or pandemics in the 21st century and that likely emerged from animal reservoirs. Differences in nucleotide and protein sequence composition within related β-coronaviruses are often used to better understand CoV evolution, host adaptation, and their emergence as human pathogens. Here we report the comprehensive analysis of amino acid residue changes that have occurred in lineage B β-coronaviruses that show covariance with each other. This analysis revealed patterns of covariance within conserved viral proteins that potentially define conserved interactions within and between core proteins encoded by SARS-CoV-2 related β-coranaviruses. We identified not only individual pairs but also networks of amino acid residues that exhibited statistically high frequencies of covariance with each other using an independent pair model followed by a tandem model approach. Using 149 different CoV genomes that vary in their relatedness, we identified networks of unique combinations of alleles that can be incrementally traced genome by genome within different phylogenic lineages. Remarkably, covariant residues and their respective regions most abundantly represented are implicated in the emergence of SARS-CoV-2 are also enriched in dominant SARS-CoV-2 variants.
Collapse
Affiliation(s)
- William P Robins
- Department of Microbiology, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115
| | - John J Mekalanos
- Department of Microbiology, Harvard Medical School, 77 Avenue Louis Pasteur, Boston, MA 02115
| |
Collapse
|
25
|
Kumar A, Goyal N, Saranathan N, Dhamija S, Saraswat S, Menon MB, Vivekanandan P. The slowing rate of CpG depletion in SARS-CoV-2 genomes is consistent with adaptations to the human host. Mol Biol Evol 2022; 39:6521032. [PMID: 35134218 PMCID: PMC8892944 DOI: 10.1093/molbev/msac029] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Depletion of CpG dinucleotides in SARS-CoV-2 genomes has been linked to virus evolution, host-switching, virus replication, and innate immune responses. Temporal variations, if any, in the rate of CpG depletion during virus evolution in the host remain poorly understood. Here, we analysed the CpG content of over 1.4 million full-length SARS-CoV-2 genomes representing over 170 million documented infections during the first 17 months of the pandemic. Our findings suggest that the extent of CpG depletion in SARS-CoV-2 genomes is modest. Interestingly, the rate of CpG depletion is highest during early evolution in humans and it gradually tapers off almost reaching an equilibrium; this is consistent with adaptations to the human host. Furthermore, within the coding regions, CpG depletion occurs predominantly at codon positions 2-3 and 3-1. Loss of ZAP-binding motifs in SARS-CoV-2 genomes is primarily driven by the loss of the terminal CpG in the motifs. Nonetheless, majority of the CpG depletion in SARS-CoV-2 genomes occurs outside ZAP-binding motifs. SARS-CoV-2 genomes selectively lose CpGs-motifs from a U-rich context; this may help avoid immune recognition by TLR7. SARS-CoV-2 alpha-, beta- and delta-variants of concern have reduced CpG content compared to sequences from the beginning of the pandemic. In sum, we provide evidence that the rate of CpG depletion in virus genomes is not uniform and it greatly varies over time and during adaptations to the host. This work highlights how temporal variations in selection pressures during virus adaption may impact the rate and the extent of CpG depletion in virus genomes.
Collapse
Affiliation(s)
- Akhil Kumar
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi-110016, India
| | - Nishank Goyal
- Department of Chemical Engineering, Indian Institute of Technology Delhi, New Delhi-110016, India
| | - Nandhini Saranathan
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi-110016, India
| | - Sonam Dhamija
- CSIR-Institute of Genomics and Integrative Biology, New Delhi-110025, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad- 201002, India
| | - Saurabh Saraswat
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi-110016, India
| | - Manoj B Menon
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi-110016, India
| | - Perumal Vivekanandan
- Kusuma School of Biological Sciences, Indian Institute of Technology Delhi, New Delhi-110016, India
| |
Collapse
|
26
|
Begum NS, Chakraborty S. Influencing elements of codon usage bias in Birnaviridae and its evolutionary analysis. Virus Res 2022; 310:198672. [PMID: 34986367 DOI: 10.1016/j.virusres.2021.198672] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2021] [Revised: 11/25/2021] [Accepted: 12/30/2021] [Indexed: 11/25/2022]
Abstract
Birnaviridae is a family of double stranded (ds) RNA virus with non-enveloped virions and 2-segmented genomes. These viruses are known to cause diseases in many hosts. Virus of this family has affected the fish and poultry economy in a wide sector. Unevenness in the use of synonymous codons for a particular amino acid in the coding strand of DNA is known as codon usage bias (CUB). Codons that code the same amino acid are used with variable frequency in a variety of life forms. To understand the pattern of CUB in Birnaviridae, we carried out bioinformatics study to understand the properties of coding sequences of proteins. ENC value of Birnaviridae suggested low CUB. Nucleotide analysis revealed high GC content. Parameters such as RSCU values, nucleotide skewness, translational selection, parity plot and neutrality plot were studied to investigate the pattern of codon use and it was clear that both mutational pressure and natural selection contributed to the designing of CUB in Birnaviridae family. The neutrality plot revealed natural selection to dominate the structuring of CUB and hence remained the major CUB determinant in Birnaviridae. Outcome of our study exemplified the pattern of codon use in the Birnaviridae genomes and contributed the basic primary data for fundamental evolutionary research on them.
Collapse
Affiliation(s)
| | - Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India.
| |
Collapse
|
27
|
Host adaptation of codon usage in SARS-CoV-2 from mammals indicates potential natural selection and viral fitness. Arch Virol 2022; 167:2677-2688. [PMID: 36166106 PMCID: PMC9514192 DOI: 10.1007/s00705-022-05612-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 08/19/2022] [Indexed: 12/14/2022]
Abstract
SARS-CoV-2 infection, which is the cause of the COVID-19 pandemic, has expanded across various animal hosts, and the virus can be transmitted particularly efficiently in minks. It is still not clear how SARS-CoV-2 is selected and evolves in its hosts, or how mutations affect viral fitness. In this report, sequences of SARS-CoV-2 isolated from human and animal hosts were analyzed, and the binding energy and capacity of the spike protein to bind human ACE2 and the mink receptor were compared. Codon adaptation index (CAI) analysis indicated the optimization of viral codons in some animals such as bats and minks, and a neutrality plot demonstrated that natural selection had a greater influence on some SARS-CoV-2 sequences than mutational pressure. Molecular dynamics simulation results showed that the mutations Y453F and N501T in mink SARS-CoV-2 could enhance the binding of the viral spike to the mink receptor, indicating the involvement of these mutations in natural selection and viral fitness. Receptor binding analysis revealed that the mink SARS-CoV-2 spike interacted more strongly with the mink receptor than the human receptor. Tracking the variations and codon bias of SARS-CoV-2 is helpful for understanding the fitness of the virus in virus transmission, pathogenesis, and immune evasion.
Collapse
|
28
|
UĞUREL OM, ATA O, TURGUT-BALIK D. Genomic chronicle of SARS-CoV-2: a mutational analysis with over 1 million genome sequences. Turk J Biol 2021; 45:425-435. [PMID: 34803444 PMCID: PMC8573839 DOI: 10.3906/biy-2106-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Accepted: 07/30/2021] [Indexed: 11/26/2022] Open
Abstract
Use of information technologies to analyse big data on SARS-CoV-2 genome provides an insight for tracking variations and examining the evolution of the virus. Nevertheless, storing, processing, alignment and analyses of these numerous genomes are still a challenge. In this study, over 1 million SARS-CoV-2 genomes have been analysed to show distribution and relationship of variations that could enlighten development and evolution of the virus. In all genomes analysed in this study, a total of over 215M SNVs have been detected and average number of SNV per isolate was found to be 21.83. Single nucleotide variant (SNV) average is observed to reach 31.25 just in March 2021. The average variation number of isolates is increasing and compromising with total case numbers around the world. Remarkably, cytosine deamination, which is one of the most important biochemical processes in the evolutionary development of coronaviruses, accounts for 46% of all SNVs seen in SARS-CoV-2 genomes within 16 months. This study is one of the most comprehensive SARS-CoV-2 genomic analysis study in terms of number of genomes analysed in an academic publication so far, and reported results could be useful in monitoring the development of SARS-CoV-2.
Collapse
Affiliation(s)
- Osman Mutluhan UĞUREL
- Department of Bioengineering, Faculty of Chemical and Metallurgical Engineering, Yıldız Technical University, İstanbulTurkey
- Department of Basic Sciences, School of Engineering and Natural Sciences, Altınbaş University, İstanbulTurkey
| | - Oğuz ATA
- Department of Software Engineering, School of Engineering and Natural Sciences, Altınbaş University, İstanbulTurkey
| | - Dilek TURGUT-BALIK
- Department of Bioengineering, Faculty of Chemical and Metallurgical Engineering, Yıldız Technical University, İstanbulTurkey
| |
Collapse
|
29
|
Yi K, Kim SY, Bleazard T, Kim T, Youk J, Ju YS. Mutational spectrum of SARS-CoV-2 during the global pandemic. Exp Mol Med 2021; 53:1229-1237. [PMID: 34453107 PMCID: PMC8393781 DOI: 10.1038/s12276-021-00658-z] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 04/29/2021] [Accepted: 05/11/2021] [Indexed: 02/07/2023] Open
Abstract
Viruses accumulate mutations under the influence of natural selection and host-virus interactions. Through a systematic comparison of 351,525 full viral genome sequences collected during the recent COVID-19 pandemic, we reveal the spectrum of SARS-CoV-2 mutations. Unlike those of other viruses, the mutational spectrum of SARS-CoV-2 exhibits extreme asymmetry, with a much higher rate of C>U than U>C substitutions, as well as a higher rate of G>U than U>G substitutions. This suggests directional genome sequence evolution during transmission. The substantial asymmetry and directionality of the mutational spectrum enable pseudotemporal tracing of SARS-CoV-2 without prior information about the root sequence, collection time, and sampling region. This shows that the viral genome sequences collected in Asia are similar to the original genome sequence. Adjusted estimation of the dN/dS ratio accounting for the asymmetrical mutational spectrum also shows evidence of negative selection on viral genes, consistent with previous reports. Our findings provide deep insights into the mutational processes in SARS-CoV-2 viral infection and advance the understanding of the history and future evolution of the virus.
Collapse
Affiliation(s)
- Kijong Yi
- grid.37172.300000 0001 2292 0500Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141 Korea
| | - Su Yeon Kim
- grid.37172.300000 0001 2292 0500Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141 Korea
| | - Thomas Bleazard
- grid.70909.370000 0001 2199 6511National Institute for Biological Standards and Control, Blanche Lane, South Mimms, Potters Bar, Hertfordshire, EN6 3QG UK
| | - Taewoo Kim
- grid.37172.300000 0001 2292 0500Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141 Korea
| | - Jeonghwan Youk
- grid.37172.300000 0001 2292 0500Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141 Korea ,grid.511166.4GENOME INSIGHT Inc, Daejeon, 34051 Korea
| | - Young Seok Ju
- grid.37172.300000 0001 2292 0500Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141 Korea ,grid.511166.4GENOME INSIGHT Inc, Daejeon, 34051 Korea
| |
Collapse
|
30
|
Khandia R, Alqahtani T, Alqahtani AM. Genes Common in Primary Immunodeficiencies and Cancer Display Overrepresentation of Codon CTG and Dominant Role of Selection Pressure in Shaping Codon Usage. Biomedicines 2021; 9:biomedicines9081001. [PMID: 34440205 PMCID: PMC8391990 DOI: 10.3390/biomedicines9081001] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2021] [Revised: 08/06/2021] [Accepted: 08/09/2021] [Indexed: 11/20/2022] Open
Abstract
Primary immunodeficiencies (PIDs) are disorders of the immune system that involve faulty cellular, humoral, or both cellular and humoral functions. PIDs are present at the crossroad between infections, immune dysregulation, and cancers. A panel encompassing 42 genes involved in both PIDs and cancer has been investigated for the genes’ compositional properties, codon usage patterns, various forces affecting codon choice, protein properties, and gene expression profiles. In the present study, the codon choice of genes was found to be dependent upon the richness of the nucleotide; the viz AT nucleotide rich genome preferred AT ending codons. The dinucleotide TpA adversely affected protein expression, while CpG did not. The CTG codon was the most overrepresented codon in 80.95% of genes. Analysis of various protein properties, including GRAVY, AROMA, isoelectric point, aliphatic index, hydrophobicity, instability index, and numbers of acidic, basic, and neutral amino acid residues revealed that the hydrophobicity index, instability index, and numbers of acidic and basic amino acid residues are the factors affecting gene expression. Based on neutrality analysis, parity analysis, ENc-GC3 analysis, and regression analysis of nucleotides present at the first and third positions of the codon, it was determined that selection pressure, mutation pressure, and compositional constraints all participated in shaping codon usage. The study will help determine the various evolutionary forces acting on genes common to both PIDs and cancer. Codon usage analysis might be helpful in the future to augment both diseases simultaneously. The research also indicates a peculiar pattern adapted by a set of genes involved in any disease.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal 462026, India
- Correspondence:
| | - Taha Alqahtani
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha 62529, Saudi Arabia; (T.A.); (A.M.A.)
| | - Ali M. Alqahtani
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha 62529, Saudi Arabia; (T.A.); (A.M.A.)
| |
Collapse
|
31
|
Ata G, Wang H, Bai H, Yao X, Tao S. Edging on Mutational Bias, Induced Natural Selection From Host and Natural Reservoirs Predominates Codon Usage Evolution in Hantaan Virus. Front Microbiol 2021; 12:699788. [PMID: 34276633 PMCID: PMC8283416 DOI: 10.3389/fmicb.2021.699788] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Accepted: 06/07/2021] [Indexed: 12/14/2022] Open
Abstract
The molecular evolutionary dynamics that shape hantaviruses’ evolution are poorly understood even now, besides the contribution of virus-host interaction to their evolution remains an open question. Our study aimed to investigate these two aspects in Hantaan virus (HTNV)—the prototype of hantaviruses and an emerging zoonotic pathogen that infects humans, causing hemorrhagic fever with renal syndrome (HFRS): endemic in Far East Russia, China, and South Korea—via a comprehensive, phylogenetic-dependent codon usage analysis. We found that host- and natural reservoir-induced natural selection is the primary determinant of its biased codon choices, exceeding the mutational bias effect. The phylogenetic analysis of HTNV strains resulted in three distinct clades: South Korean, Russian, and Chinese. An effective number of codon (ENC) analysis showed a slightly biased codon usage in HTNV genomes. Nucleotide composition and RSCU analyses revealed a significant bias toward A/U nucleotides and A/U-ended codons, indicating the potential influence of mutational bias on the codon usage patterns of HTNV. Via ENC-plot, Parity Rule 2 (PR2), and neutrality plot analyses, we would conclude the presence of both mutation pressure and natural selection effect in shaping the codon usage patterns of HTNV; however, natural selection is the dominant factor influencing its codon usage bias. Codon adaptation index (CAI), Relative codon deoptimization index (RCDI), and Similarity Index (SiD) analyses uncovered the intense selection pressure from the host (Human) and natural reservoirs (Striped field mouse and Chinese white-bellied rat) in shaping HTNV biased codon choices. Our study clearly revealed the evolutionary processes in HTNV and the role of virus-host interaction in its evolution. Moreover, it opens the door for a more comprehensive codon usage analysis for all hantaviruses species to determine their molecular evolutionary dynamics and adaptability to several hosts and environments. We believe that our research will help in a better and deep understanding of HTNV evolution that will serve its future basic research and aid live attenuated vaccines design.
Collapse
Affiliation(s)
- Galal Ata
- State Key Laboratory of Crop Stress Biology in Arid Areas, College of Life Sciences, Northwest A&F University, Xianyang, China
| | - Hao Wang
- State Key Laboratory of Crop Stress Biology in Arid Areas, College of Life Sciences, Northwest A&F University, Xianyang, China
| | - Haoxiang Bai
- State Key Laboratory of Crop Stress Biology in Arid Areas, College of Life Sciences, Northwest A&F University, Xianyang, China
| | - Xiaoting Yao
- College of Veterinary Medicine, Northwest A&F University, Xianyang, China
| | - Shiheng Tao
- State Key Laboratory of Crop Stress Biology in Arid Areas, College of Life Sciences, Northwest A&F University, Xianyang, China
| |
Collapse
|
32
|
Das JK, Roy S. Comparative analysis of human coronaviruses focusing on nucleotide variability and synonymous codon usage patterns. Genomics 2021; 113:2177-2188. [PMID: 34019999 PMCID: PMC8131179 DOI: 10.1016/j.ygeno.2021.05.008] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2020] [Revised: 05/09/2021] [Accepted: 05/14/2021] [Indexed: 01/04/2023]
Abstract
The prevailing COVID-19 pandemic has drawn the attention of the scientific community to study the evolutionary origin of Severe Acute Respiratory Syndrome Corona Virus 2 (SARS-CoV-2). This study is a comprehensive quantitative analysis of the protein-coding sequences of seven human coronaviruses (HCoVs) to decipher the nucleotide sequence variability and codon usage patterns. It is essential to understand the survival ability of the viruses, their adaptation to hosts, and their evolution. The current analysis revealed a high abundance of the relative dinucleotide (odds ratio), GC and CT pairs in the first and last two codon positions, respectively, as well as a low abundance of the CG pair in the last two positions of the codon, which might be related to the evolution of the viruses. A remarkable level of variability of GC content in the third position of the codon among the seven coronaviruses was observed. Codons with high RSCU values are primarily from the aliphatic and hydroxyl amino acid groups, and codons with low RSCU values belong to the aliphatic, cyclic, positively charged, and sulfur-containing amino acid groups. In order to elucidate the evolutionary processes of the seven coronaviruses, a phylogenetic tree (dendrogram) was constructed based on the RSCU scores of the codons. The severe and mild categories CoVs were positioned in different clades. A comparative phylogenetic study with other coronaviruses depicted that SARS-CoV-2 is close to the CoV isolated from pangolins (Manis javanica, Pangolin-CoV) and cats (Felis catus, SARS(r)-CoV). Further analysis of the effective number of codon (ENC) usage bias showed a relatively higher bias for SARS-CoV and MERS-CoV compared to SARS-CoV-2. The ENC plot against GC3 suggested that the mutational bias might have a role in determining the codon usage variation among candidate viruses. A codon adaptability study on a few human host parasites (from different kingdoms), including CoVs, showed a diverse adaptability pattern. SARS-CoV-2 and SARS-CoV exhibit relatively lower but similar codon adaptability compared to MERS-CoV.
Collapse
Affiliation(s)
- Jayanta Kumar Das
- Department of Pediatrics, Johns Hopkins University School of Medicine, MD, USA.
| | - Swarup Roy
- Network Reconstruction & Analysis (NetRA) Lab, Department of Computer Applications, Sikkim University, Gangtok, India.
| |
Collapse
|
33
|
Pereira-Gómez M, Carrau L, Fajardo Á, Moreno P, Moratorio G. Altering Compositional Properties of Viral Genomes to Design Live-Attenuated Vaccines. Front Microbiol 2021; 12:676582. [PMID: 34276608 PMCID: PMC8278477 DOI: 10.3389/fmicb.2021.676582] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 06/01/2021] [Indexed: 12/11/2022] Open
Abstract
Live-attenuated vaccines have been historically used to successfully prevent numerous diseases caused by a broad variety of RNA viruses due to their ability to elicit strong and perdurable immune-protective responses. In recent years, various strategies have been explored to achieve viral attenuation by rational genetic design rather than using classic and empirical approaches, based on successive passages in cell culture. A deeper understanding of evolutionary implications of distinct viral genomic compositional aspects, as well as substantial advances in synthetic biology technologies, have provided a framework to achieve new viral attenuation strategies. Herein, we will discuss different approaches that are currently applied to modify compositional features of viruses in order to develop novel live-attenuated vaccines.
Collapse
Affiliation(s)
- Marianoel Pereira-Gómez
- Laboratorio de Virología Molecular, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
- Laboratorio de Evolución Experimental de Virus, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Lucía Carrau
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| | - Álvaro Fajardo
- Laboratorio de Virología Molecular, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
- Laboratorio de Evolución Experimental de Virus, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Pilar Moreno
- Laboratorio de Virología Molecular, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
- Laboratorio de Evolución Experimental de Virus, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Gonzalo Moratorio
- Laboratorio de Virología Molecular, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
- Laboratorio de Evolución Experimental de Virus, Institut Pasteur de Montevideo, Montevideo, Uruguay
| |
Collapse
|
34
|
Mordstein C, Cano L, Morales AC, Young B, Ho AT, Rice AM, Liss M, Hurst LD, Kudla G. Transcription, mRNA export and immune evasion shape the codon usage of viruses. Genome Biol Evol 2021; 13:6275682. [PMID: 33988683 PMCID: PMC8410142 DOI: 10.1093/gbe/evab106] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/10/2021] [Indexed: 12/15/2022] Open
Abstract
The nucleotide composition, dinucleotide composition, and codon usage of many viruses differs from their hosts. These differences arise because viruses are subject to unique mutation and selection pressures that do not apply to host genomes; however, the molecular mechanisms that underlie these evolutionary forces are unclear. Here, we analysed the patterns of codon usage in 1,520 vertebrate-infecting viruses, focusing on parameters known to be under selection and associated with gene regulation. We find that GC content, dinucleotide content, and splicing and m6A modification-related sequence motifs are associated with the type of genetic material (DNA or RNA), strandedness, and replication compartment of viruses. In an experimental follow-up, we find that the effects of GC content on gene expression depend on whether the genetic material is delivered to the cell as DNA or mRNA, whether it is transcribed by endogenous or exogenous RNA polymerase, and whether transcription takes place in the nucleus or cytoplasm. Our results suggest that viral codon usage cannot be explained by a simple adaptation to the codon usage of the host - instead, it reflects the combination of multiple selective and mutational pressures, including the need for efficient transcription, export, and immune evasion.
Collapse
Affiliation(s)
- Christine Mordstein
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK.,The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Laura Cano
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK
| | - Atahualpa Castillo Morales
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Bethan Young
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK.,The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Alexander T Ho
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Alan M Rice
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Michael Liss
- Thermo Fisher Scientific, GENEART GmbH, Regensburg, Germany
| | - Laurence D Hurst
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Grzegorz Kudla
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK
| |
Collapse
|
35
|
Malik YS, Ansari MI, Kattoor JJ, Kaushik R, Sircar S, Subbaiyan A, Tiwari R, Dhama K, Ghosh S, Tomar S, Zhang KYJ. Evolutionary and codon usage preference insights into spike glycoprotein of SARS-CoV-2. Brief Bioinform 2021; 22:1006-1022. [PMID: 33377145 PMCID: PMC7953982 DOI: 10.1093/bib/bbaa383] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Revised: 10/31/2020] [Accepted: 11/26/2020] [Indexed: 12/13/2022] Open
Abstract
Interaction of SARS-CoV-2 spike glycoprotein with the ACE2 cell receptor is very crucial for virus attachment to human cells. Selected mutations in SARS-CoV-2 S-protein are reported to strengthen its binding affinity to mammalian ACE2. The N501T mutation in SARS-CoV-2-CTD furnishes better support to hotspot 353 in comparison with SARS-CoV and shows higher affinity for receptor binding. Recombination analysis exhibited higher recombination events in SARS-CoV-2 strains, irrespective of their geographical origin or hosts. Investigation further supports a common origin among SARS-CoV-2 and its predecessors, SARS-CoV and bat-SARS-like-CoV. The recombination events suggest a constant exchange of genetic material among the co-infecting viruses in possible reservoirs and human hosts before SARS-CoV-2 emerged. Furthermore, a comprehensive analysis of codon usage bias (CUB) in SARS-CoV-2 revealed significant CUB among the S-genes of different beta-coronaviruses governed majorly by natural selection and mutation pressure. Various indices of codon usage of S-genes helped in quantifying its adaptability in other animal hosts. These findings might help in identifying potential experimental animal models for investigating pathogenicity for drugs and vaccine development experiments.
Collapse
Affiliation(s)
| | | | | | - Rahul Kaushik
- Laboratory for Structural Bioinformatics, RIKEN Center for Biosystems Dynamics Research, Japan
| | | | | | - Ruchi Tiwari
- Department of Vet erinary Microbiology, DUVASU, Mathura, India
| | - Kuldeep Dhama
- Division of Pathology, ICAR-Indian Veterinary Research Institute, Izatnagar, Uttar Pradesh, India
| | - Souvik Ghosh
- Health Center for Zoonoses and Tropical Veterinary Medicine, Basseterre, St. Kitts, West Indies
| | | | - Kam Y J Zhang
- Laboratory for Structural Bioinformatics, Center for Biosystems Dynamics Research, RIKEN, Japan
| |
Collapse
|
36
|
Franzo G, Tucciarone CM, Legnardi M, Cecchinato M. Effect of genome composition and codon bias on infectious bronchitis virus evolution and adaptation to target tissues. BMC Genomics 2021; 22:244. [PMID: 33827429 PMCID: PMC8025453 DOI: 10.1186/s12864-021-07559-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2020] [Accepted: 03/26/2021] [Indexed: 11/10/2022] Open
Abstract
Background Infectious bronchitis virus (IBV) is one of the most relevant viruses affecting the poultry industry, and several studies have investigated the factors involved in its biological cycle and evolution. However, very few of those studies focused on the effect of genome composition and the codon bias of different IBV proteins, despite the remarkable increase in available complete genomes. In the present study, all IBV complete genomes were downloaded (n = 383), and several statistics representative of genome composition and codon bias were calculated for each protein-coding sequence, including but not limited to, the nucleotide odds ratio, relative synonymous codon usage and effective number of codons. Additionally, viral codon usage was compared to host codon usage based on a collection of highly expressed genes in IBV target and nontarget tissues. Results The results obtained demonstrated a significant difference among structural, non-structural and accessory proteins, especially regarding dinucleotide composition, which appears under strong selective forces. In particular, some dinucleotide pairs, such as CpG, a probable target of the host innate immune response, are underrepresented in genes coding for pp1a, pp1ab, S and N. Although genome composition and dinucleotide bias appear to affect codon usage, additional selective forces may act directly on codon bias. Variability in relative synonymous codon usage and effective number of codons was found for different proteins, with structural proteins and polyproteins being more adapted to the codon bias of host target tissues. In contrast, accessory proteins had a more biased codon usage (i.e., lower number of preferred codons), which might contribute to the regulation of their expression level and timing throughout the cell cycle. Conclusions The present study confirms the existence of selective forces acting directly on the genome and not only indirectly through phenotype selection. This evidence might help understanding IBV biology and in developing attenuated strains without affecting the protein phenotype and therefore immunogenicity. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07559-5.
Collapse
Affiliation(s)
- Giovanni Franzo
- Microbiology and Infectious Diseases, Department of Animal Medicine, Production and Health (MAPS), University of Padua, Viale dell'Università 16 - 35020 Legnaro, Padua, Italy.
| | - Claudia Maria Tucciarone
- Microbiology and Infectious Diseases, Department of Animal Medicine, Production and Health (MAPS), University of Padua, Viale dell'Università 16 - 35020 Legnaro, Padua, Italy
| | - Matteo Legnardi
- Microbiology and Infectious Diseases, Department of Animal Medicine, Production and Health (MAPS), University of Padua, Viale dell'Università 16 - 35020 Legnaro, Padua, Italy
| | - Mattia Cecchinato
- Microbiology and Infectious Diseases, Department of Animal Medicine, Production and Health (MAPS), University of Padua, Viale dell'Università 16 - 35020 Legnaro, Padua, Italy
| |
Collapse
|
37
|
Hernandez-Alias X, Benisty H, Schaefer MH, Serrano L. Translational adaptation of human viruses to the tissues they infect. Cell Rep 2021; 34:108872. [PMID: 33730572 PMCID: PMC7962955 DOI: 10.1016/j.celrep.2021.108872] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 12/15/2020] [Accepted: 02/23/2021] [Indexed: 12/22/2022] Open
Abstract
Viruses need to hijack the translational machinery of the host cell for a productive infection to happen. However, given the dynamic landscape of tRNA pools among tissues, it is unclear whether different viruses infecting different tissues have adapted their codon usage toward their tropism. Here, we collect the coding sequences of 502 human-infecting viruses and determine that tropism explains changes in codon usage. Using the tRNA abundances across 23 human tissues from The Cancer Genome Atlas (TCGA), we build an in silico model of translational efficiency that validates the correspondence of the viral codon usage with the translational machinery of their tropism. For instance, we detect that severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is specifically adapted to the upper respiratory tract and alveoli. Furthermore, this correspondence is specifically defined in early viral proteins. The observed tissue-specific translational efficiency could be useful for the development of antiviral therapies and vaccines.
Collapse
Affiliation(s)
- Xavier Hernandez-Alias
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain.
| | - Hannah Benisty
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Martin H Schaefer
- IEO European Institute of Oncology IRCCS, Department of Experimental Oncology, Via Adamello 16, Milan 20139, Italy.
| | - Luis Serrano
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain; Universitat Pompeu Fabra (UPF), Barcelona 08002, Spain; ICREA, Pg. Lluís Companys 23, Barcelona 08010, Spain.
| |
Collapse
|
38
|
Saha J, Bhattacharjee S, Pal Sarkar M, Saha BK, Basak HK, Adhikary S, Roy V, Mandal P, Chatterjee A, Pal A. A comparative genomics-based study of positive strand RNA viruses emphasizing on SARS-CoV-2 utilizing dinucleotide signature, codon usage and codon context analyses. GENE REPORTS 2021; 23:101055. [PMID: 33615042 PMCID: PMC7887452 DOI: 10.1016/j.genrep.2021.101055] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2020] [Revised: 01/20/2021] [Accepted: 02/09/2021] [Indexed: 12/12/2022]
Abstract
The novel corona virus disease or COVID-19 caused by a positive strand RNA virus (PRV) called SARS-CoV-2 is plaguing the entire planet as we conduct this study. In this study a multifaceted analysis was carried out employing dinucleotide signature, codon usage and codon context to compare and unravel the genomic as well as genic characteristics of the SARS-CoV-2 isolates and how they compare to other PRVs which represents some of the most pathogenic human viruses. The main emphasis of this study was to comprehend the codon biology of the SARS-CoV-2 in the backdrop of the other PRVs like Poliovirus, Japanese encephalitis virus, Hepatitis C virus, Norovirus, Rubella virus, Semliki Forest virus, Zika virus, Dengue virus, Human rhinoviruses and the Betacoronaviruses since codon usage pattern along with the nucleotide composition prevalent within the viral genome helps to understand the biology and evolution of viruses. Our results suggest discrete genomic dinucleotide signature within the PRVs. Some of the genes from the different SARS-CoV-2 isolates were also found to demonstrate heterogeneity in terms of their dinucleotide signature. The SARS-CoV-2 isolates also demonstrated a codon context trend characteristically dissimilar to the other PRVs. The findings of this study are expected to contribute to the developing global knowledge base in countering COVID-19.
Collapse
Key Words
- CAI, Codon Adaptation Index
- CNS, Central Nervous System
- COVID-19
- CRS, Congenital Rubella Syndrome
- CUB, Codon Usage Bias
- Codon context
- Codon usage bias
- Coronaviruses
- Fop, Frequency of optimal codons
- GC1, Guanine and Cytosine content on the first position of the codon
- GC2, Guanine and Cytosine content on the second position of the codon
- GC3, Guanine and Cytosine content on the third position of the codon
- HCV, Hepatitis C Virus
- MERS, Middle East Respiratory Syndrome
- MFE, Minimum Free Energy
- Nc, Effective Number of Codons
- PCA, Principal Component Analysis
- PRV, Positive strand RNA Virus
- Positive strand RNA virus
- RCDI, Relative Codon De-Optimization Index
- RSCU, Relative Synonymous Codon Usage
- SARS, Severe Acute Respiratory Syndrome
- SARS-CoV-2
- SARS-CoV-2, Severe Acute Respiratory Syndrome Coronavirus 2
- SCUO, Synonymous Codon Usage Order
- SiD, Similarity Index
Collapse
Affiliation(s)
- Jayanti Saha
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj PIN-733 134, Uttar Dinajpur, West Bengal, India
| | - Sukanya Bhattacharjee
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj PIN-733 134, Uttar Dinajpur, West Bengal, India
| | - Monalisha Pal Sarkar
- Mycology & Plant Pathology Laboratory, Department of Botany, Raiganj University, Raiganj PIN-733 134, Uttar Dinajpur, West Bengal, India
| | - Barnan Kumar Saha
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj PIN-733 134, Uttar Dinajpur, West Bengal, India
| | - Hriday Kumar Basak
- Department of Chemistry, Raiganj University, Raiganj PIN-733 134, Uttar Dinajpur, West Bengal, India
| | - Samarpita Adhikary
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj PIN-733 134, Uttar Dinajpur, West Bengal, India
| | - Vivek Roy
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj PIN-733 134, Uttar Dinajpur, West Bengal, India
| | - Parimal Mandal
- Mycology & Plant Pathology Laboratory, Department of Botany, Raiganj University, Raiganj PIN-733 134, Uttar Dinajpur, West Bengal, India
| | - Abhik Chatterjee
- Department of Chemistry, Raiganj University, Raiganj PIN-733 134, Uttar Dinajpur, West Bengal, India
| | - Ayon Pal
- Microbiology & Computational Biology Laboratory, Department of Botany, Raiganj University, Raiganj PIN-733 134, Uttar Dinajpur, West Bengal, India
| |
Collapse
|
39
|
Abstract
RNA viruses are responsible for some of the worst pandemics known to mankind, including outbreaks of Influenza, Ebola, and COVID-19. One major challenge in tackling RNA viruses is the fact they are extremely genetically diverse. Nevertheless, they share common features that include their dependence on host cells for replication, and high mutation rates. We set out to search for shared evolutionary characteristics that may aid in gaining a broader understanding of RNA virus evolution, and constructed a phylogeny-based data set spanning thousands of sequences from diverse single-stranded RNA viruses of animals. Strikingly, we found that the vast majority of these viruses have a skewed nucleotide composition, manifested as adenine rich (A-rich) coding sequences. In order to test whether A-richness is driven by selection or by biased mutation processes, we harnessed the effects of incomplete purifying selection at the tips of virus phylogenies. Our results revealed consistent mutational biases toward U rather than A in genomes of all viruses. In +ssRNA viruses, we found that this bias is compensated by selection against U and selection for A, which leads to A-rich genomes. In -ssRNA viruses, the genomic mutational bias toward U on the negative strand manifests as A-rich coding sequences, on the positive strand. We investigated possible reasons for the advantage of A-rich sequences including weakened RNA secondary structures, codon usage bias, and selection for a particular amino acid composition, and conclude that host immune pressures may have led to similar biases in coding sequence composition across very divergent RNA viruses.
Collapse
Affiliation(s)
- Talia Kustin
- The Shmunis School of Biomedicine and Cancer Research, Tel Aviv University, Tel Aviv, Israel
| | - Adi Stern
- The Shmunis School of Biomedicine and Cancer Research, Tel Aviv University, Tel Aviv, Israel.,Edmond J. Safra Center for Bioinformatics, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
40
|
Huang W, Guo Y, Li N, Feng Y, Xiao L. Codon usage analysis of zoonotic coronaviruses reveals lower adaptation to humans by SARS-CoV-2. INFECTION GENETICS AND EVOLUTION 2021; 89:104736. [PMID: 33516969 PMCID: PMC7843097 DOI: 10.1016/j.meegid.2021.104736] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Revised: 12/28/2020] [Accepted: 01/22/2021] [Indexed: 12/25/2022]
Abstract
Since 2002, the world has witnessed major outbreaks of acute respiratory illness by three zoonotic coronaviruses (CoVs), which differ from each other in pathogenicity. Reasons for the lower pathogenicity of SARS-CoV-2 than the other two zoonotic coronaviruses, SARS-CoV and MERS-CoV, are not well understood. We herein compared the codon usage patterns of the three zoonotic CoVs causing severe acute respiratory syndromes and four human-specific CoVs (NL63, 229E, OC43, and HKU1) causing mild diseases. We found that the seven viruses have different codon usages, with SARS-CoV-2 having the lowest effective number of codons (ENC) among the zoonotic CoVs. Human codon adaptation index (CAI) analysis revealed that the CAI value of SARS-CoV-2 is the lowest among the zoonotic CoVs. The ENC and CAI values of SARS-CoV-2 were more similar to those of the less-pathogenic human-specific CoVs. To further investigate adaptive evolution within SARS-CoV-2, we examined codon usage patterns in 3573 genomes of SARS-CoV-2 collected over the initial 4 months of the pandemic. We showed that the ENC values and the CAI values of SARS-CoV-2 were decreasing over the period. The low ENC and CAI values could be responsible for the lower pathogenicity of SARS-CoV-2. While mutational pressure appears to shape codon adaptation in the overall genomes of SARS-CoV-2 and other zoonotic CoVs, the E gene of SARS-CoV-2, which has the highest codon usage bias, appears to be under strong natural selection. Data from the study contribute to our understanding of the pathogenicity and evolution of SARS-CoV-2 in humans.
Collapse
Affiliation(s)
- Wanyi Huang
- Center for Emerging and Zoonotic Diseases, College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China; Guangdong Laboratory for Lingnan Modern Agriculture, Guangzhou 510642, China.
| | - Yaqiong Guo
- Center for Emerging and Zoonotic Diseases, College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China; Guangdong Laboratory for Lingnan Modern Agriculture, Guangzhou 510642, China.
| | - Na Li
- Center for Emerging and Zoonotic Diseases, College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China; Guangdong Laboratory for Lingnan Modern Agriculture, Guangzhou 510642, China.
| | - Yaoyu Feng
- Center for Emerging and Zoonotic Diseases, College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China; Guangdong Laboratory for Lingnan Modern Agriculture, Guangzhou 510642, China.
| | - Lihua Xiao
- Center for Emerging and Zoonotic Diseases, College of Veterinary Medicine, South China Agricultural University, Guangzhou 510642, China; Guangdong Laboratory for Lingnan Modern Agriculture, Guangzhou 510642, China.
| |
Collapse
|
41
|
Yin C. Latent periodicity-2 in coronavirus SARS-CoV-2 genome: Evolutionary implications. J Theor Biol 2021; 515:110604. [PMID: 33508323 PMCID: PMC7835100 DOI: 10.1016/j.jtbi.2021.110604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Revised: 01/02/2021] [Accepted: 01/21/2021] [Indexed: 11/25/2022]
Abstract
The ongoing global pandemic of infection disease COVID-19 caused by the 2019 novel coronavirus (SARS-COV-2, formerly 2019-nCoV) presents critical threats to public health and the economy. The genome of SARS-CoV-2 had been sequenced and structurally annotated, yet little is known of the intrinsic organization and evolution of the genome. To this end, we present a mathematical method for the genomic spectrum, a kind of barcode, of SARS-CoV-2 and common human coronaviruses. The genomic spectrum is constructed according to the periodic distributions of nucleotides and therefore reflects the unique characteristics of the genome. The results demonstrate that coronavirus SARS-CoV-2 exhibits predominant latent periodicity-2 regions of non-structural proteins 3, 4, 5, and 6. Further analysis of the latent periodicity-2 regions suggests that the dinucleotide imbalances are increased during evolution and may confer the evolutionary fitness of the virus. Especially, SARS-CoV-2 isolates have increased latent periodicity-2 and periodicity-3 during COVID-19 pandemic. The special strong periodicity-2 regions and the intensity of periodicity-2 in the SARS-CoV-2 whole genome may become diagnostic and pharmaceutical targets in monitoring and curing the COVID-19 disease.
Collapse
Affiliation(s)
- Changchuan Yin
- Department of Mathematics, Statistics, and Computer Science, The University of Illinois at Chicago, Chicago, IL 60607-7045, USA.
| |
Collapse
|
42
|
Ou Z, Ouzounis C, Wang D, Sun W, Li J, Chen W, Marlière P, Danchin A. A Path toward SARS-CoV-2 Attenuation: Metabolic Pressure on CTP Synthesis Rules the Virus Evolution. Genome Biol Evol 2020; 12:2467-2485. [PMID: 33125064 PMCID: PMC7665462 DOI: 10.1093/gbe/evaa229] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/23/2020] [Indexed: 02/06/2023] Open
Abstract
In the context of the COVID-19 pandemic, we describe here the singular metabolic background that constrains enveloped RNA viruses to evolve toward likely attenuation in the long term, possibly after a step of increased pathogenicity. Cytidine triphosphate (CTP) is at the crossroad of the processes allowing SARS-CoV-2 to multiply, because CTP is in demand for four essential metabolic steps. It is a building block of the virus genome, it is required for synthesis of the cytosine-based liponucleotide precursors of the viral envelope, it is a critical building block of the host transfer RNAs synthesis and it is required for synthesis of dolichol-phosphate, a precursor of viral protein glycosylation. The CCA 3'-end of all the transfer RNAs required to translate the RNA genome and further transcripts into the proteins used to build active virus copies is not coded in the human genome. It must be synthesized de novo from CTP and ATP. Furthermore, intermediary metabolism is built on compulsory steps of synthesis and salvage of cytosine-based metabolites via uridine triphosphate that keep limiting CTP availability. As a consequence, accidental replication errors tend to replace cytosine by uracil in the genome, unless recombination events allow the sequence to return to its ancestral sequences. We document some of the consequences of this situation in the function of viral proteins. This unique metabolic setup allowed us to highlight and provide a raison d'être to viperin, an enzyme of innate antiviral immunity, which synthesizes 3'-deoxy-3',4'-didehydro-CTP as an extremely efficient antiviral nucleotide.
Collapse
Affiliation(s)
- Zhihua Ou
- BGI-Shenzhen, Shenzhen, China.,Shenzhen Key Laboratory of Unknown Pathogen Identification, BGI-Shenzhen, Shenzhen, China
| | - Christos Ouzounis
- Biological Computation and Process Laboratory, Centre for Research and Technology Hellas, Chemical Process and Energy Resources Institute, Thessalonica, Greece
| | - Daxi Wang
- BGI-Shenzhen, Shenzhen, China.,Shenzhen Key Laboratory of Unknown Pathogen Identification, BGI-Shenzhen, Shenzhen, China
| | - Wanying Sun
- BGI-Shenzhen, Shenzhen, China.,Shenzhen Key Laboratory of Unknown Pathogen Identification, BGI-Shenzhen, Shenzhen, China.,BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, China
| | - Junhua Li
- BGI-Shenzhen, Shenzhen, China.,Shenzhen Key Laboratory of Unknown Pathogen Identification, BGI-Shenzhen, Shenzhen, China
| | - Weijun Chen
- Shenzhen Key Laboratory of Unknown Pathogen Identification, BGI-Shenzhen, Shenzhen, China.,BGI PathoGenesis Pharmaceutical Technology, BGI-Shenzhen, Shenzhen, China
| | - Philippe Marlière
- TESSSI, The European Syndicate of Synthetic Scientists and Industrialists, Paris, France
| | - Antoine Danchin
- Kodikos Labs, Institut Cochin, Paris, France.,School of Biomedical Sciences, Li KaShing Faculty of Medicine, Hong Kong University, Pokfulam, Hong Kong
| |
Collapse
|
43
|
Characterization of a Novel Mitovirus of the Sand Fly Lutzomyia longipalpis Using Genomic and Virus-Host Interaction Signatures. Viruses 2020; 13:v13010009. [PMID: 33374584 PMCID: PMC7822452 DOI: 10.3390/v13010009] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 12/17/2020] [Accepted: 12/21/2020] [Indexed: 02/06/2023] Open
Abstract
Hematophagous insects act as the major reservoirs of infectious agents due to their intimate contact with a large variety of vertebrate hosts. Lutzomyia longipalpis is the main vector of Leishmania chagasi in the New World, but its role as a host of viruses is poorly understood. In this work, Lu. longipalpis RNA libraries were subjected to progressive assembly using viral profile HMMs as seeds. A sequence phylogenetically related to fungal viruses of the genus Mitovirus was identified and this novel virus was named Lul-MV-1. The 2697-base genome presents a single gene coding for an RNA-directed RNA polymerase with an organellar genetic code. To determine the possible host of Lul-MV-1, we analyzed the molecular characteristics of the viral genome. Dinucleotide composition and codon usage showed profiles similar to mitochondrial DNA of invertebrate hosts. Also, the virus-derived small RNA profile was consistent with the activation of the siRNA pathway, with size distribution and 5′ base enrichment analogous to those observed in viruses of sand flies, reinforcing Lu. longipalpis as a putative host. Finally, RT-PCR of different insect pools and sequences of public Lu. longipalpis RNA libraries confirmed the high prevalence of Lul-MV-1. This is the first report of a mitovirus infecting an insect host.
Collapse
|
44
|
Analysis of Virus Population Profiles within Pigs Infected with Virulent Classical Swine Fever Viruses: Evidence for Bottlenecks in Transmission but Absence of Tissue-Specific Virus Variants. J Virol 2020; 94:JVI.01119-20. [PMID: 32699086 DOI: 10.1128/jvi.01119-20] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 07/14/2020] [Indexed: 11/20/2022] Open
Abstract
Classical swine fever virus (CSFV) contains a specific motif within the E2 glycoprotein that differs between strains of different virulence. In the highly virulent CSFV strain Koslov, this motif comprises residues S763/L764 in the polyprotein. However, L763/P764 represent the predominant alleles in published CSFV genomes. In this study, changes were introduced into the CSFV strain Koslov (here called vKos_SL) to generate modified CSFVs with substitutions at residues 763 and/or 764 (vKos_LL, vKos_SP, and vKos_LP). The properties of these mutant viruses, in comparison to those of vKos_SL, were determined in pigs. Each of the viruses was virulent and induced typical clinical signs of CSF, but the vKos_LP strain produced them significantly earlier. Full-length CSFV cDNA amplicons (12.3 kb) derived from sera of infected pigs were deep sequenced and cloned to reveal the individual haplotypes that contributed to the single-nucleotide polymorphism (SNP) profiles observed in the virus population. The SNP profiles for vKos_SL and vKos_LL displayed low-level heterogeneity across the entire genome, whereas vKos_SP and vKos_LP displayed limited diversity with a few high-frequency SNPs. This indicated that vKos_SL and vKos_LL exhibited a higher level of fitness in the host and more stability at the consensus level, whereas several consensus changes were observed in the vKos_SP and vKos_LP sequences, pointing to adaptation. For each virus, only a subset of the variants present within the virus inoculums were maintained in the infected pigs. No clear tissue-dependent quasispecies differentiation occurred within inoculated pigs; however, clear evidence for transmission bottlenecks to contact animals was observed, with subsequent loss of sequence diversity.IMPORTANCE The surface-exposed E2 protein of classical swine fever virus is required for its interaction with host cells. A short motif within this protein varies between strains of different virulence. The importance of two particular amino acid residues in determining the properties of a highly virulent strain of the virus has been analyzed. Each of the different viruses tested proved highly virulent, but one of them produced earlier, but not more severe, disease. By analyzing the virus genomes present within infected pigs, it was found that the viruses which replicated within inoculated animals were only a subset of those within the virus inoculum. Furthermore, following contact transmission, it was shown that a very restricted set of viruses had transferred between animals. There were no significant differences in the virus populations present in various tissues of the infected animals. These results indicate mechanisms of virus population change during transmission between animals.
Collapse
|
45
|
Khodary SM, Anwar AM. Insights into The Codon Usage Bias of 13 Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) Isolates from Different Geo-locations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2020:2020.04.01.019463. [PMID: 34013275 PMCID: PMC8132235 DOI: 10.1101/2020.04.01.019463] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the causative agent of Coronavirus disease 2019 (COVID-19) which is an infectious disease that spread throughout the world and was declared as a pandemic by the World Health Organization (WHO). In this study, we performed a genome-wide analysis on the codon usage bias (CUB) of 13 SARS-CoV-2 isolates from different geo-locations (countries) in an attempt to characterize it, unravel the main force shaping its pattern, and understand its adaptation to Homo sapiens . Overall results revealed that, SARS-CoV-2 codon usage is slightly biased similarly to other RNA viruses. Nucleotide and dinucleotide compositions displayed a bias toward A/U content in all codon positions and CpU-ended codons preference, respectively. Eight common putative preferred codons were identified, and all of them were A/U-ended (U-ended: 7, A-ended: 1). In addition, natural selection was found to be the main force structuring the codon usage pattern of SARS-CoV-2. However, mutation pressure and other factors such as compositional constraints and hydrophobicity had an undeniable contribution. Two adaptation indices were utilized and indicated that SARS-CoV-2 is moderately adapted to Homo sapiens compared to other human viruses. The outcome of this study may help in understanding the underlying factors involved in the evolution of SARS-CoV-2 and may aid in vaccine design strategies.
Collapse
|
46
|
Dutta R, Buragohain L, Borah P. Analysis of codon usage of severe acute respiratory syndrome corona virus 2 (SARS-CoV-2) and its adaptability in dog. Virus Res 2020; 288:198113. [PMID: 32771430 PMCID: PMC7410794 DOI: 10.1016/j.virusres.2020.198113] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2020] [Revised: 08/02/2020] [Accepted: 08/03/2020] [Indexed: 01/10/2023]
Abstract
Codon analysis reveal natural selection and other undefined factors dominates the overall codon usage bias in SARS-CoV-2 rather than mutational pressure. The host adaptation potential of SARS-CoV-2 is more in human as compared to dog.
Severe acute respiratory syndrome corona virus 2 (SARS-CoV-2) is recognized as one of the life-threatening viruses causing the most destructive pandemic in this century. The genesis of this virus is still unknown. To elucidate its molecular evolution and regulation of gene expression, the knowledge of codon usage is a pre-requisite. In this study, an attempt was made to document the genome-wide codon usage profile and the various factors influencing the codon usage patterns of SARS-CoV-2 in human and dog. The SARS-CoV-2 genome showed relative abundance of A and U nucleotides and relative synonymous codon usage analysis revealed that the preferred synonymous codons mostly end with A/U. The analysis of ENc-GC3s, Neutrality and Parity rule 2 plots indicated that natural selection and other undefined factors dominate the overall codon usage bias in SARS-CoV-2 whereas the impact of mutation pressure is comparatively minor. The codon adaptation index and relative codon deoptimization index of SARS-CoV-2 deciphered that human is more favoured host for adaptation compared to dog. These results enhance our understanding of the factors involved in evolution of the novel human SARS-CoV-2 and its adaptability in dog.
Collapse
Affiliation(s)
- Rupam Dutta
- Department of Animal Biotechnology, College of Veterinary Science, Assam Agricultural University, Khanapara, Guwahati 22, Assam, India.
| | - Lukumoni Buragohain
- Department of Animal Biotechnology, College of Veterinary Science, Assam Agricultural University, Khanapara, Guwahati 22, Assam, India
| | - Probodh Borah
- Department of Animal Biotechnology, College of Veterinary Science, Assam Agricultural University, Khanapara, Guwahati 22, Assam, India
| |
Collapse
|
47
|
Abstract
A novel pathogen, named SARS-CoV-2, has caused an unprecedented worldwide pandemic in the first half of 2020. As the SARS-CoV-2 genome sequences have become available, one of the important focus of scientists has become tracking variations in the viral genome. In this study, 30366 SARS-CoV-2 isolate genomes were aligned using the software developed by our group (ODOTool) and 11 variations in SARS-CoV-2 genome over 10% of whole isolates were discussed. Results indicated that, frequency rates of these 11 variations change between 3.56%-88.44 % and these rates differ greatly depending on the continents they have been reported. Despite some variations being in low frequency rate in some continents, C14408T and A23403G variations on Nsp12 and S protein, respectively, observed to be the most prominent variations all over the world, in general, and both cause missense mutations. It is also notable that most of isolates carry C14408T and A23403 variations simultaneously and also nearly all isolates carrying the G25563T variation on ORF3a, also carry C14408T and A23403 variations, although their location distributions are not similar. All these data should be considered towards development of vaccine and antiviral treatment strategies as well as tracing diversity of virus in all over the world.
Collapse
Affiliation(s)
- Osman Mutluhan Ugurel
- Department of Bioengineering, Faculty of Chemical and Metallurgical Engineering, Yıldız Technical University, İstanbul Turkey.,Department of Basic Sciences, School of Engineering and Natural Sciences, Altınbaş University, İstanbul Turkey
| | - Oguz Ata
- Department of Software Engineering, School of Engineering and Natural Sciences, Altınbaş University, İstanbul Turkey
| | - Dilek Turgut-Balik
- Department of Bioengineering, Faculty of Chemical and Metallurgical Engineering, Yıldız Technical University, İstanbul Turkey
| |
Collapse
|
48
|
Jitobaom K, Phakaratsakul S, Sirihongthong T, Chotewutmontri S, Suriyaphol P, Suptawiwat O, Auewarakul P. Codon usage similarity between viral and some host genes suggests a codon-specific translational regulation. Heliyon 2020; 6:e03915. [PMID: 32395662 PMCID: PMC7205639 DOI: 10.1016/j.heliyon.2020.e03915] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Revised: 03/02/2020] [Accepted: 04/30/2020] [Indexed: 02/03/2023] Open
Abstract
The codon usage pattern is a specific characteristic of each species; however, the codon usage of all of the genes in a genome is not uniform. Intriguingly, most viruses have codon usage patterns that are vastly different from the optimal codon usage of their hosts. How viral genes with different codon usage patterns are efficiently expressed during a viral infection is unclear. An analysis of the similarity between viral codon usage and the codon usage of the individual genes of a host genome has never been performed. In this study, we demonstrated that the codon usage of human RNA viruses is similar to that of some human genes, especially those involved in the cell cycle. This finding was substantiated by its concordance with previous reports of an upregulation at the protein level of some of these biological processes. It therefore suggests that some suboptimal viral codon usage patterns may actually be compatible with cellular translational machineries in infected conditions.
Collapse
Affiliation(s)
- Kunlakanya Jitobaom
- Department of Microbiology, Faculty of Medicine Siriraj Hospital, Mahidol University, Thailand
| | - Supinya Phakaratsakul
- Department of Microbiology, Faculty of Medicine Siriraj Hospital, Mahidol University, Thailand
| | | | - Sasithorn Chotewutmontri
- Faculty of Medicine and Public Health, HRH Princess Chulabhorn College of Medical Science, Chulabhorn Royal Academy, Bangkok, Thailand
| | - Prapat Suriyaphol
- Division of Bioinformatics and Data Management for Research, Department of Research and Development, Faculty of Medicine, Siriraj Hospital, Mahidol University, Bangkok, Thailand.,Center of Excellence in Bioinformatics and Clinical Data Management, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Ornpreya Suptawiwat
- Faculty of Medicine and Public Health, HRH Princess Chulabhorn College of Medical Science, Chulabhorn Royal Academy, Bangkok, Thailand
| | - Prasert Auewarakul
- Department of Microbiology, Faculty of Medicine Siriraj Hospital, Mahidol University, Thailand
| |
Collapse
|
49
|
Hussain S, Shinu P, Islam MM, Chohan MS, Rasool ST. Analysis of Codon Usage and Nucleotide Bias in Middle East Respiratory Syndrome Coronavirus Genes. Evol Bioinform Online 2020; 16:1176934320918861. [PMID: 32425493 PMCID: PMC7218340 DOI: 10.1177/1176934320918861] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2020] [Accepted: 03/23/2020] [Indexed: 12/31/2022] Open
Abstract
The Middle East Respiratory Syndrome (MERS) is an emerging disease caused by a recently identified human coronavirus (CoV). Over 2494 laboratory-confirmed cases and 858 MERS-related deaths have been reported from 27 countries. MERS-CoV has been associated with a high case fatality rate, especially in patients with pre-existing conditions. Despite the fatal nature of MERS-CoV infection, a comprehensive study to explore its evolution and adaptation in different hosts is lacking. We performed codon usage analyses on 4751 MERS-CoV genes and determined underlying forces that affect the codon usage bias in the MERS-CoV genome. The current analyses revealed a low but highly conserved, gene-specific codon usage bias in the MERS-CoV genome. The codon usage bias is mainly shaped by natural selection, while mutational pressure emerged as a minor factor affecting codon usage in some genes. Other contributory factors included CpG dinucleotide bias, physical and chemical properties of encoded proteins and gene length. Results reported in this study provide considerable insights into the molecular evaluation of MERS-CoV and could serve as a theoretical basis for optimizing MERS-CoV gene expression to study the functional relevance of various MERS-CoV proteins. Alternatively, an attenuated vaccine strain containing hundreds of silent mutations could be engineered. Codon de-optimization will not affect the amino acid sequence or antigenicity of a vaccine strain, but the sheer number of mutations would make viral reversion to a virulent phenotype extremely unlikely.
Collapse
Affiliation(s)
- Snawar Hussain
- Department of Biomedical Science, College of Clinical Pharmacy, King Faisal University, Al-Ahsa, Kingdom of Saudi Arabia
| | - Pottathil Shinu
- Department of Biomedical Science, College of Clinical Pharmacy, King Faisal University, Al-Ahsa, Kingdom of Saudi Arabia
| | - Mohammed Monirul Islam
- Department of Biomedical Science, College of Clinical Pharmacy, King Faisal University, Al-Ahsa, Kingdom of Saudi Arabia
| | - Muhammad Shahzad Chohan
- Department of Biomedical Science, College of Clinical Pharmacy, King Faisal University, Al-Ahsa, Kingdom of Saudi Arabia
| | - Sahibzada Tasleem Rasool
- Department of Biomedical Science, College of Clinical Pharmacy, King Faisal University, Al-Ahsa, Kingdom of Saudi Arabia
| |
Collapse
|
50
|
Dilucca M, Forcelloni S, Georgakilas AG, Giansanti A, Pavlopoulou A. Codon Usage and Phenotypic Divergences of SARS-CoV-2 Genes. Viruses 2020; 12:E498. [PMID: 32366025 PMCID: PMC7290700 DOI: 10.3390/v12050498] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2020] [Revised: 04/18/2020] [Accepted: 04/27/2020] [Indexed: 12/11/2022] Open
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which first occurred in Wuhan (China) in December of 2019, causes a severe acute respiratory illness with a high mortality rate, and has spread around the world. To gain an understanding of the evolution of the newly emerging SARS-CoV-2, we herein analyzed the codon usage pattern of SARS-CoV-2. For this purpose, we compared the codon usage of SARS-CoV-2 with that of other viruses belonging to the subfamily of Orthocoronavirinae. We found that SARS-CoV-2 has a high AU content that strongly influences its codon usage, which appears to be better adapted to the human host. We also studied the evolutionary pressures that influence the codon usage of five conserved coronavirus genes encoding the viral replicase, spike, envelope, membrane and nucleocapsid proteins. We found different patterns of both mutational bias and natural selection that affect the codon usage of these genes. Moreover, we show here that the two integral membrane proteins (matrix and envelope) tend to evolve slowly by accumulating nucleotide mutations on their corresponding genes. Conversely, genes encoding nucleocapsid (N), viral replicase and spike proteins (S), although they are regarded as are important targets for the development of vaccines and antiviral drugs, tend to evolve faster in comparison to the two genes mentioned above. Overall, our results suggest that the higher divergence observed for the latter three genes could represent a significant barrier in the development of antiviral therapeutics against SARS-CoV-2.
Collapse
Affiliation(s)
- Maddalena Dilucca
- Physics Department, Sapienza University of Rome, 00185 Rome, Italy; (S.F.); (A.G.)
- Liceo Scientifico Statale Augusto Righi, 00187 Rome, Italy
| | - Sergio Forcelloni
- Physics Department, Sapienza University of Rome, 00185 Rome, Italy; (S.F.); (A.G.)
| | - Alexandros G. Georgakilas
- DNA Damage Laboratory, Physics Department, School of Applied Mathematical and Physical Sciences, National Technical University of Athens (NTUA), Zografou Campous, 15780 Athens, Greece;
| | - Andrea Giansanti
- Physics Department, Sapienza University of Rome, 00185 Rome, Italy; (S.F.); (A.G.)
- INFN Roma1 Unit, 00185 Rome, Italy
| | - Athanasia Pavlopoulou
- Izmir Biomedicine and Genome Center (IBG), 35340 Balcova, Izmir, Turkey;
- Izmir International Biomedicine and Genome Institute, Dokuz Eylül University, 35340 Balcova, Izmir, Turkey
| |
Collapse
|