1
|
Jing S, Jie W, Yongping M, Yan S, Zhi L. Genealogical Diversity of Endogenous Retrovirus in the Jawless Fish Genome. J Microbiol Biotechnol 2023; 33:1412-1419. [PMID: 37583082 DOI: 10.4014/jmb.2306.06028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 07/18/2023] [Accepted: 07/19/2023] [Indexed: 08/17/2023]
Abstract
Retroviral integration into ancient vertebrate genomes left traces that can shed light on the early history of viruses. In this study, we explored the early evolution of retroviruses by isolating nine Spuma endogenous retroviruses (ERVs) and one Epsilon ERV from the genomes of Agnatha and Chondrichthyes. Phylogenetic analysis of protein sequences revealed a striking pattern of co-evolution between jawless fish ERV and their host, while shark ERV underwent ancient cross-class viral transmission with jawless fish, ray-finned fish, and amphibians. Nucleotide sequence analysis showed that jawless fish ERV emerged in the Palaeozoic period, relatively later than ray-finned fish ERV. Moreover, codon analysis suggested that the jawless fish ERV employed an infection strategy that mimics the host codon. The genealogical diversity of ERVs in the jawless fish genome highlights the importance of studying different viral species. Overall, our findings provide valuable insights into the evolution of retroviruses and their interactions with their hosts.
Collapse
Affiliation(s)
- Song Jing
- College of Life Sciences, Shaanxi Normal University, Xi'an 710062, P.R. China
- College of Chemistry and Biological Engineering, Hechi University, Hechi 546300, P.R. China
| | - Wei Jie
- College of Environment and Life Sciences, Weinan Normal University, Weinan 714099, P. R. China
| | - Ma Yongping
- College of Biological Sciences and Engineering, North Minzu University, Yinchuan 750021, P.R. China
| | - Sun Yan
- College of Life Sciences, Shaanxi Normal University, Xi'an 710062, P.R. China
| | - Li Zhi
- College of Life Sciences, Shaanxi Normal University, Xi'an 710062, P.R. China
| |
Collapse
|
2
|
Pawlak K, Błażej P, Mackiewicz D, Mackiewicz P. The Influence of the Selection at the Amino Acid Level on Synonymous Codon Usage from the Viewpoint of Alternative Genetic Codes. Int J Mol Sci 2023; 24:ijms24021185. [PMID: 36674703 PMCID: PMC9866869 DOI: 10.3390/ijms24021185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Revised: 12/19/2022] [Accepted: 12/30/2022] [Indexed: 01/11/2023] Open
Abstract
Synonymous codon usage can be influenced by mutations and/or selection, e.g., for speed of protein translation and correct folding. However, this codon bias can also be affected by a general selection at the amino acid level due to differences in the acceptance of the loss and generation of these codons. To assess the importance of this effect, we constructed a mutation-selection model model, in which we generated almost 90,000 stationary nucleotide distributions produced by mutational processes and applied a selection based on differences in physicochemical properties of amino acids. Under these conditions, we calculated the usage of fourfold degenerated (4FD) codons and compared it with the usage characteristic of the pure mutations. We considered both the standard genetic code (SGC) and alternative genetic codes (AGCs). The analyses showed that a majority of AGCs produced a greater 4FD codon bias than the SGC. The mutations producing more thymine or adenine than guanine and cytosine increased the differences in usage. On the other hand, the mutational pressures generating a lot of cytosine or guanine with a low content of adenine and thymine decreased this bias because the nucleotide content of most 4FD codons stayed in the compositional equilibrium with these pressures. The comparison of the theoretical results with those for real protein coding sequences showed that the influence of selection at the amino acid level on the synonymous codon usage cannot be neglected. The analyses indicate that the effect of amino acid selection cannot be disregarded and that it can interfere with other selection factors influencing codon usage, especially in AT-rich genomes, in which AGCs are usually used.
Collapse
|
3
|
Fan Y, Guo D, Zhao S, Wei Q, Li Y, Lin T. Human genes with relative synonymous codon usage analogous to that of polyomaviruses are involved in the mechanism of polyomavirus nephropathy. Front Cell Infect Microbiol 2022; 12:992201. [PMID: 36159639 PMCID: PMC9492876 DOI: 10.3389/fcimb.2022.992201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2022] [Accepted: 08/12/2022] [Indexed: 11/28/2022] Open
Abstract
Human polyomaviruses (HPyVs) can cause serious and deleterious infections in human. Yet, the molecular mechanism underlying these infections, particularly in polyomavirus nephropathy (PVAN), is not well-defined. In the present study, we aimed to identify human genes with codon usage bias (CUB) similar to that of HPyV genes and explore their potential involvement in the pathogenesis of PVAN. The relative synonymous codon usage (RSCU) values of genes of HPyVs and those of human genes were computed and used for Pearson correlation analysis. The involvement of the identified correlation genes in PVAN was analyzed by validating their differential expression in publicly available transcriptomics data. Functional enrichment was performed to uncover the role of sets of genes. The RSCU analysis indicated that the A- and T-ending codons are preferentially used in HPyV genes. In total, 5400 human genes were correlated to the HPyV genes. The protein-protein interaction (PPI) network indicated strong interactions between these proteins. Gene expression analysis indicated that 229 of these genes were consistently and differentially expressed between normal kidney tissues and kidney tissues from PVAN patients. Functional enrichment analysis indicated that these genes were involved in biological processes related to transcription and in pathways related to protein ubiquitination pathway, apoptosis, cellular response to stress, inflammation and immune system. The identified genes may serve as diagnostic biomarkers and potential therapeutic targets for HPyV associated diseases, especially PVAN.
Collapse
Affiliation(s)
- Yu Fan
- Department of Urology, National Clinical Research Center for Geriatrics and Organ Transplantation Center, West China Hospital of Sichuan University, Chengdu, China
| | - Duan Guo
- Department of Palliative Medicine, West China School of Public Health and West China fourth Hospital, Sichuan University, Chengdu, China
- Palliative Medicine Research Center, West China−Peking Union Medical College, Chen Zhiqian (PUMC C.C). Chen Institute of Health, Sichuan University, Chengdu, China
| | - Shangping Zhao
- Department of Urology, West China School of Nursing and Organ Transplantation Center, West China Hospital of Sichuan University, Chengdu, China
| | - Qiang Wei
- Department of Urology, National Clinical Research Center for Geriatrics and Organ Transplantation Center, West China Hospital of Sichuan University, Chengdu, China
| | - Yi Li
- Department of Laboratory Medicine, West China Hospital, Sichuan University, Chengdu, China
- *Correspondence: Tao Lin, ; ; Yi Li,
| | - Tao Lin
- Department of Urology, National Clinical Research Center for Geriatrics and Organ Transplantation Center, West China Hospital of Sichuan University, Chengdu, China
- *Correspondence: Tao Lin, ; ; Yi Li,
| |
Collapse
|
4
|
Analysis of the Contribution of Intrinsic Disorder in Shaping Potyvirus Genetic Diversity. Viruses 2022; 14:v14091959. [PMID: 36146764 PMCID: PMC9504506 DOI: 10.3390/v14091959] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 08/30/2022] [Accepted: 08/31/2022] [Indexed: 12/30/2022] Open
Abstract
Intrinsically disordered regions (IDRs) are abundant in the proteome of RNA viruses. The multifunctional properties of these regions are widely documented and their structural flexibility is associated with the low constraint in their amino acid positions. Therefore, from an evolutionary stand point, these regions could have a greater propensity to accumulate non-synonymous mutations (NS) than highly structured regions (ORs, or 'ordered regions'). To address this hypothesis, we compared the distribution of non-synonymous mutations (NS), which we relate here to mutational robustness, in IDRs and ORs in the genome of potyviruses, a major genus of plant viruses. For this purpose, a simulation model was built and used to distinguish a possible selection phenomenon in the biological datasets from randomly generated mutations. We analyzed several short-term experimental evolution datasets. An analysis was also performed on the natural diversity of three different species of potyviruses reflecting their long-term evolution. We observed that the mutational robustness of IDRs is significantly higher than that of ORs. Moreover, the substitutions in the ORs are very constrained by the conservation of the physico-chemical properties of the amino acids. This feature is not found in the IDRs where the substitutions tend to be more random. This reflects the weak structural constraints in these regions, wherein an amino acid polymorphism is naturally conserved. In the course of evolution, potyvirus IDRs and ORs follow different evolutive paths with respect to their mutational robustness. These results have forced the authors to consider the hypothesis that IDRs and their associated amino acid polymorphism could constitute a potential adaptive reservoir.
Collapse
|
5
|
Wang Q, Lyu X, Cheng J, Fu Y, Lin Y, Abdoulaye AH, Jiang D, Xie J. Codon Usage Provides Insights into the Adaptive Evolution of Mycoviruses in Their Associated Fungi Host. Int J Mol Sci 2022; 23:ijms23137441. [PMID: 35806445 PMCID: PMC9267111 DOI: 10.3390/ijms23137441] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Revised: 06/29/2022] [Accepted: 06/30/2022] [Indexed: 11/16/2022] Open
Abstract
Codon usage bias (CUB) could reflect co-evolutionary changes between viruses and hosts in contrast to plant and animal viruses, and the systematic analysis of codon usage among the mycoviruses that infect plant pathogenic fungi is limited. We performed an extensive analysis of codon usage patterns among 98 characterized RNA mycoviruses from eight phytopathogenic fungi. The GC and GC3s contents of mycoviruses have a wide variation from 29.35% to 64.62% and 24.32% to 97.13%, respectively. Mycoviral CUB is weak, and natural selection plays a major role in the formation of mycoviral codon usage pattern. In this study, we demonstrated that the codon usage of mycoviruses is similar to that of some host genes, especially those involved in RNA biosynthetic process and transcription, suggesting that CUB is a potential evolutionary mechanism that mycoviruses adapt to in their hosts.
Collapse
Affiliation(s)
- Qianqian Wang
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan 430070, China; (Q.W.); (X.L.); (J.C.); (A.H.A.); (D.J.)
- The Hubei Key Lab of Plant Pathology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (Y.F.); (Y.L.)
- Hubei Hongshan Laboratory, Wuhan 430070, China
| | - Xueliang Lyu
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan 430070, China; (Q.W.); (X.L.); (J.C.); (A.H.A.); (D.J.)
- The Hubei Key Lab of Plant Pathology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (Y.F.); (Y.L.)
| | - Jiasen Cheng
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan 430070, China; (Q.W.); (X.L.); (J.C.); (A.H.A.); (D.J.)
- The Hubei Key Lab of Plant Pathology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (Y.F.); (Y.L.)
| | - Yanping Fu
- The Hubei Key Lab of Plant Pathology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (Y.F.); (Y.L.)
| | - Yang Lin
- The Hubei Key Lab of Plant Pathology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (Y.F.); (Y.L.)
| | - Assane Hamidou Abdoulaye
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan 430070, China; (Q.W.); (X.L.); (J.C.); (A.H.A.); (D.J.)
- The Hubei Key Lab of Plant Pathology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (Y.F.); (Y.L.)
| | - Daohong Jiang
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan 430070, China; (Q.W.); (X.L.); (J.C.); (A.H.A.); (D.J.)
- The Hubei Key Lab of Plant Pathology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (Y.F.); (Y.L.)
- Hubei Hongshan Laboratory, Wuhan 430070, China
| | - Jiatao Xie
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan 430070, China; (Q.W.); (X.L.); (J.C.); (A.H.A.); (D.J.)
- The Hubei Key Lab of Plant Pathology, College of Plant Science and Technology, Huazhong Agricultural University, Wuhan 430070, China; (Y.F.); (Y.L.)
- Hubei Hongshan Laboratory, Wuhan 430070, China
- Correspondence: ; Tel.: +86-185-027-36960
| |
Collapse
|
6
|
Nambou K, Anakpa M, Tong YS. Human genes with codon usage bias similar to that of the nonstructural protein 1 gene of influenza A viruses are conjointly involved in the infectious pathogenesis of influenza A viruses. Genetica 2022; 150:97-115. [PMID: 35396627 PMCID: PMC8992787 DOI: 10.1007/s10709-022-00155-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Accepted: 03/24/2022] [Indexed: 11/27/2022]
Abstract
Molecular mechanisms of the non-structural protein 1 (NS1) in influenza A-induced pathological changes remain ambiguous. This study explored the pathogenesis of human infection by influenza A viruses (IAVs) through identifying human genes with codon usage bias (CUB) similar to NS1 gene of these viruses based on the relative synonymous codon usage (RSCU). CUB of the IAV subtypes H1N1, H3N2, H3N8, H5N1, H5N2, H5N8, H7N9 and H9N2 was analyzed and the correlation of RSCU values of NS1 sequences with those of the human genes was calculated. The CUB of NS1 was uneven and codons ending with A/U were preferred. The ENC-GC3 and neutrality plots suggested natural selection as the main determinant for CUB. The RCDI, CAI and SiD values showed that the viruses had a high degree of adaptability to human. A total of 2155 human genes showed significant RSCU-based correlation (p < 0.05 and r > 0.5) with NS1 coding sequences and was considered as human genes with CUB similar to NS1 gene of IAV subtypes. Differences and similarities in the subtype-specific human protein–protein interaction (PPI) networks and their functions were recorded among IAVs subtypes, indicating that NS1 of each IAV subtype has a specific pathogenic mechanism. Processes and pathways involved in influenza, transcription, immune response and cell cycle were enriched in human gene sets retrieved based on the CUB of NS1 gene of IAV subtypes. The present work may advance our understanding on the mechanism of NS1 in human infections of IAV subtypes and shed light on the therapeutic options.
Collapse
Affiliation(s)
- Komi Nambou
- Shenzhen Nambou1 Biotech Company Limited, 998 Wisdom Valley, No. 38-56 Zhenming Road, Guangming District, Shenzhen, 518106, China.
| | - Manawa Anakpa
- Centre d'Informatique et de Calcul, Université de Lomé, Boulevard Gnassingbé Eyadema, 01 B.P. 1515, Lomé, Togo
| | - Yin Selina Tong
- Shenzhen Nambou1 Biotech Company Limited, 998 Wisdom Valley, No. 38-56 Zhenming Road, Guangming District, Shenzhen, 518106, China
| |
Collapse
|
7
|
Bias at the third nucleotide of codon pairs in virus and host genomes. Sci Rep 2022; 12:4522. [PMID: 35296743 PMCID: PMC8927144 DOI: 10.1038/s41598-022-08570-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Accepted: 03/09/2022] [Indexed: 11/29/2022] Open
Abstract
Genomes of different sizes and complexity can be compared using common features. Most genomes contain open reading frames, and most genomes use the same genetic code. Redundancy in the genetic code means that different biases in the third nucleotide position of a codon exist in different genomes. However, the nucleotide composition of viruses can be quite different from host nucleotide composition making it difficult to assess the relevance of these biases. Here we show that grouping codons of a codon-pair according to the GC content of the first two nucleotide positions of each codon reveals patterns in nucleotide usage at the third position of the 1st codon. Differences between the observed and expected biases occur predominantly when the first two nucleotides of the 2nd codon are both S (strong, G or C) or both W (weak, A or T), not a mixture of strong and weak. The data indicates that some codon pairs are preferred because of the strength of the interactions between the codon and anticodon, the adjacent tRNAs and the ribosome. Using base-pairing strength and third position bias facilitates the comparison of genomes of different size and nucleotide composition and reveals patterns not previously described.
Collapse
|
8
|
Bartas M, Volná A, Beaudoin CA, Poulsen ET, Červeň J, Brázda V, Špunda V, Blundell TL, Pečinka P. Unheeded SARS-CoV-2 proteins? A deep look into negative-sense RNA. Brief Bioinform 2022; 23:6539840. [PMID: 35229157 PMCID: PMC9116216 DOI: 10.1093/bib/bbac045] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 01/13/2022] [Accepted: 01/29/2022] [Indexed: 01/27/2023] Open
Abstract
SARS-CoV-2 is a novel positive-sense single-stranded RNA virus from the Coronaviridae family (genus Betacoronavirus), which has been established as causing the COVID-19 pandemic. The genome of SARS-CoV-2 is one of the largest among known RNA viruses, comprising of at least 26 known protein-coding loci. Studies thus far have outlined the coding capacity of the positive-sense strand of the SARS-CoV-2 genome, which can be used directly for protein translation. However, it has been recently shown that transcribed negative-sense viral RNA intermediates that arise during viral genome replication from positive-sense viruses can also code for proteins. No studies have yet explored the potential for negative-sense SARS-CoV-2 RNA intermediates to contain protein-coding loci. Thus, using sequence and structure-based bioinformatics methodologies, we have investigated the presence and validity of putative negative-sense ORFs (nsORFs) in the SARS-CoV-2 genome. Nine nsORFs were discovered to contain strong eukaryotic translation initiation signals and high codon adaptability scores, and several of the nsORFs were predicted to interact with RNA-binding proteins. Evolutionary conservation analyses indicated that some of the nsORFs are deeply conserved among related coronaviruses. Three-dimensional protein modeling revealed the presence of higher order folding among all putative SARS-CoV-2 nsORFs, and subsequent structural mimicry analyses suggest similarity of the nsORFs to DNA/RNA-binding proteins and proteins involved in immune signaling pathways. Altogether, these results suggest the potential existence of still undescribed SARS-CoV-2 proteins, which may play an important role in the viral lifecycle and COVID-19 pathogenesis.
Collapse
Affiliation(s)
- Martin Bartas
- Department of Biology and Ecology, University of Ostrava, Ostrava 710 00, Czech Republic
| | - Adriana Volná
- Department of Physics, University of Ostrava, Ostrava 710 00, Czech Republic
| | - Christopher A Beaudoin
- Department of Biochemistry, Sanger Building, University of Cambridge, Tennis Court Rd, Cambridge CB2 1GA, UK
| | | | - Jiří Červeň
- Department of Biology and Ecology, University of Ostrava, Ostrava 710 00, Czech Republic
| | - Václav Brázda
- Institute of Biophysics, Czech Academy of Sciences, Brno, 612 65, Czech Republic
| | - Vladimír Špunda
- Department of Physics, University of Ostrava, Ostrava 710 00, Czech Republic.,Global Change Research Institute, Czech Academy of Sciences, Brno, 603 00, Czech Republic
| | - Tom L Blundell
- Department of Biochemistry, Sanger Building, University of Cambridge, Tennis Court Rd, Cambridge CB2 1GA, UK
| | - Petr Pečinka
- Department of Biology and Ecology, University of Ostrava, Ostrava 710 00, Czech Republic
| |
Collapse
|
9
|
Yi K, Kim SY, Bleazard T, Kim T, Youk J, Ju YS. Mutational spectrum of SARS-CoV-2 during the global pandemic. Exp Mol Med 2021; 53:1229-1237. [PMID: 34453107 PMCID: PMC8393781 DOI: 10.1038/s12276-021-00658-z] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 04/29/2021] [Accepted: 05/11/2021] [Indexed: 02/07/2023] Open
Abstract
Viruses accumulate mutations under the influence of natural selection and host-virus interactions. Through a systematic comparison of 351,525 full viral genome sequences collected during the recent COVID-19 pandemic, we reveal the spectrum of SARS-CoV-2 mutations. Unlike those of other viruses, the mutational spectrum of SARS-CoV-2 exhibits extreme asymmetry, with a much higher rate of C>U than U>C substitutions, as well as a higher rate of G>U than U>G substitutions. This suggests directional genome sequence evolution during transmission. The substantial asymmetry and directionality of the mutational spectrum enable pseudotemporal tracing of SARS-CoV-2 without prior information about the root sequence, collection time, and sampling region. This shows that the viral genome sequences collected in Asia are similar to the original genome sequence. Adjusted estimation of the dN/dS ratio accounting for the asymmetrical mutational spectrum also shows evidence of negative selection on viral genes, consistent with previous reports. Our findings provide deep insights into the mutational processes in SARS-CoV-2 viral infection and advance the understanding of the history and future evolution of the virus.
Collapse
Affiliation(s)
- Kijong Yi
- grid.37172.300000 0001 2292 0500Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141 Korea
| | - Su Yeon Kim
- grid.37172.300000 0001 2292 0500Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141 Korea
| | - Thomas Bleazard
- grid.70909.370000 0001 2199 6511National Institute for Biological Standards and Control, Blanche Lane, South Mimms, Potters Bar, Hertfordshire, EN6 3QG UK
| | - Taewoo Kim
- grid.37172.300000 0001 2292 0500Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141 Korea
| | - Jeonghwan Youk
- grid.37172.300000 0001 2292 0500Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141 Korea ,grid.511166.4GENOME INSIGHT Inc, Daejeon, 34051 Korea
| | - Young Seok Ju
- grid.37172.300000 0001 2292 0500Graduate School of Medical Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141 Korea ,grid.511166.4GENOME INSIGHT Inc, Daejeon, 34051 Korea
| |
Collapse
|
10
|
Hoxie I, Dennehy JJ. Rotavirus A Genome Segments Show Distinct Segregation and Codon Usage Patterns. Viruses 2021; 13:v13081460. [PMID: 34452326 PMCID: PMC8402926 DOI: 10.3390/v13081460] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 07/05/2021] [Accepted: 07/07/2021] [Indexed: 12/29/2022] Open
Abstract
Reassortment of the Rotavirus A (RVA) 11-segment dsRNA genome may generate new genome constellations that allow RVA to expand its host range or evade immune responses. Reassortment may also produce phylogenetic incongruities and weakly linked evolutionary histories across the 11 segments, obscuring reassortment-specific epistasis and changes in substitution rates. To determine the co-segregation patterns of RVA segments, we generated time-scaled phylogenetic trees for each of the 11 segments of 789 complete RVA genomes isolated from mammalian hosts and compared the segments’ geodesic distances. We found that segments 4 (VP4) and 9 (VP7) occupied significantly different tree spaces from each other and from the rest of the genome. By contrast, segments 10 and 11 (NSP4 and NSP5/6) occupied nearly indistinguishable tree spaces, suggesting strong co-segregation. Host-species barriers appeared to vary by segment, with segment 9 (VP7) presenting the weakest association with host species. Bayesian Skyride plots were generated for each segment to compare relative genetic diversity among segments over time. All segments showed a dramatic decrease in diversity around 2007 coinciding with the introduction of RVA vaccines. To assess selection pressures, codon adaptation indices and relative codon deoptimization indices were calculated with respect to different host genomes. Codon usage varied by segment with segment 11 (NSP5) exhibiting significantly higher adaptation to host genomes. Furthermore, RVA codon usage patterns appeared optimized for expression in humans and birds relative to the other hosts examined, suggesting that translational efficiency is not a barrier in RVA zoonosis.
Collapse
Affiliation(s)
- Irene Hoxie
- Biology Department, The Graduate Center, The City University of New York, New York, NY 10016, USA;
- Biology Department, Queens College, The City University of New York, Flushing, New York, NY 11367, USA
- Correspondence:
| | - John J. Dennehy
- Biology Department, The Graduate Center, The City University of New York, New York, NY 10016, USA;
- Biology Department, Queens College, The City University of New York, Flushing, New York, NY 11367, USA
| |
Collapse
|
11
|
Mordstein C, Cano L, Morales AC, Young B, Ho AT, Rice AM, Liss M, Hurst LD, Kudla G. Transcription, mRNA export and immune evasion shape the codon usage of viruses. Genome Biol Evol 2021; 13:6275682. [PMID: 33988683 PMCID: PMC8410142 DOI: 10.1093/gbe/evab106] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/10/2021] [Indexed: 12/15/2022] Open
Abstract
The nucleotide composition, dinucleotide composition, and codon usage of many viruses differs from their hosts. These differences arise because viruses are subject to unique mutation and selection pressures that do not apply to host genomes; however, the molecular mechanisms that underlie these evolutionary forces are unclear. Here, we analysed the patterns of codon usage in 1,520 vertebrate-infecting viruses, focusing on parameters known to be under selection and associated with gene regulation. We find that GC content, dinucleotide content, and splicing and m6A modification-related sequence motifs are associated with the type of genetic material (DNA or RNA), strandedness, and replication compartment of viruses. In an experimental follow-up, we find that the effects of GC content on gene expression depend on whether the genetic material is delivered to the cell as DNA or mRNA, whether it is transcribed by endogenous or exogenous RNA polymerase, and whether transcription takes place in the nucleus or cytoplasm. Our results suggest that viral codon usage cannot be explained by a simple adaptation to the codon usage of the host - instead, it reflects the combination of multiple selective and mutational pressures, including the need for efficient transcription, export, and immune evasion.
Collapse
Affiliation(s)
- Christine Mordstein
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK.,The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Laura Cano
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK
| | - Atahualpa Castillo Morales
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Bethan Young
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK.,The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Alexander T Ho
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Alan M Rice
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Michael Liss
- Thermo Fisher Scientific, GENEART GmbH, Regensburg, Germany
| | - Laurence D Hurst
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Grzegorz Kudla
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK
| |
Collapse
|
12
|
Das JK, Chakraborty S, Roy S. A scheme for inferring viral-host associations based on codon usage patterns identifies the most affected signaling pathways during COVID-19. J Biomed Inform 2021; 118:103801. [PMID: 33965637 PMCID: PMC8102073 DOI: 10.1016/j.jbi.2021.103801] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 05/02/2021] [Accepted: 05/03/2021] [Indexed: 12/16/2022]
Abstract
Understanding the molecular mechanism of COVID-19 pathogenesis helps in the rapid therapeutic target identification. Usually, viral protein targets host proteins in an organized fashion. The expression of any viral gene depends mostly on the host translational machinery. Recent studies report the great significance of codon usage biases in establishing host-viral protein–protein interactions (PPI). Exploring the codon usage patterns between a pair of co-evolved host and viral proteins may present novel insight into the host-viral protein interactomes during disease pathogenesis. Leveraging the similarity in codon usage patterns, we propose a computational scheme to recreate the host-viral protein–protein interaction network. We use host proteins from seventeen (17) essential signaling pathways for our current work towards understanding the possible targeting mechanism of SARS-CoV-2 proteins. We infer both negatively and positively interacting edges in the network. Further, extensive analysis is performed to understand the host PPI network topologically and the attacking behavior of the viral proteins. Our study reveals that viral proteins mostly utilize codons, rare in the targeted host proteins (negatively correlated interaction). Among them, non-structural proteins, NSP3 and structural protein, Spike (S), are the most influential proteins in interacting with multiple host proteins. While ranking the most affected pathways, MAPK pathways observe to be the worst affected during the SARS-CoV-2 infection. Several proteins participating in multiple pathways are highly central in host PPI and mostly targeted by multiple viral proteins. We observe many potential targets (host proteins) from the affected pathways associated with the various drug molecules, including Arsenic trioxide, Dexamethasone, Hydroxychloroquine, Ritonavir, and Interferon beta, which are either under clinical trial or in use during COVID-19.
Collapse
Affiliation(s)
- Jayanta Kumar Das
- Department of Pediatrics, Johns Hopkins University, School of Medicine, MD, USA
| | | | - Swarup Roy
- Network Reconstruction & Analysis (NetRA) Lab, Department of Computer Applications, Sikkim University, Gangtok, India.
| |
Collapse
|
13
|
Dimonaco NJ, Salavati M, Shih BB. Computational Analysis of SARS-CoV-2 and SARS-Like Coronavirus Diversity in Human, Bat and Pangolin Populations. Viruses 2020; 13:E49. [PMID: 33396801 PMCID: PMC7823979 DOI: 10.3390/v13010049] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 12/21/2020] [Accepted: 12/22/2020] [Indexed: 12/14/2022] Open
Abstract
In 2019, a novel coronavirus, SARS-CoV-2/nCoV-19, emerged in Wuhan, China, and has been responsible for the current COVID-19 pandemic. The evolutionary origins of the virus remain elusive and understanding its complex mutational signatures could guide vaccine design and development. As part of the international "CoronaHack" in April 2020, we employed a collection of contemporary methodologies to compare the genomic sequences of coronaviruses isolated from human (SARS-CoV-2; n = 163), bat (bat-CoV; n = 215) and pangolin (pangolin-CoV; n = 7) available in public repositories. We have also noted the pangolin-CoV isolate MP789 to bare stronger resemblance to SARS-CoV-2 than other pangolin-CoV. Following de novo gene annotation prediction, analyses of gene-gene similarity network, codon usage bias and variant discovery were undertaken. Strong host-associated divergences were noted in ORF3a, ORF6, ORF7a, ORF8 and S, and in codon usage bias profiles. Last, we have characterised several high impact variants (in-frame insertion/deletion or stop gain) in bat-CoV and pangolin-CoV populations, some of which are found in the same amino acid position and may be highlighting loci of potential functional relevance.
Collapse
Affiliation(s)
- Nicholas J. Dimonaco
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Wales SY3 3FL, UK
| | - Mazdak Salavati
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian EH25 9RG, UK
| | - Barbara B. Shih
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Midlothian EH25 9RG, UK
| |
Collapse
|