1
|
Nambou K, Anakpa M, Tong YS. Human genes with codon usage bias similar to that of the nonstructural protein 1 gene of influenza A viruses are conjointly involved in the infectious pathogenesis of influenza A viruses. Genetica 2022; 150:97-115. [PMID: 35396627 PMCID: PMC8992787 DOI: 10.1007/s10709-022-00155-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Accepted: 03/24/2022] [Indexed: 11/27/2022]
Abstract
Molecular mechanisms of the non-structural protein 1 (NS1) in influenza A-induced pathological changes remain ambiguous. This study explored the pathogenesis of human infection by influenza A viruses (IAVs) through identifying human genes with codon usage bias (CUB) similar to NS1 gene of these viruses based on the relative synonymous codon usage (RSCU). CUB of the IAV subtypes H1N1, H3N2, H3N8, H5N1, H5N2, H5N8, H7N9 and H9N2 was analyzed and the correlation of RSCU values of NS1 sequences with those of the human genes was calculated. The CUB of NS1 was uneven and codons ending with A/U were preferred. The ENC-GC3 and neutrality plots suggested natural selection as the main determinant for CUB. The RCDI, CAI and SiD values showed that the viruses had a high degree of adaptability to human. A total of 2155 human genes showed significant RSCU-based correlation (p < 0.05 and r > 0.5) with NS1 coding sequences and was considered as human genes with CUB similar to NS1 gene of IAV subtypes. Differences and similarities in the subtype-specific human protein–protein interaction (PPI) networks and their functions were recorded among IAVs subtypes, indicating that NS1 of each IAV subtype has a specific pathogenic mechanism. Processes and pathways involved in influenza, transcription, immune response and cell cycle were enriched in human gene sets retrieved based on the CUB of NS1 gene of IAV subtypes. The present work may advance our understanding on the mechanism of NS1 in human infections of IAV subtypes and shed light on the therapeutic options.
Collapse
Affiliation(s)
- Komi Nambou
- Shenzhen Nambou1 Biotech Company Limited, 998 Wisdom Valley, No. 38-56 Zhenming Road, Guangming District, Shenzhen, 518106, China.
| | - Manawa Anakpa
- Centre d'Informatique et de Calcul, Université de Lomé, Boulevard Gnassingbé Eyadema, 01 B.P. 1515, Lomé, Togo
| | - Yin Selina Tong
- Shenzhen Nambou1 Biotech Company Limited, 998 Wisdom Valley, No. 38-56 Zhenming Road, Guangming District, Shenzhen, 518106, China
| |
Collapse
|
2
|
Codon Usage Bias in Phytoplankton. JOURNAL OF MARINE SCIENCE AND ENGINEERING 2022. [DOI: 10.3390/jmse10020168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
Non-random usage of synonymous codons, known as “codon bias”, has been described in many organisms, from bacteria to Drosophila, but little is known about it in phytoplankton. This phenomenon is thought to be driven by selection for translational efficiency. As the efficacy of selection is proportional to the effective population size, species with large population sizes, such as phytoplankton, are expected to have strong codon bias. To test this, we measured codon bias in 215 strains from Haptophyta, Chlorophyta, Ochrophyta (except diatoms that were studied previously), Dinophyta, Cryptophyta, Ciliophora, unicellular Rhodophyta and Chlorarachniophyta. Codon bias is modest in most groups, despite the astronomically large population sizes of marine phytoplankton. The strength of the codon bias, measured with the effective number of codons, is the strongest in Haptophyta and the weakest in Chlorarachniophyta. The optimal codons are GC-ending in most cases, but several shifts to AT-ending codons were observed (mainly in Ochrophyta and Ciliophora). As it takes a long time to reach a new equilibrium after such shifts, species having AT-ending codons show a lower frequency of optimal codons compared to other species. Genetic diversity, calculated for species with more than three strains sequenced, is modest, indicating that the effective population sizes are many orders of magnitude lower than the astronomically large census population sizes, which helps to explain the modest codon bias in marine phytoplankton. This study represents the first comparative analysis of codon bias across multiple major phytoplankton groups.
Collapse
|
3
|
Simón D, Cristina J, Musto H. Nucleotide Composition and Codon Usage Across Viruses and Their Respective Hosts. Front Microbiol 2021; 12:646300. [PMID: 34262534 PMCID: PMC8274242 DOI: 10.3389/fmicb.2021.646300] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2020] [Accepted: 06/04/2021] [Indexed: 11/13/2022] Open
Abstract
The genetic material of the three domains of life (Bacteria, Archaea, and Eukaryota) is always double-stranded DNA, and their GC content (molar content of guanine plus cytosine) varies between ≈ 13% and ≈ 75%. Nucleotide composition is the simplest way of characterizing genomes. Despite this simplicity, it has several implications. Indeed, it is the main factor that determines, among other features, dinucleotide frequencies, repeated short DNA sequences, and codon and amino acid usage. Which forces drive this strong variation is still a matter of controversy. For rather obvious reasons, most of the studies concerning this huge variation and its consequences, have been done in free-living organisms. However, no recent comprehensive study of all known viruses has been done (that is, concerning all available sequences). Viruses, by far the most abundant biological entities on Earth, are the causative agents of many diseases. An overview of these entities is important also because their genetic material is not always double-stranded DNA: indeed, certain viruses have as genetic material single-stranded DNA, double-stranded RNA, single-stranded RNA, and/or retro-transcribing. Therefore, one may wonder if what we have learned about the evolution of GC content and its implications in prokaryotes and eukaryotes also applies to viruses. In this contribution, we attempt to describe compositional properties of ∼ 10,000 viral species: base composition (globally and according to Baltimore classification), correlations among non-coding regions and the three codon positions, and the relationship of the nucleotide frequencies and codon usage of viruses with the same feature of their hosts. This allowed us to determine how the base composition of phages strongly correlate with the value of their respective hosts, while eukaryotic viruses do not (with fungi and protists as exceptions). Finally, we discuss some of these results concerning codon usage: reinforcing previous results, we found that phages and hosts exhibit moderate to high correlations, while for eukaryotes and their viruses the correlations are weak or do not exist.
Collapse
Affiliation(s)
- Diego Simón
- Laboratorio de Genómica Evolutiva, Departamento de Biología Celular y Molecular, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay.,Laboratorio de Virología Molecular, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de la Republica, Montevideo, Uruguay.,Laboratorio de Evolución Experimental de Virus, Institut Pasteur de Montevideo, Montevideo, Uruguay
| | - Juan Cristina
- Laboratorio de Virología Molecular, Centro de Investigaciones Nucleares, Facultad de Ciencias, Universidad de la Republica, Montevideo, Uruguay
| | - Héctor Musto
- Laboratorio de Genómica Evolutiva, Departamento de Biología Celular y Molecular, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| |
Collapse
|
4
|
Bousquet L, Hemon C, Malburet P, Bucchini F, Vandepoele K, Grimsley N, Moreau H, Echeverria M. The medium-size noncoding RNA transcriptome of Ostreococcus tauri, the smallest living eukaryote, reveals a large family of small nucleolar RNAs displaying multiple genomic expression strategies. NAR Genom Bioinform 2020; 2:lqaa080. [PMID: 33575626 PMCID: PMC7671301 DOI: 10.1093/nargab/lqaa080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Revised: 08/31/2020] [Accepted: 09/17/2020] [Indexed: 11/14/2022] Open
Abstract
The small nucleolar RNAs (snoRNAs), essential for ribosome biogenesis, constitute a major family of medium-size noncoding RNAs (mncRNAs) in all eukaryotes. We present here, for the first time in a marine unicellular alga, the characterization of the snoRNAs family in Ostreococcus tauri, the smallest photosynthetic eukaryote. Using a transcriptomic approach, we identified 131 O. tauri snoRNAs (Ot–snoRNA) distributed in three classes: the C/D snoRNAs, the H/ACA snoRNAs and the MRP RNA. Their genomic organization revealed a unique combination of both the intronic organization of animals and the polycistronic organization of plants. Remarkably, clustered genes produced Ot–snoRNAs with unusual structures never previously described in plants. Their abundances, based on quantification of reads and northern blots, showed extreme differences in Ot–snoRNA accumulation, mainly determined by their differential stability. Most of these Ot–snoRNAs were predicted to target rRNAs or snRNAs. Seventeen others were orphan Ot–snoRNAs that would not target rRNA. These were specific to O. tauri or Mamiellophyceae and could have functions unrelated to ribosome biogenesis. Overall, these data reveal an ‘evolutionary response’ adapted to the extreme compactness of the O. tauri genome that accommodates the essential Ot–snoRNAs, developing multiple strategies to optimize their coordinated expression with a minimal cost on regulatory circuits.
Collapse
Affiliation(s)
- Laurie Bousquet
- Sorbonne Université, CNRS, Laboratoire de Biologie Intégrative des Organismes Marins , UMR7232, F-66650 Banyuls sur Mer, France
| | - Claire Hemon
- Sorbonne Université, CNRS, Laboratoire de Biologie Intégrative des Organismes Marins , UMR7232, F-66650 Banyuls sur Mer, France
| | - Paul Malburet
- Sorbonne Université, CNRS, Laboratoire de Biologie Intégrative des Organismes Marins , UMR7232, F-66650 Banyuls sur Mer, France
| | - François Bucchini
- Department of Plant Systems Biology,VIB, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium
| | - Klaas Vandepoele
- Department of Plant Systems Biology,VIB, 9052 Ghent, Belgium
- Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052 Ghent, Belgium
- Bioinformatic Institute Ghent, Ghent University, 9052 Ghent, Belgium
| | - Nigel Grimsley
- Sorbonne Université, CNRS, Laboratoire de Biologie Intégrative des Organismes Marins , UMR7232, F-66650 Banyuls sur Mer, France
| | - Hervé Moreau
- Sorbonne Université, CNRS, Laboratoire de Biologie Intégrative des Organismes Marins , UMR7232, F-66650 Banyuls sur Mer, France
| | - Manuel Echeverria
- Sorbonne Université, CNRS, Laboratoire de Biologie Intégrative des Organismes Marins , UMR7232, F-66650 Banyuls sur Mer, France
- Département de Biologie, Université de Perpignan via Domitia, 66860 Perpignan Cedex, France
| |
Collapse
|
5
|
Duncan GA, Dunigan DD, Van Etten JL. Diversity of tRNA Clusters in the Chloroviruses. Viruses 2020; 12:v12101173. [PMID: 33081353 PMCID: PMC7589089 DOI: 10.3390/v12101173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 10/12/2020] [Accepted: 10/12/2020] [Indexed: 11/25/2022] Open
Abstract
Viruses rely on their host’s translation machinery for the synthesis of their own proteins. Problems belie viral translation when the host has a codon usage bias (CUB) that is different from an infecting virus due to differences in the GC content between the host and virus genomes. Here, we examine the hypothesis that chloroviruses adapted to host CUB by acquisition and selection of tRNAs that at least partially favor their own CUB. The genomes of 41 chloroviruses comprising three clades, each infecting a different algal host, have been sequenced, assembled and annotated. All 41 viruses not only encode tRNAs, but their tRNA genes are located in clusters. While differences were observed between clades and even within clades, seven tRNA genes were common to all three clades of chloroviruses, including the tRNAArg gene, which was found in all 41 chloroviruses. By comparing the codon usage of one chlorovirus algal host, in which the genome has been sequenced and annotated (67% GC content), to that of two of its viruses (40% GC content), we found that the viruses were able to at least partially overcome the host’s CUB by encoding tRNAs that recognize AU-rich codons. Evidence presented herein supports the hypothesis that a chlorovirus tRNA cluster was present in the most recent common ancestor (MRCA) prior to divergence into three clades. In addition, the MRCA encoded a putative isoleucine lysidine synthase (TilS) that remains in 39/41 chloroviruses examined herein, suggesting a strong evolutionary pressure to retain the gene. TilS alters the anticodon of tRNAMet that normally recognizes AUG to then recognize AUA, a codon for isoleucine. This is advantageous to the chloroviruses because the AUA codon is 12–13 times more common in the chloroviruses than their host, further helping the chloroviruses to overcome CUB. Among large DNA viruses infecting eukaryotes, the presence of tRNA genes and tRNA clusters appear to be most common in the Phycodnaviridae and, to a lesser extent, in the Mimiviridae.
Collapse
Affiliation(s)
- Garry A. Duncan
- Nebraska Center for Virology, University of Nebraska-Lincoln, Lincoln, NE 68583-0900, USA; (G.A.D.); (D.D.D.)
| | - David D. Dunigan
- Nebraska Center for Virology, University of Nebraska-Lincoln, Lincoln, NE 68583-0900, USA; (G.A.D.); (D.D.D.)
- Department of Plant Pathology, University of Nebraska-Lincoln, Lincoln, NE 68583-0833, USA
| | - James L. Van Etten
- Nebraska Center for Virology, University of Nebraska-Lincoln, Lincoln, NE 68583-0900, USA; (G.A.D.); (D.D.D.)
- Department of Plant Pathology, University of Nebraska-Lincoln, Lincoln, NE 68583-0833, USA
- Correspondence: ; Tel.: +1-402-472-3168
| |
Collapse
|
6
|
Ge Z, Li X, Cao X, Wang R, Hu W, Gen L, Han S, Shang Y, Liu Y, Zhou JH. Viral adaption of staphylococcal phage: A genome-based analysis of the selective preference based on codon usage Bias. Genomics 2020; 112:4657-4665. [PMID: 32818632 DOI: 10.1016/j.ygeno.2020.08.012] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Revised: 07/19/2020] [Accepted: 08/11/2020] [Indexed: 12/09/2022]
Abstract
Given the high therapeutic value of the staphylococcal phage, the genome co-evolution of the phage and the host has gained great attention. Though the genome-wide AT richness in staphylococcal phages has been well-studied with nucleotide usage bias, here we proved that host factor, lifestyle and taxonomy are also important factors in understanding the phage nucleotide usages bias using information entropy formula. Such correlation is especially prominent when it comes to the synonymous codon usages of staphylococcal phages, despite the overall scattered codon usage pattern represented by principal component analysis. This strong relationship is explained by nucleotide skew which testified that the usage biases of nucleotide at different codon positions are acting on synonymous codons. Therefore, our study reveals a hidden relationship of genome evolution with host limitation and phagic phenotype, providing new insight into phage genome evolution at genetic level.
Collapse
Affiliation(s)
- Zhiyi Ge
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China
| | - Xuerui Li
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China
| | - Xiaoan Cao
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China
| | - Rui Wang
- Viterbi School of Engineering, University of Southern California, Los Angeles, CA 90089, United States of America
| | - Wen Hu
- Gansu Police Vocational College, Lanzhou 730046, Gansu, PR China
| | - Ling Gen
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China
| | - Shengyi Han
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China; The College of Veterinary Medicine, Gansu Agricultural University, Lanzhou 730070, Gansu Province, PR China
| | - Youjun Shang
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China
| | - Yongsheng Liu
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China
| | - Jian-Hua Zhou
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu, PR China.
| |
Collapse
|
7
|
Zinoviev A, Kuroha K, Pestova TV, Hellen CUT. Two classes of EF1-family translational GTPases encoded by giant viruses. Nucleic Acids Res 2019; 47:5761-5776. [PMID: 31216040 PMCID: PMC6582330 DOI: 10.1093/nar/gkz296] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2019] [Revised: 04/10/2019] [Accepted: 04/12/2019] [Indexed: 01/31/2023] Open
Abstract
Giant viruses have extraordinarily large dsDNA genomes, and exceptionally, they encode various components of the translation apparatus, including tRNAs, aminoacyl-tRNA synthetases and translation factors. Here, we focused on the elongation factor 1 (EF1) family of viral translational GTPases (trGTPases), using computational and functional approaches to shed light on their functions. Multiple sequence alignment indicated that these trGTPases clustered into two groups epitomized by members of Mimiviridae and Marseilleviridae, respectively. trGTPases in the first group were more closely related to GTP-binding protein 1 (GTPBP1), whereas trGTPases in the second group were closer to eEF1A, eRF3 and Hbs1. Functional characterization of representative GTPBP1-like trGTPases (encoded by Hirudovirus, Catovirus and Moumouvirus) using in vitro reconstitution revealed that they possess eEF1A-like activity and can deliver cognate aa-tRNAs to the ribosomal A site during translation elongation. By contrast, representative eEF1A/eRF3/Hbs1-like viral trGTPases, encoded by Marseillevirus and Lausannevirus, have eRF3-like termination activity and stimulate peptide release by eRF1. Our analysis identified specific aspects of the functioning of these viral trGTPases with eRF1 of human, amoebal and Marseillevirus origin.
Collapse
Affiliation(s)
- Alexandra Zinoviev
- Department of Cell Biology, SUNY Downstate Medical Center, 450 Clarkson Avenue, MSC 44, Brooklyn, NY 11203, USA
| | - Kazushige Kuroha
- Department of Cell Biology, SUNY Downstate Medical Center, 450 Clarkson Avenue, MSC 44, Brooklyn, NY 11203, USA
| | - Tatyana V Pestova
- Department of Cell Biology, SUNY Downstate Medical Center, 450 Clarkson Avenue, MSC 44, Brooklyn, NY 11203, USA
| | - Christopher U T Hellen
- Department of Cell Biology, SUNY Downstate Medical Center, 450 Clarkson Avenue, MSC 44, Brooklyn, NY 11203, USA
| |
Collapse
|
8
|
Sanchez F, Geffroy S, Norest M, Yau S, Moreau H, Grimsley N. Simplified Transformation of Ostreococcus tauri Using Polyethylene Glycol. Genes (Basel) 2019; 10:E399. [PMID: 31130696 PMCID: PMC6562926 DOI: 10.3390/genes10050399] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2019] [Revised: 05/16/2019] [Accepted: 05/21/2019] [Indexed: 12/21/2022] Open
Abstract
Ostreococcustauri is an easily cultured representative of unicellular algae (class Mamiellophyceae) that abound in oceans worldwide. Eight complete 13-22 Mb genomes of phylogenetically divergent species within this class are available, and their DNA sequences are nearly always present in metagenomic data produced from marine samples. Here we describe a simplified and robust transformation protocol for the smallest of these algae (O. tauri). Polyethylene glycol (PEG) treatment was much more efficient than the previously described electroporation protocol. Short (2 min or less) incubation times in PEG gave >104 transformants per microgram DNA. The time of cell recovery after transformation could be reduced to a few hours, permitting the experiment to be done in a day rather than overnight as used in previous protocols. DNA was randomly inserted in the O. tauri genome. In our hands PEG was 20-40-fold more efficient than electroporation for the transformation of O. tauri, and this improvement will facilitate mutagenesis of all of the dispensable genes present in the tiny O. tauri genome.
Collapse
Affiliation(s)
- Frédéric Sanchez
- CNRS UMR7232 BIOM (Biologie Intégrative des Organismes Marin) Sorbonne University, 66650 Banyuls sur Mer, France.
| | - Solène Geffroy
- IFREMER, Centre Atlantique, 44331 Nantes CEDEX 03, France.
| | - Manon Norest
- CNRS UMR7232 BIOM (Biologie Intégrative des Organismes Marin) Sorbonne University, 66650 Banyuls sur Mer, France.
| | - Sheree Yau
- CNRS UMR7232 BIOM (Biologie Intégrative des Organismes Marin) Sorbonne University, 66650 Banyuls sur Mer, France.
| | - Hervé Moreau
- CNRS UMR7232 BIOM (Biologie Intégrative des Organismes Marin) Sorbonne University, 66650 Banyuls sur Mer, France.
| | - Nigel Grimsley
- CNRS UMR7232 BIOM (Biologie Intégrative des Organismes Marin) Sorbonne University, 66650 Banyuls sur Mer, France.
| |
Collapse
|
9
|
Erives AJ. Phylogenetic analysis of the core histone doublet and DNA topo II genes of Marseilleviridae: evidence of proto-eukaryotic provenance. Epigenetics Chromatin 2017; 10:55. [PMID: 29179736 PMCID: PMC5704553 DOI: 10.1186/s13072-017-0162-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2017] [Accepted: 11/15/2017] [Indexed: 11/15/2022] Open
Abstract
Background While the genomes of eukaryotes and Archaea both encode the histone-fold domain, only eukaryotes encode the core histone paralogs H2A, H2B, H3, and H4. With DNA, these core histones assemble into the nucleosomal octamer underlying eukaryotic chromatin. Importantly, core histones for H2A and H3 are maintained as neofunctionalized paralogs adapted for general bulk chromatin (canonical H2 and H3) or specialized chromatin (H2A.Z enriched at gene promoters and cenH3s enriched at centromeres). In this context, the identification of core histone-like “doublets” in the cytoplasmic replication factories of the Marseilleviridae (MV) is a novel finding with possible relevance to understanding the origin of eukaryotic chromatin. Here, we analyze and compare the core histone doublet genes from all known MV genomes as well as other MV genes relevant to the origin of the eukaryotic replisome. Results Using different phylogenetic approaches, we show that MV histone domains encode obligate H2B-H2A and H4-H3 dimers of possible proto-eukaryotic origin. MV core histone moieties form sister clades to each of the four eukaryotic clades of canonical and variant core histones. This suggests that MV core histone moieties diverged prior to eukaryotic neofunctionalizations associated with paired linear chromosomes and variant histone octamer assembly. We also show that MV genomes encode a proto-eukaryotic DNA topoisomerase II enzyme that forms a sister clade to eukaryotes. This is a relevant finding given that DNA topo II influences histone deposition and chromatin compaction and is the second most abundant nuclear protein after histones. Conclusions The combined domain architecture and phylogenomic analyses presented here suggest that a primitive origin for MV histone genes is a more parsimonious explanation than horizontal gene transfers + gene fusions + sufficient divergence to eliminate relatedness to eukaryotic neofunctionalizations within the H2A and H3 clades without loss of relatedness to each of the four core histone clades. We thus suggest MV histone doublet genes and their DNA topo II gene possibly were acquired from an organism with a chromatinized replisome that diverged prior to the origin of eukaryotic core histone variants for H2/H2A.Z and H3/cenH3. These results also imply that core histones were utilized ancestrally in viral DNA compaction and/or protection from host endonucleases. Electronic supplementary material The online version of this article (10.1186/s13072-017-0162-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Albert J Erives
- Department of Biology, University of Iowa, Iowa City, IA, 52242-1324, USA.
| |
Collapse
|
10
|
Mioduser O, Goz E, Tuller T. Significant differences in terms of codon usage bias between bacteriophage early and late genes: a comparative genomics analysis. BMC Genomics 2017; 18:866. [PMID: 29132309 PMCID: PMC5683454 DOI: 10.1186/s12864-017-4248-7] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2017] [Accepted: 10/31/2017] [Indexed: 11/13/2022] Open
Abstract
Background Viruses undergo extensive evolutionary selection for efficient replication which effects, among others, their codon distribution. In the current study, we aimed at understanding the way evolution shapes the codon distribution in early vs. late viral genes in terms of their expression during different stages in the viral replication cycle. To this end we analyzed 14 bacteriophages and 11 human viruses with available information about the expression phases of their genes. Results We demonstrated evidence of selection for distinct composition of synonymous codons in early and late viral genes in 50% of the analyzed bacteriophages. Among others, this phenomenon may be related to the time specific adaptation of the viral genes to the translation efficiency factors involved at different bacteriophage developmental stages. Specifically, we showed that the differences in codon composition in different temporal gene groups cannot be explained only by phylogenetic proximities between the analyzed bacteriophages, and can be partially explained by differences in the adaptation to the host tRNA pool, nucleotide bias, GC content and more. In contrast, no difference in temporal regulation of synonymous codon usage was observed in human viruses, possibly because of a stronger selection pressure due to a larger effective population size in bacteriophages and their bacterial hosts. Conclusions The codon distribution in large fractions of bacteriophage genomes tend to be different in early and late genes. This phenomenon seems to be related to various aspects of the viral life cycle, and to various intracellular processes. We believe that the reported results should contribute towards better understanding of viral evolution and may promote the development of relevant procedures in synthetic virology. Electronic supplementary material The online version of this article (10.1186/s12864-017-4248-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Oriah Mioduser
- Department of Biomedical Engineering, Tel-Aviv University, Ramat Aviv, Israel
| | - Eli Goz
- Department of Biomedical Engineering, Tel-Aviv University, Ramat Aviv, Israel.,SynVaccineLtd. Ramat Hachayal, Tel Aviv, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel-Aviv University, Ramat Aviv, Israel. .,SynVaccineLtd. Ramat Hachayal, Tel Aviv, Israel. .,Sagol School of Neuroscience, Tel-Aviv University, Ramat Aviv, Israel.
| |
Collapse
|
11
|
Esposito LA, Gupta S, Streiter F, Prasad A, Dennehy JJ. Evolutionary interpretations of mycobacteriophage biodiversity and host-range through the analysis of codon usage bias. Microb Genom 2016; 2:e000079. [PMID: 28348827 PMCID: PMC5359403 DOI: 10.1099/mgen.0.000079] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2016] [Accepted: 07/18/2016] [Indexed: 12/31/2022] Open
Abstract
In an genomics course sponsored by the Howard Hughes Medical Institute (HHMI), undergraduate students have isolated and sequenced the genomes of more than 1,150 mycobacteriophages, creating the largest database of sequenced bacteriophages able to infect a single host, Mycobacterium smegmatis, a soil bacterium. Genomic analysis indicates that these mycobacteriophages can be grouped into 26 clusters based on genetic similarity. These clusters span a continuum of genetic diversity, with extensive genomic mosaicism among phages in different clusters. However, little is known regarding the primary hosts of these mycobacteriophages in their natural habitats, nor of their broader host ranges. As such, it is possible that the primary host of many newly isolated mycobacteriophages is not M. smegmatis, but instead a range of closely related bacterial species. However, determining mycobacteriophage host range presents difficulties associated with mycobacterial cultivability, pathogenicity and growth. Another way to gain insight into mycobacteriophage host range and ecology is through bioinformatic analysis of their genomic sequences. To this end, we examined the correlations between the codon usage biases of 199 different mycobacteriophages and those of several fully sequenced mycobacterial species in order to gain insight into the natural host range of these mycobacteriophages. We find that UPGMA clustering tends to match, but not consistently, clustering by shared nucleotide sequence identify. In addition, analysis of GC content, tRNA usage and correlations between mycobacteriophage and mycobacterial codon usage bias suggests that the preferred host of many clustered mycobacteriophages is not M. smegmatis but other, as yet unknown, members of the mycobacteria complex or closely allied bacterial species.
Collapse
Affiliation(s)
| | - Swati Gupta
- Biology Department, Queens College, Queens, NY 11367, USA
| | | | - Ashley Prasad
- Biology Department, Queens College, Queens, NY 11367, USA
| | - John J. Dennehy
- Biology Department, Queens College, Queens, NY 11367, USA
- Biology PhD Program, The Graduate Center of the City University of New York, New York, NY 10016, USA
- Correspondence John J. Dennehy ()
| |
Collapse
|
12
|
Delesalle VA, Tanke NT, Vill AC, Krukonis GP. Testing hypotheses for the presence of tRNA genes in mycobacteriophage genomes. BACTERIOPHAGE 2016; 6:e1219441. [PMID: 27738556 DOI: 10.1080/21597081.2016.1219441] [Citation(s) in RCA: 58] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2016] [Revised: 07/27/2016] [Accepted: 07/28/2016] [Indexed: 10/21/2022]
Abstract
The presence of tRNA genes in bacteriophages has been explained on the basis of codon usage (tRNA genes are retained in the phage genome if they correspond to codons more common in the phage than in its host) or amino acid usage (independent of codon, the amino acid corresponding to the retained tRNA gene is more common in the phage genome than in the bacterial host). The existence of a large database of sequenced mycobacteriophages, isolated on the common host Mycobacterium smegmatis, allows us to test the above hypotheses as well as explore other hypotheses for the presence of tRNA genes. Our analyses suggest that amino acid rather than codon usage better explains the presence of tRNA genes in mycobacteriophages. However, closely related phages that differ in the presence of tRNA genes in their genomes are capable of lysing the common bacterial host and do not differ in codon or amino acid usage. This suggests that the benefits of having tRNA genes may be associated with either growth in the host or the ability to infect more hosts (i.e., host range) rather than simply infecting a particular host.
Collapse
Affiliation(s)
| | - Natalie T Tanke
- Department of Biology, Gettysburg College , Gettysburg, PA, USA
| | - Albert C Vill
- Department of Biology, Gettysburg College , Gettysburg, PA, USA
| | - Greg P Krukonis
- Department of Biology, Bucknell University , Lewisburg, PA, USA
| |
Collapse
|
13
|
Virocell Metabolism: Metabolic Innovations During Host-Virus Interactions in the Ocean. Trends Microbiol 2016; 24:821-832. [PMID: 27395772 DOI: 10.1016/j.tim.2016.06.006] [Citation(s) in RCA: 110] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2016] [Revised: 06/06/2016] [Accepted: 06/13/2016] [Indexed: 11/24/2022]
Abstract
Marine viruses are considered to be major ecological, evolutionary, and biogeochemical drivers of the marine environment, responsible for nutrient recycling and determining species composition. Viruses can re-shape their host's metabolic network during infection, generating the virocell-a unique metabolic state that supports their specific requirement. Here we discuss the concept of 'virocell metabolism' and its formation by rewiring of host-encoded metabolic networks, or by introducing virus-encoded auxiliary metabolic genes which provide the virocell with novel metabolic capabilities. The ecological role of marine viruses is commonly assessed by their relative abundance and phylogenetic diversity, lacking the ability to assess the dynamics of active viral infection. The new ability to define a unique metabolic state of the virocell will expand the current virion-centric approaches in order to quantify the impact of marine viruses on microbial food webs.
Collapse
|
14
|
Barthélémy RM, Seligmann H. Cryptic tRNAs in chaetognath mitochondrial genomes. Comput Biol Chem 2016; 62:119-32. [DOI: 10.1016/j.compbiolchem.2016.04.007] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2015] [Revised: 04/11/2016] [Accepted: 04/14/2016] [Indexed: 12/14/2022]
|
15
|
Li N, Li Y, Zheng C, Huang J, Zhang S. Genome-wide comparative analysis of the codon usage patterns in plants. Genes Genomics 2016. [DOI: 10.1007/s13258-016-0417-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
16
|
Kumar S, Kumari R, Sharma V. Coevolution mechanisms that adapt viruses to genetic code variations implemented in their hosts. J Genet 2016; 95:3-12. [PMID: 27019427 DOI: 10.1007/s12041-016-0612-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Sushil Kumar
- SKA Institution for Research, Education and Development, 4/11 SarvPriya Vihar, New Delhi 110016, India.
| | | | | |
Collapse
|
17
|
Blanc-Mathieu R, Verhelst B, Derelle E, Rombauts S, Bouget FY, Carré I, Château A, Eyre-Walker A, Grimsley N, Moreau H, Piégu B, Rivals E, Schackwitz W, Van de Peer Y, Piganeau G. An improved genome of the model marine alga Ostreococcus tauri unfolds by assessing Illumina de novo assemblies. BMC Genomics 2014; 15:1103. [PMID: 25494611 PMCID: PMC4378021 DOI: 10.1186/1471-2164-15-1103] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Accepted: 11/19/2014] [Indexed: 12/17/2022] Open
Abstract
Background Cost effective next generation sequencing technologies now enable the production of genomic datasets for many novel planktonic eukaryotes, representing an understudied reservoir of genetic diversity. O. tauri is the smallest free-living photosynthetic eukaryote known to date, a coccoid green alga that was first isolated in 1995 in a lagoon by the Mediterranean sea. Its simple features, ease of culture and the sequencing of its 13 Mb haploid nuclear genome have promoted this microalga as a new model organism for cell biology. Here, we investigated the quality of genome assemblies of Illumina GAIIx 75 bp paired-end reads from Ostreococcus tauri, thereby also improving the existing assembly and showing the genome to be stably maintained in culture. Results The 3 assemblers used, ABySS, CLCBio and Velvet, produced 95% complete genomes in 1402 to 2080 scaffolds with a very low rate of misassembly. Reciprocally, these assemblies improved the original genome assembly by filling in 930 gaps. Combined with additional analysis of raw reads and PCR sequencing effort, 1194 gaps have been solved in total adding up to 460 kb of sequence. Mapping of RNAseq Illumina data on this updated genome led to a twofold reduction in the proportion of multi-exon protein coding genes, representing 19% of the total 7699 protein coding genes. The comparison of the DNA extracted in 2001 and 2009 revealed the fixation of 8 single nucleotide substitutions and 2 deletions during the approximately 6000 generations in the lab. The deletions either knocked out or truncated two predicted transmembrane proteins, including a glutamate-receptor like gene. Conclusion High coverage (>80 fold) paired-end Illumina sequencing enables a high quality 95% complete genome assembly of a compact ~13 Mb haploid eukaryote. This genome sequence has remained stable for 6000 generations of lab culture. Electronic supplementary material The online version of this article (doi:10.1186/1471-2164-15-1103) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | - Gwenaël Piganeau
- CNRS, UMR 7232, Observatoire Océanologique, Avenue du Fontaulé, BP44, 66650 Banyuls-sur-Mer, France.
| |
Collapse
|
18
|
Kessler MD, Dean MD. Effective population size does not predict codon usage bias in mammals. Ecol Evol 2014; 4:3887-900. [PMID: 25505518 PMCID: PMC4242573 DOI: 10.1002/ece3.1249] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2014] [Revised: 08/04/2014] [Accepted: 08/07/2014] [Indexed: 12/20/2022] Open
Abstract
Synonymous codons are not used at equal frequency throughout the genome, a phenomenon termed codon usage bias (CUB). It is often assumed that interspecific variation in the intensity of CUB is related to species differences in effective population sizes (Ne), with selection on CUB operating less efficiently in species with small Ne. Here, we specifically ask whether variation in Ne predicts differences in CUB in mammals and report two main findings. First, across 41 mammalian genomes, CUB was not correlated with two indirect proxies of Ne (body mass and generation time), even though there was statistically significant evidence of selection shaping CUB across all species. Interestingly, autosomal genes showed higher codon usage bias compared to X-linked genes, and high-recombination genes showed higher codon usage bias compared to low recombination genes, suggesting intraspecific variation in Ne predicts variation in CUB. Second, across six mammalian species with genetic estimates of Ne (human, chimpanzee, rabbit, and three mouse species: Mus musculus, M. domesticus, and M. castaneus), Ne and CUB were weakly and inconsistently correlated. At least in mammals, interspecific divergence in Ne does not strongly predict variation in CUB. One hypothesis is that each species responds to a unique distribution of selection coefficients, confounding any straightforward link between Ne and CUB.
Collapse
Affiliation(s)
- Michael D Kessler
- Molecular and Computational Biology, University of Southern California 1050 Childs Way, Los Angeles, California, 90089
| | - Matthew D Dean
- Molecular and Computational Biology, University of Southern California 1050 Childs Way, Los Angeles, California, 90089
| |
Collapse
|
19
|
Codon usage bias of the phosphoprotein gene of spring viraemia of carp virus and high codon adaptation to the host. Arch Virol 2014; 159:1841-7. [PMID: 24519460 DOI: 10.1007/s00705-014-2000-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2013] [Accepted: 10/05/2013] [Indexed: 10/25/2022]
Abstract
In this study, we calculated the relative synonymous codon usage (RSCU) value and the effective number of codons (ENC) value to carry out principal component analysis (PCA) and correlation analysis of the codon usage pattern of the phosphoprotein gene (P gene) of spring viraemia of carp virus (SVCV). The synonymous codon usage pattern in P genes is geography-specific, based on PCA analysis. The high correlation between (G + C)1,2 % and (G + C)3 % suggests that mutational pressure rather than natural selection is the main factor that determines the codon usage and base components in P genes. At least 40 out of 59 synonymous codons are similarly selected in all functional genes within five complete SVCV genomes, and the hosts based on the RSCU data. These results not only provide insight into variations in the codon usage pattern of SVCV but also may help in understanding the processes governing the evolution of SVCV.
Collapse
|
20
|
Complex codon usage pattern and compositional features of retroviruses. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2013; 2013:848123. [PMID: 24288576 PMCID: PMC3833384 DOI: 10.1155/2013/848123] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/30/2013] [Revised: 09/05/2013] [Accepted: 09/07/2013] [Indexed: 11/26/2022]
Abstract
Retroviruses infect a wide range of organisms including humans. Among them, HIV-1, which causes AIDS, has now become a major threat for world health. Some of these viruses are also potential gene transfer vectors. In this study, the patterns of synonymous codon usage in retroviruses have been studied through multivariate statistical methods on ORFs sequences from the available 56 retroviruses. The principal determinant for evolution of the codon usage pattern in retroviruses seemed to be the compositional constraints, while selection for translation of the viral genes plays a secondary role. This was further supported by multivariate analysis on relative synonymous codon usage. Thus, it seems that mutational bias might have dominated role over translational selection in shaping the codon usage of retroviruses. Codon adaptation index was used to identify translationally optimal codons among genes from retroviruses. The comparative analysis of the preferred and optimal codons among different retroviral groups revealed that four codons GAA, AAA, AGA, and GGA were significantly more frequent in most of the retroviral genes inspite of some differences. Cluster analysis also revealed that phylogenetically related groups of retroviruses have probably evolved their codon usage in a concerted manner under the influence of their nucleotide composition.
Collapse
|