1
|
Fan K, Li Y, Chen Z, Fan L. GenRCA: a user-friendly rare codon analysis tool for comprehensive evaluation of codon usage preferences based on coding sequences in genomes. BMC Bioinformatics 2024; 25:309. [PMID: 39333857 PMCID: PMC11438159 DOI: 10.1186/s12859-024-05934-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 09/17/2024] [Indexed: 09/30/2024] Open
Abstract
BACKGROUND The study of codon usage bias is important for understanding gene expression, evolution and gene design, providing critical insights into the molecular processes that govern the function and regulation of genes. Codon Usage Bias (CUB) indices are valuable metrics for understanding codon usage patterns across different organisms without extensive experiments. Considering that there is no one-fits-all index for all species, a comprehensive platform supporting the calculation and analysis of multiple CUB indices for codon optimization is greatly needed. RESULTS Here, we release GenRCA, an updated version of our previous Rare Codon Analysis Tool, as a free and user-friendly website for all-inclusive evaluation of codon usage preferences of coding sequences. In this study, we manually reviewed and implemented up to 31 codon preference indices, with 65 expression host organisms covered and batch processing of multiple gene sequences supported, aiming to improve the user experience and provide more comprehensive and efficient analysis. CONCLUSIONS Our website fills a gap in the availability of comprehensive tools for species-specific CUB calculations, enabling researchers to thoroughly assess the protein expression level based on a comprehensive list of 31 indices and further guide the codon optimization.
Collapse
Affiliation(s)
- Kunjie Fan
- Production and R&D Center I of LSS, GenScript (Shanghai) Biotech Co., Ltd., Shanghai, China
| | - Yuanyuan Li
- Production and R&D Center I of LSS, GenScript Biotech Corporation, Nanjing, China
| | - Zhiwei Chen
- Production and R&D Center I of LSS, GenScript Biotech Corporation, Nanjing, China
| | - Long Fan
- Production and R&D Center I of LSS, GenScript (Shanghai) Biotech Co., Ltd., Shanghai, China.
| |
Collapse
|
2
|
Villanueva E, Smith T, Pizzinga M, Elzek M, Queiroz RML, Harvey RF, Breckels LM, Crook OM, Monti M, Dezi V, Willis AE, Lilley KS. System-wide analysis of RNA and protein subcellular localization dynamics. Nat Methods 2024; 21:60-71. [PMID: 38036857 PMCID: PMC10776395 DOI: 10.1038/s41592-023-02101-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 10/24/2023] [Indexed: 12/02/2023]
Abstract
Although the subcellular dynamics of RNA and proteins are key determinants of cell homeostasis, their characterization is still challenging. Here we present an integrative framework to simultaneously interrogate the dynamics of the transcriptome and proteome at subcellular resolution by combining two methods: localization of RNA (LoRNA) and a streamlined density-based localization of proteins by isotope tagging (dLOPIT) to map RNA and protein to organelles (nucleus, endoplasmic reticulum and mitochondria) and membraneless compartments (cytosol, nucleolus and cytosolic granules). Interrogating all RNA subcellular locations at once enables system-wide quantification of the proportional distribution of RNA. We obtain a cell-wide overview of localization dynamics for 31,839 transcripts and 5,314 proteins during the unfolded protein response, revealing that endoplasmic reticulum-localized transcripts are more efficiently recruited to cytosolic granules than cytosolic RNAs, and that the translation initiation factor eIF3d is key to sustaining cytoskeletal function. Overall, we provide the most comprehensive overview so far of RNA and protein subcellular localization dynamics.
Collapse
Affiliation(s)
- Eneko Villanueva
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Tom Smith
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, UK
- MRC Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Mariavittoria Pizzinga
- MRC Toxicology Unit, University of Cambridge, Cambridge, UK
- Structural Biology Research Centre, Human Technopole, Milan, Italy
| | - Mohamed Elzek
- MRC Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Rayner M L Queiroz
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, UK
| | | | - Lisa M Breckels
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Oliver M Crook
- Department of Statistics, University of Oxford, Oxford, UK
| | - Mie Monti
- MRC Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Veronica Dezi
- MRC Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Anne E Willis
- MRC Toxicology Unit, University of Cambridge, Cambridge, UK.
| | - Kathryn S Lilley
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, UK.
| |
Collapse
|
3
|
Hribovšek P, Olesin Denny E, Dahle H, Mall A, Øfstegaard Viflot T, Boonnawa C, Reeves EP, Steen IH, Stokke R. Putative novel hydrogen- and iron-oxidizing sheath-producing Zetaproteobacteria thrive at the Fåvne deep-sea hydrothermal vent field. mSystems 2023; 8:e0054323. [PMID: 37921472 PMCID: PMC10734525 DOI: 10.1128/msystems.00543-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 10/02/2023] [Indexed: 11/04/2023] Open
Abstract
IMPORTANCE Knowledge on microbial iron oxidation is important for understanding the cycling of iron, carbon, nitrogen, nutrients, and metals. The current study yields important insights into the niche sharing, diversification, and Fe(III) oxyhydroxide morphology of Ghiorsea, an iron- and hydrogen-oxidizing Zetaproteobacteria representative belonging to Zetaproteobacteria operational taxonomic unit 9. The study proposes that Ghiorsea exhibits a more extensive morphology of Fe(III) oxyhydroxide than previously observed. Overall, the results increase our knowledge on potential drivers of Zetaproteobacteria diversity in iron microbial mats and can eventually be used to develop strategies for the cultivation of sheath-forming Zetaproteobacteria.
Collapse
Affiliation(s)
- Petra Hribovšek
- Centre for Deep Sea Research, University of Bergen, Bergen, Norway
- Department of Earth Science, University of Bergen, Bergen, Norway
| | - Emily Olesin Denny
- Centre for Deep Sea Research, University of Bergen, Bergen, Norway
- Department of Biological Sciences, University of Bergen, Bergen, Norway
- Computational Biology Unit, University of Berge, Bergen, Norway
| | - Håkon Dahle
- Centre for Deep Sea Research, University of Bergen, Bergen, Norway
- Department of Biological Sciences, University of Bergen, Bergen, Norway
- Computational Biology Unit, University of Berge, Bergen, Norway
| | - Achim Mall
- Centre for Deep Sea Research, University of Bergen, Bergen, Norway
- Department of Biological Sciences, University of Bergen, Bergen, Norway
| | - Thomas Øfstegaard Viflot
- Centre for Deep Sea Research, University of Bergen, Bergen, Norway
- Department of Earth Science, University of Bergen, Bergen, Norway
| | - Chanakan Boonnawa
- Centre for Deep Sea Research, University of Bergen, Bergen, Norway
- Department of Earth Science, University of Bergen, Bergen, Norway
| | - Eoghan P. Reeves
- Centre for Deep Sea Research, University of Bergen, Bergen, Norway
- Department of Earth Science, University of Bergen, Bergen, Norway
| | - Ida Helene Steen
- Centre for Deep Sea Research, University of Bergen, Bergen, Norway
- Department of Biological Sciences, University of Bergen, Bergen, Norway
| | - Runar Stokke
- Centre for Deep Sea Research, University of Bergen, Bergen, Norway
- Department of Biological Sciences, University of Bergen, Bergen, Norway
| |
Collapse
|
4
|
Kurt S, Kaymaz Y, Ateş D, Tanyolaç MB. Complete chloroplast genome of Lens lamottei reveals intraspecies variation among with Lens culinaris. Sci Rep 2023; 13:14959. [PMID: 37696838 PMCID: PMC10495401 DOI: 10.1038/s41598-023-41287-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Accepted: 08/24/2023] [Indexed: 09/13/2023] Open
Abstract
Lens lamottei is a member of the Fabaceae family and the second gene pool of the genus Lens. The environmental factors that drove the divergence among wild and cultivated species have been studied extensively. Recent research has focused on genomic signatures associated with various phenotypes with the acceleration of next-generation techniques in molecular profiling. Therefore, in this study, we provide the complete sequence of the chloroplast genome sequence in the wild Lens species L. lamottei with a deep coverage of 713 × next-generation sequencing (NGS) data for the first time. Compared to the cultivated species, Lens culinaris, we identified synonymous, and nonsynonymous changes in the protein-coding regions of the genes ndhB, ndhF, ndhH, petA, rpoA, rpoC2, rps3, and ycf2 in L. lamottei. Phylogenetic analysis of chloroplast genomes of various plants under Leguminosae revealed that L. lamottei and L. culinaris are closest to one another than to other species. The complete chloroplast genome of L. lamottei also allowed us to reanalyze previously published transcriptomic data, which showed high levels of gene expression for ATP-synthase, rubisco, and photosystem genes. Overall, this study provides a deeper insight into the diversity of Lens species and the agricultural importance of these plants through their chloroplast genomes.
Collapse
Affiliation(s)
- Selda Kurt
- Faculty of Engineering, Department of Bioengineering, Ege University, Izmir, Turkey
| | - Yasin Kaymaz
- Faculty of Engineering, Department of Bioengineering, Ege University, Izmir, Turkey
| | - Duygu Ateş
- Faculty of Engineering, Department of Bioengineering, Ege University, Izmir, Turkey
| | | |
Collapse
|
5
|
Alonso AM, Diambra L. Dicodon-based measures for modeling gene expression. Bioinformatics 2023; 39:btad380. [PMID: 37307098 PMCID: PMC10287933 DOI: 10.1093/bioinformatics/btad380] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 05/20/2023] [Accepted: 06/09/2023] [Indexed: 06/14/2023] Open
Abstract
MOTIVATION Codon usage preference patterns have been associated with modulation of translation efficiency, protein folding, and mRNA decay. However, new studies support that codon pair usage has also a remarkable effect at the gene expression level. Here, we expand the concept of CAI to answer if codon pair usage patterns can be understood in terms of codon usage bias, or if they offer new information regarding coding translation efficiency. RESULTS Through the implementation of a weighting strategy to consider the dicodon contributions, we observe that the dicodon-based measure has greater correlations with gene expression level than CAI. Interestingly, we have noted that dicodons associated with a low value of adaptiveness are related to dicodons which mediate strong translational inhibition in yeast. We have also noticed that some codon-pairs have a smaller dicodon contribution than estimated by the product of the respective codon contributions. AVAILABILITY AND IMPLEMENTATION Scripts, implemented in Python, are freely available for download at https://zenodo.org/record/7738276#.ZBIDBtLMIdU.
Collapse
Affiliation(s)
- Andres M Alonso
- Instituto Tecnológico Chascomús (INTECH), CONICET-UNSAM, Intendente Marino km 8.2, Chascomús, 7130 Provincia de Buenos Aires, Argentina
- CCT-La Plata, CONICET, Calle 8 Nº 1467, La Plata, B1904CMC Provincia de Buenos Aires, Argentina
| | - Luis Diambra
- CCT-La Plata, CONICET, Calle 8 Nº 1467, La Plata, B1904CMC Provincia de Buenos Aires, Argentina
- Centro Regional de Estudios Genómicos, FCE-UNLP, Blvd 120 N∘ 1461, La Plata, 1900 Provincia de Buenos Aires, Argentina
| |
Collapse
|
6
|
Paukszto Ł, Górski P, Krawczyk K, Maździarz M, Szczecińska M, Ślipiko M, Sawicki J. The organellar genomes of Pellidae (Marchantiophyta): the evidence of cryptic speciation, conflicting phylogenies and extraordinary reduction of mitogenomes in simple thalloid liverwort lineage. Sci Rep 2023; 13:8303. [PMID: 37221210 DOI: 10.1038/s41598-023-35269-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 05/15/2023] [Indexed: 05/25/2023] Open
Abstract
Organellar genomes of liverworts are considered as one of the most stable among plants, with rare events of gene loss and structural rearrangements. However, not all lineages of liverworts are equally explored in the field of organellar genomics, and subclass Pellidae is one of the less known. Hybrid assembly, using both short- and long-read technologies enabled the assembly of repeat-rich mitogenomes of Pellia and Apopellia revealing extraordinary reduction of length in the latter which impacts only intergenic spacers. The mitogenomes of Apopellia were revealed to be the smallest among all known liverworts-109 k bp, despite retaining all introns. The study also showed the loss of one tRNA gene in Apopellia mitogenome, although it had no impact on the codon usage pattern of mitochondrial protein coding genes. Moreover, it was revealed that Apopellia and Pellia differ in codon usage by plastome CDSs, despite identical tRNA gene content. Molecular identification of species is especially important where traditional taxonomic methods fail, especially within Pellidae where cryptic speciation is well recognized. The simple morphology of these species and a tendency towards environmental plasticity make them complicated in identification. Application of super-barcodes, based on complete mitochondrial or plastid genomes sequences enable identification of all cryptic lineages within Apopellia and Pellia genera, however in some particular cases, mitogenomes were more efficient in species delimitation than plastomes.
Collapse
Affiliation(s)
- Łukasz Paukszto
- Department of Botany and Nature Protection, University of Warmia and Mazury in Olsztyn, Plac Łódzki 1, 10-727, Olsztyn, Poland.
| | - Piotr Górski
- Department of Botany, Poznań University of Life Sciences, ul. Wojska Polskiego 71C, 60-625, Poznań, Poland
| | - Katarzyna Krawczyk
- Department of Botany and Nature Protection, University of Warmia and Mazury in Olsztyn, Plac Łódzki 1, 10-727, Olsztyn, Poland
| | - Mateusz Maździarz
- Department of Botany and Nature Protection, University of Warmia and Mazury in Olsztyn, Plac Łódzki 1, 10-727, Olsztyn, Poland
| | - Monika Szczecińska
- Department of Ecology and Environmental Protection, University of Warmia and Mazury in Olsztyn, Plac Łódzki 3, 10-727, Olsztyn, Poland
| | - Monika Ślipiko
- Department of Botany and Nature Protection, University of Warmia and Mazury in Olsztyn, Plac Łódzki 1, 10-727, Olsztyn, Poland
| | - Jakub Sawicki
- Department of Botany and Nature Protection, University of Warmia and Mazury in Olsztyn, Plac Łódzki 1, 10-727, Olsztyn, Poland
| |
Collapse
|
7
|
Sahoo S, Rakshit R. The pattern of coding sequences in the chloroplast genome of Atropa belladonna and a comparative analysis with other related genomes in the nightshade family. Genomics Inform 2022; 20:e43. [PMID: 36617650 PMCID: PMC9847383 DOI: 10.5808/gi.22045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 12/12/2022] [Indexed: 12/31/2022] Open
Abstract
Atropa belladonna is a valuable medicinal plant and a commercial source of tropane alkaloids, which are frequently utilized in therapeutic practice. In this study, bioinformaticmethodologies were used to examine the pattern of coding sequences and the factors thatmight influence codon usage bias in the chloroplast genome of Atropa belladonna andother nightshade genomes. The chloroplast engineering being a promising field in modernbiotechnology, the characterization of chloroplast genome is very important. The resultsrevealed that the chloroplast genomes of Nicotiana tabacum, Solanum lycopersicum, Capsicum frutescens, Datura stramonium, Lyciumbarbarum, Solanum melongena, and Solanumtuberosum exhibited comparable codon usage patterns. In these chloroplast genomes, weobserved a weak codon usage bias. According to the correspondence analysis, the genesisof the codon use bias in these chloroplast genes might be explained by natural selection,directed mutational pressure, and other factors. GC12 and GC3S were shown to have nomeaningful relationship. Further research revealed that natural selection primarily shapedthe codon usage in A. belladonna and other nightshade genomes for translational efficiency. The sequencing properties of these chloroplast genomes were also investigated by investing the occurrences of palindromes and inverted repeats, which would be useful forfuture research on medicinal plants.
Collapse
Affiliation(s)
- Satyabrata Sahoo
- Department of Physics, Dhruba Chand Halder College, Dakshin Barasat 743372, India,*Corresponding author E-mail:
| | - Ria Rakshit
- Department of Botany, Baruipur College, Baruipur 743610, India
| |
Collapse
|
8
|
Benchmarking Community-Wide Estimates of Growth Potential from Metagenomes Using Codon Usage Statistics. mSystems 2022; 7:e0074522. [PMID: 36190138 PMCID: PMC9600850 DOI: 10.1128/msystems.00745-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Trait inference from mixed-species assemblages is a central problem in microbial ecology. Frequently, sequencing information from an environment is available, but phenotypic measurements from individual community members are not. With the increasing availability of molecular data for microbial communities, bioinformatic approaches that map metagenome to (meta)phenotype are needed. Recently, we developed a tool, gRodon, that enables the prediction of the maximum growth rate of an organism from genomic data on the basis of codon usage patterns. Our work and that of other groups suggest that such predictors can be applied to mixed-species communities in order to derive estimates of the average community-wide maximum growth rate. Here, we present an improved maximum growth rate predictor designed for metagenomes that corrects a persistent GC bias in the original gRodon model for metagenomic prediction. We benchmark this predictor with simulated metagenomic data sets to show that it has superior performance on mixed-species communities relative to earlier models. We go on to provide guidance on data preprocessing and show that calling genes from assembled contigs rather than directly from reads dramatically improves performance. Finally, we apply our predictor to large-scale metagenomic data sets from marine and human microbiomes to illustrate how community-wide growth prediction can be a powerful approach for hypothesis generation. Altogether, we provide an updated tool with clear guidelines for users about the uses and pitfalls of metagenomic prediction of the average community-wide maximal growth rate. IMPORTANCE Microbes dominate nearly every known habitat, and therefore tools to survey the structure and function of natural microbial communities are much needed. Metagenomics, in which the DNA content of an entire community of organisms is sequenced all at once, allows us to probe the genetic diversity contained in a habitat. Yet, mapping metagenomic information to the actual traits of community members is a difficult and largely unsolved problem. Here, we present and validate a tool that allows users to predict the average maximum growth rate of a microbial community directly from metagenomic data. Maximum growth rate is a fundamental characteristic of microbial species that can give us a great deal of insight into their ecological role, and by applying our community-level predictor to large-scale metagenomic data sets from marine and human-associated microbiomes, we show how community-wide growth prediction can be a powerful approach for hypothesis generation.
Collapse
|
9
|
Tayşi N, Kaymaz Y, Ateş D, Sari H, Toker C, Tanyolaç MB. Complete chloroplast genome sequence of Lens ervoides and comparison to Lens culinaris. Sci Rep 2022; 12:15068. [PMID: 36064865 PMCID: PMC9445179 DOI: 10.1038/s41598-022-17877-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Accepted: 08/02/2022] [Indexed: 12/05/2022] Open
Abstract
Lens is a member of the Papilionoideae subfamily of Fabaceae and is generally used as a source of vegetable protein as part of human diets in many regions worldwide. Chloroplast (cp) genomes are highly active genetic components of plants and can be utilized as molecular markers for various purposes. As one of the wild lentil species, the Lens ervoides cp genome has been sequenced for the first time in this study using next-generation sequencing. The de novo assembly of the cp genome resulted in a single 122,722 bp sequence as two separate coexisting structural haplotypes with similar lengths. Results indicated that the cp genome of L. ervoides belongs to the inverted repeat lacking clade. Several noteworthy divergences within the coding regions were observed in ndhB, ndhF, rbcL, rpoC2, and ycf2 genes. Analysis of relative synonymous codon usage showed that certain genes, psbN, psaI, psbI, psbE, psbK, petD, and ndhC, preferred using biased codons more often and therefore might have elevated expression and translation efficiencies. Overall, this study exhibited the divergence level between the wild-type and cultured lentil cp genomes and pointed to certain regions that can be utilized as distinction markers for various goals.
Collapse
Affiliation(s)
- Nurbanu Tayşi
- Bioengineering Department, Faculty of Engineering, Ege University, Izmir, Turkey
| | - Yasin Kaymaz
- Bioengineering Department, Faculty of Engineering, Ege University, Izmir, Turkey
| | - Duygu Ateş
- Bioengineering Department, Faculty of Engineering, Ege University, Izmir, Turkey
| | - Hatice Sari
- Department of Field Crops, Faculty of Agriculture, Akdeniz University, Antalya, Turkey
| | - Cengiz Toker
- Department of Field Crops, Faculty of Agriculture, Akdeniz University, Antalya, Turkey
| | - M Bahattin Tanyolaç
- Bioengineering Department, Faculty of Engineering, Ege University, Izmir, Turkey.
| |
Collapse
|
10
|
Steffens L, Pettinato E, Steiner TM, Eisenreich W, Berg IA. Tracking the Reversed Oxidative Tricarboxylic Acid Cycle in Bacteria. Bio Protoc 2022; 12:e4364. [PMID: 35434198 PMCID: PMC8983159 DOI: 10.21769/bioprotoc.4364] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Revised: 11/22/2021] [Accepted: 02/10/2022] [Indexed: 12/29/2022] Open
Abstract
Different pathways for autotrophic CO2 fixation can be recognized by the presence of genes for their specific key enzymes. On this basis, (meta)genomic, (meta)transcriptomic, or (meta)proteomic analysis enables the identification of the role of an organism or a distinct pathway in primary production. However, the recently discovered variant of the reductive tricarboxylic acid (rTCA) cycle, the reverse oxidative tricarboxylic acid (roTCA) cycle, lacks unique enzymes, a feature that makes it cryptic for bioinformatics analysis. This pathway is a reversal of the widespread tricarboxylic acid (TCA) cycle. The functioning of the roTCA cycle requires unusually high activity of citrate synthase, the enzyme responsible for citrate cleavage, as well as elevated CO2 partial pressures. Here, we present a detailed description of the protocol we used for the identification of the roTCA cycle in members of Desulfurellaceae. First, we describe the anaerobic cultivation of Desulfurellaceae at different CO2 concentrations with a method that can be adapted to the cultivation of other anaerobes. Then, we explain how to measure activities of enzymes responsible for citrate cleavage, malate dehydrogenase reaction, and the crucial carboxylation step of the cycle catalyzed by pyruvate synthase in cell extracts. In conclusion, we describe stable isotope experiments that allow tracking of the roTCA cycle in vivo, through the position-specific incorporation of carbon-13 into amino acids. The label is provided to the organism as 13CO2 or [1-13C]glutamate. The same key methodology can be used for the reliable evaluation of the functioning of the roTCA cycle in any organism under study. This pathway is likely to participate, completely unseen, in the metabolism of various microorganisms. Graphic abstract.
Collapse
Affiliation(s)
- Lydia Steffens
- Institute for Molecular Microbiology and Biotechnology, University of Münster, Münster, Germany
| | - Eugenio Pettinato
- Institute for Molecular Microbiology and Biotechnology, University of Münster, Münster, Germany
| | - Thomas M. Steiner
- Bavarian NMR Center–Structural Membrane Biochemistry, Department of Chemistry, Technische Universität München, Garching, Germany
| | - Wolfgang Eisenreich
- Bavarian NMR Center–Structural Membrane Biochemistry, Department of Chemistry, Technische Universität München, Garching, Germany
,
*For correspondence: ;
| | - Ivan A. Berg
- Institute for Molecular Microbiology and Biotechnology, University of Münster, Münster, Germany
,
*For correspondence: ;
| |
Collapse
|
11
|
Wang Z, Cai Q, Wang Y, Li M, Wang C, Wang Z, Jiao C, Xu C, Wang H, Zhang Z. Comparative Analysis of Codon Bias in the Chloroplast Genomes of Theaceae Species. Front Genet 2022; 13:824610. [PMID: 35360853 PMCID: PMC8961065 DOI: 10.3389/fgene.2022.824610] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 01/31/2022] [Indexed: 11/13/2022] Open
Abstract
Theaceae species are dicotyledonous angiosperms with extremely high ornamental and economic value. The chloroplast genome is traditionally used to study species evolution, expression of chloroplast genes and chloroplast transformation. Codon usage bias (CUB) analysis is beneficial for investigations of evolutionary relationships and can be used to improve gene expression efficiency in genetic transformation research. However, there are relatively few systematic studies of the CUB in the chloroplast genomes of Theaceae species. In this study, CUB and nucleotide compositions parameters were determined by the scripts written in the Perl language, CodonW 1.4.2, CU.Win2000, RStudio and SPSS 23.0. The chloroplast genome data of 40 Theaceae species were obtained to analyse the codon usage (CU) characteristics of the coding regions and the influence of the source of variation on CUB. To explore the relationship between the CUB and gene expression levels in these 40 Theaceae plastomes, the synonymous codon usage order (SCUO) and measure independent of length and composition (MILC) values were determined. Finally, phylogenetic analysis revealed the genetic evolutionary relationships among these Theaceae species. Our results showed that based on the chloroplast genomes of these 40 Theaceae species, the CUB was for codons containing A/T bases and those that ended with A/T bases. Moreover, there was great commonality in the CUB of the Theaceae species according to comparative analysis of relative synonymous codon usage (RSCU) and relative frequency of synonymous codon (RFSC): these species had 29 identical codons with bias (RSCU > 1), and there were 19 identical high-frequency codons. The CUB of Theaceae species is mainly affected by natural selection. The SCUO value of the 40 Theaceae species was 0.23 or 0.24, and the chloroplast gene expression level was moderate, according to MILC values. Additionally, we observed a positive correlation between the SCUO and MILC values, which indicated that CUB might affect gene expression. Furthermore, the phylogenetic analysis showed that the evolutionary relationships in these 40 Theaceae species were relatively conserved. A systematic study on the CUB and expression of Theaceae species provides further evidence for their evolution and phylogeny.
Collapse
Affiliation(s)
- Zhanjun Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, China
| | - Qianwen Cai
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Yue Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Minhui Li
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Chenchen Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Zhaoxia Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Chunyan Jiao
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Congcong Xu
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Hongyan Wang
- College of Life Sciences, Hefei Normal University, Hefei, China
| | - Zhaoliang Zhang
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, China
- *Correspondence: Zhaoliang Zhang,
| |
Collapse
|
12
|
Suzuki S, Matsuzaki R, Yamaguchi H, Kawachi M. What happened before losses of photosynthesis in cryptophyte algae? Mol Biol Evol 2022; 39:6513384. [PMID: 35079797 PMCID: PMC8829904 DOI: 10.1093/molbev/msac001] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
In many lineages of algae and land plants, photosynthesis was lost multiple times independently. Comparative analyses of photosynthetic and secondary nonphotosynthetic relatives have revealed the essential functions of plastids, beyond photosynthesis. However, evolutionary triggers and processes that drive the loss of photosynthesis remain unknown. Cryptophytes are microalgae with complex plastids derived from a red alga. They include several secondary nonphotosynthetic species with closely related photosynthetic taxa. In this study, we found that a cryptophyte, Cryptomonas borealis, is in a stage just prior to the loss of photosynthesis. Cryptomonas borealis was mixotrophic, possessed photosynthetic activity, and grew independent of light. The plastid genome of C. borealis had distinct features, including increases of group II introns with mobility, frequent genome rearrangements, incomplete loss of inverted repeats, and abundant small/medium/large-sized structural variants. These features provide insight into the evolutionary process leading to the loss of photosynthesis.
Collapse
Affiliation(s)
- Shigekatsu Suzuki
- Biodiversity Division, National Institute for Environmental Studies, Tsukuba, Ibaraki, 305-8506, Japan
| | - Ryo Matsuzaki
- Biodiversity Division, National Institute for Environmental Studies, Tsukuba, Ibaraki, 305-8506, Japan
- Faculty of Life and Environmental Sciences, University of Tsukuba, Tsukuba, Ibaraki, 305-8577, Japan
| | - Haruyo Yamaguchi
- Biodiversity Division, National Institute for Environmental Studies, Tsukuba, Ibaraki, 305-8506, Japan
| | - Masanobu Kawachi
- Biodiversity Division, National Institute for Environmental Studies, Tsukuba, Ibaraki, 305-8506, Japan
| |
Collapse
|
13
|
Zeng Z, Aptekmann AA, Bromberg Y. Decoding the effects of synonymous variants. Nucleic Acids Res 2021; 49:12673-12691. [PMID: 34850938 PMCID: PMC8682775 DOI: 10.1093/nar/gkab1159] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Revised: 11/02/2021] [Accepted: 11/08/2021] [Indexed: 12/12/2022] Open
Abstract
Synonymous single nucleotide variants (sSNVs) are common in the human genome but are often overlooked. However, sSNVs can have significant biological impact and may lead to disease. Existing computational methods for evaluating the effect of sSNVs suffer from the lack of gold-standard training/evaluation data and exhibit over-reliance on sequence conservation signals. We developed synVep (synonymous Variant effect predictor), a machine learning-based method that overcomes both of these limitations. Our training data was a combination of variants reported by gnomAD (observed) and those unreported, but possible in the human genome (generated). We used positive-unlabeled learning to purify the generated variant set of any likely unobservable variants. We then trained two sequential extreme gradient boosting models to identify subsets of the remaining variants putatively enriched and depleted in effect. Our method attained 90% precision/recall on a previously unseen set of variants. Furthermore, although synVep does not explicitly use conservation, its scores correlated with evolutionary distances between orthologs in cross-species variation analysis. synVep was also able to differentiate pathogenic vs. benign variants, as well as splice-site disrupting variants (SDV) vs. non-SDVs. Thus, synVep provides an important improvement in annotation of sSNVs, allowing users to focus on variants that most likely harbor effects.
Collapse
Affiliation(s)
- Zishuo Zeng
- Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ 08873, USA
| | - Ariel A Aptekmann
- Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ 08873, USA
| | - Yana Bromberg
- Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ 08873, USA
- Department of Genetics, Rutgers University, Piscataway, NJ 08854, USA
| |
Collapse
|
14
|
Pecka-Kiełb E, Kowalewska-Łuczak I, Czerniawska-Piątkowska E, Króliczewska B. FASN, SCD1 and ANXA9 gene polymorphism as genetic predictors of the fatty acid profile of sheep milk. Sci Rep 2021; 11:23761. [PMID: 34887487 PMCID: PMC8660767 DOI: 10.1038/s41598-021-03186-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Accepted: 11/23/2021] [Indexed: 01/05/2023] Open
Abstract
In this study, single nucleotide polymorphisms (SNPs) in the ANXA9 (annexin 9), FASN (fatty acid synthase) and SCD1 (stearoyl-CoA desaturase 1) genes were analyzed as factors influencing fatty acid profiles in milk from Zošľachtená valaška sheep. SNP in selected genes was identified using polymerase chain reaction (PCR) and restriction fragment length polymorphism (PCR–RFLP). The long-chain fatty acids profile in sheep milk was identified by gas chromatography. Statistical analysis of the SCD1/Cfr13I polymorphism showed that the milk of the homozygous AA animals was characterized by a lower (P < 0.05) share of C4:0, C6:0, C8:0, C10:0, C12:0, C14:0 in comparison to the homozygous CC sheep. The milk of heterozygous sheep was characterized by a higher (P < 0.05) proportion of C13:0 acid compared to the milk of sheep with the homozygous AA type. A higher (P < 0.05) level of saturated fatty acids (SFA) was found in the milk of CC genotype sheep compared to the AA genotype. Our results lead to the conclusion that the greatest changes were observed for the SCD1/Cfr13I polymorphism and the least significant ones for FASN/AciI. Moreover, it is the first evidence that milk from sheep with SCD1/Cfr13I polymorphism and the homozygous AA genotype showed the most desirable fatty acids profile.
Collapse
Affiliation(s)
- Ewa Pecka-Kiełb
- Department of Biostructure and Animal Physiology, Faculty of Veterinary Medicine, Wroclaw University of Environmental and Life Sciences, Norwida 31, 50-375, Wrocław, Poland.
| | - Inga Kowalewska-Łuczak
- Department of Genetics, Faculty of Biotechnology and Animal Husbandry, West Pomeranian University of Technology in Szczecin, Piastów Avenue 45, 79-311, Szczecin, Poland
| | - Ewa Czerniawska-Piątkowska
- Department of Ruminant Science, Faculty of Biotechnology and Animal Husbandry, West Pomeranian University of Technology in Szczecin, Klemensa Janickiego 29, 71-270, Szczecin, Poland
| | - Bożena Króliczewska
- Department of Biostructure and Animal Physiology, Faculty of Veterinary Medicine, Wroclaw University of Environmental and Life Sciences, Norwida 31, 50-375, Wrocław, Poland
| |
Collapse
|
15
|
Mazumder TH, Alqahtani AM, Alqahtani T, Emran TB, A. Aldahish A, Uddin A. Analysis of Codon Usage of Speech Gene FoxP2 among Animals. BIOLOGY 2021; 10:biology10111078. [PMID: 34827071 PMCID: PMC8614651 DOI: 10.3390/biology10111078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 10/12/2021] [Accepted: 10/16/2021] [Indexed: 12/03/2022]
Abstract
Simple Summary We evaluated codon usage bias in the FoxP2 gene in fishes, birds, reptiles, and mammals. Fishes use C or G—ending codons, while birds, reptiles, and mammals employ T or A—ending codons. Apart from the nucleotide composition, natural selection and mutation pressure might influence the CUB. The ENC observed/ENC expected ratio demonstrated that mutation pressure influences FoxP2 codon usage patterns. Natural selection may have had a key influence in shaping the CUB, although mutation pressure may have played a minor role. FoxP2 gene codon usage is affected by the base composition under mutation bias. Abstract The protein-coding gene FoxP2 (fork head box protein P2) plays a major role in communication and evolutionary changes. The present study carried out a comprehensive codon usage bias analysis in the FoxP2 gene among a diverse group of animals including fishes, birds, reptiles, and mammals. We observed that in the genome of fishes for the FoxP2 gene, codons ending with C or G were most frequently used, while in birds, reptiles, and mammals, codons ending with T or A were most frequently used. A higher ENC value was observed for the FoxP2 gene indicating a lower CUB. Parity role two-bias plots suggested that apart from mutation pressure, other factors such as natural selection might have influenced the CUB. The frequency distribution of the ENC observed and ENC expected ratio revealed that mutation pressure plays a key role in the patterns of codon usage of FoxP2. Besides, correspondence analysis exposed the composition of the nucleobase under mutation bias affects the codon usage of the FoxP2 gene. However, neutrality plots revealed the major role of natural selection over mutation pressure in the CUB of FoxP2. In addition, the codon usage patterns for FoxP2 among the selected genomes suggested that nature has favored nearly all the synonymous codons for encoding the corresponding amino acid. The uniform usage of 12 synonymous codons for FoxP2 was observed among the species of birds. The amino acid usage frequency for FoxP2 revealed that the amino acids Leucine, Glutamine, and Serine were predominant over other amino acids among all the species of fishes, birds, reptiles, and mammals.
Collapse
Affiliation(s)
| | - Ali M. Alqahtani
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha 62529, Saudi Arabia; (A.M.A.); (T.A.); (A.A.A.)
| | - Taha Alqahtani
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha 62529, Saudi Arabia; (A.M.A.); (T.A.); (A.A.A.)
| | - Talha Bin Emran
- Department of Pharmacy, BGC Trust University Bangladesh, Chittagong 4381, Bangladesh;
| | - Afaf A. Aldahish
- Department of Pharmacology, College of Pharmacy, King Khalid University, Abha 62529, Saudi Arabia; (A.M.A.); (T.A.); (A.A.A.)
| | - Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial College, Hailakandi 788150, Assam, India
- Correspondence:
| |
Collapse
|
16
|
Estimating maximal microbial growth rates from cultures, metagenomes, and single cells via codon usage patterns. Proc Natl Acad Sci U S A 2021; 118:2016810118. [PMID: 33723043 PMCID: PMC8000110 DOI: 10.1073/pnas.2016810118] [Citation(s) in RCA: 103] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Despite the wide perception that microbes have rapid growth rates, many environments like seawater and soil are often dominated by microorganisms that can only grow very slowly. Our knowledge about growth is necessarily biased toward easily culturable organisms, which tend to be those that grow fast, because microbial growth rates have traditionally been measured using laboratory growth experiments. However, how are potential growth rates distributed in nature? Using genomic data, we predicted the growth rates of over 200,000 organisms, including many as yet uncultivated species. These data reveal how current culture collections are strongly biased toward fast-growing organisms. Finally, we noticed a bimodal distribution of maximal growth rates, suggesting a natural division of microbial growth strategies into two classes. Maximal growth rate is a basic parameter of microbial lifestyle that varies over several orders of magnitude, with doubling times ranging from a matter of minutes to multiple days. Growth rates are typically measured using laboratory culture experiments. Yet, we lack sufficient understanding of the physiology of most microbes to design appropriate culture conditions for them, severely limiting our ability to assess the global diversity of microbial growth rates. Genomic estimators of maximal growth rate provide a practical solution to survey the distribution of microbial growth potential, regardless of cultivation status. We developed an improved maximal growth rate estimator and predicted maximal growth rates from over 200,000 genomes, metagenome-assembled genomes, and single-cell amplified genomes to survey growth potential across the range of prokaryotic diversity; extensions allow estimates from 16S rRNA sequences alone as well as weighted community estimates from metagenomes. We compared the growth rates of cultivated and uncultivated organisms to illustrate how culture collections are strongly biased toward organisms capable of rapid growth. Finally, we found that organisms naturally group into two growth classes and observed a bias in growth predictions for extremely slow-growing organisms. These observations ultimately led us to suggest evolutionary definitions of oligotrophy and copiotrophy based on the selective regime an organism occupies. We found that these growth classes are associated with distinct selective regimes and genomic functional potentials.
Collapse
|
17
|
Bahiri-Elitzur S, Tuller T. Codon-based indices for modeling gene expression and transcript evolution. Comput Struct Biotechnol J 2021; 19:2646-2663. [PMID: 34025951 PMCID: PMC8122159 DOI: 10.1016/j.csbj.2021.04.042] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2021] [Revised: 04/17/2021] [Accepted: 04/18/2021] [Indexed: 11/21/2022] Open
Abstract
Codon usage bias (CUB) refers to the phenomena that synonymous codons are used in different frequencies in most genes and organisms. The general assumption is that codon biases reflect a balance between mutational biases and natural selection. Today we understand that the codon content is related and can affect all gene expression steps. Starting from the 1980s, codon-based indices have been used for answering different questions in all biomedical fields, including systems biology, agriculture, medicine, and biotechnology. In general, codon usage bias indices weigh each codon or a small set of codons to estimate the fitting of a certain coding sequence to a certain phenomenon (e.g., bias in codons, adaptation to the tRNA pool, frequencies of certain codons, transcription elongation speed, etc.) and are usually easy to implement. Today there are dozens of such indices; thus, this paper aims to review and compare the different codon usage bias indices, their applications, and advantages. In addition, we perform analysis that demonstrates that most indices tend to correlate even though they aim to capture different aspects. Due to the centrality of codon usage bias on different gene expression steps, it is important to keep developing new indices that can capture additional aspects that are not modeled with the current indices.
Collapse
Affiliation(s)
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel-Aviv University, Tel Aviv, Israel
- The Sagol School of Neuroscience, Tel-Aviv University, Tel Aviv, Israel
| |
Collapse
|
18
|
Steffens L, Pettinato E, Steiner TM, Mall A, König S, Eisenreich W, Berg IA. High CO 2 levels drive the TCA cycle backwards towards autotrophy. Nature 2021; 592:784-788. [PMID: 33883741 DOI: 10.1038/s41586-021-03456-9] [Citation(s) in RCA: 53] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 03/15/2021] [Indexed: 02/02/2023]
Abstract
It has recently been shown that in anaerobic microorganisms the tricarboxylic acid (TCA) cycle, including the seemingly irreversible citrate synthase reaction, can be reversed and used for autotrophic fixation of carbon1,2. This reversed oxidative TCA cycle requires ferredoxin-dependent 2-oxoglutarate synthase instead of the NAD-dependent dehydrogenase as well as extremely high levels of citrate synthase (more than 7% of the proteins in the cell). In this pathway, citrate synthase replaces ATP-citrate lyase of the reductive TCA cycle, which leads to the spending of one ATP-equivalent less per one turn of the cycle. Here we show, using the thermophilic sulfur-reducing deltaproteobacterium Hippea maritima, that this route is driven by high partial pressures of CO2. These high partial pressures are especially important for the removal of the product acetyl coenzyme A (acetyl-CoA) through reductive carboxylation to pyruvate, which is catalysed by pyruvate synthase. The reversed oxidative TCA cycle may have been functioning in autotrophic CO2 fixation in a primordial atmosphere that is assumed to have been rich in CO2.
Collapse
Affiliation(s)
- Lydia Steffens
- Institute for Molecular Microbiology and Biotechnology, University of Münster, Münster, Germany
| | - Eugenio Pettinato
- Institute for Molecular Microbiology and Biotechnology, University of Münster, Münster, Germany
| | - Thomas M Steiner
- Bavarian NMR Center-Structural Membrane Biochemistry, Department of Chemistry, Technische Universität München, Garching, Germany
| | - Achim Mall
- K. G. Jebsen Centre for Deep Sea Research, University of Bergen, Bergen, Norway.,Department of Biological Sciences, University of Bergen, Bergen, Norway
| | - Simone König
- Core Unit Proteomics, Interdisciplinary Center for Clinical Research, Medical Faculty, University of Münster, Münster, Germany
| | - Wolfgang Eisenreich
- Bavarian NMR Center-Structural Membrane Biochemistry, Department of Chemistry, Technische Universität München, Garching, Germany.
| | - Ivan A Berg
- Institute for Molecular Microbiology and Biotechnology, University of Münster, Münster, Germany.
| |
Collapse
|
19
|
Cai L, Arnold BJ, Xi Z, Khost DE, Patel N, Hartmann CB, Manickam S, Sasirat S, Nikolov LA, Mathews S, Sackton TB, Davis CC. Deeply Altered Genome Architecture in the Endoparasitic Flowering Plant Sapria himalayana Griff. (Rafflesiaceae). Curr Biol 2021; 31:1002-1011.e9. [DOI: 10.1016/j.cub.2020.12.045] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Revised: 12/11/2020] [Accepted: 12/23/2020] [Indexed: 12/18/2022]
|
20
|
Futo M, Opašić L, Koska S, Čorak N, Široki T, Ravikumar V, Thorsell A, Lenuzzi M, Kifer D, Domazet-Lošo M, Vlahoviček K, Mijakovic I, Domazet-Lošo T. Embryo-Like Features in Developing Bacillus subtilis Biofilms. Mol Biol Evol 2021; 38:31-47. [PMID: 32871001 PMCID: PMC7783165 DOI: 10.1093/molbev/msaa217] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Correspondence between evolution and development has been discussed for more than two centuries. Recent work reveals that phylogeny-ontogeny correlations are indeed present in developmental transcriptomes of eukaryotic clades with complex multicellularity. Nevertheless, it has been largely ignored that the pervasive presence of phylogeny-ontogeny correlations is a hallmark of development in eukaryotes. This perspective opens a possibility to look for similar parallelisms in biological settings where developmental logic and multicellular complexity are more obscure. For instance, it has been increasingly recognized that multicellular behavior underlies biofilm formation in bacteria. However, it remains unclear whether bacterial biofilm growth shares some basic principles with development in complex eukaryotes. Here we show that the ontogeny of growing Bacillus subtilis biofilms recapitulates phylogeny at the expression level. Using time-resolved transcriptome and proteome profiles, we found that biofilm ontogeny correlates with the evolutionary measures, in a way that evolutionary younger and more diverged genes were increasingly expressed toward later timepoints of biofilm growth. Molecular and morphological signatures also revealed that biofilm growth is highly regulated and organized into discrete ontogenetic stages, analogous to those of eukaryotic embryos. Together, this suggests that biofilm formation in Bacillus is a bona fide developmental process comparable to organismal development in animals, plants, and fungi. Given that most cells on Earth reside in the form of biofilms and that biofilms represent the oldest known fossils, we anticipate that the widely adopted vision of the first life as a single-cell and free-living organism needs rethinking.
Collapse
Affiliation(s)
- Momir Futo
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | - Luka Opašić
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
- Department for Evolutionary Theory, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Sara Koska
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | - Nina Čorak
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | - Tin Široki
- Faculty of Electrical Engineering and Computing, University of Zagreb, Zagreb, Croatia
| | - Vaishnavi Ravikumar
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, Denmark
| | - Annika Thorsell
- Proteomics Core Facility, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Maša Lenuzzi
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
- Department of Evolutionary Biology, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Domagoj Kifer
- Faculty of Pharmacy and Biochemistry, University of Zagreb, Zagreb, Croatia
| | - Mirjana Domazet-Lošo
- Faculty of Electrical Engineering and Computing, University of Zagreb, Zagreb, Croatia
| | - Kristian Vlahoviček
- Bioinformatics Group, Division of Biology, Faculty of Science, University of Zagreb, Zagreb, Croatia
- School of Biosciences, University of Skövde, Skövde, Sweden
| | - Ivan Mijakovic
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kgs. Lyngby, Denmark
- Systems and Synthetic Biology Division, Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Tomislav Domazet-Lošo
- Laboratory of Evolutionary Genetics, Division of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
- Catholic University of Croatia, Zagreb, Croatia
| |
Collapse
|
21
|
Chamakura KR, Tran JS, O'Leary C, Lisciandro HG, Antillon SF, Garza KD, Tran E, Min L, Young R. Rapid de novo evolution of lysis genes in single-stranded RNA phages. Nat Commun 2020; 11:6009. [PMID: 33243984 PMCID: PMC7693330 DOI: 10.1038/s41467-020-19860-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Accepted: 10/30/2020] [Indexed: 12/27/2022] Open
Abstract
Leviviruses are bacteriophages with small single-stranded RNA genomes consisting of 3-4 genes, one of which (sgl) encodes a protein that induces the host to undergo autolysis and liberate progeny virions. Recent meta-transcriptomic studies have uncovered thousands of leviviral genomes, but most of these lack an annotated sgl, mainly due to the small size, lack of sequence similarity, and embedded nature of these genes. Here, we identify sgl genes in 244 leviviral genomes and functionally characterize them in Escherichia coli. We show that leviviruses readily evolve sgl genes and sometimes have more than one per genome. Moreover, these genes share little to no similarity with each other or to previously known sgl genes, thus representing a rich source for potential protein antibiotics.
Collapse
Affiliation(s)
- Karthik R Chamakura
- Center for Phage Technology and Texas A&M AgriLife, Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA
| | - Jennifer S Tran
- Center for Phage Technology and Texas A&M AgriLife, Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA
- Pharmaceutical Sciences Division, University of Wisconsin-Madison, Madison, WI, 53705, USA
| | - Chandler O'Leary
- Center for Phage Technology and Texas A&M AgriLife, Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA
- University of North Texas Health Science Center, Fort Worth, TX, 43210, USA
| | - Hannah G Lisciandro
- Center for Phage Technology and Texas A&M AgriLife, Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA
| | - Sophia F Antillon
- Center for Phage Technology and Texas A&M AgriLife, Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA
| | - Kameron D Garza
- Center for Phage Technology and Texas A&M AgriLife, Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA
| | - Elizabeth Tran
- Center for Phage Technology and Texas A&M AgriLife, Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA
- College of Pharmacy, University of North Texas Health Science Center, Fort Worth, TX, 43210, USA
| | - Lorna Min
- Center for Phage Technology and Texas A&M AgriLife, Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA
- Baylor College of Medicine, Houston, TX, 77030, USA
| | - Ry Young
- Center for Phage Technology and Texas A&M AgriLife, Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX, 77843-2128, USA.
| |
Collapse
|
22
|
Narakusumo RP, Riedel A, Pons J. Mitochondrial genomes of twelve species of hyperdiverse Trigonopterus weevils. PeerJ 2020; 8:e10017. [PMID: 33083123 PMCID: PMC7566755 DOI: 10.7717/peerj.10017] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Accepted: 09/01/2020] [Indexed: 11/20/2022] Open
Abstract
Mitochondrial genomes of twelve species of Trigonopterus weevils are presented, ten of them complete. We describe their gene order and molecular features and test their potential for reconstructing the phylogeny of this hyperdiverse genus comprising > 1,000 species. The complete mitochondrial genomes examined herein ranged from 16,501 bp to 21,007 bp in length, with an average AT content of 64.2% to 69.7%. Composition frequencies and skews were generally lower across species for atp6, cox1-3, and cob genes, while atp8 and genes coded on the minus strand showed much higher divergence at both nucleotide and amino acid levels. Most variation within genes was found at the codon level with high variation at third codon sites across species, and with lesser degree at the coding strand level. Two large non-coding regions were found, CR1 (between rrnS and trnI genes) and CR2 (between trnI and trnQ), but both with large variability in length; this peculiar structure of the non-coding region may be a derived character of Curculionoidea. The nad1 and cob genes exhibited an unusually high interspecific length variation of up to 24 bp near the 3' end. This pattern was probably caused by a single evolutionary event since both genes are only separated by trnS2 and length variation is extremely rare in mitochondrial protein coding genes. We inferred phylogenetic trees using protein coding gene sequences implementing both maximum likelihood and Bayesian approaches, each for both nucleotide and amino acid sequences. While some clades could be retrieved from all reconstructions with high confidence, there were also a number of differences and relatively low support for some basal nodes. The best partition scheme of the 13 protein coding sequences obtained by IQTREE suggested that phylogenetic signal is more accurate by splitting sequence variation at the codon site level as well as coding strand, rather than at the gene level. This result corroborated the different patterns found in Trigonopterus regarding to A+T frequencies and AT and GC skews that also greatly diverge at the codon site and coding strand levels.
Collapse
Affiliation(s)
- Raden Pramesa Narakusumo
- State Museum of Natural History Karlsruhe, Karlsruhe, Germany.,Museum Zoologicum Bogoriense, Research Center for Biology, Indonesian Institute of Sciences (LIPI), Cibinong, Indonesia
| | | | - Joan Pons
- Diversidad Animal y Microbiana, Instituto Mediterráneo de Estudios Avanzados IMEDEA (CSIC-UIB), Esporles, Balearic Islands, Spain
| |
Collapse
|
23
|
Chakraborty S, Yengkhom S, Uddin A. Analysis of codon usage bias of chloroplast genes in Oryza species : Codon usage of chloroplast genes in Oryza species. PLANTA 2020; 252:67. [PMID: 32989601 DOI: 10.1007/s00425-020-03470-7] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Accepted: 09/15/2020] [Indexed: 05/11/2023]
Abstract
The codon usage bias in chloroplast genes of Oryza species was low and AT rich. The pattern of codon usage was different among Oryza species and mainly influenced by mutation pressure and natural selection. Codon usage bias (CUB) is the unequal usage of synonymous codons in which some codons are more preferred to others in the coding sequences of genes. It shows a species-specific property. We studied the patterns of codon usage and the factors that influenced the CUB of protein-coding chloroplast (cp) genes in 18 Oryza species as no work was yet reported. The nucleotide composition analysis revealed that the overall GC content of cp genes in different species of Oryza was lower than 50%, i.e., Oryza cp genes were AT rich. Synonymous codon usage order (SCUO) suggested that CUB was weak in the cp genes of different Oryza species. A highly significant correlation was observed between overall nucleotides and its constituents at the third codon position suggesting that both, mutation pressure and natural selection, might influence the CUB. Correspondence analysis (COA) revealed that codon usage pattern differed across Oryza species. In the neutrality plot, a narrow range of GC3 distribution was recorded and some points were diagonally distributed in all the plots, suggesting that natural selection and mutation pressure might have influenced the CUB. The slope of the regression line was < 0.5, augmenting our inference that natural selection might have played a major role, while mutation pressure had a minor role in shaping the CUB of cp genes. The magnitudes of mutation pressure and natural selection on cp genes varied across Oryza species.
Collapse
Affiliation(s)
- Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar, 788011, Assam, India.
| | - Sophiarani Yengkhom
- Department of Biotechnology, Assam University, Silchar, 788011, Assam, India
| | - Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Algapur, Hailakandi, 788150, Assam, India
| |
Collapse
|
24
|
Codon Usage Optimization in the Prokaryotic Tree of Life: How Synonymous Codons Are Differentially Selected in Sequence Domains with Different Expression Levels and Degrees of Conservation. mBio 2020; 11:mBio.00766-20. [PMID: 32694138 PMCID: PMC7374057 DOI: 10.1128/mbio.00766-20] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The prokaryotic genomes—the current heritage of the most ancient life forms on earth—are comprised of diverse gene sets, all characterized by varied origins, ancestries, and spatial-temporal expression patterns. Such genetic diversity has for a long time raised the question of how cells shape their coding strategies to optimize protein demands (i.e., product abundance) and accuracy (i.e., translation fidelity) through the use of the same genetic code in genomes with GC contents that range from less than 20 to more than 80%. Here, we present evidence on how codon usage is adjusted in the prokaryotic tree of life and on how specific biases have operated to improve translation. Through the use of proteome data, we characterized conserved and variable sequence domains in genes of either high or low expression level and quantitated the relative weight of efficiency and accuracy—as well as their interaction—in shaping codon usage in prokaryotes. Prokaryote genomes exhibit a wide range of GC contents and codon usages, both resulting from an interaction between mutational bias and natural selection. In order to investigate the basis underlying specific codon changes, we performed a comprehensive analysis of 29 different prokaryote families. The analysis of core gene sets with increasing ancestries in each family lineage revealed that the codon usages became progressively more adapted to the tRNA pools. While, as previously reported, highly expressed genes presented the most optimized codon usage, the singletons contained the less selectively favored codons. The results showed that usually codons with the highest translational adaptation were preferentially enriched. In agreement with previous reports, a C bias in 2- to 3-fold pyrimidine-ending codons, and a U bias in 4-fold codons occurred in all families, irrespective of the global genomic GC content. Furthermore, the U biases suggested that U3-mRNA–U34-tRNA interactions were responsible for a prominent codon optimization in both the most ancestral core and the highly expressed genes. A comparative analysis of sequences that encode conserved (cr) or variable (vr) translated products, with each one being under high (HEP) and low (LEP) expression levels, demonstrated that the efficiency was more relevant (by a factor of 2) than accuracy to modeling codon usage. Finally, analysis of the third position of codons (GC3) revealed that in genomes with global GC contents higher than 35 to 40%, selection favored a GC3 increase, whereas in genomes with very low GC contents, a decrease in GC3 occurred. A comprehensive final model is presented in which all patterns of codon usage variations are condensed in four distinct behavioral groups.
Collapse
|
25
|
Genetic evolution and codon usage analysis of NKX-2.5 gene governing heart development in some mammals. Genomics 2019; 112:1319-1329. [PMID: 31377427 DOI: 10.1016/j.ygeno.2019.07.023] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Revised: 07/26/2019] [Accepted: 07/31/2019] [Indexed: 11/21/2022]
Abstract
NKX-2.5 gene is responsible for cardiac development and its targeted disruption apprehends cardiac development at the linear heart tube stage. Bioinformatic analysis was employed to investigate the codon usage pattern and dN/dS of mammalian NKX-2.5 gene. The relative synonymous codon usage analysis revealed variation in codon usage and two synonymous codons namely ATA (Ile) and GTA (Val) were absent in NKX-2.5 gene across selected mammalian species suggesting that these two codons were possibly selected against during evolution. Parity rule 2 analysis of two and four fold amino acids showed CT bias whereas six-fold amino acids revealed GA bias. Neutrality analysis suggests that selection played a prominent role while mutation had a minor role. The dN/dS analysis suggests synonymous substitution played a significant role and it negatively correlated with p-distance of the gene. Purifying natural selection played a dominant role in the genetic evolution of NKX-2.5 gene in mammals.
Collapse
|
26
|
Abstract
The universal triple-nucleotide genetic code is often viewed as a given, randomly selected through evolution. However, as summarized in this article, many observations and deductions within structural and thermodynamic frameworks help to explain the forces that must have shaped the code during the early evolution of life on Earth.
Collapse
|
27
|
Liu SS, Hockenberry AJ, Jewett MC, Amaral LAN. A novel framework for evaluating the performance of codon usage bias metrics. J R Soc Interface 2019; 15:rsif.2017.0667. [PMID: 29386398 DOI: 10.1098/rsif.2017.0667] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2017] [Accepted: 01/04/2018] [Indexed: 11/12/2022] Open
Abstract
The unequal utilization of synonymous codons affects numerous cellular processes including translation rates, protein folding and mRNA degradation. In order to understand the biological impact of variable codon usage bias (CUB) between genes and genomes, it is crucial to be able to accurately measure CUB for a given sequence. A large number of metrics have been developed for this purpose, but there is currently no way of systematically testing the accuracy of individual metrics or knowing whether metrics provide consistent results. This lack of standardization can result in false-positive and false-negative findings if underpowered or inaccurate metrics are applied as tools for discovery. Here, we show that the choice of CUB metric impacts both the significance and measured effect sizes in numerous empirical datasets, raising questions about the generality of findings in published research. To bring about standardization, we developed a novel method to create synthetic protein-coding DNA sequences according to different models of codon usage. We use these benchmark sequences to identify the most accurate and robust metrics with regard to sequence length, GC content and amino acid heterogeneity. Finally, we show how our benchmark can aid the development of new metrics by providing feedback on its performance compared to the state of the art.
Collapse
Affiliation(s)
- Sophia S Liu
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA
| | - Adam J Hockenberry
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA.,Interdisciplinary Program in Biological Sciences, Northwestern University, Evanston, IL, USA
| | - Michael C Jewett
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA .,Interdisciplinary Program in Biological Sciences, Northwestern University, Evanston, IL, USA.,Center for Synthetic Biology, Northwestern University, Evanston, IL, USA.,Simpson Querrey BioNanotechnology Institute, Northwestern University, Evanston, IL, USA.,Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL, USA
| | - Luís A N Amaral
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA .,Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL, USA.,Department of Physics and Astronomy, Northwestern University, Evanston, IL, USA
| |
Collapse
|
28
|
Conserved motifs in nuclear genes encoding predicted mitochondrial proteins in Trypanosoma cruzi. PLoS One 2019; 14:e0215160. [PMID: 30964924 PMCID: PMC6456187 DOI: 10.1371/journal.pone.0215160] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Accepted: 03/27/2019] [Indexed: 11/19/2022] Open
Abstract
Trypanosoma cruzi, the protozoan parasite that causes Chagas’ disease, exhibits peculiar biological features. Among them, the presence of a unique mitochondrion is remarkable. Even though the mitochondrial DNA constitutes up to 25% of total cellular DNA, the structure and functionality of the mitochondrion are dependent on the expression of the nuclear genome. As in other eukaryotes, specific peptide signals have been proposed to drive the mitochondrial localization of a subset of trypanosomatid proteins. However, there are mitochondrial proteins encoded in the nuclear genome that lack of a peptide signal. In other eukaryotes, alternative protein targeting to subcellular organelles via mRNA localization has also been recognized and specific mRNA localization towards the mitochondria has been described. With the aim of seeking for mitochondrial localization signals in T. cruzi, we developed a strategy to build a comprehensive database of nuclear genes encoding predicted mitochondrial proteins (MiNT) in the TriTryps (T. cruzi, T. brucei and L. major). We found that approximately 15% of their nuclear genome encodes mitochondrial products. In T. cruzi the MiNT database reaches 1438 genes and a conserved peptide signal, M(L/F) R (R/S) SS, named TryM-TaPe is found in 60% of these genes, suggesting that the canonical mRNA guidance mechanism is present. In addition, the search for compositional signals in the transcripts of T. cruzi MiNT genes produce a list, being worth to note a conserved non-translated element represented by the consensus sequence DARRVSG. Taking into account its reported interaction with the T. brucei TRRM3 protein which is enriched in the mitochondrial membrane fraction, we here suggest a putative zip code role for this element. Globally, here we provide an inventory of the mitochondrial proteins in T. cruzi and give evidence for the existence of both peptide and mRNA signals specific to nuclear encoded mitochondrial proteins.
Collapse
|
29
|
Sahoo S, Das SS, Rakshit R. Codon usage pattern and predicted gene expression in Arabidopsis thaliana. Gene 2019; 721S:100012. [PMID: 32550546 PMCID: PMC7286098 DOI: 10.1016/j.gene.2019.100012] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2018] [Revised: 01/30/2019] [Accepted: 02/21/2019] [Indexed: 01/20/2023]
Abstract
The extensive research for predicting highly expressed genes in plant genome sequences has been going on for decades. The codon usage pattern of genes in Arabidopsis thaliana genome is a classical topic for plant biologists for its significance in the understanding of molecular plant biology. Here we have used a gene expression profiling methodology based on the score of modified relative codon bias (MRCBS) to elucidate expression pattern of genes in Arabidopsis thaliana. MRCBS relies exclusively on sequence features for identifying the highly expressed genes. In this study, a critical analysis of predicted highly expressed (PHE) genes in Arabidopsis thaliana has been performed using MRCBS as a numerical estimator of gene expression level. Consistent with previous other results, our study indicates that codon composition plays an important role in the regulation of gene expression. We found a systematic strong correlation between MRCBS and CAI (codon adaptation index) or other expression-measures. Additionally, MRCBS correlates well with experimental gene expression data. Our study highlights the relationship between gene expression and compositional signature in relation to codon usage bias and sets the ground for the further investigation of the evolution of the protein-coding genes in the plant genome.
Collapse
Key Words
- Arabidopsis thaliana
- CAI
- CAI, Codon adaptation index
- CP, Chloroplast Pltd CP
- Codon usage bias
- GC content
- GEO, Gene Expression Omnibus
- Gene expression
- MADS, Minichromosome maintenance1, Agamous, Deficiens and Serum response factor
- MBP, Megabase pair
- MRCBS, Score of Modified relative codon bias
- MT, Mitochondrion
- PHE genes
- PHE, Predicted Highly Expressed
- RCA, Relative Codon Adaptation
- RCB, Relative codon bias
- RCBS, Relative Codon Bias Strength
- RMA, Relative Molecular Abundance
- RP, Ribosomal protein
- SAGE, Serial Analysis of Gene Expression
- TAIR, The Arabidopsis Information Resourses
Collapse
Affiliation(s)
- Satyabrata Sahoo
- Department of Physics, Dhruba Chand Halder College, Dakshin Barasat, South 24 Parganas, W.B., India
| | - Shib Sankar Das
- Department of Mathematics, Uluberia College, Uluberia, Howrah, W.B., India
| | - Ria Rakshit
- Department of Botany, Baruipur College, South 24 Parganas, W.B., India
| |
Collapse
|
30
|
Venev SV, Zeldovich KB. Thermophilic Adaptation in Prokaryotes Is Constrained by Metabolic Costs of Proteostasis. Mol Biol Evol 2019; 35:211-224. [PMID: 29106597 PMCID: PMC5850847 DOI: 10.1093/molbev/msx282] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Prokaryotes evolved to thrive in an extremely diverse set of habitats, and their proteomes bear signatures of environmental conditions. Although correlations between amino acid usage and environmental temperature are well-documented, understanding of the mechanisms of thermal adaptation remains incomplete. Here, we couple the energetic costs of protein folding and protein homeostasis to build a microscopic model explaining both the overall amino acid composition and its temperature trends. Low biosynthesis costs lead to low diversity of physical interactions between amino acid residues, which in turn makes proteins less stable and drives up chaperone activity to maintain appropriate levels of folded, functional proteins. Assuming that the cost of chaperone activity is proportional to the fraction of unfolded client proteins, we simulated thermal adaptation of model proteins subject to minimization of the total cost of amino acid synthesis and chaperone activity. For the first time, we predicted both the proteome-average amino acid abundances and their temperature trends simultaneously, and found strong correlations between model predictions and 402 genomes of bacteria and archaea. The energetic constraint on protein evolution is more apparent in highly expressed proteins, selected by codon adaptation index. We found that in bacteria, highly expressed proteins are similar in composition to thermophilic ones, whereas in archaea no correlation between predicted expression level and thermostability was observed. At the same time, thermal adaptations of highly expressed proteins in bacteria and archaea are nearly identical, suggesting that universal energetic constraints prevail over the phylogenetic differences between these domains of life.
Collapse
Affiliation(s)
- Sergey V Venev
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, 368 Plantation St, Worcester, MA
| | - Konstantin B Zeldovich
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, 368 Plantation St, Worcester, MA
| |
Collapse
|
31
|
Deka H, Nath D, Uddin A, Chakraborty S. DNA compositional dynamics and codon usage patterns of M1 and M2 matrix protein genes in influenza A virus. INFECTION GENETICS AND EVOLUTION 2018; 67:7-16. [PMID: 30367980 DOI: 10.1016/j.meegid.2018.10.015] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2018] [Revised: 10/11/2018] [Accepted: 10/23/2018] [Indexed: 11/30/2022]
Abstract
Influenza A virus subtype H3N2 has been a serious health issue across the globe with approximately 36 thousand annual casualties in the United States of America only. Co-circulation in avian, swine and human hosts has led to frequent mutations in the virus genome, due to which development of successful antivirals against the virus has become a formidable challenge. Recently, focussed research is being carried out targeting the matrix proteins of this strain as vaccine candidates. This study is carried out to unravel the key features of the genes encoding the matrix proteins that manoeuvre the codon usage profile in the H3N2 strains. The findings reveal differential codon choice for both matrix protein 1 and matrix protein 2. The overall codon usage bias is less pronounced in both the datasets which is evident from higher value of effective number of codons (>55). Comparison of the codon usage for both the genes under study with that of humans revealed that the viral codon usage is not fully optimized for the human host conditions. Both the genes enrolled in the study showed variation which was reflected in almost all the indices used for codon usage studies. Neutrality analysis revealed a weak role of mutation pressure while selection was the major contributor towards codon usage.
Collapse
Affiliation(s)
- Himangshu Deka
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India
| | - Durbba Nath
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India
| | - Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Hailakandi 788150, Assam, India.
| | - Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar 788011, Assam, India.
| |
Collapse
|
32
|
Gene expression, nucleotide composition and codon usage bias of genes associated with human Y chromosome. Genetica 2017; 145:295-305. [PMID: 28421323 DOI: 10.1007/s10709-017-9965-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2016] [Accepted: 04/08/2017] [Indexed: 10/19/2022]
Abstract
Analysis of codon usage pattern is important to understand the genetic and evolutionary characteristics of genomes. We have used bioinformatic approaches to analyze the codon usage bias (CUB) of the genes located in human Y chromosome. Codon bias index (CBI) indicated that the overall extent of codon usage bias was low. The relative synonymous codon usage (RSCU) analysis suggested that approximately half of the codons out of 59 synonymous codons were most frequently used, and possessed a T or G at the third codon position. The codon usage pattern was different in different genes as revealed from correspondence analysis (COA). A significant correlation between effective number of codons (ENC) and various GC contents suggests that both mutation pressure and natural selection affect the codon usage pattern of genes located in human Y chromosome. In addition, Y-linked genes have significant difference in GC contents at the second and third codon positions, expression level, and codon usage pattern of some codons like the SPANX genes in X chromosome.
Collapse
|
33
|
Das S, Chottopadhyay B, Sahoo S. Comparative Analysis of Predicted Gene Expression among Crenarchaeal Genomes. Genomics Inform 2017; 15:38-47. [PMID: 28416948 PMCID: PMC5389947 DOI: 10.5808/gi.2017.15.1.38] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2016] [Revised: 11/28/2016] [Accepted: 01/26/2017] [Indexed: 12/13/2022] Open
Abstract
Research into new methods for identifying highly expressed genes in anonymous genome sequences has been going on for more than 15 years. We presented here an alternative approach based on modified score of relative codon usage bias to identify highly expressed genes in crenarchaeal genomes. The proposed algorithm relies exclusively on sequence features for identifying the highly expressed genes. In this study, a comparative analysis of predicted highly expressed genes in five crenarchaeal genomes was performed using the score of Modified Relative Codon Bias Strength (MRCBS) as a numerical estimator of gene expression level. We found a systematic strong correlation between Codon Adaptation Index and MRCBS. Additionally, MRCBS correlated well with other expression measures. Our study indicates that MRCBS can consistently capture the highly expressed genes.
Collapse
Affiliation(s)
- Shibsankar Das
- Department of Mathematics, Uluberia College, Uluberia 711315, India
| | | | | |
Collapse
|
34
|
Survey of (Meta)genomic Approaches for Understanding Microbial Community Dynamics. Indian J Microbiol 2016; 57:23-38. [PMID: 28148977 DOI: 10.1007/s12088-016-0629-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Accepted: 10/27/2016] [Indexed: 01/06/2023] Open
Abstract
Advancement in the next generation sequencing technologies has led to evolution of the field of genomics and metagenomics in a slim duration with nominal cost at precipitous higher rate. While metagenomics and genomics can be separately used to reveal the culture-independent and culture-based microbial evolution, respectively, (meta)genomics together can be used to demonstrate results at population level revealing in-depth complex community interactions for specific ecotypes. The field of metagenomics which started with answering "who is out there?" based on 16S rRNA gene has evolved immensely with the precise organismal reconstruction at species/strain level from the deeply covered metagenome data outweighing the need to isolate bacteria of which 99% are de facto non-cultivable. In this review we have underlined the appeal of metagenomic-derived genomes in providing insights into the evolutionary patterns, growth dynamics, genome/gene-specific sweeps, and durability of environmental pressures. We have demonstrated the use of culture-based genomics and environmental shotgun metagenome data together to elucidate environment specific genome modulations via metagenomic recruitments in terms of gene loss/gain, accessory and core-genome extent. We further illustrated the benefit of (meta)genomics in the understanding of infectious diseases by deducing the relationship between human microbiota and clinical microbiology. This review summarizes the technological advances in the (meta)genomic strategies using the genome and metagenome datasets together to increase the resolution of microbial population studies.
Collapse
|
35
|
Brbić M, Piškorec M, Vidulin V, Kriško A, Šmuc T, Supek F. The landscape of microbial phenotypic traits and associated genes. Nucleic Acids Res 2016; 44:10074-10090. [PMID: 27915291 PMCID: PMC5137458 DOI: 10.1093/nar/gkw964] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2016] [Revised: 09/21/2016] [Accepted: 10/11/2016] [Indexed: 12/31/2022] Open
Abstract
Bacteria and Archaea display a variety of phenotypic traits and can adapt to diverse ecological niches. However, systematic annotation of prokaryotic phenotypes is lacking. We have therefore developed ProTraits, a resource containing ∼545 000 novel phenotype inferences, spanning 424 traits assigned to 3046 bacterial and archaeal species. These annotations were assigned by a computational pipeline that associates microbes with phenotypes by text-mining the scientific literature and the broader World Wide Web, while also being able to define novel concepts from unstructured text. Moreover, the ProTraits pipeline assigns phenotypes by drawing extensively on comparative genomics, capturing patterns in gene repertoires, codon usage biases, proteome composition and co-occurrence in metagenomes. Notably, we find that gene synteny is highly predictive of many phenotypes, and highlight examples of gene neighborhoods associated with spore-forming ability. A global analysis of trait interrelatedness outlined clusters in the microbial phenotype network, suggesting common genetic underpinnings. Our extended set of phenotype annotations allows detection of 57 088 high confidence gene-trait links, which recover many known associations involving sporulation, flagella, catalase activity, aerobicity, photosynthesis and other traits. Over 99% of the commonly occurring gene families are involved in genetic interactions conditional on at least one phenotype, suggesting that epistasis has a major role in shaping microbial gene content.
Collapse
Affiliation(s)
- Maria Brbić
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia
| | - Matija Piškorec
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia
| | - Vedrana Vidulin
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia
| | - Anita Kriško
- Mediterranean Institute of Life Sciences, 21000 Split, Croatia
| | - Tomislav Šmuc
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia
| | - Fran Supek
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia .,EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, 08003 Barcelona, Spain.,Universitat Pompeu Fabra (UPF), 08002 Barcelona, Spain
| |
Collapse
|
36
|
Whittle CA, Extavour CG. Expression-Linked Patterns of Codon Usage, Amino Acid Frequency, and Protein Length in the Basally Branching Arthropod Parasteatoda tepidariorum. Genome Biol Evol 2016; 8:2722-36. [PMID: 27017527 PMCID: PMC5630913 DOI: 10.1093/gbe/evw068] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Spiders belong to the Chelicerata, the most basally branching arthropod subphylum. The common house spider, Parasteatoda tepidariorum, is an emerging model and provides a valuable system to address key questions in molecular evolution in an arthropod system that is distinct from traditionally studied insects. Here, we provide evidence suggesting that codon usage, amino acid frequency, and protein lengths are each influenced by expression-mediated selection in P. tepidariorum. First, highly expressed genes exhibited preferential usage of T3 codons in this spider, suggestive of selection. Second, genes with elevated transcription favored amino acids with low or intermediate size/complexity (S/C) scores (glycine and alanine) and disfavored those with large S/C scores (such as cysteine), consistent with the minimization of biosynthesis costs of abundant proteins. Third, we observed a negative correlation between expression level and coding sequence length. Together, we conclude that protein-coding genes exhibit signals of expression-related selection in this emerging, noninsect, arthropod model.
Collapse
Affiliation(s)
- Carrie A Whittle
- Department of Organismic and Evolutionary Biology, Harvard University
| | - Cassandra G Extavour
- Department of Organismic and Evolutionary Biology, Harvard University Department of Molecular and Cellular Biology, Harvard University
| |
Collapse
|
37
|
Fabijanić M, Vlahoviček K. Big Data, Evolution, and Metagenomes: Predicting Disease from Gut Microbiota Codon Usage Profiles. Methods Mol Biol 2016; 1415:509-531. [PMID: 27115650 DOI: 10.1007/978-1-4939-3572-7_26] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Metagenomics projects use next-generation sequencing to unravel genetic potential in microbial communities from a wealth of environmental niches, including those associated with human body and relevant to human health. In order to understand large datasets collected in metagenomics surveys and interpret them in context of how a community metabolism as a whole adapts and interacts with the environment, it is necessary to extend beyond the conventional approaches of decomposing metagenomes into microbial species' constituents and performing analysis on separate components. By applying concepts of translational optimization through codon usage adaptation on entire metagenomic datasets, we demonstrate that a bias in codon usage present throughout the entire microbial community can be used as a powerful analytical tool to predict for community lifestyle-specific metabolism. Here we demonstrate this approach combined with machine learning, to classify human gut microbiome samples according to the pathological condition diagnosed in the human host.
Collapse
Affiliation(s)
- Maja Fabijanić
- Bioinformatics Group, Division of Biology, Department of Molecular Biology, Faculty of Science, University of Zagreb, Rooseveltov trg 6, Zagreb, Croatia
| | - Kristian Vlahoviček
- Bioinformatics Group, Division of Biology, Department of Molecular Biology, Faculty of Science, University of Zagreb, Rooseveltov trg 6, Zagreb, Croatia.
| |
Collapse
|
38
|
Supek F. The Code of Silence: Widespread Associations Between Synonymous Codon Biases and Gene Function. J Mol Evol 2015; 82:65-73. [PMID: 26538122 DOI: 10.1007/s00239-015-9714-8] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2015] [Accepted: 10/30/2015] [Indexed: 02/07/2023]
Abstract
Some mutations in gene coding regions exchange one synonymous codon for another, and thus do not alter the amino acid sequence of the encoded protein. Even though they are often called 'silent,' these mutations may exhibit a plethora of effects on the living cell. Therefore, they are often selected during evolution, causing synonymous codon usage biases in genomes. Comparative analyses of bacterial, archaeal, fungal, and human cancer genomes have found many links between a gene's biological role and the accrual of synonymous mutations during evolution. In particular, highly expressed genes in certain functional categories are enriched with optimal codons, which are decoded by the abundant tRNAs, thus enhancing the speed and accuracy of the translating ribosome. The set of genes exhibiting codon adaptation differs between genomes, and these differences show robust associations to organismal phenotypes. In addition to selection for translation efficiency, other distinct codon bias patterns have been found in: amino acid starvation genes, cyclically expressed genes, tissue-specific genes in animals and plants, oxidative stress response genes, cellular differentiation genes, and oncogenes. In addition, genomes of organisms harboring tRNA modifications exhibit particular codon preferences. The evolutionary trace of codon bias patterns across orthologous genes may be examined to learn about a gene's relevance to various phenotypes, or, more generally, its function in the cell.
Collapse
Affiliation(s)
- Fran Supek
- Division of electronics, Rudjer Boskovic Institute, 10000, Zagreb, Croatia.
- EMBL-CRG Systems Biology Unit, Centre for Genomic Regulation (CRG), 08003, Barcelona, Spain.
- Universitat Pompeu Fabra (UPF), 08003, Barcelona, Spain.
| |
Collapse
|
39
|
Chakraborti P, Banerjee R, Roy A, Mandal S, Mukhopadhyay S. Molecular characterization influencing metal resistance in the Cupriavidus/Ralstonia genomes. J Biomol Struct Dyn 2015; 33:2330-46. [PMID: 26156561 DOI: 10.1080/07391102.2015.1069214] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Our environment is stressed with a load of heavy and toxic metals. Microbes, abundant in our environment, are found to adapt well to this metal-stressed condition. A comparative study among five Cupriavidus/Ralstonia genomes can offer a better perception of their evolutionary mechanisms to adapt to these conditions. We have studied codon usage among 1051 genes common to all these organisms and identified 15 optimal codons frequently used in highly expressed genes present within 1051 genes. We found the core genes of Cupriavidus metallidurans CH34 have a different optimal codon choice for arginine, glycine and alanine in comparison with the other four bacteria. We also found that the synonymous codon usage bias within these 1051 core genes is highly correlated with their gene expression. This supports that translational selection drives synonymous codon usage in the core genes of these genomes. Synonymous codon usage is highly conserved in the core genes of these five genomes. The only exception among them is C. metallidurans CH34. This genomewide shift in synonymous codon choice in C. metallidurans CH34 may have taken place due to the insertion of new genes in its genomes facilitating them to survive in heavy metal containing environment and the co-evolution of the other genes in its genome to achieve a balance in gene expression. Structural studies indicated the presence of a longer N-terminal region containing a copper-binding domain in the cupC proteins of C. metallidurans CH3 that helps it to attain higher binding efficacy with copper in comparison with its orthologs.
Collapse
Affiliation(s)
- Pratim Chakraborti
- a Apt Software Avenues Pvt. Ltd, Unit G 301, Block DC , City Centre , Sector I, Salt Lake, Kolkata 700064 , India
| | - Rachana Banerjee
- b Department of Biophysics, Molecular Biology and Bioinformatics , University of Calcutta , 92, A.P.C. Road, Kolkata 700009 , India
| | - Ayan Roy
- c NBU Bioinformatics Facility, Department of Botany , University of North Bengal , Raja Rammohanpur, Siliguri 734013 , India
| | - Sunanda Mandal
- b Department of Biophysics, Molecular Biology and Bioinformatics , University of Calcutta , 92, A.P.C. Road, Kolkata 700009 , India
| | - Subhasish Mukhopadhyay
- b Department of Biophysics, Molecular Biology and Bioinformatics , University of Calcutta , 92, A.P.C. Road, Kolkata 700009 , India
| |
Collapse
|
40
|
Cheng D, Wang R, Prather KJ, Chow KL, Hsing IM. Tackling codon usage bias for heterologous expression in Rhodobacter sphaeroides by supplementation of rare tRNAs. Enzyme Microb Technol 2015; 72:25-34. [DOI: 10.1016/j.enzmictec.2015.02.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2014] [Revised: 02/05/2015] [Accepted: 02/07/2015] [Indexed: 10/24/2022]
|
41
|
Mazumder TH, Chakraborty S. Gaining insights into the codon usage patterns of TP53 gene across eight mammalian species. PLoS One 2015; 10:e0121709. [PMID: 25807269 PMCID: PMC4373688 DOI: 10.1371/journal.pone.0121709] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2014] [Accepted: 02/14/2015] [Indexed: 02/06/2023] Open
Abstract
TP53 gene is known as the “guardian of the genome” as it plays a vital role in regulating cell cycle, cell proliferation, DNA damage repair, initiation of programmed cell death and suppressing tumor growth. Non uniform usage of synonymous codons for a specific amino acid during translation of protein known as codon usage bias (CUB) is a unique property of the genome and shows species specific deviation. Analysis of codon usage bias with compositional dynamics of coding sequences has contributed to the better understanding of the molecular mechanism and the evolution of a particular gene. In this study, the complete nucleotide coding sequences of TP53 gene from eight different mammalian species were used for CUB analysis. Our results showed that the codon usage patterns in TP53 gene across different mammalian species has been influenced by GC bias particularly GC3 and a moderate bias exists in the codon usage of TP53 gene. Moreover, we observed that nature has highly favored the most over represented codon CTG for leucine amino acid but selected against the ATA codon for isoleucine in TP53 gene across all mammalian species during the course of evolution.
Collapse
Affiliation(s)
| | - Supriyo Chakraborty
- Department of Biotechnology, Assam University, Silchar-788011, Assam, India
- * E-mail:
| |
Collapse
|
42
|
Sun WY, Sun SC. A description of the complete mitochondrial genomes of Amphiporus formidabilis, Prosadenoporus spectaculum and Nipponnemertes punctatula (Nemertea: Hoplonemertea: Monostilifera). Mol Biol Rep 2014; 41:5681-92. [PMID: 24939507 DOI: 10.1007/s11033-014-3438-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2013] [Accepted: 05/27/2014] [Indexed: 11/30/2022]
Abstract
We sequenced the complete mitochondrial genomes (mitogenomes) of three Hoplonemertea species, Amphiporus formidabilis, Prosadenoporus spectaculum and Nipponnemertes punctatula, which are 14,616, 14,655 and 15,354 bp in length, respectively. Each of the three circular mitogenomes consists of 37 typical genes and some non-coding regions. The nucleotide composition of the coding strand is biased toward T, almost a half of total nucleotides in these mitogenomes. There are many poly-T tracts across these mitogenomes, which exhibit T-number variation within different clones of protein-coding genes, mainly resulting from false PCR amplification. The major non-coding regions have tandem repeat motifs and hairpin-like structures that may be associated with the initiation of replication or transcription. Data published to date for nemerteans show that Palaeonemertea species usually bear the largest mitogenomes, while representatives in the more recently derived Distromatonemertea clade bear the smallest ones; and that the gene arrangement of mitogenomes seems to be variable within the phylum Nemertea, but stable within either of Heteronemertea and Hoplonemertea.
Collapse
Affiliation(s)
- Wen-Yan Sun
- Institute of Evolution & Marine Biodiversity, Ocean University of China, 5 Yushan Road, Qingdao, 266003, China
| | | |
Collapse
|
43
|
Hu C, Chen J, Ye L, Chen R, Zhang L, Xue X. Codon usage bias in human cytomegalovirus and its biological implication. Gene 2014; 545:5-14. [PMID: 24814188 DOI: 10.1016/j.gene.2014.05.018] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2013] [Revised: 05/02/2014] [Accepted: 05/06/2014] [Indexed: 10/25/2022]
Abstract
Human cytomegalovirus (HCMV) infection, a worldwide contagion, causes a serious disorder in infected individuals. Analysis of codon usage can reveal much molecular information about this virus. The effective number of codon (ENC) values, relative synonymous codon usage (RSCU) values, codon adaptation index (CAI), and nucleotide contents was investigated in approximately 160 coding sequences (CDS) among 17 human cytomegalovirus genomes using the software CodonW. Linear regression analysis and logistic regression were performed to explore the preliminary data. The results showed that, overall, HCMV genomes had low codon usage bias (mean ENC=47.619). However, the ENC of individual CDS varied widely and was distributed unevenly between host-related genes and viral-self-function genes (P=0.002, odds ratio (OR)=3.194), as did the GC content (P=0.016, OR=2.178). The ENC values correlated with CAI, GC content, and the nucleotide composing at the 3rd codon position (GC3s) (P<0.001). There was a significant variation in the codon preference that depended on the RSCU data. The predicted ENC curve suggested that mutational pressure, rather than natural selection, was one of the main factors that determined the codon usage bias in HCMV. Among 123 genes with known function, the genes related to viral self-replication and viral-host interaction showed different ENC and CAI values, and GC and GC3s contents. In conclusion, the detailed codon usage bias theoretically revealed information concerning HCMV evolution and could be a valuable additional parameter for HCMV gene function research.
Collapse
Affiliation(s)
- Changyuan Hu
- Department of General Surgery, The First Affiliated Hospital of Wenzhou Medical University, Ouhai District 325035, Wenzhou City, Zhejiang Province, China
| | - Jing Chen
- Department of Rheumatism and Immunology, The First Affiliated Hospital of Wenzhou Medical University, Ouhai District 325035, Wenzhou City, Zhejiang Province, China
| | - Lulu Ye
- Department of Microbiology and Immunology, Institute of Molecular Virology and Immunology, Wenzhou Medical University, Ouhai District 325035, Wenzhou City, Zhejiang Province, China
| | - Renpin Chen
- Department of Gastroenterology and Hepatology, The First Affiliated Hospital of Wenzhou Medical University, Ouhai District 325035, Wenzhou City, Zhejiang Province, China
| | - Lifang Zhang
- Department of Microbiology and Immunology, Institute of Molecular Virology and Immunology, Wenzhou Medical University, Ouhai District 325035, Wenzhou City, Zhejiang Province, China
| | - Xiangyang Xue
- Department of Microbiology and Immunology, Institute of Molecular Virology and Immunology, Wenzhou Medical University, Ouhai District 325035, Wenzhou City, Zhejiang Province, China.
| |
Collapse
|
44
|
Krisko A, Copic T, Gabaldón T, Lehner B, Supek F. Inferring gene function from evolutionary change in signatures of translation efficiency. Genome Biol 2014; 15:R44. [PMID: 24580753 PMCID: PMC4054840 DOI: 10.1186/gb-2014-15-3-r44] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2013] [Accepted: 03/03/2014] [Indexed: 11/13/2022] Open
Abstract
Background The genetic code is redundant, meaning that most amino acids can be encoded by more than one codon. Highly expressed genes tend to use optimal codons to increase the accuracy and speed of translation. Thus, codon usage biases provide a signature of the relative expression levels of genes, which can, uniquely, be quantified across the domains of life. Results Here we describe a general statistical framework to exploit this phenomenon and to systematically associate genes with environments and phenotypic traits through changes in codon adaptation. By inferring evolutionary signatures of translation efficiency in 911 bacterial and archaeal genomes while controlling for confounding effects of phylogeny and inter-correlated phenotypes, we linked 187 gene families to 24 diverse phenotypic traits. A series of experiments in Escherichia coli revealed that 13 of 15, 19 of 23, and 3 of 6 gene families with changes in codon adaptation in aerotolerant, thermophilic, or halophilic microbes. Respectively, confer specific resistance to, respectively, hydrogen peroxide, heat, and high salinity. Further, we demonstrate experimentally that changes in codon optimality alone are sufficient to enhance stress resistance. Finally, we present evidence that multiple genes with altered codon optimality in aerobes confer oxidative stress resistance by controlling the levels of iron and NAD(P)H. Conclusions Taken together, these results provide experimental evidence for a widespread connection between changes in translation efficiency and phenotypic adaptation. As the number of sequenced genomes increases, this novel genomic context method for linking genes to phenotypes based on sequence alone will become increasingly useful.
Collapse
|
45
|
Iriarte A, Baraibar JD, Diana L, Castro-Sowinski S, Romero H, Musto H. Trends in amino acid usage across the class Mollicutes. J Biomol Struct Dyn 2014; 32:65-74. [DOI: 10.1080/07391102.2012.748636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
46
|
O'Neill PK, Or M, Erill I. scnRCA: a novel method to detect consistent patterns of translational selection in mutationally-biased genomes. PLoS One 2013; 8:e76177. [PMID: 24116094 PMCID: PMC3792112 DOI: 10.1371/journal.pone.0076177] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2013] [Accepted: 08/23/2013] [Indexed: 12/04/2022] Open
Abstract
Codon usage bias (CUB) results from the complex interplay between translational selection and mutational biases. Current methods for CUB analysis apply heuristics to integrate both components, limiting the depth and scope of CUB analysis as a technique to probe into the evolution and optimization of protein-coding genes. Here we introduce a self-consistent CUB index (scnRCA) that incorporates implicit correction for mutational biases, facilitating exploration of the translational selection component of CUB. We validate this technique using gene expression data and we apply it to a detailed analysis of CUB in the Pseudomonadales. Our results illustrate how the selective enrichment of specific codons among highly expressed genes is preserved in the context of genome-wide shifts in codon frequencies, and how the balance between mutational and translational biases leads to varying definitions of codon optimality. We extend this analysis to other moderate and fast growing bacteria and we provide unified support for the hypothesis that C- and A-ending codons of two-box amino acids, and the U-ending codons of four-box amino acids, are systematically enriched among highly expressed genes across bacteria. The use of an unbiased estimator of CUB allows us to report for the first time that the signature of translational selection is strongly conserved in the Pseudomonadales in spite of drastic changes in genome composition, and extends well beyond the core set of highly optimized genes in each genome. We generalize these results to other moderate and fast growing bacteria, hinting at selection for a universal pattern of gene expression that is conserved and detectable in conserved patterns of codon usage bias.
Collapse
Affiliation(s)
- Patrick K. O'Neill
- Department of Biological Sciences, University of Maryland Baltimore County (UMBC), Baltimore, Maryland, United States of America
| | - Mindy Or
- Department of Biological Sciences, University of Maryland Baltimore County (UMBC), Baltimore, Maryland, United States of America
| | - Ivan Erill
- Department of Biological Sciences, University of Maryland Baltimore County (UMBC), Baltimore, Maryland, United States of America
- * E-mail:
| |
Collapse
|
47
|
Taylor RC, Webb Robertson BJM, Markillie LM, Serres MH, Linggi BE, Aldrich JT, Hill EA, Romine MF, Lipton MS, Wiley HS. Changes in translational efficiency is a dominant regulatory mechanism in the environmental response of bacteria. Integr Biol (Camb) 2013; 5:1393-406. [PMID: 24081429 DOI: 10.1039/c3ib40120k] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
To understand how cell physiological state affects mRNA translation, we used Shewanella oneidensis MR-1 grown under steady state conditions at either 20% or 8.5% O2. Using a combination of quantitative proteomics and RNA-Seq, we generated high-confidence data on >1000 mRNA and protein pairs. By using a steady state model, we found that differences in protein-mRNA ratios were primarily due to differences in the translational efficiency of specific genes. When oxygen levels were lowered, 28% of the proteins showed at least a 2-fold change in expression. Transcription levels were sp. significantly altered for 26% of the protein changes; translational efficiency was significantly altered for 46% and a combination of both was responsible for the remaining 28%. Changes in translational efficiency were significantly correlated with the codon usage pattern of the genes and measurable tRNA pools changed in response to altered O2 levels. Our results suggest that changes in the translational efficiency of proteins, in part due to altered tRNA pools, is a major determinant of regulated alterations in protein expression levels in bacteria.
Collapse
Affiliation(s)
- Ronald C Taylor
- Computational Biosciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Roller M, Lucić V, Nagy I, Perica T, Vlahovicek K. Environmental shaping of codon usage and functional adaptation across microbial communities. Nucleic Acids Res 2013; 41:8842-52. [PMID: 23921637 PMCID: PMC3799439 DOI: 10.1093/nar/gkt673] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Microbial communities represent the largest portion of the Earth's biomass. Metagenomics projects use high-throughput sequencing to survey these communities and shed light on genetic capabilities that enable microbes to inhabit every corner of the biosphere. Metagenome studies are generally based on (i) classifying and ranking functions of identified genes; and (ii) estimating the phyletic distribution of constituent microbial species. To understand microbial communities at the systems level, it is necessary to extend these studies beyond the species' boundaries and capture higher levels of metabolic complexity. We evaluated 11 metagenome samples and demonstrated that microbes inhabiting the same ecological niche share common preferences for synonymous codons, regardless of their phylogeny. By exploring concepts of translational optimization through codon usage adaptation, we demonstrated that community-wide bias in codon usage can be used as a prediction tool for lifestyle-specific genes across the entire microbial community, effectively considering microbial communities as meta-genomes. These findings set up a 'functional metagenomics' platform for the identification of genes relevant for adaptations of entire microbial communities to environments. Our results provide valuable arguments in defining the concept of microbial species through the context of their interactions within the community.
Collapse
Affiliation(s)
- Masa Roller
- Bioinformatics Group, Department of Molecular Biology, Faculty of Science, University of Zagreb, Horvatovac 102a, 10000 Zagreb, Croatia, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Temesvári körút 62, H-6726 Szeged, Hungary, MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 0QH, UK and Department of Informatics, University of Oslo, PO Box 1080 Blindern, NO-0316 Oslo, Norway
| | | | | | | | | |
Collapse
|
49
|
Iriarte A, Baraibar JD, Romero H, Castro-Sowinski S, Musto H. Evolution of optimal codon choices in the family Enterobacteriaceae. MICROBIOLOGY-SGM 2013; 159:555-564. [PMID: 23288542 DOI: 10.1099/mic.0.061952-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
The Enterobacteriaceae are a large family of Proteobacteria that include many well-known prokaryotic genera, such as Escherichia, Yersinia and Salmonella. The main ideas of synonymous codon usage (CU) evolution and translational selection have been deeply influenced by studies with these bacterial groups. In this work we report the analysis of the CU pattern of completely sequenced bacterial genomes that belong to the Enterobacteriaceae. The effect of selection in translation acting at the levels of speed and accuracy, and phylogenetic trends within this group are described. Preferred (optimal) codons were identified. The evolutionary dynamics of these codons were studied and following a Bayesian approach these preferences were traced back to the common ancestor of the family. We found that there is some level of variation in selection among the analysed micro-organisms that is probably associated with lineage-specific trends. The codon bias was largely conserved across the evolutionary time of the family in highly expressed genes and protein conserved regions, suggesting a major role of negative selection. In this sense, the results support the idea that the extant CU bias is finely tuned over the ancestral well-conserved pool of tRNAs.
Collapse
Affiliation(s)
- Andrés Iriarte
- Área Genética, Depto. de Genética y Mejora Animal, Facultad de Veterinaria (UDELAR), Av. A. Lasplaces 1550, CP 11600, Montevideo, Uruguay.,Laboratorio de Evolución, Facultad de Ciencias (UDELAR), Iguá 4225, 11400 Montevideo, Uruguay.,Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias (UDELAR), Iguá 4225, 11400 Montevideo, Uruguay
| | - Juan Diego Baraibar
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias (UDELAR), Iguá 4225, 11400 Montevideo, Uruguay
| | - Héctor Romero
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias (UDELAR), Iguá 4225, 11400 Montevideo, Uruguay
| | - Susana Castro-Sowinski
- Sección Bioquímica y Biología Molecular, Facultad de Ciencias (UDELAR), Iguá 4225, 11400 Montevideo, Uruguay
| | - Héctor Musto
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias (UDELAR), Iguá 4225, 11400 Montevideo, Uruguay
| |
Collapse
|
50
|
Iriarte A, Sanguinetti M, Fernández-Calero T, Naya H, Ramón A, Musto H. Translational selection on codon usage in the genus Aspergillus. Gene 2012; 506:98-105. [DOI: 10.1016/j.gene.2012.06.027] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2011] [Revised: 05/09/2012] [Accepted: 06/15/2012] [Indexed: 10/28/2022]
|