1
|
Xu D, Tang L, Kapranov P. Complexities of mammalian transcriptome revealed by targeted RNA enrichment techniques. Trends Genet 2023; 39:320-333. [PMID: 36681580 DOI: 10.1016/j.tig.2022.12.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 12/27/2022] [Accepted: 12/30/2022] [Indexed: 01/21/2023]
Abstract
Studies using highly sensitive targeted RNA enrichment methods have shown that a large portion of the human transcriptome remains to be discovered and that most of the genome is transcribed in a complex, interleaved fashion characterized by a complex web of transcripts emanating from protein coding and noncoding loci. These results resonate with those from single-cell transcriptome profiling endeavors that reveal the existence of multiple novel, cell type-specific transcripts and clearly demonstrate that our understanding of the complexities of the human transcriptome is far from being complete. Here, we review the current status of the targeted RNA enrichment techniques, their application to the discovery of novel cell type-specific transcripts, and their impact on our understanding of the human genome and transcriptome.
Collapse
Affiliation(s)
- Dongyang Xu
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen 361021, China
| | - Lu Tang
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen 361021, China
| | - Philipp Kapranov
- Institute of Genomics, School of Medicine, Huaqiao University, 668 Jimei Road, Xiamen 361021, China.
| |
Collapse
|
2
|
Farkas C, Recabal A, Mella A, Candia-Herrera D, Olivero MG, Haigh JJ, Tarifeño-Saldivia E, Caprile T. annotate_my_genomes: an easy-to-use pipeline to improve genome annotation and uncover neglected genes by hybrid RNA sequencing. Gigascience 2022; 11:6874526. [PMID: 36472574 PMCID: PMC9724561 DOI: 10.1093/gigascience/giac099] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 07/22/2022] [Accepted: 09/28/2022] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND The advancement of hybrid sequencing technologies is increasingly expanding genome assemblies that are often annotated using hybrid sequencing transcriptomics, leading to improved genome characterization and the identification of novel genes and isoforms in a wide variety of organisms. RESULTS We developed an easy-to-use genome-guided transcriptome annotation pipeline that uses assembled transcripts from hybrid sequencing data as input and distinguishes between coding and long non-coding RNAs by integration of several bioinformatic approaches, including gene reconciliation with previous annotations in GTF format. We demonstrated the efficiency of this approach by correctly assembling and annotating all exons from the chicken SCO-spondin gene (containing more than 105 exons), including the identification of missing genes in the chicken reference annotations by homology assignments. CONCLUSIONS Our method helps to improve the current transcriptome annotation of the chicken brain. Our pipeline, implemented on Anaconda/Nextflow and Docker is an easy-to-use package that can be applied to a broad range of species, tissues, and research areas helping to improve and reconcile current annotations. The code and datasets are publicly available at https://github.com/cfarkas/annotate_my_genomes.
Collapse
Affiliation(s)
| | - Antonia Recabal
- Departamento de Biología Celular, Facultad de Ciencias Biológicas, Universidad de Concepción, Chile
| | - Andy Mella
- Instituto de Ciencias Naturales, Universidad de las Américas, Chile,Centro Integrativo de Biología y Química Aplicada (CIBQA), Universidad Bernardo O'Higgins, Santiago 8370854, Chile
| | - Daniel Candia-Herrera
- Departamento de Bioquímica y Biología Molecular, Facultad de Ciencias Biológicas, Universidad de Concepción, Chile
| | - Maryori González Olivero
- Departamento de Biología Celular, Facultad de Ciencias Biológicas, Universidad de Concepción, Chile
| | - Jody Jonathan Haigh
- CancerCare Manitoba Research Institute, Winnipeg, MB, Canada,Department of Pharmacology and Therapeutics, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, MB, Canada
| | | | | |
Collapse
|
3
|
Zhang J, Xu C. Gene product diversity: adaptive or not? Trends Genet 2022; 38:1112-1122. [PMID: 35641344 PMCID: PMC9560964 DOI: 10.1016/j.tig.2022.05.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 04/30/2022] [Accepted: 05/03/2022] [Indexed: 01/24/2023]
Abstract
One gene does not equal one RNA or protein. The genomic revolution has revealed numerous different RNA and protein molecules that can be produced from one gene, such as circular RNAs generated by back-splicing, proteins with residues mismatching the genomic encoding because of RNA editing, and proteins extended in the C terminus via stop codon readthrough in translation. Are these diverse products results of exquisite gene regulations or imprecise biological processes? While there are cases where the gene product diversity appears beneficial, genome-scale patterns suggest that much of this diversity arises from nonadaptive, molecular errors. This finding has important implications for studying the functions of diverse gene products and for understanding the fundamental properties and evolution of cellular life.
Collapse
Affiliation(s)
- Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA.
| | - Chuan Xu
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
| |
Collapse
|
4
|
Palazzo AF, Kejiou NS. Non-Darwinian Molecular Biology. Front Genet 2022; 13:831068. [PMID: 35251134 PMCID: PMC8888898 DOI: 10.3389/fgene.2022.831068] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 01/24/2022] [Indexed: 12/14/2022] Open
Abstract
With the discovery of the double helical structure of DNA, a shift occurred in how biologists investigated questions surrounding cellular processes, such as protein synthesis. Instead of viewing biological activity through the lens of chemical reactions, this new field used biological information to gain a new profound view of how biological systems work. Molecular biologists asked new types of questions that would have been inconceivable to the older generation of researchers, such as how cellular machineries convert inherited biological information into functional molecules like proteins. This new focus on biological information also gave molecular biologists a way to link their findings to concepts developed by genetics and the modern synthesis. However, by the late 1960s this all changed. Elevated rates of mutation, unsustainable genetic loads, and high levels of variation in populations, challenged Darwinian evolution, a central tenant of the modern synthesis, where adaptation was the main driver of evolutionary change. Building on these findings, Motoo Kimura advanced the neutral theory of molecular evolution, which advocates that selection in multicellular eukaryotes is weak and that most genomic changes are neutral and due to random drift. This was further elaborated by Jack King and Thomas Jukes, in their paper “Non-Darwinian Evolution”, where they pointed out that the observed changes seen in proteins and the types of polymorphisms observed in populations only become understandable when we take into account biochemistry and Kimura’s new theory. Fifty years later, most molecular biologists remain unaware of these fundamental advances. Their adaptionist viewpoint fails to explain data collected from new powerful technologies which can detect exceedingly rare biochemical events. For example, high throughput sequencing routinely detects RNA transcripts being produced from almost the entire genome yet are present less than one copy per thousand cells and appear to lack any function. Molecular biologists must now reincorporate ideas from classical biochemistry and absorb modern concepts from molecular evolution, to craft a new lens through which they can evaluate the functionality of transcriptional units, and make sense of our messy, intricate, and complicated genome.
Collapse
|
5
|
Xu C, Zhang J. Mammalian circular RNAs result largely from splicing errors. Cell Rep 2021; 36:109439. [PMID: 34320353 PMCID: PMC8365531 DOI: 10.1016/j.celrep.2021.109439] [Citation(s) in RCA: 49] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 04/13/2021] [Accepted: 07/02/2021] [Indexed: 12/20/2022] Open
Abstract
Ubiquitous in eukaryotes, circular RNAs (circRNAs) comprise a large class of mostly non-coding RNAs produced by back-splicing. Although some circRNAs have demonstrated biochemical activities, whether most circRNAs are functional is unknown. Here, we test the hypothesis that circRNA production primarily results from splicing error and so is deleterious instead of beneficial. In support of the error hypothesis, our analysis of RNA sequencing data from 11 shared tissues of humans, macaques, and mice finds that (1) back-splicing is much rarer than linear-splicing, (2) the rate of back-splicing diminishes with the splicing amount, (3) the overall prevalence of back-splicing in a species declines with its effective population size, and (4) circRNAs are overall evolutionarily unconserved. We estimate that more than 97% of the observed circRNA production is deleterious. We identify a small number of functional circRNA candidates, and the genome-wide trend strongly suggests that circRNAs are largely non-functional products of splicing errors.
Collapse
Affiliation(s)
- Chuan Xu
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders of Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China; Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109, USA.
| |
Collapse
|
6
|
Ho AT, Hurst LD. Effective Population Size Predicts Local Rates but Not Local Mitigation of Read-through Errors. Mol Biol Evol 2021; 38:244-262. [PMID: 32797190 PMCID: PMC7783166 DOI: 10.1093/molbev/msaa210] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
In correctly predicting that selection efficiency is positively correlated with the effective population size (Ne), the nearly neutral theory provides a coherent understanding of between-species variation in numerous genomic parameters, including heritable error (germline mutation) rates. Does the same theory also explain variation in phenotypic error rates and in abundance of error mitigation mechanisms? Translational read-through provides a model to investigate both issues as it is common, mostly nonadaptive, and has good proxy for rate (TAA being the least leaky stop codon) and potential error mitigation via "fail-safe" 3' additional stop codons (ASCs). Prior theory of translational read-through has suggested that when population sizes are high, weak selection for local mitigation can be effective thus predicting a positive correlation between ASC enrichment and Ne. Contra to prediction, we find that ASC enrichment is not correlated with Ne. ASC enrichment, although highly phylogenetically patchy, is, however, more common both in unicellular species and in genes expressed in unicellular modes in multicellular species. By contrast, Ne does positively correlate with TAA enrichment. These results imply that local phenotypic error rates, not local mitigation rates, are consistent with a drift barrier/nearly neutral model.
Collapse
Affiliation(s)
- Alexander T Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
- Corresponding author: E-mail:
| | - Laurence D Hurst
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
7
|
Xu C, Zhang J. Mammalian Alternative Translation Initiation Is Mostly Nonadaptive. Mol Biol Evol 2021; 37:2015-2028. [PMID: 32145028 DOI: 10.1093/molbev/msaa063] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Alternative translation initiation (ATLI) refers to the existence of multiple translation initiation sites per gene and is a widespread phenomenon in eukaryotes. ATLI is commonly assumed to be advantageous through creating proteome diversity or regulating protein synthesis. We here propose an alternative hypothesis that ATLI arises primarily from nonadaptive initiation errors presumably due to the limited ability of ribosomes to distinguish sequence motifs truly signaling translation initiation from similar sequences. Our hypothesis, but not the adaptive hypothesis, predicts a series of global patterns of ATLI, all of which are confirmed at the genomic scale by quantitative translation initiation sequencing in multiple human and mouse cell lines and tissues. Similarly, although many codons differing from AUG by one nucleotide can serve as start codons, our analysis suggests that using non-AUG start codons is mostly disadvantageous. These and other findings strongly suggest that ATLI predominantly results from molecular error, requiring a major revision of our understanding of the precision and regulation of translation initiation.
Collapse
Affiliation(s)
- Chuan Xu
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| |
Collapse
|
8
|
Palazzo AF, Kang YM. GC-content biases in protein-coding genes act as an "mRNA identity" feature for nuclear export. Bioessays 2020; 43:e2000197. [PMID: 33165929 DOI: 10.1002/bies.202000197] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 09/30/2020] [Accepted: 10/01/2020] [Indexed: 01/11/2023]
Abstract
It has long been observed that human protein-coding genes have a particular distribution of GC-content: the 5' end of these genes has high GC-content while the 3' end has low GC-content. In 2012, it was proposed that this pattern of GC-content could act as an mRNA identity feature that would lead to it being better recognized by the cellular machinery to promote its nuclear export. In contrast, junk RNA, which largely lacks this feature, would be retained in the nucleus and targeted for decay. Now two recent papers have provided evidence that GC-content does promote the nuclear export of many mRNAs in human cells.
Collapse
Affiliation(s)
- Alexander F Palazzo
- Department of Biochemistry, University of Toronto, Toronto, ON, M5G 1M1, Canada
| | - Yoon Mo Kang
- Department of Biochemistry, University of Toronto, Toronto, ON, M5G 1M1, Canada
| |
Collapse
|
9
|
Palazzo AF, Koonin EV. Functional Long Non-coding RNAs Evolve from Junk Transcripts. Cell 2020; 183:1151-1161. [PMID: 33068526 DOI: 10.1016/j.cell.2020.09.047] [Citation(s) in RCA: 125] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Revised: 08/20/2020] [Accepted: 09/17/2020] [Indexed: 12/30/2022]
Abstract
Transcriptome studies reveal pervasive transcription of complex genomes, such as those of mammals. Despite popular arguments for functionality of most, if not all, of these transcripts, genome-wide analysis of selective constraints indicates that most of the produced RNA are junk. However, junk is not garbage. On the contrary, junk transcripts provide the raw material for the evolution of diverse long non-coding (lnc) RNAs by non-adaptive mechanisms, such as constructive neutral evolution. The generation of many novel functional entities, such as lncRNAs, that fuels organismal complexity does not seem to be driven by strong positive selection. Rather, the weak selection regime that dominates the evolution of most multicellular eukaryotes provides ample material for functional innovation with relatively little adaptation involved.
Collapse
Affiliation(s)
- Alexander F Palazzo
- Department of Biochemistry, University of Toronto, Toronto, ON M5G 1M1, Canada.
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| |
Collapse
|
10
|
Xu H, Liu JJ, Liu Z, Li Y, Jin YS, Zhang J. Synchronization of stochastic expressions drives the clustering of functionally related genes. SCIENCE ADVANCES 2019; 5:eaax6525. [PMID: 31633028 PMCID: PMC6785257 DOI: 10.1126/sciadv.aax6525] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Accepted: 09/10/2019] [Indexed: 05/18/2023]
Abstract
Functionally related genes tend to be chromosomally clustered in eukaryotic genomes even after the exclusion of tandem duplicates, but the biological significance of this widespread phenomenon is unclear. We propose that stochastic expression fluctuations of neighboring genes resulting from chromatin dynamics are more or less synchronized such that their expression ratio is more stable than that for unlinked genes. Consequently, chromosomal clustering could be advantageous when the expression ratio of the clustered genes needs to stay constant, for example, because of the accumulation of toxic compounds when this ratio is altered. Evidence from manipulative experiments on the yeast GAL cluster, comprising three chromosomally adjacent genes encoding enzymes catalyzing consecutive reactions in galactose catabolism, unequivocally supports this hypothesis and elucidates how disorder in one biological phenomenon-gene expression noise-could prompt the emergence of order in another-genome organization.
Collapse
Affiliation(s)
- Haiqing Xu
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Jing-Jing Liu
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Zhen Liu
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
| | - Ying Li
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Yong-Su Jin
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Department of Food Science and Human Nutrition, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
11
|
Ho AT, Hurst LD. In eubacteria, unlike eukaryotes, there is no evidence for selection favouring fail-safe 3' additional stop codons. PLoS Genet 2019; 15:e1008386. [PMID: 31527909 PMCID: PMC6764699 DOI: 10.1371/journal.pgen.1008386] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Revised: 09/27/2019] [Accepted: 08/27/2019] [Indexed: 12/23/2022] Open
Abstract
Errors throughout gene expression are likely deleterious, hence genomes are under selection to ameliorate their consequences. Additional stop codons (ASCs) are in-frame nonsense ‘codons’ downstream of the primary stop which may be read by translational machinery should the primary stop have been accidentally read through. Prior evidence in several eukaryotes suggests that ASCs are selected to prevent potentially-deleterious consequences of read-through. We extend this evidence showing that enrichment of ASCs is common but not universal for single cell eukaryotes. By contrast, there is limited evidence as to whether the same is true in other taxa. Here, we provide the first systematic test of the hypothesis that ASCs act as a fail-safe mechanism in eubacteria, a group with high read-through rates. Contra to the predictions of the hypothesis we find: there is paucity, not enrichment, of ASCs downstream; substitutions that degrade stops are more frequent in-frame than out-of-frame in 3’ sequence; highly expressed genes are no more likely to have ASCs than lowly expressed genes; usage of the leakiest primary stop (TGA) in highly expressed genes does not predict ASC enrichment even compared to usage of non-leaky stops (TAA) in lowly expressed genes, beyond downstream codon +1. Any effect at the codon immediately proximal to the primary stop can be accounted for by a preference for a T/U residue immediately following the stop, although if anything, TT- and TC- starting codons are preferred. We conclude that there is no compelling evidence for ASC selection in eubacteria. This presents an unusual case in which the same error could be solved by the same mechanism in eukaryotes and prokaryotes but is not. We discuss two possible explanations: that, owing to the absence of nonsense mediated decay, bacteria may solve read-through via gene truncation and in eukaryotes certain prion states cause raised read-through rates. In all organisms, gene expression is error-prone. One such error, translational read-through, occurs where the primary stop codon of an expressed gene is missed by the translational machinery. Failure to terminate is likely to be costly, hence genomes are under selection to prevent this from happening. One proposed error-proofing strategy involves in-frame proximal additional stop codons (ASCs) which may act as a ‘fail-safe’ mechanism by providing another opportunity for translation to terminate. There is evidence for ASC enrichment in several eukaryotes. We extend this evidence showing it to be common but not universal in single celled eukaryotes. However, the situation in bacteria is poorly understood, despite bacteria having high read-through rates. Here, we test the fail-safe hypothesis within a broad range of bacteria. To our surprise, we find that not only are ASCs not enriched, but they may even be selected against. This provides evidence for an unusual circumstance where eukaryotes and prokaryotes could solve the same problem the same way but don’t. What are we to make of this? We suggest that if read-through is the problem, ASCs are not necessarily the expected solution. Owing to the absence of nonsense-mediated decay, a process that makes gene truncation in eukaryotes less viable, we propose bacteria may rescue a leaky stop by mutation that creates a new stop upstream. Alternatively, raised read-through rates in some particular conditions in eukaryotes might explain the difference.
Collapse
Affiliation(s)
- Alexander T. Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
- * E-mail:
| | - Laurence D. Hurst
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
12
|
Demongeot J, Seligmann H. Spontaneous evolution of circular codes in theoretical minimal RNA rings. Gene 2019; 705:95-102. [DOI: 10.1016/j.gene.2019.03.069] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2018] [Revised: 03/08/2019] [Accepted: 03/29/2019] [Indexed: 02/06/2023]
|
13
|
Seligmann H. Localized Context-Dependent Effects of the "Ambush" Hypothesis: More Off-Frame Stop Codons Downstream of Shifty Codons. DNA Cell Biol 2019; 38:786-795. [PMID: 31157984 DOI: 10.1089/dna.2019.4725] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
The ambush hypothesis speculates that off-frame stop codons increase translational efficiency after ribosomal frameshifts by stopping early frameshifted translation. Some evidences fit this hypothesis: (1) synonymous codon usages increase with their potential contribution to off-frame stops; (2) the genetic code assigns frequent amino acids to codon families contributing to off-frame stops; (3) positive biases for off-frame stops (AT rich) occur despite adverse nucleotide (GC) biases; and (4) mitochondrial off-frame stop codon densities increase with ribosomal structural instability, potential proxy of frameshift frequencies. In this study, analyses of vertebrate mitogenes and tRNA synthetase genes from all superkingdoms and viruses test a new prediction of the ambush hypothesis: sequences immediately downstream of frameshift-inducing homopolymer codons (AAA, CCC, GGG, and TTT) are off-frame stop rich. Codons immediately downstream of homopolymer codons form more than average off-frame stops, biases are stronger than for corresponding upstream distances and for any other group of synonymous codons. Sequences downstream of that high-density region are off-frame stop depleted. This decrease suggests that off-frame stops, combined with suppressor tRNAs regulate translation of overlapping coding sequences. Results show the predictive power of the ambush hypothesis, from macroevolutionary (genetic code structure) to detailed gene sequence anatomy.
Collapse
Affiliation(s)
- Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, Jerusalem, Israel
| |
Collapse
|
14
|
Li C, Zhang J. Stop-codon read-through arises largely from molecular errors and is generally nonadaptive. PLoS Genet 2019; 15:e1008141. [PMID: 31120886 PMCID: PMC6550407 DOI: 10.1371/journal.pgen.1008141] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 06/05/2019] [Accepted: 04/16/2019] [Indexed: 12/02/2022] Open
Abstract
Stop-codon read-through refers to the phenomenon that a ribosome goes past the stop codon and continues translating into the otherwise untranslated region (UTR) of a transcript. Recent ribosome-profiling experiments in eukaryotes uncovered widespread stop-codon read-through that also varies among tissues, prompting the adaptive hypothesis that stop-codon read-through is an important, regulated mechanism for generating proteome diversity. Here we propose and test a competing hypothesis that stop-codon read-through arises mostly from molecular errors and is largely nonadaptive. The error hypothesis makes distinct predictions about the probability of read-through, frequency of sequence motifs for read-through, and conservation of the read-through region, each of which is supported by genome-scale data from yeasts and fruit flies. Thus, except for the few cases with demonstrated functions, stop-codon read-through is generally nonadaptive. This finding, along with other molecular errors recently quantified, reveals a much less precise or orderly cellular life than is commonly thought.
Collapse
Affiliation(s)
- Chuan Li
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, United States of America
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, United States of America
| |
Collapse
|
15
|
Abrahams L, Hurst LD. Refining the Ambush Hypothesis: Evidence That GC- and AT-Rich Bacteria Employ Different Frameshift Defence Strategies. Genome Biol Evol 2018; 10:1153-1173. [PMID: 29617761 PMCID: PMC5909447 DOI: 10.1093/gbe/evy075] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/30/2018] [Indexed: 12/13/2022] Open
Abstract
Stop codons are frequently selected for beyond their regular termination function for error control. The “ambush hypothesis” proposes out-of-frame stop codons (OSCs) terminating frameshifted translations are selected for. Although early indirect evidence was partially supportive, recent evidence suggests OSC frequencies are not exceptional when considering underlying nucleotide content. However, prior null tests fail to control amino acid/codon usages or possible local mutational biases. We therefore return to the issue using bacterial genomes, considering several tests defining and testing against a null. We employ simulation approaches preserving amino acid order but shuffling synonymous codons or preserving codons while shuffling amino acid order. Additionally, we compare codon usage in amino acid pairs, where one codon can but the next, otherwise identical codon, cannot encode an OSC. OSC frequencies exceed expectations typically in AT-rich genomes, the +1 frame and for TGA/TAA but not TAG. With this complex evidence, simply rejecting or accepting the ambush hypothesis is not warranted. We propose a refined post hoc model, whereby AT-rich genomes have more accidental frameshifts, handled by RF2–RF3 complexes (associated with TGA/TAA) and are mostly +1 (or −2) slips. Supporting this, excesses positively correlate with in silico predicted frameshift probabilities. Thus, we propose a more viable framework, whereby genomes broadly adopt one of the two strategies to combat frameshifts: preventing frameshifting (GC-rich) or permitting frameshifts but minimizing impacts when most are caught early (AT-rich). Our refined framework holds promise yet some features, such as the bias of out-of-frame sense codons, remain unexplained.
Collapse
Affiliation(s)
- Liam Abrahams
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, United Kingdom
| | - Laurence D Hurst
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, United Kingdom
| |
Collapse
|
16
|
Wei X, Zhang J. On the Origin of Compositional Features of Ribosomes. Genome Biol Evol 2018; 10:2010-2016. [PMID: 30059996 PMCID: PMC6097593 DOI: 10.1093/gbe/evy169] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/28/2018] [Indexed: 01/08/2023] Open
Abstract
Ribosomes are highly abundant in cells and comprise, besides RNAs of varying lengths, 55–80 similarly sized, short proteins. This seemingly unusual composition is thought to have resulted from selection for rapid autocatalytic ribosome production. Here, we demonstrate that ribosomal protein-splitting mutations cannot accelerate ribosome production. The autocatalytic explanation is also unnecessary, because protein lengths generally decline with expression levels. Although ribosomal proteins are shorter than expected from their expression levels, they are not outliers among members of large protein complexes in mean protein length or coefficient of variation. These observations are explainable because 1) shortening proteins lowers their synthetic cost and reduces the waste from mistranslation-induced protein dysfunction and degradation, 2) such benefits rise with expression levels, and 3) members of large complexes participate in more protein–protein interactions so are less tolerant to mistranslation. These and other considerations suggest that the compositional features of ribosomes originate from cellular energy economics.
Collapse
Affiliation(s)
- Xinzhu Wei
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| |
Collapse
|
17
|
Abrahams L, Hurst LD. Adenine Enrichment at the Fourth CDS Residue in Bacterial Genes Is Consistent with Error Proofing for +1 Frameshifts. Mol Biol Evol 2018; 34:3064-3080. [PMID: 28961919 PMCID: PMC5850271 DOI: 10.1093/molbev/msx223] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Beyond selection for optimal protein functioning, coding sequences (CDSs) are under selection at the RNA and DNA levels. Here, we identify a possible signature of “dual-coding,” namely extensive adenine (A) enrichment at bacterial CDS fourth sites. In 99.07% of studied bacterial genomes, fourth site A use is greater than expected given genomic A-starting codon use. Arguing for nucleotide level selection, A-starting serine and arginine second codons are heavily utilized when compared with their non-A starting synonyms. Several models have the ability to explain some of this trend. In part, A-enrichment likely reduces 5′ mRNA stability, promoting translation initiation. However T/U, which may also reduce stability, is avoided. Further, +1 frameshifts on the initiating ATG encode a stop codon (TGA) provided A is the fourth residue, acting either as a frameshift “catch and destroy” or a frameshift stop and adjust mechanism and hence implicated in translation initiation. Consistent with both, genomes lacking TGA stop codons exhibit weaker fourth site A-enrichment. Sequences lacking a Shine–Dalgarno sequence and those without upstream leader genes, that may be more error prone during initiation, have greater utilization of A, again suggesting a role in initiation. The frameshift correction model is consistent with the notion that many genomic features are error-mitigation factors and provides the first evidence for site-specific out of frame stop codon selection. We conjecture that the NTG universal start codon may have evolved as a consequence of TGA being a stop codon and the ability of NTGA to rapidly terminate or adjust a ribosome.
Collapse
Affiliation(s)
- Liam Abrahams
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| | - Laurence D Hurst
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
18
|
Keller M, Hu Y, Mesihovic A, Fragkostefanakis S, Schleiff E, Simm S. Alternative splicing in tomato pollen in response to heat stress. DNA Res 2018; 24:205-217. [PMID: 28025318 PMCID: PMC5397606 DOI: 10.1093/dnares/dsw051] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2016] [Accepted: 10/26/2016] [Indexed: 01/08/2023] Open
Abstract
Alternative splicing (AS) is a key control mechanism influencing signal response cascades in different developmental stages and under stress conditions. In this study, we examined heat stress (HS)-induced AS in the heat sensitive pollen tissue of two tomato cultivars. To obtain the entire spectrum of HS-related AS, samples taken directly after HS and after recovery were combined and analysed by RNA-seq. For nearly 9,200 genes per cultivar, we observed at least one AS event under HS. In comparison to control, for one cultivar we observed 76% more genes with intron retention (IR) or exon skipping (ES) under HS. Furthermore, 2,343 genes had at least one transcript with IR or ES accumulated under HS in both cultivars. These genes are involved in biological processes like protein folding, gene expression and heat response. Transcriptome assembly of these genes revealed that most of the alternative spliced transcripts possess truncated coding sequences resulting in partial or total loss of functional domains. Moreover, 141 HS specific and 22 HS repressed transcripts were identified. Further on, we propose AS as layer of stress response regulating constitutively expressed genes under HS by isoform abundance.
Collapse
Affiliation(s)
- Mario Keller
- Department of Biosciences, Molecular Cell Biology of Plants
| | - Yangjie Hu
- Department of Biosciences, Molecular Cell Biology of Plants
| | | | | | - Enrico Schleiff
- Department of Biosciences, Molecular Cell Biology of Plants.,Cluster of Excellence Frankfurt.,Buchmann Institute for Molecular Life Sciences (BMLS), Goethe University, D-60438 Frankfurt am Main, Germany
| | - Stefan Simm
- Department of Biosciences, Molecular Cell Biology of Plants.,Cluster of Excellence Frankfurt
| |
Collapse
|
19
|
de Oliveira LL, Freitas AA, Tinós R. Multi-objective genetic algorithms in the study of the genetic code’s adaptability. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2017.10.022] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
|
20
|
Loviglio MN, Arbogast T, Jønch AE, Collins SC, Popadin K, Bonnet CS, Giannuzzi G, Maillard AM, Jacquemont S, Yalcin B, Katsanis N, Golzio C, Reymond A. The Immune Signaling Adaptor LAT Contributes to the Neuroanatomical Phenotype of 16p11.2 BP2-BP3 CNVs. Am J Hum Genet 2017; 101:564-577. [PMID: 28965845 PMCID: PMC5630231 DOI: 10.1016/j.ajhg.2017.08.016] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2017] [Accepted: 08/21/2017] [Indexed: 02/04/2023] Open
Abstract
Copy-number changes in 16p11.2 contribute significantly to neuropsychiatric traits. Besides the 600 kb BP4-BP5 CNV found in 0.5%-1% of individuals with autism spectrum disorders and schizophrenia and whose rearrangement causes reciprocal defects in head size and body weight, a second distal 220 kb BP2-BP3 CNV is likewise a potent driver of neuropsychiatric, anatomical, and metabolic pathologies. These two CNVs are engaged in complex reciprocal chromatin looping, intimating a functional relationship between genes in these regions that might be relevant to pathomechanism. We assessed the drivers of the distal 16p11.2 duplication by overexpressing each of the nine encompassed genes in zebrafish. Only overexpression of LAT induced a reduction of brain proliferating cells and concomitant microcephaly. Consistently, suppression of the zebrafish ortholog induced an increase of proliferation and macrocephaly. These phenotypes were not unique to zebrafish; Lat knockout mice show brain volumetric changes. Consistent with the hypothesis that LAT dosage is relevant to the CNV pathology, we observed similar effects upon overexpression of CD247 and ZAP70, encoding members of the LAT signalosome. We also evaluated whether LAT was interacting with KCTD13, MVP, and MAPK3, major driver and modifiers of the proximal 16p11.2 600 kb BP4-BP5 syndromes, respectively. Co-injected embryos exhibited an increased microcephaly, suggesting the presence of genetic interaction. Correspondingly, carriers of 1.7 Mb BP1-BP5 rearrangements that encompass both the BP2-BP3 and BP4-BP5 loci showed more severe phenotypes. Taken together, our results suggest that LAT, besides its well-recognized function in T cell development, is a major contributor of the 16p11.2 220 kb BP2-BP3 CNV-associated neurodevelopmental phenotypes.
Collapse
MESH Headings
- Adaptor Proteins, Signal Transducing/genetics
- Adaptor Proteins, Signal Transducing/metabolism
- Adaptor Proteins, Signal Transducing/physiology
- Adolescent
- Adult
- Aged
- Aged, 80 and over
- Animals
- Autistic Disorder/genetics
- Autistic Disorder/immunology
- Autistic Disorder/pathology
- Brain/metabolism
- Brain/pathology
- Child
- Child, Preschool
- Chromosome Deletion
- Chromosome Disorders/genetics
- Chromosome Disorders/immunology
- Chromosome Disorders/pathology
- Chromosomes, Human, Pair 16/genetics
- Chromosomes, Human, Pair 16/immunology
- Cohort Studies
- DNA Copy Number Variations
- Embryo, Nonmammalian/metabolism
- Embryo, Nonmammalian/pathology
- Female
- Gene Expression Regulation, Developmental
- Humans
- Infant
- Intellectual Disability/genetics
- Intellectual Disability/immunology
- Intellectual Disability/pathology
- Male
- Membrane Proteins/genetics
- Membrane Proteins/metabolism
- Membrane Proteins/physiology
- Mice
- Mice, Inbred C57BL
- Mice, Knockout
- Microcephaly/genetics
- Microcephaly/pathology
- Middle Aged
- Phenotype
- Phosphoproteins/physiology
- Signal Transduction
- Young Adult
- Zebrafish/embryology
- Zebrafish/genetics
- Zebrafish Proteins/genetics
- Zebrafish Proteins/metabolism
Collapse
Affiliation(s)
- Maria Nicla Loviglio
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
| | - Thomas Arbogast
- Center for Human Disease Modeling, Duke University, Durham, NC 27701, USA
| | - Aia Elise Jønch
- Service of Medical Genetics, Lausanne University Hospital (CHUV), 1011 Lausanne, Switzerland
| | - Stephan C Collins
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, Department of Translational Medicine and Neurogenetics; Centre National de la Recherche Scientifique, UMR7104; Institut National de la Santé et de la Recherche Médicale, U964; Université de Strasbourg, 67400 Illkirch-Graffenstaden, France
| | - Konstantin Popadin
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland; Immanuel Kant Baltic Federal University, 14 A. Nevskogo ul., Kaliningrad 236041, Russia
| | - Camille S Bonnet
- Center for Human Disease Modeling, Duke University, Durham, NC 27701, USA
| | - Giuliana Giannuzzi
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
| | - Anne M Maillard
- Service of Medical Genetics, Lausanne University Hospital (CHUV), 1011 Lausanne, Switzerland
| | - Sébastien Jacquemont
- Service of Medical Genetics, Lausanne University Hospital (CHUV), 1011 Lausanne, Switzerland
| | - Binnaz Yalcin
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland; Institut de Génétique et de Biologie Moléculaire et Cellulaire, Department of Translational Medicine and Neurogenetics; Centre National de la Recherche Scientifique, UMR7104; Institut National de la Santé et de la Recherche Médicale, U964; Université de Strasbourg, 67400 Illkirch-Graffenstaden, France
| | - Nicholas Katsanis
- Center for Human Disease Modeling, Duke University, Durham, NC 27701, USA
| | - Christelle Golzio
- Center for Human Disease Modeling, Duke University, Durham, NC 27701, USA.
| | - Alexandre Reymond
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland.
| |
Collapse
|
21
|
Penalva LO, Sanford JR. From mechanisms to therapy: RNA processing's impact on human genetics. Hum Genet 2017; 136:1013-1014. [PMID: 28866814 DOI: 10.1007/s00439-017-1841-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Affiliation(s)
- Luiz O Penalva
- Children's Cancer Research Institute, University of Texas Health Science Center at San Antonio, San Antonio, TX, 78229, USA
| | - Jeremy R Sanford
- Department of Molecular, Cellular and Developmental Biology, University of California Santa Cruz, Santa Cruz, CA, 95060, USA.
| |
Collapse
|
22
|
Reviewing evidence for systematic transcriptional deletions, nucleotide exchanges, and expanded codons, and peptide clusters in human mitochondria. Biosystems 2017; 160:10-24. [PMID: 28807694 DOI: 10.1016/j.biosystems.2017.08.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2017] [Revised: 07/26/2017] [Accepted: 08/04/2017] [Indexed: 12/12/2022]
Abstract
Polymerization sometimes transforms sequences by (a) systematic deletions of mono-, dinucleotides after trinucleotides, or (b) 23 systematic nucleotide exchanges (9 symmetric, X<>Y, e.g. G<>T, 14 asymmetric, X > Y > Z > X, e.g. A > G > T > A), producing del- and swinger RNAs. Some peptides correspond to del- and swinger RNA translations, also according to tetracodons, codons expanded by a silent nucleotide. Here new analyzes assume different proteolytic patterns, partially alleviating false negative peptide detection biases, expanding noncanonical mitoproteome profiles. Mito-genomic, -transcriptomic and -proteomic evidence for noncanonical transcriptions and translations are reviewed and clusters of del- and swinger peptides (also along tetracodons) are described. Noncanonical peptide clusters indicate regulated expression of cryptically encoded mitochondrial protein coding genes. These candidate noncanonical proteins don't resemble known proteins.
Collapse
|
23
|
Drift Barriers to Quality Control When Genes Are Expressed at Different Levels. Genetics 2016; 205:397-407. [PMID: 27838629 DOI: 10.1534/genetics.116.192567] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2016] [Accepted: 11/02/2016] [Indexed: 11/18/2022] Open
Abstract
Gene expression is imperfect, sometimes leading to toxic products. Solutions take two forms: globally reducing error rates, or ensuring that the consequences of erroneous expression are relatively harmless. The latter is optimal, but because it must evolve independently at so many loci, it is subject to a stringent "drift barrier"-a limit to how weak the effects of a deleterious mutation s can be, while still being effectively purged by selection, expressed in terms of the population size N of an idealized population such that purging requires s < -1/N In previous work, only large populations evolved the optimal local solution, small populations instead evolved globally low error rates, and intermediate populations were bistable, with either solution possible. Here, we take into consideration the fact that the effectiveness of purging varies among loci, because of variation in gene expression level, and variation in the intrinsic vulnerabilities of different gene products to error. The previously found dichotomy between the two kinds of solution breaks down, replaced by a gradual transition as a function of population size. In the extreme case of a small enough population, selection fails to maintain even the global solution against deleterious mutations, explaining the nonmonotonic relationship between effective population size and transcriptional error rate that was recently observed in experiments on Escherichia coli, Caenorhabditis elegans, and Buchnera aphidicola.
Collapse
|
24
|
Codon Distribution in Error-Detecting Circular Codes. Life (Basel) 2016; 6:life6010014. [PMID: 26999215 PMCID: PMC4810245 DOI: 10.3390/life6010014] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2015] [Revised: 02/24/2016] [Accepted: 03/10/2016] [Indexed: 11/17/2022] Open
Abstract
In 1957, Francis Crick et al. suggested an ingenious explanation for the process of frame maintenance. The idea was based on the notion of comma-free codes. Although Crick’s hypothesis proved to be wrong, in 1996, Arquès and Michel discovered the existence of a weaker version of such codes in eukaryote and prokaryote genomes, namely the so-called circular codes. Since then, circular code theory has invariably evoked great interest and made significant progress. In this article, the codon distributions in maximal comma-free, maximal self-complementary C3 and maximal self-complementary circular codes are discussed, i.e., we investigate in how many of such codes a given codon participates. As the main (and surprising) result, it is shown that the codons can be separated into very few classes (three, or five, or six) with respect to their frequency. Moreover, the distribution classes can be hierarchically ordered as refinements from maximal comma-free codes via maximal self-complementary C3 codes to maximal self-complementary circular codes.
Collapse
|
25
|
Deng W, Babu IR, Su D, Yin S, Begley TJ, Dedon PC. Trm9-Catalyzed tRNA Modifications Regulate Global Protein Expression by Codon-Biased Translation. PLoS Genet 2015; 11:e1005706. [PMID: 26670883 PMCID: PMC4689569 DOI: 10.1371/journal.pgen.1005706] [Citation(s) in RCA: 73] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Accepted: 11/06/2015] [Indexed: 12/30/2022] Open
Abstract
Post-transcriptional modifications of transfer RNAs (tRNAs) have long been recognized to play crucial roles in regulating the rate and fidelity of translation. However, the extent to which they determine global protein production remains poorly understood. Here we use quantitative proteomics to show a direct link between wobble uridine 5-methoxycarbonylmethyl (mcm5) and 5-methoxy-carbonyl-methyl-2-thio (mcm5s2) modifications catalyzed by tRNA methyltransferase 9 (Trm9) in tRNAArg(UCU) and tRNAGlu(UUC) and selective translation of proteins from genes enriched with their cognate codons. Controlling for bias in protein expression and alternations in mRNA expression, we find that loss of Trm9 selectively impairs expression of proteins from genes enriched with AGA and GAA codons under both normal and stress conditions. Moreover, we show that AGA and GAA codons occur with high frequency in clusters along the transcripts, which may play a role in modulating translation. Consistent with these results, proteins subject to enhanced ribosome pausing in yeast lacking mcm5U and mcm5s2U are more likely to be down-regulated and contain a larger number of AGA/GAA clusters. Together, these results suggest that Trm9-catalyzed tRNA modifications play a significant role in regulating protein expression within the cell. Here we present evidence for a more complicated role for transfer RNAs (tRNAs) than as mere adapters that link the genetic code in messenger RNA (mRNA) to the amino acid sequence of a protein during translation. tRNAs have long been known to be modified with dozens of different chemical structures other than the 4 canonical ribonucleosides, though the role of these modifications in controlling translation is poorly understood. By quantifying the expression of thousands of proteins in the yeast S. cerevisiae, we identified a mechanistic link between modified ribonucleosides located at the wobble position of two tRNAs, tRNAArg(UCU) and tRNAGlu(UUC), and the translation of proteins derived from genes enriched with codons read by these tRNAs: AGA and GAA. In cells lacking the enzyme that inserts these modifications, tRNA methyltransferase 9 (Trm9), we found a significant reduction in proteins from genes enriched with AGA and GAA codons and with runs of these codons. Also, mRNAs enriched with runs of AGA and GAA codons are subject to stalled translation on ribosomes in yeast lacking mcm5U and mcm5s2U. Together, these results reveal a distinct role for Trm9-catalyzed tRNA modifications in selectively regulating the expression of proteins enriched with AGA and GAA codons.
Collapse
Affiliation(s)
- Wenjun Deng
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - I. Ramesh Babu
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Dan Su
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Shanye Yin
- Department of Cell Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Thomas J. Begley
- SUNY College of Nanoscale Science and Engineering, Albany, New York, United States of America
- RNA Institute and Cancer Research Center, University at Albany, State University of New York, Albany, New York, United States of America
| | - Peter C. Dedon
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Singapore-MIT Alliance for Research and Technology, Singapore
- * E-mail:
| |
Collapse
|
26
|
Evolution of Robustness to Protein Mistranslation by Accelerated Protein Turnover. PLoS Biol 2015; 13:e1002291. [PMID: 26544557 PMCID: PMC4636289 DOI: 10.1371/journal.pbio.1002291] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2015] [Accepted: 09/30/2015] [Indexed: 11/19/2022] Open
Abstract
Translational errors occur at high rates, and they influence organism viability and the onset of genetic diseases. To investigate how organisms mitigate the deleterious effects of protein synthesis errors during evolution, a mutant yeast strain was engineered to translate a codon ambiguously (mistranslation). It thereby overloads the protein quality-control pathways and disrupts cellular protein homeostasis. This strain was used to study the capacity of the yeast genome to compensate the deleterious effects of protein mistranslation. Laboratory evolutionary experiments revealed that fitness loss due to mistranslation can rapidly be mitigated. Genomic analysis demonstrated that adaptation was primarily mediated by large-scale chromosomal duplication and deletion events, suggesting that errors during protein synthesis promote the evolution of genome architecture. By altering the dosages of numerous, functionally related proteins simultaneously, these genetic changes introduced large phenotypic leaps that enabled rapid adaptation to mistranslation. Evolution increased the level of tolerance to mistranslation through acceleration of ubiquitin-proteasome–mediated protein degradation and protein synthesis. As a consequence of rapid elimination of erroneous protein products, evolution reduced the extent of toxic protein aggregation in mistranslating cells. However, there was a strong evolutionary trade-off between adaptation to mistranslation and survival upon starvation: the evolved lines showed fitness defects and impaired capacity to degrade mature ribosomes upon nutrient limitation. Moreover, as a response to an enhanced energy demand of accelerated protein turnover, the evolved lines exhibited increased glucose uptake by selective duplication of hexose transporter genes. We conclude that adjustment of proteome homeostasis to mistranslation evolves rapidly, but this adaptation has several side effects on cellular physiology. Our work also indicates that translational fidelity and the ubiquitin-proteasome system are functionally linked to each other and may, therefore, co-evolve in nature. Tolerance to errors during protein synthesis evolves rapidly through acceleration of protein turnover—a process determined by the combined rates of protein synthesis and degradation. However, this adaptation has deleterious side effects due to its energy costs. Although fidelity of information transfer has a substantial impact on cellular survival, many steps in protein production are strikingly error-prone. Such errors during protein synthesis can have a substantial influence on viability and the onset of genetic diseases. These considerations raise the question as to how organisms can tolerate errors during protein synthesis. In this paper, for the first time, we study organisms’ capacity to evolve robustness against mistranslation and explore the underlying cellular mechanisms. A mutant yeast strain was engineered to translate a codon ambiguously (mistranslation). This thereby overloads the protein quality-control pathways and disrupts cellular protein homeostasis. This strain was used to study the capacity of the yeast genome to compensate for the deleterious effects of protein mistranslation. We found that mistranslation led to rapid evolution of genomic rearrangements, including chromosomal duplications and deletions. By altering the dosages of numerous, functionally related proteins simultaneously, these genetic changes introduce large phenotypic leaps that enable adaptation to mistranslation. Robustness against mistranslation during laboratory evolution was achieved through acceleration of protein turnover—a process that was determined by the combined rates of protein synthesis and ubiquitin-proteasome system-mediated degradation. However, as both translation and active degradation of proteins are exceptionally energy-consuming cellular processes, accelerated proteome turnover has substantial energy costs.
Collapse
|
27
|
Yanagida H, Gispan A, Kadouri N, Rozen S, Sharon M, Barkai N, Tawfik DS. The Evolutionary Potential of Phenotypic Mutations. PLoS Genet 2015; 11:e1005445. [PMID: 26244544 PMCID: PMC4526572 DOI: 10.1371/journal.pgen.1005445] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2015] [Accepted: 07/15/2015] [Indexed: 01/08/2023] Open
Abstract
Errors in protein synthesis, so-called phenotypic mutations, are orders-of-magnitude more frequent than genetic mutations. Here, we provide direct evidence that alternative protein forms and phenotypic variability derived from translational errors paved the path to genetic, evolutionary adaptations via gene duplication. We explored the evolutionary origins of Saccharomyces cerevisiae IDP3 - an NADP-dependent isocitrate dehydrogenase mediating fatty acids ß-oxidation in the peroxisome. Following the yeast whole genome duplication, IDP3 diverged from a cytosolic ancestral gene by acquisition of a C-terminal peroxisomal targeting signal. We discovered that the pre-duplicated cytosolic IDPs are partially localized to the peroxisome owing to +1 translational frameshifts that bypass the stop codon and unveil cryptic peroxisomal targeting signals within the 3'-UTR. Exploring putative cryptic signals in all 3'-UTRs of yeast genomes, we found that other enzymes related to NADPH production such as pyruvate carboxylase 1 (PYC1) might be prone to peroxisomal localization via cryptic signals. Using laboratory evolution we found that these translational frameshifts are rapidly imprinted via genetic single base deletions occurring within the very same gene location. Further, as exemplified here, the sequences that promote translational frameshifts are also more prone to genetic deletions. Thus, genotypes conferring higher phenotypic variability not only meet immediate challenges by unveiling cryptic 3'-UTR sequences, but also boost the potential for future genetic adaptations.
Collapse
Affiliation(s)
- Hayato Yanagida
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Ariel Gispan
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Noam Kadouri
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Shelly Rozen
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Michal Sharon
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Naama Barkai
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Dan S. Tawfik
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
28
|
Abstract
The rate and mechanism of protein sequence evolution have been central questions in evolutionary biology since the 1960s. Although the rate of protein sequence evolution depends primarily on the level of functional constraint, exactly what determines functional constraint has remained unclear. The increasing availability of genomic data has enabled much needed empirical examinations on the nature of functional constraint. These studies found that the evolutionary rate of a protein is predominantly influenced by its expression level rather than functional importance. A combination of theoretical and empirical analyses has identified multiple mechanisms behind these observations and demonstrated a prominent role in protein evolution of selection against errors in molecular and cellular processes.
Collapse
Affiliation(s)
- Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, 830 North University Avenue, Ann Arbor, Michigan 48109, USA
| | - Jian-Rong Yang
- Department of Ecology and Evolutionary Biology, University of Michigan, 830 North University Avenue, Ann Arbor, Michigan 48109, USA
| |
Collapse
|
29
|
Schüler A, Ghanbarian AT, Hurst LD. Purifying selection on splice-related motifs, not expression level nor RNA folding, explains nearly all constraint on human lincRNAs. Mol Biol Evol 2014; 31:3164-83. [PMID: 25158797 PMCID: PMC4245815 DOI: 10.1093/molbev/msu249] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
There are two strong and equally important predictors of rates of human protein evolution: The amount the gene is expressed and the proportion of exonic sequence devoted to control splicing, mediated largely by selection on exonic splice enhancer (ESE) motifs. Is the same true for noncoding RNAs, known to be under very weak purifying selection? Prior evidence suggests that selection at splice sites in long intergenic noncoding RNAs (lincRNAs) is important. We now report multiple lines of evidence indicating that the great majority of purifying selection operating on lincRNAs in humans is splice related. Splice-related parameters explain much of the between-gene variation in evolutionary rate in humans. Expression rate is not a relevant predictor, although expression breadth is weakly so. In contrast to protein-coding RNAs, we observe no relationship between evolutionary rate and lincRNA stability. As in protein-coding genes, ESEs are especially abundant near splice junctions and evolve slower than non-ESE sequence equidistant from boundaries. Nearly all constraint in lincRNAs is at exon ends (N.B. the same is not witnessed in Drosophila). Although we cannot definitely answer the question as to why splice-related selection is so important, we find no evidence that splicing might enable the nonsense-mediated decay pathway to capture transcripts incorrectly processed by ribosomes. We find evidence consistent with the notion that splicing modifies the underlying chromatin through recruitment of splice-coupled chromatin modifiers, such as CHD1, which in turn might modulate neighbor gene activity. We conclude that most selection on human lincRNAs is splice mediated and suggest that the possibility of splice-chromatin coupling is worthy of further scrutiny.
Collapse
Affiliation(s)
- Andreas Schüler
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Avazeh T Ghanbarian
- Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom
| | - Laurence D Hurst
- Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom
| |
Collapse
|
30
|
Seligmann H, Labra A. Tetracoding increases with body temperature in Lepidosauria. Biosystems 2013; 114:155-63. [DOI: 10.1016/j.biosystems.2013.09.002] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Revised: 09/04/2013] [Accepted: 09/05/2013] [Indexed: 10/26/2022]
|
31
|
Xu G, Liu B, Wang F, Wei C, Zhang Y, Sheng J, Wang G, Li F. High-throughput screen of essential gene modules in Mycobacterium tuberculosis: a bibliometric approach. BMC Infect Dis 2013; 13:227. [PMID: 23687949 PMCID: PMC3680244 DOI: 10.1186/1471-2334-13-227] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2012] [Accepted: 05/15/2013] [Indexed: 01/24/2023] Open
Abstract
Background Tuberculosis (TB) is an infectious disease caused by Mycobacterium tuberculosis (M. tuberculosis). The annotation of functional genome and signaling network in M. tuberculosis are still not systematic. Essential gene modules are a collection of functionally related essential genes in the same signaling or metabolic pathway. The determination of essential genes and essential gene modules at genomic level may be important for better understanding of the physiology and pathology of M. tuberculosis, and also helpful for the development of drugs against this pathogen. The establishment of genomic operon database (DOOR) and the annotation of gene pathways have felicitated the genomic analysis of the essential gene modules of M. tuberculosis. Method Bibliometric approach has been used to perform a High-throughput screen for essential genes of M. tuberculosis strain H37Rv. Ant colony algorithm were used to identify the essential genes in other M. tuberculosis reference strains. Essential gene modules were analyzed by operon database DOOR. The pathways of essential genes were assessed by Biocarta, KEGG, NCI-PID, HumanCyc and Reactome. The function prediction of essential genes was analyzed by Pfam. Results A total approximately 700 essential genes were identified in M. tuberculosis genome. 40% of operons are consisted of two or more essential genes. The essential genes were distributed in 92 pathways in M. tuberculosis. In function prediction, 61.79% of essential genes were categorized into virulence, intermediary metabolism/respiration,cell wall related and lipid metabolism, which are fundamental functions that exist in most bacteria species. Conclusion We have identified the essential genes of M. tuberculosis using bibliometric approach at genomic level. The essential gene modules were further identified and analyzed.
Collapse
Affiliation(s)
- Guangyu Xu
- Key Laboratory of Zoonosis, Ministry of Education, Norman Bethune College of Medicine, Jilin University, Changchun, Jilin, China
| | | | | | | | | | | | | | | |
Collapse
|
32
|
Seligmann H. Polymerization of non-complementary RNA: systematic symmetric nucleotide exchanges mainly involving uracil produce mitochondrial RNA transcripts coding for cryptic overlapping genes. Biosystems 2013; 111:156-74. [PMID: 23410796 DOI: 10.1016/j.biosystems.2013.01.011] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2012] [Revised: 01/24/2013] [Accepted: 01/29/2013] [Indexed: 12/23/2022]
Abstract
Usual DNA→RNA transcription exchanges T→U. Assuming different systematic symmetric nucleotide exchanges during translation, some GenBank RNAs match exactly human mitochondrial sequences (exchange rules listed in decreasing transcript frequencies): C↔U, A↔U, A↔U+C↔G (two nucleotide pairs exchanged), G↔U, A↔G, C↔G, none for A↔C, A↔G+C↔U, and A↔C+G↔U. Most unusual transcripts involve exchanging uracil. Independent measures of rates of rare replicational enzymatic DNA nucleotide misinsertions predict frequencies of RNA transcripts systematically exchanging the corresponding misinserted nucleotides. Exchange transcripts self-hybridize less than other gene regions, self-hybridization increases with length, suggesting endoribonuclease-limited elongation. Blast detects stop codon depleted putative protein coding overlapping genes within exchange-transcribed mitochondrial genes. These align with existing GenBank proteins (mainly metazoan origins, prokaryotic and viral origins underrepresented). These GenBank proteins frequently interact with RNA/DNA, are membrane transporters, or are typical of mitochondrial metabolism. Nucleotide exchange transcript frequencies increase with overlapping gene densities and stop densities, indicating finely tuned counterbalancing regulation of expression of systematic symmetric nucleotide exchange-encrypted proteins. Such expression necessitates combined activities of suppressor tRNAs matching stops, and nucleotide exchange transcription. Two independent properties confirm predicted exchanged overlap coding genes: discrepancy of third codon nucleotide contents from replicational deamination gradients, and codon usage according to circular code predictions. Predictions from both properties converge, especially for frequent nucleotide exchange types. Nucleotide exchanging transcription apparently increases coding densities of protein coding genes without lengthening genomes, revealing unsuspected functional DNA coding potential.
Collapse
Affiliation(s)
- Hervé Seligmann
- National Natural History Museum Collections, The Hebrew University of Jerusalem, 91904 Jerusalem, Israel.
| |
Collapse
|
33
|
The layout of a bacterial genome. FEBS Lett 2012; 586:2043-8. [DOI: 10.1016/j.febslet.2012.03.051] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2012] [Revised: 03/25/2012] [Accepted: 03/26/2012] [Indexed: 12/25/2022]
|
34
|
|