1
|
Radrizzani S, Kudla G, Izsvák Z, Hurst LD. Selection on synonymous sites: the unwanted transcript hypothesis. Nat Rev Genet 2024; 25:431-448. [PMID: 38297070 DOI: 10.1038/s41576-023-00686-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/04/2023] [Indexed: 02/02/2024]
Abstract
Although translational selection to favour codons that match the most abundant tRNAs is not readily observed in humans, there is nonetheless selection in humans on synonymous mutations. We hypothesize that much of this synonymous site selection can be explained in terms of protection against unwanted RNAs - spurious transcripts, mis-spliced forms or RNAs derived from transposable elements or viruses. We propose not only that selection on synonymous sites functions to reduce the rate of creation of unwanted transcripts (for example, through selection on exonic splice enhancers and cryptic splice sites) but also that high-GC content (but low-CpG content), together with intron presence and position, is both particular to functional native mRNAs and used to recognize transcripts as native. In support of this hypothesis, transcription, nuclear export, liquid phase condensation and RNA degradation have all recently been shown to promote GC-rich transcripts and suppress AU/CpG-rich ones. With such 'traps' being set against AU/CpG-rich transcripts, the codon usage of native genes has, in turn, evolved to avoid such suppression. That parallel filters against AU/CpG-rich transcripts also affect the endosomal import of RNAs further supports the unwanted transcript hypothesis of synonymous site selection and explains the similar design rules that have enabled the successful use of transgenes and RNA vaccines.
Collapse
Affiliation(s)
- Sofia Radrizzani
- Milner Centre for Evolution, Department of Life Sciences, University of Bath, Bath, UK
- Milner Therapeutics Institute, Jeffrey Cheah Biomedical Centre, University of Cambridge, Cambridge, UK
| | - Grzegorz Kudla
- MRC Human Genetics Unit, Institute for Genetics and Cancer, The University of Edinburgh, Edinburgh, UK
| | - Zsuzsanna Izsvák
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Society, Berlin, Germany
| | - Laurence D Hurst
- Milner Centre for Evolution, Department of Life Sciences, University of Bath, Bath, UK.
| |
Collapse
|
2
|
Picard MAL, Leblay F, Cassan C, Willemsen A, Daron J, Bauffe F, Decourcelle M, Demange A, Bravo IG. Transcriptomic, proteomic, and functional consequences of codon usage bias in human cells during heterologous gene expression. Protein Sci 2023; 32:e4576. [PMID: 36692287 PMCID: PMC9926478 DOI: 10.1002/pro.4576] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 01/12/2023] [Accepted: 01/14/2023] [Indexed: 01/25/2023]
Abstract
Differences in codon frequency between genomes, genes, or positions along a gene, modulate transcription and translation efficiency, leading to phenotypic and functional differences. Here, we present a multiscale analysis of the effects of synonymous codon recoding during heterologous gene expression in human cells, quantifying the phenotypic consequences of codon usage bias at different molecular and cellular levels, with an emphasis on translation elongation. Six synonymous versions of an antibiotic resistance gene were generated, fused to a fluorescent reporter, and independently expressed in HEK293 cells. Multiscale phenotype was analyzed by means of quantitative transcriptome and proteome assessment, as proxies for gene expression; cellular fluorescence, as a proxy for single-cell level expression; and real-time cell proliferation in absence or presence of antibiotic, as a proxy for the cell fitness. We show that differences in codon usage bias strongly impact the molecular and cellular phenotype: (i) they result in large differences in mRNA levels and protein levels, leading to differences of over 15 times in translation efficiency; (ii) they introduce unpredicted splicing events; (iii) they lead to reproducible phenotypic heterogeneity; and (iv) they lead to a trade-off between the benefit of antibiotic resistance and the burden of heterologous expression. In human cells in culture, codon usage bias modulates gene expression by modifying mRNA availability and suitability for translation, leading to differences in protein levels and eventually eliciting functional phenotypic changes.
Collapse
Affiliation(s)
- Marion A. L. Picard
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| | - Fiona Leblay
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| | - Cécile Cassan
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| | - Anouk Willemsen
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| | - Josquin Daron
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| | - Frédérique Bauffe
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| | - Mathilde Decourcelle
- BioCampus Montpellier (University of Montpellier, CNRS, INSERM)MontpellierFrance
| | - Antonin Demange
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| | - Ignacio G. Bravo
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| |
Collapse
|
3
|
Xiao G, Zhou J, Huo Z, Wu T, Li Y, Li Y, Wang Y, Wang M. The Shift in Synonymous Codon Usage Reveals Similar Genomic Variation during Domestication of Asian and African Rice. Int J Mol Sci 2022; 23:12860. [PMID: 36361651 PMCID: PMC9656316 DOI: 10.3390/ijms232112860] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 10/20/2022] [Accepted: 10/24/2022] [Indexed: 10/29/2023] Open
Abstract
The domestication of wild rice occurred together with genomic variation, including the synonymous nucleotide substitutions that result in synonymous codon usage bias (SCUB). SCUB mirrors the evolutionary specialization of plants, but its characteristics during domestication were not yet addressed. Here, we found cytosine- and guanidine-ending (NNC and NNG) synonymous codons (SCs) were more pronounced than adenosine- and thymine-ending SCs (NNA and NNT) in both wild and cultivated species of Asian and African rice. The ratios of NNC/G to NNA/T codons gradually decreased following the rise in the number of introns, and the preference for NNA/T codons became more obvious in genes with more introns in cultivated rice when compared with those in wild rice. SCUB frequencies were heterogeneous across the exons, with a higher preference for NNA/T in internal exons than in terminal exons. The preference for NNA/T in internal but not terminal exons was more predominant in cultivated rice than in wild rice, with the difference between wild and cultivated rice becoming more remarkable with the rise in exon numbers. The difference in the ratios of codon combinations representing DNA methylation-mediated conversion from cytosine to thymine between wild and cultivated rice coincided with their difference in SCUB frequencies, suggesting that SCUB reveals the possible association between genetic and epigenetic variation during the domestication of rice. Similar patterns of SCUB shift in Asian and African rice indicate that genomic variation occurs in the same non-random manner. SCUB representing non-neutral synonymous mutations can provide insight into the mechanism of genomic variation in domestication and can be used for the genetic dissection of agricultural traits in rice and other crops.
Collapse
Affiliation(s)
- Guilian Xiao
- The Key Laboratory of Plant Development and Environment Adaptation Biology, Ministry of Education, School of Life Science, Shandong University, Qingdao 266237, China
| | - Junzhi Zhou
- The Key Laboratory of Plant Development and Environment Adaptation Biology, Ministry of Education, School of Life Science, Shandong University, Qingdao 266237, China
| | - Zhiheng Huo
- The Key Laboratory of Plant Development and Environment Adaptation Biology, Ministry of Education, School of Life Science, Shandong University, Qingdao 266237, China
| | - Tong Wu
- The Key Laboratory of Plant Development and Environment Adaptation Biology, Ministry of Education, School of Life Science, Shandong University, Qingdao 266237, China
| | - Yingchun Li
- The Key Laboratory of Plant Development and Environment Adaptation Biology, Ministry of Education, School of Life Science, Shandong University, Qingdao 266237, China
| | - Yajing Li
- The Key Laboratory of Plant Development and Environment Adaptation Biology, Ministry of Education, School of Life Science, Shandong University, Qingdao 266237, China
| | - Yanxia Wang
- Shijiazhuang Academy of Agriculture and Forestry Sciences, Shijiazhuang 050041, China
| | - Mengcheng Wang
- The Key Laboratory of Plant Development and Environment Adaptation Biology, Ministry of Education, School of Life Science, Shandong University, Qingdao 266237, China
| |
Collapse
|
4
|
Tian G, Xiao G, Wu T, Zhou J, Xu W, Wang Y, Xia G, Wang M. Alteration of synonymous codon usage bias accompanies polyploidization in wheat. Front Genet 2022; 13:979902. [PMID: 36313462 PMCID: PMC9614214 DOI: 10.3389/fgene.2022.979902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 10/03/2022] [Indexed: 11/13/2022] Open
Abstract
The diploidization of polyploid genomes is accompanied by genomic variation, including synonymous nucleotide substitutions that may lead to synonymous codon usage bias (SCUB). SCUB can mirror the evolutionary specialization of plants, but its effect on the formation of polyploidies is not well documented. We explored this issue here with hexaploid wheat and its progenitors. Synonymous codons (SCs) ending in either cytosine (NNC) or guanidine (NNG) were more frequent than those ending in either adenosine (NNA) or thymine (NNT), and the preference for NNC/G codons followed the increase in genome ploidy. The ratios between NNC/G and NNA/T codons gradually decreased in genes with more introns, and the difference in these ratios between wheat and its progenitors diminished with increasing ploidy. SCUB frequencies were heterogeneous among exons, and the bias preferred to NNA/T in more internal exons, especially for genes with more exons; while the preference did not appear to associate with ploidy. The SCUB alteration of the progenitors was different during the formation of hexaploid wheat, so that SCUB was the homogeneous among A, B and D subgenomes. DNA methylation-mediated conversion from cytosine to thymine weakened following the increase of genome ploidy, coinciding with the stronger bias for NNC/G SCs in the genome as a function of ploidy, suggesting that SCUB contribute to the epigenetic variation in hexaploid wheat. The patterns in SCUB mirrored the formation of hexaploid wheat, which provides new insight into genome shock-induced genetic variation during polyploidization. SCs representing non-neutral synonymous mutations can be used for genetic dissection and improvement of agricultural traits of wheat and other polyploidies.
Collapse
Affiliation(s)
- Geng Tian
- The Key Laboratory of Plant Development and Environment Adaptation Biology, Ministry of Education, School of Life Science, Shandong University, Qingdao, China
| | - Guilian Xiao
- The Key Laboratory of Plant Development and Environment Adaptation Biology, Ministry of Education, School of Life Science, Shandong University, Qingdao, China
| | - Tong Wu
- The Key Laboratory of Plant Development and Environment Adaptation Biology, Ministry of Education, School of Life Science, Shandong University, Qingdao, China
| | - Junzhi Zhou
- The Key Laboratory of Plant Development and Environment Adaptation Biology, Ministry of Education, School of Life Science, Shandong University, Qingdao, China
| | - Wenjing Xu
- The Key Laboratory of Plant Development and Environment Adaptation Biology, Ministry of Education, School of Life Science, Shandong University, Qingdao, China
| | - Yanxia Wang
- Shijiazhuang Academy of Agriculture and Forestry Sciences, Shijiazhuang, China
| | - Guangmin Xia
- The Key Laboratory of Plant Development and Environment Adaptation Biology, Ministry of Education, School of Life Science, Shandong University, Qingdao, China
| | - Mengcheng Wang
- The Key Laboratory of Plant Development and Environment Adaptation Biology, Ministry of Education, School of Life Science, Shandong University, Qingdao, China
- *Correspondence: Mengcheng Wang,
| |
Collapse
|
5
|
Ho AT, Hurst LD. Unusual mammalian usage of TGA stop codons reveals that sequence conservation need not imply purifying selection. PLoS Biol 2022; 20:e3001588. [PMID: 35550630 PMCID: PMC9129041 DOI: 10.1371/journal.pbio.3001588] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 05/24/2022] [Accepted: 04/20/2022] [Indexed: 11/18/2022] Open
Abstract
The assumption that conservation of sequence implies the action of purifying selection is central to diverse methodologies to infer functional importance. GC-biased gene conversion (gBGC), a meiotic mismatch repair bias strongly favouring GC over AT, can in principle mimic the action of selection, this being thought to be especially important in mammals. As mutation is GC→AT biased, to demonstrate that gBGC does indeed cause false signals requires evidence that an AT-rich residue is selectively optimal compared to its more GC-rich allele, while showing also that the GC-rich alternative is conserved. We propose that mammalian stop codon evolution provides a robust test case. Although in most taxa TAA is the optimal stop codon, TGA is both abundant and conserved in mammalian genomes. We show that this mammalian exceptionalism is well explained by gBGC mimicking purifying selection and that TAA is the selectively optimal codon. Supportive of gBGC, we observe (i) TGA usage trends are consistent at the focal stop codon and elsewhere (in UTR sequences); (ii) that higher TGA usage and higher TAA→TGA substitution rates are predicted by a high recombination rate; and (iii) across species the difference in TAA <-> TGA substitution rates between GC-rich and GC-poor genes is largest in genomes that possess higher between-gene GC variation. TAA optimality is supported both by enrichment in highly expressed genes and trends associated with effective population size. High TGA usage and high TAA→TGA rates in mammals are thus consistent with gBGC’s predicted ability to “drive” deleterious mutations and supports the hypothesis that sequence conservation need not be indicative of purifying selection. A general trend for GC-rich trinucleotides to reside at frequencies far above their mutational equilibrium in high recombining domains supports the generality of these results.
Collapse
Affiliation(s)
- Alexander Thomas Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
- * E-mail:
| | | |
Collapse
|
6
|
Xu W, Li Y, Li Y, Liu C, Wang Y, Xia G, Wang M. Asymmetric Somatic Hybridization Affects Synonymous Codon Usage Bias in Wheat. Front Genet 2021; 12:682324. [PMID: 34178040 PMCID: PMC8226224 DOI: 10.3389/fgene.2021.682324] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Accepted: 05/07/2021] [Indexed: 11/24/2022] Open
Abstract
Asymmetric somatic hybridization is an efficient strategy for crop breeding by introducing exogenous chromatin fragments, which leads to whole genomic shock and local chromosomal shock that induces genome-wide genetic variation including indel (insertion and deletion) and nucleotide substitution. Nucleotide substitution causes synonymous codon usage bias (SCUB), an indicator of genomic mutation and natural selection. However, how asymmetric somatic hybridization affects SCUB has not been addressed. Here, we explored this issue by comparing expressed sequence tags of a common wheat cultivar and its asymmetric somatic hybrid line. Asymmetric somatic hybridization affected SCUB and promoted the bias to A- and T-ending synonymous codon (SCs). SCUB frequencies in chromosomes introgressed with exogenous fragments were comparable to those in chromosomes without exogenous fragments, showing that exogenous fragments had no local chromosomal effect. Asymmetric somatic hybridization affected SCUB frequencies in indel-flanking sequences more strongly than in non-flanking sequences, and this stronger effect was present in both chromosomes with and without exogenous fragments. DNA methylation-driven SCUB shift was more pronounced than other SC pairs. SCUB shift was similar among seven groups of allelic chromosomes as well as three sub-genomes. Our work demonstrates that the SCUB shift induced by asymmetric somatic hybridization is attributed to the whole genomic shock, and DNA methylation is a putative force of SCUB shift during asymmetric somatic hybridization. Asymmetric somatic hybridization provides an available method for deepening the nature of SCUB shift and genetic variation induced by genomic shock.
Collapse
Affiliation(s)
- Wenjing Xu
- The Key Laboratory of Plant Development and Environmental Adaption, Ministry of Education, School of Life Science, Shandong University, Jinan, China
| | - Yingchun Li
- The Key Laboratory of Plant Development and Environmental Adaption, Ministry of Education, School of Life Science, Shandong University, Jinan, China
| | - Yajing Li
- The Key Laboratory of Plant Development and Environmental Adaption, Ministry of Education, School of Life Science, Shandong University, Jinan, China
| | - Chun Liu
- The Key Laboratory of Plant Development and Environmental Adaption, Ministry of Education, School of Life Science, Shandong University, Jinan, China
| | - Yanxia Wang
- Shijiazhuang Academy of Agriculture and Forestry Sciences, Shijiazhuang, China
| | - Guangmin Xia
- The Key Laboratory of Plant Development and Environmental Adaption, Ministry of Education, School of Life Science, Shandong University, Jinan, China
| | - Mengcheng Wang
- The Key Laboratory of Plant Development and Environmental Adaption, Ministry of Education, School of Life Science, Shandong University, Jinan, China
| |
Collapse
|
7
|
Castellano D, James J, Eyre-Walker A. Nearly Neutral Evolution across the Drosophila melanogaster Genome. Mol Biol Evol 2019; 35:2685-2694. [PMID: 30418639 DOI: 10.1093/molbev/msy164] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Under the nearly neutral theory of molecular evolution, the proportion of effectively neutral mutations is expected to depend upon the effective population size (Ne). Here, we investigate whether this is the case across the genome of Drosophila melanogaster using polymorphism data from North American and African lines. We show that the ratio of the number of nonsynonymous and synonymous polymorphisms is negatively correlated to the number of synonymous polymorphisms, even when the nonindependence is accounted for. The relationship is such that the proportion of effectively neutral nonsynonymous mutations increases by ∼45% as Ne is halved. However, we also show that this relationship is steeper than expected from an independent estimate of the distribution of fitness effects from the site frequency spectrum. We investigate a number of potential explanations for this and show, using simulation, that this is consistent with a model of genetic hitchhiking: Genetic hitchhiking depresses diversity at neutral and weakly selected sites, but has little effect on the diversity of strongly selected sites.
Collapse
Affiliation(s)
- David Castellano
- Bioinformatics Research Centre, Aarhus University, Aarhus, Denmark
| | - Jennifer James
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
8
|
Im EH, Choi SS. Synonymous Codon Usage Controls Various Molecular Aspects. Genomics Inform 2017; 15:123-127. [PMID: 29307137 PMCID: PMC5769864 DOI: 10.5808/gi.2017.15.4.123] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 09/25/2017] [Indexed: 12/19/2022] Open
Abstract
Synonymous sites are generally considered to be functionally neutral. However, there are recent contradictory findings suggesting that synonymous alleles might have functional roles in various molecular aspects. For instance, a recent study demonstrated that synonymous single nucleotide polymorphisms have a similar effect size as nonsynonymous single nucleotide polymorphisms in human disease association studies. Researchers have recognized synonymous codon usage bias (SCUB) in the genomes of almost all species and have investigated whether SCUB is due to random nucleotide compositional bias or to natural selection of any functional exposure generated by synonymous mutations. One of the most prominent observations on the non-neutrality of synonymous codons is the correlation between SCUB and levels of gene expression, such that highly expressed genes tend to have a higher preference toward so-called optimal codons than lowly expressed genes. In relation, it is known that amounts of cognate tRNAs that bind to optimal codons are significantly higher than the amounts of cognate tRNAs that bind to non-optimal codons in genomes. In the present paper, we review various functions that synonymous codons might have other than regulating expression levels.
Collapse
Affiliation(s)
- Eu-Hyun Im
- Division of Biomedical Convergence, College of Biomedical Science, and Institute of Bioscience & Biotechnology, Kangwon National University, Chuncheon 24341, Korea
| | - Sun Shim Choi
- Division of Biomedical Convergence, College of Biomedical Science, and Institute of Bioscience & Biotechnology, Kangwon National University, Chuncheon 24341, Korea
| |
Collapse
|
9
|
Mazumder GA, Uddin A, Chakraborty S. Comparative analysis of codon usage pattern and its influencing factors in Schistosoma japonicum and Ascaris suum. Acta Parasitol 2017; 62:748-761. [PMID: 29035868 DOI: 10.1515/ap-2017-0090] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2017] [Accepted: 07/21/2017] [Indexed: 12/30/2022]
Abstract
Schistosoma japonicum and Ascaris suum are considered as the major parasites of human which cause various life threatening diseases such as schistomiasis and ascariasis. The codon usage bias (CUB) is known as the phenomenon of more usage of a specific codon than the other synonymous codons for an amino acid. The factors that influence the codon usage bias are mutation pressure, natural selection, gene expression, gene length, GC content, RNA stability, recombination rates, codon position etc. Here we had used various bioinformatic tools and statistical analyses to understand the compositional features, expression level and codon usage bias in the genes of these two species.After estimating the effective number of codon (ENC) in both the species, codon usage bias was found to be low and gene expression was high. The nucleobase A and T were used most often than C and G. From neutrality plot and correspondence analysis it was found that both natural selection and mutation pressure played an important role in shaping the codon usage pattern of both species. Moreover, natural selection played a major role while mutation pressure played a minor role in shaping the codon usage bias in S. japonicum and A.suum. This is the first report on the codon usage biology in S. japonicum and A.suum, and the factors influencing their codon usage bias. These results are expected to be useful for genetic engineering and evolutionary studies.
Collapse
|
10
|
Zhao F, Yu CH, Liu Y. Codon usage regulates protein structure and function by affecting translation elongation speed in Drosophila cells. Nucleic Acids Res 2017; 45:8484-8492. [PMID: 28582582 PMCID: PMC5737824 DOI: 10.1093/nar/gkx501] [Citation(s) in RCA: 74] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2017] [Accepted: 05/26/2017] [Indexed: 11/14/2022] Open
Abstract
Codon usage biases are found in all eukaryotic and prokaryotic genomes and have been proposed to regulate different aspects of translation process. Codon optimality has been shown to regulate translation elongation speed in fungal systems, but its effect on translation elongation speed in animal systems is not clear. In this study, we used a Drosophila cell-free translation system to directly compare the velocity of mRNA translation elongation. Our results demonstrate that optimal synonymous codons speed up translation elongation while non-optimal codons slow down translation. In addition, codon usage regulates ribosome movement and stalling on mRNA during translation. Finally, we show that codon usage affects protein structure and function in vitro and in Drosophila cells. Together, these results suggest that the effect of codon usage on translation elongation speed is a conserved mechanism from fungi to animals that can affect protein folding in eukaryotic organisms.
Collapse
Affiliation(s)
- Fangzhou Zhao
- Department of Physiology, The University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX 75390, USA
| | - Chien-Hung Yu
- Department of Physiology, The University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX 75390, USA
| | - Yi Liu
- Department of Physiology, The University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX 75390, USA
| |
Collapse
|
11
|
Pancsa R, Tompa P. Coding Regions of Intrinsic Disorder Accommodate Parallel Functions. Trends Biochem Sci 2016; 41:898-906. [DOI: 10.1016/j.tibs.2016.08.009] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2016] [Revised: 08/16/2016] [Accepted: 08/19/2016] [Indexed: 02/01/2023]
|
12
|
Genome-wide comparative analysis of codon usage bias and codon context patterns among cyanobacterial genomes. Mar Genomics 2016; 32:31-39. [PMID: 27733306 DOI: 10.1016/j.margen.2016.10.001] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2016] [Revised: 09/11/2016] [Accepted: 10/03/2016] [Indexed: 11/20/2022]
Abstract
With the increasing accumulation of genomic sequence information of prokaryotes, the study of codon usage bias has gained renewed attention. The purpose of this study was to examine codon selection pattern within and across cyanobacterial species belonging to diverse taxonomic orders and habitats. We performed detailed comparative analysis of cyanobacterial genomes with respect to codon bias. Our analysis reflects that in cyanobacterial genomes, A- and/or T-ending codons were used predominantly in the genes whereas G- and/or C-ending codons were largely avoided. Variation in the codon context usage of cyanobacterial genes corresponded to the clustering of cyanobacteria as per their GC content. Analysis of codon adaptation index (CAI) and synonymous codon usage order (SCUO) revealed that majority of genes are associated with low codon bias. Codon selection pattern in cyanobacterial genomes reflected compositional constraints as major influencing factor. It is also identified that although, mutational constraint may play some role in affecting codon usage bias in cyanobacteria, compositional constraint in terms of genomic GC composition coupled with environmental factors affected codon selection pattern in cyanobacterial genomes.
Collapse
|
13
|
Homma K, Noguchi T, Fukuchi S. Codon usage is less optimized in eukaryotic gene segments encoding intrinsically disordered regions than in those encoding structural domains. Nucleic Acids Res 2016; 44:10051-10061. [PMID: 27915289 PMCID: PMC5137448 DOI: 10.1093/nar/gkw899] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2016] [Revised: 09/15/2016] [Accepted: 09/29/2016] [Indexed: 12/14/2022] Open
Abstract
Codon usage tends to be optimized in highly expressed genes. A plausible explanation for this phenomenon is that translational accuracy is increased in highly expressed genes with infrequent use of rare codons. Besides structural domains (SDs), eukaryotic proteins generally have intrinsically disordered regions (IDRs) that by themselves do not assume unique three-dimensional structures. As IDRs are free from structural constraint, they can probably accommodate more translational errors than SDs can. Thus, codon usage in IDRs is likely to be less optimized than that in SDs. Codon usage in all the genes of seven eukaryotes was examined in terms of both tRNA adaptation index and codon adaptation index. Different amino acid compositions in different protein regions were taken into account in calculating expected adaptation indices, to which observed indices were compared. Codon usage is less optimized in gene regions encoding IDRs than in those corresponding to SDs. The finding does not depend on whether IDRs are located at the N-terminus, in the middle, or at the C-terminus of proteins. Furthermore, the observation remains unchanged in two different algorithms used to predict IDRs in proteins. The result is consistent with the idea that IDRs tolerate more translational errors than SDs.
Collapse
Affiliation(s)
- Keiichi Homma
- Department of Life Science and Informatics, Maebashi Institute of Technology, 460-1 Kamisadori-machi, Maebashi-shi 371-0816, Japan
| | - Tamotsu Noguchi
- Pharmaceutical Education Research Center, Meiji Pharmaceutical University, 2-522-1 Noshio, Kiyose, Tokyo 204-8588, Japan
| | - Satoshi Fukuchi
- Department of Life Science and Informatics, Maebashi Institute of Technology, 460-1 Kamisadori-machi, Maebashi-shi 371-0816, Japan
| |
Collapse
|
14
|
Holloway AK, Bruneau BG, Sukonnik T, Rubenstein JL, Pollard KS. Accelerated Evolution of Enhancer Hotspots in the Mammal Ancestor. Mol Biol Evol 2016; 33:1008-18. [PMID: 26715627 PMCID: PMC4776709 DOI: 10.1093/molbev/msv344] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
Mammals have evolved remarkably different sensory, reproductive, metabolic, and skeletal systems. To explore the genetic basis for these differences, we developed a comparative genomics approach to scan whole-genome multiple sequence alignments to identify regions that evolved rapidly in an ancestral lineage but are conserved within extant species. This pattern suggests that ancestral changes in function were maintained in descendants. After applying this test to therian mammals, we identified 4,797 accelerated regions, many of which are noncoding and located near developmental transcription factors. We then used mouse transgenic reporter assays to test if noncoding accelerated regions are enhancers and to determine how therian-specific substitutions affect their activity in vivo. We discovered enhancers with expression specific to the therian version in brain regions involved in the hormonal control of milk ejection, uterine contractions, blood pressure, temperature, and visual processing. This work underscores the idea that changes in developmental gene expression are important for mammalian evolution, and it pinpoints candidate genes for unique aspects of mammalian biology.
Collapse
Affiliation(s)
- Alisha K Holloway
- Gladstone Institute of Cardiovascular Disease, San Francisco, CA Department of Epidemiology and Biostatistics, University of California, San Francisco, CA
| | - Benoit G Bruneau
- Gladstone Institute of Cardiovascular Disease, San Francisco, CA Department of Pediatrics and Cardiovascular Research Institute, University of California, San Francisco, CA
| | - Tatyana Sukonnik
- Gladstone Institute of Cardiovascular Disease, San Francisco, CA
| | - John L Rubenstein
- Nina Ireland Laboratory of Developmental Neurobiology and Department of Psychiatry, University of California, San Francisco
| | - Katherine S Pollard
- Gladstone Institute of Cardiovascular Disease, San Francisco, CA Department of Epidemiology and Biostatistics, University of California, San Francisco, CA Institute for Human Genetics, University of California, San Francisco, CA
| |
Collapse
|
15
|
Deng W, Babu IR, Su D, Yin S, Begley TJ, Dedon PC. Trm9-Catalyzed tRNA Modifications Regulate Global Protein Expression by Codon-Biased Translation. PLoS Genet 2015; 11:e1005706. [PMID: 26670883 PMCID: PMC4689569 DOI: 10.1371/journal.pgen.1005706] [Citation(s) in RCA: 73] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Accepted: 11/06/2015] [Indexed: 12/30/2022] Open
Abstract
Post-transcriptional modifications of transfer RNAs (tRNAs) have long been recognized to play crucial roles in regulating the rate and fidelity of translation. However, the extent to which they determine global protein production remains poorly understood. Here we use quantitative proteomics to show a direct link between wobble uridine 5-methoxycarbonylmethyl (mcm5) and 5-methoxy-carbonyl-methyl-2-thio (mcm5s2) modifications catalyzed by tRNA methyltransferase 9 (Trm9) in tRNAArg(UCU) and tRNAGlu(UUC) and selective translation of proteins from genes enriched with their cognate codons. Controlling for bias in protein expression and alternations in mRNA expression, we find that loss of Trm9 selectively impairs expression of proteins from genes enriched with AGA and GAA codons under both normal and stress conditions. Moreover, we show that AGA and GAA codons occur with high frequency in clusters along the transcripts, which may play a role in modulating translation. Consistent with these results, proteins subject to enhanced ribosome pausing in yeast lacking mcm5U and mcm5s2U are more likely to be down-regulated and contain a larger number of AGA/GAA clusters. Together, these results suggest that Trm9-catalyzed tRNA modifications play a significant role in regulating protein expression within the cell. Here we present evidence for a more complicated role for transfer RNAs (tRNAs) than as mere adapters that link the genetic code in messenger RNA (mRNA) to the amino acid sequence of a protein during translation. tRNAs have long been known to be modified with dozens of different chemical structures other than the 4 canonical ribonucleosides, though the role of these modifications in controlling translation is poorly understood. By quantifying the expression of thousands of proteins in the yeast S. cerevisiae, we identified a mechanistic link between modified ribonucleosides located at the wobble position of two tRNAs, tRNAArg(UCU) and tRNAGlu(UUC), and the translation of proteins derived from genes enriched with codons read by these tRNAs: AGA and GAA. In cells lacking the enzyme that inserts these modifications, tRNA methyltransferase 9 (Trm9), we found a significant reduction in proteins from genes enriched with AGA and GAA codons and with runs of these codons. Also, mRNAs enriched with runs of AGA and GAA codons are subject to stalled translation on ribosomes in yeast lacking mcm5U and mcm5s2U. Together, these results reveal a distinct role for Trm9-catalyzed tRNA modifications in selectively regulating the expression of proteins enriched with AGA and GAA codons.
Collapse
Affiliation(s)
- Wenjun Deng
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - I. Ramesh Babu
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Dan Su
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Shanye Yin
- Department of Cell Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Thomas J. Begley
- SUNY College of Nanoscale Science and Engineering, Albany, New York, United States of America
- RNA Institute and Cancer Research Center, University at Albany, State University of New York, Albany, New York, United States of America
| | - Peter C. Dedon
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Singapore-MIT Alliance for Research and Technology, Singapore
- * E-mail:
| |
Collapse
|
16
|
Supek F. The Code of Silence: Widespread Associations Between Synonymous Codon Biases and Gene Function. J Mol Evol 2015; 82:65-73. [PMID: 26538122 DOI: 10.1007/s00239-015-9714-8] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2015] [Accepted: 10/30/2015] [Indexed: 02/07/2023]
Abstract
Some mutations in gene coding regions exchange one synonymous codon for another, and thus do not alter the amino acid sequence of the encoded protein. Even though they are often called 'silent,' these mutations may exhibit a plethora of effects on the living cell. Therefore, they are often selected during evolution, causing synonymous codon usage biases in genomes. Comparative analyses of bacterial, archaeal, fungal, and human cancer genomes have found many links between a gene's biological role and the accrual of synonymous mutations during evolution. In particular, highly expressed genes in certain functional categories are enriched with optimal codons, which are decoded by the abundant tRNAs, thus enhancing the speed and accuracy of the translating ribosome. The set of genes exhibiting codon adaptation differs between genomes, and these differences show robust associations to organismal phenotypes. In addition to selection for translation efficiency, other distinct codon bias patterns have been found in: amino acid starvation genes, cyclically expressed genes, tissue-specific genes in animals and plants, oxidative stress response genes, cellular differentiation genes, and oncogenes. In addition, genomes of organisms harboring tRNA modifications exhibit particular codon preferences. The evolutionary trace of codon bias patterns across orthologous genes may be examined to learn about a gene's relevance to various phenotypes, or, more generally, its function in the cell.
Collapse
Affiliation(s)
- Fran Supek
- Division of electronics, Rudjer Boskovic Institute, 10000, Zagreb, Croatia.
- EMBL-CRG Systems Biology Unit, Centre for Genomic Regulation (CRG), 08003, Barcelona, Spain.
- Universitat Pompeu Fabra (UPF), 08003, Barcelona, Spain.
| |
Collapse
|
17
|
Castellano D, Coronado-Zamora M, Campos JL, Barbadilla A, Eyre-Walker A. Adaptive Evolution Is Substantially Impeded by Hill-Robertson Interference in Drosophila. Mol Biol Evol 2015; 33:442-55. [PMID: 26494843 PMCID: PMC4794616 DOI: 10.1093/molbev/msv236] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Hill-Robertson interference (HRi) is expected to reduce the efficiency of natural selection when two or more linked selected sites do not segregate freely, but no attempt has been done so far to quantify the overall impact of HRi on the rate of adaptive evolution for any given genome. In this work, we estimate how much HRi impedes the rate of adaptive evolution in the coding genome of Drosophila melanogaster. We compiled a data set of 6,141 autosomal protein-coding genes from Drosophila, from which polymorphism levels in D. melanogaster and divergence out to D. yakuba were estimated. The rate of adaptive evolution was calculated using a derivative of the McDonald-Kreitman test that controls for slightly deleterious mutations. We find that the rate of adaptive amino acid substitution at a given position of the genome is positively correlated to both the rate of recombination and the mutation rate, and negatively correlated to the gene density of the region. These correlations are robust to controlling for each other, for synonymous codon bias and for gene functions related to immune response and testes. We show that HRi diminishes the rate of adaptive evolution by approximately 27%. Interestingly, genes with low mutation rates embedded in gene poor regions lose approximately 17% of their adaptive substitutions whereas genes with high mutation rates embedded in gene rich regions lose approximately 60%. We conclude that HRi hampers the rate of adaptive evolution in Drosophila and that the variation in recombination, mutation, and gene density along the genome affects the HRi effect.
Collapse
Affiliation(s)
- David Castellano
- Genomics, Bioinformatics and Evolution Group, Institut de Biotecnologia i de Biomedicina (IBB) and Department de Genètica i Microbiologia, Campus Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Marta Coronado-Zamora
- Genomics, Bioinformatics and Evolution Group, Institut de Biotecnologia i de Biomedicina (IBB) and Department de Genètica i Microbiologia, Campus Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Jose L Campos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Antonio Barbadilla
- Genomics, Bioinformatics and Evolution Group, Institut de Biotecnologia i de Biomedicina (IBB) and Department de Genètica i Microbiologia, Campus Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Adam Eyre-Walker
- Centre for the Study of Evolution, School of Life Sciences, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
18
|
Ressayre A, Glémin S, Montalent P, Serre-Giardi L, Dillmann C, Joets J. Introns Structure Patterns of Variation in Nucleotide Composition in Arabidopsis thaliana and Rice Protein-Coding Genes. Genome Biol Evol 2015; 7:2913-28. [PMID: 26450849 PMCID: PMC4684703 DOI: 10.1093/gbe/evv189] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Plant genomes present a continuous range of variation in nucleotide composition (G + C content). In coding regions, G + C-poor species tend to have unimodal distributions of G + C content among genes within genomes and slight 5′–3′ gradients along genes. In contrast, G + C-rich species display bimodal distributions of G + C content among genes and steep 5′–3′ decreasing gradients along genes. The causes of these peculiar patterns are still poorly understood. Within two species (Arabidopsis thaliana and rice), each representative of one side of the continuum, we studied the consequences of intron presence on coding region and intron G + C content at different scales. By properly taking intron structure into account, we showed that, in both species, intron presence is associated with step changes in nucleotide, codon, and amino acid composition. This suggests that introns have a barrier effect structuring G + C content along genes and that previous continuous characterizations of the 5′–3′ gradients were artifactual. In external gene regions (located upstream first or downstream last introns), species-specific factors, such as GC-biased gene conversion, are shaping G + C content whereas in internal gene regions (surrounded by introns), G + C content is likely constrained to remain within a range common to both species.
Collapse
Affiliation(s)
- Adrienne Ressayre
- UMR 0320/UMR 8120 Génétique Quantitative et Evolution-Le Moulon, INRA, Gif-sur-Yvette, France
| | - Sylvain Glémin
- Institut des Sciences de l'Evolution (ISEM), UMR 5554, Université de Montpellier, CNRS-IRD-EPHE, France Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Sweden
| | - Pierre Montalent
- UMR 0320/UMR 8120 Génétique Quantitative et Evolution-Le Moulon, INRA, Gif-sur-Yvette, France
| | - Laurana Serre-Giardi
- UMR 1345 IRHS Institut de Recherche en Horticulture et Semences, INRA, Centre de Recherche Angers-Nantes, Beaucousé, France
| | - Christine Dillmann
- UMR 0320/UMR 8120 Génétique Quantitative et Evolution-Le Moulon, Université Paris-Sud, Gif-sur-Yvette, France
| | - Johann Joets
- UMR 0320/UMR 8120 Génétique Quantitative et Evolution-Le Moulon, INRA, Gif-sur-Yvette, France
| |
Collapse
|
19
|
Dohra H, Fujishima M, Suzuki H. Analysis of amino acid and codon usage in Paramecium bursaria. FEBS Lett 2015; 589:3113-8. [PMID: 26341535 DOI: 10.1016/j.febslet.2015.08.033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2015] [Revised: 08/20/2015] [Accepted: 08/21/2015] [Indexed: 01/28/2023]
Abstract
The ciliate Paramecium bursaria harbors the green-alga Chlorella symbionts. We reassembled the P. bursaria transcriptome to minimize falsely fused transcripts, and investigated amino acid and codon usage using the transcriptome data. Surface proteins preferentially use smaller amino acid residues like cysteine. Unusual synonymous codon and amino acid usage in highly expressed genes can reflect a balance between translational selection and other factors. A correlation of gene expression level with synonymous codon or amino acid usage is emphasized in genes down-regulated in symbiont-bearing cells compared to symbiont-free cells. Our results imply that the selection is associated with P. bursaria-Chlorella symbiosis.
Collapse
Affiliation(s)
- Hideo Dohra
- Instrumental Research Support Office, Research Institute of Green Science and Technology, Shizuoka University, 836 Ohya, Suruga-ku, Shizuoka 422-8529, Japan; Department of Biological Science, Graduate School of Science, Shizuoka University, 836 Ohya, Suruga-ku, Shizuoka 422-8529, Japan
| | - Masahiro Fujishima
- Department of Environmental Science and Engineering, Graduate School of Science and Engineering, Yamaguchi University, 1677-1 Yoshida, Yamaguchi 753-8512, Japan; National Bio-Resource Project of Japan Agency for Medical Research and Development, Japan
| | - Haruo Suzuki
- Department of Environmental Science and Engineering, Graduate School of Science and Engineering, Yamaguchi University, 1677-1 Yoshida, Yamaguchi 753-8512, Japan.
| |
Collapse
|
20
|
Bush SJ, Kover PX, Urrutia AO. Lineage-specific sequence evolution and exon edge conservation partially explain the relationship between evolutionary rate and expression level in A. thaliana. Mol Ecol 2015; 24:3093-106. [PMID: 25930165 PMCID: PMC4480654 DOI: 10.1111/mec.13221] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2014] [Revised: 04/21/2015] [Accepted: 04/28/2015] [Indexed: 02/06/2023]
Abstract
Rapidly evolving proteins can aid the identification of genes underlying phenotypic adaptation across taxa, but functional and structural elements of genes can also affect evolutionary rates. In plants, the ‘edges’ of exons, flanking intron junctions, are known to contain splice enhancers and to have a higher degree of conservation compared to the remainder of the coding region. However, the extent to which these regions may be masking indicators of positive selection or account for the relationship between dN/dS and other genomic parameters is unclear. We investigate the effects of exon edge conservation on the relationship of dN/dS to various sequence characteristics and gene expression parameters in the model plant Arabidopsis thaliana. We also obtain lineage-specific dN/dS estimates, making use of the recently sequenced genome of Thellungiella parvula, the second closest sequenced relative after the sister species Arabidopsis lyrata. Overall, we find that the effect of exon edge conservation, as well as the use of lineage-specific substitution estimates, upon dN/dS ratios partly explains the relationship between the rates of protein evolution and expression level. Furthermore, the removal of exon edges shifts dN/dS estimates upwards, increasing the proportion of genes potentially under adaptive selection. We conclude that lineage-specific substitutions and exon edge conservation have an important effect on dN/dS ratios and should be considered when assessing their relationship with other genomic parameters.
Collapse
Affiliation(s)
- Stephen J Bush
- Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Paula X Kover
- Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Araxi O Urrutia
- Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| |
Collapse
|
21
|
Haerty W, Ponting CP. Unexpected selection to retain high GC content and splicing enhancers within exons of multiexonic lncRNA loci. RNA (NEW YORK, N.Y.) 2015; 21:333-46. [PMID: 25589248 PMCID: PMC4338330 DOI: 10.1261/rna.047324.114] [Citation(s) in RCA: 64] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2014] [Accepted: 11/25/2014] [Indexed: 06/04/2023]
Abstract
If sequencing was possible only for genomes, and not for RNAs or proteins, then functional protein-coding exons would be recognizable by their unusual patterns of nucleotide composition, specifically a high GC content across the body of exons, and an unusual nucleotide content near their edges. RNAs and proteins can, of course, be sequenced but the extent of functionality of intergenic long noncoding RNAs (lncRNAs) remains under question owing to their low nucleotide conservation. Inspired by the nucleotide composition patterns of protein-coding exons, we sought evidence for functionality across lncRNA loci from diverse species. We found that such patterns across multiexonic lncRNA loci mirror those of proteincoding genes, although to a lesser degree: Specifically, compared with introns, lncRNA exons are GC rich. Additionally we report evidence for the action of purifying selection to preserve exonic splicing enhancers within human multiexonic lncRNAs and nucleotide composition in fruit fly lncRNAs. Our findings provide evidence for selection for more efficient rates of transcription and splicing within lncRNA loci. Despite only a minor proportion of their RNA bases being constrained, multiexonic intergenic lncRNAs appear to require accurate splicing of their exons to transact their function.
Collapse
|
22
|
Wei X, Zhang J. A simple method for estimating the strength of natural selection on overlapping genes. Genome Biol Evol 2014; 7:381-90. [PMID: 25552532 PMCID: PMC4316641 DOI: 10.1093/gbe/evu294] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Overlapping genes, where one DNA sequence codes for two proteins with different reading frames, are not uncommon in viruses and cellular organisms. Estimating the direction and strength of natural selection acting on overlapping genes is important for understanding their functionality, origin, evolution, maintenance, and potential interaction. However, the standard methods for estimating synonymous (dS) and nonsynonymous (dN) nucleotide substitution rates are inapplicable here because a nucleotide change can be simultaneously synonymous and nonsynonymous when both reading frames involved are considered. We have developed a simple method that can estimate dN/dS and test for the action of natural selection in each relevant reading frame of the overlapping genes. Our method is an extension of the modified Nei-Gojobori method previously developed for nonoverlapping genes. We confirmed the reliability of our method using extensive computer simulation. Applying this method, we studied the longest human sense–antisense overlapping gene pair, LRRC8E and ENSG00000214248. Although LRRC8E (leucine-rich repeat containing eight family, member E) is known to regulate cell size, the function of ENSG00000214248 is unknown. Our analysis revealed purifying selection on ENSG00000214248 and suggested that it originated in the common ancestor of bony vertebrates.
Collapse
Affiliation(s)
- Xinzhu Wei
- Department of Ecology and Evolutionary Biology, University of Michigan
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan
| |
Collapse
|
23
|
Schüler A, Ghanbarian AT, Hurst LD. Purifying selection on splice-related motifs, not expression level nor RNA folding, explains nearly all constraint on human lincRNAs. Mol Biol Evol 2014; 31:3164-83. [PMID: 25158797 PMCID: PMC4245815 DOI: 10.1093/molbev/msu249] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
There are two strong and equally important predictors of rates of human protein evolution: The amount the gene is expressed and the proportion of exonic sequence devoted to control splicing, mediated largely by selection on exonic splice enhancer (ESE) motifs. Is the same true for noncoding RNAs, known to be under very weak purifying selection? Prior evidence suggests that selection at splice sites in long intergenic noncoding RNAs (lincRNAs) is important. We now report multiple lines of evidence indicating that the great majority of purifying selection operating on lincRNAs in humans is splice related. Splice-related parameters explain much of the between-gene variation in evolutionary rate in humans. Expression rate is not a relevant predictor, although expression breadth is weakly so. In contrast to protein-coding RNAs, we observe no relationship between evolutionary rate and lincRNA stability. As in protein-coding genes, ESEs are especially abundant near splice junctions and evolve slower than non-ESE sequence equidistant from boundaries. Nearly all constraint in lincRNAs is at exon ends (N.B. the same is not witnessed in Drosophila). Although we cannot definitely answer the question as to why splice-related selection is so important, we find no evidence that splicing might enable the nonsense-mediated decay pathway to capture transcripts incorrectly processed by ribosomes. We find evidence consistent with the notion that splicing modifies the underlying chromatin through recruitment of splice-coupled chromatin modifiers, such as CHD1, which in turn might modulate neighbor gene activity. We conclude that most selection on human lincRNAs is splice mediated and suggest that the possibility of splice-chromatin coupling is worthy of further scrutiny.
Collapse
Affiliation(s)
- Andreas Schüler
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Avazeh T Ghanbarian
- Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom
| | - Laurence D Hurst
- Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom
| |
Collapse
|
24
|
Fares M. Identifying Natural Selection with Molecular Data. NATURAL SELECTION 2014:48-82. [DOI: 10.1201/b17795-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
|
25
|
Kessler MD, Dean MD. Effective population size does not predict codon usage bias in mammals. Ecol Evol 2014; 4:3887-900. [PMID: 25505518 PMCID: PMC4242573 DOI: 10.1002/ece3.1249] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2014] [Revised: 08/04/2014] [Accepted: 08/07/2014] [Indexed: 12/20/2022] Open
Abstract
Synonymous codons are not used at equal frequency throughout the genome, a phenomenon termed codon usage bias (CUB). It is often assumed that interspecific variation in the intensity of CUB is related to species differences in effective population sizes (Ne), with selection on CUB operating less efficiently in species with small Ne. Here, we specifically ask whether variation in Ne predicts differences in CUB in mammals and report two main findings. First, across 41 mammalian genomes, CUB was not correlated with two indirect proxies of Ne (body mass and generation time), even though there was statistically significant evidence of selection shaping CUB across all species. Interestingly, autosomal genes showed higher codon usage bias compared to X-linked genes, and high-recombination genes showed higher codon usage bias compared to low recombination genes, suggesting intraspecific variation in Ne predicts variation in CUB. Second, across six mammalian species with genetic estimates of Ne (human, chimpanzee, rabbit, and three mouse species: Mus musculus, M. domesticus, and M. castaneus), Ne and CUB were weakly and inconsistently correlated. At least in mammals, interspecific divergence in Ne does not strongly predict variation in CUB. One hypothesis is that each species responds to a unique distribution of selection coefficients, confounding any straightforward link between Ne and CUB.
Collapse
Affiliation(s)
- Michael D Kessler
- Molecular and Computational Biology, University of Southern California 1050 Childs Way, Los Angeles, California, 90089
| | - Matthew D Dean
- Molecular and Computational Biology, University of Southern California 1050 Childs Way, Los Angeles, California, 90089
| |
Collapse
|
26
|
Singha HS, Chakraborty S, Deka H. Stress induced MAPK genes show distinct pattern of codon usage in Arabidopsis thaliana, Glycine max and Oryza sativa. Bioinformation 2014; 10:436-42. [PMID: 25187684 PMCID: PMC4135292 DOI: 10.6026/97320630010436] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2014] [Accepted: 06/27/2014] [Indexed: 01/19/2023] Open
Abstract
Mitogen activated protein kinase (MAPK) genes provide resistance to various biotic and abiotic stresses. Codon usage profiling of the genes reveals the characteristic features of the genes like nucleotide composition, gene expressivity, optimal codons etc. The present study is a comparative analysis of codon usage patterns for different MAPK genes in three organisms, viz. Arabidopsis thaliana, Glycine max (soybean) and Oryza sativa (rice). The study has revealed a high AT content in MAPK genes of Arabidopsis and soybean whereas in rice a balanced AT-GC content at the third synonymous position of codon. The genes show a low bias in codon usage profile as reflected in the higher values (50.83 to 56.55) of effective number of codons (Nc). The prediction of gene expression profile in the MAPK genes revealed that these genes might be under the selective pressure of translational optimization as reflected in the low codon adaptation index (CAI) values ranging from 0.147 to 0.208.
Collapse
Affiliation(s)
| | | | - Himangshu Deka
- Department of Biotechnology, Assam University, Silchar-788011, Assam, India
| |
Collapse
|
27
|
Hunt RC, Simhadri VL, Iandoli M, Sauna ZE, Kimchi-Sarfaty C. Exposing synonymous mutations. Trends Genet 2014; 30:308-21. [PMID: 24954581 DOI: 10.1016/j.tig.2014.04.006] [Citation(s) in RCA: 230] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2013] [Revised: 04/16/2014] [Accepted: 04/17/2014] [Indexed: 12/12/2022]
Abstract
Synonymous codon changes, which do not alter protein sequence, were previously thought to have no functional consequence. Although this concept has been overturned in recent years, there is no unique mechanism by which these changes exert biological effects. A large repertoire of both experimental and bioinformatic methods has been developed to understand the effects of synonymous variants. Results from this body of work have provided global insights into how biological systems exploit the degeneracy of the genetic code to control gene expression, protein folding efficiency, and the coordinated expression of functionally related gene families. Although it is now clear that synonymous variants are important in a variety of contexts, from human disease to the safety and efficacy of therapeutic proteins, there is no clear consensus on the approaches to identify and validate these changes. Here, we review the diverse methods to understand the effects of synonymous mutations.
Collapse
Affiliation(s)
- Ryan C Hunt
- Division of Hematology, Center for Biologics Evaluation and Research, Food and Drug Administration, Bethesda, MD, USA.
| | - Vijaya L Simhadri
- Division of Hematology, Center for Biologics Evaluation and Research, Food and Drug Administration, Bethesda, MD, USA
| | - Matthew Iandoli
- Division of Hematology, Center for Biologics Evaluation and Research, Food and Drug Administration, Bethesda, MD, USA
| | - Zuben E Sauna
- Division of Hematology, Center for Biologics Evaluation and Research, Food and Drug Administration, Bethesda, MD, USA.
| | - Chava Kimchi-Sarfaty
- Division of Hematology, Center for Biologics Evaluation and Research, Food and Drug Administration, Bethesda, MD, USA.
| |
Collapse
|
28
|
Du J, Dungan SZ, Sabouhanian A, Chang BSW. Selection on synonymous codons in mammalian rhodopsins: a possible role in optimizing translational processes. BMC Evol Biol 2014; 14:96. [PMID: 24884412 PMCID: PMC4021273 DOI: 10.1186/1471-2148-14-96] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2013] [Accepted: 04/11/2014] [Indexed: 01/21/2023] Open
Abstract
Background Synonymous codon usage can affect many cellular processes, particularly those associated with translation such as polypeptide elongation and folding, mRNA degradation/stability, and splicing. Highly expressed genes are thought to experience stronger selection pressures on synonymous codons. This should result in codon usage bias even in species with relatively low effective population sizes, like mammals, where synonymous site selection is thought to be weak. Here we use phylogenetic codon-based likelihood models to explore patterns of codon usage bias in a dataset of 18 mammalian rhodopsin sequences, the protein mediating the first step in vision in the eye, and one of the most highly expressed genes in vertebrates. We use these patterns to infer selection pressures on key translational mechanisms including polypeptide elongation, protein folding, mRNA stability, and splicing. Results Overall, patterns of selection in mammalian rhodopsin appear to be correlated with post-transcriptional and translational processes. We found significant evidence for selection at synonymous sites using phylogenetic mutation-selection likelihood models, with C-ending codons found to have the highest relative fitness, and to be significantly more abundant at conserved sites. In general, these codons corresponded with the most abundant tRNAs in mammals. We found significant differences in codon usage bias between rhodopsin loops versus helices, though there was no significant difference in mean synonymous substitution rate between these motifs. We also found a significantly higher proportion of GC-ending codons at paired sites in rhodopsin mRNA secondary structure, and significantly lower synonymous mutation rates in putative exonic splicing enhancer (ESE) regions than in non-ESE regions. Conclusions By focusing on a single highly expressed gene we both distinguish synonymous codon selection from mutational effects and analytically explore underlying functional mechanisms. Our results suggest that codon bias in mammalian rhodopsin arises from selection to optimally balance high overall translational speed, accuracy, and proper protein folding, especially in structurally complicated regions. Selection at synonymous sites may also be contributing to mRNA stability and splicing efficiency at exonic-splicing-enhancer (ESE) regions. Our results highlight the importance of investigating highly expressed genes in a broader phylogenetic context in order to better understand the evolution of synonymous substitutions.
Collapse
Affiliation(s)
| | | | | | - Belinda S W Chang
- Department of Ecology & Evolutionary Biology, University of Toronto, 25 Harbord Street, Toronto, ON M5S 3G5, Canada.
| |
Collapse
|
29
|
Gu W, Li M, Xu Y, Wang T, Ko JH, Zhou T. The impact of RNA structure on coding sequence evolution in both bacteria and eukaryotes. BMC Evol Biol 2014; 14:87. [PMID: 24758737 PMCID: PMC4021280 DOI: 10.1186/1471-2148-14-87] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2014] [Accepted: 04/07/2014] [Indexed: 12/03/2022] Open
Abstract
Background Many studies have found functional RNA secondary structures are selectively conserved among species. But, the effect of RNA structure selection on coding sequence evolution remains unknown. To address this problem, we systematically investigated the relationship between nucleotide conservation level and its structural sensitivity in four model organisms, Escherichia coli, yeast, fly, and mouse. Results We define structurally sensitive sites as those with putative local structure-disruptive mutations. Using both the Mantel-Haenszel procedure and association test, we found structurally sensitive nucleotide sites evolved more slowly than non-sensitive sites in all four organisms. Furthermore, we observed that this association is more obvious in highly expressed genes and region near the start codon. Conclusion We conclude that structurally sensitive sites in mRNA sequences normally have less nucleotide divergence in all species we analyzed. This study extends our understanding of the impact of RNA structure on coding sequence evolution, and is helpful to the development of a codon model with RNA structure information.
Collapse
Affiliation(s)
- Wanjun Gu
- Research Center for Learning Sciences, Southeast University, Nanjing, Jiangsu 210096, China.
| | | | | | | | | | | |
Collapse
|
30
|
Matthijs S, Ye L, Stijlemans B, Cornelis P, Bossuyt F, Roelants K. Low structural variation in the host-defense peptide repertoire of the dwarf clawed frog Hymenochirus boettgeri (Pipidae). PLoS One 2014; 9:e86339. [PMID: 24466037 PMCID: PMC3899252 DOI: 10.1371/journal.pone.0086339] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2013] [Accepted: 12/06/2013] [Indexed: 02/05/2023] Open
Abstract
THE skin secretion of many amphibians contains peptides that are able to kill a broad range of microorganisms (antimicrobial peptides: AMPs) and potentially play a role in innate immune defense. Similar to the toxin arsenals of various animals, amphibian AMP repertoires typically show major structural variation, and previous studies have suggested that this may be the result of diversifying selection in adaptation to a diverse spectrum of pathogens. Here we report on transcriptome analyses that indicate a very different pattern in the dwarf clawed frog H. boettgeri. Our analyses reveal a diverse set of transcripts containing two to six tandem repeats, together encoding 14 distinct peptides. Five of these have recently been identified as AMPs, while three more are shown here to potently inhibit the growth of gram-negative bacteria, including multi-drug resistant strains of the medically important Pseudomonas aeruginosa. Although the number of predicted peptides is similar to the numbers of related AMPs in Xenopus and Silurana frog species, they show significantly lower structural variation. Selection analyses confirm that, in contrast to the AMPs of other amphibians, the H. boettgeri peptides did not evolve under diversifying selection. Instead, the low sequence variation among tandem repeats resulted from purifying selection, recent duplication and/or concerted gene evolution. Our study demonstrates that defense peptide repertoires of closely related taxa, after diverging from each other, may evolve under differential selective regimes, leading to contrasting patterns of structural diversity.
Collapse
Affiliation(s)
- Severine Matthijs
- Amphibian Evolution Lab, Biology Department, Vrije Universiteit Brussel, Brussels, Belgium
| | - Lumeng Ye
- Department of Bioengineering Sciences, Research Group of Microbiology and Vlaams Instituut voor Biotechnologie, Vrije Universiteit Brussel, Brussels, Belgium
| | - Benoit Stijlemans
- Unit of Cellular and Molecular Immunology, Vrije Universiteit Brussel, Brussels, Belgium
- Laboratory of Myeloid Cell Immunology, Vlaams Instituut voor Biotechnologie, Brussels, Belgium
| | - Pierre Cornelis
- Department of Bioengineering Sciences, Research Group of Microbiology and Vlaams Instituut voor Biotechnologie, Vrije Universiteit Brussel, Brussels, Belgium
| | - Franky Bossuyt
- Amphibian Evolution Lab, Biology Department, Vrije Universiteit Brussel, Brussels, Belgium
| | - Kim Roelants
- Amphibian Evolution Lab, Biology Department, Vrije Universiteit Brussel, Brussels, Belgium
- * E-mail:
| |
Collapse
|
31
|
Affiliation(s)
- Robert J Weatheritt
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, UK
| | | |
Collapse
|
32
|
Behura SK, Singh BK, Severson DW. Antagonistic relationships between intron content and codon usage bias of genes in three mosquito species: functional and evolutionary implications. Evol Appl 2013; 6:1079-89. [PMID: 24187589 PMCID: PMC3804240 DOI: 10.1111/eva.12088] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2013] [Accepted: 06/14/2013] [Indexed: 12/17/2022] Open
Abstract
Genome biology of mosquitoes holds potential in developing knowledge-based control strategies against vectorborne diseases such as malaria, dengue, West Nile, and others. Although the genomes of three major vector mosquitoes have been sequenced, attempts to elucidate the relationship between intron and codon usage bias across species in phylogenetic contexts are limited. In this study, we investigated the relationship between intron content and codon bias of orthologous genes among three vector mosquito species. We found an antagonistic relationship between codon usage bias and the intron number of genes in each mosquito species. The pattern is further evident among the intronless and the intron-containing orthologous genes associated with either low or high codon bias among the three species. Furthermore, the covariance between codon bias and intron number has a directional component associated with the species phylogeny when compared with other nonmosquito insects. By applying a maximum likelihood-based continuous regression method, we show that codon bias and intron content of genes vary among the insects in a phylogeny-dependent manner, but with no evidence of adaptive radiation or species-specific adaptation. We discuss the functional and evolutionary significance of antagonistic relationships between intron content and codon bias.
Collapse
Affiliation(s)
- Susanta K Behura
- Eck Institute for Global Health, Department of Biological Sciences, University of Notre Dame Notre Dame, IN, USA
| | | | | |
Collapse
|
33
|
Gu W, Wang X, Zhai C, Zhou T, Xie X. Biological basis of miRNA action when their targets are located in human protein coding region. PLoS One 2013; 8:e63403. [PMID: 23671676 PMCID: PMC3646042 DOI: 10.1371/journal.pone.0063403] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2013] [Accepted: 03/30/2013] [Indexed: 01/09/2023] Open
Abstract
Recent analyses have revealed many functional microRNA (miRNA) targets in mammalian protein coding regions. But, the mechanisms that ensure miRNA function when their target sites are located in protein coding regions of mammalian mRNA transcripts are largely unknown. In this paper, we investigate some potential biological factors, such as target site accessibility and local translation efficiency. We computationally analyze these two factors using experimentally identified miRNA targets in human protein coding region. We find site accessibility is significantly increased in miRNA target region to facilitate miRNA binding. At the mean time, local translation efficiency is also selectively decreased near miRNA target region. GC-poor codons are preferred in the flank region of miRNA target sites to ease the access of miRNA targets. Within-genome analysis shows substantial variations of site accessibility and local translation efficiency among different miRNA targets in the genome. Further analyses suggest target gene’s GC content and conservation level could explain some of the differences in site accessibility. On the other hand, target gene’s functional importance and conservation level can affect local translation efficiency near miRNA target region. We hence propose both site accessibility and local translation efficiency are important in miRNA action when miRNA target sites are located in mammalian protein coding regions.
Collapse
Affiliation(s)
- Wanjun Gu
- Research Center of Learning Sciences, Southeast University, Nanjing, Jiangsu, China
- * E-mail: (WG); (TZ); (XX)
| | - Xiaofei Wang
- Research Center of Learning Sciences, Southeast University, Nanjing, Jiangsu, China
| | - Chuanying Zhai
- Research Center of Learning Sciences, Southeast University, Nanjing, Jiangsu, China
| | - Tong Zhou
- Institute for Personalized Respiratory Medicine, The University of Illinois at Chicago, Chicago, Illinois, United States of America
- Section of Pulmonary, Critical Care, Sleep & Allergy, Department of Medicine, The University of Illinois at Chicago, Chicago, Illinois, United States of America
- * E-mail: (WG); (TZ); (XX)
| | - Xueying Xie
- Research Center of Learning Sciences, Southeast University, Nanjing, Jiangsu, China
- * E-mail: (WG); (TZ); (XX)
| |
Collapse
|
34
|
Nabiyouni M, Prakash A, Fedorov A. Vertebrate codon bias indicates a highly GC-rich ancestral genome. Gene 2013; 519:113-9. [PMID: 23376453 DOI: 10.1016/j.gene.2013.01.033] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2012] [Revised: 01/10/2013] [Accepted: 01/17/2013] [Indexed: 11/16/2022]
Abstract
Two factors are thought to have contributed to the origin of codon usage bias in eukaryotes: 1) genome-wide mutational forces that shape overall GC-content and create context-dependent nucleotide bias, and 2) positive selection for codons that maximize efficient and accurate translation. Particularly in vertebrates, these two explanations contradict each other and cloud the origin of codon bias in the taxon. On the one hand, mutational forces fail to explain GC-richness (~60%) of third codon positions, given the GC-poor overall genomic composition among vertebrates (~40%). On the other hand, positive selection cannot easily explain strict regularities in codon preferences. Large-scale bioinformatic assessment, of nucleotide composition of coding and non-coding sequences in vertebrates and other taxa, suggests a simple possible resolution for this contradiction. Specifically, we propose that the last common vertebrate ancestor had a GC-rich genome (~65% GC). The data suggest that whole-genome mutational bias is the major driving force for generating codon bias. As the bias becomes prominent, it begins to affect translation and can result in positive selection for optimal codons. The positive selection can, in turn, significantly modulate codon preferences.
Collapse
Affiliation(s)
- Maryam Nabiyouni
- Program in Bioinformatics and Proteomics/Genomics, University of Toledo, Health Science Campus, Toledo, OH 43614, USA.
| | | | | |
Collapse
|
35
|
Wu X, Tronholm A, Cáceres EF, Tovar-Corona JM, Chen L, Urrutia AO, Hurst LD. Evidence for deep phylogenetic conservation of exonic splice-related constraints: splice-related skews at exonic ends in the brown alga Ectocarpus are common and resemble those seen in humans. Genome Biol Evol 2013; 5:1731-45. [PMID: 23902749 PMCID: PMC3787667 DOI: 10.1093/gbe/evt115] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/25/2013] [Indexed: 12/22/2022] Open
Abstract
The control of RNA splicing is often modulated by exonic motifs near splice sites. Chief among these are exonic splice enhancers (ESEs). Well-described ESEs in mammals are purine rich and cause predictable skews in codon and amino acid usage toward exonic ends. Looking across species, those with relatively abundant intronic sequence are those with the more profound end of exon skews, indicative of exonization of splice site recognition. To date, the only intron-rich species that have been analyzed are mammals, precluding any conclusions about the likely ancestral condition. Here, we examine the patterns of codon and amino acid usage in the vicinity of exon-intron junctions in the brown alga Ectocarpus siliculosus, a species with abundant large introns, known SR proteins, and classical splice sites. We find that amino acids and codons preferred/avoided at both 3' and 5' ends in Ectocarpus, of which there are many, tend, on average, to also be preferred/avoided at the same exon ends in humans. Moreover, the preferences observed at the 5' ends of exons are largely the same as those at the 3' ends, a symmetry trend only previously observed in animals. We predict putative hexameric ESEs in Ectocarpus and show that these are purine rich and that there are many more of these identified as functional ESEs in humans than expected by chance. These results are consistent with deep phylogenetic conservation of SR protein binding motifs. Assuming codons preferred near boundaries are "splice optimal" codons, in Ectocarpus, unlike Drosophila, splice optimal and translationally optimal codons are not mutually exclusive. The exclusivity of translationally optimal and splice optimal codon sets is thus not universal.
Collapse
Affiliation(s)
- XianMing Wu
- Department of Biology and Biochemistry, University of Bath, Somerset, United Kingdom
| | - Ana Tronholm
- Department of Biology and Biochemistry, University of Bath, Somerset, United Kingdom
- Present address: Department of Biological Sciences, University of Alabama, Mary Harmon Bryant Hall, Tuscaloosa, AL
| | - Eva Fernández Cáceres
- Department of Biology and Biochemistry, University of Bath, Somerset, United Kingdom
| | - Jaime M. Tovar-Corona
- Department of Biology and Biochemistry, University of Bath, Somerset, United Kingdom
| | - Lu Chen
- Human Genetics, Wellcome Trust Sanger Institute, Genome Campus, Hinxton, United Kingdom
| | - Araxi O. Urrutia
- Department of Biology and Biochemistry, University of Bath, Somerset, United Kingdom
| | - Laurence D. Hurst
- Department of Biology and Biochemistry, University of Bath, Somerset, United Kingdom
| |
Collapse
|
36
|
Hershberg R, Petrov DA. On the limitations of using ribosomal genes as references for the study of codon usage: a rebuttal. PLoS One 2012; 7:e49060. [PMID: 23284622 PMCID: PMC3527481 DOI: 10.1371/journal.pone.0049060] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2011] [Accepted: 10/05/2012] [Indexed: 01/08/2023] Open
Abstract
In a recent paper published in PLOS ONE, Wang et al. challenge our finding that the identity of optimal codons in different genomes follows a set of clear rules. Here we provide a rebuttal of their paper and demonstrate that the results of our original PLOS Genetics paper stand. This provides us with an opportunity to bring up an aspect of how codon usage has been studied that should be of general interest. The Wang et al. study, as well as many other studies, used ribosomal genes as a reference set for the study of patterns of codon usage. We discuss here the assumptions that are made in order to justify using ribosomal genes to study codon bias, suggest that this practice can at times be problematic, and discuss its limitations.
Collapse
Affiliation(s)
- Ruth Hershberg
- Rachel & Menachem Mendelovitch Evolutionary Processes of Mutation & Natural Selection Research Laboratory, Department of Genetics, Technion-Israel Institute of Technology, Haifa, Israel.
| | | |
Collapse
|
37
|
Abstract
Human metapneumovirus (HMPV) is an important agent of acute respiratory tract infection in children, while its pathogenicity and molecular evolution are lacking. Herein, we firstly report the synonymous codon usage patterns of HMPV genome. The relative synonymous codon usage (RSCU) values, effective number of codon (ENC) values, nucleotide contents, and correlation analysis were performed among 17 available whole genome of HMPV, including different genotypes. All preferred codons in HMPV are ended with A/U nucleotide and exhibited a great association with its high proportion of these two nucleotides in their genomes. Mutation pressure rather than natural selection is the main influence factor that determines the bias of synonymous codon usage in HMPV. The complementary pattern of codon usage bias between HMPV and human cell was observed, and this phenomenon suggests that host cells might be also act as an important factor to affect the codon usage bias. Moreover, the codon usage biases in each HMPV genotypes are separated into different clades, which suggest that phylogenetic distance might involve in codon usage bias formation as well. These analyses of synonymous codon usage bias in HMPV provide more information for better understanding its evolution and pathogenicity.
Collapse
|
38
|
Zhou T, Ko EA, Gu W, Lim I, Bang H, Ko JH. Non-silent story on synonymous sites in voltage-gated ion channel genes. PLoS One 2012; 7:e48541. [PMID: 23119053 PMCID: PMC3485311 DOI: 10.1371/journal.pone.0048541] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2012] [Accepted: 09/26/2012] [Indexed: 02/06/2023] Open
Abstract
Synonymous mutations are usually referred to as “silent”, but increasing evidence shows that they are not neutral in a wide range of organisms. We looked into the relationship between synonymous codon usage bias and residue importance of voltage-gated ion channel proteins in mice, rats, and humans. We tested whether translationally optimal codons are associated with transmembrane or channel-forming regions, i.e., the sites that are particularly likely to be involved in the closing and opening of an ion channel. Our hypothesis is that translationally optimal codons are preferred at the sites within transmembrane domains or channel-forming regions in voltage-gated ion channel genes to avoid mistranslation-induced protein misfolding or loss-of-function. Using the Mantel-Haenszel procedure, which applies to categorical data, we found that translationally optimal codons are more likely to be used at transmembrane residues and the residues involved in channel-forming. We also found that the conservation level at synonymous sites in the transmembrane region is significantly higher than that in the non-transmembrane region. This study provides evidence that synonymous sites in voltage-gated ion channel genes are not neutral. Silent mutations at channel-related sites may lead to dysfunction of the ion channel.
Collapse
Affiliation(s)
- Tong Zhou
- Institute for Personalized Respiratory Medicine, University of Illinois at Chicago, Chicago, Illinois, United States of America
- Section of Pulmonary, Critical Care, Sleep & Allergy, Department of Medicine, University of Illinois at Chicago, Chicago, Illinois, United States of America
| | - Eun A. Ko
- Department of Medicine, University of California San Francisco, San Francisco, California, United States of America
| | - Wanjun Gu
- Key Laboratory of Child Development and Learning Science of Ministry of Education of China, Southeast University, Nanjing, Jiangsu, China
| | - Inja Lim
- Department of Physiology, College of Medicine, Chung-Ang University, Seoul, South Korea
| | - Hyoweon Bang
- Department of Physiology, College of Medicine, Chung-Ang University, Seoul, South Korea
| | - Jae-Hong Ko
- Department of Physiology, College of Medicine, Chung-Ang University, Seoul, South Korea
- * E-mail:
| |
Collapse
|
39
|
Gu W, Zhai C, Wang X, Xie X, Parinandi G, Zhou T. Translation Efficiency in Upstream Region of microRNA Targets in Arabidopsis thaliana. Evol Bioinform Online 2012; 8:565-74. [PMID: 23071387 PMCID: PMC3469488 DOI: 10.4137/ebo.s10362] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
With respect to upstream regions of microRNA (miRNA) target sites located in protein coding sequences, experimental studies have suggested rare codons, rather than frequent codons, are important for miRNA function, because they slow down the local translational process. But, whether there is a trend of reduced translation efficiency near miRNA targets is still unknown. Using Arabidopsis thaliana, we perform genome-wide analysis of synonymous codon usage in upstream regions of miRNA target sites. At the whole genome level, we find no significant selection signals for decreased translational efficiency. However, the same genome analyses do show substantial variations of translation efficiency reduction among miRNA targets. We find that miRNA conservation level, gene codon usage bias, and the mechanism of miRNA action can account for the differences in translation efficiency. But gene's GC content, gene expression level, and miRNA target's conservation level have no effect on local translation efficiency of miRNA targets. Although local translation efficiency in the upstream region of miRNA targets is related to miRNA function in A. thaliana, the selection signal of rare codon usage in that region is weak. We propose some other biological factors are more important than local translation efficiency in miRNA action when miRNA targets are located in protein coding sequences.
Collapse
Affiliation(s)
- Wanjun Gu
- Key Laboratory of Child Development and Learning Science of Ministry of Education of China, Southeast University, Nanjing, Jiangsu 210096, China
| | | | | | | | | | | |
Collapse
|
40
|
Selection on Synonymous Sites for Increased Accessibility around miRNA Binding Sites in Plants. Mol Biol Evol 2012; 29:3037-44. [DOI: 10.1093/molbev/mss109] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
|
41
|
Error prevention and mitigation as forces in the evolution of genes and genomes. Nat Rev Genet 2011; 12:875-81. [PMID: 22094950 DOI: 10.1038/nrg3092] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Why are short introns rarely a multiple of three nucleotides long? Why do essential genes cluster? Why are genes in operons often lined up in the order in which they are needed in the encoded pathway? In this Opinion article, we argue that these and many other - ostensibly disparate - observations are all pieces of an emerging picture in which multiple aspects of gene anatomy and genome architecture have evolved in response to error-prone gene expression.
Collapse
|
42
|
Weber CC, Pink CJ, Hurst LD. Late-replicating domains have higher divergence and diversity in Drosophila melanogaster. Mol Biol Evol 2011; 29:873-82. [PMID: 22046001 DOI: 10.1093/molbev/msr265] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Several reports from mammals indicate that an increase in the mutation rate in late-replicating regions may, in part, be responsible for the observed genomic heterogeneity in neutral substitution rates and levels of diversity, although the mechanisms for this remain poorly understood. Recent evidence also suggests that late replication is associated with high mutability in yeast. This then raises the question as to whether a similar effect is operating across all eukaryotes. Limited evidence from one chromosome arm in Drosophila melanogaster suggests the opposite pattern, with regions overlapping early-firing origins showing increased levels of diversity and divergence. Given the availability of genome-wide replication timing profiles for D. melanogaster, we now return to this issue. Consistent with what is seen in other taxa, we find that divergence at synonymous sites in exon cores, as well as divergence at putatively unconstrained intronic sites, is elevated in late-replicating regions. Analysis of genes with low codon usage bias suggests a ∼30% difference in mutation rate between the earliest and the latest replicating sequence. Intronic sequence suggests a more modest difference. We additionally show that an increase in diversity in late-replicating sequences is not owing to replication timing covarying with the local recombination rate. If anything, the effects of recombination mask the impact of replication timing. We conclude that, contrary to prior reports and consistent with what is seen in mammals and yeast, there is indeed a relationship between rates of nucleotide divergence and diversity and replication timing that is consistent with an increase in the mutation rate during late S-phase in D. melanogaster. It is therefore plausible that such an effect might be common among eukaryotes. The result may have implications for the inference of positive selection.
Collapse
Affiliation(s)
- Claudia C Weber
- Department of Biology and Biochemistry, University of Bath, Bath, United Kingdom
| | | | | |
Collapse
|
43
|
Ezawa K, Ikeo K, Gojobori T, Saitou N. Evolutionary patterns of recently emerged animal duplogs. Genome Biol Evol 2011; 3:1119-35. [PMID: 21859807 PMCID: PMC3194840 DOI: 10.1093/gbe/evr074] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Duplogs, or intraspecies paralogs, constitute the important portion of eukaryote genomes and serve as a major source of functional innovation. We conducted detailed analyses of recently emerged animal duplogs. Genome data of three vertebrate species (Homo sapiens, Mus musculus, and Danio rerio), Caenorhabditis elegans, and two Drosophila species (Drosophila melanogaster and D. pseudoobscura) were used. Duplication events were divided into six age-groups according to the synonymous distance (dS) up to 0.6. Duplogs were classified into four equal-sized classes on physical distances and into three classes on relative orientations. We observed the following shared characteristics among intrachromosomal multiexon duplogs: 1) inverted duplogs account for 20-50%, and about a half of the physically most distant 25%; 2) except for C. elegans, the composition of physical distances, that of relative orientations, and the proportion of inverted duplogs in each physical distance category are more or less uniform; 3) except for C. elegans, the characteristics of the youngest (dS < 0.01) duplogs are similar to the overall characteristics of the entire set. These results suggest that intrachromosomal duplogs with fairly long physical distances were generated at once, rather than resulting from tandem duplications and subsequent genomic rearrangements. This is different from the three well-known modes of gene duplication: tandem duplication, retrotransposition, and genome duplication. We termed this new mode as "drift" duplication. The drift duplication has been producing duplicate copies at paces comparable with tandem duplications since the common ancestor of vertebrates, and it may have already operated in the common ancestor of bilateral animals.
Collapse
Affiliation(s)
- Kiyoshi Ezawa
- Division of Population Genetics, National Institute of Genetics, Mishima, Japan
| | | | | | | |
Collapse
|
44
|
Haerty W, Golding GB. Increased polymorphism near low-complexity sequences across the genomes of Plasmodium falciparum isolates. Genome Biol Evol 2011; 3:539-50. [PMID: 21602572 PMCID: PMC3140889 DOI: 10.1093/gbe/evr045] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Low-complexity regions (LCRs) within proteins sequences are often considered to evolve neutrally even though recent studies reported evidence for selection acting on some of them. Because of their widespread distribution among eukaryotes genomes and the potential deleterious effect of expansion/contraction of some of them in humans, low-complexity sequences are of major interest and numerous studies have attempted to describe their dynamic between genomes as well as the factors correlated to their variation and to assess their selective value. However, due to the scarcity of individual genomes within a species, most of the analyses so far have been performed at the species level with the implicit assumption that the variation both in composition and size within species is too small relative to the between-species divergence to affect the conclusions of the analysis. Here we used the available genomes of 14 Plasmodium falciparum isolates to assess the relationship between low-complexity sequence variation and factors such as nucleotide polymorphism across strains, sequence composition, and protein expression. We report that more than half of the 7,711 low-complexity sequences found within aligned coding sequences are variable in size among strains. Across strains, we observed an increasing density of polymorphic sites toward the LCR boundaries. This observation strongly suggests the joint effects of lowered selective constraints on low-complexity sequences and a mutagenic effect of these simple sequences.
Collapse
Affiliation(s)
- Wilfried Haerty
- Department of Biology, McMaster University, Hamilton, Ontario, Canada
| | | |
Collapse
|
45
|
Lee Y, Zhou T, Tartaglia GG, Vendruscolo M, Wilke CO. Translationally optimal codons associate with aggregation-prone sites in proteins. Proteomics 2011; 10:4163-71. [PMID: 21046618 DOI: 10.1002/pmic.201000229] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
We analyze the relationship between codon usage bias and residue aggregation propensity in the genomes of four model organisms, Escherichia coli, yeast, fly, and mouse, as well as the archaeon Halobacterium species NRC-1. Using the Mantel-Haenszel procedure, we find that translationally optimal codons associate with aggregation-prone residues. Our results are qualitatively and quantitatively similar to those of an earlier study where we found an association between translationally optimal codons and buried residues. We also combine the aggregation-propensity data with solvent-accessibility data. Although the resulting data set is small, and hence statistical power low, results indicate that the association between optimal codons and aggregation-prone residues exists both at buried and at exposed sites. By comparing codon usage at different combinations of sites (exposed, aggregation-prone sites versus buried, non-aggregation-prone sites; buried, aggregation-prone sites versus exposed, non-aggregation-prone sites), we find that aggregation propensity and solvent accessibility seem to have independent effects of (on average) comparable magnitude on codon usage. Finally, in fly, we assess whether optimal codons associate with sites at which amino acid substitutions lead to an increase in aggregation propensity, and find only a very weak effect. These results suggest that optimal codons may be required to reduce the frequency of translation errors at aggregation-prone sites that coincide with certain functional sites, such as protein-protein interfaces. Alternatively, optimal codons may be required for rapid translation of aggregation-prone regions.
Collapse
Affiliation(s)
- Yaelim Lee
- Institute for Cell and Molecular Biology, The University of Texas at Austin, Austin, TX, USA
| | | | | | | | | |
Collapse
|
46
|
Abstract
Despite their name, synonymous mutations have significant consequences for cellular processes in all taxa. As a result, an understanding of codon bias is central to fields as diverse as molecular evolution and biotechnology. Although recent advances in sequencing and synthetic biology have helped to resolve longstanding questions about codon bias, they have also uncovered striking patterns that suggest new hypotheses about protein synthesis. Ongoing work to quantify the dynamics of initiation and elongation is as important for understanding natural synonymous variation as it is for designing transgenes in applied contexts.
Collapse
Affiliation(s)
- Joshua B Plotkin
- Department of Biology and Program in Applied Mathematics and Computational Science, University of Pennsylvania, 433 South University Avenue, Philadelphia, Pennsylvania 19104, USA.
| | | |
Collapse
|
47
|
Weber CC, Hurst LD. Intronic AT skew is a defendable proxy for germline transcription but does not predict crossing-over or protein evolution rates in Drosophila melanogaster. J Mol Evol 2010; 71:415-26. [PMID: 20938653 DOI: 10.1007/s00239-010-9395-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2010] [Accepted: 09/17/2010] [Indexed: 01/28/2023]
Abstract
Recent evidence suggests that germline transcription may affect both protein evolutionary rates, possibly mediated by repair processes, and recombination rates, possibly mediated by chromatin and epigenetic modification. Here, we test these propositions in Drosophila melanogaster. The challenge for such analyses is to provide defendable measures of germline gene expression. Intronic AT skew is a good candidate measure as it is thought to be a consequence, at least in part, of transcription-coupled repair. Prior evidence suggests that intronic AT skew in D. melanogaster is not affected by proximity to intron extremities and differs between transcribed DNA and flanking sequence. We now also establish that intronic AT skew is a defendable proxy for germline expression as (a) it is more similar than expected by chance between introns of the same gene (which is not accounted for by physical proximity), (b) is correlated with male germline expression, and (c) is more pronounced in broadly expressed genes. Furthermore, (d) a trend for intronic skew to differ between 3' and 5' ends of genes is particular to broadly expressed genes. Finally, (e) controlling for physical distance, introns of proximate genes are most different in skew if they have different tissue specificity. We find that intronic AT skew, employed as a proxy for germline transcription, correlates neither with recombination rates nor with the rate of protein evolution. We conclude that there is no prima facie evidence that germline expression modulates recombination rates or monotonically affects protein evolution rates in D. melanogaster.
Collapse
Affiliation(s)
- Claudia C Weber
- Department of Biology and Biochemistry, University of Bath, Bath, UK
| | | |
Collapse
|
48
|
Zhou T, Gu W, Wilke CO. Detecting positive and purifying selection at synonymous sites in yeast and worm. Mol Biol Evol 2010; 27:1912-22. [PMID: 20231333 DOI: 10.1093/molbev/msq077] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
We present a new computational method to identify positive and purifying selection at synonymous sites in yeast and worm. We define synonymous substitutions that change codons from preferred to unpreferred or vice versa as nonconservative synonymous substitutions and all other substitutions as conservative. Using a maximum-likelihood framework, we then test whether conservative and nonconservative synonymous substitutions occur at equal rates. Our approach replaces the standard rate of synonymous substitutions per synonymous site, dS, with two new rates, the conservative synonymous substitution rate (dS(C)) and the nonconservative synonymous substitution rate (dS(N)). Based on the ratio dS(N)/dS(C), we find that 0.05% of all yeast genes and none of worm genes show evidence of positive selection at synonymous sites (dS(N)/dS(C) > 1). On the other hand, 9.44% of all yeast genes and 5.12% of all worm genes show evidence of significant purifying selection on synonymous sites (dS(N)/dS(C) < 1). We also find that dS(N) correlates strongly with gene expression level, whereas the correlation between expression level and dS(C) is very weak. Thus, dS(N) captures most of the signal of selection for translational accuracy and speed, whereas dS(C) is not strongly influenced by this selection pressure. We suggest that the ratio dN/dS(C) may be more appropriate than the ratio dN/dS to identify positive or purifying selection on amino acids.
Collapse
Affiliation(s)
- Tong Zhou
- Center for Computational Biology and Bioinformatics, University of Texas at Austin, TX, USA
| | | | | |
Collapse
|
49
|
Gu W, Zhou T, Wilke CO. A universal trend of reduced mRNA stability near the translation-initiation site in prokaryotes and eukaryotes. PLoS Comput Biol 2010; 6:e1000664. [PMID: 20140241 PMCID: PMC2816680 DOI: 10.1371/journal.pcbi.1000664] [Citation(s) in RCA: 232] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2009] [Accepted: 12/30/2009] [Indexed: 12/05/2022] Open
Abstract
Recent studies have suggested that the thermodynamic stability of mRNA secondary structure near the start codon can regulate translation efficiency in Escherichia coli, and that translation is more efficient the less stable the secondary structure. We survey the complete genomes of 340 species for signals of reduced mRNA secondary structure near the start codon. Our analysis includes bacteria, archaea, fungi, plants, insects, fishes, birds, and mammals. We find that nearly all species show evidence for reduced mRNA stability near the start codon. The reduction in stability generally increases with increasing genomic GC content. In prokaryotes, the reduction also increases with decreasing optimal growth temperature. Within genomes, there is variation in the stability among genes, and this variation correlates with gene GC content, codon bias, and gene expression level. For birds and mammals, however, we do not find a genome-wide trend of reduced mRNA stability near the start codon. Yet the most GC rich genes in these organisms do show such a signal. We conclude that reduced stability of the mRNA secondary structure near the start codon is a universal feature of all cellular life. We suggest that the origin of this reduction is selection for efficient recognition of the start codon by initiator-tRNA. Synonymous mutations are mutations that change the nucleotide sequence of a gene without changing the amino-acid sequence. Because these mutations don't alter the expressed protein, they are frequently also called silent mutations. Yet increasing evidence demonstrates that synonymous mutations are not that silent. In particular, experimental work in Escherichia coli has shown that the choice of synonymous codons near the start codon can greatly influence protein production. Codons that allow the mRNA to fold into a stable secondary structure seem to inhibit efficient translation initiation. This observation suggests that selection should prefer reduced mRNA stability near the start codon in many organisms. Here, we show that this prediction generally holds true in most organisms, including bacteria, archaea, fungi, plants, insects, and fishes. In birds and mammals it doesn't hold true genome-wide, but it does hold true in the most GC-rich genes. In all organisms, the extent to which mRNA stability is reduced increases with increasing GC content. In prokaryotes, it also increases with decreasing optimal growing temperature. Thus, it seems that all organisms have to optimize their synonymous sites near the start codon to guarantee efficient protein translation.
Collapse
Affiliation(s)
- Wanjun Gu
- Key Laboratory of Child Development and Learning Science of Ministry of Education of China, Southeast University, Nanjing, Jiangsu, China
| | - Tong Zhou
- Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, Texas, United States of America
- Section of Integrative Biology, The University of Texas at Austin, Austin, Texas, United States of America
| | - Claus O. Wilke
- Center for Computational Biology and Bioinformatics, The University of Texas at Austin, Austin, Texas, United States of America
- Section of Integrative Biology, The University of Texas at Austin, Austin, Texas, United States of America
- Institute for Cell and Molecular Biology, The University of Texas at Austin, Austin, Texas, United States of America
- * E-mail:
| |
Collapse
|
50
|
Experimentally increased codon bias in the Drosophila Adh gene leads to an increase in larval, but not adult, alcohol dehydrogenase activity. Genetics 2009; 184:547-55. [PMID: 19966063 DOI: 10.1534/genetics.109.111294] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Although most amino acids can be encoded by more than one codon, the synonymous codons are not used with equal frequency. This phenomenon is known as codon bias and appears to be a universal feature of genomes. The translational selection hypothesis posits that the use of optimal codons, which match the most abundant species of isoaccepting tRNAs, results in increased translational efficiency and accuracy. Previous work demonstrated that the experimental reduction of codon bias in the Drosophila alcohol dehydrogenase (Adh) gene led to a significant decrease in ADH protein expression. In this study we performed the converse experiment: we replaced seven suboptimal leucine codons that occur naturally in the Drosophila melanogaster Adh gene with the optimal codon. We then compared the in vivo ADH activities imparted by the wild-type and mutant alleles. The introduction of optimal leucine codons led to an increase in ADH activity in third-instar larvae. In adult flies, however, the introduction of optimal codons led to a decrease in ADH activity. There is no evidence that other selectively constrained features of the Adh gene, or its rate of transcription, were altered by the synonymous replacements. These results are consistent with translational selection for codon bias being stronger in the larval stage and suggest that there may be a selective conflict over optimal codon usage between different developmental stages.
Collapse
|