1
|
Farookhi H, Xia X. Differential Selection for Translation Efficiency Shapes Translation Machineries in Bacterial Species. Microorganisms 2024; 12:768. [PMID: 38674712 PMCID: PMC11052298 DOI: 10.3390/microorganisms12040768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 04/01/2024] [Accepted: 04/09/2024] [Indexed: 04/28/2024] Open
Abstract
Different bacterial species have dramatically different generation times, from 20-30 min in Escherichia coli to about two weeks in Mycobacterium leprae. The translation machinery in a cell needs to synthesize all proteins for a new cell in each generation. The three subprocesses of translation, i.e., initiation, elongation, and termination, are expected to be under stronger selection pressure to optimize in short-generation bacteria (SGB) such as Vibrio natriegens than in the long-generation Mycobacterium leprae. The initiation efficiency depends on the start codon decoded by the initiation tRNA, the optimal Shine-Dalgarno (SD) decoded by the anti-SD (aSD) sequence on small subunit rRNA, and the secondary structure that may embed the initiation signals and prevent them from being decoded. The elongation efficiency depends on the tRNA pool and codon usage. The termination efficiency in bacteria depends mainly on the nature of the stop codon and the nucleotide immediately downstream of the stop codon. By contrasting SGB with long-generation bacteria (LGB), we predict (1) SGB to have more ribosome RNA operons to produce ribosomes, and more tRNA genes for carrying amino acids to ribosomes, (2) SGB to have a higher percentage of genes using AUG as the start codon and UAA as the stop codon than LGB, (3) SGB to exhibit better codon and anticodon adaptation than LGB, and (4) SGB to have a weaker secondary structure near the translation initiation signals than LGB. These differences between SGB and LGB should be more pronounced in highly expressed genes than the rest of the genes. We present empirical evidence in support of these predictions.
Collapse
Affiliation(s)
- Heba Farookhi
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Xuhua Xia
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1H 8M5, Canada
| |
Collapse
|
2
|
Ho AT, Hurst LD. Stop codon usage as a window into genome evolution: mutation, selection, biased gene conversion and the TAG paradox. Genome Biol Evol 2022; 14:6648529. [PMID: 35867377 PMCID: PMC9348620 DOI: 10.1093/gbe/evac115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/17/2022] [Indexed: 11/16/2022] Open
Abstract
Protein coding genes terminate with one of three stop codons (TAA, TGA, or TAG) that, like synonymous codons, are not employed equally. With TGA and TAG having identical nucleotide content, analysis of their differential usage provides an unusual window into the forces operating on what are ostensibly functionally identical residues. Across genomes and between isochores within the human genome, TGA usage increases with G + C content but, with a common G + C → A + T mutation bias, this cannot be explained by mutation bias-drift equilibrium. Increased usage of TGA in G + C-rich genomes or genomic regions is also unlikely to reflect selection for the optimal stop codon, as TAA appears to be universally optimal, probably because it has the lowest read-through rate. Despite TAA being favored by selection and mutation bias, as with codon usage bias G + C pressure is the prime determinant of between-species TGA usage trends. In species with strong G + C-biased gene conversion (gBGC), such as mammals and birds, the high usage and conservation of TGA is best explained by an A + T → G + C repair bias. How to explain TGA enrichment in other G + C-rich genomes is less clear. Enigmatically, across bacterial and archaeal species and between human isochores TAG usage is mostly unresponsive to G + C pressure. This unresponsiveness we dub the TAG paradox as currently no mutational, selective, or gBGC model provides a well-supported explanation. That TAG does increase with G + C usage across eukaryotes makes the usage elsewhere yet more enigmatic. We suggest resolution of the TAG paradox may provide insights into either an unknown but common selective preference (probably at the DNA/RNA level) or an unrecognized complexity to the action of gBGC.
Collapse
Affiliation(s)
- Alexander T Ho
- Milner Centre for Evolution, University of Bath, Bath, UK
| | | |
Collapse
|
3
|
Why Is the UAG (Amber) Stop Codon Almost Absent in Highly Expressed Bacterial Genes? Life (Basel) 2022; 12:life12030431. [PMID: 35330182 PMCID: PMC8954436 DOI: 10.3390/life12030431] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 03/14/2022] [Accepted: 03/14/2022] [Indexed: 11/17/2022] Open
Abstract
The genome hypothesis postulates that genes in a genome tend to conform to their species’ usage of the codon catalog and the GC content of the DNA. Thus, codon frequencies differ across organisms, including the three termination codons in the standard genetic code. Here, we analyze the frequencies of stop codons in a group of highly expressed genes from 196 prokaryotes under strong translational selection. The occurrence of the three translation termination codons is highly biased, with UAA (ochre) being the most prevalent in almost all bacteria. In contrast, UAG (amber) is the least frequent termination codon, e.g., only 321 occurrences (7.4%) in E. coli K-12 substr. W3110. Of the 253 highly expressed genes, only two end with an UAG codon. The strength of the selective bias against UAG in highly expressed genes varies among bacterial genomes, but it is not affected by the GC content of these genomes. In contrast, increased GC content results in a decrease in UAA abundance with a concomitant increase in UGA abundance. We propose that readthrough efficiency and context effects could explain the prevalence of UAA over UAG, particularly in highly expressed genes. Findings from this communication can be utilized for the optimization of gene expression.
Collapse
|
4
|
Ho AT, Hurst LD. Variation in Release Factor Abundance Is Not Needed to Explain Trends in Bacterial Stop Codon Usage. Mol Biol Evol 2022; 39:msab326. [PMID: 34751397 PMCID: PMC8789281 DOI: 10.1093/molbev/msab326] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
In bacteria stop codons are recognized by one of two class I release factors (RF1) recognizing TAG, RF2 recognizing TGA, and TAA being recognized by both. Variation across bacteria in the relative abundance of RF1 and RF2 is thus hypothesized to select for different TGA/TAG usage. This has been supported by correlations between TAG:TGA ratios and RF1:RF2 ratios across multiple bacterial species, potentially also explaining why TAG usage is approximately constant despite extensive variation in GC content. It is, however, possible that stop codon trends are determined by other forces and that RF ratios adapt to stop codon usage, rather than vice versa. Here, we determine which direction of the causal arrow is the more parsimonious. Our results support the notion that RF1/RF2 ratios become adapted to stop codon usage as the same trends, notably the anomalous TAG behavior, are seen in contexts where RF1:RF2 ratios cannot be, or are unlikely to be, causative, that is, at 3'untranslated sites never used for translation termination, in intragenomic analyses, and across archaeal species (that possess only one RF1). We conclude that specifics of RF biology are unlikely to fully explain TGA/TAG relative usage. We discuss why the causal relationships for the evolution of synonymous stop codon usage might be different from those affecting synonymous sense codon usage, noting that transitions between TGA and TAG require two-point mutations one of which is likely to be deleterious.
Collapse
Affiliation(s)
- Alexander T Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| | - Laurence D Hurst
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
5
|
Ho AT, Hurst LD. Effective Population Size Predicts Local Rates but Not Local Mitigation of Read-through Errors. Mol Biol Evol 2021; 38:244-262. [PMID: 32797190 PMCID: PMC7783166 DOI: 10.1093/molbev/msaa210] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
In correctly predicting that selection efficiency is positively correlated with the effective population size (Ne), the nearly neutral theory provides a coherent understanding of between-species variation in numerous genomic parameters, including heritable error (germline mutation) rates. Does the same theory also explain variation in phenotypic error rates and in abundance of error mitigation mechanisms? Translational read-through provides a model to investigate both issues as it is common, mostly nonadaptive, and has good proxy for rate (TAA being the least leaky stop codon) and potential error mitigation via "fail-safe" 3' additional stop codons (ASCs). Prior theory of translational read-through has suggested that when population sizes are high, weak selection for local mitigation can be effective thus predicting a positive correlation between ASC enrichment and Ne. Contra to prediction, we find that ASC enrichment is not correlated with Ne. ASC enrichment, although highly phylogenetically patchy, is, however, more common both in unicellular species and in genes expressed in unicellular modes in multicellular species. By contrast, Ne does positively correlate with TAA enrichment. These results imply that local phenotypic error rates, not local mitigation rates, are consistent with a drift barrier/nearly neutral model.
Collapse
Affiliation(s)
- Alexander T Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
- Corresponding author: E-mail:
| | - Laurence D Hurst
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
6
|
Adachi H, Yu YT. Pseudouridine-mediated stop codon readthrough in S. cerevisiae is sequence context-independent. RNA (NEW YORK, N.Y.) 2020; 26:1247-1256. [PMID: 32434780 PMCID: PMC7430670 DOI: 10.1261/rna.076042.120] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Accepted: 05/19/2020] [Indexed: 05/26/2023]
Abstract
We have previously shown that when the uridine of a stop codon (UAA, UAG, or UGA) is pseudouridylated, the ribosome reads through the modified stop codon. However, it is not clear as to whether or not the pseudouridine (Ψ)-mediated readthrough is dependent on the sequence context of mRNA. Here, we use several different approaches and the yeast system to address this question. We show that when a stop codon (premature termination codon, PTC) is introduced into the coding region of a reporter mRNA at several different positions (with different sequence contexts) and pseudouridylated, we detect similar levels of readthrough. Using mutational and selection/screen analyses, we also show that the upstream sequence (relative to PTC) as well as the nucleotides surrounding the PTC (upstream and downstream) play a minimal role (if at all) in Ψ-mediated ribosome readthrough. Interestingly, we detect no suppression of NMD (nonsense-mediated mRNA decay) by targeted PTC pseudouridylation in the yeast system. Our results indicate that Ψ-mediated nonsense suppression occurs at the translational level, and that the suppression is sequence context-independent, unlike some previously characterized rare stop codon readthrough events.
Collapse
Affiliation(s)
- Hironori Adachi
- Department of Biochemistry and Biophysics, Center for RNA Biology, University of Rochester Medical Center, Rochester, New York 14642, USA
| | - Yi-Tao Yu
- Department of Biochemistry and Biophysics, Center for RNA Biology, University of Rochester Medical Center, Rochester, New York 14642, USA
| |
Collapse
|
7
|
Sharma J, Keeling KM, Rowe SM. Pharmacological approaches for targeting cystic fibrosis nonsense mutations. Eur J Med Chem 2020; 200:112436. [PMID: 32512483 DOI: 10.1016/j.ejmech.2020.112436] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Revised: 05/04/2020] [Accepted: 05/06/2020] [Indexed: 12/11/2022]
Abstract
Cystic fibrosis (CF) is a monogenic autosomal recessive disorder. The clinical manifestations of the disease are caused by ∼2,000 mutations in the cystic fibrosis transmembrane conductance regulator (CFTR) protein. It is unlikely that any one approach will be efficient in correcting all defects. The recent approvals of ivacaftor, lumacaftor/ivacaftor and elexacaftor/tezacaftor/ivacaftor represent the genesis of a new era of precision combination medicine for the CF patient population. In this review, we discuss targeted translational readthrough approaches as mono and combination therapies for CFTR nonsense mutations. We examine the current status of efficacy of translational readthrough/nonsense suppression therapies and their limitations, including non-native amino acid incorporation at PTCs and nonsense-mediated mRNA decay (NMD), along with approaches to tackle these limitations. We further elaborate on combining various therapies such as readthrough agents, NMD inhibitors, and corrector/potentiators to improve the efficacy and safety of suppression therapy. These mutation specific strategies that are directed towards the basic CF defects should positively impact CF patients bearing nonsense mutations.
Collapse
Affiliation(s)
- Jyoti Sharma
- Department of Medicine, University of Alabama at Birmingham (UAB), USA; Department of Gregory Fleming James Cystic Fibrosis Research Center, University of Alabama at Birmingham (UAB), USA
| | - Kim M Keeling
- Department of Biochemistry and Molecular Genetics, University of Alabama at Birmingham (UAB), USA; Department of Gregory Fleming James Cystic Fibrosis Research Center, University of Alabama at Birmingham (UAB), USA
| | - Steven M Rowe
- Department of Medicine, University of Alabama at Birmingham (UAB), USA; Department of Pediatrics, University of Alabama at Birmingham (UAB), USA; Department of Gregory Fleming James Cystic Fibrosis Research Center, University of Alabama at Birmingham (UAB), USA.
| |
Collapse
|
8
|
Ho AT, Hurst LD. In eubacteria, unlike eukaryotes, there is no evidence for selection favouring fail-safe 3' additional stop codons. PLoS Genet 2019; 15:e1008386. [PMID: 31527909 PMCID: PMC6764699 DOI: 10.1371/journal.pgen.1008386] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Revised: 09/27/2019] [Accepted: 08/27/2019] [Indexed: 12/23/2022] Open
Abstract
Errors throughout gene expression are likely deleterious, hence genomes are under selection to ameliorate their consequences. Additional stop codons (ASCs) are in-frame nonsense ‘codons’ downstream of the primary stop which may be read by translational machinery should the primary stop have been accidentally read through. Prior evidence in several eukaryotes suggests that ASCs are selected to prevent potentially-deleterious consequences of read-through. We extend this evidence showing that enrichment of ASCs is common but not universal for single cell eukaryotes. By contrast, there is limited evidence as to whether the same is true in other taxa. Here, we provide the first systematic test of the hypothesis that ASCs act as a fail-safe mechanism in eubacteria, a group with high read-through rates. Contra to the predictions of the hypothesis we find: there is paucity, not enrichment, of ASCs downstream; substitutions that degrade stops are more frequent in-frame than out-of-frame in 3’ sequence; highly expressed genes are no more likely to have ASCs than lowly expressed genes; usage of the leakiest primary stop (TGA) in highly expressed genes does not predict ASC enrichment even compared to usage of non-leaky stops (TAA) in lowly expressed genes, beyond downstream codon +1. Any effect at the codon immediately proximal to the primary stop can be accounted for by a preference for a T/U residue immediately following the stop, although if anything, TT- and TC- starting codons are preferred. We conclude that there is no compelling evidence for ASC selection in eubacteria. This presents an unusual case in which the same error could be solved by the same mechanism in eukaryotes and prokaryotes but is not. We discuss two possible explanations: that, owing to the absence of nonsense mediated decay, bacteria may solve read-through via gene truncation and in eukaryotes certain prion states cause raised read-through rates. In all organisms, gene expression is error-prone. One such error, translational read-through, occurs where the primary stop codon of an expressed gene is missed by the translational machinery. Failure to terminate is likely to be costly, hence genomes are under selection to prevent this from happening. One proposed error-proofing strategy involves in-frame proximal additional stop codons (ASCs) which may act as a ‘fail-safe’ mechanism by providing another opportunity for translation to terminate. There is evidence for ASC enrichment in several eukaryotes. We extend this evidence showing it to be common but not universal in single celled eukaryotes. However, the situation in bacteria is poorly understood, despite bacteria having high read-through rates. Here, we test the fail-safe hypothesis within a broad range of bacteria. To our surprise, we find that not only are ASCs not enriched, but they may even be selected against. This provides evidence for an unusual circumstance where eukaryotes and prokaryotes could solve the same problem the same way but don’t. What are we to make of this? We suggest that if read-through is the problem, ASCs are not necessarily the expected solution. Owing to the absence of nonsense-mediated decay, a process that makes gene truncation in eukaryotes less viable, we propose bacteria may rescue a leaky stop by mutation that creates a new stop upstream. Alternatively, raised read-through rates in some particular conditions in eukaryotes might explain the difference.
Collapse
Affiliation(s)
- Alexander T. Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
- * E-mail:
| | - Laurence D. Hurst
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
9
|
Xia X. Translation Control of HAC1 by Regulation of Splicing in Saccharomyces cerevisiae. Int J Mol Sci 2019; 20:ijms20122860. [PMID: 31212749 PMCID: PMC6627864 DOI: 10.3390/ijms20122860] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2019] [Revised: 05/30/2019] [Accepted: 06/10/2019] [Indexed: 12/19/2022] Open
Abstract
Hac1p is a key transcription factor regulating the unfolded protein response (UPR) induced by abnormal accumulation of unfolded/misfolded proteins in the endoplasmic reticulum (ER) in Saccharomyces cerevisiae. The accumulation of unfolded/misfolded proteins is sensed by protein Ire1p, which then undergoes trans-autophosphorylation and oligomerization into discrete foci on the ER membrane. HAC1 pre-mRNA, which is exported to the cytoplasm but is blocked from translation by its intron sequence looping back to its 5’UTR to form base-pair interaction, is transported to the Ire1p foci to be spliced, guided by a cis-acting bipartite element at its 3’UTR (3’BE). Spliced HAC1 mRNA can be efficiently translated. The resulting Hac1p enters the nucleus and activates, together with coactivators, a large number of genes encoding proteins such as protein chaperones to restore and maintain ER homeostasis and secretary protein quality control. This review details the translation regulation of Hac1p production, mediated by the nonconventional splicing, in the broad context of translation control and summarizes the evolution and diversification of the UPR signaling pathway among fungal, metazoan and plant lineages.
Collapse
Affiliation(s)
- Xuhua Xia
- Department of Biology, University of Ottawa, Marie-Curie Private, Ottawa, ON K1N 9A7, Canada.
| |
Collapse
|
10
|
'Stop' in protein synthesis is modulated with exquisite subtlety by an extended RNA translation signal. Biochem Soc Trans 2018; 46:1615-1625. [PMID: 30420414 DOI: 10.1042/bst20180190] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2018] [Revised: 09/30/2018] [Accepted: 10/04/2018] [Indexed: 02/08/2023]
Abstract
Translational stop codons, UAA, UAG, and UGA, form an integral part of the universal genetic code. They are of significant interest today for their underlying fundamental role in terminating protein synthesis, but also for their potential utilisation for programmed alternative translation events. In diverse organisms, UAA has wide usage, but it is puzzling that the high fidelity UAG is selected against and yet UGA, vulnerable to suppression, is widely used, particularly in those archaeal and bacterial genomes with a high GC content. In canonical protein synthesis, stop codons are interpreted by protein release factors that structurally and functionally mimic decoding tRNAs and occupy the decoding site on the ribosome. The release factors make close contact with the decoding complex through multiple interactions. Correct interactions cause conformational changes resulting in new and enhanced contacts with the ribosome, particularly between specific bases in the mRNA and rRNA. The base following the stop codon (fourth or +4 base) may strongly influence decoding efficiency, facilitating alternative non-canonical events like frameshifting or selenocysteine incorporation. The fourth base is drawn into the decoding site with a compacted stop codon in the eukaryotic termination complex. Surprisingly, mRNA sequences upstream and downstream of this core tetranucleotide signal have a significant influence on the strength of the signal. Since nine bases downstream of the stop codon are within the mRNA channel, their interactions with rRNA, and r-proteins may affect efficiency. With this understanding, it is now possible to design stop signals of desired strength for specific applied purposes.
Collapse
|
11
|
Abrahams L, Hurst LD. Refining the Ambush Hypothesis: Evidence That GC- and AT-Rich Bacteria Employ Different Frameshift Defence Strategies. Genome Biol Evol 2018; 10:1153-1173. [PMID: 29617761 PMCID: PMC5909447 DOI: 10.1093/gbe/evy075] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/30/2018] [Indexed: 12/13/2022] Open
Abstract
Stop codons are frequently selected for beyond their regular termination function for error control. The “ambush hypothesis” proposes out-of-frame stop codons (OSCs) terminating frameshifted translations are selected for. Although early indirect evidence was partially supportive, recent evidence suggests OSC frequencies are not exceptional when considering underlying nucleotide content. However, prior null tests fail to control amino acid/codon usages or possible local mutational biases. We therefore return to the issue using bacterial genomes, considering several tests defining and testing against a null. We employ simulation approaches preserving amino acid order but shuffling synonymous codons or preserving codons while shuffling amino acid order. Additionally, we compare codon usage in amino acid pairs, where one codon can but the next, otherwise identical codon, cannot encode an OSC. OSC frequencies exceed expectations typically in AT-rich genomes, the +1 frame and for TGA/TAA but not TAG. With this complex evidence, simply rejecting or accepting the ambush hypothesis is not warranted. We propose a refined post hoc model, whereby AT-rich genomes have more accidental frameshifts, handled by RF2–RF3 complexes (associated with TGA/TAA) and are mostly +1 (or −2) slips. Supporting this, excesses positively correlate with in silico predicted frameshift probabilities. Thus, we propose a more viable framework, whereby genomes broadly adopt one of the two strategies to combat frameshifts: preventing frameshifting (GC-rich) or permitting frameshifts but minimizing impacts when most are caught early (AT-rich). Our refined framework holds promise yet some features, such as the bias of out-of-frame sense codons, remain unexplained.
Collapse
Affiliation(s)
- Liam Abrahams
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, United Kingdom
| | - Laurence D Hurst
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, United Kingdom
| |
Collapse
|
12
|
Abstract
Codon usage depends on mutation bias, tRNA-mediated selection, and the need for high efficiency and accuracy in translation. One codon in a synonymous codon family is often strongly over-used, especially in highly expressed genes, which often leads to a high dN/dS ratio because dS is very small. Many different codon usage indices have been proposed to measure codon usage and codon adaptation. Sense codon could be misread by release factors and stop codons misread by tRNAs, which also contribute to codon usage in rare cases. This chapter outlines the conceptual framework on codon evolution, illustrates codon-specific and gene-specific codon usage indices, and presents their applications. A new index for codon adaptation that accounts for background mutation bias (Index of Translation Elongation) is presented and contrasted with codon adaptation index (CAI) which does not consider background mutation bias. They are used to re-analyze data from a recent paper claiming that translation elongation efficiency matters little in protein production. The reanalysis disproves the claim.
Collapse
|
13
|
Purifying and positive selection in the evolution of stop codons. Sci Rep 2018; 8:9260. [PMID: 29915293 PMCID: PMC6006363 DOI: 10.1038/s41598-018-27570-3] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Accepted: 05/18/2018] [Indexed: 12/13/2022] Open
Abstract
Modes of evolution of stop codons in protein-coding genes, especially the conservation of UAA, have been debated for many years. We reconstructed the evolution of stop codons in 40 groups of closely related prokaryotic and eukaryotic genomes. The results indicate that the UAA codons are maintained by purifying selection in all domains of life. In contrast, positive selection appears to drive switches from UAG to other stop codons in prokaryotes but not in eukaryotes. Changes in stop codons are significantly associated with increased substitution frequency immediately downstream of the stop. These positions are otherwise more strongly conserved in evolution compared to sites farther downstream, suggesting that such substitutions are compensatory. Although GC content has a major impact on stop codon frequencies, its contribution to the decreased frequency of UAA differs between bacteria and archaea, presumably, due to differences in their translation termination mechanisms.
Collapse
|
14
|
Xia X. DAMBE6: New Tools for Microbial Genomics, Phylogenetics, and Molecular Evolution. J Hered 2017; 108:431-437. [PMID: 28379490 PMCID: PMC5434544 DOI: 10.1093/jhered/esx033] [Citation(s) in RCA: 238] [Impact Index Per Article: 29.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2017] [Accepted: 04/01/2017] [Indexed: 12/21/2022] Open
Abstract
DAMBE is a comprehensive software workbench for data analysis in molecular biology, phylogenetics, and evolution. Several important new functions have been added since version 5 of DAMBE: 1) comprehensive genomic profiling of translation initiation efficiency of different genes in different prokaryotic species, 2) a new index of translation elongation (ITE) that takes into account both tRNA-mediated selection and background mutation on codon–anticodon adaptation, 3) a new and accurate phylogenetic approach based on pairwise alignment only, which is useful for highly divergent sequences from which a reliable multiple sequence alignment is difficult to obtain. Many other functions have been updated and improved including PWM for motif characterization, Gibbs sampler for de novo motif discovery, hidden Markov models for protein secondary structure prediction, self-organizing map for nonlinear clustering of transcriptomic data, comprehensive sequence alignment, and phylogenetic functions. DAMBE features a graphic, user-friendly and intuitive interface, and is freely available from http://dambe.bio.uottawa.ca.
Collapse
Affiliation(s)
- Xuhua Xia
- From the Department of Biology and Center for Advanced Research in Environmental Genomics, University of Ottawa, 30 Marie Curie, PO Box 450, Station A, Ottawa, ON K1N 6N5, Canada
| |
Collapse
|
15
|
Devi M, Lyngdoh RD. Favored and less favored codon–anticodon duplexes arising from the GC codon family box encoding for alanine: some computational perspectives. J Biomol Struct Dyn 2017; 36:1029-1049. [DOI: 10.1080/07391102.2017.1308886] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Martina Devi
- Department of Chemistry, North-Eastern Hill University, Shillong 793022, India
| | - R.H. Duncan Lyngdoh
- Department of Chemistry, North-Eastern Hill University, Shillong 793022, India
| |
Collapse
|