1
|
Ward C, Beharry A, Tennakoon R, Rozik P, Wilhelm SDP, Heinemann IU, O'Donoghue P. Mechanisms and Delivery of tRNA Therapeutics. Chem Rev 2024; 124:7976-8008. [PMID: 38801719 DOI: 10.1021/acs.chemrev.4c00142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Transfer ribonucleic acid (tRNA) therapeutics will provide personalized and mutation specific medicines to treat human genetic diseases for which no cures currently exist. The tRNAs are a family of adaptor molecules that interpret the nucleic acid sequences in our genes into the amino acid sequences of proteins that dictate cell function. Humans encode more than 600 tRNA genes. Interestingly, even healthy individuals contain some mutant tRNAs that make mistakes. Missense suppressor tRNAs insert the wrong amino acid in proteins, and nonsense suppressor tRNAs read through premature stop signals to generate full length proteins. Mutations that underlie many human diseases, including neurodegenerative diseases, cancers, and diverse rare genetic disorders, result from missense or nonsense mutations. Thus, specific tRNA variants can be strategically deployed as therapeutic agents to correct genetic defects. We review the mechanisms of tRNA therapeutic activity, the nature of the therapeutic window for nonsense and missense suppression as well as wild-type tRNA supplementation. We discuss the challenges and promises of delivering tRNAs as synthetic RNAs or as gene therapies. Together, tRNA medicines will provide novel treatments for common and rare genetic diseases in humans.
Collapse
|
2
|
Mangkalaphiban K, Fu L, Du M, Thrasher K, Keeling KM, Bedwell DM, Jacobson A. Extended stop codon context predicts nonsense codon readthrough efficiency in human cells. Nat Commun 2024; 15:2486. [PMID: 38509072 PMCID: PMC10954755 DOI: 10.1038/s41467-024-46703-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Accepted: 03/06/2024] [Indexed: 03/22/2024] Open
Abstract
Protein synthesis terminates when a stop codon enters the ribosome's A-site. Although termination is efficient, stop codon readthrough can occur when a near-cognate tRNA outcompetes release factors during decoding. Seeking to understand readthrough regulation we used a machine learning approach to analyze readthrough efficiency data from published HEK293T ribosome profiling experiments and compared it to comparable yeast experiments. We obtained evidence for the conservation of identities of the stop codon, its context, and 3'-UTR length (when termination is compromised), but not the P-site codon, suggesting a P-site tRNA role in readthrough regulation. Models trained on data from cells treated with the readthrough-promoting drug, G418, accurately predicted readthrough of premature termination codons arising from CFTR nonsense alleles that cause cystic fibrosis. This predictive ability has the potential to aid development of nonsense suppression therapies by predicting a patient's likelihood of improvement in response to drugs given their nonsense mutation sequence context.
Collapse
Affiliation(s)
- Kotchaphorn Mangkalaphiban
- Department of Microbiology and Physiological Systems, UMass Chan Medical School, 368 Plantation Street, Worcester, MA, 01655, USA
- Department of Genomics and Computational Biology, UMass Chan Medical School, 368 Plantation Street, Worcester, MA, 01655, USA
| | - Lianwu Fu
- Department of Biochemistry and Molecular Genetics, Heersink School of Medicine, The University of Alabama at Birmingham, 845 19th Street South, Birmingham, AL, 35294, USA
| | - Ming Du
- Department of Biochemistry and Molecular Genetics, Heersink School of Medicine, The University of Alabama at Birmingham, 845 19th Street South, Birmingham, AL, 35294, USA
| | - Kari Thrasher
- Department of Biochemistry and Molecular Genetics, Heersink School of Medicine, The University of Alabama at Birmingham, 845 19th Street South, Birmingham, AL, 35294, USA
| | - Kim M Keeling
- Department of Biochemistry and Molecular Genetics, Heersink School of Medicine, The University of Alabama at Birmingham, 845 19th Street South, Birmingham, AL, 35294, USA
| | - David M Bedwell
- Department of Biochemistry and Molecular Genetics, Heersink School of Medicine, The University of Alabama at Birmingham, 845 19th Street South, Birmingham, AL, 35294, USA
| | - Allan Jacobson
- Department of Microbiology and Physiological Systems, UMass Chan Medical School, 368 Plantation Street, Worcester, MA, 01655, USA.
| |
Collapse
|
3
|
Dimonaco NJ, Clare A, Kenobi K, Aubrey W, Creevey CJ. StORF-Reporter: finding genes between genes. Nucleic Acids Res 2023; 51:11504-11517. [PMID: 37897345 PMCID: PMC10682499 DOI: 10.1093/nar/gkad814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 09/04/2023] [Accepted: 09/27/2023] [Indexed: 10/30/2023] Open
Abstract
Large regions of prokaryotic genomes are currently without any annotation, in part due to well-established limitations of annotation tools. For example, it is routine for genes using alternative start codons to be misreported or completely omitted. Therefore, we present StORF-Reporter, a tool that takes an annotated genome and returns regions that may contain missing CDS genes from unannotated regions. StORF-Reporter consists of two parts. The first begins with the extraction of unannotated regions from an annotated genome. Next, Stop-ORFs (StORFs) are identified in these unannotated regions. StORFs are open reading frames that are delimited by stop codons and thus can capture those genes most often missing in genome annotations. We show this methodology recovers genes missing from canonical genome annotations. We inspect the results of the genomes of model organisms, the pangenome of Escherichia coli, and a set of 5109 prokaryotic genomes of 247 genera from the Ensembl Bacteria database. StORF-Reporter extended the core, soft-core and accessory gene collections, identified novel gene families and extended families into additional genera. The high levels of sequence conservation observed between genera suggest that many of these StORFs are likely to be functional genes that should now be considered for inclusion in canonical annotations.
Collapse
Affiliation(s)
- Nicholas J Dimonaco
- Institute of Biological, Environmental and Rural Sciences, Aberystwyth University, Aberystwyth SY23 3PD, Wales, UK
- Department of Computer Science, Aberystwyth University, Aberystwyth SY23 3DB, Wales, UK
- Department of Medicine, McMaster University, Hamilton, ON, Canada
- Farncombe Family Digestive Health Research Institute, McMaster University, Hamilton, ON, Canada
- School of Biological Sciences, Queen’s University Belfast, Belfast BT7 1NN, Northern Ireland, UK
| | - Amanda Clare
- Department of Computer Science, Aberystwyth University, Aberystwyth SY23 3DB, Wales, UK
| | - Kim Kenobi
- Department of Mathematics, Aberystwyth University, Aberystwyth SY23 3BZ, Wales, UK
| | - Wayne Aubrey
- Department of Computer Science, Aberystwyth University, Aberystwyth SY23 3DB, Wales, UK
| | - Christopher J Creevey
- School of Biological Sciences, Queen’s University Belfast, Belfast BT7 1NN, Northern Ireland, UK
| |
Collapse
|
4
|
Trexler M, Bányai L, Kerekes K, Patthy L. Evolution of termination codons of proteins and the TAG-TGA paradox. Sci Rep 2023; 13:14294. [PMID: 37653005 PMCID: PMC10471768 DOI: 10.1038/s41598-023-41410-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 08/25/2023] [Indexed: 09/02/2023] Open
Abstract
In most eukaryotes and prokaryotes TGA is used at a significantly higher frequency than TAG as termination codon of protein-coding genes. Although this phenomenon has been recognized several years ago, there is no generally accepted explanation for the TAG-TGA paradox. Our analyses of human mutation data revealed that out of the eighteen sense codons that can give rise to a nonsense codon by single base substitution, the CGA codon is exceptional: it gives rise to the TGA stop codon at an order of magnitude higher rate than the other codons. Here we propose that the TAG-TGA paradox is due to methylation and hypermutabilty of CpG dinucleotides. In harmony with this explanation, we show that the coding genomes of organisms with strong CpG methylation have a significant bias for TGA whereas those from organisms that lack CpG methylation use TGA and TAG termination codons with similar probability.
Collapse
Affiliation(s)
- Mária Trexler
- Institute of Enzymology, Research Centre for Natural Sciences, Budapest, 1117, Hungary
| | - László Bányai
- Institute of Enzymology, Research Centre for Natural Sciences, Budapest, 1117, Hungary
| | - Krisztina Kerekes
- Institute of Enzymology, Research Centre for Natural Sciences, Budapest, 1117, Hungary
| | - László Patthy
- Institute of Enzymology, Research Centre for Natural Sciences, Budapest, 1117, Hungary.
| |
Collapse
|
5
|
Belinky F, Bykova A, Yurchenko V, Rogozin IB. No evidence for widespread positive selection on double substitutions within codons in primates and yeasts. Front Genet 2022; 13:991249. [PMID: 36159983 PMCID: PMC9500374 DOI: 10.3389/fgene.2022.991249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 08/29/2022] [Indexed: 11/13/2022] Open
Abstract
Nucleotide substitutions in protein-coding genes can be divided into synonymous (S) and non-synonymous (N) ones that alter amino acids (including nonsense mutations causing stop codons). The S substitutions are expected to have little effect on function. The N substitutions almost always are affected by strong purifying selection that eliminates them from evolving populations. However, additional mutations of nearby bases can modulate the deleterious effect of single N substitutions and, thus, could be subjected to the positive selection. This effect has been demonstrated for mutations in the serine codons, stop codons and double N substitutions in prokaryotes. In all abovementioned cases, a novel technique was applied that allows elucidating the effects of selection on double substitutions considering mutational biases. Here, we applied the same technique to study double N substitutions in eukaryotic lineages of primates and yeast. We identified markedly fewer cases of purifying selection relative to prokaryotes and no evidence of codon double substitutions under positive selection. This is consistent with previous studies of serine codons in primates and yeast. In general, the obtained results strongly suggest that there are major differences between studied pro- and eukaryotes; double substitutions in primates and yeasts largely reflect mutational biases and are not hallmarks of selection. This is especially important in the context of detection of positive selection in codons because it has been suggested that multiple mutations in codons cause false inferences of lineage-specific site positive selection. It is likely that this concern is applicable to previously studied prokaryotes but not to primates and yeasts where markedly fewer double substitutions are affected by positive selection.
Collapse
Affiliation(s)
- Frida Belinky
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States
| | - Anastassia Bykova
- Life Science Research Centre, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Vyacheslav Yurchenko
- Life Science Research Centre, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
- *Correspondence: Vyacheslav Yurchenko, ; Igor B. Rogozin,
| | - Igor B. Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States
- *Correspondence: Vyacheslav Yurchenko, ; Igor B. Rogozin,
| |
Collapse
|
6
|
Ho AT, Hurst LD. Stop codon usage as a window into genome evolution: mutation, selection, biased gene conversion and the TAG paradox. Genome Biol Evol 2022; 14:6648529. [PMID: 35867377 PMCID: PMC9348620 DOI: 10.1093/gbe/evac115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/17/2022] [Indexed: 11/16/2022] Open
Abstract
Protein coding genes terminate with one of three stop codons (TAA, TGA, or TAG) that, like synonymous codons, are not employed equally. With TGA and TAG having identical nucleotide content, analysis of their differential usage provides an unusual window into the forces operating on what are ostensibly functionally identical residues. Across genomes and between isochores within the human genome, TGA usage increases with G + C content but, with a common G + C → A + T mutation bias, this cannot be explained by mutation bias-drift equilibrium. Increased usage of TGA in G + C-rich genomes or genomic regions is also unlikely to reflect selection for the optimal stop codon, as TAA appears to be universally optimal, probably because it has the lowest read-through rate. Despite TAA being favored by selection and mutation bias, as with codon usage bias G + C pressure is the prime determinant of between-species TGA usage trends. In species with strong G + C-biased gene conversion (gBGC), such as mammals and birds, the high usage and conservation of TGA is best explained by an A + T → G + C repair bias. How to explain TGA enrichment in other G + C-rich genomes is less clear. Enigmatically, across bacterial and archaeal species and between human isochores TAG usage is mostly unresponsive to G + C pressure. This unresponsiveness we dub the TAG paradox as currently no mutational, selective, or gBGC model provides a well-supported explanation. That TAG does increase with G + C usage across eukaryotes makes the usage elsewhere yet more enigmatic. We suggest resolution of the TAG paradox may provide insights into either an unknown but common selective preference (probably at the DNA/RNA level) or an unrecognized complexity to the action of gBGC.
Collapse
Affiliation(s)
- Alexander T Ho
- Milner Centre for Evolution, University of Bath, Bath, UK
| | | |
Collapse
|
7
|
Ho AT, Hurst LD. Unusual mammalian usage of TGA stop codons reveals that sequence conservation need not imply purifying selection. PLoS Biol 2022; 20:e3001588. [PMID: 35550630 PMCID: PMC9129041 DOI: 10.1371/journal.pbio.3001588] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 05/24/2022] [Accepted: 04/20/2022] [Indexed: 11/18/2022] Open
Abstract
The assumption that conservation of sequence implies the action of purifying selection is central to diverse methodologies to infer functional importance. GC-biased gene conversion (gBGC), a meiotic mismatch repair bias strongly favouring GC over AT, can in principle mimic the action of selection, this being thought to be especially important in mammals. As mutation is GC→AT biased, to demonstrate that gBGC does indeed cause false signals requires evidence that an AT-rich residue is selectively optimal compared to its more GC-rich allele, while showing also that the GC-rich alternative is conserved. We propose that mammalian stop codon evolution provides a robust test case. Although in most taxa TAA is the optimal stop codon, TGA is both abundant and conserved in mammalian genomes. We show that this mammalian exceptionalism is well explained by gBGC mimicking purifying selection and that TAA is the selectively optimal codon. Supportive of gBGC, we observe (i) TGA usage trends are consistent at the focal stop codon and elsewhere (in UTR sequences); (ii) that higher TGA usage and higher TAA→TGA substitution rates are predicted by a high recombination rate; and (iii) across species the difference in TAA <-> TGA substitution rates between GC-rich and GC-poor genes is largest in genomes that possess higher between-gene GC variation. TAA optimality is supported both by enrichment in highly expressed genes and trends associated with effective population size. High TGA usage and high TAA→TGA rates in mammals are thus consistent with gBGC’s predicted ability to “drive” deleterious mutations and supports the hypothesis that sequence conservation need not be indicative of purifying selection. A general trend for GC-rich trinucleotides to reside at frequencies far above their mutational equilibrium in high recombining domains supports the generality of these results.
Collapse
Affiliation(s)
- Alexander Thomas Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
- * E-mail:
| | | |
Collapse
|
8
|
Why Is the UAG (Amber) Stop Codon Almost Absent in Highly Expressed Bacterial Genes? Life (Basel) 2022; 12:life12030431. [PMID: 35330182 PMCID: PMC8954436 DOI: 10.3390/life12030431] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 03/14/2022] [Accepted: 03/14/2022] [Indexed: 11/17/2022] Open
Abstract
The genome hypothesis postulates that genes in a genome tend to conform to their species’ usage of the codon catalog and the GC content of the DNA. Thus, codon frequencies differ across organisms, including the three termination codons in the standard genetic code. Here, we analyze the frequencies of stop codons in a group of highly expressed genes from 196 prokaryotes under strong translational selection. The occurrence of the three translation termination codons is highly biased, with UAA (ochre) being the most prevalent in almost all bacteria. In contrast, UAG (amber) is the least frequent termination codon, e.g., only 321 occurrences (7.4%) in E. coli K-12 substr. W3110. Of the 253 highly expressed genes, only two end with an UAG codon. The strength of the selective bias against UAG in highly expressed genes varies among bacterial genomes, but it is not affected by the GC content of these genomes. In contrast, increased GC content results in a decrease in UAA abundance with a concomitant increase in UGA abundance. We propose that readthrough efficiency and context effects could explain the prevalence of UAA over UAG, particularly in highly expressed genes. Findings from this communication can be utilized for the optimization of gene expression.
Collapse
|
9
|
Ho AT, Hurst LD. Variation in Release Factor Abundance Is Not Needed to Explain Trends in Bacterial Stop Codon Usage. Mol Biol Evol 2022; 39:msab326. [PMID: 34751397 PMCID: PMC8789281 DOI: 10.1093/molbev/msab326] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
In bacteria stop codons are recognized by one of two class I release factors (RF1) recognizing TAG, RF2 recognizing TGA, and TAA being recognized by both. Variation across bacteria in the relative abundance of RF1 and RF2 is thus hypothesized to select for different TGA/TAG usage. This has been supported by correlations between TAG:TGA ratios and RF1:RF2 ratios across multiple bacterial species, potentially also explaining why TAG usage is approximately constant despite extensive variation in GC content. It is, however, possible that stop codon trends are determined by other forces and that RF ratios adapt to stop codon usage, rather than vice versa. Here, we determine which direction of the causal arrow is the more parsimonious. Our results support the notion that RF1/RF2 ratios become adapted to stop codon usage as the same trends, notably the anomalous TAG behavior, are seen in contexts where RF1:RF2 ratios cannot be, or are unlikely to be, causative, that is, at 3'untranslated sites never used for translation termination, in intragenomic analyses, and across archaeal species (that possess only one RF1). We conclude that specifics of RF biology are unlikely to fully explain TGA/TAG relative usage. We discuss why the causal relationships for the evolution of synonymous stop codon usage might be different from those affecting synonymous sense codon usage, noting that transitions between TGA and TAG require two-point mutations one of which is likely to be deleterious.
Collapse
Affiliation(s)
- Alexander T Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| | - Laurence D Hurst
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
10
|
Papadopoulos C, Callebaut I, Gelly JC, Hatin I, Namy O, Renard M, Lespinet O, Lopes A. Intergenic ORFs as elementary structural modules of de novo gene birth and protein evolution. Genome Res 2021; 31:2303-2315. [PMID: 34810219 PMCID: PMC8647833 DOI: 10.1101/gr.275638.121] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 09/23/2021] [Indexed: 01/08/2023]
Abstract
The noncoding genome plays an important role in de novo gene birth and in the emergence of genetic novelty. Nevertheless, how noncoding sequences' properties could promote the birth of novel genes and shape the evolution and the structural diversity of proteins remains unclear. Therefore, by combining different bioinformatic approaches, we characterized the fold potential diversity of the amino acid sequences encoded by all intergenic open reading frames (ORFs) of S. cerevisiae with the aim of (1) exploring whether the structural states' diversity of proteomes is already present in noncoding sequences, and (2) estimating the potential of the noncoding genome to produce novel protein bricks that could either give rise to novel genes or be integrated into pre-existing proteins, thus participating in protein structure diversity and evolution. We showed that amino acid sequences encoded by most yeast intergenic ORFs contain the elementary building blocks of protein structures. Moreover, they encompass the large structural state diversity of canonical proteins, with the majority predicted as foldable. Then, we investigated the early stages of de novo gene birth by reconstructing the ancestral sequences of 70 yeast de novo genes and characterized the sequence and structural properties of intergenic ORFs with a strong translation signal. This enabled us to highlight sequence and structural factors determining de novo gene emergence. Finally, we showed a strong correlation between the fold potential of de novo proteins and one of their ancestral amino acid sequences, reflecting the relationship between the noncoding genome and the protein structure universe.
Collapse
Affiliation(s)
- Chris Papadopoulos
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Isabelle Callebaut
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005 Paris, France
| | - Jean-Christophe Gelly
- Université de Paris, Biologie Intégrée du Globule Rouge, UMR_S1134, BIGR, INSERM, F-75015 Paris, France
- Laboratoire d'Excellence GR-Ex, 75015 Paris, France
- Institut National de la Transfusion Sanguine, F-75015 Paris, France
| | - Isabelle Hatin
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Olivier Namy
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Maxime Renard
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Olivier Lespinet
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Anne Lopes
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| |
Collapse
|
11
|
Abstract
Codon usage bias is the preferential or non-random use of synonymous codons, a ubiquitous phenomenon observed in bacteria, plants and animals. Different species have consistent and characteristic codon biases. Codon bias varies not only with species, family or group within kingdom, but also between the genes within an organism. Codon usage bias has evolved through mutation, natural selection, and genetic drift in various organisms. Genome composition, GC content, expression level and length of genes, position and context of codons in the genes, recombination rates, mRNA folding, and tRNA abundance and interactions are some factors influencing codon bias. The factors shaping codon bias may also be involved in evolution of the universal genetic code. Codon-usage bias is critical factor determining gene expression and cellular function by influencing diverse processes such as RNA processing, protein translation and protein folding. Codon usage bias reflects the origin, mutation patterns and evolution of the species or genes. Investigations of codon bias patterns in genomes can reveal phylogenetic relationships between organisms, horizontal gene transfers, molecular evolution of genes and identify selective forces that drive their evolution. Most important application of codon bias analysis is in the design of transgenes, to increase gene expression levels through codon optimization, for development of transgenic crops. The review gives an overview of deviations of genetic code, factors influencing codon usage or bias, codon usage bias of nuclear and organellar genes, computational methods to determine codon usage and the significance as well as applications of codon usage analysis in biological research, with emphasis on plants.
Collapse
Affiliation(s)
| | - Varatharajalu Udayasuriyan
- Department of Biotechnology, Centre for Plant Molecular Biology and Biotechnology, Tamil Nadu Agricultural University, Coimbatore, 641003, India
| | - Vijaipal Bhadana
- ICAR-Indian Institute of Agricultural Biotechnology, Ranchi, Jharkhand, 834010, India
| |
Collapse
|
12
|
Bioinformatics Applications to Reveal Molecular Mechanisms of Gene Expression Regulation in Model Organisms. Int J Mol Sci 2021; 22:ijms222111973. [PMID: 34769403 PMCID: PMC8584893 DOI: 10.3390/ijms222111973] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2021] [Revised: 11/01/2021] [Accepted: 11/03/2021] [Indexed: 02/06/2023] Open
|
13
|
Katyal G, Ebanks B, Lucassen M, Papetti C, Chakrabarti L. Sequence and structure comparison of ATP synthase F0 subunits 6 and 8 in notothenioid fish. PLoS One 2021; 16:e0245822. [PMID: 34613983 PMCID: PMC8494342 DOI: 10.1371/journal.pone.0245822] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2021] [Accepted: 09/09/2021] [Indexed: 11/20/2022] Open
Abstract
Mitochondrial changes such as tight coupling of the mitochondria have facilitated sustained oxygen and respiratory activity in haemoglobin-less icefish of the Channichthyidae family. We aimed to characterise features in the sequence and structure of the proteins directly involved in proton transport, which have potential physiological implications. ATP synthase subunit a (ATP6) and subunit 8 (ATP8) are proteins that function as part of the F0 component (proton pump) of the F0F1complex. Both proteins are encoded by the mitochondrial genome and involved in oxidative phosphorylation. To explore mitochondrial sequence variation for ATP6 and ATP8 we analysed sequences from C. gunnari and C. rastrospinosus and compared them with their closely related red-blooded species and eight other vertebrate species. Our comparison of the amino acid sequence of these proteins reveals important differences that could underlie aspects of the unique physiology of the icefish. In this study we find that changes in the sequence of subunit a of the icefish C. gunnari at position 35 where there is a hydrophobic alanine which is not seen in the other notothenioids we analysed. An amino acid change of this type is significant since it may have a structural impact. The biology of the haemoglobin-less icefish is necessarily unique and any insights about these animals will help to generate a better overall understanding of important physiological pathways.
Collapse
Affiliation(s)
- Gunjan Katyal
- School of Veterinary Medicine and Science, University of Nottingham, Sutton Bonington, United Kingdom
| | - Brad Ebanks
- School of Veterinary Medicine and Science, University of Nottingham, Sutton Bonington, United Kingdom
| | | | | | - Lisa Chakrabarti
- School of Veterinary Medicine and Science, University of Nottingham, Sutton Bonington, United Kingdom
- MRC-Versus Arthritis Centre for Musculoskeletal Ageing Research, Nottingham, United Kingdom
| |
Collapse
|
14
|
Georgakopoulos-Soares I, Yizhar-Barnea O, Mouratidis I, Hemberg M, Ahituv N. Absent from DNA and protein: genomic characterization of nullomers and nullpeptides across functional categories and evolution. Genome Biol 2021; 22:245. [PMID: 34433494 PMCID: PMC8386077 DOI: 10.1186/s13059-021-02459-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 08/09/2021] [Indexed: 11/13/2022] Open
Abstract
Nullomers and nullpeptides are short DNA or amino acid sequences that are absent from a genome or proteome, respectively. One potential cause for their absence could be their having a detrimental impact on an organism. RESULTS: Here, we identify all possible nullomers and nullpeptides in the genomes and proteomes of thirty eukaryotes and demonstrate that a significant proportion of these sequences are under negative selection. We also identify nullomers that are unique to specific functional categories: coding sequences, exons, introns, 5'UTR, 3'UTR, promoters, and show that coding sequence and promoter nullomers are most likely to be selected against. By analyzing all protein sequences across the tree of life, we further identify 36,081 peptides up to six amino acids in length that do not exist in any known organism, termed primes. We next characterize all possible single base pair mutations that can lead to the appearance of a nullomer in the human genome, observing a significantly higher number of mutations than expected by chance for specific nullomer sequences in transposable elements, likely due to their suppression. We also annotate nullomers that appear due to naturally occurring variants and show that a subset of them can be used to distinguish between different human populations. Analysis of nullomers and nullpeptides across vertebrate evolution shows they can also be used as phylogenetic classifiers. CONCLUSIONS: We provide a catalog of nullomers and nullpeptides in distinct functional categories, develop methods to systematically study them, and highlight the use of variability in these sequences in other analyses.
Collapse
Affiliation(s)
- Ilias Georgakopoulos-Soares
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| | - Ofer Yizhar-Barnea
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| | - Ioannis Mouratidis
- Department of Computer Science, Katholieke Universiteit Leuven, Leuven, Belgium
| | - Martin Hemberg
- Evergrande Center for Immunologic Diseases, Harvard Medical School and Brigham and Women's Hospital, Boston, MA, USA.
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK.
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA.
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA.
| |
Collapse
|
15
|
Ho AT, Hurst LD. Effective Population Size Predicts Local Rates but Not Local Mitigation of Read-through Errors. Mol Biol Evol 2021; 38:244-262. [PMID: 32797190 PMCID: PMC7783166 DOI: 10.1093/molbev/msaa210] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
In correctly predicting that selection efficiency is positively correlated with the effective population size (Ne), the nearly neutral theory provides a coherent understanding of between-species variation in numerous genomic parameters, including heritable error (germline mutation) rates. Does the same theory also explain variation in phenotypic error rates and in abundance of error mitigation mechanisms? Translational read-through provides a model to investigate both issues as it is common, mostly nonadaptive, and has good proxy for rate (TAA being the least leaky stop codon) and potential error mitigation via "fail-safe" 3' additional stop codons (ASCs). Prior theory of translational read-through has suggested that when population sizes are high, weak selection for local mitigation can be effective thus predicting a positive correlation between ASC enrichment and Ne. Contra to prediction, we find that ASC enrichment is not correlated with Ne. ASC enrichment, although highly phylogenetically patchy, is, however, more common both in unicellular species and in genes expressed in unicellular modes in multicellular species. By contrast, Ne does positively correlate with TAA enrichment. These results imply that local phenotypic error rates, not local mitigation rates, are consistent with a drift barrier/nearly neutral model.
Collapse
Affiliation(s)
- Alexander T Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
- Corresponding author: E-mail:
| | - Laurence D Hurst
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
16
|
Belinky F, Ganguly I, Poliakov E, Yurchenko V, Rogozin IB. Analysis of Stop Codons within Prokaryotic Protein-Coding Genes Suggests Frequent Readthrough Events. Int J Mol Sci 2021; 22:ijms22041876. [PMID: 33672790 PMCID: PMC7918605 DOI: 10.3390/ijms22041876] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2021] [Revised: 02/05/2021] [Accepted: 02/09/2021] [Indexed: 02/07/2023] Open
Abstract
Nonsense mutations turn a coding (sense) codon into an in-frame stop codon that is assumed to result in a truncated protein product. Thus, nonsense substitutions are the hallmark of pseudogenes and are used to identify them. Here we show that in-frame stop codons within bacterial protein-coding genes are widespread. Their evolutionary conservation suggests that many of them are not pseudogenes, since they maintain dN/dS values (ratios of substitution rates at non-synonymous and synonymous sites) significantly lower than 1 (this is a signature of purifying selection in protein-coding regions). We also found that double substitutions in codons—where an intermediate step is a nonsense substitution—show a higher rate of evolution compared to null models, indicating that a stop codon was introduced and then changed back to sense via positive selection. This further supports the notion that nonsense substitutions in bacteria are relatively common and do not necessarily cause pseudogenization. In-frame stop codons may be an important mechanism of regulation: Such codons are likely to cause a substantial decrease of protein expression levels.
Collapse
Affiliation(s)
- Frida Belinky
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (F.B.); (I.G.)
| | - Ishan Ganguly
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (F.B.); (I.G.)
| | - Eugenia Poliakov
- National Eye Institute, National Institutes of Health, Bethesda, MD 20892, USA;
| | - Vyacheslav Yurchenko
- Life Science Research Centre, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic
- Martsinovsky Institute of Medical Parasitology, Tropical and Vector Borne Diseases, Sechenov University, 119435 Moscow, Russia
- Correspondence: (V.Y.); (I.B.R.)
| | - Igor B. Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA; (F.B.); (I.G.)
- Correspondence: (V.Y.); (I.B.R.)
| |
Collapse
|
17
|
Seoighe C, Kiniry SJ, Peters A, Baranov PV, Yang H. Selection Shapes Synonymous Stop Codon Use in Mammals. J Mol Evol 2020; 88:549-561. [PMID: 32617614 DOI: 10.1007/s00239-020-09957-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Accepted: 06/19/2020] [Indexed: 12/15/2022]
Abstract
Phylogenetic models of the evolution of protein-coding sequences can provide insights into the selection pressures that have shaped them. In the application of these models synonymous nucleotide substitutions, which do not alter the encoded amino acid, are often assumed to have limited functional consequences and used as a proxy for the neutral rate of evolution. The ratio of nonsynonymous to synonymous substitution rates is then used to categorize the selective regime that applies to the protein (e.g., purifying selection, neutral evolution, diversifying selection). Here, we extend the Muse and Gaut model of codon evolution to explore the extent of purifying selection acting on substitutions between synonymous stop codons. Using a large collection of coding sequence alignments, we estimate that a high proportion (approximately 57%) of mammalian genes are affected by selection acting on stop codon preference. This proportion varies substantially by codon, with UGA stop codons far more likely to be conserved. Genes with evidence of selection acting on synonymous stop codons have distinctive characteristics, compared to unconserved genes with the same stop codon, including longer [Formula: see text] untranslated regions (UTRs) and shorter mRNA half-life. The coding regions of these genes are also much more likely to be under strong purifying selection pressure. Our results suggest that the preference for UGA stop codons found in many multicellular eukaryotes is selective rather than mutational in origin.
Collapse
Affiliation(s)
- Cathal Seoighe
- School of Mathematics, Statistics and Applied Mathematics, National University of Ireland Galway, Galway, Ireland.
| | - Stephen J Kiniry
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Andrew Peters
- School of Mathematics, Statistics and Applied Mathematics, National University of Ireland Galway, Galway, Ireland
| | - Pavel V Baranov
- School of Biochemistry and Cell Biology, University College Cork, Cork, Ireland
| | - Haixuan Yang
- School of Mathematics, Statistics and Applied Mathematics, National University of Ireland Galway, Galway, Ireland
| |
Collapse
|
18
|
Wangen JR, Green R. Stop codon context influences genome-wide stimulation of termination codon readthrough by aminoglycosides. eLife 2020; 9:52611. [PMID: 31971508 PMCID: PMC7089771 DOI: 10.7554/elife.52611] [Citation(s) in RCA: 111] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Accepted: 01/22/2020] [Indexed: 12/14/2022] Open
Abstract
Stop codon readthrough (SCR) occurs when the ribosome miscodes at a stop codon. Such readthrough events can be therapeutically desirable when a premature termination codon (PTC) is found in a critical gene. To study SCR in vivo in a genome-wide manner, we treated mammalian cells with aminoglycosides and performed ribosome profiling. We find that in addition to stimulating readthrough of PTCs, aminoglycosides stimulate readthrough of normal termination codons (NTCs) genome-wide. Stop codon identity, the nucleotide following the stop codon, and the surrounding mRNA sequence context all influence the likelihood of SCR. In comparison to NTCs, downstream stop codons in 3′UTRs are recognized less efficiently by ribosomes, suggesting that targeting of critical stop codons for readthrough may be achievable without general disruption of translation termination. Finally, we find that G418-induced miscoding alters gene expression with substantial effects on translation of histone genes, selenoprotein genes, and S-adenosylmethionine decarboxylase (AMD1). Many genes provide a set of instructions needed to build a protein, which are read by structures called ribosomes through a process called translation. The genetic information contains a short, coded instruction called a stop codon which marks the end of the protein. When a ribosome finds a stop codon it should stop building and release the protein it has made. Ribosomes do not always stop at stop codons. Certain chemicals can actually prevent ribosomes from detecting stop codons correctly, and aminoglycosides are drugs that have exactly this effect. Aminoglycosides can be used as antibiotics at low doses because they interfere with ribosomes in bacteria, but at higher doses they can also prevent ribosomes from detecting stop codons in human cells. When ribosomes do not stop at a stop codon this is called readthrough. There are different types of stop codons and some are naturally more effective at stopping ribosomes than others. Wangen and Green have now examined the effect of an aminoglycoside called G418 on ribosomes in human cells grown in the laboratory. The results showed how ribosomes interacted with genetic information and revealed that certain stop codons are more affected by G418 than others. The stop codon and other genetic sequences around it affect the likelihood of readthrough. Wangen and Green also showed that sequences that encourage translation to stop are more common in the area around stop codons. These findings highlight an evolutionary pressure driving more genes to develop strong stop codons that resist readthrough. Despite this, some are still more affected by drugs like G418 than others. Some genetic conditions, like cystic fibrosis, result from incorrect stop codons in genes. Drugs that promote readthrough specifically in these genes could be useful new treatments.
Collapse
Affiliation(s)
- Jamie R Wangen
- Department of Molecular Biology and Genetics, Howard Hughes Medical Institute, Johns Hopkins University School of Medicine, Baltimore, United States
| | - Rachel Green
- Department of Molecular Biology and Genetics, Howard Hughes Medical Institute, Johns Hopkins University School of Medicine, Baltimore, United States
| |
Collapse
|
19
|
Belinky F, Sela I, Rogozin IB, Koonin EV. Crossing fitness valleys via double substitutions within codons. BMC Biol 2019; 17:105. [PMID: 31842858 PMCID: PMC6916188 DOI: 10.1186/s12915-019-0727-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Accepted: 11/20/2019] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Single nucleotide substitutions in protein-coding genes can be divided into synonymous (S), with little fitness effect, and non-synonymous (N) ones that alter amino acids and thus generally have a greater effect. Most of the N substitutions are affected by purifying selection that eliminates them from evolving populations. However, additional mutations of nearby bases potentially could alleviate the deleterious effect of single substitutions, making them subject to positive selection. To elucidate the effects of selection on double substitutions in all codons, it is critical to differentiate selection from mutational biases. RESULTS We addressed the evolutionary regimes of within-codon double substitutions in 37 groups of closely related prokaryotic genomes from diverse phyla by comparing the fractions of double substitutions within codons to those of the equivalent double S substitutions in adjacent codons. Under the assumption that substitutions occur one at a time, all within-codon double substitutions can be represented as "ancestral-intermediate-final" sequences (where "intermediate" refers to the first single substitution and "final" refers to the second substitution) and can be partitioned into four classes: (1) SS, S intermediate-S final; (2) SN, S intermediate-N final; (3) NS, N intermediate-S final; and (4) NN, N intermediate-N final. We found that the selective pressure on the second substitution markedly differs among these classes of double substitutions. Analogous to single S (synonymous) substitutions, SS double substitutions evolve neutrally, whereas analogous to single N (non-synonymous) substitutions, SN double substitutions are subject to purifying selection. In contrast, NS show positive selection on the second step because the original amino acid is recovered. The NN double substitutions are heterogeneous and can be subject to either purifying or positive selection, or evolve neutrally, depending on the amino acid similarity between the final or intermediate and the ancestral states. CONCLUSIONS The results of the present, comprehensive analysis of the evolutionary landscape of within-codon double substitutions reaffirm the largely conservative regime of protein evolution. However, the second step of a double substitution can be subject to positive selection when the first step is deleterious. Such positive selection can result in frequent crossing of valleys on the fitness landscape.
Collapse
Affiliation(s)
- Frida Belinky
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Itamar Sela
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
20
|
Ho AT, Hurst LD. In eubacteria, unlike eukaryotes, there is no evidence for selection favouring fail-safe 3' additional stop codons. PLoS Genet 2019; 15:e1008386. [PMID: 31527909 PMCID: PMC6764699 DOI: 10.1371/journal.pgen.1008386] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Revised: 09/27/2019] [Accepted: 08/27/2019] [Indexed: 12/23/2022] Open
Abstract
Errors throughout gene expression are likely deleterious, hence genomes are under selection to ameliorate their consequences. Additional stop codons (ASCs) are in-frame nonsense ‘codons’ downstream of the primary stop which may be read by translational machinery should the primary stop have been accidentally read through. Prior evidence in several eukaryotes suggests that ASCs are selected to prevent potentially-deleterious consequences of read-through. We extend this evidence showing that enrichment of ASCs is common but not universal for single cell eukaryotes. By contrast, there is limited evidence as to whether the same is true in other taxa. Here, we provide the first systematic test of the hypothesis that ASCs act as a fail-safe mechanism in eubacteria, a group with high read-through rates. Contra to the predictions of the hypothesis we find: there is paucity, not enrichment, of ASCs downstream; substitutions that degrade stops are more frequent in-frame than out-of-frame in 3’ sequence; highly expressed genes are no more likely to have ASCs than lowly expressed genes; usage of the leakiest primary stop (TGA) in highly expressed genes does not predict ASC enrichment even compared to usage of non-leaky stops (TAA) in lowly expressed genes, beyond downstream codon +1. Any effect at the codon immediately proximal to the primary stop can be accounted for by a preference for a T/U residue immediately following the stop, although if anything, TT- and TC- starting codons are preferred. We conclude that there is no compelling evidence for ASC selection in eubacteria. This presents an unusual case in which the same error could be solved by the same mechanism in eukaryotes and prokaryotes but is not. We discuss two possible explanations: that, owing to the absence of nonsense mediated decay, bacteria may solve read-through via gene truncation and in eukaryotes certain prion states cause raised read-through rates. In all organisms, gene expression is error-prone. One such error, translational read-through, occurs where the primary stop codon of an expressed gene is missed by the translational machinery. Failure to terminate is likely to be costly, hence genomes are under selection to prevent this from happening. One proposed error-proofing strategy involves in-frame proximal additional stop codons (ASCs) which may act as a ‘fail-safe’ mechanism by providing another opportunity for translation to terminate. There is evidence for ASC enrichment in several eukaryotes. We extend this evidence showing it to be common but not universal in single celled eukaryotes. However, the situation in bacteria is poorly understood, despite bacteria having high read-through rates. Here, we test the fail-safe hypothesis within a broad range of bacteria. To our surprise, we find that not only are ASCs not enriched, but they may even be selected against. This provides evidence for an unusual circumstance where eukaryotes and prokaryotes could solve the same problem the same way but don’t. What are we to make of this? We suggest that if read-through is the problem, ASCs are not necessarily the expected solution. Owing to the absence of nonsense-mediated decay, a process that makes gene truncation in eukaryotes less viable, we propose bacteria may rescue a leaky stop by mutation that creates a new stop upstream. Alternatively, raised read-through rates in some particular conditions in eukaryotes might explain the difference.
Collapse
Affiliation(s)
- Alexander T. Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
- * E-mail:
| | - Laurence D. Hurst
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
21
|
'Stop' in protein synthesis is modulated with exquisite subtlety by an extended RNA translation signal. Biochem Soc Trans 2018; 46:1615-1625. [PMID: 30420414 DOI: 10.1042/bst20180190] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2018] [Revised: 09/30/2018] [Accepted: 10/04/2018] [Indexed: 02/08/2023]
Abstract
Translational stop codons, UAA, UAG, and UGA, form an integral part of the universal genetic code. They are of significant interest today for their underlying fundamental role in terminating protein synthesis, but also for their potential utilisation for programmed alternative translation events. In diverse organisms, UAA has wide usage, but it is puzzling that the high fidelity UAG is selected against and yet UGA, vulnerable to suppression, is widely used, particularly in those archaeal and bacterial genomes with a high GC content. In canonical protein synthesis, stop codons are interpreted by protein release factors that structurally and functionally mimic decoding tRNAs and occupy the decoding site on the ribosome. The release factors make close contact with the decoding complex through multiple interactions. Correct interactions cause conformational changes resulting in new and enhanced contacts with the ribosome, particularly between specific bases in the mRNA and rRNA. The base following the stop codon (fourth or +4 base) may strongly influence decoding efficiency, facilitating alternative non-canonical events like frameshifting or selenocysteine incorporation. The fourth base is drawn into the decoding site with a compacted stop codon in the eukaryotic termination complex. Surprisingly, mRNA sequences upstream and downstream of this core tetranucleotide signal have a significant influence on the strength of the signal. Since nine bases downstream of the stop codon are within the mRNA channel, their interactions with rRNA, and r-proteins may affect efficiency. With this understanding, it is now possible to design stop signals of desired strength for specific applied purposes.
Collapse
|