1
|
Cheng H, Yu J, Wong CC. Adenosine-to-Inosine RNA editing in cancer: molecular mechanisms and downstream targets. Protein Cell 2024:pwae039. [PMID: 39126156 DOI: 10.1093/procel/pwae039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Indexed: 08/12/2024] Open
Abstract
Adenosine-to-Inosine (A-to-I), one of the most prevalent RNA modifications, has recently garnered significant attention. The A-to-I modification actively contributes to biological and pathological processes by affecting the structure and function of various RNA molecules, including double stranded RNA, transfer RNA, microRNA, and viral RNA. Increasing evidence suggests that A-to-I plays a crucial role in the development of human disease, particularly in cancer, and aberrant A-to-I levels are closely associated with tumorigenesis and progression through regulation of the expression of multiple oncogenes and tumor suppressor genes. Currently, the underlying molecular mechanisms of A-to-I modification in cancer are not comprehensively understood. Here, we review the latest advances regarding the A-to-I editing pathways implicated in cancer, describing their biological functions and their connections to the disease.
Collapse
Affiliation(s)
- Hao Cheng
- Institute of Digestive Disease and Department of Medicine and Therapeutics, State Key Laboratory of Digestive Disease, Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Hong Kong SAR 518172, China
| | - Jun Yu
- Institute of Digestive Disease and Department of Medicine and Therapeutics, State Key Laboratory of Digestive Disease, Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Hong Kong SAR 518172, China
| | - Chi Chun Wong
- Institute of Digestive Disease and Department of Medicine and Therapeutics, State Key Laboratory of Digestive Disease, Li Ka Shing Institute of Health Sciences, The Chinese University of Hong Kong, Hong Kong SAR 518172, China
| |
Collapse
|
2
|
Fages‐Lartaud M, Hundvin K, Hohmann‐Marriott MF. Mechanisms governing codon usage bias and the implications for protein expression in the chloroplast of Chlamydomonas reinhardtii. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 112:919-945. [PMID: 36071273 PMCID: PMC9828097 DOI: 10.1111/tpj.15970] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 08/29/2022] [Accepted: 09/01/2022] [Indexed: 05/30/2023]
Abstract
Chloroplasts possess a considerably reduced genome that is decoded via an almost minimal set of tRNAs. These features make an excellent platform for gaining insights into fundamental mechanisms that govern protein expression. Here, we present a comprehensive and revised perspective of the mechanisms that drive codon selection in the chloroplast of Chlamydomonas reinhardtii and the functional consequences for protein expression. In order to extract this information, we applied several codon usage descriptors to genes with different expression levels. We show that highly expressed genes strongly favor translationally optimal codons, while genes with lower functional importance are rather affected by directional mutational bias. We demonstrate that codon optimality can be deduced from codon-anticodon pairing affinity and, for a small number of amino acids (leucine, arginine, serine, and isoleucine), tRNA concentrations. Finally, we review, analyze, and expand on the impact of codon usage on protein yield, secondary structures of mRNA, translation initiation and termination, and amino acid composition of proteins, as well as cotranslational protein folding. The comprehensive analysis of codon choice provides crucial insights into heterologous gene expression in the chloroplast of C. reinhardtii, which may also be applicable to other chloroplast-containing organisms and bacteria.
Collapse
Affiliation(s)
- Maxime Fages‐Lartaud
- Department of BiotechnologyNorwegian University of Science and TechnologyTrondheimN‐7491Norway
| | - Kristoffer Hundvin
- Department of BiotechnologyNorwegian University of Science and TechnologyTrondheimN‐7491Norway
| | | |
Collapse
|
3
|
Gillen SL, Waldron JA, Bushell M. Codon optimality in cancer. Oncogene 2021; 40:6309-6320. [PMID: 34584217 PMCID: PMC8585667 DOI: 10.1038/s41388-021-02022-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 08/24/2021] [Accepted: 09/10/2021] [Indexed: 12/14/2022]
Abstract
A key characteristic of cancer cells is their increased proliferative capacity, which requires elevated levels of protein synthesis. The process of protein synthesis involves the translation of codons within the mRNA coding sequence into a string of amino acids to form a polypeptide chain. As most amino acids are encoded by multiple codons, the nucleotide sequence of a coding region can vary dramatically without altering the polypeptide sequence of the encoded protein. Although mutations that do not alter the final amino acid sequence are often thought of as silent/synonymous, these can still have dramatic effects on protein output. Because each codon has a distinct translation elongation rate and can differentially impact mRNA stability, each codon has a different degree of 'optimality' for protein synthesis. Recent data demonstrates that the codon preference of a transcriptome matches the abundance of tRNAs within the cell and that this supply and demand between tRNAs and mRNAs varies between different cell types. The largest observed distinction is between mRNAs encoding proteins associated with proliferation or differentiation. Nevertheless, precisely how codon optimality and tRNA expression levels regulate cell fate decisions and their role in malignancy is not fully understood. This review describes the current mechanistic understanding on codon optimality, its role in malignancy and discusses the potential to target codon optimality therapeutically in the context of cancer.
Collapse
Affiliation(s)
- Sarah L Gillen
- Cancer Research UK Beatson Institute, Garscube Estate, Switchback Road, Glasgow, G61 1BD, UK.
| | - Joseph A Waldron
- Cancer Research UK Beatson Institute, Garscube Estate, Switchback Road, Glasgow, G61 1BD, UK
| | - Martin Bushell
- Cancer Research UK Beatson Institute, Garscube Estate, Switchback Road, Glasgow, G61 1BD, UK.
- Institute of Cancer Sciences, University of Glasgow, Glasgow, UK, G61 1QH.
| |
Collapse
|
4
|
Santoni D. The impact of codon choice on translation process in Saccharomyces cerevisiae: folding class, protein function and secondary structure. J Theor Biol 2021; 526:110806. [PMID: 34111456 DOI: 10.1016/j.jtbi.2021.110806] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Revised: 05/26/2021] [Accepted: 06/03/2021] [Indexed: 11/28/2022]
Abstract
The genetic code consists in a set of rules used by living organisms to translate genomic information, contained in genes, into proteins; every amino acid is coded by a set of nucleotide triplets or codons. We refer to codon choice as the choice of a given codon, among the synonymous available ones, to code a given amino acid occurrence. The aim of this work is to shed light on the pivotal role that codon choice plays in regulating the timing of translation process, through patterns of low and high translation efficiency codons. A translation efficiency value, namely codon score, was associated to each codon through a formula based on the number of tRNAs gene copies able to translate the given codon. By using codon scores, those k-mers of the proteome of Saccharomyces cerevisiae, showing low and high average scores associated to the correspondent codons, were computed. The analysis of distribution of both low and high average score k-mers clearly showed that, in particular for higher k-mer size, they occur much more than expected, strongly suggesting a functional role. Moreover performed analysis highlighted that significant k-mers preferentially occur in some protein folding classes, such as those containing alpha helices, and in some functional classes mainly involved in transcription process while codon choice seems to have a very low impact in proteins associated to energy production and metabolism. The relationship between secondary structures and significant k-mers was investigated, revealing that low score k-mers tend to preferentially occur in coil or close to coil regions and almost never in beta sheets, while high score k-mers preferentially occur in alpha helices, avoiding beta sheets, and close to coil regions for high k-mer sizes. Finally the analysis of distribution of significant codon patterns along the proteins highlighted a relevant enrichment of low average score k-mers at the 5' end of protein-coding sequences in the region from 5th to 25th amino acid.
Collapse
Affiliation(s)
- Daniele Santoni
- Institute for System Analysis and Computer Science "Antonio Ruberti", National Research Council of Italy, Via dei Taurini 19, Rome 00185, Italy.
| |
Collapse
|
5
|
Bahiri-Elitzur S, Tuller T. Codon-based indices for modeling gene expression and transcript evolution. Comput Struct Biotechnol J 2021; 19:2646-2663. [PMID: 34025951 PMCID: PMC8122159 DOI: 10.1016/j.csbj.2021.04.042] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2021] [Revised: 04/17/2021] [Accepted: 04/18/2021] [Indexed: 11/21/2022] Open
Abstract
Codon usage bias (CUB) refers to the phenomena that synonymous codons are used in different frequencies in most genes and organisms. The general assumption is that codon biases reflect a balance between mutational biases and natural selection. Today we understand that the codon content is related and can affect all gene expression steps. Starting from the 1980s, codon-based indices have been used for answering different questions in all biomedical fields, including systems biology, agriculture, medicine, and biotechnology. In general, codon usage bias indices weigh each codon or a small set of codons to estimate the fitting of a certain coding sequence to a certain phenomenon (e.g., bias in codons, adaptation to the tRNA pool, frequencies of certain codons, transcription elongation speed, etc.) and are usually easy to implement. Today there are dozens of such indices; thus, this paper aims to review and compare the different codon usage bias indices, their applications, and advantages. In addition, we perform analysis that demonstrates that most indices tend to correlate even though they aim to capture different aspects. Due to the centrality of codon usage bias on different gene expression steps, it is important to keep developing new indices that can capture additional aspects that are not modeled with the current indices.
Collapse
Affiliation(s)
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel-Aviv University, Tel Aviv, Israel
- The Sagol School of Neuroscience, Tel-Aviv University, Tel Aviv, Israel
| |
Collapse
|
6
|
Srinivasan S, Torres AG, Ribas de Pouplana L. Inosine in Biology and Disease. Genes (Basel) 2021; 12:600. [PMID: 33921764 PMCID: PMC8072771 DOI: 10.3390/genes12040600] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Revised: 04/13/2021] [Accepted: 04/15/2021] [Indexed: 02/06/2023] Open
Abstract
The nucleoside inosine plays an important role in purine biosynthesis, gene translation, and modulation of the fate of RNAs. The editing of adenosine to inosine is a widespread post-transcriptional modification in transfer RNAs (tRNAs) and messenger RNAs (mRNAs). At the wobble position of tRNA anticodons, inosine profoundly modifies codon recognition, while in mRNA, inosines can modify the sequence of the translated polypeptide or modulate the stability, localization, and splicing of transcripts. Inosine is also found in non-coding and exogenous RNAs, where it plays key structural and functional roles. In addition, molecular inosine is an important secondary metabolite in purine metabolism that also acts as a molecular messenger in cell signaling pathways. Here, we review the functional roles of inosine in biology and their connections to human health.
Collapse
Affiliation(s)
- Sundaramoorthy Srinivasan
- Institute for Research in Biomedicine, Barcelona Institute of Science and Technology, 08028 Barcelona, Catalonia, Spain; (S.S.); (A.G.T.)
| | - Adrian Gabriel Torres
- Institute for Research in Biomedicine, Barcelona Institute of Science and Technology, 08028 Barcelona, Catalonia, Spain; (S.S.); (A.G.T.)
| | - Lluís Ribas de Pouplana
- Institute for Research in Biomedicine, Barcelona Institute of Science and Technology, 08028 Barcelona, Catalonia, Spain; (S.S.); (A.G.T.)
- Catalan Institution for Research and Advanced Studies, 08010 Barcelona, Catalonia, Spain
| |
Collapse
|
7
|
Thompson BA, Walters R, Parsons MT, Dumenil T, Drost M, Tiersma Y, Lindor NM, Tavtigian SV, de Wind N, Spurdle AB. Contribution of mRNA Splicing to Mismatch Repair Gene Sequence Variant Interpretation. Front Genet 2020; 11:798. [PMID: 32849802 PMCID: PMC7398121 DOI: 10.3389/fgene.2020.00798] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2020] [Accepted: 07/03/2020] [Indexed: 12/25/2022] Open
Abstract
Functional assays that assess mRNA splicing can be used in interpretation of the clinical significance of sequence variants, including the Lynch syndrome-associated mismatch repair (MMR) genes. The purpose of this study was to investigate the contribution of splicing assay data to the classification of MMR gene sequence variants. We assayed mRNA splicing for 24 sequence variants in MLH1, MSH2, and MSH6, including 12 missense variants that were also assessed using a cell-free in vitro MMR activity (CIMRA) assay. Multifactorial likelihood analysis was conducted for each variant, combining CIMRA outputs and clinical data where available. We collated these results with existing public data to provide a dataset of splicing assay results for a total of 671 MMR gene sequence variants (328 missense/in-frame indel), and published and unpublished repair activity measurements for 154 of these variants. There were 241 variants for which a splicing aberration was detected: 92 complete impact, 33 incomplete impact, and 116 where it was not possible to determine complete versus incomplete splicing impact. Splicing results mostly aided in the interpretation of intronic (72%) and silent (92%) variants and were the least useful for missense substitutions/in-frame indels (10%). MMR protein functional activity assays were more useful in the analysis of these exonic variants but by design they were not able to detect clinically important splicing aberrations identified by parallel mRNA assays. The development of high throughput assays that can quantitatively assess impact on mRNA transcript expression and protein function in parallel will streamline classification of MMR gene sequence variants.
Collapse
Affiliation(s)
- Bryony A Thompson
- Department of Pathology, The Royal Melbourne Hospital, Melbourne, VIC, Australia.,Department of Clinical Pathology, The University of Melbourne, Melbourne, VIC, Australia
| | - Rhiannon Walters
- Genetics and Computational Biology Department, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | - Michael T Parsons
- Genetics and Computational Biology Department, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | - Troy Dumenil
- Genetics and Computational Biology Department, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | - Mark Drost
- Department of Human Genetics, Leiden University Medical Center, Leiden, Netherlands
| | - Yvonne Tiersma
- Department of Human Genetics, Leiden University Medical Center, Leiden, Netherlands
| | - Noralane M Lindor
- Department of Health Sciences Research, Mayo Clinic, Scottsdale, AZ, United States
| | - Sean V Tavtigian
- Department of Oncological Sciences, University of Utah School of Medicine, Salt Lake City, UT, United States
| | - Niels de Wind
- Department of Human Genetics, Leiden University Medical Center, Leiden, Netherlands
| | - Amanda B Spurdle
- Genetics and Computational Biology Department, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | | |
Collapse
|
8
|
Ghoneim DH, Zhang X, Brule CE, Mathews DH, Grayhack EJ. Conservation of location of several specific inhibitory codon pairs in the Saccharomyces sensu stricto yeasts reveals translational selection. Nucleic Acids Res 2019; 47:1164-1177. [PMID: 30576464 PMCID: PMC6379720 DOI: 10.1093/nar/gky1262] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Revised: 11/19/2018] [Accepted: 12/06/2018] [Indexed: 12/30/2022] Open
Abstract
Synonymous codons provide redundancy in the genetic code that influences translation rates in many organisms, in which overall codon use is driven by selection for optimal codons. It is unresolved if or to what extent translational selection drives use of suboptimal codons or codon pairs. In Saccharomyces cerevisiae, 17 specific inhibitory codon pairs, each comprised of adjacent suboptimal codons, inhibit translation efficiency in a manner distinct from their constituent codons, and many are translated slowly in native genes. We show here that selection operates within Saccharomyces sensu stricto yeasts to conserve nine of these codon pairs at defined positions in genes. Conservation of these inhibitory codon pairs is significantly greater than expected, relative to conservation of their constituent codons, with seven pairs more highly conserved than any other synonymous pair. Conservation is strongly correlated with slow translation of the pairs. Conservation of suboptimal codon pairs extends to two related Candida species, fungi that diverged from Saccharomyces ∼270 million years ago, with an enrichment for codons decoded by I•A and U•G wobble in both Candida and Saccharomyces. Thus, conservation of inhibitory codon pairs strongly implies selection for slow translation at particular gene locations, executed by suboptimal codon pairs.
Collapse
Affiliation(s)
- Dalia H Ghoneim
- Department of Biochemistry and Biophysics, School of Medicine and Dentistry, University of Rochester, Rochester, NY 14642, USA.,Center for RNA Biology, University of Rochester, Rochester, NY 14642, USA
| | - Xiaoju Zhang
- Department of Biochemistry and Biophysics, School of Medicine and Dentistry, University of Rochester, Rochester, NY 14642, USA.,Center for RNA Biology, University of Rochester, Rochester, NY 14642, USA
| | - Christina E Brule
- Department of Biochemistry and Biophysics, School of Medicine and Dentistry, University of Rochester, Rochester, NY 14642, USA.,Center for RNA Biology, University of Rochester, Rochester, NY 14642, USA
| | - David H Mathews
- Department of Biochemistry and Biophysics, School of Medicine and Dentistry, University of Rochester, Rochester, NY 14642, USA.,Center for RNA Biology, University of Rochester, Rochester, NY 14642, USA
| | - Elizabeth J Grayhack
- Department of Biochemistry and Biophysics, School of Medicine and Dentistry, University of Rochester, Rochester, NY 14642, USA.,Center for RNA Biology, University of Rochester, Rochester, NY 14642, USA
| |
Collapse
|
9
|
Zhang Z, Ye Y, Gong J, Ruan H, Liu CJ, Xiang Y, Cai C, Guo AY, Ling J, Diao L, Weinstein JN, Han L. Global analysis of tRNA and translation factor expression reveals a dynamic landscape of translational regulation in human cancers. Commun Biol 2018; 1:234. [PMID: 30588513 PMCID: PMC6303286 DOI: 10.1038/s42003-018-0239-8] [Citation(s) in RCA: 51] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2018] [Accepted: 11/27/2018] [Indexed: 12/14/2022] Open
Abstract
The protein translational system, including transfer RNAs (tRNAs) and several categories of enzymes, plays a key role in regulating cell proliferation. Translation dysregulation also contributes to cancer development, though relatively little is known about the changes that occur to the translational system in cancer. Here, we present global analyses of tRNAs and three categories of enzymes involved in translational regulation in ~10,000 cancer patients across 31 cancer types from The Cancer Genome Atlas. By analyzing the expression levels of tRNAs at the gene, codon, and amino acid levels, we identified unequal alterations in tRNA expression, likely due to the uneven distribution of tRNAs decoding different codons. We find that overexpression of tRNAs recognizing codons with a low observed-over-expected ratio may overcome the translational bottleneck in tumorigenesis. We further observed overall overexpression and amplification of tRNA modification enzymes, aminoacyl-tRNA synthetases, and translation factors, which may play synergistic roles with overexpression of tRNAs to activate the translational systems across multiple cancer types.
Collapse
Affiliation(s)
- Zhao Zhang
- Department of Biochemistry and Molecular Biology, McGovern Medical School at The University of Texas Health Science Center at Houston, Houston, TX 77030 USA
| | - Youqiong Ye
- Department of Biochemistry and Molecular Biology, McGovern Medical School at The University of Texas Health Science Center at Houston, Houston, TX 77030 USA
| | - Jing Gong
- Department of Biochemistry and Molecular Biology, McGovern Medical School at The University of Texas Health Science Center at Houston, Houston, TX 77030 USA
| | - Hang Ruan
- Department of Biochemistry and Molecular Biology, McGovern Medical School at The University of Texas Health Science Center at Houston, Houston, TX 77030 USA
| | - Chun-Jie Liu
- Department of Bioinformatics and Systems Biology, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology Wuhan, 430074 Hubei, People’s Republic of China
| | - Yu Xiang
- Department of Biochemistry and Molecular Biology, McGovern Medical School at The University of Texas Health Science Center at Houston, Houston, TX 77030 USA
| | - Chunyan Cai
- Department of Internal Medicine, McGovern Medical School at The University of Texas Health Science Center at Houston, Houston, TX 77030 USA
| | - An-Yuan Guo
- Department of Bioinformatics and Systems Biology, Hubei Bioinformatics and Molecular Imaging Key Laboratory, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology Wuhan, 430074 Hubei, People’s Republic of China
| | - Jiqiang Ling
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742 USA
| | - Lixia Diao
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030 USA
| | - John N. Weinstein
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030 USA
| | - Leng Han
- Department of Biochemistry and Molecular Biology, McGovern Medical School at The University of Texas Health Science Center at Houston, Houston, TX 77030 USA
- Center for Precision Health, The University of Texas Health Science Center at Houston, Houston, TX 77030 USA
| |
Collapse
|
10
|
Pellizza L, Smal C, Rodrigo G, Arán M. Codon usage clusters correlation: towards protein solubility prediction in heterologous expression systems in E. coli. Sci Rep 2018; 8:10618. [PMID: 30006617 PMCID: PMC6045634 DOI: 10.1038/s41598-018-29035-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Accepted: 06/21/2018] [Indexed: 12/15/2022] Open
Abstract
Production of soluble recombinant proteins is crucial to the development of industry and basic research. However, the aggregation due to the incorrect folding of the nascent polypeptides is still a mayor bottleneck. Understanding the factors governing protein solubility is important to grasp the underlying mechanisms and improve the design of recombinant proteins. Here we show a quantitative study of the expression and solubility of a set of proteins from Bizionia argentinensis. Through the analysis of different features known to modulate protein production, we defined two parameters based on the %MinMax algorithm to compare codon usage clusters between the host and the target genes. We demonstrate that the absolute difference between all %MinMax frequencies of the host and the target gene is significantly negatively correlated with protein expression levels. But most importantly, a strong positive correlation between solubility and the degree of conservation of codons usage clusters is observed for two independent datasets. Moreover, we evince that this correlation is higher in codon usage clusters involved in less compact protein secondary structure regions. Our results provide important tools for protein design and support the notion that codon usage may dictate translation rate and modulate co-translational folding.
Collapse
Affiliation(s)
- Leonardo Pellizza
- Laboratory of Nuclear Magnetic Resonance, Fundación Instituto Leloir, IIBBA-CONICET, Av. Patricias Argentinas 435, C1405BWE, CABA, Argentina
| | - Clara Smal
- Laboratory of Nuclear Magnetic Resonance, Fundación Instituto Leloir, IIBBA-CONICET, Av. Patricias Argentinas 435, C1405BWE, CABA, Argentina
| | - Guido Rodrigo
- Laboratory of Nuclear Magnetic Resonance, Fundación Instituto Leloir, IIBBA-CONICET, Av. Patricias Argentinas 435, C1405BWE, CABA, Argentina
| | - Martín Arán
- Laboratory of Nuclear Magnetic Resonance, Fundación Instituto Leloir, IIBBA-CONICET, Av. Patricias Argentinas 435, C1405BWE, CABA, Argentina.
| |
Collapse
|
11
|
Franzo G, Segales J, Tucciarone CM, Cecchinato M, Drigo M. The analysis of genome composition and codon bias reveals distinctive patterns between avian and mammalian circoviruses which suggest a potential recombinant origin for Porcine circovirus 3. PLoS One 2018; 13:e0199950. [PMID: 29958294 PMCID: PMC6025852 DOI: 10.1371/journal.pone.0199950] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Accepted: 06/15/2018] [Indexed: 01/30/2023] Open
Abstract
Members of the genus Circovirus are host-specific viruses, which are totally dependent on cell machinery for their replication. Consequently, certain mimicry of the host genome features is expected to maximize cellular replicative system exploitation and minimize the recognition by the innate immune system. In the present study, the analysis of several genome composition and codon bias parameters of circoviruses infecting avian and mammalian species demonstrated the presence of quite distinctive patterns between the two groups. Remarkably, a higher deviation from the expected values based only on mutational patterns was observed for mammalian circoviruses both at dinucleotide and codon levels. Accordingly, a stronger selective pressure was estimated to shape the genome of mammalian circoviruses, particularly in the Cap encoding gene, compared to avian circoviruses. These differences could be attributed to different physiological and immunological features of the two host classes and suggest a trade-off between a tendency to optimize the capsid protein translation while minimizing the recognition of the genome and the transcript molecules. Interestingly, the recently identified Porcine circovirus 3 (PCV-3) had an intermediate pattern in terms of genome composition and codon bias. Particularly, its Rep gene appeared closely related to other mammalian circoviruses (especially bat circoviruses) while the Cap gene more closely resembled avian circoviruses. These evidences, coupled with the high selective forces apparently modelling the PCV-3 Cap gene composition, suggest the potential recombinant origin, followed or preceded by a host jump, of this virus.
Collapse
Affiliation(s)
- Giovanni Franzo
- Department of Animal Medicine, Production and Health (MAPS), University of Padua, Legnaro, Padua, Italy
- * E-mail:
| | - Joaquim Segales
- Departament de Sanitat i Anatomia Animals, Universitat Autònoma de Barcelona, Bellaterra, Barcelona, Spain
- UAB, Centre de Recerca en Sanitat Animal (CReSA, IRTA- UAB), Campus de la Universitat Autònoma de Barcelona, Bellaterra, Barcelona, Spain
| | - Claudia Maria Tucciarone
- Department of Animal Medicine, Production and Health (MAPS), University of Padua, Legnaro, Padua, Italy
| | - Mattia Cecchinato
- Department of Animal Medicine, Production and Health (MAPS), University of Padua, Legnaro, Padua, Italy
| | - Michele Drigo
- Department of Animal Medicine, Production and Health (MAPS), University of Padua, Legnaro, Padua, Italy
| |
Collapse
|
12
|
Ma XX, Ma P, Chang QY, Li LJ, Zhou XK, Zhang DR, Li MS, Cao X, Ma ZR. The analyses of relationships among nucleotide, synonymous codon and amino acid usages for E2 gene of bovine viral diarrhea virus. Gene 2018; 660:62-67. [DOI: 10.1016/j.gene.2018.03.065] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2017] [Revised: 03/08/2018] [Accepted: 03/20/2018] [Indexed: 10/17/2022]
|
13
|
Hui CY, Guo Y, Zhang W, Huang XQ. Rapid monitoring of the target protein expression with a fluorescent signal based on a dicistronic construct in Escherichia coli. AMB Express 2018; 8:81. [PMID: 29785487 PMCID: PMC5962521 DOI: 10.1186/s13568-018-0612-5] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2018] [Accepted: 05/12/2018] [Indexed: 01/02/2023] Open
Abstract
Real-time quantification of recombinant proteins is important in studies on fermentation engineering, cell engineering, etc. Measurement of the expression level of heterologous proteins in bacterial fermentation broth has traditionally relied on time-consuming and labor-intensive procedures, such as polyacrylamide gel electrophoresis, immunoblot analysis, and biological activity assays. We describe a simple, fast, and high sensitive assay for detecting heterologous proteins production in bacteria either at the overall level (fluorescence spectrophotometry) or at the individual level (fluorescence microscopic image) in this study. Based on a dicistronic model, the translation of target gene in the upstream open reading frame (ORF) was coupled with the synthesis of the mCherry reporter in the downstream ORF in E. coli cells, and subsequently this demonstrated a positive correlation between the expression of target gene and mCherry. Although a time lag exists between the expression of target protein and mCherry reporter, the method described here allows facile monitoring of dynamic changes in target protein expression, relying on indirect determination of the fluorescence intensity of mCherry during fermentation in real-time models. Additionally, the performance of a single bacterial cell factory could be checked under the fluorescence microscope field.
Collapse
|
14
|
Chaney JL, Steele A, Carmichael R, Rodriguez A, Specht AT, Ngo K, Li J, Emrich S, Clark PL. Widespread position-specific conservation of synonymous rare codons within coding sequences. PLoS Comput Biol 2017; 13:e1005531. [PMID: 28475588 PMCID: PMC5438181 DOI: 10.1371/journal.pcbi.1005531] [Citation(s) in RCA: 67] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2016] [Revised: 05/19/2017] [Accepted: 04/21/2017] [Indexed: 02/01/2023] Open
Abstract
Synonymous rare codons are considered to be sub-optimal for gene expression because they are translated more slowly than common codons. Yet surprisingly, many protein coding sequences include large clusters of synonymous rare codons. Rare codons at the 5’ terminus of coding sequences have been shown to increase translational efficiency. Although a general functional role for synonymous rare codons farther within coding sequences has not yet been established, several recent reports have identified rare-to-common synonymous codon substitutions that impair folding of the encoded protein. Here we test the hypothesis that although the usage frequencies of synonymous codons change from organism to organism, codon rarity will be conserved at specific positions in a set of homologous coding sequences, for example to tune translation rate without altering a protein sequence. Such conservation of rarity–rather than specific codon identity–could coordinate co-translational folding of the encoded protein. We demonstrate that many rare codon cluster positions are indeed conserved within homologous coding sequences across diverse eukaryotic, bacterial, and archaeal species, suggesting they result from positive selection and have a functional role. Most conserved rare codon clusters occur within rather than between conserved protein domains, challenging the view that their primary function is to facilitate co-translational folding after synthesis of an autonomous structural unit. Instead, many conserved rare codon clusters separate smaller protein structural motifs within structural domains. These smaller motifs typically fold faster than an entire domain, on a time scale more consistent with translation rate modulation by synonymous codon usage. While proteins with conserved rare codon clusters are structurally and functionally diverse, they are enriched in functions associated with organism growth and development, suggesting an important role for synonymous codon usage in organism physiology. The identification of conserved rare codon clusters advances our understanding of distinct, functional roles for otherwise synonymous codons and enables experimental testing of the impact of synonymous codon usage on the production of functional proteins. Proteins are long linear polymers that must fold into complex three-dimensional shapes in order to carry out their cellular functions. Every protein is synthesized by the ribosome, which decodes each trinucleotide codon in an mRNA coding sequence in order to select the amino acid residue that will occupy each position in the protein sequence. Most amino acids can be encoded by more than one codon, but these synonymous codons are not used with equal frequency. Rare codons are associated with generally slower rates for protein synthesis, and for this reason have traditionally been considered mildly deleterious for efficient protein production. However, because synonymous codon substitutions do not change the sequence of the encoded protein, the majority view is that they merely reflect genomic ‘background noise’. To the contrary, here we show that the positions of many synonymous rare codons are conserved in mRNA sequences that encode structurally similar proteins from a diverse range of organisms. These results suggest that rare codons have a functional role related to the production of functional proteins, potentially to regulate the rate of protein synthesis and the earliest steps of protein folding, while synthesis is still underway.
Collapse
Affiliation(s)
- Julie L. Chaney
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Aaron Steele
- Department of Computer Science & Engineering, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Rory Carmichael
- Department of Computer Science & Engineering, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Anabel Rodriguez
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Alicia T. Specht
- Department of Applied and Computational Mathematics & Statistics, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Kim Ngo
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, United States of America
- Department of Computer Science & Engineering, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Jun Li
- Department of Applied and Computational Mathematics & Statistics, University of Notre Dame, Notre Dame, Indiana, United States of America
| | - Scott Emrich
- Department of Computer Science & Engineering, University of Notre Dame, Notre Dame, Indiana, United States of America
- * E-mail: (PLC); (SE)
| | - Patricia L. Clark
- Department of Chemistry & Biochemistry, University of Notre Dame, Notre Dame, Indiana, United States of America
- Department of Chemical & Biomolecular Engineering, University of Notre Dame, Notre Dame, Indiana, United States of America
- * E-mail: (PLC); (SE)
| |
Collapse
|
15
|
When mRNA translation meets decay. Biochem Soc Trans 2017; 45:339-351. [DOI: 10.1042/bst20160243] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2016] [Revised: 12/19/2016] [Accepted: 01/11/2017] [Indexed: 12/26/2022]
Abstract
Messenger RNA (mRNA) translation and mRNA degradation are important determinants of protein output, and they are interconnected. Previously, it was thought that translation of an mRNA, as a rule, prevents its degradation. mRNA surveillance mechanisms, which degrade mRNAs as a consequence of their translation, were considered to be exceptions to this rule. Recently, however, it has become clear that many mRNAs are degraded co-translationally, and it has emerged that codon choice, by influencing the rate of ribosome elongation, affects the rate of mRNA decay. In this review, we discuss the links between translation and mRNA stability, with an emphasis on emerging data suggesting that codon optimality may regulate mRNA degradation.
Collapse
|
16
|
Frequent GU wobble pairings reduce translation efficiency in Plasmodium falciparum. Sci Rep 2017; 7:723. [PMID: 28389662 PMCID: PMC5429705 DOI: 10.1038/s41598-017-00801-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2016] [Accepted: 03/13/2017] [Indexed: 11/08/2022] Open
Abstract
Plasmodium falciparum genome has 81% A+T content. This nucleotide bias leads to extreme codon usage bias and culminates in frequent insertion of asparagine homorepeats in the proteome. Using recodonized GFP sequences, we show that codons decoded via G:U wobble pairing are suboptimal codons that are negatively associated to protein translation efficiency. Despite this, one third of all codons in the genome are GU wobble codons, suggesting that codon usage in P. falciparum has not been driven to maximize translation efficiency, but may have evolved as translational regulatory mechanism. Particularly, asparagine homorepeats are generally encoded by locally clustered GU wobble AAT codons, we demonstrated that this GU wobble-rich codon context is the determining factor that causes reduction of protein level. Moreover, insertion of clustered AAT codons also causes destabilization of the transcripts. Interestingly, more frequent asparagine homorepeats insertion is seen in single-exon genes, suggesting transcripts of these genes may have been programmed for rapid mRNA decay to compensate for the inefficiency of mRNA surveillance regulation on intronless genes. To our knowledge, this is the first study that addresses P. falciparum codon usage in vitro and provides new insights on translational regulation and genome evolution of this parasite.
Collapse
|
17
|
Goz E, Tuller T. Evidence of a Direct Evolutionary Selection for Strong Folding and Mutational Robustness Within HIV Coding Regions. J Comput Biol 2016; 23:641-50. [PMID: 27347769 DOI: 10.1089/cmb.2016.0052] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A large number of studies demonstrated the importance of different HIV RNA structural elements at all stages of the viral life cycle. Nevertheless, the significance of many of these structures is unknown, and plausibly new regions containing RNA structure-mediated regulatory signals remain to be identified. An important characteristic of genomic regions carrying functionally significant secondary structures is their mutational robustness, that is, the extent to which a sequence remains constant in spite of despite mutations in terms of its underlying secondary structure. Structural robustness to mutations is expected to be important in the case of functional RNA structures in viruses with high mutation rate; it may prevent fitness loss due to disruption of possibly functional conformations, pointing to the specific significance of the corresponding genomic region. In the current work, we perform a genome-wide computational analysis to detect signals of a direct evolutionary selection for strong folding and RNA structure-based mutational robustness within HIV coding sequences. We provide evidence that specific regions of HIV structural genes undergo an evolutionary selection for strong folding; in addition, we demonstrate that HIV Rev responsive element seems to undergo a direct evolutionary selection for increased secondary structure robustness to point mutations. We believe that our analysis may enable a better understanding of viral evolutionary dynamics at the RNA structural level and may benefit to practical efforts of engineering antiviral vaccines and novel therapeutic approaches.
Collapse
Affiliation(s)
- Eli Goz
- 1 Department of Biomedical Engineering, Tel-Aviv University , Ramat Aviv, Israel .,2 SynVaccine Ltd . Ramat Hachayal, Tel Aviv, Israel
| | - Tamir Tuller
- 1 Department of Biomedical Engineering, Tel-Aviv University , Ramat Aviv, Israel .,2 SynVaccine Ltd . Ramat Hachayal, Tel Aviv, Israel .,3 Sagol School of Neuroscience, Tel-Aviv University , Ramat Aviv, Israel
| |
Collapse
|
18
|
Deng W, Babu IR, Su D, Yin S, Begley TJ, Dedon PC. Trm9-Catalyzed tRNA Modifications Regulate Global Protein Expression by Codon-Biased Translation. PLoS Genet 2015; 11:e1005706. [PMID: 26670883 PMCID: PMC4689569 DOI: 10.1371/journal.pgen.1005706] [Citation(s) in RCA: 75] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Accepted: 11/06/2015] [Indexed: 12/30/2022] Open
Abstract
Post-transcriptional modifications of transfer RNAs (tRNAs) have long been recognized to play crucial roles in regulating the rate and fidelity of translation. However, the extent to which they determine global protein production remains poorly understood. Here we use quantitative proteomics to show a direct link between wobble uridine 5-methoxycarbonylmethyl (mcm5) and 5-methoxy-carbonyl-methyl-2-thio (mcm5s2) modifications catalyzed by tRNA methyltransferase 9 (Trm9) in tRNAArg(UCU) and tRNAGlu(UUC) and selective translation of proteins from genes enriched with their cognate codons. Controlling for bias in protein expression and alternations in mRNA expression, we find that loss of Trm9 selectively impairs expression of proteins from genes enriched with AGA and GAA codons under both normal and stress conditions. Moreover, we show that AGA and GAA codons occur with high frequency in clusters along the transcripts, which may play a role in modulating translation. Consistent with these results, proteins subject to enhanced ribosome pausing in yeast lacking mcm5U and mcm5s2U are more likely to be down-regulated and contain a larger number of AGA/GAA clusters. Together, these results suggest that Trm9-catalyzed tRNA modifications play a significant role in regulating protein expression within the cell. Here we present evidence for a more complicated role for transfer RNAs (tRNAs) than as mere adapters that link the genetic code in messenger RNA (mRNA) to the amino acid sequence of a protein during translation. tRNAs have long been known to be modified with dozens of different chemical structures other than the 4 canonical ribonucleosides, though the role of these modifications in controlling translation is poorly understood. By quantifying the expression of thousands of proteins in the yeast S. cerevisiae, we identified a mechanistic link between modified ribonucleosides located at the wobble position of two tRNAs, tRNAArg(UCU) and tRNAGlu(UUC), and the translation of proteins derived from genes enriched with codons read by these tRNAs: AGA and GAA. In cells lacking the enzyme that inserts these modifications, tRNA methyltransferase 9 (Trm9), we found a significant reduction in proteins from genes enriched with AGA and GAA codons and with runs of these codons. Also, mRNAs enriched with runs of AGA and GAA codons are subject to stalled translation on ribosomes in yeast lacking mcm5U and mcm5s2U. Together, these results reveal a distinct role for Trm9-catalyzed tRNA modifications in selectively regulating the expression of proteins enriched with AGA and GAA codons.
Collapse
Affiliation(s)
- Wenjun Deng
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - I. Ramesh Babu
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Dan Su
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Shanye Yin
- Department of Cell Biology, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Thomas J. Begley
- SUNY College of Nanoscale Science and Engineering, Albany, New York, United States of America
- RNA Institute and Cancer Research Center, University at Albany, State University of New York, Albany, New York, United States of America
| | - Peter C. Dedon
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- Singapore-MIT Alliance for Research and Technology, Singapore
- * E-mail:
| |
Collapse
|
19
|
Goz E, Tuller T. Widespread signatures of local mRNA folding structure selection in four Dengue virus serotypes. BMC Genomics 2015; 16 Suppl 10:S4. [PMID: 26449467 PMCID: PMC4602183 DOI: 10.1186/1471-2164-16-s10-s4] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND It is known that mRNA folding can affect and regulate various gene expression steps both in living organisms and in viruses. Previous studies have recognized functional RNA structures in the genome of the Dengue virus. However, these studies usually focused either on the viral untranslated regions or on very specific and limited regions at the beginning of the coding sequences, in a limited number of strains, and without considering evolutionary selection. RESULTS Here we performed the first large scale comprehensive genomics analysis of selection for local mRNA folding strength in the Dengue virus coding sequences, based on a total of 1,670 genomes and 4 serotypes. Our analysis identified clusters of positions along the coding regions that may undergo a conserved evolutionary selection for strong or weak local folding maintained across different viral variants. Specifically, 53-66 clusters for strong folding and 49-73 clusters for weak folding (depending on serotype) aggregated of positions with a significant conservation of folding energy signals (related to partially overlapping local genomic regions) were recognized. In addition, up to 7% of these positions were found to be conserved in more than 90% of the viral genomes. Although some of the identified positions undergo frequent synonymous / non-synonymous substitutions, the selection for folding strength therein is preserved, and thus cannot be trivially explained based on sequence conservation alone. CONCLUSIONS The fact that many of the positions with significant folding related signals are conserved among different Dengue variants suggests that a better understanding of the mRNA structures in the corresponding regions may promote the development of prospective anti- Dengue vaccination strategies. The comparative genomics approach described here can be employed in the future for detecting functional regions in other pathogens with very high mutations rates.
Collapse
|
20
|
Daniel E, Onwukwe GU, Wierenga RK, Quaggin SE, Vainio SJ, Krause M. ATGme: Open-source web application for rare codon identification and custom DNA sequence optimization. BMC Bioinformatics 2015; 16:303. [PMID: 26391121 PMCID: PMC4578782 DOI: 10.1186/s12859-015-0743-5] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Accepted: 09/16/2015] [Indexed: 02/06/2023] Open
Abstract
Background Codon usage plays a crucial role when recombinant proteins are expressed in different organisms. This is especially the case if the codon usage frequency of the organism of origin and the target host organism differ significantly, for example when a human gene is expressed in E. coli. Therefore, to enable or enhance efficient gene expression it is of great importance to identify rare codons in any given DNA sequence and subsequently mutate these to codons which are more frequently used in the expression host. Results We describe an open-source web-based application, ATGme, which can in a first step identify rare and highly rare codons from most organisms, and secondly gives the user the possibility to optimize the sequence. Conclusions This application provides a simple user-friendly interface utilizing three optimization strategies: 1. one-click optimization, 2. bulk optimization (by codon-type), 3. individualized custom (codon-by-codon) optimization. ATGme is an open-source application which is freely available at: http://atgme.org
Collapse
Affiliation(s)
- Edward Daniel
- Biocenter Oulu, Faculty of Biochemistry and Molecular Medicine, Structural Biochemistry, University of Oulu, Oulu, Finland.
| | - Goodluck U Onwukwe
- Biocenter Oulu, Faculty of Biochemistry and Molecular Medicine, Structural Biochemistry, University of Oulu, Oulu, Finland.
| | - Rik K Wierenga
- Biocenter Oulu, Faculty of Biochemistry and Molecular Medicine, Structural Biochemistry, University of Oulu, Oulu, Finland.
| | - Susan E Quaggin
- Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA.
| | - Seppo J Vainio
- Biocenter Oulu, Laboratory of Developmental Biology, InfoTech Oulu, Center for Cell Matrix Research, Faculty of Biochemistry and Molecular Medicine, University of Oulu, Aapistie 5A, FIN-90220, Oulu, Finland.
| | - Mirja Krause
- Biocenter Oulu, Laboratory of Developmental Biology, InfoTech Oulu, Center for Cell Matrix Research, Faculty of Biochemistry and Molecular Medicine, University of Oulu, Aapistie 5A, FIN-90220, Oulu, Finland.
| |
Collapse
|
21
|
The importance of codon–anticodon interactions in translation elongation. Biochimie 2015; 114:72-9. [DOI: 10.1016/j.biochi.2015.04.013] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2014] [Accepted: 04/16/2015] [Indexed: 11/16/2022]
|
22
|
Pizzo L, Iriarte A, Alvarez-Valin F, Marín M. Conservation of CFTR codon frequency through primates suggests synonymous mutations could have a functional effect. Mutat Res 2015; 775:19-25. [PMID: 25839760 DOI: 10.1016/j.mrfmmm.2015.03.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2014] [Revised: 02/05/2015] [Accepted: 03/09/2015] [Indexed: 06/04/2023]
Abstract
Cystic fibrosis is an inherited chronic disease that affects the lungs and digestive system, with a prevalence of about 1:3000 people. Cystic fibrosis is caused by mutations in CFTR gene, which lead to a defective function of the chloride channel, the cystic fibrosis transmembrane conductance regulator (CFTR). Up-to-date, more than 1900 mutations have been reported in CFTR. However for an important proportion of them, their functional effects and the relation to disease are still not understood. Many of these mutations are silent (or synonymous), namely they do not alter the encoded amino acid. These synonymous mutations have been considered as neutral to protein function. However, more recent evidence in bacterial and human proteins has put this concept under revision. With the aim of understanding possible functional effects of synonymous mutations in CFTR, we analyzed human and primates CFTR codon usage and divergence patterns. We report the presence of regions enriched in rare and frequent codons. This spatial pattern of codon preferences is conserved in primates, but this cannot be explained by sequence conservation alone. In sum, the results presented herein suggest a functional implication of these regions of the gene that may be maintained by purifying selection acting to preserve a particular codon usage pattern along the sequence. Overall these results support the idea that several synonymous mutations in CFTR may have functional importance, and could be involved in the disease.
Collapse
Affiliation(s)
- Lucilla Pizzo
- Sección Bioquímica-Biología Molecular, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay
| | - Andrés Iriarte
- Dpto. de Desarrollo Biotecnológico, Instituto de Higiene, Facultad de Medicina, Universidad de la República, Montevideo, Uruguay; Dpto. de Genómica, Instituto de Investigaciones Biológicas Clemente Estable, IIBCE, Montevideo, Uruguay; Dpto. de Bioquímica y Genómica Microbianas, Instituto de Investigaciones Biológicas Clemente Estable, IIBCE, Montevideo, Uruguay
| | - Fernando Alvarez-Valin
- Sección Biomatemática, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| | - Mónica Marín
- Sección Bioquímica-Biología Molecular, Facultad de Ciencias, Universidad de la República, Iguá 4225, 11400 Montevideo, Uruguay.
| |
Collapse
|
23
|
Abstract
Owing to the degeneracy of the genetic code, a protein sequence can be encoded by many different synonymous mRNA coding sequences. Synonymous codon usage was once thought to be functionally neutral, but evidence now indicates it is shaped by evolutionary selection and affects other aspects of protein biogenesis beyond specifying the amino acid sequence of the protein. Synonymous rare codons, once thought to have only negative impacts on the speed and accuracy of translation, are now known to play an important role in diverse functions, including regulation of cotranslational folding, covalent modifications, secretion, and expression level. Mutations altering synonymous codon usage are linked to human diseases. However, much remains unknown about the molecular mechanisms connecting synonymous codon usage to efficient protein biogenesis and proper cell physiology. Here we review recent literature on the functional effects of codon usage, including bioinformatics approaches aimed at identifying general roles for synonymous codon usage.
Collapse
|
24
|
Ou KC, Wang CY, Liu KT, Chen YL, Chen YC, Lai MD, Yen MC. Optimization protein productivity of human interleukin-2 through codon usage, gene copy number and intracellular tRNA concentration in CHO cells. Biochem Biophys Res Commun 2014; 454:347-52. [DOI: 10.1016/j.bbrc.2014.10.097] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2014] [Accepted: 10/17/2014] [Indexed: 11/17/2022]
|
25
|
Kessler MD, Dean MD. Effective population size does not predict codon usage bias in mammals. Ecol Evol 2014; 4:3887-900. [PMID: 25505518 PMCID: PMC4242573 DOI: 10.1002/ece3.1249] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2014] [Revised: 08/04/2014] [Accepted: 08/07/2014] [Indexed: 12/20/2022] Open
Abstract
Synonymous codons are not used at equal frequency throughout the genome, a phenomenon termed codon usage bias (CUB). It is often assumed that interspecific variation in the intensity of CUB is related to species differences in effective population sizes (Ne), with selection on CUB operating less efficiently in species with small Ne. Here, we specifically ask whether variation in Ne predicts differences in CUB in mammals and report two main findings. First, across 41 mammalian genomes, CUB was not correlated with two indirect proxies of Ne (body mass and generation time), even though there was statistically significant evidence of selection shaping CUB across all species. Interestingly, autosomal genes showed higher codon usage bias compared to X-linked genes, and high-recombination genes showed higher codon usage bias compared to low recombination genes, suggesting intraspecific variation in Ne predicts variation in CUB. Second, across six mammalian species with genetic estimates of Ne (human, chimpanzee, rabbit, and three mouse species: Mus musculus, M. domesticus, and M. castaneus), Ne and CUB were weakly and inconsistently correlated. At least in mammals, interspecific divergence in Ne does not strongly predict variation in CUB. One hypothesis is that each species responds to a unique distribution of selection coefficients, confounding any straightforward link between Ne and CUB.
Collapse
Affiliation(s)
- Michael D Kessler
- Molecular and Computational Biology, University of Southern California 1050 Childs Way, Los Angeles, California, 90089
| | - Matthew D Dean
- Molecular and Computational Biology, University of Southern California 1050 Childs Way, Los Angeles, California, 90089
| |
Collapse
|
26
|
The effects of the context-dependent codon usage bias on the structure of the nsp1α of porcine reproductive and respiratory syndrome virus. BIOMED RESEARCH INTERNATIONAL 2014; 2014:765320. [PMID: 25162025 PMCID: PMC4137607 DOI: 10.1155/2014/765320] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/18/2014] [Revised: 06/05/2014] [Accepted: 06/19/2014] [Indexed: 11/18/2022]
Abstract
The information about the crystal structure of porcine reproductive and respiratory syndrome virus (PRRSV) leader protease nsp1α is available to analyze the roles of tRNA abundance of pigs and codon usage of the nsp1 α gene in the formation of this protease. The effects of tRNA abundance of the pigs and the synonymous codon usage and the context-dependent codon bias (CDCB) of the nsp1 α on shaping the specific folding units (α-helix, β-strand, and the coil) in the nsp1α were analyzed based on the structural information about this protease from protein data bank (PDB: 3IFU) and the nsp1 α of the 191 PRRSV strains. By mapping the overall tRNA abundance along the nsp1 α, we found that there is no link between the fluctuation of the overall tRNA abundance and the specific folding units in the nsp1α, and the low translation speed of ribosome caused by the tRNA abundance exists in the nsp1 α. The strong correlation between some synonymous codon usage and the specific folding units in the nsp1α was found, and the phenomenon of CDCB exists in the specific folding units of the nsp1α. These findings provide an insight into the roles of the synonymous codon usage and CDCB in the formation of PRRSV nsp1α structure.
Collapse
|
27
|
Abstract
Whole-genome and functional analyses suggest a wealth of secondary or auxiliary genetic information (AGI) within the redundancy component of the genetic code. Although there are multiple aspects of biased codon use, we focus on two types of auxiliary information: codon-specific translational pauses that can be used by particular proteins toward their unique folding and biased codon patterns shared by groups of functionally related mRNAs with coordinate regulation. AGI is important to genetics in general and to human disease; here, we consider influences of its three major components, biased codon use itself, variations in the tRNAome, and anticodon modifications that distinguish synonymous decoding. AGI is plastic and can be used by different species to different extents, with tissue-specificity and in stress responses. Because AGI is species-specific, it is important to consider codon-sensitive experiments when using heterologous systems; for this we focus on the tRNA anticodon loop modification enzyme, CDKAL1, and its link to type 2 diabetes. Newly uncovered tRNAome variability among humans suggests roles in penetrance and as a genetic modifier and disease modifier. Development of experimental and bioinformatics methods are needed to uncover additional means of auxiliary genetic information.
Collapse
Affiliation(s)
- Richard J. Maraia
- Intramural Research Program on Genomics of Differentiation, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland 20892, USA
- Corresponding authorE-mail
| | - James R. Iben
- Intramural Research Program on Genomics of Differentiation, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland 20892, USA
| |
Collapse
|
28
|
Ponce de Leon M, de Miranda AB, Alvarez-Valin F, Carels N. The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins. Bioinform Biol Insights 2014; 8:93-108. [PMID: 24899802 PMCID: PMC4039185 DOI: 10.4137/bbi.s13161] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2013] [Revised: 11/24/2013] [Accepted: 11/24/2013] [Indexed: 01/02/2023] Open
Abstract
For this report, we analyzed protein secondary structures in relation to the statistics of three nucleotide codon positions. The purpose of this investigation was to find which properties of the ribosome, tRNA or protein level, could explain the purine bias (Rrr) as it is observed in coding DNA. We found that the Rrr pattern is the consequence of a regularity (the codon structure) resulting from physicochemical constraints on proteins and thermodynamic constraints on ribosomal machinery. The physicochemical constraints on proteins mainly come from the hydropathy and molecular weight (MW) of secondary structures as well as the energy cost of amino acid synthesis. These constraints appear through a network of statistical correlations, such as (i) the cost of amino acid synthesis, which is in favor of a higher level of guanine in the first codon position, (ii) the constructive contribution of hydropathy alternation in proteins, (iii) the spatial organization of secondary structure in proteins according to solvent accessibility, (iv) the spatial organization of secondary structure according to amino acid hydropathy, (v) the statistical correlation of MW with protein secondary structures and their overall hydropathy, (vi) the statistical correlation of thymine in the second codon position with hydropathy and the energy cost of amino acid synthesis, and (vii) the statistical correlation of adenine in the second codon position with amino acid complexity and the MW of secondary protein structures. Amino acid physicochemical properties and functional constraints on proteins constitute a code that is translated into a purine bias within the coding DNA via tRNAs. In that sense, the Rrr pattern within coding DNA is the effect of information transfer on nucleotide composition from protein to DNA by selection according to the codon positions. Thus, coding DNA structure and ribosomal machinery co-evolved to minimize the energy cost of protein coding given the functional constraints on proteins.
Collapse
Affiliation(s)
- Miguel Ponce de Leon
- Sección Biomatemática, Facultad de Ciencias, Universidad de la República, Iguá, Montevideo, Uruguay
| | - Antonio Basilio de Miranda
- Fundação Oswaldo Cruz (FIOCRUZ), Instituto Oswaldo Cruz (IOC), Laboratório de Genômica Funcional e Bioinformática, Rio de Janeiro, RJ, Brazil
| | - Fernando Alvarez-Valin
- Sección Biomatemática, Facultad de Ciencias, Universidad de la República, Iguá, Montevideo, Uruguay
| | - Nicolas Carels
- Fundação Oswaldo Cruz (FIOCRUZ), Instituto Oswaldo Cruz (IOC), Laboratório de Genômica Funcional e Bioinformática, Rio de Janeiro, RJ, Brazil
| |
Collapse
|
29
|
Gandin V, Topisirovic I. Co-translational mechanisms of quality control of newly synthesized polypeptides. ACTA ACUST UNITED AC 2014; 2:e28109. [PMID: 26779401 PMCID: PMC4705825 DOI: 10.4161/trla.28109] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2013] [Revised: 12/30/2013] [Accepted: 02/04/2014] [Indexed: 01/23/2023]
Abstract
During protein synthesis, nascent polypeptides emerge from ribosomes to fold into functional proteins. Misfolding of newly synthesized polypeptides (NSPs) at this stage leads to their aggregation. These misfolded NSPs must be expediently cleared to circumvent the deleterious effects of protein aggregation on cell physiology. To this end, a sizable portion of NSPs are ubiquitinated and rapidly degraded by the proteasome. This suggests the existence of co-translational mechanisms that play a pivotal role in the quality control of NSPs. It is generally thought that ribosomes play a central role in this process. During mRNA translation, ribosomes sense errors that lead to the accumulation of aberrant polypeptides, and serve as a hub for protein complexes that are required for optimal folding and/or proteasome-dependent degradation of misfolded polypeptides. In this review, we discuss recent findings that shed light on the molecular underpinnings of the co-translational quality control of NSPs.
Collapse
Affiliation(s)
- Valentina Gandin
- Lady Davis Institute for Medical Research; Sir Mortimer B. Davis-Jewish General Hospital; Montréal, QC Canada; Department of Oncology; McGill University; Montréal, QC Canada
| | - Ivan Topisirovic
- Lady Davis Institute for Medical Research; Sir Mortimer B. Davis-Jewish General Hospital; Montréal, QC Canada; Department of Oncology; McGill University; Montréal, QC Canada
| |
Collapse
|
30
|
Duong-Ly KC, Gabelli SB. Explanatory chapter: troubleshooting recombinant protein expression: general. Methods Enzymol 2014; 541:209-29. [PMID: 24674074 DOI: 10.1016/b978-0-12-420119-4.00017-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
One of the most daunting problems for biochemists is the expression of recombinant proteins. Often, the host organism differs from the organism from which the gene coding for the protein of interest was derived. This article provides guidelines to determine whether or not protein expression is a problem, describes possible reasons for low protein expression, and covers several possible solutions. A protocol for measuring protein expression during E. coli cell growth and after induction is given. The reader should note that low protein expression is a complex problem that often stems from a variety of factors. Combinations of the solutions presented in this article may be required to solve a problem of protein expression. A brief overview of host cell expression systems is given, but the article primarily focuses on expression in E. coli as this is the most commonly used host organism. Some of the methods discussed here, however, may be applied to other expression systems.
Collapse
Affiliation(s)
- Krisna C Duong-Ly
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Sandra B Gabelli
- Department of Biophysics and Biophysical Chemistry, Johns Hopkins University School of Medicine, Baltimore, MD, USA.
| |
Collapse
|
31
|
Zhou JH, Zhang J, Sun DJ, Ma Q, Chen HT, Ma LN, Ding YZ, Liu YS. The distribution of synonymous codon choice in the translation initiation region of dengue virus. PLoS One 2013; 8:e77239. [PMID: 24204777 PMCID: PMC3808402 DOI: 10.1371/journal.pone.0077239] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2013] [Accepted: 08/30/2013] [Indexed: 11/18/2022] Open
Abstract
Dengue is the most common arthropod-borne viral (Arboviral) illness in humans. The genetic features concerning the codon usage of dengue virus (DENV) were analyzed by the relative synonymous codon usage, the effective number of codons and the codon adaptation index. The evolutionary distance between DENV and the natural hosts (Homo sapiens, Pan troglodytes, Aedes albopictus and Aedes aegypti) was estimated by a novel formula. Finally, the synonymous codon usage preference for the translation initiation region of this virus was also analyzed. The result indicates that the general trend of the 59 synonymous codon usage of the four genotypes of DENV are similar to each other, and this pattern has no link with the geographic distribution of the virus. The effect of codon usage pattern of Aedes albopictus and Aedes aegypti on the formation of codon usage of DENV is stronger than that of the two primates. Turning to the codon usage preference of the translation initiation region of this virus, some codons pairing to low tRNA copy numbers in the two primates have a stronger tendency to exist in the translation initiation region than those in the open reading frame of DENV. Although DENV, like other RNA viruses, has a high mutation to adapt its hosts, the regulatory features about the synonymous codon usage have been 'branded' on the translation initiation region of this virus in order to hijack the translational mechanisms of the hosts.
Collapse
Affiliation(s)
- Jian-hua Zhou
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences. Lanzhou, Gansu, P.R. China
| | - Jie Zhang
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences. Lanzhou, Gansu, P.R. China
| | - Dong-jie Sun
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences. Lanzhou, Gansu, P.R. China
| | - Qi Ma
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences. Lanzhou, Gansu, P.R. China
| | - Hao-tai Chen
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences. Lanzhou, Gansu, P.R. China
| | - Li-na Ma
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences. Lanzhou, Gansu, P.R. China
| | - Yao-zhong Ding
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences. Lanzhou, Gansu, P.R. China
| | - Yong-sheng Liu
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences. Lanzhou, Gansu, P.R. China
- * E-mail:
| |
Collapse
|
32
|
Zhou JH, Zhang J, Sun DJ, Ma Q, Ma B, Pejsak Z, Chen HT, Ma LN, Ding YZ, Liu YS. Potential roles of synonymous codon usage and tRNA concentration in hosts on the two initiation regions of foot-and-mouth disease virus RNA. Virus Res 2013; 176:298-302. [DOI: 10.1016/j.virusres.2013.06.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2013] [Revised: 06/10/2013] [Accepted: 06/14/2013] [Indexed: 12/01/2022]
|
33
|
Zhou JH, You YN, Chen HT, Zhang J, Ma LN, Ding YZ, Pejsak Z, Liu YS. The effects of the synonymous codon usage and tRNA abundance on protein folding of the 3C protease of foot-and-mouth disease virus. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2013; 16:270-4. [PMID: 23499709 DOI: 10.1016/j.meegid.2013.02.017] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2012] [Revised: 02/18/2013] [Accepted: 02/22/2013] [Indexed: 11/22/2022]
Abstract
The 3C protease of foot-and-mouth disease virus (FMDV) has a conserved amino acid sequence and is responsible for most cleavage in the viral polyprotein. The effects of the synonymous codon usage of FMDV 3C gene and tRNA abundance of the hosts on shaping different folding units (α-helix, β-strand and the coil) in the 3C protease were analyzed based on the structural information of the FMDV 3C protease from Protein Data Bank (PDB: 2BHG) and 210 genes of 3C for all serotypes of FMDV. The strong correlation between some codons usage and the specific folding unit in the FMDV 3C protease is found. As for the effect of translation speed caused by tRNA abundance on the formation of folding units, the codon positions with lowly abundant tRNA scatter in the 3C gene and there is the obvious fluctuation of tRNA abundance locating in the transition boundaries from the β-strand to the α-helix and the coil, respectively. However, codon positions with lowly abundant tRNA clustering into these boundaries are not found, suggesting that the adjustment of translation speed is likely also achieved by the single codon position with low tRNA abundance rather than a cluster. The observations can provide the information for insight into the role of the synonymous codon usage in the formation of 3C protease of FMDV and effect of the tRNA abundance of the hosts on this formation of protease.
Collapse
Affiliation(s)
- Jian-hua Zhou
- State Key Laboratory of Veterinary Etiological Biology, National Foot-and-Mouth Disease Reference Laboratory, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, 730046 Gansu, PR China
| | | | | | | | | | | | | | | |
Collapse
|
34
|
The analysis of codon bias of foot-and-mouth disease virus and the adaptation of this virus to the hosts. INFECTION GENETICS AND EVOLUTION 2013; 14:105-10. [DOI: 10.1016/j.meegid.2012.09.020] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2012] [Revised: 09/23/2012] [Accepted: 09/25/2012] [Indexed: 11/24/2022]
|
35
|
van der Kuyl AC, Berkhout B. The biased nucleotide composition of the HIV genome: a constant factor in a highly variable virus. Retrovirology 2012; 9:92. [PMID: 23131071 PMCID: PMC3511177 DOI: 10.1186/1742-4690-9-92] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2012] [Accepted: 10/14/2012] [Indexed: 01/09/2023] Open
Abstract
Viruses often deviate from their hosts in the nucleotide composition of their genomes. The RNA genome of the lentivirus family of retroviruses, including human immunodeficiency virus (HIV), contains e.g. an above average percentage of adenine (A) nucleotides, while being extremely poor in cytosine (C). Such a deviant base composition has implications for the amino acids that are encoded by the open reading frames (ORFs), both in the requirement of specific tRNA species and in the preference for amino acids encoded by e.g. A-rich codons. Nucleotide composition does obviously affect the secondary and tertiary structure of the RNA genome and its biological functions, but it does also influence phylogenetic analysis of viral genome sequences, and possibly the activity of the integrated DNA provirus. Over time, the nucleotide composition of the HIV-1 genome is exceptionally conserved, varying by less than 1% per base position per isolate within either group M, N, or O during 1983–2009. This extreme stability of the nucleotide composition may possibly be achieved by negative selection, perhaps conserving semi-stable RNA secondary structure as reverse transcription would be significantly affected for a less A-rich genome where secondary structures are expected to be more stable and thus more difficult to unfold. This review will discuss all aspects of the lentiviral genome composition, both of the RNA and of its derived double-stranded DNA genome, with a focus on HIV-1, the nucleotide composition over time, the effects of artificially humanized codons as well as contributions of immune system pressure on HIV nucleotide bias.
Collapse
Affiliation(s)
- Antoinette C van der Kuyl
- Laboratory of Experimental Virology, Department of Medical Microbiology, Center for Infection and Immunity Amsterdam, Academic Medical Center of the University of Amsterdam, Meibergdreef 15, Amsterdam, AZ 1105, The Netherlands.
| | | |
Collapse
|
36
|
Novoa EM, Ribas de Pouplana L. Speeding with control: codon usage, tRNAs, and ribosomes. Trends Genet 2012; 28:574-81. [PMID: 22921354 DOI: 10.1016/j.tig.2012.07.006] [Citation(s) in RCA: 218] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2012] [Revised: 07/19/2012] [Accepted: 07/20/2012] [Indexed: 11/26/2022]
Abstract
Codon usage and tRNA abundance are critical parameters for gene synthesis. However, the forces determining codon usage bias within genomes and between organisms, as well as the functional roles of biased codon compositions, remain poorly understood. Similarly, the composition and dynamics of mature tRNA populations in cells in terms of isoacceptor abundances, and the prevalence and function of base modifications are not well understood. As we begin to decipher some of the rules that govern codon usage and tRNA abundances, it is becoming clear that these parameters are a way to not only increase gene expression, but also regulate the speed of ribosomal translation, the efficiency of protein folding, and the coordinated expression of functionally related gene families. Here, we discuss the importance of codon-anticodon interactions in translation regulation and highlight the contribution of non-random codon distributions and post-transcriptional base modifications to this regulation.
Collapse
Affiliation(s)
- Eva Maria Novoa
- Institute for Research in Biomedicine (IRB), c/Baldiri Reixac 15-21 08028, Barcelona, Catalonia, Spain
| | | |
Collapse
|
37
|
Chartier M, Gaudreault F, Najmanovich R. Large-scale analysis of conserved rare codon clusters suggests an involvement in co-translational molecular recognition events. ACTA ACUST UNITED AC 2012; 28:1438-45. [PMID: 22467916 DOI: 10.1093/bioinformatics/bts149] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
MOTIVATION An increasing amount of evidence from experimental and computational analysis suggests that rare codon clusters are functionally important for protein activity. Most of the studies on rare codon clusters were performed on a limited number of proteins or protein families. In the present study, we present the Sherlocc program and how it can be used for large scale protein family analysis of evolutionarily conserved rare codon clusters and their relation to protein function and structure. This large-scale analysis was performed using the whole Pfam database covering over 70% of the known protein sequence universe. Our program Sherlocc, detects statistically relevant conserved rare codon clusters and produces a user-friendly HTML output. RESULTS Statistically significant rare codon clusters were detected in a multitude of Pfam protein families. The most statistically significant rare codon clusters were predominantly identified in N-terminal Pfam families. Many of the longest rare codon clusters are found in membrane-related proteins which are required to interact with other proteins as part of their function, for example in targeting or insertion. We identified some cases where rare codon clusters can play a regulating role in the folding of catalytically important domains. Our results support the existence of a widespread functional role for rare codon clusters across species. Finally, we developed an online filter-based search interface that provides access to Sherlocc results for all Pfam families. AVAILABILITY The Sherlocc program and search interface are open access and are available at http://bcb.med.usherbrooke.ca
Collapse
Affiliation(s)
- Matthieu Chartier
- Department of Biochemistry, Faculty of Medicine and Health Sciences, Université de Sherbrooke, 12e Avenue Nord, Sherbrooke, Québec, Canada
| | | | | |
Collapse
|
38
|
Wang J, Zhang W, Yi Z, Wang S, Li Z. Identification of a thrombin cleavage site and a short form of ADAMTS-18. Biochem Biophys Res Commun 2012; 419:692-7. [PMID: 22386991 PMCID: PMC3313623 DOI: 10.1016/j.bbrc.2012.02.081] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2012] [Accepted: 02/14/2012] [Indexed: 12/20/2022]
Abstract
We previously reported that C-terminal fragment of ADAMTS-18 induces platelet fragmentation through ROS release. We have shown that thrombin cleaves ADAMTS-18 and that a short form of ADAMTS-18 in in vitro translational assay. However, the exact thrombin cleavage site and whether a short form ADAMTS-18 presents in vivo are not clear. In this study, we first identified that the thrombin cleavage site is between Arg775 and Ser776 by thrombin cleavage of ADAMTS-18 peptide following mass spectrum assay. We then showed that a short form ADAMTS-18 presents in brain, kidney, lung, and testicle from C57BL/6 mouse embryo. Since alternative form of ADAMTS-18 could be a mechanism to regulate its activity, we then investigated the mechanism involves in the generation of ADAMTS-18 short form. However, neither protease inhibitors nor mutations in catalytic domain of ADAMTS-18 have any significant effect on the generation of ADAMTS-18 short form. Thus, our data demonstrate a thrombin cleavage site and confirm a short form of ADAMTS-18 presents in vivo.
Collapse
Affiliation(s)
- Jianhui Wang
- Department of Medicine, NYU Cancer Institute, New York University School of Medicine 550 First Avenue New York, NY 10016
| | - Wei Zhang
- Department of Medicine, NYU Cancer Institute, New York University School of Medicine 550 First Avenue New York, NY 10016
| | - Zanhua Yi
- Department of Medicine, NYU Cancer Institute, New York University School of Medicine 550 First Avenue New York, NY 10016
| | - Shiyang Wang
- Department of Medicine, NYU Cancer Institute, New York University School of Medicine 550 First Avenue New York, NY 10016
| | - Zongdong Li
- Department of Medicine, NYU Cancer Institute, New York University School of Medicine 550 First Avenue New York, NY 10016
| |
Collapse
|
39
|
Genes adopt non-optimal codon usage to generate cell cycle-dependent oscillations in protein levels. Mol Syst Biol 2012; 8:572. [PMID: 22373820 PMCID: PMC3293633 DOI: 10.1038/msb.2012.3] [Citation(s) in RCA: 95] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2011] [Accepted: 01/11/2012] [Indexed: 11/17/2022] Open
Abstract
Most cell cycle-regulated genes adopt non-optimal codon usage, namely, their translation involves wobbly matching codons. Here, the authors show that tRNA expression is cyclic and that codon usage, therefore, can give rise to cell-cycle regulation of proteins. ![]()
Most cell cycle-regulated genes adopt non-optimal codon usage. Non-optimal codon usage can give rise to cell-cycle dynamics at the protein level. The high expression of transfer RNAs (tRNAs) observed in G2 phase enables cell cycle-regulated genes to adopt non-optimal codon usage, and conversely the lower expression of tRNAs at the end of G1 phase is associated with optimal codon usage. The protein levels of aminoacyl-tRNA synthetases oscillate, peaking in G2/M phase, consistent with the observed cyclic expression of tRNAs.
The cell cycle is a temporal program that regulates DNA synthesis and cell division. When we compared the codon usage of cell cycle-regulated genes with that of other genes, we discovered that there is a significant preference for non-optimal codons. Moreover, genes encoding proteins that cycle at the protein level exhibit non-optimal codon preferences. Remarkably, cell cycle-regulated genes expressed in different phases display different codon preferences. Here, we show empirically that transfer RNA (tRNA) expression is indeed highest in the G2 phase of the cell cycle, consistent with the non-optimal codon usage of genes expressed at this time, and lowest toward the end of G1, reflecting the optimal codon usage of G1 genes. Accordingly, protein levels of human glycyl-, threonyl-, and glutamyl-prolyl tRNA synthetases were found to oscillate, peaking in G2/M phase. In light of our findings, we propose that non-optimal (wobbly) matching codons influence protein synthesis during the cell cycle. We describe a new mathematical model that shows how codon usage can give rise to cell-cycle regulation. In summary, our data indicate that cells exploit wobbling to generate cell cycle-dependent dynamics of proteins.
Collapse
|
40
|
Rao Y, Wu G, Wang Z, Chai X, Nie Q, Zhang X. Mutation bias is the driving force of codon usage in the Gallus gallus genome. DNA Res 2011; 18:499-512. [PMID: 22039174 PMCID: PMC3223081 DOI: 10.1093/dnares/dsr035] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Synonymous codons are used with different frequencies both among species and among genes within the same genome and are controlled by neutral processes (such as mutation and drift) as well as by selection. Up to now, a systematic examination of the codon usage for the chicken genome has not been performed. Here, we carried out a whole genome analysis of the chicken genome by the use of the relative synonymous codon usage (RSCU) method and identified 11 putative optimal codons, all of them ending with uracil (U), which is significantly departing from the pattern observed in other eukaryotes. Optimal codons in the chicken genome are most likely the ones corresponding to highly expressed transfer RNA (tRNAs) or tRNA gene copy numbers in the cell. Codon bias, measured as the frequency of optimal codons (Fop), is negatively correlated with the G + C content, recombination rate, but positively correlated with gene expression, protein length, gene length and intron length. The positive correlation between codon bias and protein, gene and intron length is quite different from other multi-cellular organism, as this trend has been only found in unicellular organisms. Our data displayed that regional G + C content explains a large proportion of the variance of codon bias in chicken. Stepwise selection model analyses indicate that G + C content of coding sequence is the most important factor for codon bias. It appears that variation in the G + C content of CDSs accounts for over 60% of the variation of codon bias. This study suggests that both mutation bias and selection contribute to codon bias. However, mutation bias is the driving force of the codon usage in the Gallus gallus genome. Our data also provide evidence that the negative correlation between codon bias and recombination rates in G. gallus is determined mostly by recombination-dependent mutational patterns.
Collapse
Affiliation(s)
- Yousheng Rao
- Department of Biological Technology, Jiangxi Educational Institute, Nanchang, China.
| | | | | | | | | | | |
Collapse
|
41
|
Waldman YY, Tuller T, Keinan A, Ruppin E. Selection for translation efficiency on synonymous polymorphisms in recent human evolution. Genome Biol Evol 2011; 3:749-61. [PMID: 21803767 PMCID: PMC3163469 DOI: 10.1093/gbe/evr076] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Synonymous mutations are considered to be "silent" as they do not affect protein sequence. However, different silent codons have different translation efficiency (TE), which raises the question to what extent such mutations are really neutral. We perform the first genome-wide study of natural selection operating on TE in recent human evolution, surveying 13,798 synonymous single nucleotide polymorphisms (SNPs) in 1,198 unrelated individuals from 11 populations. We find evidence for both negative and positive selection on TE, as measured based on differentiation in allele frequencies between populations. Notably, the likelihood of an SNP to be targeted by positive or negative selection is correlated with the magnitude of its effect on the TE of the corresponding protein. Furthermore, negative selection acting against changes in TE is more marked in highly expressed genes, highly interacting proteins, complex members, and regulatory genes. It is also more common in functional regions and in the initial segments of highly expressed genes. Positive selection targeting sites with a large effect on TE is stronger in lowly interacting proteins and in regulatory genes. Similarly, essential genes are enriched for negative TE selection while underrepresented for positive TE selection. Taken together, these results point to the significant role of TE as a selective force operating in humans and hence underscore the importance of considering silent SNPs in interpreting associations with complex human diseases. Testifying to this potential, we describe two synonymous SNPs that may have clinical implications in phenylketonuria and in Best's macular dystrophy due to TE differences between alleles.
Collapse
Affiliation(s)
- Yedael Y Waldman
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel
| | | | | | | |
Collapse
|
42
|
Angov E. Codon usage: nature's roadmap to expression and folding of proteins. Biotechnol J 2011; 6:650-9. [PMID: 21567958 PMCID: PMC3166658 DOI: 10.1002/biot.201000332] [Citation(s) in RCA: 153] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2011] [Revised: 04/11/2011] [Accepted: 04/13/2011] [Indexed: 02/06/2023]
Abstract
Biomedical and biotechnological research relies on processes leading to the successful expression and production of key biological products. High-quality proteins are required for many purposes, including protein structural and functional studies. Protein expression is the culmination of multistep processes involving regulation at the level of transcription, mRNA turnover, protein translation, and post-translational modifications leading to the formation of a stable product. Although significant strides have been achieved over the past decade, advances toward integrating genomic and proteomic information are essential, and until such time, many target genes and their products may not be fully realized. Thus, the focus of this review is to provide some experimental support and a brief overview of how codon usage bias has evolved relative to regulating gene expression levels.
Collapse
Affiliation(s)
- Evelina Angov
- Division of Malaria Vaccine Development, Walter Reed Army Institute of Research, Silver Spring, MD 20910, USA.
| |
Collapse
|
43
|
Iben JR, Epstein JA, Bayfield MA, Bruinsma MW, Hasson S, Bacikova D, Ahmad D, Rockwell D, Kittler ELW, Zapp ML, Maraia RJ. Comparative whole genome sequencing reveals phenotypic tRNA gene duplication in spontaneous Schizosaccharomyces pombe La mutants. Nucleic Acids Res 2011; 39:4728-42. [PMID: 21317186 PMCID: PMC3113579 DOI: 10.1093/nar/gkr066] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
We used a genetic screen based on tRNA-mediated suppression (TMS) in a Schizosaccharomyces pombe La protein (Sla1p) mutant. Suppressor pre-tRNASerUCA-C47:6U with a debilitating substitution in its variable arm fails to produce tRNA in a sla1-rrm mutant deficient for RNA chaperone-like activity. The parent strain and spontaneous mutant were analyzed using Solexa sequencing. One synonymous single-nucleotide polymorphism (SNP), unrelated to the phenotype, was identified. Further sequence analyses found a duplication of the tRNASerUCA-C47:6U gene, which was shown to cause the phenotype. Ninety percent of 28 isolated mutants contain duplicated tRNASerUCA-C47:6U genes. The tRNA gene duplication led to a disproportionately large increase in tRNASerUCA-C47:6U levels in sla1-rrm but not sla1-null cells, consistent with non-specific low-affinity interactions contributing to the RNA chaperone-like activity of La, similar to other RNA chaperones. Our analysis also identified 24 SNPs between ours and S. pombe 972h- strain yFS101 that was recently sequenced using Solexa. By including mitochondrial (mt) DNA in our analysis, overall coverage increased from 52% to 96%. mtDNA from our strain and yFS101 shared 14 mtSNPs relative to a ‘reference’ mtDNA, providing the first identification of these S. pombe mtDNA discrepancies. Thus, strain-specific and spontaneous phenotypic mutations can be mapped in S. pombe by Solexa sequencing.
Collapse
Affiliation(s)
- James R Iben
- Intramural Research Program on Genomics of Differentiation, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, University of Massachusetts Medical School, Worcester, MA, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
44
|
Letzring DP, Dean KM, Grayhack EJ. Control of translation efficiency in yeast by codon-anticodon interactions. RNA (NEW YORK, N.Y.) 2010; 16:2516-28. [PMID: 20971810 PMCID: PMC2995412 DOI: 10.1261/rna.2411710] [Citation(s) in RCA: 110] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/12/2010] [Accepted: 09/24/2010] [Indexed: 05/17/2023]
Abstract
The choice of synonymous codons used to encode a polypeptide contributes to substantial differences in translation efficiency between genes. However, both the magnitude and the mechanisms of codon-mediated effects are unknown, as neither the effects of individual codons nor the parameters that modulate codon-mediated regulation are understood, particularly in eukaryotes. To explore this problem in Saccharomyces cerevisiae, we performed the first systematic analysis of codon effects on expression. We find that the arginine codon CGA is strongly inhibitory, resulting in progressively and sharply reduced expression with increased CGA codon dosage. CGA-mediated inhibition of expression is primarily due to wobble decoding of CGA, since it is nearly completely suppressed by coexpression of an exact match anticodon-mutated tRNA(Arg(UCG)), and is associated with generation of a smaller RNA fragment, likely due to endonucleolytic cleavage at a stalled ribosome. Moreover, CGA codon pairs are more effective inhibitors of expression than individual CGA codons. These results directly implicate decoding by the ribosome and interactions at neighboring sites within the ribosome as mediators of codon-specific translation efficiency.
Collapse
Affiliation(s)
- Daniel P Letzring
- Department of Biochemistry and Biophysics, University of Rochester Medical School, Rochester, New York 14642, USA
| | | | | |
Collapse
|
45
|
Supek F, Škunca N, Repar J, Vlahoviček K, Šmuc T. Translational selection is ubiquitous in prokaryotes. PLoS Genet 2010; 6:e1001004. [PMID: 20585573 PMCID: PMC2891978 DOI: 10.1371/journal.pgen.1001004] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2009] [Accepted: 05/26/2010] [Indexed: 11/29/2022] Open
Abstract
Codon usage bias in prokaryotic genomes is largely a consequence of background substitution patterns in DNA, but highly expressed genes may show a preference towards codons that enable more efficient and/or accurate translation. We introduce a novel approach based on supervised machine learning that detects effects of translational selection on genes, while controlling for local variation in nucleotide substitution patterns represented as sequence composition of intergenic DNA. A cornerstone of our method is a Random Forest classifier that outperformed previous distance measure-based approaches, such as the codon adaptation index, in the task of discerning the (highly expressed) ribosomal protein genes by their codon frequencies. Unlike previous reports, we show evidence that translational selection in prokaryotes is practically universal: in 460 of 461 examined microbial genomes, we find that a subset of genes shows a higher codon usage similarity to the ribosomal proteins than would be expected from the local sequence composition. These genes constitute a substantial part of the genome—between 5% and 33%, depending on genome size—while also exhibiting higher experimentally measured mRNA abundances and tending toward codons that match tRNA anticodons by canonical base pairing. Certain gene functional categories are generally enriched with, or depleted of codon-optimized genes, the trends of enrichment/depletion being conserved between Archaea and Bacteria. Prominent exceptions from these trends might indicate genes with alternative physiological roles; we speculate on specific examples related to detoxication of oxygen radicals and ammonia and to possible misannotations of asparaginyl–tRNA synthetases. Since the presence of codon optimizations on genes is a valid proxy for expression levels in fully sequenced genomes, we provide an example of an “adaptome” by highlighting gene functions with expression levels elevated specifically in thermophilic Bacteria and Archaea. Synonymous codons are not equally common in genomes. The main causes of unequal codon usage are varying nucleotide substitution patterns, as manifested in the wide range of genomic nucleotide compositions. However, since the first E. coli and yeast genes were sequenced, it became evident that there was also a bias towards codons that can be translated to protein faster and more accurately. This bias was stronger in highly expressed genes, and its driving force was termed translational selection. Researchers sought for effects of translational selection in microbial genomes as they became available, employing a flurry of mathematical approaches which sometimes led to contradictory conclusions. We introduce a sensitive and accurate machine learning-based methodology and find that highly expressed genes have a recognizable codon usage pattern in almost every bacterial and archaeal genome analyzed, even after accounting for large differences in background nucleotide composition. We also show that the gene functional category has a great bearing on whether that gene is subject to translational selection. Since presence of codon optimizations can be used as a purely sequence-derived proxy for expression levels, we can delineate “adaptomes” by relating predicted gene activity to organisms' phenotypes, which we demonstrate on genomes of temperature-resistant Bacteria and Archaea.
Collapse
Affiliation(s)
- Fran Supek
- Division of Electronics, Rudjer Boskovic Institute, Zagreb, Croatia
| | - Nives Škunca
- Division of Electronics, Rudjer Boskovic Institute, Zagreb, Croatia
| | - Jelena Repar
- Division of Molecular Biology, Rudjer Boskovic Institute, Zagreb, Croatia
| | - Kristian Vlahoviček
- Division of Biology, Faculty of Science, University of Zagreb, Zagreb, Croatia
- Department of Informatics, University of Oslo, Oslo, Norway
| | - Tomislav Šmuc
- Division of Electronics, Rudjer Boskovic Institute, Zagreb, Croatia
- * E-mail:
| |
Collapse
|
46
|
Zhou T, Gu W, Wilke CO. Detecting positive and purifying selection at synonymous sites in yeast and worm. Mol Biol Evol 2010; 27:1912-22. [PMID: 20231333 DOI: 10.1093/molbev/msq077] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
We present a new computational method to identify positive and purifying selection at synonymous sites in yeast and worm. We define synonymous substitutions that change codons from preferred to unpreferred or vice versa as nonconservative synonymous substitutions and all other substitutions as conservative. Using a maximum-likelihood framework, we then test whether conservative and nonconservative synonymous substitutions occur at equal rates. Our approach replaces the standard rate of synonymous substitutions per synonymous site, dS, with two new rates, the conservative synonymous substitution rate (dS(C)) and the nonconservative synonymous substitution rate (dS(N)). Based on the ratio dS(N)/dS(C), we find that 0.05% of all yeast genes and none of worm genes show evidence of positive selection at synonymous sites (dS(N)/dS(C) > 1). On the other hand, 9.44% of all yeast genes and 5.12% of all worm genes show evidence of significant purifying selection on synonymous sites (dS(N)/dS(C) < 1). We also find that dS(N) correlates strongly with gene expression level, whereas the correlation between expression level and dS(C) is very weak. Thus, dS(N) captures most of the signal of selection for translational accuracy and speed, whereas dS(C) is not strongly influenced by this selection pressure. We suggest that the ratio dN/dS(C) may be more appropriate than the ratio dN/dS to identify positive or purifying selection on amino acids.
Collapse
Affiliation(s)
- Tong Zhou
- Center for Computational Biology and Bioinformatics, University of Texas at Austin, TX, USA
| | | | | |
Collapse
|
47
|
Increased incidence of rare codon clusters at 5' and 3' gene termini: implications for function. BMC Genomics 2010; 11:118. [PMID: 20167116 PMCID: PMC2833160 DOI: 10.1186/1471-2164-11-118] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2009] [Accepted: 02/18/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The process of translation can be affected by the use of rare versus common codons within the mRNA transcript. RESULTS Here, we show that rare codons are enriched at the 5' and 3' termini of genes from E. coli and other prokaryotes. Genes predicted to be secreted show significant enrichment in 5' rare codon clusters, but not 3' rare codon clusters. Surprisingly, no correlation between 5' mRNA structure and rare codon usage was observed. CONCLUSIONS Potential functional roles for the enrichment of rare codons at terminal positions are explored.
Collapse
|