1
|
Komar AA, Samatova E, Rodnina MV. Translation Rates and Protein Folding. J Mol Biol 2024; 436:168384. [PMID: 38065274 DOI: 10.1016/j.jmb.2023.168384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 12/01/2023] [Accepted: 12/02/2023] [Indexed: 12/19/2023]
Abstract
The mRNA coding sequence defines not only the amino acid sequence of the protein, but also the speed at which the ribosomes move along the mRNA while making the protein. The non-uniform local kinetics - denoted as translational rhythm - is similar among mRNAs coding for related protein folds. Deviations from this conserved rhythm can result in protein misfolding. In this review we summarize the experimental evidence demonstrating how local translation rates affect cotranslational protein folding, with the focus on the synonymous codons and patches of charged residues in the nascent peptide as best-studied examples. Alterations in nascent protein conformations due to disturbed translational rhythm can persist off the ribosome, as demonstrated by the effects of synonymous codon variants of several disease-related proteins. Charged amino acid patches in nascent chains also modulate translation and cotranslational protein folding, and can abrogate translation when placed at the N-terminus of the nascent peptide. During cotranslational folding, incomplete nascent chains navigate through a unique conformational landscape in which earlier intermediate states become inaccessible as the nascent peptide grows. Precisely tuned local translation rates, as well as interactions with the ribosome, guide the folding pathway towards the native structure, whereas deviations from the natural translation rhythm may favor pathways leading to trapped misfolded states. Deciphering the 'folding code' of the mRNA will contribute to understanding the diseases caused by protein misfolding and to rational protein design.
Collapse
Affiliation(s)
- Anton A Komar
- Center for Gene Regulation in Health and Disease, Department of Biological, Geological and Environmental Sciences, Cleveland State University, 2121 Euclid Avenue, Cleveland, OH 44115, USA; Department of Biochemistry and Center for RNA Science and Therapeutics, School of Medicine, Case Western Reserve University, Cleveland, OH 44106, USA.
| | - Ekaterina Samatova
- Max Planck Department of Physical Biochemistry, Max Planck Institute for Multidisciplinary Sciences, 37077 Goettingen, Germany
| | - Marina V Rodnina
- Max Planck Department of Physical Biochemistry, Max Planck Institute for Multidisciplinary Sciences, 37077 Goettingen, Germany.
| |
Collapse
|
2
|
Komar AA. A Code Within a Code: How Codons Fine-Tune Protein Folding in the Cell. BIOCHEMISTRY (MOSCOW) 2021; 86:976-991. [PMID: 34488574 DOI: 10.1134/s0006297921080083] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The genetic code sets the correspondence between the sequence of a given nucleotide triplet in an mRNA molecule, called a codon, and the amino acid that is added to the growing polypeptide chain during protein synthesis. With four bases (A, G, U, and C), there are 64 possible triplet codons: 61 sense codons (encoding amino acids) and 3 nonsense codons (so-called, stop codons that define termination of translation). In most organisms, there are 20 common/standard amino acids used in protein synthesis; thus, the genetic code is redundant with most amino acids (with the exception of Met and Trp) are being encoded by more than one (synonymous) codon. Synonymous codons were initially presumed to have entirely equivalent functions, however, the finding that synonymous codons are not present at equal frequencies in mRNA suggested that the specific codon choice might have functional implications beyond coding for amino acid. Observation of nonequivalent use of codons in mRNAs implied a possibility of the existence of auxiliary information in the genetic code. Indeed, it has been found that genetic code contains several layers of such additional information and that synonymous codons are strategically placed within mRNAs to ensure a particular translation kinetics facilitating and fine-tuning co-translational protein folding in the cell via step-wise/sequential structuring of distinct regions of the polypeptide chain emerging from the ribosome at different points in time. This review summarizes key findings in the field that have identified the role of synonymous codons and their usage in protein folding in the cell.
Collapse
Affiliation(s)
- Anton A Komar
- Center for Gene Regulation in Health and Disease and Department of Biological, Geological and Environmental Sciences, Cleveland State University, Cleveland, OH 44115, USA. .,Department of Biochemistry and Center for RNA Science and Therapeutics, Case Western Reserve University, Cleveland, OH 44106, USA.,Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA.,DAPCEL, Inc., Cleveland, OH 44106, USA
| |
Collapse
|
3
|
Hia F, Takeuchi O. The effects of codon bias and optimality on mRNA and protein regulation. Cell Mol Life Sci 2021; 78:1909-1928. [PMID: 33128106 PMCID: PMC11072601 DOI: 10.1007/s00018-020-03685-7] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Revised: 10/05/2020] [Accepted: 10/12/2020] [Indexed: 12/25/2022]
Abstract
The central dogma of molecular biology entails that genetic information is transferred from nucleic acid to proteins. Notwithstanding retro-transcribing genetic elements, DNA is transcribed to RNA which in turn is translated into proteins. Recent advancements have shown that each stage is regulated to control protein abundances for a variety of essential physiological processes. In this regard, mRNA regulation is essential in fine-tuning or calibrating protein abundances. In this review, we would like to discuss one of several mRNA-intrinsic features of mRNA regulation that has been gaining traction of recent-codon bias and optimality. Specifically, we address the effects of codon bias with regard to codon optimality in several biological processes centred on translation, such as mRNA stability and protein folding among others. Finally, we examine how different organisms or cell types, through this system, are able to coordinate physiological pathways to respond to a variety of stress or growth conditions.
Collapse
Affiliation(s)
- Fabian Hia
- Department of Medical Chemistry, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Osamu Takeuchi
- Department of Medical Chemistry, Graduate School of Medicine, Kyoto University, Kyoto, Japan.
| |
Collapse
|
4
|
|
5
|
Seligmann H, Warthi G. Genetic Code Optimization for Cotranslational Protein Folding: Codon Directional Asymmetry Correlates with Antiparallel Betasheets, tRNA Synthetase Classes. Comput Struct Biotechnol J 2017; 15:412-424. [PMID: 28924459 PMCID: PMC5591391 DOI: 10.1016/j.csbj.2017.08.001] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2017] [Revised: 07/20/2017] [Accepted: 08/05/2017] [Indexed: 12/14/2022] Open
Abstract
A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').
Collapse
Affiliation(s)
- Hervé Seligmann
- Aix-Marseille Univ, Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes, UM 63, CNRS UMR7278, IRD 198, INSERM U1095, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, Postal code 13385, France
- Dept. Ecol Evol Behav, Alexander Silberman Inst Life Sci, The Hebrew University of Jerusalem, IL-91904 Jerusalem, Israel
| | - Ganesh Warthi
- Aix-Marseille Univ, Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes, UM 63, CNRS UMR7278, IRD 198, INSERM U1095, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, Postal code 13385, France
| |
Collapse
|
6
|
Meyer IM. In silico methods for co-transcriptional RNA secondary structure prediction and for investigating alternative RNA structure expression. Methods 2017; 120:3-16. [PMID: 28433606 DOI: 10.1016/j.ymeth.2017.04.009] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Revised: 03/16/2017] [Accepted: 04/14/2017] [Indexed: 01/26/2023] Open
Abstract
RNA transcripts are the primary products of active genes in any living organism, including many viruses. Their cellular destiny not only depends on primary sequence signals, but can also be determined by RNA structure. Recent experimental evidence shows that many transcripts can be assigned more than a single functional RNA structure throughout their cellular life and that structure formation happens co-transcriptionally, i.e. as the transcript is synthesised in the cell. Moreover, functional RNA structures are not limited to non-coding transcripts, but can also feature in coding transcripts. The picture that now emerges is that RNA structures constitute an additional layer of information that can be encoded in any RNA transcript (and on top of other layers of information such as protein-context) in order to exert a wide range of functional roles. Moreover, different encoded RNA structures can be expressed at different stages of a transcript's life in order to alter the transcript's behaviour depending on its actual cellular context. Similar to the concept of alternative splicing for protein-coding genes, where a single transcript can yield different proteins depending on cellular context, it is thus appropriate to propose the notion of alternative RNA structure expression for any given transcript. This review introduces several computational strategies that my group developed to detect different aspects of RNA structure expression in vivo. Two aspects are of particular interest to us: (1) RNA secondary structure features that emerge during co-transcriptional folding and (2) functional RNA structure features that are expressed at different times of a transcript's life and potentially mutually exclusive.
Collapse
Affiliation(s)
- Irmtraud M Meyer
- Laboratory of Bioinformatics of RNA Structure and Transcriptome Regulation, Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, Robert-Rössle-Str. 10, 13125 Berlin-Buch, Germany; Institute of Chemistry and Biochemistry, Free University, Thielallee 63, 14195 Berlin, Germany.
| |
Collapse
|
7
|
Gandin V, Topisirovic I. Co-translational mechanisms of quality control of newly synthesized polypeptides. ACTA ACUST UNITED AC 2014; 2:e28109. [PMID: 26779401 PMCID: PMC4705825 DOI: 10.4161/trla.28109] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2013] [Revised: 12/30/2013] [Accepted: 02/04/2014] [Indexed: 01/23/2023]
Abstract
During protein synthesis, nascent polypeptides emerge from ribosomes to fold into functional proteins. Misfolding of newly synthesized polypeptides (NSPs) at this stage leads to their aggregation. These misfolded NSPs must be expediently cleared to circumvent the deleterious effects of protein aggregation on cell physiology. To this end, a sizable portion of NSPs are ubiquitinated and rapidly degraded by the proteasome. This suggests the existence of co-translational mechanisms that play a pivotal role in the quality control of NSPs. It is generally thought that ribosomes play a central role in this process. During mRNA translation, ribosomes sense errors that lead to the accumulation of aberrant polypeptides, and serve as a hub for protein complexes that are required for optimal folding and/or proteasome-dependent degradation of misfolded polypeptides. In this review, we discuss recent findings that shed light on the molecular underpinnings of the co-translational quality control of NSPs.
Collapse
Affiliation(s)
- Valentina Gandin
- Lady Davis Institute for Medical Research; Sir Mortimer B. Davis-Jewish General Hospital; Montréal, QC Canada; Department of Oncology; McGill University; Montréal, QC Canada
| | - Ivan Topisirovic
- Lady Davis Institute for Medical Research; Sir Mortimer B. Davis-Jewish General Hospital; Montréal, QC Canada; Department of Oncology; McGill University; Montréal, QC Canada
| |
Collapse
|
8
|
Bhardwaj A. Investigating the role of site specific synonymous variation in disease association studies. Mitochondrion 2014; 16:83-8. [PMID: 24434286 DOI: 10.1016/j.mito.2013.12.005] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2013] [Revised: 12/15/2013] [Accepted: 12/24/2013] [Indexed: 02/05/2023]
Abstract
Synonymous codon changes may not always be neutral indicating their significance in disease association studies, which is almost always overlooked. Synonymous substitutions may affect protein-folding rates leading to protein misfolding and aggregation. Genome wide analysis of 2301 mitochondrial genomes is performed to evaluate the significance of synonymous codons in disease association studies. The analysis revealed usage of rare codons at several sites in mitochondrial genes with rare codon usage higher for hydrophobic amino acids. The analysis suggests that variation data in association studies should be analyzed using site-specific codon usage values to infer the potential phenotypic impact of synonymous changes.
Collapse
Affiliation(s)
- Anshu Bhardwaj
- Open Source Drug Discovery Unit, Council of Scientific and Industrial Research (CSIR), Delhi 110001, India.
| |
Collapse
|
9
|
Jha SS, Komar AA. Isolation of ribosome bound nascent polypeptides in vitro to identify translational pause sites along mRNA. J Vis Exp 2012:4026. [PMID: 22806127 DOI: 10.3791/4026] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
The rate of translational elongation is non-uniform. mRNA secondary structure, codon usage and mRNA associated proteins may alter ribosome movement on the message(for review see 1). However, it's now widely accepted that synonymous codon usage is the primary cause of non-uniform translational elongation rates(1). Synonymous codons are not used with identical frequency. A bias exists in the use of synonymous codons with some codons used more frequently than others(2). Codon bias is organism as well as tissue specific(2,3). Moreover, frequency of codon usage is directly proportional to the concentrations of cognate tRNAs(4). Thus, a frequently used codon will have higher multitude of corresponding tRNAs, which further implies that a frequent codon will be translated faster than an infrequent one. Thus, regions on mRNA enriched in rare codons (potential pause sites) will as a rule slow down ribosome movement on the message and cause accumulation of nascent peptides of the respective sizes(5-8). These pause sites can have functional impact on the protein expression, mRNA stability and protein folding(for review see 9). Indeed, it was shown that alleviation of such pause sites can alter ribosome movement on mRNA and subsequently may affect the efficiency of co-translational (in vivo) protein folding(1,7,10,11). To understand the process of protein folding in vivo, in the cell, that is ultimately coupled to the process of protein synthesis it is essential to gain comprehensive insights into the impact of codon usage/tRNA content on the movement of ribosomes along mRNA during translational elongation. Here we describe a simple technique that can be used to locate major translation pause sites for a given mRNA translated in various cell-free systems(6-8). This procedure is based on isolation of nascent polypeptides accumulating on ribosomes during in vitro translation of a target mRNA. The rationale is that at low-frequency codons, the increase in the residence time of the ribosomes results in increased amounts of nascent peptides of the corresponding sizes. In vitro transcribed mRNA is used for in vitro translational reactions in the presence of radioactively labeled amino acids to allow the detection of the nascent chains. In order to isolate ribosome bound nascent polypeptide complexes the translation reaction is layered on top of 30% glycerol solution followed by centrifugation. Nascent polypeptides in polysomal pellet are further treated with ribonuclease A and resolved by SDS PAGE. This technique can be potentially used for any protein and allows analysis of ribosome movement along mRNA and the detection of the major pause sites. Additionally, this protocol can be adapted to study factors and conditions that can alter ribosome movement and thus potentially can also alter the function/conformation of the protein.
Collapse
Affiliation(s)
- Sujata S Jha
- Center for Gene Regulation in Health and Disease, Department of Biological, Geological and Environmental Sciences, Cleveland State University, OH, USA
| | | |
Collapse
|
10
|
Chartier M, Gaudreault F, Najmanovich R. Large-scale analysis of conserved rare codon clusters suggests an involvement in co-translational molecular recognition events. ACTA ACUST UNITED AC 2012; 28:1438-45. [PMID: 22467916 DOI: 10.1093/bioinformatics/bts149] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
MOTIVATION An increasing amount of evidence from experimental and computational analysis suggests that rare codon clusters are functionally important for protein activity. Most of the studies on rare codon clusters were performed on a limited number of proteins or protein families. In the present study, we present the Sherlocc program and how it can be used for large scale protein family analysis of evolutionarily conserved rare codon clusters and their relation to protein function and structure. This large-scale analysis was performed using the whole Pfam database covering over 70% of the known protein sequence universe. Our program Sherlocc, detects statistically relevant conserved rare codon clusters and produces a user-friendly HTML output. RESULTS Statistically significant rare codon clusters were detected in a multitude of Pfam protein families. The most statistically significant rare codon clusters were predominantly identified in N-terminal Pfam families. Many of the longest rare codon clusters are found in membrane-related proteins which are required to interact with other proteins as part of their function, for example in targeting or insertion. We identified some cases where rare codon clusters can play a regulating role in the folding of catalytically important domains. Our results support the existence of a widespread functional role for rare codon clusters across species. Finally, we developed an online filter-based search interface that provides access to Sherlocc results for all Pfam families. AVAILABILITY The Sherlocc program and search interface are open access and are available at http://bcb.med.usherbrooke.ca
Collapse
Affiliation(s)
- Matthieu Chartier
- Department of Biochemistry, Faculty of Medicine and Health Sciences, Université de Sherbrooke, 12e Avenue Nord, Sherbrooke, Québec, Canada
| | | | | |
Collapse
|
11
|
Su JH, Ma XX, He YL, Li JD, Ma XS, Dou YX, Luo XN, Cai XP. Mapping codon usage of the translation initiation region in porcine reproductive and respiratory syndrome virus genome. Virol J 2011; 8:476. [PMID: 22014033 PMCID: PMC3219751 DOI: 10.1186/1743-422x-8-476] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2011] [Accepted: 10/21/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Porcine reproductive and respitatory syndrome virus (PRRSV) is a recently emerged pathogen and severely affects swine populations worldwide. The replication of PRRSV is tightly controlled by viral gene expression and the codon usage of translation initiation region within each gene could potentially regulate the translation rate. Therefore, a better understanding of the codon usage pattern of the initiation translation region would shed light on the regulation of PRRSV gene expression. RESULTS In this study, the codon usage in the translation initiation region and in the whole coding sequence was compared in PRRSV ORF1a and ORFs2-7. To investigate the potential role of codon usage in affecting the translation initiation rate, we established a codon usage model for PRRSV translation initiation region. We observed that some non-preferential codons are preferentially used in the translation initiation region in particular ORFs. Although some positions vary with codons, they intend to use codons with negative CUB. Furthermore, our model of codon usage showed that the conserved pattern of CUB is not directly consensus with the conserved sequence, but shaped under the translation selection. CONCLUSIONS The non-variation pattern with negative CUB in the PRRSV translation initiation region scanned by ribosomes is considered the rate-limiting step in the translation process.
Collapse
Affiliation(s)
- Jun-hong Su
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, 730046, PR China
| | | | | | | | | | | | | | | |
Collapse
|
12
|
Deane CM, Saunders R. The imprint of codons on protein structure. Biotechnol J 2011; 6:641-9. [DOI: 10.1002/biot.201000329] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2011] [Revised: 03/10/2011] [Accepted: 03/23/2011] [Indexed: 12/23/2022]
|
13
|
Jha S, Komar AA. Birth, life and death of nascent polypeptide chains. Biotechnol J 2011; 6:623-40. [PMID: 21538896 PMCID: PMC3130931 DOI: 10.1002/biot.201000327] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2011] [Revised: 02/26/2011] [Accepted: 03/15/2011] [Indexed: 01/16/2023]
Abstract
The journey of nascent polypeptides from synthesis at the peptidyl transferase center of the ribosome (“birth”) to full function (“maturity”) involves multiple interactions, constraints, modifications and folding events. Each step of this journey impacts the ultimate expression level and functional capacity of the translated protein. It has become clear that the kinetics of protein translation is predominantly modulated by synonymous codon usage along the mRNA, and that this provides an active mechanism for coordinating the synthesis, maturation and folding of nascent polypeptides. Multiple quality control systems ensure that proteins achieve their native, functional form. Unproductive co-translational folding intermediates that arise during protein synthesis may undergo enhanced interaction with components of these systems, such as chaperones, and/or be subjects of co-translational degradation (“death”). This review provides an overview of our current understanding of the complex co-translational events that accompany the synthesis, maturation, folding and degradation of nascent polypeptide chains.
Collapse
Affiliation(s)
- Sujata Jha
- Center for Gene Regulation in Health and Disease, Department of Biological, Geological and Environmental Sciences, Cleveland State University, Cleveland, OH 44115, USA
| | | |
Collapse
|
14
|
Saunders R, Deane CM. Synonymous codon usage influences the local protein structure observed. Nucleic Acids Res 2010; 38:6719-28. [PMID: 20530529 PMCID: PMC2965230 DOI: 10.1093/nar/gkq495] [Citation(s) in RCA: 112] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Translation of mRNA into protein is a unidirectional information flow process. Analysing the input (mRNA) and output (protein) of translation, we find that local protein structure information is encoded in the mRNA nucleotide sequence. The Coding Sequence and Structure (CSandS) database developed in this work provides a detailed mapping between over 4000 solved protein structures and their mRNA. CSandS facilitates a comprehensive analysis of codon usage over many organisms. In assigning translation speed, we find that relative codon usage is less informative than tRNA concentration. For all speed measures, no evidence was found that domain boundaries are enriched with slow codons. In fact, genes seemingly avoid slow codons around structurally defined domain boundaries. Translation speed, however, does decrease at the transition into secondary structure. Codons are identified that have structural preferences significantly different from the amino acid they encode. However, each organism has its own set of ‘significant codons’. Our results support the premise that codons encode more information than merely amino acids and give insight into the role of translation in protein folding.
Collapse
Affiliation(s)
- Rhodri Saunders
- Department of Statistics, Oxford University, 1 South Parks Road, Oxford OX1 3TG, UK.
| | | |
Collapse
|
15
|
Conformational diseases: looking into the eyes. Brain Res Bull 2010; 81:12-24. [PMID: 19808079 DOI: 10.1016/j.brainresbull.2009.09.015] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2009] [Revised: 09/11/2009] [Accepted: 09/29/2009] [Indexed: 01/09/2023]
Abstract
Conformational diseases, a general term comprising more than 40 disorders are caused by the accumulation of unfolded or misfolded proteins. Improper protein folding (misfolding) as well as accrual of unfolded proteins can lead to the formation of disordered (amorphous) or ordered (amyloid fibril) aggregates. The gradual accumulation of protein aggregates and the acceleration of their formation by stress explain the characteristic late or episodic onset of the diseases. The best studied in this group are neurodegenerative diseases and amyloidosis accompanied by the deposition of a specific aggregation-prone proteins or protein fragments and formation of insoluble fibrils. Amyloidogenic protein accumulation often occurs in the brain tissues, e.g. in Alzheimer's disease with the deposition of amyloid-beta and Tau, in scrapie and bovine spongiform encephalopathy with the accumulation of prion protein, in Parkinson's disease with the deposition of alpha-synuclein. Other examples of amyloid proteins are transthyretin, immunoglobulin light chain, gelsolin, etc. In addition to the brain, the accumulation of unfolded or misfolded proteins leading to pathology takes place in a wide variety of organs and tissues, including different parts of the eye. The best studied ocular conformational diseases are cataract in the lens and retinitis pigmentosa in the retina, but accumulation of misfolded proteins also occurs in other parts of the eye causing various disorders. Furthermore, ocular manifestation of systemic amyloidosis often causes the deposition of amyloidogenic proteins in different ocular tissues. Here we present the data regarding naturally unfolded and misfolded proteins in eye tissues, their structure-function relationships, and molecular mechanisms underlying their involvement in diseases. We also summarize the etiology of ocular conformational diseases and discuss approaches to their treatment.
Collapse
|
16
|
Clustering of codons with rare cognate tRNAs in human genes suggests an extra level of expression regulation. PLoS Genet 2009; 5:e1000548. [PMID: 19578405 PMCID: PMC2697378 DOI: 10.1371/journal.pgen.1000548] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2008] [Accepted: 06/03/2009] [Indexed: 12/31/2022] Open
Abstract
In species with large effective population sizes, highly expressed genes tend to be encoded by codons with highly abundant cognate tRNAs to maximize translation rate. However, there has been little evidence for a similar bias of synonymous codons in highly expressed human genes. Here, we ask instead whether there is evidence for the selection for codons associated with low abundance tRNAs. Rather than averaging the codon usage of complete genes, we scan the genes for windows with deviating codon usage. We show that there is a significant over representation of human genes that contain clusters of codons with low abundance cognate tRNAs. We name these regions, which on average have a 50% reduction in the amount of cognate tRNA available compared to the remainder of the gene, RTS (rare tRNA score) clusters. We observed a significant reduction in the substitution rate between the human RTS clusters and their orthologous chimp sequence, when compared to non-RTS cluster sequences. Overall, the genes with an RTS cluster have higher tissue specificity than the non-RTS cluster genes. Furthermore, these genes are functionally enriched for transcription regulation. As genes that regulate transcription in lower eukaryotes are known to be involved in translation on demand, this suggests that the mechanism of translation level expression regulation also exists within the human genome.
Collapse
|
17
|
Tartaglia GG, Pechmann S, Dobson CM, Vendruscolo M. A relationship between mRNA expression levels and protein solubility in E. coli. J Mol Biol 2009; 388:381-9. [PMID: 19281824 DOI: 10.1016/j.jmb.2009.03.002] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2008] [Revised: 02/26/2009] [Accepted: 03/03/2009] [Indexed: 10/21/2022]
Abstract
Each step in the process of gene expression, from the transcription of DNA into mRNA to the folding and posttranslational modification of proteins, is regulated by complex cellular mechanisms. At the same time, stringent conditions on the physicochemical properties of proteins, and hence on the nature of their amino acids, are imposed by the need to avoid aggregation at the concentrations required for optimal cellular function. A relationship is therefore expected to exist between mRNA expression levels and protein solubility in the cell. By investigating such a relationship, we formulate a method that enables the prediction of the maximal levels of mRNA expression in Escherichia coli with an accuracy of 83% and of the solubility of recombinant human proteins expressed in E. coli with an accuracy of 86%.
Collapse
Affiliation(s)
- Gian Gaetano Tartaglia
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK.
| | | | | | | |
Collapse
|
18
|
Komar AA. A pause for thought along the co-translational folding pathway. Trends Biochem Sci 2008; 34:16-24. [PMID: 18996013 DOI: 10.1016/j.tibs.2008.10.002] [Citation(s) in RCA: 250] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2008] [Revised: 10/09/2008] [Accepted: 10/13/2008] [Indexed: 11/26/2022]
Abstract
A unifying concept that combines the basic features governing self-organization of proteins into complex three-dimensional structures in vitro and in vivo is still lacking. Recent experimental results and theoretical in silico modeling studies provide evidence showing that mRNA might contain an additional layer of information, beyond the amino acid sequence, that fine-tunes in vivo protein folding, which is largely believed to start as a co-translational process. These findings indicate that translation kinetics might direct the co-translational folding pathway and that translational pausing at rare codons might provide a time delay to enable independent and sequential folding of the defined portions of the nascent polypeptide emerging from the ribosome.
Collapse
Affiliation(s)
- Anton A Komar
- Department of Biological, Center for Gene Regulation in Health and Disease, Cleveland State University, Cleveland, OH 44115, USA.
| |
Collapse
|
19
|
Abstract
Evaluation of: Kimchi-Sarfaty C, Oh JM, Kim IW et al.: A ‘silent’ polymorphism in the MDR1 gene changes substrate specificity. Science 315, 525–528 (2007) [1] . Individuals carrying silent SNPs in the MDR1 gene encoding P-glycoprotein sometimes reveal altered P-glycoprotein pharmacokinetics. There is no rational explanation for why silent SNPs might have such effects, especially when no change in P-glycoprotein mRNA and protein expression levels has been observed. The purpose of this study was to perform careful ex vivo (in cells) analysis of the effects of the three polymorphisms (C1236T, G2677T C3435T) on P-glycoprotein expression and activity. As a result, it has been shown that silent polymorphisms (in particular, C3435T) in MDR1 can alter P-glycoprotein conformation and protein activity/substrate specificity. This study is of immense importance as it demonstrates for the first time that naturally occurring silent SNPs can lead to the synthesis of protein product with the same amino acid sequence but different structural and functional properties. Thus, silent SNPs should no longer be neglected in determining the likelihood of development of various diseases, and should be taken into account in personalized drug treatment and development programs.
Collapse
Affiliation(s)
- Anton A Komar
- Cleveland State University, Department of Biological, Geological and Environmental Sciences, Cleveland, OH 44115, USA.
| |
Collapse
|
20
|
Mrázek J. Analysis of distribution indicates diverse functions of simple sequence repeats in Mycoplasma genomes. Mol Biol Evol 2006; 23:1370-85. [PMID: 16618962 DOI: 10.1093/molbev/msk023] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Simple sequence repeats (SSRs) composed of extensive tandem iterations of a single nucleotide or a short oligonucleotide are rare in most bacterial genomes, but they are common among Mycoplasma. Some of these repeats act as contingency loci in association with families of surface antigens. By contraction or expansion during replication, these SSRs increase genetic variance of the population and facilitate avoidance of the immune response of the host. Occurrence and distribution of SSRs are analyzed in complete genomes of 11 Mycoplasma and 3 related Mollicutes in order to gain insights into functional and evolutionary diversity of the SSRs in Mycoplasma. The results revealed an unexpected variety of SSRs with respect to their distribution and composition and suggest that it is unlikely that all SSRs function as contingency loci or recombination hot spots. Various types of SSRs are most abundant in Mycoplasma hyopneumoniae, whereas Mycoplasma penetrans, Mycoplasma mobile, and Mycoplasma synoviae do not contain unusually long SSRs. Mycoplasma hyopneumoniae and Mycoplasma pulmonis feature abundant short adenine and thymine runs periodically spaced at 11 and 12 bp, respectively, which likely affect the supercoiling propensities of the DNA molecule. Physiological roles of long adenine and thymine runs in M. hyopneumoniae appear independent of location upstream or downstream of genes, unlike contingency loci that are typically located in protein-coding regions or upstream regulatory regions. Comparisons among 3 M. hyopneumoniae strains suggest that the adenine and thymine runs are rarely involved in genome rearrangements. The results indicate that the SSRs in the Mycoplasma genomes play diverse roles, including modulating gene expression as contingency loci, facilitating genome rearrangements via recombination, affecting protein structure and possibly protein-protein interactions, and contributing to the organization of the DNA molecule in the cell.
Collapse
Affiliation(s)
- Jan Mrázek
- Department of Microbiology and Institute of Bioinformatics, University of Georgia, USA.
| |
Collapse
|
21
|
Makhoul CH, Trifonov EN. Distribution of rare triplets along mRNA and their relation to protein folding. J Biomol Struct Dyn 2002; 20:413-20. [PMID: 12437379 DOI: 10.1080/07391102.2002.10506859] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
It is believed that pausing during mRNA translation plays some role in ensuring proper folding of newly synthesized sections of a protein chain. Such pausing occurs when rare triplets are encountered in the mRNA, as it takes additional time for the corresponding rare species of tRNA to be delivered. To determine whether pause sites are non-randomly distributed along prokaryotic mRNA (cDNA), we have located clusters of rare triplets in cDNA sequences from 21 different bacteria. From the individual profiles of local codon frequencies calculated with various windows, the positions of the clusters of the rarest codons were taken for generation of the combined histograms of positional preferences of the pause sites. The histograms show that in the prokaryotic sequences, the pause sites are located preferentially at the start positions and at about 155 triplets from the starts. To verify the generality of these observations, the data are grouped in six independent sets about 500 sequences each, all revealing the same features. A less prominent maximum is also seen at the triplet position 75. Judging by the amplitude of the peak at 155 triplets, an optimal cluster size is estimated to equal 18 triplets. The distance 155 closely corresponds to the sizes of typical protein folds and to earlier estimated prokaryotic protein sequence segments. This supports the suggestion of a role for translation pausing in the cotranslational folding of protein domains. The profiles of rare codons in mRNA can serve in the detection or prediction of boundaries between protein domains.
Collapse
Affiliation(s)
- Cameel H Makhoul
- Department of Structural Biology, The Weizmann Institute of Science, Rehovot, 76100, Israel
| | | |
Collapse
|
22
|
Komar AA, Lesnik T, Reiss C. Synonymous codon substitutions affect ribosome traffic and protein folding during in vitro translation. FEBS Lett 1999; 462:387-91. [PMID: 10622731 DOI: 10.1016/s0014-5793(99)01566-5] [Citation(s) in RCA: 294] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
To investigate the possible influence of the local rates of translation on protein folding, 16 consecutive rare (in Escherichia coli) codons in the chloramphenicol acetyltransferase (CAT) gene have been replaced by frequent ones. Site-directed silent mutagenesis reduced the pauses in translation of CAT in E. coli S30 extract cell-free system and led to the acceleration of the overall rate of CAT protein synthesis. At the same time, the silently mutated protein (with unaltered protein sequence) synthesized in the E. coli S30 extract system was shown to possess 20% lower specific activity. The data suggest that kinetics of protein translation can affect the in vivo protein-folding pathway, leading to increased levels of protein misfolding.
Collapse
Affiliation(s)
- A A Komar
- Centre de Génétique Moléculaire, CNRS, Gif-sur-Yvette, France.
| | | | | |
Collapse
|
23
|
Tsalkova T, Kramer G, Hardesty B. The effect of a hydrophobic N-terminal probe on translational pausing of chloramphenicol acetyl transferase and rhodanese. J Mol Biol 1999; 286:71-81. [PMID: 9931250 DOI: 10.1006/jmbi.1998.2481] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The effect on translational pausing of a hydrophobic probe, coumarin, at the N terminus of nascent peptides was investigated. Two different proteins, bacterial chloramphenicol acetyltransferase and bovine rhodanese, were synthesized by coupled transcription/translation in a cell-free system derived from Escherichia coli. Protein synthesis was initiated with N-formyl-Met-tRNAf or N-acetyl-S-coumarin-Met-tRNAf. Cotranslational incorporation of the coumarin derivative generated nascent polypeptides with a hydrophobic residue at their N termini. The effect of the two N-terminal groups on the size distribution and quantity of the peptides formed by translational pausing was investigated. The N-terminal coumarin caused an accumulation of nascent chloramphenicol acetyltransferase peptides in the mass range of 3.5-4.0 kDa that reflects a delay in translation at this point. No similar effect on rhodanese pause-site peptides was observed. This effect on translational pausing cannot be explained by either mRNA secondary structure or rare codons and tRNA abundance. It is suggested that the effect of N-terminal coumarin on translational pausing is the result of the interaction of the nascent peptide with components of the large ribosomal subunit along the path it follows between the peptidyl transferase center and the exit site on the distal surface.
Collapse
Affiliation(s)
- T Tsalkova
- Department of Chemistry and Biochemistry, University of Texas at Austin, Austin, TX, 78712, USA
| | | | | |
Collapse
|
24
|
Oresic M, Shalloway D. Specific correlations between relative synonymous codon usage and protein secondary structure. J Mol Biol 1998; 281:31-48. [PMID: 9680473 DOI: 10.1006/jmbi.1998.1921] [Citation(s) in RCA: 121] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
We found significant species-specific correlations between the use of two synonymous codons and protein secondary structure units by comparing the three-dimensional structures of human and Escherichia coli proteins with their mRNA sequences. The correlations are not explained by codon-context, expression level, GC/AU content, or positional effects. The E. coli correlation is between Asn AAC and the C-terminal regions of beta-sheet segments; it may result from selection for translational accuracy, suggesting the hypothesis that downstream Asn residues are important for beta-sheet formation. The correlation in human proteins is between Asp GAU and the N termini of alpha-helices; it may be important for eukaryote-specific sequential, cotranslational folding. The kingdom-specific correlations may reflect kingdom-specific differences in translational mechanisms. The correlations may help identify residues that are important for secondary structure formation, be useful in secondary structure prediction algorithms, and have implications for recombinant gene expression.
Collapse
Affiliation(s)
- M Oresic
- Section of Biochemistry Molecular and Cell Biology, Cornell University, Ithaca, NY 14853, USA
| | | |
Collapse
|
25
|
Abstract
Tropomyosin (TM) is a component of microfilaments of most eukaryotic cells. In striated muscle, TM helps confer calcium sensitivity to the actin-myosin interaction. TM is a fibrillar, self-associating protein that binds to the extended actin filament system. We hypothesized that these structural features would permit TM to undergo assembly into the cytoskeleton during translation, or cotranslational assembly. Pulse-chase experiments with [35S]methionine and pulse experiments with [3H]puromycin followed by extraction and immunoprecipitation of TM were performed to examine the mechanism of assembly of TM into the cytoskeleton in cultured avian muscle cells. Pulse-chase experiments provide kinetic evidence for cotranslational assembly of TM in skeletal and cardiac muscle. Demonstration of a large majority of completed TM on purified skeletal muscle microfilaments after a short labeling period confirms that these kinetic data are not related to trapping of TM within the actin network of the cytoskeleton. Nascent TM peptides are demonstrated on the cytoskeleton of muscle cells after a short metabolic pulse followed by puromycin treatment to release nascent peptides from ribosomes or after labeling with [3H]puromycin. Nascent chain localization to the cytoskeleton independent of ribosomal attachment further confirms the high degree of cotranslational assembly of this protein. The extent of cotranslational assembly is similar before and after the formation of significant myofibril in myotubes, suggesting that cotranslational assembly of TM is active during contractile apparatus assembly in muscle differentiation. This is the first report where assembly mechanism has been predicted to be cotranslational based upon structural features of a cytoskeletal protein.
Collapse
Affiliation(s)
- T J L'Ecuyer
- Department of Pediatrics, Wayne State University School of Medicine, Children's Hospital of Michigan, Detroit 48201, USA
| | | | | |
Collapse
|
26
|
Adzhubei IA, Adzhubei AA, Neidle S. An Integrated Sequence-Structure Database incorporating matching mRNA sequence, amino acid sequence and protein three-dimensional structure data. Nucleic Acids Res 1998; 26:327-31. [PMID: 9399866 PMCID: PMC147252 DOI: 10.1093/nar/26.1.327] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
We have constructed a non-homologous database, termed the Integrated Sequence-Structure Database (ISSD) which comprises the coding sequences of genes, amino acid sequences of the corresponding proteins, their secondary structure and straight phi,psi angles assignments, and polypeptide backbone coordinates. Each protein entry in the database holds the alignment of nucleotide sequence, amino acid sequence and the PDB three-dimensional structure data. The nucleotide and amino acid sequences for each entry are selected on the basis of exact matches of the source organism and cell environment. The current version 1.0 of ISSD is available on the WWW at http://www.protein.bio.msu.su/issd/ and includes 107 non-homologous mammalian proteins, of which 80 are human proteins. The database has been used by us for the analysis of synonymous codon usage patterns in mRNA sequences showing their correlation with the three-dimensional structure features in the encoded proteins. Possible ISSD applications include optimisation of protein expression, improvement of the protein structure prediction accuracy, and analysis of evolutionary aspects of the nucleotide sequence-protein structure relationship.
Collapse
Affiliation(s)
- I A Adzhubei
- CRC Biomolecular Structure Unit, The Institute of Cancer Research, Sutton, Surrey SM2 5NG, UK
| | | | | |
Collapse
|
27
|
Phoenix DA, Korotkov E. Evidence of rare codon clusters within Escherichia coli coding regions. FEMS Microbiol Lett 1997; 155:63-6. [PMID: 9345765 DOI: 10.1111/j.1574-6968.1997.tb12686.x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
It is known that there is a high occurrence of rare codons at the start of coding region. Here it is shown that although the remainder of the gene is likely to contain a relatively low number of rare codons, rare and non-rare codons do not form a random sequence. It is apparent that throughout the coding region there is a higher than expected number of rare codon clusters. For example once a rare codon has occurred there is a greater chance than expected of the next six codons containing another rare codon. This non-random distribution implies that rare codons may have an as yet unidentified biological role.
Collapse
Affiliation(s)
- D A Phoenix
- Department of Applied Biology, University of Central Lancashire, Preston, UK.
| | | |
Collapse
|
28
|
Abstract
Globin synthesis in a wheat germ cell-free translation system was performed in the presence of [3H]hemin and [35S]methionine to determine the minimal length of the nascent ribosome-bound globin chain capable of heme binding. Nascent polypeptides of predetermined size were synthesized on ribosomes by translation of truncated mRNA molecules. Analysis with the use of sucrose gradient centrifugation and puromycin reaction revealed that the ribosome-bound N-terminal alpha-globin fragments of 140, 100, and 86 amino acid residues are capable of an efficient heme binding, whereas those of 75, 65, and 34 amino acid residues display a significantly weaker, or just nonspecific, affinity to heme. This indicates that the ribosome-bound nascent chain of 86 amino acid residues has already acquired a spatial structure that allows its interaction with the heme group or that heme attachment promotes the formation of the proper tertiary structure in the ribosome-bound nascent peptide. In any case the cotranslational folding of globin is suggested.
Collapse
Affiliation(s)
- A A Komar
- Department of Molecular Biology, Faculty of Biology, Moscow State University, 119899 Moscow, Russia
| | | | | | | |
Collapse
|
29
|
Adzhubei AA, Adzhubei IA, Krasheninnikov IA, Neidle S. Non-random usage of 'degenerate' codons is related to protein three-dimensional structure. FEBS Lett 1996; 399:78-82. [PMID: 8980124 DOI: 10.1016/s0014-5793(96)01287-2] [Citation(s) in RCA: 74] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
We report an analysis of a novel sequence-structure database of mammalian proteins incorporating nucleotide sequences of the exon regions of their genes together with protein sequence and structural information. We find that synonymous codon families (i.e. coding the same residue) have non-random codon distribution frequencies between protein secondary structure types. Their structural preferences are related to the third, 'silent' nucleotide position in a codon. We also find that some synonymous codons show very different or even opposite structural preferences at the N- or C-termini of structure fragments, relative to those observed for their amino acid residues.
Collapse
Affiliation(s)
- A A Adzhubei
- CRC Biomolecular Structure Unit, The Institute of Cancer Research, Sutton, Surrey, UK
| | | | | | | |
Collapse
|
30
|
Thanaraj TA, Argos P. Protein secondary structural types are differentially coded on messenger RNA. Protein Sci 1996; 5:1973-83. [PMID: 8897597 PMCID: PMC2143259 DOI: 10.1002/pro.5560051003] [Citation(s) in RCA: 126] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Tricodon regions on messenger RNAs corresponding to a set of proteins from Escherichia coli were scrutinized for their translation speed. The fractional frequency values of the individual codons as they occur in mRNAs of highly expressed genes from Escherichia coli were taken as an indicative measure of the translation speed. The tricodons were classified by the sum of the frequency values of the constituent codons. Examination of the conformation of the encoded amino acid residues in the corresponding protein tertiary structures revealed a correlation between codon usage in mRNA and topological features of the encoded proteins. Alpha helices on proteins tend to be preferentially coded by translationally fast mRNA regions while the slow segments often code for beta strands and coil regions. Fast regions correspondingly avoid coding for beta strands and coil regions while the slow regions similarly move away from encoding alpha helices. Structural and mechanistic aspects of the ribosome peptide channel support the relevance of sequence fragment translation and subsequent conformation. A discussion is presented relating the observation to the reported kinetic data on the formation and stabilization of protein secondary structural types during protein folding. The observed absence of such strong positive selection for codons in non-highly expressed genes is compatible with existing theories that mutation pressure may well dominate codon selection in non-highly expressed genes.
Collapse
Affiliation(s)
- T A Thanaraj
- European Molecular Biology Laboratory, Heidelberg, Germany.
| | | |
Collapse
|
31
|
Brunak S, Engelbrecht J. Protein structure and the sequential structure of mRNA: alpha-helix and beta-sheet signals at the nucleotide level. Proteins 1996; 25:237-52. [PMID: 8811739 DOI: 10.1002/(sici)1097-0134(199606)25:2<237::aid-prot9>3.0.co;2-e] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.
Collapse
Affiliation(s)
- S Brunak
- Department of Physical Chemistry, Technical University of Denmark, Lyngby, Denmark
| | | |
Collapse
|
32
|
Komar AA, Jaenicke R. Kinetics of translation of gamma B crystallin and its circularly permutated variant in an in vitro cell-free system: possible relations to codon distribution and protein folding. FEBS Lett 1995; 376:195-8. [PMID: 7498540 DOI: 10.1016/0014-5793(95)01275-0] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Analysis of nascent gamma B-crystallin peptides accumulating during in vitro translation in a rabbit reticulocyte lysate cell-free system was carried out. As a consequence of the irregular distribution of rare codons along the polypeptide chain of gamma B-crystallin, translation of the two-domain protein is a non-uniform process characterized by specific pauses. One of the major delays occurs during the translation of the connecting peptide between the domains. Comparing the kinetics of translation of natural gamma B-crystallin and its circularly permutated variant (with the order of the N- and C-terminal domains exchanged) reveals that the natural N-terminal domain is translated faster than the C-terminal one. Since the N-terminal domain in natural gamma B-crystallin is known to be more stable and to fold faster than the C-terminal one [E.-M. Mayr et al. (1994) J. Mol. Biol. 235, 84-88], the present data suggest that the translation rates are optimized to tune the synthesis and folding of the nascent polypeptide chain. In this connection, the pause in the linker region between the domains provides a delay allowing the correct folding of the N-terminal domain and its subsequent assistance in the stabilization of the C-terminal one.
Collapse
Affiliation(s)
- A A Komar
- Institut für Biophysik und Physikalische Biochemie, Universität Regensburg, Germany
| | | |
Collapse
|
33
|
Komar AA, Kommer A, Krasheninnikov IA, Spirin AS. Cotranslational heme binding to nascent globin chains. FEBS Lett 1993; 326:261-3. [PMID: 8325375 DOI: 10.1016/0014-5793(93)81803-8] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Globin synthesis in cell-free extracts of rabbit reticulocytes was carried out in the presence of 3H-labeled hemin. Sucrose gradient centrifugation analysis revealed [3H]hemin in the polyribosome fraction. The addition of puromycin resulted in the release of both [3H]hemin- and [14C]leucine-labeled polypeptide from the polyribosomes. The data suggest cotranslational folding of the globin molecule on the ribosome and cotranslational heme binding to the nascent globin chain.
Collapse
Affiliation(s)
- A A Komar
- Department of Molecular Biology, Faculty of Biology, Moscow State University, Russian Federation
| | | | | | | |
Collapse
|