1
|
Arai M. Unified understanding of folding and binding mechanisms of globular and intrinsically disordered proteins. Biophys Rev 2018; 10:163-181. [PMID: 29307002 PMCID: PMC5899706 DOI: 10.1007/s12551-017-0346-7] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Accepted: 11/13/2017] [Indexed: 12/18/2022] Open
Abstract
Extensive experimental and theoretical studies have advanced our understanding of the mechanisms of folding and binding of globular proteins, and coupled folding and binding of intrinsically disordered proteins (IDPs). The forces responsible for conformational changes and binding are common in both proteins; however, these mechanisms have been separately discussed. Here, we attempt to integrate the mechanisms of coupled folding and binding of IDPs, folding of small and multi-subdomain proteins, folding of multimeric proteins, and ligand binding of globular proteins in terms of conformational selection and induced-fit mechanisms as well as the nucleation–condensation mechanism that is intermediate between them. Accumulating evidence has shown that both the rate of conformational change and apparent rate of binding between interacting elements can determine reaction mechanisms. Coupled folding and binding of IDPs occurs mainly by induced-fit because of the slow folding in the free form, while ligand binding of globular proteins occurs mainly by conformational selection because of rapid conformational change. Protein folding can be regarded as the binding of intramolecular segments accompanied by secondary structure formation. Multi-subdomain proteins fold mainly by the induced-fit (hydrophobic collapse) mechanism, as the connection of interacting segments enhances the binding (compaction) rate. Fewer hydrophobic residues in small proteins reduce the intramolecular binding rate, resulting in the nucleation–condensation mechanism. Thus, the folding and binding of globular proteins and IDPs obey the same general principle, suggesting that the coarse-grained, statistical mechanical model of protein folding is promising for a unified theoretical description of all mechanisms.
Collapse
Affiliation(s)
- Munehito Arai
- Department of Life Sciences, Graduate School of Arts and Sciences, The University of Tokyo, 3-8-1 Komaba, Meguro, Tokyo, 153-8902, Japan.
| |
Collapse
|
2
|
Villada JC, Brustolini OJB, Batista da Silveira W. Integrated analysis of individual codon contribution to protein biosynthesis reveals a new approach to improving the basis of rational gene design. DNA Res 2017; 24:419-434. [PMID: 28449100 PMCID: PMC5737324 DOI: 10.1093/dnares/dsx014] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2016] [Revised: 03/22/2017] [Accepted: 03/23/2017] [Indexed: 01/21/2023] Open
Abstract
Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent-non-optimal cluster and enrichment at the 5'-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation.
Collapse
Affiliation(s)
- Juan C. Villada
- Department of Microbiology, Universidade Federal de Viçosa, Viçosa 36570-900, Brazil
| | | | | |
Collapse
|
3
|
Abstract
Based on the Shannon's information communication theory, information amount of the entire length of a polymeric macromolecule can be calculated in bits through adding the entropies of each building block. Proteins, DNA and RNA are such macromolecules. When only the building blocks' variation is considered as the source of entropy, there is seemingly lower information in case of the protein if this approach is applied directly on a protein of specific size and the coding sequence size of the mRNA corresponding to the particular length of the protein. This decrease in the information amount seems contradictory but this apparent conflict is resolved by considering the conformational variations in proteins as a new variable in the calculation and balancing the approximated entropy of the coding part of the mRNA and the protein. Probabilities can change therefore we also assigned hypothetical probabilities to the conformational states, which represent the uneven distribution as the time spent in one conformation, providing the probability of the presence in either or one of the possible conformations. Results that are obtained by using hypothetical probabilities are in line with the experimental values of variations in the conformational-state of protein populations. This equalization approach has further biological relevance that it compensates for the degeneracy in the codon usage during protein translation and it leads to the conclusion that the alphabet size for the protein is rather optimal for the proper protein functioning within the thermodynamic milieu of the cell. The findings were also discussed in relation to the codon bias and have implications in relation to the codon evolution concept. Eventually, this work brings the fields of protein structural studies and molecular protein translation processes together with a novel approach.
Collapse
Affiliation(s)
- Y Adiguzel
- Biophysics Department, School of Medicine, Istanbul Kemerburgaz University, Istanbul, Turkey.
| |
Collapse
|
4
|
Sharma AK, Bukau B, O'Brien EP. Physical Origins of Codon Positions That Strongly Influence Cotranslational Folding: A Framework for Controlling Nascent-Protein Folding. J Am Chem Soc 2016; 138:1180-95. [PMID: 26716464 DOI: 10.1021/jacs.5b08145] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
An emerging paradigm in the field of in vivo protein biophysics is that nascent-protein behavior is a type of nonequilibrium phenomenon, where translation-elongation kinetics can be more important in determining nascent-protein behavior than the thermodynamic properties of the protein. Synonymous codon substitutions, which change the translation rate at select codon positions along a transcript, have been shown to alter cotranslational protein folding, suggesting that evolution may have shaped synonymous codon usage in the genomes of organisms in part to increase the amount of folded and functional nascent protein. Here, we develop a Monte Carlo-master-equation method that allows for the control of nascent-chain folding during translation through the rational design of mRNA sequences to guide the cotranslational folding process. We test this framework using coarse-grained molecular dynamics simulations and find it provides optimal mRNA sequences to control the simulated, cotranslational folding of a protein in a user-prescribed manner. With this approach we discover that some codon positions in a transcript can have a much greater impact on nascent-protein folding than others because they tend to be positions where the nascent chain populates states that are far from equilibrium, as well as being dependent on a complex ratio of time scales. As a consequence, different cotranslational profiles of the same protein can have different critical codon positions and different numbers of synonymous mRNA sequences that encode for them. These findings explain that there is a fundamental connection between the nonequilibrium nature of cotranslational processes, nascent-protein behavior, and synonymous codon usage.
Collapse
Affiliation(s)
- Ajeet K Sharma
- Department of Chemistry, Pennsylvania State University , University Park, Pennsylvania 16802, United States
| | - Bernd Bukau
- Center for Molecular Biology of the University of Heidelberg (ZMBH), DKFZ-ZMBH Alliance, Im Neuenheimer Feld 282, Heidelberg D-69120, Germany
| | - Edward P O'Brien
- Department of Chemistry, Pennsylvania State University , University Park, Pennsylvania 16802, United States
| |
Collapse
|
5
|
Daniel E, Onwukwe GU, Wierenga RK, Quaggin SE, Vainio SJ, Krause M. ATGme: Open-source web application for rare codon identification and custom DNA sequence optimization. BMC Bioinformatics 2015; 16:303. [PMID: 26391121 PMCID: PMC4578782 DOI: 10.1186/s12859-015-0743-5] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Accepted: 09/16/2015] [Indexed: 02/06/2023] Open
Abstract
Background Codon usage plays a crucial role when recombinant proteins are expressed in different organisms. This is especially the case if the codon usage frequency of the organism of origin and the target host organism differ significantly, for example when a human gene is expressed in E. coli. Therefore, to enable or enhance efficient gene expression it is of great importance to identify rare codons in any given DNA sequence and subsequently mutate these to codons which are more frequently used in the expression host. Results We describe an open-source web-based application, ATGme, which can in a first step identify rare and highly rare codons from most organisms, and secondly gives the user the possibility to optimize the sequence. Conclusions This application provides a simple user-friendly interface utilizing three optimization strategies: 1. one-click optimization, 2. bulk optimization (by codon-type), 3. individualized custom (codon-by-codon) optimization. ATGme is an open-source application which is freely available at: http://atgme.org
Collapse
Affiliation(s)
- Edward Daniel
- Biocenter Oulu, Faculty of Biochemistry and Molecular Medicine, Structural Biochemistry, University of Oulu, Oulu, Finland.
| | - Goodluck U Onwukwe
- Biocenter Oulu, Faculty of Biochemistry and Molecular Medicine, Structural Biochemistry, University of Oulu, Oulu, Finland.
| | - Rik K Wierenga
- Biocenter Oulu, Faculty of Biochemistry and Molecular Medicine, Structural Biochemistry, University of Oulu, Oulu, Finland.
| | - Susan E Quaggin
- Feinberg School of Medicine, Northwestern University, Chicago, IL, 60611, USA.
| | - Seppo J Vainio
- Biocenter Oulu, Laboratory of Developmental Biology, InfoTech Oulu, Center for Cell Matrix Research, Faculty of Biochemistry and Molecular Medicine, University of Oulu, Aapistie 5A, FIN-90220, Oulu, Finland.
| | - Mirja Krause
- Biocenter Oulu, Laboratory of Developmental Biology, InfoTech Oulu, Center for Cell Matrix Research, Faculty of Biochemistry and Molecular Medicine, University of Oulu, Aapistie 5A, FIN-90220, Oulu, Finland.
| |
Collapse
|
6
|
Exploring codon context bias for synthetic gene design of a thermostable invertase in Escherichia coli. Enzyme Microb Technol 2015; 75-76:57-63. [DOI: 10.1016/j.enzmictec.2015.04.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2015] [Revised: 03/31/2015] [Accepted: 04/22/2015] [Indexed: 02/07/2023]
|
7
|
Decoding mechanisms by which silent codon changes influence protein biogenesis and function. Int J Biochem Cell Biol 2015; 64:58-74. [PMID: 25817479 DOI: 10.1016/j.biocel.2015.03.011] [Citation(s) in RCA: 90] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2014] [Revised: 03/02/2015] [Accepted: 03/14/2015] [Indexed: 02/07/2023]
Abstract
SCOPE Synonymous codon usage has been a focus of investigation since the discovery of the genetic code and its redundancy. The occurrences of synonymous codons vary between species and within genes of the same genome, known as codon usage bias. Today, bioinformatics and experimental data allow us to compose a global view of the mechanisms by which the redundancy of the genetic code contributes to the complexity of biological systems from affecting survival in prokaryotes, to fine tuning the structure and function of proteins in higher eukaryotes. Studies analyzing the consequences of synonymous codon changes in different organisms have revealed that they impact nucleic acid stability, protein levels, structure and function without altering amino acid sequence. As such, synonymous mutations inevitably contribute to the pathogenesis of complex human diseases. Yet, fundamental questions remain unresolved regarding the impact of silent mutations in human disorders. In the present review we describe developments in this area concentrating on mechanisms by which synonymous mutations may affect protein function and human health. PURPOSE This synopsis illustrates the significance of synonymous mutations in disease pathogenesis. We review the different steps of gene expression affected by silent mutations, and assess the benefits and possible harmful effects of codon optimization applied in the development of therapeutic biologics. PHYSIOLOGICAL AND MEDICAL RELEVANCE Understanding mechanisms by which synonymous mutations contribute to complex diseases such as cancer, neurodegeneration and genetic disorders, including the limitations of codon-optimized biologics, provides insight concerning interpretation of silent variants and future molecular therapies.
Collapse
|
8
|
KDM5C mutational screening among males with intellectual disability suggestive of X-Linked inheritance and review of the literature. Eur J Med Genet 2014; 57:138-44. [PMID: 24583395 DOI: 10.1016/j.ejmg.2014.02.011] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2013] [Accepted: 02/17/2014] [Indexed: 01/18/2023]
Abstract
An increasing number of neurodevelopmental diseases have been associated with disruption of chromatin remodeling in eukaryotes. Lysine(K)-specific demethylase 5C (KDM5C) is a versatile epigenetic regulator that removes di- and tri-methyl groups of lysine 4 on histone H3 (H3K4) from transcriptional targets and is essential for neuronal survival and dendritic growth. Mutations in KDM5C gene, located at Xp11.22, have been reported as an important cause of both syndromic and non-syndromic X-linked intellectual disability (XLID) in males. The aim of this study was to evaluate the prevalence and spectrum of KDM5C mutations among Brazilian patients with XLID. To access the impact of KDM5C variants on XLID, a cohort of 143 males with a family history of intellectual disability (ID) suggestive of X-linked inheritance were enrolled. Common genetic causes of XLID were previously excluded and the entire coding and flanking intronic sequences of KDM5C gene were screened by direct Sanger sequencing. Seven nucleotide changes were observed: one pathogenic mutation (c.2172C>A, p.Cys724*), one novel variant with unknown value (c.633G>C, p.Arg211Arg) and five apparently benign sequence changes. In silico analysis of the variants revealed a putative creation of an Exonic Splicing Enhancer sequence by the silent c.633G>C mutation, which co-segregates with the ID phenotype. Our results point out to a KDM5C pathogenic mutational frequency of 0.7% among males with probable XLID. This is the first KDM5C screening among ID males from a country in Latin America and provides new clues about the significance of KDM5C mutations for genetic counseling.
Collapse
|
9
|
Biro JC. Coding nucleic acids are chaperons for protein folding: a novel theory of protein folding. Gene 2012; 515:249-57. [PMID: 23266645 DOI: 10.1016/j.gene.2012.12.048] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2012] [Revised: 12/04/2012] [Accepted: 12/06/2012] [Indexed: 11/29/2022]
Abstract
The arguments for nucleic acid chaperons are reviewed and three new lines of evidence are added. (1) It was found that amino acids encoded by codons in short nucleic acid loops frequently form turns and helices in the corresponding protein structures. (2) The amino acids encoded by partially complementary (1st and 3rd nucleotides) codons are more frequently co-located in the encoded proteins than expected by chance. (3) There are significant correlations between thermodynamic changes (ddG) caused by codon mutations in nucleic acids and the thermodynamic changes caused by the corresponding amino acid mutations in the encoded proteins. We conclude that the concept of the Proteomic Code and nucleic acid chaperons seems correct from the bioinformatics point of view, and we expect to see direct biochemical experiments and evidence in the near future.
Collapse
Affiliation(s)
- Jan C Biro
- Karolinska Institute, Stockholm, Sweden.
| |
Collapse
|
10
|
Silent substitutions predictably alter translation elongation rates and protein folding efficiencies. J Mol Biol 2012; 422:328-35. [PMID: 22705285 DOI: 10.1016/j.jmb.2012.06.010] [Citation(s) in RCA: 166] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2012] [Revised: 05/31/2012] [Accepted: 06/02/2012] [Indexed: 11/23/2022]
Abstract
Genetic code redundancy allows most amino acids to be encoded by multiple codons that are non-randomly distributed along coding sequences. An accepted theory explaining the biological significance of such non-uniform codon selection is that codons are translated at different speeds. Thus, varying codon placement along a message may confer variable rates of polypeptide emergence from the ribosome, which may influence the capacity to fold toward the native state. Previous studies report conflicting results regarding whether certain codons correlate with particular structural or folding properties of the encoded protein. This is partly due to different criteria traditionally utilized for predicting translation speeds of codons, including their usage frequencies and the concentration of tRNA species capable of decoding them, which do not always correlate. Here, we developed a metric to predict organism-specific relative translation rates of codons based on the availability of tRNA decoding mechanisms: Watson-Crick, non-Watson-Crick or both types of interactions. We determine translation rates of messages by pulse-chase analyses in living Escherichia coli cells and show that sequence engineering based on these concepts predictably modulates translation rates in a manner that is superior to codon usage frequency, which occur during the elongation phase, and significantly impacts folding of the encoded polypeptide. Finally, we demonstrate that sequence harmonization based on expression host tRNA pools, designed to mimic ribosome movement of the original organism, can significantly increase the folding of the encoded polypeptide. These results illuminate how genetic code degeneracy may function to specify properties beyond amino acid encoding, including folding.
Collapse
|
11
|
Chartier M, Gaudreault F, Najmanovich R. Large-scale analysis of conserved rare codon clusters suggests an involvement in co-translational molecular recognition events. ACTA ACUST UNITED AC 2012; 28:1438-45. [PMID: 22467916 DOI: 10.1093/bioinformatics/bts149] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
MOTIVATION An increasing amount of evidence from experimental and computational analysis suggests that rare codon clusters are functionally important for protein activity. Most of the studies on rare codon clusters were performed on a limited number of proteins or protein families. In the present study, we present the Sherlocc program and how it can be used for large scale protein family analysis of evolutionarily conserved rare codon clusters and their relation to protein function and structure. This large-scale analysis was performed using the whole Pfam database covering over 70% of the known protein sequence universe. Our program Sherlocc, detects statistically relevant conserved rare codon clusters and produces a user-friendly HTML output. RESULTS Statistically significant rare codon clusters were detected in a multitude of Pfam protein families. The most statistically significant rare codon clusters were predominantly identified in N-terminal Pfam families. Many of the longest rare codon clusters are found in membrane-related proteins which are required to interact with other proteins as part of their function, for example in targeting or insertion. We identified some cases where rare codon clusters can play a regulating role in the folding of catalytically important domains. Our results support the existence of a widespread functional role for rare codon clusters across species. Finally, we developed an online filter-based search interface that provides access to Sherlocc results for all Pfam families. AVAILABILITY The Sherlocc program and search interface are open access and are available at http://bcb.med.usherbrooke.ca
Collapse
Affiliation(s)
- Matthieu Chartier
- Department of Biochemistry, Faculty of Medicine and Health Sciences, Université de Sherbrooke, 12e Avenue Nord, Sherbrooke, Québec, Canada
| | | | | |
Collapse
|
12
|
Venetianer P. Are synonymous codons indeed synonymous? Biomol Concepts 2012; 3:21-8. [DOI: 10.1515/bmc.2011.050] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2011] [Accepted: 10/25/2011] [Indexed: 12/17/2022] Open
Abstract
AbstractIt has long been known that the distribution and frequency of occurence of synonymous codons can vary greatly among different species, and that the abundance of isoaccepting tRNA species could also be very different. The interaction of these two factors may influence the rate and efficiency of protein synthesis and therefore synonymous mutations might influence the fitness of the organism and cannot be treated generally as ‘neutral’ in an evolutionary sense. These general effects of synonymous mutations, and their possible role in evolution, have been discussed in several recent papers. This review, however, will only deal with the influence of synonymous codon replacements on the expression of individual genes. It will describe the possible mechanisms of such effects and will present examples demonstrating the existence and effects of each of these mechanisms.
Collapse
Affiliation(s)
- Pál Venetianer
- 1Biological Research Centre, Hungarian Academy of Sciences, Temesvári krt. 62, H-6726 Szeged, Hungary
| |
Collapse
|