1
|
Erez K, Jangid A, Feldheim ON, Friedlander T. The role of promiscuous molecular recognition in the evolution of RNase-based self-incompatibility in plants. Nat Commun 2024; 15:4864. [PMID: 38849350 PMCID: PMC11161657 DOI: 10.1038/s41467-024-49163-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Accepted: 05/22/2024] [Indexed: 06/09/2024] Open
Abstract
How do biological networks evolve and expand? We study these questions in the context of the plant collaborative-non-self recognition self-incompatibility system. Self-incompatibility evolved to avoid self-fertilization among hermaphroditic plants. It relies on specific molecular recognition between highly diverse proteins of two families: female and male determinants, such that the combination of genes an individual possesses determines its mating partners. Though highly polymorphic, previous models struggled to pinpoint the evolutionary trajectories by which new specificities evolved. Here, we construct a novel theoretical framework, that crucially affords interaction promiscuity and multiple distinct partners per protein, as is seen in empirical findings disregarded by previous models. We demonstrate spontaneous self-organization of the population into distinct "classes" with full between-class compatibility and a dynamic long-term balance between class emergence and decay. Our work highlights the importance of molecular recognition promiscuity to network evolvability. Promiscuity was found in additional systems suggesting that our framework could be more broadly applicable.
Collapse
Affiliation(s)
- Keren Erez
- The Robert H. Smith Institute of Plant Sciences and Genetics in Agriculture, Faculty of Agriculture, The Hebrew University of Jerusalem, P.O. Box 12, Rehovot, 7610001, Israel
| | - Amit Jangid
- The Robert H. Smith Institute of Plant Sciences and Genetics in Agriculture, Faculty of Agriculture, The Hebrew University of Jerusalem, P.O. Box 12, Rehovot, 7610001, Israel
| | - Ohad Noy Feldheim
- The Einstein Institute of Mathematics, Faculty of Natural Sciences, The Hebrew University of Jerusalem, Jerusalem, 9190401, Israel
| | - Tamar Friedlander
- The Robert H. Smith Institute of Plant Sciences and Genetics in Agriculture, Faculty of Agriculture, The Hebrew University of Jerusalem, P.O. Box 12, Rehovot, 7610001, Israel.
| |
Collapse
|
2
|
Saikia B, Baruah A. In silico design of misfolding resistant proteins: the role of structural similarity of a competing conformational ensemble in the optimization of frustration. SOFT MATTER 2024; 20:3283-3298. [PMID: 38529658 DOI: 10.1039/d4sm00171k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/27/2024]
Abstract
Most state-of-the-art in silico design methods fail due to misfolding of designed sequences to a conformation other than the target. Thus, a method to design misfolding resistant proteins will provide a better understanding of the misfolding phenomenon and will also increase the success rate of in silico design methods. In this work, we optimize the conformational ensemble to be selected for negative design purposes based on the similarity of the conformational ensemble to the target. Five ensembles with different degrees of similarity to the target are created and destabilized and the target is stabilized while designing sequences using mean field theory and Monte Carlo simulation methods. The results suggest that the degree of similarity of the non-native conformations to the target plays a prominent role in designing misfolding resistant protein sequences. The design procedures that destabilize the conformational ensemble with moderate similarity to the target have proven to be more promising. Incorporation of either highly similar or highly dissimilar conformations to the target conformation into the non-native ensemble to be destabilized may lead to sequences with a higher misfolding propensity. This will significantly reduce the conformational space to be considered in any protein design procedure. Interestingly, the results suggest that a sequence with higher frustration in the target structure does not necessarily lead to a misfold prone sequence. A successful design method may purposefully choose a frustrated sequence in the target conformation if that sequence is even more frustrated in the competing non-native conformations.
Collapse
Affiliation(s)
- Bondeepa Saikia
- Department of Chemistry, Dibrugarh University, Dibrugarh 786004, India.
| | - Anupaul Baruah
- Department of Chemistry, Dibrugarh University, Dibrugarh 786004, India.
| |
Collapse
|
3
|
Abstract
Protein function can be allosterically regulated by changes in structure or dynamics. PDZ domains are classic examples for studies of allostery in single protein domains. However, PDZ domains are often found in multidomain proteins; in particular, PDZ3 is located in a supramodule containing three domains. The allosteric network in PDZ3 has never been studied in the presence of the adjacent domains. Here we map the allosteric network for a PDZ3:ligand complex, both in isolation and in the context of a supramodule. We demonstrate that the allosteric network is highly dependent on this supertertiary structure, with broad implications for studies of allostery in single domains. The notion that protein function is allosterically regulated by structural or dynamic changes in proteins has been extensively investigated in several protein domains in isolation. In particular, PDZ domains have represented a paradigm for these studies, despite providing conflicting results. Furthermore, it is still unknown how the association between protein domains in supramodules, consitituting so-called supertertiary structures, affects allosteric networks. Here, we experimentally mapped the allosteric network in a PDZ:ligand complex, both in isolation and in the context of a supramodular structure, and show that allosteric networks in a PDZ domain are highly dependent on the supertertiary structure in which they are present. This striking sensitivity of allosteric networks to the presence of adjacent protein domains is likely a common property of supertertiary structures in proteins. Our findings have general implications for prediction of allosteric networks from primary and tertiary structures and for quantitative descriptions of allostery.
Collapse
|
4
|
Affiliation(s)
- Lavi S. Bigman
- Department of Structural BiologyWeizmann Institute of Science Rehovot 76100 Israel
| | - Yaakov Levy
- Department of Structural BiologyWeizmann Institute of Science Rehovot 76100 Israel
| |
Collapse
|
5
|
Bigman LS, Levy Y. Proteins: molecules defined by their trade-offs. Curr Opin Struct Biol 2020; 60:50-56. [DOI: 10.1016/j.sbi.2019.11.005] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2019] [Revised: 10/07/2019] [Accepted: 11/11/2019] [Indexed: 12/30/2022]
|
6
|
Horovitz A, Fleisher RC, Mondal T. Double-mutant cycles: new directions and applications. Curr Opin Struct Biol 2019; 58:10-17. [PMID: 31029859 DOI: 10.1016/j.sbi.2019.03.025] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2019] [Accepted: 03/20/2019] [Indexed: 11/17/2022]
Abstract
Double-mutant cycle (DMC) analysis is a powerful approach for detecting and quantifying the energetics of both direct and long-range interactions in proteins and other chemical systems. It can also be used to unravel higher-order interactions (e.g. three-body effects) that lead to cooperativity in protein folding and function. In this review, we describe new applications of DMC analysis based on advances in native mass spectrometry and high-throughput methods such as next generation sequencing and protein complementation assays. These developments have facilitated carrying out high-throughput DMC analysis, which can be used to characterize increasingly higher-order interactions and very large interaction networks in proteins. Such studies have provided insights into the extent of cooperativity (epistasis) in protein structures. High-throughput DMC studies have also been used to validate correlated mutation analysis and can provide restraints for protein docking.
Collapse
Affiliation(s)
- Amnon Horovitz
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel.
| | - Rachel C Fleisher
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Tridib Mondal
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel
| |
Collapse
|
7
|
How Often Do Protein Genes Navigate Valleys of Low Fitness? Genes (Basel) 2019; 10:genes10040283. [PMID: 30965625 PMCID: PMC6523826 DOI: 10.3390/genes10040283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Revised: 03/27/2019] [Accepted: 04/02/2019] [Indexed: 11/17/2022] Open
Abstract
To escape from local fitness peaks, a population must navigate across valleys of low fitness. How these transitions occur, and what role they play in adaptation, have been subjects of active interest in evolutionary genetics for almost a century. However, to our knowledge, this problem has never been addressed directly by considering the evolution of a gene, or group of genes, as a whole, including the complex effects of fitness interactions among multiple loci. Here, we use a precise model of protein fitness to compute the probability P ( s , Δ t ) that an allele, randomly sampled from a population at time t, has crossed a fitness valley of depth s during an interval t - Δ t , t in the immediate past. We study populations of model genes evolving under equilibrium conditions consistent with those in mammalian mitochondria. From this data, we estimate that genes encoding small protein motifs navigate fitness valleys of depth 2 N s ≳ 30 with probability P ≳ 0 . 1 on a time scale of human evolution, where N is the (mitochondrial) effective population size. The results are consistent with recent findings for Watson⁻Crick switching in mammalian mitochondrial tRNA molecules.
Collapse
|
8
|
Rodriguez PM, Stratmann D, Duprat E, Papandreou N, Acuna R, Lacroix Z, Chomilier J. Correlating topology and thermodynamics to predict protein structure sensitivity to point mutations. BIO-ALGORITHMS AND MED-SYSTEMS 2018. [DOI: 10.1515/bams-2018-0026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
AbstractThe relation between distribution of hydrophobic amino acids along with protein chains and their structure is far from being completely understood. No reliable method allowsab initioprediction of the folded structure from this distribution of physicochemical properties, even when they are highly degenerated by considering only two classes: hydrophobic and polar. Establishment of long-range hydrophobic three dimension (3D) contacts is essential for the formation of the nucleus, a key process in the early steps of protein folding. Thus, a large number of 3D simulation studies were developed to challenge this issue. They are nowadays evaluated in a specific chapter of the molecular modeling competition, Critical Assessment of Protein Structure Prediction. We present here a simulation of the early steps of the folding process for 850 proteins, performed in a discrete 3D space, which results in peaks in the predicted distribution of intra-chain noncovalent contacts. The residues located at these peak positions tend to be buried in the core of the protein and are expected to correspond to critical positions in the sequence, important both for folding and structural (or similarly, energetic in the thermodynamic hypothesis) stability. The degree of stabilization or destabilization due to a point mutation at the critical positions involved in numerous contacts is estimated from the calculated folding free energy difference between mutated and native structures. The results show that these critical positions are not tolerant towards mutation. This simulation of the noncovalent contacts only needs a sequence as input, and this paper proposes a validation of the method by comparison with the prediction of stability by well-established programs.
Collapse
|
9
|
Bigman LS, Levy Y. Stability Effects of Protein Mutations: The Role of Long-Range Contacts. J Phys Chem B 2018; 122:11450-11459. [DOI: 10.1021/acs.jpcb.8b07379] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Lavi S. Bigman
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Yaakov Levy
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 76100, Israel
| |
Collapse
|
10
|
Abstract
Mutations in the GBA1 gene are associated with increased risk of Parkinson's disease, and the protein produced by the gene, glucocerebrosidase, interacts with α-synuclein, the protein at the center of the disease etiology. One possibility is that the mutations disrupt a beneficial interaction between the proteins, and a beneficial interaction would imply that the proteins have coevolved. To explore this possibility, a correlated mutation analysis has been performed for all 72 vertebrate species where complete sequences of α-synuclein and glucocerebrosidase are known. The most highly correlated pair of residue variations is α-synuclein A53T and glucocerebrosidase G115E. Intriguingly, the A53T mutation is a Parkinson's disease risk factor in humans, suggesting the pathology associated with this mutation and interaction with glucocerebrosidase might be connected. Correlations with β-synuclein are also evaluated. To assess the impact of lowered species number on accuracy, intra and inter-chain correlations are also calculated for hemoglobin, using mutual information Z-value and direct coupling analyses.
Collapse
Affiliation(s)
- James M. Gruschus
- Laboratory of Structural Biophysics, NHLBI, NIH, Bethesda, Maryland, United States of America
- * E-mail:
| |
Collapse
|
11
|
Sikosek T, Chan HS. Biophysics of protein evolution and evolutionary protein biophysics. J R Soc Interface 2015; 11:20140419. [PMID: 25165599 DOI: 10.1098/rsif.2014.0419] [Citation(s) in RCA: 150] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
The study of molecular evolution at the level of protein-coding genes often entails comparing large datasets of sequences to infer their evolutionary relationships. Despite the importance of a protein's structure and conformational dynamics to its function and thus its fitness, common phylogenetic methods embody minimal biophysical knowledge of proteins. To underscore the biophysical constraints on natural selection, we survey effects of protein mutations, highlighting the physical basis for marginal stability of natural globular proteins and how requirement for kinetic stability and avoidance of misfolding and misinteractions might have affected protein evolution. The biophysical underpinnings of these effects have been addressed by models with an explicit coarse-grained spatial representation of the polypeptide chain. Sequence-structure mappings based on such models are powerful conceptual tools that rationalize mutational robustness, evolvability, epistasis, promiscuous function performed by 'hidden' conformational states, resolution of adaptive conflicts and conformational switches in the evolution from one protein fold to another. Recently, protein biophysics has been applied to derive more accurate evolutionary accounts of sequence data. Methods have also been developed to exploit sequence-based evolutionary information to predict biophysical behaviours of proteins. The success of these approaches demonstrates a deep synergy between the fields of protein biophysics and protein evolution.
Collapse
Affiliation(s)
- Tobias Sikosek
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Physics, University of Toronto, Toronto, Ontario, Canada M5S 1A8
| | - Hue Sun Chan
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Physics, University of Toronto, Toronto, Ontario, Canada M5S 1A8
| |
Collapse
|
12
|
Detecting selection on protein stability through statistical mechanical models of folding and evolution. Biomolecules 2014; 4:291-314. [PMID: 24970217 PMCID: PMC4030984 DOI: 10.3390/biom4010291] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2013] [Revised: 02/13/2014] [Accepted: 02/14/2014] [Indexed: 12/31/2022] Open
Abstract
The properties of biomolecules depend both on physics and on the evolutionary process that formed them. These two points of view produce a powerful synergism. Physics sets the stage and the constraints that molecular evolution has to obey, and evolutionary theory helps in rationalizing the physical properties of biomolecules, including protein folding thermodynamics. To complete the parallelism, protein thermodynamics is founded on the statistical mechanics in the space of protein structures, and molecular evolution can be viewed as statistical mechanics in the space of protein sequences. In this review, we will integrate both points of view, applying them to detecting selection on the stability of the folded state of proteins. We will start discussing positive design, which strengthens the stability of the folded against the unfolded state of proteins. Positive design justifies why statistical potentials for protein folding can be obtained from the frequencies of structural motifs. Stability against unfolding is easier to achieve for longer proteins. On the contrary, negative design, which consists in destabilizing frequently formed misfolded conformations, is more difficult to achieve for longer proteins. The folding rate can be enhanced by strengthening short-range native interactions, but this requirement contrasts with negative design, and evolution has to trade-off between them. Finally, selection can accelerate functional movements by favoring low frequency normal modes of the dynamics of the native state that strongly correlate with the functional conformation change.
Collapse
|
13
|
Maher B, Albrecht AA, Loomes M, Yang XS, Steinhöfel K. A firefly-inspired method for protein structure prediction in lattice models. Biomolecules 2014; 4:56-75. [PMID: 24970205 PMCID: PMC4030990 DOI: 10.3390/biom4010056] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2013] [Revised: 12/17/2013] [Accepted: 12/27/2013] [Indexed: 02/05/2023] Open
Abstract
We introduce a Firefly-inspired algorithmic approach for protein structure prediction over two different lattice models in three-dimensional space. In particular, we consider three-dimensional cubic and three-dimensional face-centred-cubic (FCC) lattices. The underlying energy models are the Hydrophobic-Polar (H-P) model, the Miyazawa–Jernigan (M-J) model and a related matrix model. The implementation of our approach is tested on ten H-P benchmark problems of a length of 48 and ten M-J benchmark problems of a length ranging from 48 until 61. The key complexity parameter we investigate is the total number of objective function evaluations required to achieve the optimum energy values for the H-P model or competitive results in comparison to published values for the M-J model. For H-P instances and cubic lattices, where data for comparison are available, we obtain an average speed-up over eight instances of 2.1, leaving out two extreme values (otherwise, 8.8). For six M-J instances, data for comparison are available for cubic lattices and runs with a population size of 100, where, a priori, the minimum free energy is a termination criterion. The average speed-up over four instances is 1.2 (leaving out two extreme values, otherwise 1.1), which is achieved for a population size of only eight instances. The present study is a test case with initial results for ad hoc parameter settings, with the aim of justifying future research on larger instances within lattice model settings, eventually leading to the ultimate goal of implementations for off-lattice models.
Collapse
Affiliation(s)
- Brian Maher
- Department of Informatics, King's College London, Strand, London WC2R 2LS, UK.
| | - Andreas A Albrecht
- School of Science and Technology, Middlesex University, The Burroughs, London, NW4 4BT, UK.
| | - Martin Loomes
- School of Science and Technology, Middlesex University, The Burroughs, London, NW4 4BT, UK.
| | - Xin-She Yang
- School of Science and Technology, Middlesex University, The Burroughs, London, NW4 4BT, UK.
| | - Kathleen Steinhöfel
- Department of Informatics, King's College London, Strand, London WC2R 2LS, UK.
| |
Collapse
|
14
|
Minning J, Porto M, Bastolla U. Detecting selection for negative design in proteins through an improved model of the misfolded state. Proteins 2013; 81:1102-12. [PMID: 23280507 DOI: 10.1002/prot.24244] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2012] [Accepted: 12/17/2012] [Indexed: 11/05/2022]
Abstract
Proteins that need to be structured in their native state must be stable both against the unfolded ensemble and against incorrectly folded (misfolded) conformations with low free energy. Positive design targets the first type of stability by strengthening native interactions. The second type of stability is achieved by destabilizing interactions that occur frequently in the misfolded ensemble, a strategy called negative design. Here, we investigate negative design adopting a statistical mechanical model of the misfolded ensemble, which improves the usual Gaussian approximation by taking into account the third moment of the energy distribution and contact correlations. Applying this model, we detect and quantify selection for negative design in most natural proteins, and we analytically design protein sequences that are stable both against unfolding and against misfolding.
Collapse
Affiliation(s)
- Jonas Minning
- Institut für Festkörperphysik, Technische Universität Darmstadt, Darmstadt, Germany
| | | | | |
Collapse
|
15
|
Bechini A. On the characterization and software implementation of general protein lattice models. PLoS One 2013; 8:e59504. [PMID: 23555684 PMCID: PMC3612044 DOI: 10.1371/journal.pone.0059504] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2012] [Accepted: 02/13/2013] [Indexed: 11/19/2022] Open
Abstract
models of proteins have been widely used as a practical means to computationally investigate general properties of the system. In lattice models any sterically feasible conformation is represented as a self-avoiding walk on a lattice, and residue types are limited in number. So far, only two- or three-dimensional lattices have been used. The inspection of the neighborhood of alpha carbons in the core of real proteins reveals that also lattices with higher coordination numbers, possibly in higher dimensional spaces, can be adopted. In this paper, a new general parametric lattice model for simplified protein conformations is proposed and investigated. It is shown how the supporting software can be consistently designed to let algorithms that operate on protein structures be implemented in a lattice-agnostic way. The necessary theoretical foundations are developed and organically presented, pinpointing the role of the concept of main directions in lattice-agnostic model handling. Subsequently, the model features across dimensions and lattice types are explored in tests performed on benchmark protein sequences, using a Python implementation. Simulations give insights on the use of square and triangular lattices in a range of dimensions. The trend of potential minimum for sequences of different lengths, varying the lattice dimension, is uncovered. Moreover, an extensive quantitative characterization of the usage of the so-called "move types" is reported for the first time. The proposed general framework for the development of lattice models is simple yet complete, and an object-oriented architecture can be proficiently employed for the supporting software, by designing ad-hoc classes. The proposed framework represents a new general viewpoint that potentially subsumes a number of solutions previously studied. The adoption of the described model pushes to look at protein structure issues from a more general and essential perspective, making computational investigations over simplified models more straightforward as well.
Collapse
Affiliation(s)
- Alessio Bechini
- Department of Information Engineering, University of Pisa, Pisa, Italy.
| |
Collapse
|
16
|
Moreno-Hernández S, Levitt M. Comparative modeling and protein-like features of hydrophobic-polar models on a two-dimensional lattice. Proteins 2012; 80:1683-93. [PMID: 22411636 DOI: 10.1002/prot.24067] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2011] [Revised: 02/26/2012] [Accepted: 03/03/2012] [Indexed: 11/07/2022]
Abstract
Lattice models of proteins have been extensively used to study protein thermodynamics, folding dynamics, and evolution. Our study considers two different hydrophobic-polar (HP) models on the 2D square lattice: the purely HP model and a model where a compactness-favoring term is added. We exhaustively enumerate all the possible structures in our models and perform the study of their corresponding folds, HP arrangements in space and shapes. The two models considered differ greatly in their numbers of structures, folds, arrangements, and shapes. Despite their differences, both lattice models have distinctive protein-like features: (1) Shapes are compact in both models, especially when a compactness-favoring energy term is added. (2) The residue composition is independent of the chain length and is very close to 50% hydrophobic in both models, as we observe in real proteins. (3) Comparative modeling works well in both models, particularly in the more compact one. The fact that our models show protein-like features suggests that lattice models incorporate the fundamental physical principles of proteins. Our study supports the use of lattice models to study questions about proteins that require exactness and extensive calculations, such as protein design and evolution, which are often too complex and computationally demanding to be addressed with more detailed models.
Collapse
Affiliation(s)
- Sergio Moreno-Hernández
- Department of Structural Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | | |
Collapse
|
17
|
Brodkin HR, Novak WRP, Milne AC, D'Aquino JA, Karabacak NM, Goldberg IG, Agar JN, Payne MS, Petsko GA, Ondrechen MJ, Ringe D. Evidence of the participation of remote residues in the catalytic activity of Co-type nitrile hydratase from Pseudomonas putida. Biochemistry 2011; 50:4923-35. [PMID: 21473592 DOI: 10.1021/bi101761e] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Active sites may be regarded as layers of residues, whereby the residues that interact directly with substrate also interact with residues in a second shell and these in turn interact with residues in a third shell. These residues in the second and third layers may have distinct roles in maintaining the essential chemical properties of the first-shell catalytic residues, particularly their spatial arrangement relative to the substrate binding pocket, and their electrostatic and dynamic properties. The extent to which these remote residues participate in catalysis and precisely how they affect first-shell residues remains unexplored. To improve our understanding of the roles of second- and third-shell residues in catalysis, we used THEMATICS to identify residues in the second and third shells of the Co-type nitrile hydratase from Pseudomonas putida (ppNHase) that may be important for catalysis. Five of these predicted residues, and three additional, conserved residues that were not predicted, have been conservatively mutated, and their effects have been studied both kinetically and structurally. The eight residues have no direct contact with the active site metal ion or bound substrate. These results demonstrate that three of the predicted second-shell residues (α-Asp164, β-Glu56, and β-His147) and one predicted third-shell residue (β-His71) have significant effects on the catalytic efficiency of the enzyme. One of the predicted residues (α-Glu168) and the three residues not predicted (α-Arg170, α-Tyr171, and β-Tyr215) do not have any significant effects on the catalytic efficiency of the enzyme.
Collapse
Affiliation(s)
- Heather R Brodkin
- Department of Chemistry and Chemical Biology and Institute for Complex Scientific Software, Northeastern University, Boston, Massachusetts 02115, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Chen Y, Ding J. Roles of non-native hydrogen-bonding interaction in helix-coil transition of a single polypeptide as revealed by comparison between Gō-like and non-Gō models. Proteins 2010; 78:2090-100. [PMID: 20455265 DOI: 10.1002/prot.22724] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
To explore the role of non-native interactions in the helix-coil transition, a detailed comparison between a Gō-like model and a non-Gō model has been performed via lattice Monte Carlo simulations. Only native hydrogen bonding interactions occur in the Gō-like model, and the non-native ones with sequence interval more than 4 is also included into the non-Gō model. Some significant differences between the results from those two models have been found. The non-native hydrogen bonds were found most populated at temperature around the helix-coil transition. The rearrangement of non-native hydrogen bonds into native ones in the formation of alpha-helix leads to the increase of susceptibility of chain conformation, and even two peaks of susceptibility of radius of gyration versus temperature exist in the case of non-Gō model for a non-short peptide, while just a single peak exists in the case of Gō model for a single polypeptide chain with various chain lengths. The non-native hydrogen bonds have complicated the temperature-dependence of Zimm-Bragg nucleation constant. The increase of relative probability of non-native hydrogen bonding for long polypeptide chains leads to non-monotonous chain length effect on the transition temperature.
Collapse
Affiliation(s)
- Yantao Chen
- Key Laboratory of Molecular Engineering of Polymers of Ministry of Education, Department of Macromolecular Science, Laboratory of Advanced Materials, Fudan University, Shanghai 200433, China
| | | |
Collapse
|
19
|
Kowarsch A, Fuchs A, Frishman D, Pagel P. Correlated mutations: a hallmark of phenotypic amino acid substitutions. PLoS Comput Biol 2010; 6. [PMID: 20862353 PMCID: PMC2940720 DOI: 10.1371/journal.pcbi.1000923] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2009] [Accepted: 08/09/2010] [Indexed: 11/18/2022] Open
Abstract
Point mutations resulting in the substitution of a single amino acid can cause severe functional consequences, but can also be completely harmless. Understanding what determines the phenotypical impact is important both for planning targeted mutation experiments in the laboratory and for analyzing naturally occurring mutations found in patients. Common wisdom suggests using the extent of evolutionary conservation of a residue or a sequence motif as an indicator of its functional importance and thus vulnerability in case of mutation. In this work, we put forward the hypothesis that in addition to conservation, co-evolution of residues in a protein influences the likelihood of a residue to be functionally important and thus associated with disease. While the basic idea of a relation between co-evolution and functional sites has been explored before, we have conducted the first systematic and comprehensive analysis of point mutations causing disease in humans with respect to correlated mutations. We included 14,211 distinct positions with known disease-causing point mutations in 1,153 human proteins in our analysis. Our data show that (1) correlated positions are significantly more likely to be disease-associated than expected by chance, and that (2) this signal cannot be explained by conservation patterns of individual sequence positions. Although correlated residues have primarily been used to predict contact sites, our data are in agreement with previous observations that (3) many such correlations do not relate to physical contacts between amino acid residues. Access to our analysis results are provided at http://webclu.bio.wzw.tum.de/~pagel/supplements/correlated-positions/. Point mutations (i.e., changes of a single sequence element) can have a severe impact on protein function. Many diseases are caused by such minute defects. On the other hand, the majority of such mutations does not lead to noticeable effects. Although previous research has revealed important aspects that influence or predict the chance of a mutation to cause disease, much remains to be learned before we fully understand this complex problem. In our work, we use the observation that sometimes certain positions in a protein mutate in an apparently correlated fashion and analyze this correlation with respect to mutation vulnerability. Our results show that positions exhibiting evolutionary correlation are significantly more likely to be vulnerable to mutation than average positions. On one hand, our data further support the concept of correlated positions to not only be associated with protein contacts but also functional sites and/or disease positions (as introduced by others). On the other hand, this could be useful to further improve the understanding and prediction of the consequences of mutations. Our work is the first to attempt a large-scale quantitation of this relationship.
Collapse
Affiliation(s)
- Andreas Kowarsch
- Lehrstuhl für Genomorientierte Bioinformatik, Technische Universität München, Wissenschaftszentrum Weihenstephan, Freising, Germany
- Institut für Bioinformatik und Systembiologie/MIPS, Helmholtz Zentrum München – Deutsches Forschungszentrum für Gesundheit und Umwelt, Neuherberg, Germany
| | - Angelika Fuchs
- Lehrstuhl für Genomorientierte Bioinformatik, Technische Universität München, Wissenschaftszentrum Weihenstephan, Freising, Germany
| | - Dmitrij Frishman
- Lehrstuhl für Genomorientierte Bioinformatik, Technische Universität München, Wissenschaftszentrum Weihenstephan, Freising, Germany
- Institut für Bioinformatik und Systembiologie/MIPS, Helmholtz Zentrum München – Deutsches Forschungszentrum für Gesundheit und Umwelt, Neuherberg, Germany
| | - Philipp Pagel
- Lehrstuhl für Genomorientierte Bioinformatik, Technische Universität München, Wissenschaftszentrum Weihenstephan, Freising, Germany
- Institut für Bioinformatik und Systembiologie/MIPS, Helmholtz Zentrum München – Deutsches Forschungszentrum für Gesundheit und Umwelt, Neuherberg, Germany
- * E-mail:
| |
Collapse
|
20
|
Boyer JA, Clay CJ, Luce KS, Edgell MH, Lee AL. Detection of native-state nonadditivity in double mutant cycles via hydrogen exchange. J Am Chem Soc 2010; 132:8010-9. [PMID: 20481530 DOI: 10.1021/ja1003922] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Proteins have evolved to exploit long-range structural and dynamic effects as a means of regulating function. Understanding communication between sites in proteins is therefore vital to our comprehension of such phenomena as allostery, catalysis, and ligand binding/ejection. Double mutant cycle analysis has long been used to determine the existence of communication between pairs of sites, proximal or distal, in proteins. Typically, nonadditivity (or "thermodynamic coupling") is measured from global transitions in concert with a single probe. Here, we have applied the atomic resolution of NMR in tandem with native-state hydrogen exchange (HX) to probe the structure/energy landscape for information transduction between a large number of distal sites in a protein. Considering the event of amide proton exchange as an energetically quantifiable structural perturbation, m n-dimensional cycles can be constructed from mutation of n-1 residues, where m is the number of residues for which HX data is available. Thus, efficient mapping of a large number of couplings is made possible. We have applied this technique to one additive and two nonadditive double mutant cycles in a model system, eglin c. We find heterogeneity of HX-monitored couplings for each cycle, yet averaging results in strong agreement with traditionally measured values. Furthermore, long-range couplings observed at locally exchanging residues indicate that the basis for communication can occur within the native state ensemble, a conclusion not apparent from traditional measurements. We propose that higher-order couplings can be obtained and show that such couplings provide a mechanistic basis for understanding lower-order couplings via "spheres of perturbation". The method is presented as an additional tool for identifying a large number of couplings with greater coverage of the protein of interest.
Collapse
Affiliation(s)
- Joshua A Boyer
- Department of Biochemistry and Biophysics, School of Medicine, University of North Carolina, Chapel Hill, North Carolina 27599, USA
| | | | | | | | | |
Collapse
|
21
|
Chen T, Vernazobres D, Yomo T, Bornberg-Bauer E, Chan HS. Evolvability and single-genotype fluctuation in phenotypic properties: a simple heteropolymer model. Biophys J 2010; 98:2487-96. [PMID: 20513392 PMCID: PMC2877360 DOI: 10.1016/j.bpj.2010.02.046] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2009] [Revised: 02/15/2010] [Accepted: 02/26/2010] [Indexed: 11/26/2022] Open
Abstract
Experiment showed that the response of a genotype to mutation, i.e., the magnitude of mutational change in a phenotypic property, can be correlated with the extent of phenotypic fluctuation among genetic clones. To address a possible statistical mechanical basis for such phenomena at the protein level, we consider a simple hydrophobic-polar lattice protein-chain model with an exhaustive mapping between sequence (genotype) and conformational (phenotype) spaces. Using squared end-to-end distance, R(N)(2), as an example conformational property, we study how the thermal fluctuation of a sequence's R(N)(2) may be predictive of the changes in the Boltzmann average R(N)(2) caused by single-point mutations on that sequence. We found that sequences with the same ground-state (R(N)(2))(0) exhibit a funnel-like organization under conditions favorable to chain collapse or folding: fluctuation (standard deviation sigma) of R(N)(2) tends to increase with mutational distance from a prototype sequence whose R(N)(2) deviates little from its (R(N)(2))(0). In general, large mutational decreases in R(N)(2) or in sigma are only possible for some, though not all, sequences with large sigma values. This finding suggests that single-genotype phenotypic fluctuation is a necessary, though not sufficient, indicator of evolvability toward genotypes with less phenotypic fluctuations.
Collapse
Affiliation(s)
- Tao Chen
- Departments of Biochemistry and of Molecular Genetics, Faculty of Medicine, and Department of Physics, University of Toronto, Toronto, Ontario, Canada
| | - David Vernazobres
- Institute for Evolution and Biodiversity, School of Biological Sciences, University of Münster, Münster, Germany
| | - Tetsuya Yomo
- Department of Bioinformatic Engineering, Graduate School of Information Science and Technology, and the Graduate School of Frontier Bioscience, Osaka University, Osaka, Japan
- Exploratory Research for Advanced Technology, Japan Science and Technology Agency, Osaka, Japan
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, School of Biological Sciences, University of Münster, Münster, Germany
| | - Hue Sun Chan
- Departments of Biochemistry and of Molecular Genetics, Faculty of Medicine, and Department of Physics, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
22
|
Ashkenazy H, Kliger Y. Reducing phylogenetic bias in correlated mutation analysis. Protein Eng Des Sel 2010; 23:321-6. [PMID: 20067922 DOI: 10.1093/protein/gzp078] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Correlated mutation analysis (CMA) is a sequence-based approach for ab initio protein contact map prediction. The basis of this approach is the observed correlation between mutations in interacting amino acid residues. These correlations are often estimated by either calculating the Pearson's correlation coefficient (PCC) or the mutual information (MI) between columns in a multiple sequence alignment (MSA) of the protein of interest and its homologs. A major challenge of CMA is to filter out the background noise originating from phylogenetic relatedness between sequences included in the MSA. Recently, a procedure to reduce this background noise was demonstrated to improve an MI-based predictor. Herein, we tested whether a similar approach can also improve the performance of the classical PCC-based method. Indeed, performance improvements were achieved for all four major SCOP classes. Furthermore, the results reveal that the improved PCC-based method is superior to MI-based methods for proteins having MSAs of up to 100 sequences.
Collapse
|
23
|
Noivirt-Brik O, Horovitz A, Unger R. Trade-off between positive and negative design of protein stability: from lattice models to real proteins. PLoS Comput Biol 2009; 5:e1000592. [PMID: 20011105 PMCID: PMC2781108 DOI: 10.1371/journal.pcbi.1000592] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2009] [Accepted: 11/03/2009] [Indexed: 11/18/2022] Open
Abstract
Two different strategies for stabilizing proteins are (i) positive design in which the native state is stabilized and (ii) negative design in which competing non-native conformations are destabilized. Here, the circumstances under which one strategy might be favored over the other are explored in the case of lattice models of proteins and then generalized and discussed with regard to real proteins. The balance between positive and negative design of proteins is found to be determined by their average "contact-frequency", a property that corresponds to the fraction of states in the conformational ensemble of the sequence in which a pair of residues is in contact. Lattice model proteins with a high average contact-frequency are found to use negative design more than model proteins with a low average contact-frequency. A mathematical derivation of this result indicates that it is general and likely to hold also for real proteins. Comparison of the results of correlated mutation analysis for real proteins with typical contact-frequencies to those of proteins likely to have high contact-frequencies (such as disordered proteins and proteins that are dependent on chaperonins for their folding) indicates that the latter tend to have stronger interactions between residues that are not in contact in their native conformation. Hence, our work indicates that negative design is employed when insufficient stabilization is achieved via positive design owing to high contact-frequencies.
Collapse
Affiliation(s)
- Orly Noivirt-Brik
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Amnon Horovitz
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
- * E-mail:
| | - Ron Unger
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, Israel
| |
Collapse
|
24
|
Liu Z, Chen J, Thirumalai D. On the accuracy of inferring energetic coupling between distant sites in protein families from evolutionary imprints: Illustrations using lattice model. Proteins 2009; 77:823-31. [DOI: 10.1002/prot.22498] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
25
|
Chan HS, Zhang Z. Liaison amid disorder: non-native interactions may underpin long-range coupling in proteins. J Biol 2009; 8:27. [PMID: 19344476 PMCID: PMC2689430 DOI: 10.1186/jbiol126] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
A lattice-model study of double-mutant cycles published in BMC Structural Biology underscores how interactions in non-native conformations can lead to thermodynamic coupling between distant residues in globular proteins, adding to recent advances in delineating the often crucial roles played by disordered conformational ensembles in protein behavior.
Collapse
Affiliation(s)
- Hue Sun Chan
- Department of Biochemistry, University of Toronto, 1 King's College Circle, Toronto, Ontario, Canada.
| | | |
Collapse
|