1
|
Melnik BS, Nagibina GS, Glukhov AS, Melnik TN, Uversky VN. Substitutions of Amino Acids with Large Number of Contacts in the Native State Have no Effect on the Rates of Protein Folding. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2016; 1864:1809-1817. [PMID: 27639966 DOI: 10.1016/j.bbapap.2016.09.006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2016] [Revised: 09/09/2016] [Accepted: 09/13/2016] [Indexed: 11/28/2022]
Abstract
Various effects of amino acid substitutions on properties of globular proteins have been described in a large number of research papers. Nevertheless, no definite "rule" has been formulated as of yet that could be used by experimentalists to introduce desirable changes in the properties of proteins. Herein we attempt to establish such a "rule". To this end, a hypothesis is proposed on the effects of substitutions of hydrophobic residues with large number of contacts on free energies of different states of a globular protein. The hypothesis states: Substitutions of hydrophobic residues engaged in a large number of residue-residue contacts would not change the folding rate of a protein but could affect its unfolding rate. This hypothesis was verified by both theoretical and experimental analyses, generating a general rule that can facilitate the work of experimentalists on constructing mutant forms of proteins.
Collapse
Affiliation(s)
- Bogdan S Melnik
- Institute of Protein Research, Russian Academy of Sciences, 142290 Pushchino, Moscow, Region, Russia.
| | - Galina S Nagibina
- Institute of Protein Research, Russian Academy of Sciences, 142290 Pushchino, Moscow, Region, Russia
| | - Anatoly S Glukhov
- Institute of Protein Research, Russian Academy of Sciences, 142290 Pushchino, Moscow, Region, Russia
| | - Tatiana N Melnik
- Institute of Protein Research, Russian Academy of Sciences, 142290 Pushchino, Moscow, Region, Russia
| | - Vladimir N Uversky
- Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russia; Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA.
| |
Collapse
|
2
|
Smock RG, Yadid I, Dym O, Clarke J, Tawfik DS. De Novo Evolutionary Emergence of a Symmetrical Protein Is Shaped by Folding Constraints. Cell 2016; 164:476-86. [PMID: 26806127 PMCID: PMC4735018 DOI: 10.1016/j.cell.2015.12.024] [Citation(s) in RCA: 74] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2015] [Revised: 10/05/2015] [Accepted: 12/07/2015] [Indexed: 01/02/2023]
Abstract
Molecular evolution has focused on the divergence of molecular functions, yet we know little about how structurally distinct protein folds emerge de novo. We characterized the evolutionary trajectories and selection forces underlying emergence of β-propeller proteins, a globular and symmetric fold group with diverse functions. The identification of short propeller-like motifs (<50 amino acids) in natural genomes indicated that they expanded via tandem duplications to form extant propellers. We phylogenetically reconstructed 47-residue ancestral motifs that form five-bladed lectin propellers via oligomeric assembly. We demonstrate a functional trajectory of tandem duplications of these motifs leading to monomeric lectins. Foldability, i.e., higher efficiency of folding, was the main parameter leading to improved functionality along the entire evolutionary trajectory. However, folding constraints changed along the trajectory: initially, conflicts between monomer folding and oligomer assembly dominated, whereas subsequently, upon tandem duplication, tradeoffs between monomer stability and foldability took precedence. Inferred 47-aminoacid ancestral motifs fold into functional β-propeller assemblies Motif duplication, fusion, and diversification yield functional monomeric propellers Folding efficiency was the key parameter optimized throughout propeller emergence Single-motif precursors in extant genomes support the reconstructed emergence pathway
Collapse
Affiliation(s)
- Robert G Smock
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Itamar Yadid
- Metabolic Pathways and Enzyme Evolution Laboratory, Migal Galilee Research Institute, Kiryat Shmona 11016, Israel
| | - Orly Dym
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Jane Clarke
- Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, UK
| | - Dan S Tawfik
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel.
| |
Collapse
|
3
|
Schmidt am Busch M, Sedano A, Simonson T. Computational protein design: validation and possible relevance as a tool for homology searching and fold recognition. PLoS One 2010; 5:e10410. [PMID: 20463972 PMCID: PMC2864755 DOI: 10.1371/journal.pone.0010410] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2009] [Accepted: 03/31/2010] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Protein fold recognition usually relies on a statistical model of each fold; each model is constructed from an ensemble of natural sequences belonging to that fold. A complementary strategy may be to employ sequence ensembles produced by computational protein design. Designed sequences can be more diverse than natural sequences, possibly avoiding some limitations of experimental databases. METHODOLOGY/PRINCIPAL FINDINGS WE EXPLORE THIS STRATEGY FOR FOUR SCOP FAMILIES: Small Kunitz-type inhibitors (SKIs), Interleukin-8 chemokines, PDZ domains, and large Caspase catalytic subunits, represented by 43 structures. An automated procedure is used to redesign the 43 proteins. We use the experimental backbones as fixed templates in the folded state and a molecular mechanics model to compute the interaction energies between sidechain and backbone groups. Calculations are done with the Proteins@Home volunteer computing platform. A heuristic algorithm is used to scan the sequence and conformational space, yielding 200,000-300,000 sequences per backbone template. The results confirm and generalize our earlier study of SH2 and SH3 domains. The designed sequences ressemble moderately-distant, natural homologues of the initial templates; e.g., the SUPERFAMILY, profile Hidden-Markov Model library recognizes 85% of the low-energy sequences as native-like. Conversely, Position Specific Scoring Matrices derived from the sequences can be used to detect natural homologues within the SwissProt database: 60% of known PDZ domains are detected and around 90% of known SKIs and chemokines. Energy components and inter-residue correlations are analyzed and ways to improve the method are discussed. CONCLUSIONS/SIGNIFICANCE For some families, designed sequences can be a useful complement to experimental ones for homologue searching. However, improved tools are needed to extract more information from the designed profiles before the method can be of general use.
Collapse
Affiliation(s)
- Marcel Schmidt am Busch
- Laboratoire de Biochimie (CNRS UMR7654), Department of Biology, Ecole Polytechnique, Palaiseau, France
| | - Audrey Sedano
- Laboratoire de Biochimie (CNRS UMR7654), Department of Biology, Ecole Polytechnique, Palaiseau, France
| | - Thomas Simonson
- Laboratoire de Biochimie (CNRS UMR7654), Department of Biology, Ecole Polytechnique, Palaiseau, France
| |
Collapse
|
4
|
am Busch MS, Mignon D, Simonson T. Computational protein design as a tool for fold recognition. Proteins 2009; 77:139-58. [PMID: 19408297 DOI: 10.1002/prot.22426] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Computationally designed protein sequences have been proposed as a basis to perform fold recognition and homology searching. To investigate this possibility, an automated procedure is used to completely redesign 24 SH3 proteins and 22 SH2 proteins. We use the experimental backbone coordinates as fixed templates in the folded state and a molecular mechanics model to compute the pairwise interaction energies between all sidechain types and conformations. Energy calculations are done with the Proteins@Home volunteer computing platform. A heuristic algorithm is then used to scan the sequence and conformational space for optimal solutions. We produced 200,000-450,000 sequences for each backbone template. The designed sequences ressemble moderately-distant, natural homologues of the initial templates, according to their identity scores and their similarity with respect to the Pfam sets of SH2 and SH3 domains. Standard homology detection tools document their native-like character: the Conserved Domain Database recognizes 61% (52%) of our low-energy sequences as SH3 (SH2) domains; the SUPERFAMILY, Hidden-Markov Model library recognizes 81% (84%). Conversely, position specific scoring matrices (PSSMs) derived from our designed sequences can be used to detect natural homologues in sequence databases. Within SwissProt, a set of natural SH3 PSSMs detects 772 SH3 domains, for example; our designed PSSMs detect 67% of these, plus one additional sequence and two false positives. If six amino acids involved in substrate binding (a selective pressure not accounted for in our design) are reset to their experimental types, then 77% of the experimental SH3 domains are detected. Results for the SH2 domains are similar. Several directions to improve the method further are discussed.
Collapse
Affiliation(s)
- Marcel Schmidt am Busch
- Laboratoire de Biochimie (CNRS UMR7654), Department of Biology, Ecole Polytechnique, 91128 Palaiseau, France
| | | | | |
Collapse
|
5
|
Schmidt Am Busch M, Lopes A, Mignon D, Simonson T. Computational protein design: Software implementation, parameter optimization, and performance of a simple model. J Comput Chem 2008; 29:1092-102. [DOI: 10.1002/jcc.20870] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
6
|
Mitomo D, Nakamura HK, Ikeda K, Yamagishi A, Higo J. Transition state of a SH3 domain detected with principle component analysis and a charge-neutralized all-atom protein model. Proteins 2006; 64:883-94. [PMID: 16807919 DOI: 10.1002/prot.21069] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The src SH3 domain has been known to be a two-state folder near room temperature. However, in a previous study with an all-atom model simulation near room temperature, the transition state of this protein was not successfully detected on a free-energy profile using two axes: the radius of gyration (R(g)) and native contact reproduction ratio (Q value). In this study, we focused on an atom packing effect to characterize the transition state and tried another analysis to detect it. To explore the atom packing effect more efficiently, we introduced a charge-neutralized all-atom model, where all of the atoms in the protein and water molecules were treated explicitly, but their partial atomic charges were set to zero. Ten molecular dynamics simulations were performed starting from the native structure at 300 K, where the simulation length of each run was 90 ns, and the protein unfolded in all runs. The integrated trajectories (10 x 90 = 900 ns) were analyzed by a principal component analysis (PCA) and showed a clear free-energy barrier between folded- and unfolded-state conformational clusters in a conformational space generated by PCA. There were segments that largely deformed when the conformation passed through the free-energy barrier. These segments correlated well with the structural core regions characterized by large phi-values, and the atom-packing changes correlated with the conformational deformations. Interestingly, using the same simulation data, no significant barrier was found in a free-energy profile using the R(g) and Q values for the coordinate axes. These results suggest that the atom packing effect may be one of the most important determinants of the transition state.
Collapse
Affiliation(s)
- Daisuke Mitomo
- School of Life Science, Tokyo University of Pharmacy and Life Science, Hachioji, Japan
| | | | | | | | | |
Collapse
|
7
|
Campbell-Valois FX, Tarassov K, Michnick SW. Massive sequence perturbation of the Raf ras binding domain reveals relationships between sequence conservation, secondary structure propensity, hydrophobic core organization and stability. J Mol Biol 2006; 362:151-71. [PMID: 16916524 DOI: 10.1016/j.jmb.2006.06.061] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2006] [Revised: 05/23/2006] [Accepted: 06/21/2006] [Indexed: 11/25/2022]
Abstract
The contributions of specific residues to the delicate balance between function, stability and folding rates could be determined, in part by [corrected] comparing the sequences of structures having identical folds, but insignificant sequence homology. Recently, we have devised an experimental strategy to thoroughly explore residue substitutions consistent with a specific class of structure. Using this approach, the amino acids tolerated at virtually all residues of the c-Raf/Raf1 ras binding domain (Raf RBD), an exemplar of the common beta-grasp ubiquitin-like topology, were obtained and used to define the sequence determinants of this fold. Herein, we present analyses suggesting that more subtle sequence selection pressure, including propensity for secondary structure, the hydrophobic core organization and charge distribution are imposed on the Raf RBD sequence. Secondly, using the Gibbs free energies (DeltaG(F-U)) obtained for 51 mutants of Raf RBD, we demonstrate a strong correlation between amino acid conservation and the destabilization induced by truncating mutants. In addition, four mutants are shown to significantly stabilize Raf RBD native structure. Two of these mutations, including the well-studied R89L, are known to severely compromise binding affinity for ras. Another stabilized mutant consisted of a deletion of amino acid residues E104-K106. This deletion naturally occurs in the homologues a-Raf and b-Raf and could indicate functional divergence. Finally, the combination of mutations affecting five of 78 residues of Raf RBD results in stabilization of the structure by approximately 12 kJ mol(-1) (DeltaG(F-U) is -22 and -34 kJ mol(-1) for wt and mutant, respectively). The sequence perturbation approach combined with sequence/structure analysis of the ubiquitin-like fold provide a basis for the identification of sequence-specific requirements for function, stability and folding rate of the Raf RBD and structural analogues, highlighting the utility of conservation profiles as predictive tools of structural organization.
Collapse
Affiliation(s)
- F-X Campbell-Valois
- Département de Biochimie, Université de Montréal, C.P. 6128, Succ. centre-ville, Montréal, Québec, Canada H3C 3J7
| | | | | |
Collapse
|
8
|
Vermeulen W, Van Troys M, Bourry D, Dewitte D, Rossenu S, Goethals M, Borremans FAM, Vandekerckhove J, Martins JC, Ampe C. Identification of the PXW sequence as a structural gatekeeper of the headpiece C-terminal subdomain fold. J Mol Biol 2006; 359:1277-92. [PMID: 16697408 DOI: 10.1016/j.jmb.2006.04.042] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2006] [Revised: 04/16/2006] [Accepted: 04/20/2006] [Indexed: 11/22/2022]
Abstract
The HeadPiece (HP) domain, present in several F-actin-binding multi-domain proteins, features a well-conserved, solvent-exposed PXWK motif in its C-terminal subdomain. The latter is an autonomously folding subunit comprised of three alpha-helices organised around a hydrophobic core, with the sequence motif preceding the last helix. We report the contributions of each conserved residue in the PXWK motif to human villin HP function and structure, as well as the structural implications of the naturally occurring Pro to Ala mutation in dematin HP. NMR shift perturbation mapping reveals that substitution of each residue by Ala induces only minor, local perturbations in the full villin HP structure. CD spectroscopic thermal analysis, however, shows that the Pro and Trp residues in the PXWK motif afford stabilising interactions. This indicates that, in addition to the residues in the hydrophobic core, the Trp-Pro stacking within the motif contributes to HP stability. This is reinforced by our data on isolated C-terminal HP subdomains where the Pro is also essential for structure formation, since the villin, but not the dematin, C-terminal subdomain is structured. Proper folding can be induced in the dematin C-terminal subdomain by exchanging the Ala for Pro. Conversely, the reverse substitution in the villin C-terminal subdomain leads to loss of structure. Thus, we demonstrate a crucial role for this proline residue in structural stability and folding potential of HP (sub)domains consistent with Pro-Trp stacking as a more general determinant of protein stability.
Collapse
Affiliation(s)
- Wim Vermeulen
- NMR and Structure Analysis Unit, Department of Organic Chemistry, Faculty of Sciences, Ghent University, Belgium
| | | | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Abstract
Naturally occurring proteins comprise a special subset of all plausible sequences and structures selected through evolution. Simulating protein evolution with simplified and all-atom models has shed light on the evolutionary dynamics of protein populations, the nature of evolved sequences and structures, and the extent to which today's proteins are shaped by selection pressures on folding, structure and function. Extensive mapping of the native structure, stability and folding rate in sequence space using lattice proteins has revealed organizational principles of the sequence/structure map important for evolutionary dynamics. Evolutionary simulations with lattice proteins have highlighted the importance of fitness landscapes, evolutionary mechanisms, population dynamics and sequence space entropy in shaping the generic properties of proteins. Finally, evolutionary-like simulations with all-atom models, in particular computational protein design, have helped identify the dominant selection pressures on naturally occurring protein sequences and structures.
Collapse
Affiliation(s)
- Yu Xia
- Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Avenue, New Haven, CT 06520, USA
| | | |
Collapse
|
10
|
Ventura S, Zurdo J, Narayanan S, Parreño M, Mangues R, Reif B, Chiti F, Giannoni E, Dobson CM, Aviles FX, Serrano L. Short amino acid stretches can mediate amyloid formation in globular proteins: the Src homology 3 (SH3) case. Proc Natl Acad Sci U S A 2004; 101:7258-63. [PMID: 15123800 PMCID: PMC409906 DOI: 10.1073/pnas.0308249101] [Citation(s) in RCA: 202] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2003] [Indexed: 11/18/2022] Open
Abstract
Protein misfolding and deposition underlie an increasing number of debilitating human disorders. We have shown that model proteins unrelated to disease, such as the Src homology 3 (SH3) domain of the p58alpha subunit of bovine phosphatidyl-inositol-3'-kinase (PI3-SH3), can be converted in vitro into assemblies with structural and cytotoxic properties similar to those of pathological aggregates. By contrast, homologous proteins, such as alpha-spectrin-SH3, lack the capability of forming amyloid fibrils at a measurable rate under any of the conditions we have so far examined. However, transplanting a small sequence stretch (6 aa) from PI3-SH3 to alpha-spectrin-SH3, comprising residues of the diverging turn and adjacent RT loop, creates an amyloidogenic protein closely similar in its behavior to the original PI3-SH3. Analysis of specific PI3-SH3 mutants further confirms the involvement of this region in conferring amyloidogenic properties to this domain. Moreover, the inclusion in this stretch of two consensus residues favored in SH3 sequences substantially inhibits aggregation. These findings show that short specific amino acid stretches can act as mediators or facilitators in the incorporation of globular proteins into amyloid structures, and they support the suggestion that natural protein sequences have evolved in part to code for structural characteristics other than those included in the native fold, such as avoidance of aggregation.
Collapse
Affiliation(s)
- Salvador Ventura
- Institut de Biotecnologia i de Biomedicina and Departament de Bioquimica i Biologia Molecular, Universitat Autonoma de Barcelona, 08193 Bellaterra, Spain.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|