51
|
Lapidoth GD, Baran D, Pszolla GM, Norn C, Alon A, Tyka MD, Fleishman SJ. AbDesign: An algorithm for combinatorial backbone design guided by natural conformations and sequences. Proteins 2015; 83:1385-406. [PMID: 25670500 PMCID: PMC4881815 DOI: 10.1002/prot.24779] [Citation(s) in RCA: 65] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2014] [Revised: 01/13/2015] [Accepted: 01/26/2015] [Indexed: 12/20/2022]
Abstract
Computational design of protein function has made substantial progress, generating new enzymes, binders, inhibitors, and nanomaterials not previously seen in nature. However, the ability to design new protein backbones for function--essential to exert control over all polypeptide degrees of freedom--remains a critical challenge. Most previous attempts to design new backbones computed the mainchain from scratch. Here, instead, we describe a combinatorial backbone and sequence optimization algorithm called AbDesign, which leverages the large number of sequences and experimentally determined molecular structures of antibodies to construct new antibody models, dock them against target surfaces and optimize their sequence and backbone conformation for high stability and binding affinity. We used the algorithm to produce antibody designs that target the same molecular surfaces as nine natural, high-affinity antibodies; in five cases interface sequence identity is above 30%, and in four of those the backbone conformation at the core of the antibody binding surface is within 1 Å root-mean square deviation from the natural antibodies. Designs recapitulate polar interaction networks observed in natural complexes, and amino acid sidechain rigidity at the designed binding surface, which is likely important for affinity and specificity, is high compared to previous design studies. In designed anti-lysozyme antibodies, complementarity-determining regions (CDRs) at the periphery of the interface, such as L1 and H2, show greater backbone conformation diversity than the CDRs at the core of the interface, and increase the binding surface area compared to the natural antibody, potentially enhancing affinity and specificity.
Collapse
Affiliation(s)
- Gideon D. Lapidoth
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Dror Baran
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Gabriele M. Pszolla
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Christoffer Norn
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Assaf Alon
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Michael D. Tyka
- Google Inc., 1600 Amphitheatre Pkwy, Mountain View, CA 94043
| | - Sarel J. Fleishman
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| |
Collapse
|
52
|
Cheng CY, Chou FC, Kladwang W, Tian S, Cordero P, Das R. Consistent global structures of complex RNA states through multidimensional chemical mapping. eLife 2015; 4:e07600. [PMID: 26035425 PMCID: PMC4495719 DOI: 10.7554/elife.07600] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2015] [Accepted: 06/02/2015] [Indexed: 11/13/2022] Open
Abstract
Accelerating discoveries of non-coding RNA (ncRNA) in myriad biological processes pose major challenges to structural and functional analysis. Despite progress in secondary structure modeling, high-throughput methods have generally failed to determine ncRNA tertiary structures, even at the 1-nm resolution that enables visualization of how helices and functional motifs are positioned in three dimensions. We report that integrating a new method called MOHCA-seq (Multiplexed •OHCleavage Analysis with paired-end sequencing) with mutate-and-map secondary structure inference guides Rosetta 3D modeling to consistent 1-nm accuracy for intricately folded ncRNAs with lengths up to 188 nucleotides, including a blind RNA-puzzle challenge, the lariat-capping ribozyme. This multidimensional chemical mapping (MCM) pipeline resolves unexpected tertiary proximities for cyclic-di-GMP, glycine, and adenosylcobalamin riboswitch aptamers without their ligands and a loose structure for the recently discovered human HoxA9D internal ribosome entry site regulon. MCM offers a sequencing-based route to uncovering ncRNA 3D structure, applicable to functionally important but potentially heterogeneous states. DOI:http://dx.doi.org/10.7554/eLife.07600.001 Our genetic material, in the form of molecules of DNA, provides instructions for many different processes in our cells. To issue these instructions, particular sections of DNA are copied to make a type of molecule called ribonucleic acid (RNA). Some of these RNA molecules contain instructions to make proteins, but others—known as non-coding RNAs—regulate the activity of genes in cells. The genetic information within RNA is encoded by the sequence of four different chemical parts called ‘nucleotides’. RNA can exist as a single strand of nucleotides, but the nucleotides can also pair up in specific combinations to form sections of double-stranded RNA. Therefore, a single strand of non-coding RNA can fold into a complex three-dimensional shape that contains loops, twists, and bulges. The three-dimensional structures of non-coding RNAs are crucial for their roles in cells, but the variety and complexity of shapes that they can form makes it technically difficult to study them. In 2008, researchers developed a new method called MOHCA that can map the positions of nucleotides that are close together in the three-dimensional structure. Highly reactive chemicals are attached to the nucleotides and these can react with, and damage, other nearby nucleotides. By detecting which nucleotides have been damaged, it is possible to map the positions of these nucleotides and decipher the structure of the RNA molecule using computer algorithms. MOHCA is a promising approach, but the initial methods to find the damaged nucleotides were tedious and required specialized equipment. Now, Cheng, Das et al.—including some of the researchers involved in the 2008 work—have developed an improved version of MOHCA that uses readily available RNA sequencing techniques to find the damaged nucleotides. The RNA sequencing data are then analyzed by a new algorithm in the Rosetta computer modeling software. Cheng, Das et al. used this newly developed ‘MOHCA-seq’ and Rosetta to reveal the structures of a human non-coding RNA and several other non-coding RNA molecules to a much higher level of detail than before. Together, MOHCA-seq and Rosetta provide a rapid method for researchers to decipher the three-dimensional structure of non-coding RNAs. This method is likely to speed up the analysis of the complex structures of non-coding RNAs. It will be useful in future efforts to work out what roles these RNAs play in cells, including their activity in cancer, neurodegeneration, and other diseases. DOI:http://dx.doi.org/10.7554/eLife.07600.002
Collapse
Affiliation(s)
- Clarence Yu Cheng
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Fang-Chieh Chou
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Wipapat Kladwang
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Siqi Tian
- Department of Biochemistry, Stanford University, Stanford, United States
| | - Pablo Cordero
- Biomedical Informatics Program, Stanford University, Stanford, United States
| | - Rhiju Das
- Department of Biochemistry, Stanford University, Stanford, United States
| |
Collapse
|
53
|
Wang Y, Barth P. Evolutionary-guided de novo structure prediction of self-associated transmembrane helical proteins with near-atomic accuracy. Nat Commun 2015; 6:7196. [PMID: 25995083 DOI: 10.1038/ncomms8196] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Accepted: 04/15/2015] [Indexed: 11/09/2022] Open
Abstract
How specific protein associations regulate the function of membrane receptors remains poorly understood. Conformational flexibility currently hinders the structure determination of several classes of membrane receptors and associated oligomers. Here we develop EFDOCK-TM, a general method to predict self-associated transmembrane protein helical (TMH) structures from sequence guided by co-evolutionary information. We show that accurate intermolecular contacts can be identified using a combination of protein sequence covariation and TMH binding surfaces predicted from sequence. When applied to diverse TMH oligomers, including receptors characterized in multiple conformational and functional states, the method reaches unprecedented near-atomic accuracy for most targets. Blind predictions of structurally uncharacterized receptor tyrosine kinase TMH oligomers provide a plausible hypothesis on the molecular mechanisms of disease-associated point mutations and binding surfaces for the rational design of selective inhibitors. The method sets the stage for uncovering novel determinants of molecular recognition and signalling in single-spanning eukaryotic membrane receptors.
Collapse
Affiliation(s)
- Y Wang
- Structural and Computational Biology and Molecular Biophysics Graduate Program, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| | - P Barth
- 1] Structural and Computational Biology and Molecular Biophysics Graduate Program, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA [2] Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA [3] Department of Pharmacology, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA
| |
Collapse
|
54
|
Abstract
This review paper discusses the reciprocal kinetic behaviours of enzymes and the evolution of structure–function dichotomy. Kinetic mechanisms have evolved in response to alterations in ecological and metabolic conditions. The kinetic mechanisms of single-substrate mono-substrate enzyme reactions are easier to understand and much simpler than those of bi–bi substrate enzyme reactions. The increasing complexities of kinetic mechanisms, as well as the increasing number of enzyme subunits, can be used to shed light on the evolution of kinetic mechanisms. Enzymes with heterogeneous kinetic mechanisms attempt to achieve specific products to subsist. In many organisms, kinetic mechanisms have evolved to aid survival in response to changing environmental factors. Enzyme promiscuity is defined as adaptation to changing environmental conditions, such as the introduction of a toxin or a new carbon source. Enzyme promiscuity is defined as adaptation to changing environmental conditions, such as the introduction of a toxin or a new carbon source. Enzymes with broad substrate specificity and promiscuous properties are believed to be more evolved than single-substrate enzymes. This group of enzymes can adapt to changing environmental substrate conditions and adjust catalysing mechanisms according to the substrate’s properties, and their kinetic mechanisms have evolved in response to substrate variability.
Collapse
Affiliation(s)
- Nuriye Nuray Ulusu
- School of Medicine, Koç University, Rumelifeneri yolu, Sarıyer, Istanbul, Turkey,
| |
Collapse
|
55
|
Pilla KB, Leman JK, Otting G, Huber T. Capturing conformational States in proteins using sparse paramagnetic NMR data. PLoS One 2015; 10:e0127053. [PMID: 25992808 PMCID: PMC4436263 DOI: 10.1371/journal.pone.0127053] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2015] [Accepted: 04/10/2015] [Indexed: 12/20/2022] Open
Abstract
Capturing conformational changes in proteins or protein-protein complexes is a challenge for both experimentalists and computational biologists. Solution nuclear magnetic resonance (NMR) is unique in that it permits structural studies of proteins under greatly varying conditions, and thus allows us to monitor induced structural changes. Paramagnetic effects are increasingly used to study protein structures as they give ready access to rich structural information of orientation and long-range distance restraints from the NMR signals of backbone amides, and reliable methods have become available to tag proteins with paramagnetic metal ions site-specifically and at multiple sites. In this study, we show how sparse pseudocontact shift (PCS) data can be used to computationally model conformational states in a protein system, by first identifying core structural elements that are not affected by the environmental change, and then computationally completing the remaining structure based on experimental restraints from PCS. The approach is demonstrated on a 27 kDa two-domain NS2B-NS3 protease system of the dengue virus serotype 2, for which distinct closed and open conformational states have been observed in crystal structures. By changing the input PCS data, the observed conformational states in the dengue virus protease are reproduced without modifying the computational procedure. This data driven Rosetta protocol enables identification of conformational states of a protein system, which are otherwise difficult to obtain either experimentally or computationally.
Collapse
Affiliation(s)
- Kala Bharath Pilla
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
| | - Julia Koehler Leman
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, United States of America
| | - Gottfried Otting
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
| | - Thomas Huber
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
- * E-mail:
| |
Collapse
|
56
|
A compact native 24-residue supersecondary structure derived from the villin headpiece subdomain. Biophys J 2015; 108:678-86. [PMID: 25650934 DOI: 10.1016/j.bpj.2014.11.3482] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2014] [Revised: 10/23/2014] [Accepted: 11/20/2014] [Indexed: 11/22/2022] Open
Abstract
Many small proteins fold highly cooperatively in an all-or-none fashion and thus their native states are well protected from thermal fluctuations by an extensive network of interactions across the folded structure. Because protein structures are stabilized by local and nonlocal interactions among distal residues, dissecting individual substructures from the context of folded proteins results in large destabilization and loss of unique three-dimensional structure. Thus, mini-protein substructures can only rarely be derived from natural templates. Here, we describe a compact native 24-residues-long supersecondary structure derived from the hyperstable villin headpiece subdomain consisting of helices 2 and 3 (HP24). Using a combination of experimental techniques, including NMR and small-angle x-ray scattering, as well as all-atom replica exchange molecular-dynamics simulations, we show that a variant with stabilizing substitutions (HP24stab) forms a densely packed and compact conformation. In HP24stab, interactions between helices 2 and 3 are similar to those observed in native folded HP35, and the two helices cooperatively stabilize each other by completing the hydrophobic core lining the central part of HP35. Interestingly, even though the HP24wt fragment shows a more expanded and less structured conformation, NMR and simulations demonstrate a preference for a native-like topology. Thus, the two stabilizing residues in HP24stab shift the energy balance toward the native state, leading to a minimal folding motif.
Collapse
|
57
|
Warszawski S, Netzer R, Tawfik DS, Fleishman SJ. A "fuzzy"-logic language for encoding multiple physical traits in biomolecules. J Mol Biol 2014; 426:4125-4138. [PMID: 25311857 PMCID: PMC4270444 DOI: 10.1016/j.jmb.2014.10.002] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2014] [Revised: 09/21/2014] [Accepted: 10/02/2014] [Indexed: 12/16/2022]
Abstract
To carry out their activities, biological macromolecules balance different physical traits, such as stability, interaction affinity, and selectivity. How such often opposing traits are encoded in a macromolecular system is critical to our understanding of evolutionary processes and ability to design new molecules with desired functions. We present a framework for constraining design simulations to balance different physical characteristics. Each trait is represented by the equilibrium fractional occupancy of the desired state relative to its alternatives, ranging from none to full occupancy, and the different traits are combined using Boolean operators to effect a "fuzzy"-logic language for encoding any combination of traits. In another paper, we presented a new combinatorial backbone design algorithm AbDesign where the fuzzy-logic framework was used to optimize protein backbones and sequences for both stability and binding affinity in antibody-design simulation. We now extend this framework and find that fuzzy-logic design simulations reproduce sequence and structure design principles seen in nature to underlie exquisite specificity on the one hand and multispecificity on the other hand. The fuzzy-logic language is broadly applicable and could help define the space of tolerated and beneficial mutations in natural biomolecular systems and design artificial molecules that encode complex characteristics.
Collapse
Affiliation(s)
- Shira Warszawski
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Ravit Netzer
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Dan S Tawfik
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Sarel J Fleishman
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel.
| |
Collapse
|
58
|
Intrinsic disorder as a generalizable strategy for the rational design of highly responsive, allosterically cooperative receptors. Proc Natl Acad Sci U S A 2014; 111:15048-53. [PMID: 25288724 DOI: 10.1073/pnas.1410796111] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Control over the sensitivity with which biomolecular receptors respond to small changes in the concentration of their target ligand is critical for the proper function of many cellular processes. Such control could likewise be of utility in artificial biotechnologies, such as biosensors, genetic logic gates, and "smart" materials, in which highly responsive behavior is of value. In nature, the control of molecular responsiveness is often achieved using "Hill-type" cooperativity, a mechanism in which sequential binding events on a multivalent receptor are coupled such that the first enhances the affinity of the next, producing a steep, higher-order dependence on target concentration. Here, we use an intrinsic-disorder-based mechanism that can be implemented without requiring detailed structural knowledge to rationally introduce this potentially useful property into several normally noncooperative biomolecules. To do so, we fabricate a tandem repeat of the receptor that is destabilized (unfolded) via the introduction of a long, unstructured loop. The first binding event requires the energetically unfavorable closing of this loop, reducing its affinity relative to that of the second binding event, which, in contrast occurs at a preformed site. Using this approach, we have rationally introduced cooperativity into three unrelated DNA aptamers, achieving in the best of these a Hill coefficient experimentally indistinguishable from the theoretically expected maximum. The extent of cooperativity and thus the steepness of the binding transition are, moreover, well modeled as simple functions of the energetic cost of binding-induced folding, speaking to the quantitative nature of this design strategy.
Collapse
|
59
|
Nunes-Alves A, Arantes GM. Ligand-receptor affinities computed by an adapted linear interaction model for continuum electrostatics and by protein conformational averaging. J Chem Inf Model 2014; 54:2309-19. [PMID: 25076043 DOI: 10.1021/ci500301s] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Accurate calculations of free energies involved in small-molecule binding to a receptor are challenging. Interactions between ligand, receptor, and solvent molecules have to be described precisely, and a large number of conformational microstates has to be sampled, particularly for ligand binding to a flexible protein. Linear interaction energy models are computationally efficient methods that have found considerable success in the prediction of binding free energies. Here, we parametrize a linear interaction model for implicit solvation with coefficients adapted by ligand and binding site relative polarities in order to predict ligand binding free energies. Results obtained for a diverse series of ligands suggest that the model has good predictive power and transferability. We also apply implicit ligand theory and propose approximations to average contributions of multiple ligand-receptor poses built from a protein conformational ensemble and find that exponential averages require proper energy discrimination between plausible binding poses and false-positives (i.e., decoys). The linear interaction model and the averaging procedures presented can be applied independently of each other and of the method used to obtain the receptor structural representation.
Collapse
Affiliation(s)
- Ariane Nunes-Alves
- Department of Biochemistry, Instituto de Química, Universidade de São Paulo , Av. Prof. Lineu Prestes 748, 05508-900, São Paulo, SP, Brazil
| | | |
Collapse
|
60
|
Zhou AQ, Caballero D, O'Hern CS, Regan L. New insights into the interdependence between amino acid stereochemistry and protein structure. Biophys J 2014; 105:2403-11. [PMID: 24268152 DOI: 10.1016/j.bpj.2013.09.018] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2013] [Revised: 07/30/2013] [Accepted: 09/16/2013] [Indexed: 12/29/2022] Open
Abstract
To successfully design new proteins and understand the effects of mutations in natural proteins, we must understand the geometric and physicochemical principles underlying protein structure. The side chains of amino acids in peptides and proteins adopt specific dihedral angle combinations; however, we still do not have a fundamental quantitative understanding of why some side-chain dihedral angle combinations are highly populated and others are not. Here we employ a hard-sphere plus stereochemical constraint model of dipeptide mimetics to enumerate the side-chain dihedral angles of leucine (Leu) and isoleucine (Ile), and identify those conformations that are sterically allowed versus those that are not as a function of the backbone dihedral angles ϕ and ψ. We compare our results with the observed distributions of side-chain dihedral angles in proteins of known structure. With the hard-sphere plus stereochemical constraint model, we obtain agreement between the model predictions and the observed side-chain dihedral angle distributions for Leu and Ile. These results quantify the extent to which local, geometrical constraints determine protein side-chain conformations.
Collapse
Affiliation(s)
- Alice Qinhua Zhou
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut; Integrated Graduate Program in Physical and Engineering Biology, Yale University, New Haven, Connecticut
| | | | | | | |
Collapse
|
61
|
Zhang C, Tang B, Wang Q, Lai L. Discovery of binding proteins for a protein target using protein-protein docking-based virtual screening. Proteins 2014; 82:2472-82. [PMID: 24854898 DOI: 10.1002/prot.24611] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2014] [Revised: 05/06/2014] [Accepted: 05/09/2014] [Indexed: 12/18/2022]
Abstract
Target structure-based virtual screening, which employs protein-small molecule docking to identify potential ligands, has been widely used in small-molecule drug discovery. In the present study, we used a protein-protein docking program to identify proteins that bind to a specific target protein. In the testing phase, an all-to-all protein-protein docking run on a large dataset was performed. The three-dimensional rigid docking program SDOCK was used to examine protein-protein docking on all protein pairs in the dataset. Both the binding affinity and features of the binding energy landscape were considered in the scoring function in order to distinguish positive binding pairs from negative binding pairs. Thus, the lowest docking score, the average Z-score, and convergency of the low-score solutions were incorporated in the analysis. The hybrid scoring function was optimized in the all-to-all docking test. The docking method and the hybrid scoring function were then used to screen for proteins that bind to tumor necrosis factor-α (TNFα), which is a well-known therapeutic target for rheumatoid arthritis and other autoimmune diseases. A protein library containing 677 proteins was used for the screen. Proteins with scores among the top 20% were further examined. Sixteen proteins from the top-ranking 67 proteins were selected for experimental study. Two of these proteins showed significant binding to TNFα in an in vitro binding study. The results of the present study demonstrate the power and potential application of protein-protein docking for the discovery of novel binding proteins for specific protein targets.
Collapse
Affiliation(s)
- Changsheng Zhang
- BNLMS, State Key Laboratory for Structural Chemistry of Unstable and Stable Species, Peking-Tsinghua Center for Life Sciences at College of Chemistry and Molecular Engineering, Peking University, Beijing, 100871, China
| | | | | | | |
Collapse
|
62
|
Shrestha R, Zhang KYJ. Improving fragment quality for de novo structure prediction. Proteins 2014; 82:2240-52. [PMID: 24753351 DOI: 10.1002/prot.24587] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2014] [Revised: 04/03/2014] [Accepted: 04/15/2014] [Indexed: 11/08/2022]
Abstract
De novo structure prediction can be defined as a search in conformational space under the guidance of an energy function. The most successful de novo structure prediction methods, such as Rosetta, assemble the fragments from known structures to reduce the search space. Therefore, the fragment quality is an important factor in structure prediction. In our study, a method is proposed to generate a new set of fragments from the lowest energy de novo models. These fragments were subsequently used to predict the next-round of models. In a benchmark of 30 proteins, the new set of fragments showed better performance when used to predict de novo structures. The lowest energy model predicted using our method was closer to native structure than Rosetta for 22 proteins. Following a similar trend, the best model among top five lowest energy models predicted using our method was closer to native structure than Rosetta for 20 proteins. In addition, our experiment showed that the C-alpha root mean square deviation was improved from 5.99 to 5.03 Å on average compared to Rosetta when the lowest energy models were picked as the best predicted models.
Collapse
Affiliation(s)
- Rojan Shrestha
- Zhang Initiative Research Unit, Institute Laboratories, RIKEN, 2-1 Hirosawa, Wako, Saitama, 351-0198, Japan; Department of Computational Biology, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba, 277-0882, Japan
| | | |
Collapse
|
63
|
Baker D. Centenary Award and Sir Frederick Gowland Hopkins Memorial Lecture. Protein folding, structure prediction and design. Biochem Soc Trans 2014; 42:225-9. [PMID: 24646222 DOI: 10.1042/bst20130055] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
I describe how experimental studies of protein folding have led to advances in protein structure prediction and protein design. I describe the finding that protein sequences are not optimized for rapid folding, the contact order-protein folding rate correlation, the incorporation of experimental insights into protein folding into the Rosetta protein structure production methodology and the use of this methodology to determine structures from sparse experimental data. I then describe the inverse problem (protein design) and give an overview of recent work on designing proteins with new structures and functions. I also describe the contributions of the general public to these efforts through the Rosetta@home distributed computing project and the FoldIt interactive protein folding and design game.
Collapse
Affiliation(s)
- David Baker
- *Department of Biochemistry, University of Washington/HMMI, Seattle, WA 98195, U.S.A
| |
Collapse
|
64
|
Sripakdeevong P, Cevec M, Chang AT, Erat MC, Ziegeler M, Zhao Q, Fox GE, Gao X, Kennedy SD, Kierzek R, Nikonowicz EP, Schwalbe H, Sigel RKO, Turner DH, Das R. Structure determination of noncanonical RNA motifs guided by ¹H NMR chemical shifts. Nat Methods 2014; 11:413-6. [PMID: 24584194 PMCID: PMC3985481 DOI: 10.1038/nmeth.2876] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2013] [Accepted: 01/06/2014] [Indexed: 12/31/2022]
Abstract
Structured noncoding RNAs underlie fundamental cellular processes, but determining their three-dimensional structures remains challenging. We demonstrate that integrating ¹H NMR chemical shift data with Rosetta de novo modeling can be used to consistently determine high-resolution RNA structures. On a benchmark set of 23 noncanonical RNA motifs, including 11 'blind' targets, chemical-shift Rosetta for RNA (CS-Rosetta-RNA) recovered experimental structures with high accuracy (0.6-2.0 Å all-heavy-atom r.m.s. deviation) in 18 cases.
Collapse
Affiliation(s)
| | - Mirko Cevec
- Center for Biomolecular Magnetic Resonance, Institute for Organic Chemistry and Chemical Biology, Johann Wolfgang Goethe University Frankfurt, Frankfurt, Germany
| | - Andrew T Chang
- Department of Biochemistry and Cell Biology, Rice University, Houston, Texas, USA
| | - Michèle C Erat
- 1] Department of Biochemistry, University of Oxford, Oxford, UK. [2] Institute of Inorganic Chemistry, University of Zurich, Zurich, Switzerland
| | - Melanie Ziegeler
- Center for Biomolecular Magnetic Resonance, Institute for Organic Chemistry and Chemical Biology, Johann Wolfgang Goethe University Frankfurt, Frankfurt, Germany
| | - Qin Zhao
- Feinberg School of Medicine, Northwestern University, Chicago, Illinois, USA
| | - George E Fox
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, USA
| | - Xiaolian Gao
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, USA
| | - Scott D Kennedy
- Department of Biochemistry and Biophysics, University of Rochester School of Medicine and Dentistry, Rochester, New York, USA
| | - Ryszard Kierzek
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Edward P Nikonowicz
- Department of Biochemistry and Cell Biology, Rice University, Houston, Texas, USA
| | - Harald Schwalbe
- Center for Biomolecular Magnetic Resonance, Institute for Organic Chemistry and Chemical Biology, Johann Wolfgang Goethe University Frankfurt, Frankfurt, Germany
| | - Roland K O Sigel
- Institute of Inorganic Chemistry, University of Zurich, Zurich, Switzerland
| | - Douglas H Turner
- Department of Chemistry, University of Rochester, Rochester, New York, USA
| | - Rhiju Das
- 1] Biophysics Program, Stanford University, Stanford, California, USA. [2] Department of Biochemistry, Stanford University, Stanford, California, USA. [3] Department of Physics, Stanford University, Stanford, California, USA
| |
Collapse
|
65
|
Jana B, Morcos F, Onuchic JN. From structure to function: the convergence of structure based models and co-evolutionary information. Phys Chem Chem Phys 2014; 16:6496-507. [PMID: 24603809 DOI: 10.1039/c3cp55275f] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Understanding protein folding and function is one of the most important problems in biological research. Energy landscape theory and the folding funnel concept have provided a framework to investigate the mechanisms associated to these processes. Since protein energy landscapes are in most cases minimally frustrated, structure based models (SMBs) have successfully determined the geometrical features associated with folding and functional transitions. However, structural information is limited, particularly with respect to different functional configurations. This is a major limitation for SBMs. Alternatively, statistical methods to study amino acid co-evolution provide information on residue-residue interactions useful for the study of structure and function. Here, we show how the combination of these two methods gives rise to a novel way to investigate the mechanisms associated with folding and function. We use this methodology to explore the mechanistic aspects of protein translocation in the integral membrane protease FtsH. Dual basin-SBM simulations using the open and closed state of this hexameric motor reveals a functionally important paddling motion in the catalytic cycle. We also find that Direct Coupling Analysis (DCA) predicts physical contacts between AAA and peptidase domains of the motor, which are crucial for the open to close transition. Our combined method, which uses structural information from the open state experimental structure and co-evolutionary couplings, suggests that this methodology can be used to explore the functional landscape of complex biological macromolecules previously inaccessible to methods dependent on experimental structural information. This efficient way to sample the conformational space of large systems creates a theoretical/computational framework capable of better characterizing the functional landscape in large biomolecular assemblies.
Collapse
Affiliation(s)
- Biman Jana
- Center for Theoretical Biological Physics, Rice University, Houston, TX 77005-1827, USA.
| | | | | |
Collapse
|
66
|
An accurate binding interaction model in de novo computational protein design of interactions: If you build it, they will bind. J Struct Biol 2014; 185:136-46. [DOI: 10.1016/j.jsb.2013.03.012] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2012] [Revised: 03/15/2013] [Accepted: 03/21/2013] [Indexed: 01/07/2023]
|
67
|
Computational design of protein–protein interactions. Curr Opin Struct Biol 2013; 23:903-10. [DOI: 10.1016/j.sbi.2013.08.003] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2013] [Revised: 08/06/2013] [Accepted: 08/08/2013] [Indexed: 11/23/2022]
|
68
|
Whittredge PB, Taraska JW. Bridging the spectral gap in fluorescent proteins through directed evolution. CHEMISTRY & BIOLOGY 2013; 20:1203-5. [PMID: 24210004 PMCID: PMC3856577 DOI: 10.1016/j.chembiol.2013.10.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Fluorescent proteins are used as noninvasive tags for protein trafficking, structure, and action. In this issue of Chemistry & Biology, Hoi and colleagues present a new optimized zFP538 yellow fluorescent protein called mPapaya1 that is bright, monomeric, and an excellent fusion partner for cellular proteins.
Collapse
Affiliation(s)
- Paul B. Whittredge
- Laboratory of Molecular Biophysics, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Justin W. Taraska
- Laboratory of Molecular Biophysics, National Heart Lung and Blood Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| |
Collapse
|
69
|
Das R. Atomic-accuracy prediction of protein loop structures through an RNA-inspired Ansatz. PLoS One 2013; 8:e74830. [PMID: 24204571 PMCID: PMC3804535 DOI: 10.1371/journal.pone.0074830] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2013] [Accepted: 08/07/2013] [Indexed: 11/18/2022] Open
Abstract
Consistently predicting biopolymer structure at atomic resolution from sequence alone remains a difficult problem, even for small sub-segments of large proteins. Such loop prediction challenges, which arise frequently in comparative modeling and protein design, can become intractable as loop lengths exceed 10 residues and if surrounding side-chain conformations are erased. Current approaches, such as the protein local optimization protocol or kinematic inversion closure (KIC) Monte Carlo, involve stages that coarse-grain proteins, simplifying modeling but precluding a systematic search of all-atom configurations. This article introduces an alternative modeling strategy based on a ‘stepwise ansatz’, recently developed for RNA modeling, which posits that any realistic all-atom molecular conformation can be built up by residue-by-residue stepwise enumeration. When harnessed to a dynamic-programming-like recursion in the Rosetta framework, the resulting stepwise assembly (SWA) protocol enables enumerative sampling of a 12 residue loop at a significant but achievable cost of thousands of CPU-hours. In a previously established benchmark, SWA recovers crystallographic conformations with sub-Angstrom accuracy for 19 of 20 loops, compared to 14 of 20 by KIC modeling with a comparable expenditure of computational power. Furthermore, SWA gives high accuracy results on an additional set of 15 loops highlighted in the biological literature for their irregularity or unusual length. Successes include cis-Pro touch turns, loops that pass through tunnels of other side-chains, and loops of lengths up to 24 residues. Remaining problem cases are traced to inaccuracies in the Rosetta all-atom energy function. In five additional blind tests, SWA achieves sub-Angstrom accuracy models, including the first such success in a protein/RNA binding interface, the YbxF/kink-turn interaction in the fourth ‘RNA-puzzle’ competition. These results establish all-atom enumeration as an unusually systematic approach to ab initio protein structure modeling that can leverage high performance computing and physically realistic energy functions to more consistently achieve atomic accuracy.
Collapse
Affiliation(s)
- Rhiju Das
- Departments of Biochemistry and Physics, Stanford University, Stanford, California, United States of America
- * E-mail:
| |
Collapse
|
70
|
Yan Z, Wang J. Optimizing scoring function of protein-nucleic acid interactions with both affinity and specificity. PLoS One 2013; 8:e74443. [PMID: 24098651 PMCID: PMC3787031 DOI: 10.1371/journal.pone.0074443] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2013] [Accepted: 08/02/2013] [Indexed: 12/14/2022] Open
Abstract
Protein-nucleic acid (protein-DNA and protein-RNA) recognition is fundamental to the regulation of gene expression. Determination of the structures of the protein-nucleic acid recognition and insight into their interactions at molecular level are vital to understanding the regulation function. Recently, quantitative computational approach has been becoming an alternative of experimental technique for predicting the structures and interactions of biomolecular recognition. However, the progress of protein-nucleic acid structure prediction, especially protein-RNA, is far behind that of the protein-ligand and protein-protein structure predictions due to the lack of reliable and accurate scoring function for quantifying the protein-nucleic acid interactions. In this work, we developed an accurate scoring function (named as SPA-PN, SPecificity and Affinity of the Protein-Nucleic acid interactions) for protein-nucleic acid interactions by incorporating both the specificity and affinity into the optimization strategy. Specificity and affinity are two requirements of highly efficient and specific biomolecular recognition. Previous quantitative descriptions of the biomolecular interactions considered the affinity, but often ignored the specificity owing to the challenge of specificity quantification. We applied our concept of intrinsic specificity to connect the conventional specificity, which circumvents the challenge of specificity quantification. In addition to the affinity optimization, we incorporated the quantified intrinsic specificity into the optimization strategy of SPA-PN. The testing results and comparisons with other scoring functions validated that SPA-PN performs well on both the prediction of binding affinity and identification of native conformation. In terms of its performance, SPA-PN can be widely used to predict the protein-nucleic acid structures and quantify their interactions.
Collapse
Affiliation(s)
- Zhiqiang Yan
- Department of Chemistry & Physics, State University of New York at Stony Brook, Stony Brook, New York, United States of America
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin, China
| | - Jin Wang
- Department of Chemistry & Physics, State University of New York at Stony Brook, Stony Brook, New York, United States of America
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin, China
| |
Collapse
|
71
|
Tinberg CE, Khare SD, Dou J, Doyle L, Nelson JW, Schena A, Jankowski W, Kalodimos CG, Johnsson K, Stoddard BL, Baker D. Computational design of ligand-binding proteins with high affinity and selectivity. Nature 2013; 501:212-216. [PMID: 24005320 DOI: 10.1038/nature12443] [Citation(s) in RCA: 304] [Impact Index Per Article: 27.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2013] [Accepted: 07/11/2013] [Indexed: 01/27/2023]
Abstract
The ability to design proteins with high affinity and selectivity for any given small molecule is a rigorous test of our understanding of the physiochemical principles that govern molecular recognition. Attempts to rationally design ligand-binding proteins have met with little success, however, and the computational design of protein-small-molecule interfaces remains an unsolved problem. Current approaches for designing ligand-binding proteins for medical and biotechnological uses rely on raising antibodies against a target antigen in immunized animals and/or performing laboratory-directed evolution of proteins with an existing low affinity for the desired ligand, neither of which allows complete control over the interactions involved in binding. Here we describe a general computational method for designing pre-organized and shape complementary small-molecule-binding sites, and use it to generate protein binders to the steroid digoxigenin (DIG). Of seventeen experimentally characterized designs, two bind DIG; the model of the higher affinity binder has the most energetically favourable and pre-organized interface in the design set. A comprehensive binding-fitness landscape of this design, generated by library selections and deep sequencing, was used to optimize its binding affinity to a picomolar level, and X-ray co-crystal structures of two variants show atomic-level agreement with the corresponding computational models. The optimized binder is selective for DIG over the related steroids digitoxigenin, progesterone and β-oestradiol, and this steroid binding preference can be reprogrammed by manipulation of explicitly designed hydrogen-bonding interactions. The computational design method presented here should enable the development of a new generation of biosensors, therapeutics and diagnostics.
Collapse
Affiliation(s)
- Christine E Tinberg
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | - Sagar D Khare
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | - Jiayi Dou
- Department of Bioengineering, University of Washington, Seattle, WA 98195, USA.,Graduate Program in Biological Physics, Structure, and Design, University of Washington, Seattle, WA 98195, USA
| | - Lindsey Doyle
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Jorgen W Nelson
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Alberto Schena
- Institute of Chemical Sciences and Engineering, Institute of Bioengineering, National Centre of Competence in Research (NCCR) of Chemical Biology, École Polytechnique Fédérale de Laussane (EPFL), Laussane, Switzerland
| | - Wojciech Jankowski
- Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | | | - Kai Johnsson
- Institute of Chemical Sciences and Engineering, Institute of Bioengineering, National Centre of Competence in Research (NCCR) of Chemical Biology, École Polytechnique Fédérale de Laussane (EPFL), Laussane, Switzerland
| | - Barry L Stoddard
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
72
|
van der Schot G, Zhang Z, Vernon R, Shen Y, Vranken WF, Baker D, Bonvin AMJJ, Lange OF. Improving 3D structure prediction from chemical shift data. JOURNAL OF BIOMOLECULAR NMR 2013; 57:27-35. [PMID: 23912841 DOI: 10.1007/s10858-013-9762-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2013] [Accepted: 07/16/2013] [Indexed: 05/22/2023]
Abstract
We report advances in the calculation of protein structures from chemical shift nuclear magnetic resonance data alone. Our previously developed method, CS-Rosetta, assembles structures from a library of short protein fragments picked from a large library of protein structures using chemical shifts and sequence information. Here we demonstrate that combination of a new and improved fragment picker and the iterative sampling algorithm RASREC yield significant improvements in convergence and accuracy. Moreover, we introduce improved criteria for assessing the accuracy of the models produced by the method. The method was tested on 39 proteins in the 50-100 residue size range and yields reliable structures in 70 % of the cases. All structures that passed the reliability filter were accurate (<2 Å RMSD from the reference).
Collapse
Affiliation(s)
- Gijs van der Schot
- Computational Structural Biology, Bijvoet Center for Biomolecular Research, Faculty of Science-Chemistry, Utrecht University, Padualaan 8, 3584 CH, Utrecht, The Netherlands
| | | | | | | | | | | | | | | |
Collapse
|
73
|
Zhang Z, Lange OF. Replica exchange improves sampling in low-resolution docking stage of RosettaDock. PLoS One 2013; 8:e72096. [PMID: 24009670 PMCID: PMC3756964 DOI: 10.1371/journal.pone.0072096] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2013] [Accepted: 07/10/2013] [Indexed: 11/18/2022] Open
Abstract
Many protein-protein docking protocols are based on a shotgun approach, in which thousands of independent random-start trajectories minimize the rigid-body degrees of freedom. Another strategy is enumerative sampling as used in ZDOCK. Here, we introduce an alternative strategy, ReplicaDock, using a small number of long trajectories of temperature replica exchange. We compare replica exchange sampling as low-resolution stage of RosettaDock with RosettaDock's original shotgun sampling as well as with ZDOCK. A benchmark of 30 complexes starting from structures of the unbound binding partners shows improved performance for ReplicaDock and ZDOCK when compared to shotgun sampling at equal or less computational expense. ReplicaDock and ZDOCK consistently reach lower energies and generate significantly more near-native conformations than shotgun sampling. Accordingly, they both improve typical metrics of prediction quality of complex structures after refinement. Additionally, the refined ReplicaDock ensembles reach significantly lower interface energies and many previously hidden features of the docking energy landscape become visible when ReplicaDock is applied.
Collapse
Affiliation(s)
- Zhe Zhang
- Biomolecular NMR and Munich Center for Integrated Protein Science, Department Chemie, Technische Universität München, Garching, Germany
| | - Oliver F. Lange
- Biomolecular NMR and Munich Center for Integrated Protein Science, Department Chemie, Technische Universität München, Garching, Germany
- Institute of Structural Biology, Helmholtz Zentrum München, Neuherberg, Germany
- * E-mail:
| |
Collapse
|
74
|
Moal IH, Moretti R, Baker D, Fernández-Recio J. Scoring functions for protein-protein interactions. Curr Opin Struct Biol 2013; 23:862-7. [PMID: 23871100 DOI: 10.1016/j.sbi.2013.06.017] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2013] [Revised: 06/26/2013] [Accepted: 06/29/2013] [Indexed: 12/24/2022]
Abstract
The computational evaluation of protein-protein interactions will play an important role in organising the wealth of data being generated by high-throughput initiatives. Here we discuss future applications, report recent developments and identify areas requiring further investigation. Many functions have been developed to quantify the structural and energetic properties of interacting proteins, finding use in interrelated challenges revolving around the relationship between sequence, structure and binding free energy. These include loop modelling, side-chain refinement, docking, multimer assembly, affinity prediction, affinity change upon mutation, hotspots location and interface design. Information derived from models optimised for one of these challenges can be used to benefit the others, and can be unified within the theoretical frameworks of multi-task learning and Pareto-optimal multi-objective learning.
Collapse
Affiliation(s)
- Iain H Moal
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Supercomputing Center, C/ Jordi Girona 29, 08034 Barcelona, Spain
| | | | | | | |
Collapse
|
75
|
Nguyen ED, Norn C, Frimurer TM, Meiler J. Assessment and challenges of ligand docking into comparative models of G-protein coupled receptors. PLoS One 2013; 8:e67302. [PMID: 23844000 PMCID: PMC3699586 DOI: 10.1371/journal.pone.0067302] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2013] [Accepted: 05/16/2013] [Indexed: 01/09/2023] Open
Abstract
The rapidly increasing number of high-resolution X-ray structures of G-protein coupled receptors (GPCRs) creates a unique opportunity to employ comparative modeling and docking to provide valuable insight into the function and ligand binding determinants of novel receptors, to assist in virtual screening and to design and optimize drug candidates. However, low sequence identity between receptors, conformational flexibility, and chemical diversity of ligands present an enormous challenge to molecular modeling approaches. It is our hypothesis that rapid Monte-Carlo sampling of protein backbone and side-chain conformational space with Rosetta can be leveraged to meet this challenge. This study performs unbiased comparative modeling and docking methodologies using 14 distinct high-resolution GPCRs and proposes knowledge-based filtering methods for improvement of sampling performance and identification of correct ligand-receptor interactions. On average, top ranked receptor models built on template structures over 50% sequence identity are within 2.9 Å of the experimental structure, with an average root mean square deviation (RMSD) of 2.2 Å for the transmembrane region and 5 Å for the second extracellular loop. Furthermore, these models are consistently correlated with low Rosetta energy score. To predict their binding modes, ligand conformers of the 14 ligands co-crystalized with the GPCRs were docked against the top ranked comparative models. In contrast to the comparative models themselves, however, it remains difficult to unambiguously identify correct binding modes by score alone. On average, sampling performance was improved by 103 fold over random using knowledge-based and energy-based filters. In assessing the applicability of experimental constraints, we found that sampling performance is increased by one order of magnitude for every 10 residues known to contact the ligand. Additionally, in the case of DOR, knowledge of a single specific ligand-protein contact improved sampling efficiency 7 fold. These findings offer specific guidelines which may lead to increased success in determining receptor-ligand complexes.
Collapse
Affiliation(s)
- Elizabeth Dong Nguyen
- Center for Structural Biology, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - Christoffer Norn
- Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen, Copenhagen, Denmark
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Thomas M. Frimurer
- Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen, Copenhagen, Denmark
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Jens Meiler
- Center for Structural Biology, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
- Department of Pharmacology, Vanderbilt Program in Drug Discovery, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
- Department of Chemistry and the Institute for Chemical Biology, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
- * E-mail:
| |
Collapse
|
76
|
Employing directed evolution for the functional analysis of multi-specific proteins. Bioorg Med Chem 2013; 21:3511-6. [DOI: 10.1016/j.bmc.2013.04.052] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2013] [Revised: 04/11/2013] [Accepted: 04/18/2013] [Indexed: 01/17/2023]
|
77
|
Stein A, Kortemme T. Improvements to robotics-inspired conformational sampling in rosetta. PLoS One 2013; 8:e63090. [PMID: 23704889 PMCID: PMC3660577 DOI: 10.1371/journal.pone.0063090] [Citation(s) in RCA: 140] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2013] [Accepted: 03/28/2013] [Indexed: 02/04/2023] Open
Abstract
To accurately predict protein conformations in atomic detail, a computational method must be capable of sampling models sufficiently close to the native structure. All-atom sampling is difficult because of the vast number of possible conformations and extremely rugged energy landscapes. Here, we test three sampling strategies to address these difficulties: conformational diversification, intensification of torsion and omega-angle sampling and parameter annealing. We evaluate these strategies in the context of the robotics-based kinematic closure (KIC) method for local conformational sampling in Rosetta on an established benchmark set of 45 12-residue protein segments without regular secondary structure. We quantify performance as the fraction of sub-Angstrom models generated. While improvements with individual strategies are only modest, the combination of intensification and annealing strategies into a new “next-generation KIC” method yields a four-fold increase over standard KIC in the median percentage of sub-Angstrom models across the dataset. Such improvements enable progress on more difficult problems, as demonstrated on longer segments, several of which could not be accurately remodeled with previous methods. Given its improved sampling capability, next-generation KIC should allow advances in other applications such as local conformational remodeling of multiple segments simultaneously, flexible backbone sequence design, and development of more accurate energy functions.
Collapse
Affiliation(s)
- Amelie Stein
- California Institute for Quantitative Biomedical Research and Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
- * E-mail: (AS); (TK)
| | - Tanja Kortemme
- California Institute for Quantitative Biomedical Research and Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, United States of America
- * E-mail: (AS); (TK)
| |
Collapse
|
78
|
Ollikainen N, Smith CA, Fraser JS, Kortemme T. Flexible backbone sampling methods to model and design protein alternative conformations. Methods Enzymol 2013; 523:61-85. [PMID: 23422426 DOI: 10.1016/b978-0-12-394292-0.00004-7] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Sampling alternative conformations is key to understanding how proteins work and engineering them for new functions. However, accurately characterizing and modeling protein conformational ensembles remain experimentally and computationally challenging. These challenges must be met before protein conformational heterogeneity can be exploited in protein engineering and design. Here, as a stepping stone, we describe methods to detect alternative conformations in proteins and strategies to model these near-native conformational changes based on backrub-type Monte Carlo moves in Rosetta. We illustrate how Rosetta simulations that apply backrub moves improve modeling of point mutant side-chain conformations, native side-chain conformational heterogeneity, functional conformational changes, tolerated sequence space, protein interaction specificity, and amino acid covariation across protein-protein interfaces. We include relevant Rosetta command lines and RosettaScripts to encourage the application of these types of simulations to other systems. Our work highlights that critical scoring and sampling improvements will be necessary to approximate conformational landscapes. Challenges for the future development of these methods include modeling conformational changes that propagate away from designed mutation sites and modulating backbone flexibility to predictively design functionally important conformational heterogeneity.
Collapse
Affiliation(s)
- Noah Ollikainen
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, California, USA
| | | | | | | |
Collapse
|
79
|
Yan Z, Zheng X, Wang E, Wang J. Thermodynamic and kinetic specificities of ligand binding. Chem Sci 2013. [DOI: 10.1039/c3sc50478f] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
|
80
|
Franconetti A, Jatunov S, Borrachero P, Gómez-Guillén M, Cabrera-Escribano F. Synthesis of cyclically constrained sugar derived α/β- and α/γ-peptides. Org Biomol Chem 2013; 11:676-86. [DOI: 10.1039/c2ob26992a] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
81
|
Whitehead TA, Baker D, Fleishman SJ. Computational Design of Novel Protein Binders and Experimental Affinity Maturation. Methods Enzymol 2013; 523:1-19. [DOI: 10.1016/b978-0-12-394292-0.00001-1] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
82
|
Emerging themes in the computational design of novel enzymes and protein-protein interfaces. FEBS Lett 2012; 587:1147-54. [DOI: 10.1016/j.febslet.2012.12.009] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2012] [Revised: 12/10/2012] [Accepted: 12/10/2012] [Indexed: 11/21/2022]
|
83
|
Abstract
High-throughput screening (HTS) of enzymatic activity is important for directed evolution-based enzyme engineering. However, substrate and product diffusion can severely compromise these HTS assays. In this issue of Chemistry & Biology, Kintses and coworkers describe a microfluidic platform for the directed evolution of enzymes in droplets that allows for the screening of 10(7) mutants per round of evolution.
Collapse
|
84
|
Kastritis PL, Bonvin AMJJ. On the binding affinity of macromolecular interactions: daring to ask why proteins interact. J R Soc Interface 2012; 10:20120835. [PMID: 23235262 PMCID: PMC3565702 DOI: 10.1098/rsif.2012.0835] [Citation(s) in RCA: 276] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Interactions between proteins are orchestrated in a precise and time-dependent manner, underlying cellular function. The binding affinity, defined as the strength of these interactions, is translated into physico-chemical terms in the dissociation constant (Kd), the latter being an experimental measure that determines whether an interaction will be formed in solution or not. Predicting binding affinity from structural models has been a matter of active research for more than 40 years because of its fundamental role in drug development. However, all available approaches are incapable of predicting the binding affinity of protein–protein complexes from coordinates alone. Here, we examine both theoretical and experimental limitations that complicate the derivation of structure–affinity relationships. Most work so far has concentrated on binary interactions. Systems of increased complexity are far from being understood. The main physico-chemical measure that relates to binding affinity is the buried surface area, but it does not hold for flexible complexes. For the latter, there must be a significant entropic contribution that will have to be approximated in the future. We foresee that any theoretical modelling of these interactions will have to follow an integrative approach considering the biology, chemistry and physics that underlie protein–protein recognition.
Collapse
Affiliation(s)
- Panagiotis L Kastritis
- Bijvoet Center for Biomolecular Research, Faculty of Science, Chemistry, Utrecht University, , Padualaan 8, Utrecht, The Netherlands
| | | |
Collapse
|
85
|
Levy ED, De S, Teichmann SA. Cellular crowding imposes global constraints on the chemistry and evolution of proteomes. Proc Natl Acad Sci U S A 2012; 109:20461-6. [PMID: 23184996 PMCID: PMC3528536 DOI: 10.1073/pnas.1209312109] [Citation(s) in RCA: 124] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In living cells, functional protein-protein interactions compete with a much larger number of nonfunctional, or promiscuous, interactions. Several cellular properties contribute to avoiding unwanted protein interactions, including regulation of gene expression, cellular compartmentalization, and high specificity and affinity of functional interactions. Here we investigate whether other mechanisms exist that shape the sequence and structure of proteins to favor their correct assembly into functional protein complexes. To examine this question, we project evolutionary and cellular abundance information onto 397, 196, and 631 proteins of known 3D structure from Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens, respectively. On the basis of amino acid frequencies in interface patches versus the solvent-accessible protein surface, we define a propensity or "stickiness" scale for each of the 20 amino acids. We find that the propensity to interact in a nonspecific manner is inversely correlated with abundance. In other words, high abundance proteins have less sticky surfaces. We also find that stickiness constrains protein evolution, whereby residues in sticky surface patches are more conserved than those found in nonsticky patches. Finally, we find that the constraint imposed by stickiness on protein divergence is proportional to protein abundance, which provides mechanistic insights into the correlation between protein conservation and protein abundance. Overall, the avoidance of nonfunctional interactions significantly influences the physico-chemical and evolutionary properties of proteins. Remarkably, the effects observed are consistently larger in E. coli and S. cerevisiae than in H. sapiens, suggesting that promiscuous protein-protein interactions may be freer to accumulate in the human lineage.
Collapse
Affiliation(s)
- Emmanuel D. Levy
- Medical Research Council Laboratory of Molecular Biology, Cambridge CB2 0QH, United Kingdom
- Département de Biochimie, Université de Montréal, Montréal, QC, Canada H3T 1J4
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Subhajyoti De
- Medical Research Council Laboratory of Molecular Biology, Cambridge CB2 0QH, United Kingdom
- Department of Medicine, University of Colorado School of Medicine, Aurora, CO 80045; and
- Molecular Oncology Program, University of Colorado Cancer Center, Aurora, CO 80045
| | - Sarah A. Teichmann
- Medical Research Council Laboratory of Molecular Biology, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
86
|
Sandler I, Abu-Qarn M, Aharoni A. Protein co-evolution: how do we combine bioinformatics and experimental approaches? MOLECULAR BIOSYSTEMS 2012; 9:175-81. [PMID: 23151606 DOI: 10.1039/c2mb25317h] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Molecular co-evolution is manifested by compensatory changes in proteins designed to enable adaptation to their natural environment. In recent years, bioinformatics approaches allowed for the detection of co-evolution at the level of the whole protein or of specific residues. Such efforts enabled prediction of protein-protein interactions, functional assignments of proteins and the identification of interacting residues, thereby providing information on protein structure. Still, despite such advances, relatively little is known regarding the functional implications of sequence divergence resulting from protein co-evolution. While bioinformatics approaches usually analyze thousands of proteins to obtain a broad view of protein co-evolution, experimental evaluation of protein co-evolution serves to study only individual proteins. In this review, we describe recent advances in bioinformatics and experimental efforts aimed at examining protein co-evolution. Accordingly, we discuss possible modes of crosstalk between the bioinformatics and experimental approaches to facilitate the identification of co-evolutionary signals in proteins and to understand their implications for the structure and function of proteins.
Collapse
Affiliation(s)
- Inga Sandler
- Department of Life Sciences, Ben-Gurion University of the Negev, Be'er Sheva 84105, Israel
| | | | | |
Collapse
|
87
|
Moal IH, Fernández-Recio J. SKEMPI: a Structural Kinetic and Energetic database of Mutant Protein Interactions and its use in empirical models. ACTA ACUST UNITED AC 2012; 28:2600-7. [PMID: 22859501 DOI: 10.1093/bioinformatics/bts489] [Citation(s) in RCA: 179] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
MOTIVATION Empirical models for the prediction of how changes in sequence alter protein-protein binding kinetics and thermodynamics can garner insights into many aspects of molecular biology. However, such models require empirical training data and proper validation before they can be widely applied. Previous databases contained few stabilizing mutations and no discussion of their inherent biases or how this impacts model construction or validation. RESULTS We present SKEMPI, a database of 3047 binding free energy changes upon mutation assembled from the scientific literature, for protein-protein heterodimeric complexes with experimentally determined structures. This represents over four times more data than previously collected. Changes in 713 association and dissociation rates and 127 enthalpies and entropies were also recorded. The existence of biases towards specific mutations, residues, interfaces, proteins and protein families is discussed in the context of how the data can be used to construct predictive models. Finally, a cross-validation scheme is presented which is capable of estimating the efficacy of derived models on future data in which these biases are not present. AVAILABILITY The database is available online at http://life.bsc.es/pid/mutation_database/.
Collapse
Affiliation(s)
- Iain H Moal
- Joint BSC-IRB Research Program in Computational Biology, Life Science Department, Barcelona Supercomputing Center, Barcelona, Spain
| | | |
Collapse
|
88
|
Visualizing transient protein-folding intermediates by tryptophan-scanning mutagenesis. Nat Struct Mol Biol 2012; 19:731-6. [PMID: 22683996 DOI: 10.1038/nsmb.2322] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2012] [Accepted: 05/11/2012] [Indexed: 11/08/2022]
Abstract
To understand how proteins fold, assemble and function, it is necessary to characterize the structure and dynamics of each state they adopt during their lifetime. Experimental characterization of the transient states of proteins remains a major challenge because high-resolution structural techniques, including NMR and X-ray crystallography, cannot be directly applied to study short-lived protein states. To circumvent this limitation, we show that transient states during protein folding can be characterized by measuring the fluorescence of tryptophan residues, introduced at many solvent-exposed positions to determine whether each position is native-like, denatured-like or non-native-like in the intermediate state. We use this approach to characterize a late-folding-intermediate state of the small globular mammalian protein ubiquitin, and we show the presence of productive non-native interactions that suggest a 'flycatcher' mechanism of concerted binding and folding.
Collapse
|