251
|
Quan L, Lü Q, Li H, Xia X, Wu H. Improved packing of protein side chains with parallel ant colonies. BMC Bioinformatics 2014; 15 Suppl 12:S5. [PMID: 25474164 PMCID: PMC4251090 DOI: 10.1186/1471-2105-15-s12-s5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
INTRODUCTION The accurate packing of protein side chains is important for many computational biology problems, such as ab initio protein structure prediction, homology modelling, and protein design and ligand docking applications. Many of existing solutions are modelled as a computational optimisation problem. As well as the design of search algorithms, most solutions suffer from an inaccurate energy function for judging whether a prediction is good or bad. Even if the search has found the lowest energy, there is no certainty of obtaining the protein structures with correct side chains. METHODS We present a side-chain modelling method, pacoPacker, which uses a parallel ant colony optimisation strategy based on sharing a single pheromone matrix. This parallel approach combines different sources of energy functions and generates protein side-chain conformations with the lowest energies jointly determined by the various energy functions. We further optimised the selected rotamers to construct subrotamer by rotamer minimisation, which reasonably improved the discreteness of the rotamer library. RESULTS We focused on improving the accuracy of side-chain conformation prediction. For a testing set of 442 proteins, 87.19% of X1 and 77.11% of X12 angles were predicted correctly within 40° of the X-ray positions. We compared the accuracy of pacoPacker with state-of-the-art methods, such as CIS-RR and SCWRL4. We analysed the results from different perspectives, in terms of protein chain and individual residues. In this comprehensive benchmark testing, 51.5% of proteins within a length of 400 amino acids predicted by pacoPacker were superior to the results of CIS-RR and SCWRL4 simultaneously. Finally, we also showed the advantage of using the subrotamers strategy. All results confirmed that our parallel approach is competitive to state-of-the-art solutions for packing side chains. CONCLUSIONS This parallel approach combines various sources of searching intelligence and energy functions to pack protein side chains. It provides a frame-work for combining different inaccuracy/usefulness objective functions by designing parallel heuristic search algorithms.
Collapse
|
252
|
Dasmeh P, Serohijos AWR, Kepp KP, Shakhnovich EI. The influence of selection for protein stability on dN/dS estimations. Genome Biol Evol 2014; 6:2956-67. [PMID: 25355808 PMCID: PMC4224349 DOI: 10.1093/gbe/evu223] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Understanding the relative contributions of various evolutionary processes-purifying selection, neutral drift, and adaptation-is fundamental to evolutionary biology. A common metric to distinguish these processes is the ratio of nonsynonymous to synonymous substitutions (i.e., dN/dS) interpreted from the neutral theory as a null model. However, from biophysical considerations, mutations have non-negligible effects on the biophysical properties of proteins such as folding stability. In this work, we investigated how stability affects the rate of protein evolution in phylogenetic trees by using simulations that combine explicit protein sequences with associated stability changes. We first simulated myoglobin evolution in phylogenetic trees with a biophysically realistic approach that accounts for 3D structural information and estimates of changes in stability upon mutation. We then compared evolutionary rates inferred directly from simulation to those estimated using maximum-likelihood (ML) methods. We found that the dN/dS estimated by ML methods (ωML) is highly predictive of the per gene dN/dS inferred from the simulated phylogenetic trees. This agreement is strong in the regime of high stability where protein evolution is neutral. At low folding stabilities and under mutation-selection balance, we observe deviations from neutrality (per gene dN/dS > 1 and dN/dS < 1). We showed that although per gene dN/dS is robust to these deviations, ML tests for positive selection detect statistically significant per site dN/dS > 1. Altogether, we show how protein biophysics affects the dN/dS estimations and its subsequent interpretation. These results are important for improving the current approaches for detecting positive selection.
Collapse
Affiliation(s)
- Pouria Dasmeh
- Department of Chemistry and Chemical Biology, Harvard University DTU Chemistry, Technical University of Denmark, Kongens Lyngby, Denmark Present address: Max Planck Institute of Immunobiology and Epigenetics, Stübeweg, Freiburg, Germany
| | | | - Kasper P Kepp
- DTU Chemistry, Technical University of Denmark, Kongens Lyngby, Denmark
| | | |
Collapse
|
253
|
Muthu P, Chen HX, Lutz S. Redesigning human 2'-deoxycytidine kinase enantioselectivity for L-nucleoside analogues as reporters in positron emission tomography. ACS Chem Biol 2014; 9:2326-33. [PMID: 25079348 PMCID: PMC4201336 DOI: 10.1021/cb500463f] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
![]()
Recent advances in
nuclear medicine have allowed for positron emission
tomography (PET) to track transgenes in cell-based therapies using
PET reporter gene/probe pairs. A promising example for such reporter
gene/probe pairs are engineered nucleoside kinases that effectively
phosphorylate isotopically labeled nucleoside analogues. Upon expression
in target cells, the kinase facilitates the intracellular accumulation
of radionuclide monophosphate, which can be detected by PET imaging.
We have employed computational design for the semi-rational engineering
of human 2′-deoxycytidine kinase to create a reporter gene
with selectivity for l-nucleosides including l-thymidine
and 1-(2′-fluoro-5-methyl-β-l-arabinofuranosyl)
uracil. Our design strategy relied on a combination of preexisting
data from kinetic and structural studies of native kinases, as well
as two small, focused libraries of kinase variants to generate an in silico model for assessing the effects of single amino
acid changes on favorable activation of l-nucleosides over
their corresponding d-enantiomers. The approach identified
multiple amino acid positions distal to the active site that conferred
desired l-enantioselectivity. Recombination of individual
amino acid substitutions yielded orthogonal kinase variants with significantly
improved catalytic performance for unnatural l-nucleosides
but reduced activity for natural d-nucleosides.
Collapse
Affiliation(s)
- Pravin Muthu
- Department of Chemistry, Emory University, 1515 Dickey
Drive, Atlanta, Georgia 30322, United States
| | - Hannah X. Chen
- Department of Chemistry, Emory University, 1515 Dickey
Drive, Atlanta, Georgia 30322, United States
| | - Stefan Lutz
- Department of Chemistry, Emory University, 1515 Dickey
Drive, Atlanta, Georgia 30322, United States
| |
Collapse
|
254
|
Zhou Y, Xu W, Donald BR, Zeng J. An efficient parallel algorithm for accelerating computational protein design. ACTA ACUST UNITED AC 2014; 30:i255-i263. [PMID: 24931991 PMCID: PMC4058937 DOI: 10.1093/bioinformatics/btu264] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Motivation: Structure-based computational protein design (SCPR) is an important topic in protein engineering. Under the assumption of a rigid backbone and a finite set of discrete conformations of side-chains, various methods have been proposed to address this problem. A popular method is to combine the dead-end elimination (DEE) and A* tree search algorithms, which provably finds the global minimum energy conformation (GMEC) solution. Results: In this article, we improve the efficiency of computing A* heuristic functions for protein design and propose a variant of A* algorithm in which the search process can be performed on a single GPU in a massively parallel fashion. In addition, we make some efforts to address the memory exceeding problem in A* search. As a result, our enhancements can achieve a significant speedup of the A*-based protein design algorithm by four orders of magnitude on large-scale test data through pre-computation and parallelization, while still maintaining an acceptable memory overhead. We also show that our parallel A* search algorithm could be successfully combined with iMinDEE, a state-of-the-art DEE criterion, for rotamer pruning to further improve SCPR with the consideration of continuous side-chain flexibility. Availability: Our software is available and distributed open-source under the GNU Lesser General License Version 2.1 (GNU, February 1999). The source code can be downloaded from http://www.cs.duke.edu/donaldlab/osprey.php or http://iiis.tsinghua.edu.cn/∼compbio/software.html. Contact:zengjy321@tsinghua.edu.cn Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yichao Zhou
- Institute for Theoretical Computer Science (ITCS), Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, P. R. China, Department of Computer Science, Duke University, Durham, NC 27708, USA and Department of Biochemistry, Duke University Medical Center, Durham, NC 27708, USA
| | - Wei Xu
- Institute for Theoretical Computer Science (ITCS), Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, P. R. China, Department of Computer Science, Duke University, Durham, NC 27708, USA and Department of Biochemistry, Duke University Medical Center, Durham, NC 27708, USA
| | - Bruce R Donald
- Institute for Theoretical Computer Science (ITCS), Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, P. R. China, Department of Computer Science, Duke University, Durham, NC 27708, USA and Department of Biochemistry, Duke University Medical Center, Durham, NC 27708, USAInstitute for Theoretical Computer Science (ITCS), Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, P. R. China, Department of Computer Science, Duke University, Durham, NC 27708, USA and Department of Biochemistry, Duke University Medical Center, Durham, NC 27708, USA
| | - Jianyang Zeng
- Institute for Theoretical Computer Science (ITCS), Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, P. R. China, Department of Computer Science, Duke University, Durham, NC 27708, USA and Department of Biochemistry, Duke University Medical Center, Durham, NC 27708, USA
| |
Collapse
|
255
|
Gao M, London N, Cheng K, Tamura R, Jin J, Schueler-Furman O, Yin H. Rationally Designed Macrocyclic Peptides as Synergistic Agonists of LPS-Induced Inflammatory Response. Tetrahedron 2014; 70:7664-7668. [PMID: 25400297 DOI: 10.1016/j.tet.2014.07.026] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Toll-like receptor 4 (TLR4) plays an important role in the regulation of the innate and adaptive immune response. Both agonists and antagonists of TLR4 are of considerable interest as drug leads for various disease indications. We herein report the rational design of two myeloid differentiation factor 2 (MD2)-derived macrocyclic peptides as TLR4 modulators, using the Rosetta Macromolecular Modeling software. The designed cyclic peptides, but not their linear counterparts, displayed synergistic activation of TLR signaling when co-administered with lipopolysaccharide (LPS). Although the understanding of the mechanism of action of these peptides remains elusive; these results underscore the utility of peptide cyclization for the discovery of biologically active agents, and also lead to valuable tools for the investigation of TLR4 signaling.
Collapse
Affiliation(s)
- Meng Gao
- Center of Basic Molecular Science and Department of Chemistry, Tsinghua University, Beijing, China, 100082
| | - Nir London
- Department of Microbiology and Molecular Genetics, Institute for Medical Research Israel-Canada, Hadassah Medical School, The Hebrew University, POB 12272, Jerusalem, 91120 Israel
| | - Kui Cheng
- Department of Chemistry and Biochemistry, the BioFrontiers Institute, University of Colorado Boulder, Boulder, Colorado 80309-0596, USA
| | - Ryo Tamura
- Department of Chemistry and Biochemistry, the BioFrontiers Institute, University of Colorado Boulder, Boulder, Colorado 80309-0596, USA
| | - Jialin Jin
- Center of Basic Molecular Science and Department of Chemistry, Tsinghua University, Beijing, China, 100082
| | - Ora Schueler-Furman
- Department of Microbiology and Molecular Genetics, Institute for Medical Research Israel-Canada, Hadassah Medical School, The Hebrew University, POB 12272, Jerusalem, 91120 Israel
| | - Hang Yin
- Center of Basic Molecular Science and Department of Chemistry, Tsinghua University, Beijing, China, 100082.,Department of Chemistry and Biochemistry, the BioFrontiers Institute, University of Colorado Boulder, Boulder, Colorado 80309-0596, USA
| |
Collapse
|
256
|
Gregoire S, Glitzos K, Kwon I. Suppressing mutation-induced protein aggregation in mammalian cells by mutating residues significantly displaced upon the original mutation. Biochem Eng J 2014; 91:196-203. [PMID: 26190933 DOI: 10.1016/j.bej.2014.08.013] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Mutations introduced to wild-type proteins naturally, or intentionally via protein engineering, often lead to protein aggregation. In particular, protein aggregation within mammalian cells has significant implications in the disease pathology and biologics production; making protein aggregation modulation within mammalian cells a very important engineering topic. Previously, we showed that the semi-rational design approach can be used to reduce the intracellular aggregation of a protein by recovering the conformational stability that was lowered by the mutation. However, this approach has limited utility when no rational design approach to enhance conformational stability is readily available. In order to overcome this limitation, we investigated whether the modification of residues significantly displaced upon the original mutation is an effective way to reduce protein aggregation in mammalian cells. As a model system, human copper, zinc superoxide dismutase mutant containing glycine to alanine mutation at position 93 (SOD1G93A) was used. A panel of mutations was introduced into residues substantially displaced upon the G93A mutation. By using cell-based aggregation assays, we identified several novel variants of SOD1G93A with reduced aggregation propensity within mammalian cells. Our findings successfully demonstrate that the aggregation of a mutant protein can be suppressed by mutating the residues significantly displaced upon the original mutation.
Collapse
Affiliation(s)
- Simpson Gregoire
- Department of Chemical Engineering, University of Virginia, Charlottesville, Virginia 22904-4741, United States
| | - Kelly Glitzos
- Department of Chemical Engineering, University of Virginia, Charlottesville, Virginia 22904-4741, United States
| | - Inchan Kwon
- Department of Chemical Engineering, University of Virginia, Charlottesville, Virginia 22904-4741, United States ; School of Materials Science and Engineering, Gwangju Institute of Science and Technology (GIST), Gwangju 500-712, Republic of Korea
| |
Collapse
|
257
|
Peterson LX, Kang X, Kihara D. Assessment of protein side-chain conformation prediction methods in different residue environments. Proteins 2014; 82:1971-84. [PMID: 24619909 PMCID: PMC5007623 DOI: 10.1002/prot.24552] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2014] [Revised: 03/02/2014] [Accepted: 03/07/2014] [Indexed: 11/09/2022]
Abstract
Computational prediction of side-chain conformation is an important component of protein structure prediction. Accurate side-chain prediction is crucial for practical applications of protein structure models that need atomic-detailed resolution such as protein and ligand design. We evaluated the accuracy of eight side-chain prediction methods in reproducing the side-chain conformations of experimentally solved structures deposited to the Protein Data Bank. Prediction accuracy was evaluated for a total of four different structural environments (buried, surface, interface, and membrane-spanning) in three different protein types (monomeric, multimeric, and membrane). Overall, the highest accuracy was observed for buried residues in monomeric and multimeric proteins. Notably, side-chains at protein interfaces and membrane-spanning regions were better predicted than surface residues even though the methods did not all use multimeric and membrane proteins for training. Thus, we conclude that the current methods are as practically useful for modeling protein docking interfaces and membrane-spanning regions as for modeling monomers.
Collapse
Affiliation(s)
- Lenna X. Peterson
- Department of Biological Sciences, Purdue University, West Lafayette IN, 47907, USA
| | - Xuejiao Kang
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette IN, 47907, USA
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
| |
Collapse
|
258
|
Zhang H, Li C, Yang F, Su J, Tan J, Zhang X, Wang C. Cation-pi interactions at non-redundant protein-RNA interfaces. BIOCHEMISTRY (MOSCOW) 2014; 79:643-52. [DOI: 10.1134/s0006297914070062] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
259
|
Allouche D, André I, Barbe S, Davies J, de Givry S, Katsirelos G, O'Sullivan B, Prestwich S, Schiex T, Traoré S. Computational protein design as an optimization problem. ARTIF INTELL 2014. [DOI: 10.1016/j.artint.2014.03.005] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
260
|
Li Z, Yang Y, Faraggi E, Zhan J, Zhou Y. Direct prediction of profiles of sequences compatible with a protein structure by neural networks with fragment-based local and energy-based nonlocal profiles. Proteins 2014; 82:2565-73. [PMID: 24898915 DOI: 10.1002/prot.24620] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2014] [Revised: 05/28/2014] [Accepted: 05/30/2014] [Indexed: 12/13/2022]
Abstract
Locating sequences compatible with a protein structural fold is the well-known inverse protein-folding problem. While significant progress has been made, the success rate of protein design remains low. As a result, a library of designed sequences or profile of sequences is currently employed for guiding experimental screening or directed evolution. Sequence profiles can be computationally predicted by iterative mutations of a random sequence to produce energy-optimized sequences, or by combining sequences of structurally similar fragments in a template library. The latter approach is computationally more efficient but yields less accurate profiles than the former because of lacking tertiary structural information. Here we present a method called SPIN that predicts Sequence Profiles by Integrated Neural network based on fragment-derived sequence profiles and structure-derived energy profiles. SPIN improves over the fragment-derived profile by 6.7% (from 23.6 to 30.3%) in sequence identity between predicted and wild-type sequences. The method also reduces the number of residues in low complex regions by 15.7% and has a significantly better balance of hydrophilic and hydrophobic residues at protein surface. The accuracy of sequence profiles obtained is comparable to those generated from the protein design program RosettaDesign 3.5. This highly efficient method for predicting sequence profiles from structures will be useful as a single-body scoring term for improving scoring functions used in protein design and fold recognition. It also complements protein design programs in guiding experimental design of the sequence library for screening and directed evolution of designed sequences. The SPIN server is available at http://sparks-lab.org.
Collapse
Affiliation(s)
- Zhixiu Li
- School of Informatics and Computing, Indiana University-Purdue University, Indianapolis, Indiana, 46202
| | | | | | | | | |
Collapse
|
261
|
Renfrew PD, Craven TW, Butterfoss G, Kirshenbaum K, Bonneau R. A rotamer library to enable modeling and design of peptoid foldamers. J Am Chem Soc 2014; 136:8772-82. [PMID: 24823488 PMCID: PMC4227732 DOI: 10.1021/ja503776z] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2014] [Indexed: 01/08/2023]
Abstract
Peptoids are a family of synthetic oligomers composed of N-substituted glycine units. Along with other "foldamer" systems, peptoid oligomer sequences can be predictably designed to form a variety of stable secondary structures. It is not yet evident if foldamer design can be extended to reliably create tertiary structure features that mimic more complex biomolecular folds and functions. Computational modeling and prediction of peptoid conformations will likely play a critical role in enabling complex biomimetic designs. We introduce a computational approach to provide accurate conformational and energetic parameters for peptoid side chains needed for successful modeling and design. We find that peptoids can be described by a "rotamer" treatment, similar to that established for proteins, in which the peptoid side chains display rotational isomerism to populate discrete regions of the conformational landscape. Because of the insufficient number of solved peptoid structures, we have calculated the relative energies of side-chain conformational states to provide a backbone-dependent (BBD) rotamer library for a set of 54 different peptoid side chains. We evaluated two rotamer library development methods that employ quantum mechanics (QM) and/or molecular mechanics (MM) energy calculations to identify side-chain rotamers. We show by comparison to experimental peptoid structures that both methods provide an accurate prediction of peptoid side chain placements in folded peptoid oligomers and at protein interfaces. We have incorporated our peptoid rotamer libraries into ROSETTA, a molecular design package previously validated in the context of protein design and structure prediction.
Collapse
Affiliation(s)
- P. Douglas Renfrew
- Center for Genomics and
Systems Biology, Department
of Biology, Department of Chemistry, and Courant Institute of Mathematical
Sciences, Computer Science Department, New
York University, New York, New York 10003, United States
| | - Timothy W. Craven
- Center for Genomics and
Systems Biology, Department
of Biology, Department of Chemistry, and Courant Institute of Mathematical
Sciences, Computer Science Department, New
York University, New York, New York 10003, United States
| | - Glenn
L. Butterfoss
- Center
for Genomics and Systems Biology, New York
University Abu Dhabi, Abu Dhabi, United Arab Emirates
| | - Kent Kirshenbaum
- Center for Genomics and
Systems Biology, Department
of Biology, Department of Chemistry, and Courant Institute of Mathematical
Sciences, Computer Science Department, New
York University, New York, New York 10003, United States
| | - Richard Bonneau
- Center for Genomics and
Systems Biology, Department
of Biology, Department of Chemistry, and Courant Institute of Mathematical
Sciences, Computer Science Department, New
York University, New York, New York 10003, United States
| |
Collapse
|
262
|
Dudek MJ. A detailed representation of electrostatic energy in prediction of sequence and pH dependence of protein stability. Proteins 2014; 82:2497-511. [DOI: 10.1002/prot.24613] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2014] [Revised: 05/11/2014] [Accepted: 05/15/2014] [Indexed: 11/05/2022]
Affiliation(s)
- Michael J. Dudek
- Protabit LLC; 250 S Oak Knoll Ave. #211 Pasadena California 91101
| |
Collapse
|
263
|
Sammond DW, Yarbrough JM, Mansfield E, Bomble YJ, Hobdey SE, Decker SR, Taylor LE, Resch MG, Bozell JJ, Himmel ME, Vinzant TB, Crowley MF. Predicting enzyme adsorption to lignin films by calculating enzyme surface hydrophobicity. J Biol Chem 2014; 289:20960-9. [PMID: 24876380 DOI: 10.1074/jbc.m114.573642] [Citation(s) in RCA: 76] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The inhibitory action of lignin on cellulase cocktails is a major challenge to the biological saccharification of plant cell wall polysaccharides. Although the mechanism remains unclear, hydrophobic interactions between enzymes and lignin are hypothesized to drive adsorption. Here we evaluate the role of hydrophobic interactions in enzyme-lignin binding. The hydrophobicity of the enzyme surface was quantified using an estimation of the clustering of nonpolar atoms, identifying potential interaction sites. The adsorption of enzymes to lignin surfaces, measured using the quartz crystal microbalance, correlates to the hydrophobic cluster scores. Further, these results suggest a minimum hydrophobic cluster size for a protein to preferentially adsorb to lignin. The impact of electrostatic contribution was ruled out by comparing the isoelectric point (pI) values to the adsorption of proteins to lignin surfaces. These results demonstrate the ability to predict enzyme-lignin adsorption and could potentially be used to design improved cellulase cocktails, thus lowering the overall cost of biofuel production.
Collapse
Affiliation(s)
| | | | - Elisabeth Mansfield
- the Applied Chemicals and Materials Division, National Institute for Standards and Technology, Boulder, Colorado 80305, and
| | | | | | | | | | - Michael G Resch
- From the Biosciences Center and National Bioenergy Center, National Renewable Energy Laboratory, Golden, Colorado 80401
| | - Joseph J Bozell
- the Center for Renewable Carbon, Center for the Catalytic Conversion of Biomass (C3Bio), University of Tennessee, Knoxville, Tennessee 37917
| | | | | | | |
Collapse
|
264
|
Accurate design of co-assembling multi-component protein nanomaterials. Nature 2014; 510:103-8. [PMID: 24870237 DOI: 10.1038/nature13404] [Citation(s) in RCA: 426] [Impact Index Per Article: 42.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2014] [Accepted: 04/25/2014] [Indexed: 12/19/2022]
Abstract
The self-assembly of proteins into highly ordered nanoscale architectures is a hallmark of biological systems. The sophisticated functions of these molecular machines have inspired the development of methods to engineer self-assembling protein nanostructures; however, the design of multi-component protein nanomaterials with high accuracy remains an outstanding challenge. Here we report a computational method for designing protein nanomaterials in which multiple copies of two distinct subunits co-assemble into a specific architecture. We use the method to design five 24-subunit cage-like protein nanomaterials in two distinct symmetric architectures and experimentally demonstrate that their structures are in close agreement with the computational design models. The accuracy of the method and the number and variety of two-component materials that it makes accessible suggest a route to the construction of functional protein nanomaterials tailored to specific applications.
Collapse
|
265
|
Gaillard T, Simonson T. Pairwise decomposition of an MMGBSA energy function for computational protein design. J Comput Chem 2014; 35:1371-87. [PMID: 24854675 DOI: 10.1002/jcc.23637] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2014] [Revised: 04/14/2014] [Accepted: 05/01/2014] [Indexed: 02/02/2023]
Abstract
Computational protein design (CPD) aims at predicting new proteins or modifying existing ones. The computational challenge is huge as it requires exploring an enormous sequence and conformation space. The difficulty can be reduced by considering a fixed backbone and a discrete set of sidechain conformations. Another common strategy consists in precalculating a pairwise energy matrix, from which the energy of any sequence/conformation can be quickly obtained. In this work, we examine the pairwise decomposition of protein MMGBSA energy functions from a general theoretical perspective, and an implementation proposed earlier for CPD. It includes a Generalized Born term, whose many-body character is overcome using an effective dielectric environment, and a Surface Area term, for which we present an improved pairwise decomposition. A detailed evaluation of the error introduced by the decomposition on the different energy components is performed. We show that the error remains reasonable, compared to other uncertainties.
Collapse
Affiliation(s)
- Thomas Gaillard
- Department of Biology, Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, 91128, Palaiseau, France
| | | |
Collapse
|
266
|
Austin TM, Nannemann DP, Deluca SL, Meiler J, Delpire E. In silico analysis and experimental verification of OSR1 kinase - Peptide interaction. J Struct Biol 2014; 187:58-65. [PMID: 24821279 DOI: 10.1016/j.jsb.2014.05.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2013] [Revised: 04/25/2014] [Accepted: 05/04/2014] [Indexed: 10/25/2022]
Abstract
The oxidative-stress-responsive kinase 1 (OSR1) and the STE20/SPS1-related proline/alanine-rich kinase (SPAK) are key enzymes in a signaling cascade regulating the activity of Na(+)-K(+)-2Cl(-) cotransporters (NKCC1-2) and Na(+)-Cl(-) cotransporter (NCC). Both kinases have a conserved carboxyl-terminal (CCT) domain, which recognizes a unique peptide motif present in OSR1- and SPAK-activating kinases (with-no-lysine kinase 1 (WNK1) and WNK4) as well as their substrates (NKCC1, NKCC2, and NCC). Utilizing various modalities of the Rosetta Molecular Modeling Software Suite including flexible peptide docking and protein design, we comprehensively explored the sequence space recognized by the CCT domain. Specifically, we studied single residue mutations as well as complete unbiased designs of a hexapeptide substrate. The computational study started from a crystal structure of the CCT domain of OSR1 in complex with a hexapeptide derived from WNK4. Point mutations predicted to be favorable include Arg to His or Trp substitutions at position 2 and a Phe to Tyr substitution at position 3 of the hexapeptide. In addition, de novo design yielded two peptides predicted to bind to the CCT domain: FRFQVT and TRFDVT. These results, which indicate a little bit more freedom in the composition of the peptide, were confirmed through the use of yeast two-hybrid screening.
Collapse
Affiliation(s)
- Thomas M Austin
- Department of Anesthesiology, Vanderbilt University School of Medicine, Nashville, TN 37232, United States
| | - David P Nannemann
- Department of Chemistry, Vanderbilt University, Nashville, TN, United States
| | - Samuel L Deluca
- Department of Chemistry, Vanderbilt University, Nashville, TN, United States
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, TN, United States; Departments of Pharmacology and Bioinformatics, Vanderbilt University, Nashville, TN, United States
| | - Eric Delpire
- Department of Anesthesiology, Vanderbilt University School of Medicine, Nashville, TN 37232, United States.
| |
Collapse
|
267
|
Jensen JH, Willemoës M, Winther JR, De Vico L. In silico prediction of mutant HIV-1 proteases cleaving a target sequence. PLoS One 2014; 9:e95833. [PMID: 24796579 PMCID: PMC4010418 DOI: 10.1371/journal.pone.0095833] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2013] [Accepted: 03/31/2014] [Indexed: 11/17/2022] Open
Abstract
HIV-1 protease represents an appealing system for directed enzyme re-design, since it has various different endogenous targets, a relatively simple structure and it is well studied. Recently Chaudhury and Gray (Structure (2009) 17: 1636–1648) published a computational algorithm to discern the specificity determining residues of HIV-1 protease. In this paper we present two computational tools aimed at re-designing HIV-1 protease, derived from the algorithm of Chaudhuri and Gray. First, we present an energy-only based methodology to discriminate cleavable and non cleavable peptides for HIV-1 proteases, both wild type and mutant. Secondly, we show an algorithm we developed to predict mutant HIV-1 proteases capable of cleaving a new target substrate peptide, different from the natural targets of HIV-1 protease. The obtained in silico mutant enzymes were analyzed in terms of cleavability and specificity towards the target peptide using the energy-only methodology. We found two mutant proteases as best candidates for specificity and cleavability towards the target sequence.
Collapse
Affiliation(s)
- Jan H Jensen
- Department of Chemistry, University of Copenhagen, Copenhagen, Denmark
| | - Martin Willemoës
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Jakob R Winther
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Luca De Vico
- Department of Chemistry, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
268
|
Smadbeck J, Peterson MB, Zee BM, Garapaty S, Mago A, Lee C, Giannis A, Trojer P, Garcia BA, Floudas CA. De novo peptide design and experimental validation of histone methyltransferase inhibitors. PLoS One 2014; 9:e90095. [PMID: 24587223 PMCID: PMC3938834 DOI: 10.1371/journal.pone.0090095] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2013] [Accepted: 01/30/2014] [Indexed: 11/18/2022] Open
Abstract
Histones are small proteins critical to the efficient packaging of DNA in the nucleus. DNA–protein complexes, known as nucleosomes, are formed when the DNA winds itself around the surface of the histones. The methylation of histone residues by enhancer of zeste homolog 2 (EZH2) maintains gene repression over successive cell generations. Overexpression of EZH2 can silence important tumor suppressor genes leading to increased invasiveness of many types of cancers. This makes the inhibition of EZH2 an important target in the development of cancer therapeutics. We employed a three-stage computational de novo peptide design method to design inhibitory peptides of EZH2. The method consists of a sequence selection stage and two validation stages for fold specificity and approximate binding affinity. The sequence selection stage consists of an integer linear optimization model that was solved to produce a rank-ordered list of amino acid sequences with increased stability in the bound peptide-EZH2 structure. These sequences were validated through the calculation of the fold specificity and approximate binding affinity of the designed peptides. Here we report the discovery of novel EZH2 inhibitory peptides using the de novo peptide design method. The computationally discovered peptides were experimentally validated in vitro using dose titrations and mechanism of action enzymatic assays. The peptide with the highest in vitro response, SQ037, was validated in nucleo using quantitative mass spectrometry-based proteomics. This peptide had an IC50 of 13.5 M, demonstrated greater potency as an inhibitor when compared to the native and K27A mutant control peptides, and demonstrated competitive inhibition versus the peptide substrate. Additionally, this peptide demonstrated high specificity to the EZH2 target in comparison to other histone methyltransferases. The validated peptides are the first computationally designed peptides that directly inhibit EZH2. These inhibitors should prove useful for further chromatin biology investigations.
Collapse
Affiliation(s)
- James Smadbeck
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey, United States of America
| | - Meghan B. Peterson
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey, United States of America
| | - Barry M. Zee
- Department of Molecular Biology, Princeton University, Princeton, New Jersey, United States of America
| | - Shivani Garapaty
- Constellation Pharmaceuticals, Cambridge, Massachusetts, United States of America
| | - Aashna Mago
- Department of Molecular Biology, Princeton University, Princeton, New Jersey, United States of America
| | - Christina Lee
- Constellation Pharmaceuticals, Cambridge, Massachusetts, United States of America
| | | | - Patrick Trojer
- Constellation Pharmaceuticals, Cambridge, Massachusetts, United States of America
| | - Benjamin A. Garcia
- Department of Molecular Biology, Princeton University, Princeton, New Jersey, United States of America
- Department of Chemistry, Princeton University, Princeton, New Jersey, United States of America
- Epigenetics Program, Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Christodoulos A. Floudas
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey, United States of America
- * E-mail:
| |
Collapse
|
269
|
Gasior P, Kotulska M. FISH Amyloid - a new method for finding amyloidogenic segments in proteins based on site specific co-occurrence of aminoacids. BMC Bioinformatics 2014; 15:54. [PMID: 24564523 PMCID: PMC3941796 DOI: 10.1186/1471-2105-15-54] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2013] [Accepted: 02/03/2014] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Amyloids are proteins capable of forming fibrils whose intramolecular contact sites assume densely packed zipper pattern. Their oligomers can underlie serious diseases, e.g. Alzheimer's and Parkinson's diseases. Recent studies show that short segments of aminoacids can be responsible for amyloidogenic properties of a protein. A few hundreds of such peptides have been experimentally found but experimental testing of all candidates is currently not feasible. Here we propose an original machine learning method for classification of aminoacid sequences, based on discovering a segment with a discriminative pattern of site-specific co-occurrences between sequence elements. The pattern is based on the positions of residues with correlated occurrence over a sliding window of a specified length. The algorithm first recognizes the most relevant training segment in each positive training instance. Then the classification is based on maximal distances between co-occurrence matrix of the relevant segments in positive training sequences and the matrix from negative training segments. The method was applied for studying sequences of aminoacids with regard to their amyloidogenic properties. RESULTS Our method was first trained on available datasets of hexapeptides with the amyloidogenic classification, using 5 or 6-residue sliding windows. Depending on the choice of training and testing datasets, the area under ROC curve obtained the value up to 0.80 for experimental, and 0.95 for computationally generated (with 3D profile method) datasets. Importantly, the results on 5-residue segments were not significantly worse, although the classification required that algorithm first recognized the most relevant training segments. The dataset of long sequences, such as sup35 prion and a few other amyloid proteins, were applied to test the method and gave encouraging results. Our web tool FISH Amyloid was trained on all available experimental data 4-10 residues long, offers prediction of amyloidogenic segments in protein sequences. CONCLUSIONS We proposed a new original classification method which recognizes co-occurrence patterns in sequences. The method reveals characteristic classification pattern of the data and finds the segments where its scoring is the strongest, also in long training sequences. Applied to the problem of amyloidogenic segments recognition, it showed a good potential for classification problems in bioinformatics.
Collapse
Affiliation(s)
| | - Malgorzata Kotulska
- Institute of Biomedical Engineering and Instrumentation, Wroclaw University of Technology, 50-370 Wroclaw, Poland.
| |
Collapse
|
270
|
Borgo B, Havranek JJ. Motif-directed redesign of enzyme specificity. Protein Sci 2014; 23:312-20. [PMID: 24407908 PMCID: PMC3945839 DOI: 10.1002/pro.2417] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2013] [Accepted: 12/29/2013] [Indexed: 11/21/2022]
Abstract
Computational protein design relies on several approximations, including the use of fixed backbones and rotamers, to reduce protein design to a computationally tractable problem. However, allowing backbone and off-rotamer flexibility leads to more accurate designs and greater conformational diversity. Exhaustive sampling of this additional conformational space is challenging, and often impossible. Here, we report a computational method that utilizes a preselected library of native interactions to direct backbone flexibility to accommodate placement of these functional contacts. Using these native interaction modules, termed motifs, improves the likelihood that the interaction can be realized, provided that suitable backbone perturbations can be identified. Furthermore, it allows a directed search of the conformational space, reducing the sampling needed to find low energy conformations. We implemented the motif-based design algorithm in Rosetta, and tested the efficacy of this method by redesigning the substrate specificity of methionine aminopeptidase. In summary, native enzymes have evolved to catalyze a wide range of chemical reactions with extraordinary specificity. Computational enzyme design seeks to generate novel chemical activities by altering the target substrates of these existing enzymes. We have implemented a novel approach to redesign the specificity of an enzyme and demonstrated its effectiveness on a model system.
Collapse
Affiliation(s)
- Benjamin Borgo
- Program in Computational and Systems Biology, Washington University in St. Louis, St. Louis, Missouri, 63110
| | | |
Collapse
|
271
|
An accurate binding interaction model in de novo computational protein design of interactions: If you build it, they will bind. J Struct Biol 2014; 185:136-46. [DOI: 10.1016/j.jsb.2013.03.012] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2012] [Revised: 03/15/2013] [Accepted: 03/21/2013] [Indexed: 01/07/2023]
|
272
|
Szamborska-Gbur A, Rymarczyk G, Orłowski M, Kuzynowski T, Jakób M, Dziedzic-Letka A, Górecki A, Dobryszycki P, Ożyhar A. The molecular basis of conformational instability of the ecdysone receptor DNA binding domain studied by in silico and in vitro experiments. PLoS One 2014; 9:e86052. [PMID: 24465866 PMCID: PMC3900457 DOI: 10.1371/journal.pone.0086052] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2013] [Accepted: 12/04/2013] [Indexed: 11/19/2022] Open
Abstract
The heterodimer of the ecdysone receptor (EcR) and ultraspiracle (Usp), members of the nuclear receptors superfamily, regulates gene expression associated with molting and metamorphosis in insects. The DNA binding domains (DBDs) of the Usp and EcR play an important role in their DNA-dependent heterodimerization. Analysis of the crystal structure of the UspDBD/EcRDBD heterocomplex from Drosophila melanogaster on the hsp27 gene response element, suggested an appreciable similarity between both DBDs. However, the chemical denaturation experiments showed a categorically lower stability for the EcRDBD in contrast to the UspDBD. The aim of our study was an elucidation of the molecular basis of this intriguing instability. Toward this end, we mapped the EcRDBD amino acid sequence positions which have an impact on the stability of the EcRDBD. The computational protein design and in vitro analyses of the EcRDBD mutants indicate that non-conserved residues within the α-helix 2, forming the EcRDBD hydrophobic core, represent a specific structural element that contributes to instability. In particular, the L58 appears to be a key residue which differentiates the hydrophobic cores of UspDBD and EcRDBD and is the main reason for the low stability of the EcRDBD. Our results might serve as a benchmark for further studies of the intricate nature of the EcR molecule.
Collapse
Affiliation(s)
| | - Grzegorz Rymarczyk
- Department of Biochemistry, Faculty of Chemistry, Wrocław University of Technology, Wrocław, Poland
| | - Marek Orłowski
- Department of Biochemistry, Faculty of Chemistry, Wrocław University of Technology, Wrocław, Poland
| | - Tomasz Kuzynowski
- Department of Biochemistry, Faculty of Chemistry, Wrocław University of Technology, Wrocław, Poland
| | - Michał Jakób
- Department of Biochemistry, Faculty of Chemistry, Wrocław University of Technology, Wrocław, Poland
| | - Agnieszka Dziedzic-Letka
- Department of Biochemistry, Faculty of Chemistry, Wrocław University of Technology, Wrocław, Poland
| | - Andrzej Górecki
- Department of Physical Biochemistry, Faculty of Biochemistry, Biophysics and Biotechnology, Jagiellonian University, Kraków, Poland
| | - Piotr Dobryszycki
- Department of Biochemistry, Faculty of Chemistry, Wrocław University of Technology, Wrocław, Poland
| | - Andrzej Ożyhar
- Department of Biochemistry, Faculty of Chemistry, Wrocław University of Technology, Wrocław, Poland
- * E-mail:
| |
Collapse
|
273
|
Smith MD, Zanghellini A, Grabs-Röthlisberger D. Computational design of novel enzymes without cofactors. Methods Mol Biol 2014; 1216:197-210. [PMID: 25213417 DOI: 10.1007/978-1-4939-1486-9_10] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
In this review we present a recently developed computational method to design de novo enzymes. Starting from the three-dimensional arrangement of the transition state structure and the catalytic side chains around it (theozyme), RosettaMatch identifies successful placements of the theozyme into protein scaffolds. Subsequently, RosettaEnzDes (for EnzymeDesign) redesigns the active site around the theozyme for binding and stabilization of the transition state and the catalytic residues. The resulting computationally designed enzymes are expressed and experimentally tested for catalytic activity.
Collapse
Affiliation(s)
- Matthew D Smith
- Molecular and Cellular Biology Program, University of Washington, 351655, Seattle, WA, 98195, USA
| | | | | |
Collapse
|
274
|
Nemoto W, Saito A, Oikawa H. Recent advances in functional region prediction by using structural and evolutionary information - Remaining problems and future extensions. Comput Struct Biotechnol J 2013; 8:e201308007. [PMID: 24688747 PMCID: PMC3962155 DOI: 10.5936/csbj.201308007] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2013] [Revised: 11/12/2013] [Accepted: 11/13/2013] [Indexed: 11/22/2022] Open
Abstract
Structural genomics projects have solved many new structures with unknown functions. One strategy to investigate the function of a structure is to computationally find the functionally important residues or regions on it. Therefore, the development of functional region prediction methods has become an important research subject. An effective approach is to use a method employing structural and evolutionary information, such as the evolutionary trace (ET) method. ET ranks the residues of a protein structure by calculating the scores for relative evolutionary importance, and locates functionally important sites by identifying spatial clusters of highly ranked residues. After ET was developed, numerous ET-like methods were subsequently reported, and many of them are in practical use, although they require certain conditions. In this mini review, we first introduce the remaining problems and the recent improvements in the methods using structural and evolutionary information. We then summarize the recent developments of the methods. Finally, we conclude by describing possible extensions of the evolution- and structure-based methods.
Collapse
Affiliation(s)
- Wataru Nemoto
- Division of Life Science and Engineering, School of Science and Engineering, Tokyo Denki University (TDU), Ishizaka, Hatoyama-cho, Hiki-gun, Saitama, 350-0394, Japan
| | - Akira Saito
- Division of Life Science and Engineering, School of Science and Engineering, Tokyo Denki University (TDU), Ishizaka, Hatoyama-cho, Hiki-gun, Saitama, 350-0394, Japan
| | - Hayato Oikawa
- Division of Life Science and Engineering, School of Science and Engineering, Tokyo Denki University (TDU), Ishizaka, Hatoyama-cho, Hiki-gun, Saitama, 350-0394, Japan
| |
Collapse
|
275
|
Petrella RJ. OPTIMIZATION BIAS IN ENERGY-BASED STRUCTURE PREDICTION. JOURNAL OF THEORETICAL & COMPUTATIONAL CHEMISTRY 2013; 12:1341014. [PMID: 25552783 PMCID: PMC4278582 DOI: 10.1142/s0219633613410149] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Physics-based computational approaches to predicting the structure of macromolecules such as proteins are gaining increased use, but there are remaining challenges. In the current work, it is demonstrated that in energy-based prediction methods, the degree of optimization of the sampled structures can influence the prediction results. In particular, discrepancies in the degree of local sampling can bias the predictions in favor of the oversampled structures by shifting the local probability distributions of the minimum sampled energies. In simple systems, it is shown that the magnitude of the errors can be calculated from the energy surface, and for certain model systems, derived analytically. Further, it is shown that for energy wells whose forms differ only by a randomly assigned energy shift, the optimal accuracy of prediction is achieved when the sampling around each structure is equal. Energy correction terms can be used in cases of unequal sampling to reproduce the total probabilities that would occur under equal sampling, but optimal corrections only partially restore the prediction accuracy lost to unequal sampling. For multiwell systems, the determination of the correction terms is a multibody problem; it is shown that the involved cross-correlation multiple integrals can be reduced to simpler integrals. The possible implications of the current analysis for macromolecular structure prediction are discussed.
Collapse
Affiliation(s)
- Robert J. Petrella
- Department of Chemistry and Chemical Biology, Harvard University, 12 Oxford Street, Cambridge, MA 02138, USA
- Department of Medicine, Harvard Medical School, 25 Shattuck Street, Boston, MA 02115, USA
| |
Collapse
|
276
|
Rouet R, Lowe D, Christ D. Stability engineering of the human antibody repertoire. FEBS Lett 2013; 588:269-77. [PMID: 24291820 DOI: 10.1016/j.febslet.2013.11.029] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2013] [Revised: 11/20/2013] [Accepted: 11/20/2013] [Indexed: 10/26/2022]
Abstract
Human monoclonal antibodies often display limited thermodynamic and colloidal stabilities. This behavior hinders their production, and places limitations on the development of novel formulation conditions and therapeutic applications. Antibodies are highly diverse molecules, with much of the sequence variation observed within variable domain families and, in particular, their complementarity determining regions. This has complicated the development of comprehensive strategies for the stability engineering of the human antibody repertoire. Here we provide an overview of the field, and discuss recent advances in the development of robust and aggregation resistant antibody therapeutics.
Collapse
Affiliation(s)
- Romain Rouet
- Garvan Institute of Medical Research, 384 Victoria Street, Darlinghurst, Sydney, NSW 2010, Australia.
| | - David Lowe
- MedImmune, Milstein Building, Granta Park, Cambridge CB21 6GH, United Kingdom
| | - Daniel Christ
- Garvan Institute of Medical Research, 384 Victoria Street, Darlinghurst, Sydney, NSW 2010, Australia; The University of New South Wales, Faculty of Medicine, St Vincent's Clinical School, Darlinghurst, Sydney, NSW 2010, Australia
| |
Collapse
|
277
|
Nivón LG, Bjelic S, King C, Baker D. Automating human intuition for protein design. Proteins 2013; 82:858-66. [DOI: 10.1002/prot.24463] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2013] [Revised: 09/25/2013] [Accepted: 10/21/2013] [Indexed: 11/11/2022]
Affiliation(s)
- Lucas G. Nivón
- Department of BiochemistryUniversity of WashingtonSeattle Washington98195
| | - Sinisa Bjelic
- Department of BiochemistryUniversity of WashingtonSeattle Washington98195
| | - Chris King
- Department of BiochemistryUniversity of WashingtonSeattle Washington98195
| | - David Baker
- Department of BiochemistryUniversity of WashingtonSeattle Washington98195
- Howard Hughes Medical Institute (HHMI)University of WashingtonSeattle Washington98195
| |
Collapse
|
278
|
Abstract
Diverse engineering strategies have been developed to create enzymes with novel catalytic activities. Among these, computational approaches hold particular promise. Enzymes have been computationally designed to promote several nonbiological reactions, including a Diels-Alder cycloaddition, proton transfer, multistep retroaldol transformations, and metal-dependent hydrolysis of phosphotriesters. Although their efficiencies (kcat/KM = 0.1-100 M(-1) s(-1)) are typically low compared with those of the best natural enzymes (10(6)-10(8) M(-1) s(-1)), these catalysts are excellent starting points for laboratory evolution. This review surveys recent progress in combining computational and evolutionary approaches to enzyme design, together with insights into enzyme function gained from studies of the engineered catalysts.
Collapse
Affiliation(s)
- Donald Hilvert
- Laboratory of Organic Chemistry, ETH Zürich, 8093 Zürich, Switzerland.
| |
Collapse
|
279
|
Cherny I, Greisen P, Ashani Y, Khare SD, Oberdorfer G, Leader H, Baker D, Tawfik DS. Engineering V-type nerve agents detoxifying enzymes using computationally focused libraries. ACS Chem Biol 2013; 8:2394-403. [PMID: 24041203 DOI: 10.1021/cb4004892] [Citation(s) in RCA: 75] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
VX and its Russian (RVX) and Chinese (CVX) analogues rapidly inactivate acetylcholinesterase and are the most toxic stockpile nerve agents. These organophosphates have a thiol leaving group with a choline-like moiety and are hydrolyzed very slowly by natural enzymes. We used an integrated computational and experimental approach to increase Brevundimonas diminuta phosphotriesterase's (PTE) detoxification rate of V-agents by 5000-fold. Computational models were built of the complex between PTE and V-agents. On the basis of these models, the active site was redesigned to be complementary in shape to VX and RVX and to include favorable electrostatic interactions with their choline-like leaving group. Small libraries based on designed sequences were constructed. The libraries were screened by a direct assay for V-agent detoxification, as our initial studies showed that colorimetric surrogates fail to report the detoxification rates of the actual agents. The experimental results were fed back to improve the computational models. Overall, five rounds of iterating between experiment and model refinement led to variants that hydrolyze the toxic SP isomers of all three V-agents with kcat/KM values of up to 5 × 10(6) M(-1) min(-1) and also efficiently detoxify G-agents. These new catalysts provide the basis for broad spectrum nerve agent detoxification.
Collapse
Affiliation(s)
- Izhack Cherny
- Department
of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Per Greisen
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, United States
| | - Yacov Ashani
- Department
of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - Sagar D. Khare
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, United States
| | - Gustav Oberdorfer
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, United States
| | - Haim Leader
- Department
of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, United States
| | - Dan S. Tawfik
- Department
of Biological Chemistry, Weizmann Institute of Science, Rehovot 76100, Israel
| |
Collapse
|
280
|
Ollikainen N, Kortemme T. Computational protein design quantifies structural constraints on amino acid covariation. PLoS Comput Biol 2013; 9:e1003313. [PMID: 24244128 PMCID: PMC3828131 DOI: 10.1371/journal.pcbi.1003313] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2013] [Accepted: 09/20/2013] [Indexed: 02/02/2023] Open
Abstract
Amino acid covariation, where the identities of amino acids at different sequence positions are correlated, is a hallmark of naturally occurring proteins. This covariation can arise from multiple factors, including selective pressures for maintaining protein structure, requirements imposed by a specific function, or from phylogenetic sampling bias. Here we employed flexible backbone computational protein design to quantify the extent to which protein structure has constrained amino acid covariation for 40 diverse protein domains. We find significant similarities between the amino acid covariation in alignments of natural protein sequences and sequences optimized for their structures by computational protein design methods. These results indicate that the structural constraints imposed by protein architecture play a dominant role in shaping amino acid covariation and that computational protein design methods can capture these effects. We also find that the similarity between natural and designed covariation is sensitive to the magnitude and mechanism of backbone flexibility used in computational protein design. Our results thus highlight the necessity of including backbone flexibility to correctly model precise details of correlated amino acid changes and give insights into the pressures underlying these correlations. Proteins generally fold into specific three-dimensional structures to perform their cellular functions, and the presence of misfolded proteins is often deleterious for cellular and organismal fitness. For these reasons, maintenance of protein structure is thought to be one of the major fitness pressures acting on proteins. Consequently, the sequences of today's naturally occurring proteins contain signatures reflecting the constraints imposed by protein structure. Here we test the ability of computational protein design methods to recapitulate and explain these signatures. We focus on the physical basis of evolutionary pressures that act on interactions between amino acids in folded proteins, which are critical in determining protein structure and function. Such pressures can be observed from the appearance of amino acid covariation, where the amino acids at certain positions in protein sequences are correlated with each other. We find similar patterns of amino acid covariation in natural sequences and sequences optimized for their structures using computational protein design, demonstrating the importance of structural constraints in protein molecular evolution and providing insights into the structural mechanisms leading to covariation. In addition, these results characterize the ability of computational methods to model the precise details of correlated amino acid changes, which is critical for engineering new proteins with useful functions beyond those seen in nature.
Collapse
Affiliation(s)
- Noah Ollikainen
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, California, United States of America
| | - Tanja Kortemme
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, California, United States of America
- California Institute for Quantitative Biosciences (QB3), University of California San Francisco, San Francisco, California, United States of America
- Department of Bioengineering and Therapeutic Science, University of California San Francisco, San Francisco, California, United States of America
- * E-mail:
| |
Collapse
|
281
|
Jackson EL, Ollikainen N, Covert AW, Kortemme T, Wilke CO. Amino-acid site variability among natural and designed proteins. PeerJ 2013; 1:e211. [PMID: 24255821 PMCID: PMC3828621 DOI: 10.7717/peerj.211] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2013] [Accepted: 10/24/2013] [Indexed: 11/20/2022] Open
Abstract
Computational protein design attempts to create protein sequences that fold stably into pre-specified structures. Here we compare alignments of designed proteins to alignments of natural proteins and assess how closely designed sequences recapitulate patterns of sequence variation found in natural protein sequences. We design proteins using RosettaDesign, and we evaluate both fixed-backbone designs and variable-backbone designs with different amounts of backbone flexibility. We find that proteins designed with a fixed backbone tend to underestimate the amount of site variability observed in natural proteins while proteins designed with an intermediate amount of backbone flexibility result in more realistic site variability. Further, the correlation between solvent exposure and site variability in designed proteins is lower than that in natural proteins. This finding suggests that site variability is too uniform across different solvent exposure states (i.e., buried residues are too variable or exposed residues too conserved). When comparing the amino acid frequencies in the designed proteins with those in natural proteins we find that in the designed proteins hydrophobic residues are underrepresented in the core. From these results we conclude that intermediate backbone flexibility during design results in more accurate protein design and that either scoring functions or backbone sampling methods require further improvement to accurately replicate structural constraints on site variability.
Collapse
Affiliation(s)
- Eleisha L. Jackson
- Institute of Cellular and Molecular Biology, Center for Computational Biology and Bioinformatics, and Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USA
| | - Noah Ollikainen
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, CA, USA
| | - Arthur W. Covert
- Institute of Cellular and Molecular Biology, Center for Computational Biology and Bioinformatics, and Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USA
| | - Tanja Kortemme
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, CA, USA
- California Institute for Quantitative Biosciences (QB3) and Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Claus O. Wilke
- Institute of Cellular and Molecular Biology, Center for Computational Biology and Bioinformatics, and Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USA
| |
Collapse
|
282
|
Mitra P, Shultis D, Brender JR, Czajka J, Marsh D, Gray F, Cierpicki T, Zhang Y. An evolution-based approach to De Novo protein design and case study on Mycobacterium tuberculosis. PLoS Comput Biol 2013; 9:e1003298. [PMID: 24204234 PMCID: PMC3812052 DOI: 10.1371/journal.pcbi.1003298] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2012] [Accepted: 09/09/2013] [Indexed: 01/31/2023] Open
Abstract
Computational protein design is a reverse procedure of protein folding and structure prediction, where constructing structures from evolutionarily related proteins has been demonstrated to be the most reliable method for protein 3-dimensional structure prediction. Following this spirit, we developed a novel method to design new protein sequences based on evolutionarily related protein families. For a given target structure, a set of proteins having similar fold are identified from the PDB library by structural alignments. A structural profile is then constructed from the protein templates and used to guide the conformational search of amino acid sequence space, where physicochemical packing is accommodated by single-sequence based solvation, torsion angle, and secondary structure predictions. The method was tested on a computational folding experiment based on a large set of 87 protein structures covering different fold classes, which showed that the evolution-based design significantly enhances the foldability and biological functionality of the designed sequences compared to the traditional physics-based force field methods. Without using homologous proteins, the designed sequences can be folded with an average root-mean-square-deviation of 2.1 Å to the target. As a case study, the method is extended to redesign all 243 structurally resolved proteins in the pathogenic bacteria Mycobacterium tuberculosis, which is the second leading cause of death from infectious disease. On a smaller scale, five sequences were randomly selected from the design pool and subjected to experimental validation. The results showed that all the designed proteins are soluble with distinct secondary structure and three have well ordered tertiary structure, as demonstrated by circular dichroism and NMR spectroscopy. Together, these results demonstrate a new avenue in computational protein design that uses knowledge of evolutionary conservation from protein structural families to engineer new protein molecules of improved fold stability and biological functionality. The goal of computational protein design is to create new protein sequences of desirable structure and biological function. Most protein design methods are developed to search for sequences with the lowest free-energy based on physics-based force fields following Anfinsen's thermodynamic hypothesis. A major obstacle of such approaches is the inaccuracy of the force-field design, which cannot accurately describe atomic interactions or correctly recognize protein folds. We propose a novel method which uses evolutionary information, in the form of sequence profiles from structure families, to guide the sequence design. Since sequence profiles are generally more accurate than physics-based potentials in protein fold recognition, a unique advantage lies on that it targets the design procedure to a family of protein sequence profiles to enhance the robustness of designed sequences. The method was tested on 87 proteins and the designed sequences can be folded by I-TASSER to models with an average RMSD 2.1 Å. As a case study of large-scale application, the method is extended to redesign all structurally resolved proteins in the human pathogenic bacteria, Mycobacterium tuberculosis. Five sequences varying in fold and sizes were characterized by circular dichroism and NMR spectroscopy experiments and three were shown to have ordered tertiary structure.
Collapse
Affiliation(s)
- Pralay Mitra
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - David Shultis
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Jeffrey R. Brender
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Jeff Czajka
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - David Marsh
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Felicia Gray
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Tomasz Cierpicki
- Department of Pathology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Biological Chemistry, University of Michigan, Ann Arbor, Michigan, United States of America
- * E-mail:
| |
Collapse
|
283
|
Xiao X, Hall CK, Agris PF. The design of a peptide sequence to inhibit HIV replication: a search algorithm combining Monte Carlo and self-consistent mean field techniques. J Biomol Struct Dyn 2013; 32:1523-36. [PMID: 24147736 DOI: 10.1080/07391102.2013.825757] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
We developed a search algorithm combining Monte Carlo (MC) and self-consistent mean field techniques to evolve a peptide sequence that has good binding capability to the anticodon stem and loop (ASL) of human lysine tRNA species, tRNA(Lys3), with the ultimate purpose of breaking the replication cycle of human immunodeficiency virus-1. The starting point is the 15-amino-acid sequence, RVTHHAFLGAHRTVG, found experimentally by Agris and co-workers to bind selectively to hypermodified tRNA(Lys3). The peptide backbone conformation is determined via atomistic simulation of the peptide-ASL(Lys3) complex and then held fixed throughout the search. The proportion of amino acids of various types (hydrophobic, polar, charged, etc.) is varied to mimic different peptide hydration properties. Three different sets of hydration properties were examined in the search algorithm to see how this affects evolution to the best-binding peptide sequences. Certain amino acids are commonly found at fixed sites for all three hydration states, some necessary for binding affinity and some necessary for binding specificity. Analysis of the binding structure and the various contributions to the binding energy shows that: 1) two hydrophilic residues (asparagine at site 11 and the cysteine at site 12) "recognize" the ASL(Lys3) due to the VDW energy, and thereby contribute to its binding specificity and 2) the positively charged arginines at sites 4 and 13 preferentially attract the negatively charged sugar rings and the phosphate linkages, and thereby contribute to the binding affinity.
Collapse
Affiliation(s)
- Xingqing Xiao
- a Chemical and Biomolecular Engineering Department , North Carolina State University , Raleigh , NC , 27695-7905 , USA
| | | | | |
Collapse
|
284
|
Das R. Atomic-accuracy prediction of protein loop structures through an RNA-inspired Ansatz. PLoS One 2013; 8:e74830. [PMID: 24204571 PMCID: PMC3804535 DOI: 10.1371/journal.pone.0074830] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2013] [Accepted: 08/07/2013] [Indexed: 11/18/2022] Open
Abstract
Consistently predicting biopolymer structure at atomic resolution from sequence alone remains a difficult problem, even for small sub-segments of large proteins. Such loop prediction challenges, which arise frequently in comparative modeling and protein design, can become intractable as loop lengths exceed 10 residues and if surrounding side-chain conformations are erased. Current approaches, such as the protein local optimization protocol or kinematic inversion closure (KIC) Monte Carlo, involve stages that coarse-grain proteins, simplifying modeling but precluding a systematic search of all-atom configurations. This article introduces an alternative modeling strategy based on a ‘stepwise ansatz’, recently developed for RNA modeling, which posits that any realistic all-atom molecular conformation can be built up by residue-by-residue stepwise enumeration. When harnessed to a dynamic-programming-like recursion in the Rosetta framework, the resulting stepwise assembly (SWA) protocol enables enumerative sampling of a 12 residue loop at a significant but achievable cost of thousands of CPU-hours. In a previously established benchmark, SWA recovers crystallographic conformations with sub-Angstrom accuracy for 19 of 20 loops, compared to 14 of 20 by KIC modeling with a comparable expenditure of computational power. Furthermore, SWA gives high accuracy results on an additional set of 15 loops highlighted in the biological literature for their irregularity or unusual length. Successes include cis-Pro touch turns, loops that pass through tunnels of other side-chains, and loops of lengths up to 24 residues. Remaining problem cases are traced to inaccuracies in the Rosetta all-atom energy function. In five additional blind tests, SWA achieves sub-Angstrom accuracy models, including the first such success in a protein/RNA binding interface, the YbxF/kink-turn interaction in the fourth ‘RNA-puzzle’ competition. These results establish all-atom enumeration as an unusually systematic approach to ab initio protein structure modeling that can leverage high performance computing and physically realistic energy functions to more consistently achieve atomic accuracy.
Collapse
Affiliation(s)
- Rhiju Das
- Departments of Biochemistry and Physics, Stanford University, Stanford, California, United States of America
- * E-mail:
| |
Collapse
|
285
|
Gregoire S, Zhang S, Costanzo J, Wilson K, Fernandez EJ, Kwon I. Cis-suppression to arrest protein aggregation in mammalian cells. Biotechnol Bioeng 2013; 111:462-74. [PMID: 24114411 DOI: 10.1002/bit.25119] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2013] [Revised: 08/18/2013] [Accepted: 09/09/2013] [Indexed: 12/20/2022]
Abstract
Protein misfolding and aggregation are implicated in numerous human diseases and significantly lower production yield of proteins expressed in mammalian cells. Despite the importance of understanding and suppressing protein aggregation in mammalian cells, a protein design and selection strategy to modulate protein misfolding/aggregation in mammalian cells has not yet been reported. In this work, we address the particular challenge presented by mutation-induced protein aggregation in mammalian cells. We hypothesize that an additional mutation(s) can be introduced in an aggregation-prone protein variant, spatially near the original mutation, to suppress misfolding and aggregation (cis-suppression). As a model protein, we chose human copper, zinc superoxide dismutase mutant (SOD1(A4V) ) containing an alanine to valine mutation at residue 4, associated with the familial form of amyotrophic lateral sclerosis. We used the program RosettaDesign to identify Phe20 in SOD1(A4V) as a key residue responsible for SOD1(A4V) conformational destabilization. This information was used to rationally develop a pool of candidate mutations at the Phe20 site. After two rounds of mammalian-cell based screening of the variants, three novel SOD1(A4V) variants with a significantly reduced aggregation propensity inside cells were selected. The enhanced stability and reduced aggregation propensity of the three novel SOD1(A4V) variants were verified using cell fractionation and in vitro stability assays.
Collapse
Affiliation(s)
- Simpson Gregoire
- Department of Chemical Engineering, University of Virginia, Charlottesville, Virginia, 22904-4741
| | | | | | | | | | | |
Collapse
|
286
|
Grigoryan G. Absolute free energies of biomolecules from unperturbed ensembles. J Comput Chem 2013; 34:2726-41. [PMID: 24132787 DOI: 10.1002/jcc.23448] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2013] [Revised: 07/11/2013] [Accepted: 08/31/2013] [Indexed: 01/31/2023]
Abstract
Computing the absolute free energy of a macromolecule's structural state, F, is a challenging problem of high relevance. This study presents a method that computes F using only information from an unperturbed simulation of the macromolecule in the relevant conformational state, ensemble, and environment. Absolute free energies produced by this method, dubbed Valuation of Local Configuration Integral with Dynamics (VALOCIDY), enable comparison of alternative states. For example, comparing explicitly solvated and vaporous states of amino acid side-chain analogs produces solvation free energies in good agreement with experiments. Also, comparisons between alternative conformational states of model heptapeptides (including the unfolded state) produce free energy differences in agreement with data from μs molecular-dynamics simulations and experimental propensities. The potential of using VALOCIDY in computational protein design is explored via a small design problem of stabilizing a β-turn structure. When VALOCIDY-based estimation of folding free energy is used as the design metric, the resulting sequence folds into the desired structure within the atomistic force field used in design. The VALOCIDY-based approach also recognizes the distinct status of the native sequence regardless of minor details of the starting template structure, in stark contrast with a traditional fixed-backbone approach.
Collapse
Affiliation(s)
- Gevorg Grigoryan
- Department of Computer Science and Department of Biology, Dartmouth College, Hanover, New Hampshire, 03755
| |
Collapse
|
287
|
Alexander NS, Stein RA, Koteiche HA, Kaufmann KW, Mchaourab HS, Meiler J. RosettaEPR: rotamer library for spin label structure and dynamics. PLoS One 2013; 8:e72851. [PMID: 24039810 PMCID: PMC3764097 DOI: 10.1371/journal.pone.0072851] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2013] [Accepted: 07/15/2013] [Indexed: 11/18/2022] Open
Abstract
An increasingly used parameter in structural biology is the measurement of distances between spin labels bound to a protein. One limitation to these measurements is the unknown position of the spin label relative to the protein backbone. To overcome this drawback, we introduce a rotamer library of the methanethiosulfonate spin label (MTSSL) into the protein modeling program Rosetta. Spin label rotamers were derived from conformations observed in crystal structures of spin labeled T4 lysozyme and previously published molecular dynamics simulations. Rosetta’s ability to accurately recover spin label conformations and EPR measured distance distributions was evaluated against 19 experimentally determined MTSSL labeled structures of T4 lysozyme and the membrane protein LeuT and 73 distance distributions from T4 lysozyme and the membrane protein MsbA. For a site in the core of T4 lysozyme, the correct spin label conformation (Χ1 and Χ2) is recovered in 99.8% of trials. In surface positions 53% of the trajectories agree with crystallized conformations in Χ1 and Χ2. This level of recovery is on par with Rosetta performance for the 20 natural amino acids. In addition, Rosetta predicts the distance between two spin labels with a mean error of 4.4 Å. The width of the experimental distance distribution, which reflects the flexibility of the two spin labels, is predicted with a mean error of 1.3 Å. RosettaEPR makes full-atom spin label modeling available to a wide scientific community in conjunction with the powerful suite of modeling methods within Rosetta.
Collapse
Affiliation(s)
- Nathan S. Alexander
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Richard A. Stein
- Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Hanane A. Koteiche
- Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Kristian W. Kaufmann
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Hassane S. Mchaourab
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Jens Meiler
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States of America
- * E-mail:
| |
Collapse
|
288
|
Tinberg CE, Khare SD, Dou J, Doyle L, Nelson JW, Schena A, Jankowski W, Kalodimos CG, Johnsson K, Stoddard BL, Baker D. Computational design of ligand-binding proteins with high affinity and selectivity. Nature 2013; 501:212-216. [PMID: 24005320 DOI: 10.1038/nature12443] [Citation(s) in RCA: 304] [Impact Index Per Article: 27.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2013] [Accepted: 07/11/2013] [Indexed: 01/27/2023]
Abstract
The ability to design proteins with high affinity and selectivity for any given small molecule is a rigorous test of our understanding of the physiochemical principles that govern molecular recognition. Attempts to rationally design ligand-binding proteins have met with little success, however, and the computational design of protein-small-molecule interfaces remains an unsolved problem. Current approaches for designing ligand-binding proteins for medical and biotechnological uses rely on raising antibodies against a target antigen in immunized animals and/or performing laboratory-directed evolution of proteins with an existing low affinity for the desired ligand, neither of which allows complete control over the interactions involved in binding. Here we describe a general computational method for designing pre-organized and shape complementary small-molecule-binding sites, and use it to generate protein binders to the steroid digoxigenin (DIG). Of seventeen experimentally characterized designs, two bind DIG; the model of the higher affinity binder has the most energetically favourable and pre-organized interface in the design set. A comprehensive binding-fitness landscape of this design, generated by library selections and deep sequencing, was used to optimize its binding affinity to a picomolar level, and X-ray co-crystal structures of two variants show atomic-level agreement with the corresponding computational models. The optimized binder is selective for DIG over the related steroids digitoxigenin, progesterone and β-oestradiol, and this steroid binding preference can be reprogrammed by manipulation of explicitly designed hydrogen-bonding interactions. The computational design method presented here should enable the development of a new generation of biosensors, therapeutics and diagnostics.
Collapse
Affiliation(s)
- Christine E Tinberg
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | - Sagar D Khare
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | - Jiayi Dou
- Department of Bioengineering, University of Washington, Seattle, WA 98195, USA.,Graduate Program in Biological Physics, Structure, and Design, University of Washington, Seattle, WA 98195, USA
| | - Lindsey Doyle
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Jorgen W Nelson
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Alberto Schena
- Institute of Chemical Sciences and Engineering, Institute of Bioengineering, National Centre of Competence in Research (NCCR) of Chemical Biology, École Polytechnique Fédérale de Laussane (EPFL), Laussane, Switzerland
| | - Wojciech Jankowski
- Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | | | - Kai Johnsson
- Institute of Chemical Sciences and Engineering, Institute of Bioengineering, National Centre of Competence in Research (NCCR) of Chemical Biology, École Polytechnique Fédérale de Laussane (EPFL), Laussane, Switzerland
| | - Barry L Stoddard
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
289
|
Huang YM, Bystroff C. Expanded explorations into the optimization of an energy function for protein design. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2013; 10:1176-1187. [PMID: 24384706 PMCID: PMC3919130 DOI: 10.1109/tcbb.2013.113] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Nature possesses a secret formula for the energy as a function of the structure of a protein. In protein design, approximations are made to both the structural representation of the molecule and to the form of the energy equation, such that the existence of a general energy function for proteins is by no means guaranteed. Here, we present new insights toward the application of machine learning to the problem of finding a general energy function for protein design. Machine learning requires the definition of an objective function, which carries with it the implied definition of success in protein design. We explored four functions, consisting of two functional forms, each with two criteria for success. Optimization was carried out by a Monte Carlo search through the space of all variable parameters. Cross-validation of the optimized energy function against a test set gave significantly different results depending on the choice of objective function, pointing to relative correctness of the built-in assumptions. Novel energy cross terms correct for the observed nonadditivity of energy terms and an imbalance in the distribution of predicted amino acids. This paper expands on the work presented at the 2012 ACM-BCB.
Collapse
|
290
|
Mills JH, Khare SD, Bolduc JM, Forouhar F, Mulligan VK, Lew S, Seetharaman J, Tong L, Stoddard BL, Baker D. Computational design of an unnatural amino acid dependent metalloprotein with atomic level accuracy. J Am Chem Soc 2013; 135:13393-9. [PMID: 23924187 DOI: 10.1021/ja403503m] [Citation(s) in RCA: 84] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Genetically encoded unnatural amino acids could facilitate the design of proteins and enzymes of novel function, but correctly specifying sites of incorporation and the identities and orientations of surrounding residues represents a formidable challenge. Computational design methods have been used to identify optimal locations for functional sites in proteins and design the surrounding residues but have not incorporated unnatural amino acids in this process. We extended the Rosetta design methodology to design metalloproteins in which the amino acid (2,2'-bipyridin-5yl)alanine (Bpy-Ala) is a primary ligand of a bound metal ion. Following initial results that indicated the importance of buttressing the Bpy-Ala amino acid, we designed a buried metal binding site with octahedral coordination geometry consisting of Bpy-Ala, two protein-based metal ligands, and two metal-bound water molecules. Experimental characterization revealed a Bpy-Ala-mediated metalloprotein with the ability to bind divalent cations including Co(2+), Zn(2+), Fe(2+), and Ni(2+), with a Kd for Zn(2+) of ∼40 pM. X-ray crystal structures of the designed protein bound to Co(2+) and Ni(2+) have RMSDs to the design model of 0.9 and 1.0 Å respectively over all atoms in the binding site.
Collapse
Affiliation(s)
- Jeremy H Mills
- Department of Biochemistry and ⊥Biomolecular Structure and Design Program, University of Washington , Seattle, Washington, United States
| | | | | | | | | | | | | | | | | | | |
Collapse
|
291
|
Figueroa M, Oliveira N, Lejeune A, Kaufmann KW, Dorr BM, Matagne A, Martial JA, Meiler J, Van de Weerdt C. Octarellin VI: using rosetta to design a putative artificial (β/α)8 protein. PLoS One 2013; 8:e71858. [PMID: 23977165 PMCID: PMC3747059 DOI: 10.1371/journal.pone.0071858] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2012] [Accepted: 07/10/2013] [Indexed: 11/22/2022] Open
Abstract
The computational protein design protocol Rosetta has been applied successfully to a wide variety of protein engineering problems. Here the aim was to test its ability to design de novo a protein adopting the TIM-barrel fold, whose formation requires about twice as many residues as in the largest proteins successfully designed de novo to date. The designed protein, Octarellin VI, contains 216 residues. Its amino acid composition is similar to that of natural TIM-barrel proteins. When produced and purified, it showed a far-UV circular dichroism spectrum characteristic of folded proteins, with α-helical and β-sheet secondary structure. Its stable tertiary structure was confirmed by both tryptophan fluorescence and circular dichroism in the near UV. It proved heat stable up to 70°C. Dynamic light scattering experiments revealed a unique population of particles averaging 4 nm in diameter, in good agreement with our model. Although these data suggest the successful creation of an artificial α/β protein of more than 200 amino acids, Octarellin VI shows an apparent noncooperative chemical unfolding and low solubility.
Collapse
Affiliation(s)
- Maximiliano Figueroa
- GIGA-Research, Molecular Biology and Genetic Engineering Unit, University of Liège, Liège, Belgium
| | - Nicolas Oliveira
- GIGA-Research, Molecular Biology and Genetic Engineering Unit, University of Liège, Liège, Belgium
| | - Annabelle Lejeune
- GIGA-Research, Molecular Biology and Genetic Engineering Unit, University of Liège, Liège, Belgium
| | - Kristian W. Kaufmann
- Departments of Chemistry and Pharmacology, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Brent M. Dorr
- Departments of Chemistry and Pharmacology, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - André Matagne
- Laboratoire d’Enzymologie et Repliement des Protéines, Centre for Protein Engineering, University of Liège, Liège, Belgium
| | - Joseph A. Martial
- GIGA-Research, Molecular Biology and Genetic Engineering Unit, University of Liège, Liège, Belgium
| | - Jens Meiler
- Departments of Chemistry and Pharmacology, Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Cécile Van de Weerdt
- GIGA-Research, Molecular Biology and Genetic Engineering Unit, University of Liège, Liège, Belgium
- * E-mail:
| |
Collapse
|
292
|
Smadbeck J, Peterson MB, Khoury GA, Taylor MS, Floudas CA. Protein WISDOM: a workbench for in silico de novo design of biomolecules. J Vis Exp 2013. [PMID: 23912941 DOI: 10.3791/50476] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2022] Open
Abstract
The aim of de novo protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity. To disseminate these methods for broader use we present Protein WISDOM (http://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.
Collapse
Affiliation(s)
- James Smadbeck
- Department of Chemical and Biological Engineering, Princeton University, USA
| | | | | | | | | |
Collapse
|
293
|
Jiang L, Liu C, Leibly D, Landau M, Zhao M, Hughes MP, Eisenberg DS. Structure-based discovery of fiber-binding compounds that reduce the cytotoxicity of amyloid beta. eLife 2013. [PMID: 23878726 DOI: 10.7554/elife.00857.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Amyloid protein aggregates are associated with dozens of devastating diseases including Alzheimer's, Parkinson's, ALS, and diabetes type 2. While structure-based discovery of compounds has been effective in combating numerous infectious and metabolic diseases, ignorance of amyloid structure has hindered similar approaches to amyloid disease. Here we show that knowledge of the atomic structure of one of the adhesive, steric-zipper segments of the amyloid-beta (Aβ) protein of Alzheimer's disease, when coupled with computational methods, identifies eight diverse but mainly flat compounds and three compound derivatives that reduce Aβ cytotoxicity against mammalian cells by up to 90%. Although these compounds bind to Aβ fibers, they do not reduce fiber formation of Aβ. Structure-activity relationship studies of the fiber-binding compounds and their derivatives suggest that compound binding increases fiber stability and decreases fiber toxicity, perhaps by shifting the equilibrium of Aβ from oligomers to fibers. DOI:http://dx.doi.org/10.7554/eLife.00857.001.
Collapse
Affiliation(s)
- Lin Jiang
- Departments of Chemistry and Biochemistry and Biological Chemistry , Howard Hughes Medical Institute, UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles , Los Angeles , United States
| | | | | | | | | | | | | |
Collapse
|
294
|
Jiang L, Liu C, Leibly D, Landau M, Zhao M, Hughes MP, Eisenberg DS. Structure-based discovery of fiber-binding compounds that reduce the cytotoxicity of amyloid beta. eLife 2013; 2:e00857. [PMID: 23878726 PMCID: PMC3713518 DOI: 10.7554/elife.00857] [Citation(s) in RCA: 82] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2013] [Accepted: 06/10/2013] [Indexed: 12/15/2022] Open
Abstract
Amyloid protein aggregates are associated with dozens of devastating diseases including Alzheimer’s, Parkinson’s, ALS, and diabetes type 2. While structure-based discovery of compounds has been effective in combating numerous infectious and metabolic diseases, ignorance of amyloid structure has hindered similar approaches to amyloid disease. Here we show that knowledge of the atomic structure of one of the adhesive, steric-zipper segments of the amyloid-beta (Aβ) protein of Alzheimer’s disease, when coupled with computational methods, identifies eight diverse but mainly flat compounds and three compound derivatives that reduce Aβ cytotoxicity against mammalian cells by up to 90%. Although these compounds bind to Aβ fibers, they do not reduce fiber formation of Aβ. Structure-activity relationship studies of the fiber-binding compounds and their derivatives suggest that compound binding increases fiber stability and decreases fiber toxicity, perhaps by shifting the equilibrium of Aβ from oligomers to fibers. DOI:http://dx.doi.org/10.7554/eLife.00857.001 Alzheimer’s disease is the most common form of dementia, estimated to affect roughly five million people in the United States, and its incidence is steadily increasing as the population ages. A pathological hallmark of Alzheimer’s disease is the presence in the brain of aggregates of two proteins: tangles of a protein called tau; and fibers and smaller units (oligomers) of a peptide called amyloid beta. Many attempts have been made to screen libraries of natural and synthetic compounds to identify substances that might prevent the aggregation and toxicity of amyloid. Such studies revealed that polyphenols found in green tea and in the spice turmeric can inhibit the formation of amyloid fibrils. Moreover, a number of dyes reduce the toxic effects of amyloid on cells, although significant side effects prevent these from being used as drugs. Structure-based drug design, in which the structure of a target protein is used to help identify compounds that will interact with it, has been used to generate therapeutic agents for a number of diseases. Here, Jiang et al. report the first application of this technique in the hunt for compounds that inhibit the cytotoxicity of amyloid beta. Using the known atomic structure of the protein in complex with a dye, Jiang et al. performed a computational screen of 18,000 compounds in search of those that are likely to bind effectively. The compounds that showed the strongest predicted binding were then tested for their ability to interfere with the aggregation of amyloid beta and to protect cells grown in culture from its toxic effects. Compounds that reduced toxicity did not reduce the abundance of protein aggregates, but they appear to increase the stability of fibrils. This is consistent with other evidence suggesting that small, soluble forms (oligomers) of amyloid beta that break free from the fibrils may be the toxic agent in Alzheimer’s disease, rather than the fibrils themselves. In addition to uncovering compounds with therapeutic potential in Alzheimer’s disease, this work presents a new approach for identifying proteins that bind to amyloid fibrils. Given that amyloid accumulation is a feature of many other diseases, including Parkinson’s disease, Huntington’s disease and type 2 diabetes, the approach could have broad therapeutic applications. DOI:http://dx.doi.org/10.7554/eLife.00857.002
Collapse
Affiliation(s)
- Lin Jiang
- Departments of Chemistry and Biochemistry and Biological Chemistry , Howard Hughes Medical Institute, UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles , Los Angeles , United States
| | | | | | | | | | | | | |
Collapse
|
295
|
Parker AS, Choi Y, Griswold KE, Bailey-Kellogg C. Structure-guided deimmunization of therapeutic proteins. J Comput Biol 2013; 20:152-65. [PMID: 23384000 DOI: 10.1089/cmb.2012.0251] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Therapeutic proteins continue to yield revolutionary new treatments for a growing spectrum of human disease, but the development of these powerful drugs requires solving a unique set of challenges. For instance, it is increasingly apparent that mitigating potential anti-therapeutic immune responses, driven by molecular recognition of a therapeutic protein's peptide fragments, may be best accomplished early in the drug development process. One may eliminate immunogenic peptide fragments by mutating the cognate amino acid sequences, but deimmunizing mutations are constrained by the need for a folded, stable, and functional protein structure. These two concerns may be competing, as the mutations that are best at reducing immunogenicity often involve amino acids that are substantially different physicochemically. We develop a novel approach, called EpiSweep, that simultaneously optimizes both concerns. Our algorithm identifies sets of mutations making such Pareto optimal trade-offs between structure and immunogenicity, embodied by a molecular mechanics energy function and a T-cell epitope predictor, respectively. EpiSweep integrates structure-based protein design, sequence-based protein deimmunization, and algorithms for finding the Pareto frontier of a design space. While structure-based protein design is NP-hard, we employ integer programming techniques that are efficient in practice. Furthermore, EpiSweep only invokes the optimizer once per identified Pareto optimal design. We show that EpiSweep designs of regions of the therapeutics erythropoietin and staphylokinase are predicted to outperform previous experimental efforts. We also demonstrate EpiSweep's capacity for deimmunization of the entire proteins, case analyses involving dozens of predicted epitopes, and tens of thousands of unique side-chain interactions. Ultimately, Epi-Sweep is a powerful protein design tool that guides the protein engineer toward the most promising immunotolerant biotherapeutic candidates.
Collapse
Affiliation(s)
- Andrew S Parker
- Department of Computer Science, Dartmouth College, Hanover, NH 03755, USA
| | | | | | | |
Collapse
|
296
|
Traoré S, Allouche D, André I, de Givry S, Katsirelos G, Schiex T, Barbe S. A new framework for computational protein design through cost function network optimization. Bioinformatics 2013; 29:2129-36. [DOI: 10.1093/bioinformatics/btt374] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
297
|
Adolf-Bryfogle J, Dunbrack Jr. RL. The PyRosetta Toolkit: a graphical user interface for the Rosetta software suite. PLoS One 2013; 8:e66856. [PMID: 23874400 PMCID: PMC3706480 DOI: 10.1371/journal.pone.0066856] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2013] [Accepted: 05/11/2013] [Indexed: 01/25/2023] Open
Abstract
The Rosetta Molecular Modeling suite is a command-line-only collection of applications that enable high-resolution modeling and design of proteins and other molecules. Although extremely useful, Rosetta can be difficult to learn for scientists with little computational or programming experience. To that end, we have created a Graphical User Interface (GUI) for Rosetta, called the PyRosetta Toolkit, for creating and running protocols in Rosetta for common molecular modeling and protein design tasks and for analyzing the results of Rosetta calculations. The program is highly extensible so that developers can add new protocols and analysis tools to the PyRosetta Toolkit GUI.
Collapse
Affiliation(s)
- Jared Adolf-Bryfogle
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, Pennsylvania, United States of America
- Drexel University College of Medicine, Program in Molecular and Cell Biology and Genetics, Philadelphia, Pennsylvania, United States of America
| | - Roland L. Dunbrack Jr.
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
298
|
Rodriguez VB, Kidd BA, Interlandi G, Tchesnokova V, Sokurenko EV, Thomas WE. Allosteric coupling in the bacterial adhesive protein FimH. J Biol Chem 2013; 288:24128-39. [PMID: 23821547 DOI: 10.1074/jbc.m113.461376] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The protein FimH is expressed by the majority of commensal and uropathogenic strains of Escherichia coli on the tips of type 1 fimbriae and mediates adhesion via a catch bond to its ligand mannose. Crystal structures of FimH show an allosteric conformational change, but it remains unclear whether all of the observed structural differences are part of the allosteric mechanism. Here we use the protein structural analysis tool RosettaDesign combined with human insight to identify and synthesize 10 mutations in four regions that we predicted would stabilize one of the conformations of that region. The function of each variant was characterized by measuring binding to the ligand mannose, whereas the allosteric state was determined using a conformation-specific monoclonal antibody. These studies demonstrated that each region investigated was indeed part of the FimH allosteric mechanism. However, the studies strongly suggested that some regions were more tightly coupled to mannose binding and others to antibody binding. In addition, we identified many FimH variants that appear locked in the low affinity state. Knowledge of regulatory sites outside the active and effector sites as well as the ability to make FimH variants locked in the low affinity state may be crucial to the future development of novel antiadhesive and antimicrobial therapies using allosteric regulation to inhibit FimH.
Collapse
Affiliation(s)
- Victoria B Rodriguez
- Department of Bioengineering, University of Washington, Seattle, Washington 98195-5061, USA
| | | | | | | | | | | |
Collapse
|
299
|
Suárez-Diez M, Pujol AM, Matzapetakis M, Jaramillo A, Iranzo O. Computational protein design with electrostatic focusing: experimental characterization of a conditionally folded helical domain with a reduced amino acid alphabet. Biotechnol J 2013; 8:855-64. [PMID: 23788466 DOI: 10.1002/biot.201200380] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2012] [Revised: 04/22/2013] [Accepted: 06/03/2013] [Indexed: 11/12/2022]
Abstract
Automated methodologies to design synthetic proteins from first principles use energy computations to estimate the ability of the sequences to adopt a targeted structure. This approach is still far from systematically producing native-like sequences, due, most likely, to inaccuracies when modeling the interactions between the protein and its aqueous environment. This is particularly challenging when engineering small protein domains (with less polar pair interactions than with the solvent). We have re-designed a three-helix bundle, domain B, using a fixed backbone and a four amino acid alphabet. We have enlarged the rotamer library with conformers that increase the weight of electrostatic interactions within the design process without altering the energy function used to compute the folding free energy. Our synthetic sequences show less than 15% similarity to any Swissprot sequence. We have characterized our sequences in different solvents using circular dichroism and nuclear magnetic resonance. The targeted structure achieved is dependent on the solvent used. This method can be readily extended to larger domains. Our method will be useful for the engineering of proteins that become active only in a given solvent and for designing proteins in the context of hydrophobic solvents, an important fraction of the situations in the cell.
Collapse
Affiliation(s)
- Maria Suárez-Diez
- Laboratory of Systems and Synthetic Biology, Wageningen University, Wageningen, The Netherlands
| | | | | | | | | |
Collapse
|
300
|
Combs SA, Deluca SL, Deluca SH, Lemmon GH, Nannemann DP, Nguyen ED, Willis JR, Sheehan JH, Meiler J. Small-molecule ligand docking into comparative models with Rosetta. Nat Protoc 2013; 8:1277-98. [PMID: 23744289 DOI: 10.1038/nprot.2013.074] [Citation(s) in RCA: 120] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Structure-based drug design is frequently used to accelerate the development of small-molecule therapeutics. Although substantial progress has been made in X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy, the availability of high-resolution structures is limited owing to the frequent inability to crystallize or obtain sufficient NMR restraints for large or flexible proteins. Computational methods can be used to both predict unknown protein structures and model ligand interactions when experimental data are unavailable. This paper describes a comprehensive and detailed protocol using the Rosetta modeling suite to dock small-molecule ligands into comparative models. In the protocol presented here, we review the comparative modeling process, including sequence alignment, threading and loop building. Next, we cover docking a small-molecule ligand into the protein comparative model. In addition, we discuss criteria that can improve ligand docking into comparative models. Finally, and importantly, we present a strategy for assessing model quality. The entire protocol is presented on a single example selected solely for didactic purposes. The results are therefore not representative and do not replace benchmarks published elsewhere. We also provide an additional tutorial so that the user can gain hands-on experience in using Rosetta. The protocol should take 5-7 h, with additional time allocated for computer generation of models.
Collapse
Affiliation(s)
- Steven A Combs
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, USA
| | | | | | | | | | | | | | | | | |
Collapse
|