1
|
San Fabián J, Ema I, Omar S, García de la Vega JM. Toward a Computational NMR Procedure for Modeling Dipeptide Side-Chain Conformation. J Chem Inf Model 2021; 61:6012-6023. [PMID: 34762416 PMCID: PMC8715507 DOI: 10.1021/acs.jcim.1c00773] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
![]()
Theoretical relationships between
the vicinal spin–spin
coupling constants (SSCCs) and the χ1 torsion angles
have been studied to predict the conformations of protein side chains.
An efficient computational procedure is developed to obtain the conformation
of dipeptides through theoretical and experimental SSCCs, Karplus
equations, and quantum chemistry methods, and it is applied to three
aliphatic hydrophobic residues (Val, Leu, and Ile). Three models are
proposed: unimodal-static, trimodal-static-stepped, and trimodal-static-trigonal,
where the most important factors are incorporated (coupled nuclei,
nature and orientation of the substituents, and local geometric properties).
Our results are validated by comparison with NMR and X-ray empirical
data described in the literature, obtaining successful results on
the 29 residues considered. Using out trimodal residue treatment,
it is possible to detect and resolve residues with a simple conformation
and those with two or three staggered conformers. In four residues,
a deeper analysis explains that they do not have a unique conformation
and that the population of each conformation plays an important role.
Collapse
Affiliation(s)
- Jesús San Fabián
- Departamento de Química Física Aplicada, Facultad de Ciencias, Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | - Ignacio Ema
- Departamento de Química Física Aplicada, Facultad de Ciencias, Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | - Salama Omar
- Departamento de Química Física Aplicada, Facultad de Ciencias, Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | | |
Collapse
|
2
|
Barad BA, Echols N, Wang RYR, Cheng Y, DiMaio F, Adams PD, Fraser JS. EMRinger: side chain-directed model and map validation for 3D cryo-electron microscopy. Nat Methods 2015; 12:943-6. [PMID: 26280328 PMCID: PMC4589481 DOI: 10.1038/nmeth.3541] [Citation(s) in RCA: 617] [Impact Index Per Article: 68.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Accepted: 06/19/2015] [Indexed: 12/20/2022]
Abstract
Advances in high resolution electron cryomicroscopy (cryo-EM) have been accompanied by the development of validation metrics to independently assess map quality and model geometry. EMRinger assesses the precise fitting of an atomic model into the map during refinement and shows how radiation damage alters scattering from negatively charged amino acids. EMRinger will be useful for monitoring progress in resolving and modeling high-resolution features in cryo-EM.
Collapse
Affiliation(s)
- Benjamin A Barad
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California, USA.,Graduate Group in Biophysics, University of California, San Francisco, San Francisco, California, USA
| | - Nathaniel Echols
- Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA
| | - Ray Yu-Ruei Wang
- Graduate Program in Biological Physics, Structure and Design, University of Washington, Seattle, Washington, USA.,Department of Biochemistry, University of Washington, Seattle, Washington, USA
| | - Yifan Cheng
- Keck Advanced Microscopy Laboratory, Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, California, USA
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, Washington, USA.,Institute for Protein Design, Seattle, Washington, USA
| | - Paul D Adams
- Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley, California, USA.,Department of Bioengineering, University of California, Berkeley, Berkeley, California, USA
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California, USA
| |
Collapse
|
3
|
Manipulating the substrate specificity of murine dihydrofolate reductase enzyme using an expanded set of amino acids. Biochem Eng J 2015. [DOI: 10.1016/j.bej.2015.03.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
4
|
Joyce AP, Zhang C, Bradley P, Havranek JJ. Structure-based modeling of protein: DNA specificity. Brief Funct Genomics 2014; 14:39-49. [PMID: 25414269 DOI: 10.1093/bfgp/elu044] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Protein:DNA interactions are essential to a range of processes that maintain and express the information encoded in the genome. Structural modeling is an approach that aims to understand these interactions at the physicochemical level. It has been proposed that structural modeling can lead to deeper understanding of the mechanisms of protein:DNA interactions, and that progress in this field can not only help to rationalize the observed specificities of DNA-binding proteins but also to allow researchers to engineer novel DNA site specificities. In this review we discuss recent developments in the structural description of protein:DNA interactions and specificity, as well as the challenges facing the field in the future.
Collapse
|
5
|
Micale G, Pulvirenti A, Giugno R, Ferro A. Proteins comparison through probabilistic optimal structure local alignment. Front Genet 2014; 5:302. [PMID: 25228906 PMCID: PMC4151033 DOI: 10.3389/fgene.2014.00302] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2014] [Accepted: 08/12/2014] [Indexed: 11/13/2022] Open
Abstract
Multiple local structure comparison helps to identify common structural motifs or conserved binding sites in 3D structures in distantly related proteins. Since there is no best way to compare structures and evaluate the alignment, a wide variety of techniques and different similarity scoring schemes have been proposed. Existing algorithms usually compute the best superposition of two structures or attempt to solve it as an optimization problem in a simpler setting (e.g., considering contact maps or distance matrices). Here, we present PROPOSAL (PROteins comparison through Probabilistic Optimal Structure local ALignment), a stochastic algorithm based on iterative sampling for multiple local alignment of protein structures. Our method can efficiently find conserved motifs across a set of protein structures. Only the distances between all pairs of residues in the structures are computed. To show the accuracy and the effectiveness of PROPOSAL we tested it on a few families of protein structures. We also compared PROPOSAL with two state-of-the-art tools for pairwise local alignment on a dataset of manually annotated motifs. PROPOSAL is available as a Java 2D standalone application or a command line program at http://ferrolab.dmi.unict.it/proposal/proposal.html.
Collapse
Affiliation(s)
- Giovanni Micale
- Department of Computer Science, University of Pisa Pisa, Italy
| | - Alfredo Pulvirenti
- Department of Clinical and Molecular Biomedicine, University of Catania Catania, Italy
| | - Rosalba Giugno
- Department of Clinical and Molecular Biomedicine, University of Catania Catania, Italy
| | - Alfredo Ferro
- Department of Clinical and Molecular Biomedicine, University of Catania Catania, Italy
| |
Collapse
|
6
|
Borgo B, Havranek JJ. Motif-directed redesign of enzyme specificity. Protein Sci 2014; 23:312-20. [PMID: 24407908 PMCID: PMC3945839 DOI: 10.1002/pro.2417] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2013] [Accepted: 12/29/2013] [Indexed: 11/21/2022]
Abstract
Computational protein design relies on several approximations, including the use of fixed backbones and rotamers, to reduce protein design to a computationally tractable problem. However, allowing backbone and off-rotamer flexibility leads to more accurate designs and greater conformational diversity. Exhaustive sampling of this additional conformational space is challenging, and often impossible. Here, we report a computational method that utilizes a preselected library of native interactions to direct backbone flexibility to accommodate placement of these functional contacts. Using these native interaction modules, termed motifs, improves the likelihood that the interaction can be realized, provided that suitable backbone perturbations can be identified. Furthermore, it allows a directed search of the conformational space, reducing the sampling needed to find low energy conformations. We implemented the motif-based design algorithm in Rosetta, and tested the efficacy of this method by redesigning the substrate specificity of methionine aminopeptidase. In summary, native enzymes have evolved to catalyze a wide range of chemical reactions with extraordinary specificity. Computational enzyme design seeks to generate novel chemical activities by altering the target substrates of these existing enzymes. We have implemented a novel approach to redesign the specificity of an enzyme and demonstrated its effectiveness on a model system.
Collapse
Affiliation(s)
- Benjamin Borgo
- Program in Computational and Systems Biology, Washington University in St. Louis, St. Louis, Missouri, 63110
| | | |
Collapse
|
7
|
Abstract
Building protein tools that can selectively bind or cleave specific DNA sequences requires efficient technologies for modifying protein-DNA interactions. Computational design is one method for accomplishing this goal. In this chapter, we present the current state of protein-DNA interface design with the Rosetta macromolecular modeling program. The LAGLIDADG endonuclease family of DNA-cleaving enzymes, under study as potential gene therapy reagents, has been the main testing ground for these in silico protocols. At this time, the computational methods are most useful for designing endonuclease variants that can accommodate small numbers of target site substitutions. Attempts to engineer for more extensive interface changes will likely benefit from an approach that uses the computational design results in conjunction with a high-throughput directed evolution or screening procedure. The family of enzymes presents an engineering challenge because their interfaces are highly integrated and there is significant coordination between the binding and catalysis events. Future developments in the computational algorithms depend on experimental feedback to improve understanding and modeling of these complex enzymatic features. This chapter presents both the basic method of design that has been successfully used to modulate specificity and more advanced procedures that incorporate DNA flexibility and other properties that are likely necessary for reliable modeling of more extensive target site changes.
Collapse
Affiliation(s)
- Summer Thyme
- Department of Biological Sciences, University of Washington, Seattle, WA, USA
| | | |
Collapse
|
8
|
Thyme SB, Boissel SJS, Arshiya Quadri S, Nolan T, Baker DA, Park RU, Kusak L, Ashworth J, Baker D. Reprogramming homing endonuclease specificity through computational design and directed evolution. Nucleic Acids Res 2013; 42:2564-76. [PMID: 24270794 PMCID: PMC3936771 DOI: 10.1093/nar/gkt1212] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Homing endonucleases (HEs) can be used to induce targeted genome modification to reduce the fitness of pathogen vectors such as the malaria-transmitting Anopheles gambiae and to correct deleterious mutations in genetic diseases. We describe the creation of an extensive set of HE variants with novel DNA cleavage specificities using an integrated experimental and computational approach. Using computational modeling and an improved selection strategy, which optimizes specificity in addition to activity, we engineered an endonuclease to cleave in a gene associated with Anopheles sterility and another to cleave near a mutation that causes pyruvate kinase deficiency. In the course of this work we observed unanticipated context-dependence between bases which will need to be mechanistically understood for reprogramming of specificity to succeed more generally.
Collapse
Affiliation(s)
- Summer B Thyme
- Department of Biochemistry, University of Washington, UW Box 357350, 1705 NE Pacific St., Seattle, WA 98195, USA, Graduate Program in Biomolecular Structure and Design, University of Washington, UW Box 357350, 1705 NE Pacific St., Seattle, WA 98195, USA, Graduate Program in Molecular and Cellular Biology, University of Washington, UW Box 357275, 1959 NE Pacific St., Seattle, WA 98195, USA, Department of Life Sciences, Sir Alexander Fleming Building, Imperial College London, Imperial College Road, London SW7 2AZ, UK, Department of Genetics, University of Cambridge, Downing Street, Cambridge CB1 3QA, UK, Institute for Systems Biology, 401 Terry Avenue N, Seattle, WA 98109, USA and Howard Hughes Medical Institute, University of Washington, UW Box 357350, 1705 NE Pacific St., Seattle, WA 98195, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Murphy GS, Mills JL, Miley MJ, Machius M, Szyperski T, Kuhlman B. Increasing sequence diversity with flexible backbone protein design: the complete redesign of a protein hydrophobic core. Structure 2012; 20:1086-96. [PMID: 22632833 PMCID: PMC3372604 DOI: 10.1016/j.str.2012.03.026] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2011] [Revised: 02/15/2012] [Accepted: 03/30/2012] [Indexed: 01/07/2023]
Abstract
Protein design tests our understanding of protein stability and structure. Successful design methods should allow the exploration of sequence space not found in nature. However, when redesigning naturally occurring protein structures, most fixed backbone design algorithms return amino acid sequences that share strong sequence identity with wild-type sequences, especially in the protein core. This behavior places a restriction on functional space that can be explored and is not consistent with observations from nature, where sequences of low identity have similar structures. Here, we allow backbone flexibility during design to mutate every position in the core (38 residues) of a four-helix bundle protein. Only small perturbations to the backbone, 1-2 Å, were needed to entirely mutate the core. The redesigned protein, DRNN, is exceptionally stable (melting point >140°C). An NMR and X-ray crystal structure show that the side chains and backbone were accurately modeled (all-atom RMSD = 1.3 Å).
Collapse
Affiliation(s)
- Grant S. Murphy
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599-3290, USA
| | - Jeffrey L. Mills
- Department of Chemistry, State University of New York at Buffalo, Buffalo, NY, 14260, USA
,Northeast Structural Genomics Consortium
| | - Michael J. Miley
- Center for Structural Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Mischa Machius
- Center for Structural Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Thomas Szyperski
- Department of Chemistry, State University of New York at Buffalo, Buffalo, NY, 14260, USA
,Northeast Structural Genomics Consortium
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599-7260, USA
,Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
,corresponding author. , Phone: 919-843-0188, Fax: 919-966-2852
| |
Collapse
|
10
|
Zhou AQ, O'Hern CS, Regan L. The power of hard-sphere models: explaining side-chain dihedral angle distributions of Thr and Val. Biophys J 2012; 102:2345-52. [PMID: 22677388 DOI: 10.1016/j.bpj.2012.01.061] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2011] [Revised: 01/26/2012] [Accepted: 01/27/2012] [Indexed: 10/28/2022] Open
Abstract
The energy functions used to predict protein structures typically include both molecular-mechanics and knowledge-based terms. In contrast, our approach is to develop robust physics- and geometry-based methods. Here, we investigate to what extent simple hard-sphere models can be used to predict side-chain conformations. The distributions of the side-chain dihedral angle χ(1) of Val and Thr in proteins of known structure show distinctive features: Val side chains predominantly adopt χ(1) = 180°, whereas Thr side chains typically adopt χ(1) = 60° and 300° (i.e., χ(1) = ±60° or g- and g(+) configurations). Several hypotheses have been proposed to explain these differences, including interresidue steric clashes and hydrogen-bonding interactions. In contrast, we show that the observed side-chain dihedral angle distributions for both Val and Thr can be explained using only local steric interactions in a dipeptide mimetic. Our results emphasize the power of simple physical approaches and their importance for future advances in protein engineering and design.
Collapse
Affiliation(s)
- Alice Qinhua Zhou
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, USA
| | | | | |
Collapse
|
11
|
Thyme SB, Baker D, Bradley P. Improved modeling of side-chain--base interactions and plasticity in protein--DNA interface design. J Mol Biol 2012; 419:255-74. [PMID: 22426128 DOI: 10.1016/j.jmb.2012.03.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2011] [Revised: 02/09/2012] [Accepted: 03/09/2012] [Indexed: 12/30/2022]
Abstract
Combinatorial sequence optimization for protein design requires libraries of discrete side-chain conformations. The discreteness of these libraries is problematic, particularly for long, polar side chains, since favorable interactions can be missed. Previously, an approach to loop remodeling where protein backbone movement is directed by side-chain rotamers predicted to form interactions previously observed in native complexes (termed "motifs") was described. Here, we show how such motif libraries can be incorporated into combinatorial sequence optimization protocols and improve native complex recapitulation. Guided by the motif rotamer searches, we made improvements to the underlying energy function, increasing recapitulation of native interactions. To further test the methods, we carried out a comprehensive experimental scan of amino acid preferences in the I-AniI protein-DNA interface and found that many positions tolerated multiple amino acids. This sequence plasticity is not observed in the computational results because of the fixed-backbone approximation of the model. We improved modeling of this diversity by introducing DNA flexibility and reducing the convergence of the simulated annealing algorithm that drives the design process. In addition to serving as a benchmark, this extensive experimental data set provides insight into the types of interactions essential to maintain the function of this potential gene therapy reagent.
Collapse
Affiliation(s)
- Summer B Thyme
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA.
| | | | | |
Collapse
|
12
|
Increased Diels-Alderase activity through backbone remodeling guided by Foldit players. Nat Biotechnol 2012; 30:190-2. [PMID: 22267011 PMCID: PMC3566767 DOI: 10.1038/nbt.2109] [Citation(s) in RCA: 220] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Accepted: 01/04/2012] [Indexed: 12/03/2022]
Abstract
Computational enzyme design holds promise for the production of renewable fuels, drugs and chemicals. De novo enzyme design has generated catalysts for several reactions, but with lower catalytic efficiencies than naturally occurring enzymes. Here we report the use of game-driven crowdsourcing to enhance the activity of a computationally designed enzyme through the functional remodeling of its structure. Players of the online game Foldit were challenged to remodel the backbone of a computationally designed bimolecular Diels-Alderase to enable additional interactions with substrates. Several iterations of design and characterization generated a 24-residue helix-turn-helix motif, including a 13-residue insertion, that increased enzyme activity >18-fold. X-ray crystallography showed that the large insertion adopts a helix-turn-helix structure positioned as in the Foldit model. These results demonstrate that human creativity can extend beyond the macroscopic challenges encountered in everyday life to molecular-scale design problems.
Collapse
|
13
|
Fleishman SJ, Corn JE, Strauch EM, Whitehead TA, Karanicolas J, Baker D. Hotspot-centric de novo design of protein binders. J Mol Biol 2011; 413:1047-62. [PMID: 21945116 DOI: 10.1016/j.jmb.2011.09.001] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2011] [Revised: 08/31/2011] [Accepted: 09/02/2011] [Indexed: 11/24/2022]
Abstract
Protein-protein interactions play critical roles in biology, and computational design of interactions could be useful in a range of applications. We describe in detail a general approach to de novo design of protein interactions based on computed, energetically optimized interaction hotspots, which was recently used to produce high-affinity binders of influenza hemagglutinin. We present several alternative approaches to identify and build the key hotspot interactions within both core secondary structural elements and variable loop regions and evaluate the method's performance in natural-interface recapitulation. We show that the method generates binding surfaces that are more conformationally restricted than previous design methods, reducing opportunities for off-target interactions.
Collapse
Affiliation(s)
- Sarel J Fleishman
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | | | | | | | | | | |
Collapse
|
14
|
Huang PS, Ban YEA, Richter F, Andre I, Vernon R, Schief WR, Baker D. RosettaRemodel: a generalized framework for flexible backbone protein design. PLoS One 2011; 6:e24109. [PMID: 21909381 PMCID: PMC3166072 DOI: 10.1371/journal.pone.0024109] [Citation(s) in RCA: 240] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2011] [Accepted: 07/29/2011] [Indexed: 12/12/2022] Open
Abstract
We describe RosettaRemodel, a generalized framework for flexible protein design that provides a versatile and convenient interface to the Rosetta modeling suite. RosettaRemodel employs a unified interface, called a blueprint, which allows detailed control over many aspects of flexible backbone protein design calculations. RosettaRemodel allows the construction and elaboration of customized protocols for a wide range of design problems ranging from loop insertion and deletion, disulfide engineering, domain assembly, loop remodeling, motif grafting, symmetrical units, to de novo structure modeling.
Collapse
Affiliation(s)
- Po-Ssu Huang
- Department of Biochemistry, University of Washington, Seattle, Washington, United States of America
| | - Yih-En Andrew Ban
- Department of Biochemistry, University of Washington, Seattle, Washington, United States of America
| | - Florian Richter
- Department of Biochemistry, University of Washington, Seattle, Washington, United States of America
- Interdisciplinary Program in Biomolecular Structure and Design, University of Washington, Seattle, Washington, United States of America
| | - Ingemar Andre
- Department of Biochemistry and Structural Biology, Lund University, Lund, Sweden
| | - Robert Vernon
- Program in Molecular Structure and Function, Hospital for Sick Children, Toronto, Ontario, Canada
| | - William R. Schief
- Department of Biochemistry, University of Washington, Seattle, Washington, United States of America
- * E-mail: (WRS); (DB)
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, University of Washington, Seattle, Washington, United States of America
- * E-mail: (WRS); (DB)
| |
Collapse
|
15
|
Pantazes RJ, Grisewood MJ, Maranas CD. Recent advances in computational protein design. Curr Opin Struct Biol 2011; 21:467-72. [PMID: 21600758 DOI: 10.1016/j.sbi.2011.04.005] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2011] [Accepted: 04/28/2011] [Indexed: 11/30/2022]
Affiliation(s)
- Robert J Pantazes
- The Pennsylvania State University, Department of Chemical Engineering, 112 Fenske Lab, University Park, PA 16802, USA
| | | | | |
Collapse
|
16
|
RosettaScripts: a scripting language interface to the Rosetta macromolecular modeling suite. PLoS One 2011; 6:e20161. [PMID: 21731610 PMCID: PMC3123292 DOI: 10.1371/journal.pone.0020161] [Citation(s) in RCA: 425] [Impact Index Per Article: 32.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2011] [Accepted: 04/12/2011] [Indexed: 11/19/2022] Open
Abstract
Macromolecular modeling and design are increasingly useful in basic research, biotechnology, and teaching. However, the absence of a user-friendly modeling framework that provides access to a wide range of modeling capabilities is hampering the wider adoption of computational methods by non-experts. RosettaScripts is an XML-like language for specifying modeling tasks in the Rosetta framework. RosettaScripts provides access to protocol-level functionalities, such as rigid-body docking and sequence redesign, and allows fast testing and deployment of complex protocols without need for modifying or recompiling the underlying C++ code. We illustrate these capabilities with RosettaScripts protocols for the stabilization of proteins, the generation of computationally constrained libraries for experimental selection of higher-affinity binding proteins, loop remodeling, small-molecule ligand docking, design of ligand-binding proteins, and specificity redesign in DNA-binding proteins.
Collapse
|
17
|
Richter F, Leaver-Fay A, Khare SD, Bjelic S, Baker D. De novo enzyme design using Rosetta3. PLoS One 2011; 6:e19230. [PMID: 21603656 PMCID: PMC3095599 DOI: 10.1371/journal.pone.0019230] [Citation(s) in RCA: 221] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2011] [Accepted: 03/23/2011] [Indexed: 11/30/2022] Open
Abstract
The Rosetta de novo enzyme design protocol has been used to design enzyme
catalysts for a variety of chemical reactions, and in principle can be applied
to any arbitrary chemical reaction of interest, The process has four stages: 1)
choice of a catalytic mechanism and corresponding minimal model active site, 2)
identification of sites in a set of scaffold proteins where this minimal active
site can be realized, 3) optimization of the identities of the surrounding
residues for stabilizing interactions with the transition state and primary
catalytic residues, and 4) evaluation and ranking the resulting designed
sequences. Stages two through four of this process can be carried out with the
Rosetta package, while stage one needs to be done externally. Here, we
demonstrate how to carry out the Rosetta enzyme design protocol from start to
end in detail using for illustration the triosephosphate isomerase reaction.
Collapse
Affiliation(s)
- Florian Richter
- Department of Biochemistry, University of Washington, Seattle, Washington, United States of America.
| | | | | | | | | |
Collapse
|
18
|
Aubert M, Ryu BY, Banks L, Rawlings DJ, Scharenberg AM, Jerome KR. Successful targeting and disruption of an integrated reporter lentivirus using the engineered homing endonuclease Y2 I-AniI. PLoS One 2011; 6:e16825. [PMID: 21399673 PMCID: PMC3036713 DOI: 10.1371/journal.pone.0016825] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2010] [Accepted: 01/11/2011] [Indexed: 11/19/2022] Open
Abstract
Current antiviral therapy does not cure HIV-infected individuals because the virus establishes lifelong latent infection within long-lived memory T cells as integrated HIV proviral DNA. Here, we report a new therapeutic approach that aims to cure cells of latent HIV infection by rendering latent virus incapable of replication and pathogenesis via targeted cellular mutagenesis of essential viral genes. This is achieved by using a homing endonuclease to introduce DNA double-stranded breaks (dsb) within the integrated proviral DNA, which is followed by triggering of the cellular DNA damage response and error-prone repair. To evaluate this concept, we developed an in vitro culture model of viral latency, consisting of an integrated lentiviral vector with an easily evaluated reporter system to detect targeted mutagenesis events. Using this system, we demonstrate that homing endonucleases can efficiently and selectively target an integrated reporter lentivirus within the cellular genome, leading to mutation in the proviral DNA and loss of reporter gene expression. This new technology offers the possibility of selectively disabling integrated HIV provirus within latently infected cells.
Collapse
Affiliation(s)
- Martine Aubert
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Byoung Y. Ryu
- Department of Pediatrics, University of Washington and Seattle Children's Hospital, Seattle, Washington, United States of America
| | - Lindsey Banks
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - David J. Rawlings
- Department of Pediatrics, University of Washington and Seattle Children's Hospital, Seattle, Washington, United States of America
| | - Andrew M. Scharenberg
- Department of Pediatrics, University of Washington and Seattle Children's Hospital, Seattle, Washington, United States of America
| | - Keith R. Jerome
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- Department of Laboratory Medicine, University of Washington, Seattle, Washington, United States of America
- * E-mail:
| |
Collapse
|
19
|
Lassila JK. Conformational diversity and computational enzyme design. Curr Opin Chem Biol 2010; 14:676-82. [PMID: 20829099 PMCID: PMC2953567 DOI: 10.1016/j.cbpa.2010.08.010] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2010] [Revised: 08/06/2010] [Accepted: 08/06/2010] [Indexed: 11/22/2022]
Abstract
The application of computational protein design methods to the design of enzyme active sites offers potential routes to new catalysts and new reaction specificities. Computational design methods have typically treated the protein backbone as a rigid structure for the sake of computational tractability. However, this fixed-backbone approximation introduces its own special challenges for enzyme design and it contrasts with an emerging picture of natural enzymes as dynamic ensembles with multiple conformations and motions throughout a reaction cycle. This review considers the impact of conformational variation and dynamics on computational enzyme design and it highlights new approaches to addressing protein conformational diversity in enzyme design including recent advances in multi-state design, backbone flexibility, and computational library design.
Collapse
Affiliation(s)
- Jonathan K Lassila
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
20
|
Barakat NH, Barakat NH, Love JJ. Combined use of experimental and computational screens to characterize protein stability. Protein Eng Des Sel 2010; 23:799-807. [PMID: 20805093 DOI: 10.1093/protein/gzq052] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
One of the primary goals of protein design is to engineer proteins with improved stability. Protein stability is a key issue for chemical, biotechnology and pharmaceutical industries. The development of robust proteins/enzymes with the ability to withstand the potentially harsh conditions of industrial operations is of high importance. A number of strategies are currently being employed to achieve this goal. Two particular approaches, (i) directed evolution and (ii) computational protein design, are quite powerful yet have only recently been combined or applied and analyzed in parallel. In directed evolution, libraries of variants are searched experimentally for clones possessing the desired properties. With computational methods, protein design algorithms are utilized to perform in silico screening for stable protein sequences. Here, we used gene libraries of an unstable variant of streptococcal protein G (Gbeta1) and an in vivo screening method to identify stabilized variants. Many variants with notably increased thermal stabilities were isolated and characterized. Concomitantly, computational techniques and protein design algorithms were used to perform in silico screening of the same destabilized variant of Gbeta1. The combined use, and critical analysis, of these methods promises to advance the field of protein design.
Collapse
Affiliation(s)
- Nora H Barakat
- Department of Chemistry and Biochemistry, San Diego State University, 5500 Campanile Dr, San Diego, CA 92182-1030, USA
| | | | | |
Collapse
|
21
|
Abstract
A long-standing goal of computational protein design is to create proteins similar to those found in Nature. One motivation is to harness the exquisite functional capabilities of proteins for our own purposes. The extent of similarity between designed and natural proteins also reports on how faithfully our models represent the selective pressures that determine protein sequences. As the field of protein design shifts emphasis from reproducing native-like protein structure to function, it has become important that these models treat the notion of specificity in molecular interactions. Although specificity may, in some cases, be achieved by optimization of a desired protein in isolation, methods have been developed to address directly the desire for proteins that exhibit specific functions and interactions.
Collapse
Affiliation(s)
- James J Havranek
- Department of Genetics, Washington University School of Medicine, St Louis, Missouri 63110, USA.
| |
Collapse
|
22
|
Rzepiela AJ, Schäfer LV, Goga N, Risselada HJ, De Vries AH, Marrink SJ. Reconstruction of atomistic details from coarse-grained structures. J Comput Chem 2010; 31:1333-43. [PMID: 20087907 DOI: 10.1002/jcc.21415] [Citation(s) in RCA: 78] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
We present an algorithm to reconstruct atomistic structures from their corresponding coarse-grained (CG) representations and its implementation into the freely available molecular dynamics (MD) program package GROMACS. The central part of the algorithm is a simulated annealing MD simulation in which the CG and atomistic structures are coupled via restraints. A number of examples demonstrate the application of the reconstruction procedure to obtain low-energy atomistic structural ensembles from their CG counterparts. We reconstructed individual molecules in vacuo (NCQ tripeptide, dipalmitoylphosphatidylcholine, and cholesterol), bulk water, and a WALP transmembrane peptide embedded in a solvated lipid bilayer. The first examples serve to optimize the parameters for the reconstruction procedure, whereas the latter examples illustrate the applicability to condensed-phase biomolecular systems.
Collapse
Affiliation(s)
- Andrzej J Rzepiela
- Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Nijenborgh 4, 9747 AG Groningen, The Netherlands
| | | | | | | | | | | |
Collapse
|
23
|
Abstract
Predictive methods for the computational design of proteins search for amino acid sequences adopting desired structures that perform specific functions. Typically, design of 'function' is formulated as engineering new and altered binding activities into proteins. Progress in the design of functional protein-protein interactions is directed toward engineering proteins to precisely control biological processes by specifically recognizing desired interaction partners while avoiding competitors. The field is aiming for strategies to harness recent advances in high-resolution computational modeling-particularly those exploiting protein conformational variability-to engineer new functions and incorporate many functional requirements simultaneously.
Collapse
Affiliation(s)
- Daniel J Mandell
- Graduate Program in Bioinformatics and Computational Biology, California Institute for Quantitative Biosciences, and Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, USA
| | | |
Collapse
|
24
|
Murphy PM, Bolduc JM, Gallaher JL, Stoddard BL, Baker D. Alteration of enzyme specificity by computational loop remodeling and design. Proc Natl Acad Sci U S A 2009; 106:9215-20. [PMID: 19470646 PMCID: PMC2685249 DOI: 10.1073/pnas.0811070106] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2008] [Indexed: 11/18/2022] Open
Abstract
Altering the specificity of an enzyme requires precise positioning of side-chain functional groups that interact with the modified groups of the new substrate. This requires not only sequence changes that introduce the new functional groups but also sequence changes that remodel the structure of the protein backbone so that the functional groups are properly positioned. We describe a computational design method for introducing specific enzyme-substrate interactions by directed remodeling of loops near the active site. Benchmark tests on 8 native protein-ligand complexes show that the method can recover native loop lengths and, often, native loop conformations. We then use the method to redesign a critical loop in human guanine deaminase such that a key side-chain interaction is made with the substrate ammelide. The redesigned enzyme is 100-fold more active on ammelide and 2.5e4-fold less active on guanine than wild-type enzyme: The net change in specificity is 2.5e6-fold. The structure of the designed protein was confirmed by X-ray crystallographic analysis: The remodeled loop adopts a conformation that is within 1-A Calpha RMSD of the computational model.
Collapse
Affiliation(s)
- Paul M. Murphy
- Department of Biochemistry
- Molecular and Cellular Biology Program
- Medical Scientist Training Program, University of Washington, Seattle, WA 98195
| | - Jill M. Bolduc
- Fred Hutchinson Cancer Research Center, Seattle, WA 98109
| | | | | | - David Baker
- Department of Biochemistry
- Howard Hughes Medical Institute, Seattle, WA 98195; and
| |
Collapse
|