1
|
Zhou X, Chen G, Ye J, Wang E, Zhang J, Mao C, Li Z, Hao J, Huang X, Tang J, Heng PA. ProRefiner: an entropy-based refining strategy for inverse protein folding with global graph attention. Nat Commun 2023; 14:7434. [PMID: 37973874 PMCID: PMC10654420 DOI: 10.1038/s41467-023-43166-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 11/02/2023] [Indexed: 11/19/2023] Open
Abstract
Inverse Protein Folding (IPF) is an important task of protein design, which aims to design sequences compatible with a given backbone structure. Despite the prosperous development of algorithms for this task, existing methods tend to rely on noisy predicted residues located in the local neighborhood when generating sequences. To address this limitation, we propose an entropy-based residue selection method to remove noise in the input residue context. Additionally, we introduce ProRefiner, a memory-efficient global graph attention model to fully utilize the denoised context. Our proposed method achieves state-of-the-art performance on multiple sequence design benchmarks in different design settings. Furthermore, we demonstrate the applicability of ProRefiner in redesigning Transposon-associated transposase B, where six out of the 20 variants we propose exhibit improved gene editing activity.
Collapse
Affiliation(s)
- Xinyi Zhou
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Central Ave, Hong Kong, China
| | | | - Junjie Ye
- Noah's Ark Lab, Huawei, Shenzhen, China
| | - Ercheng Wang
- Zhejiang Lab, Kechuang Avenue, Hangzhou, China
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
| | - Jun Zhang
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing, China
| | - Cong Mao
- State Key Laboratory of Reproductive Medicine, Nanjing Medical University, Nanjing, China
| | - Zhanwei Li
- Zhejiang Lab, Kechuang Avenue, Hangzhou, China
| | | | | | - Jin Tang
- Zhejiang Lab, Kechuang Avenue, Hangzhou, China
| | - Pheng Ann Heng
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, Central Ave, Hong Kong, China
- Zhejiang Lab, Kechuang Avenue, Hangzhou, China
| |
Collapse
|
2
|
Sunsetting Binding MOAD with its last data update and the addition of 3D-ligand polypharmacology tools. Sci Rep 2023; 13:3008. [PMID: 36810894 PMCID: PMC9944886 DOI: 10.1038/s41598-023-29996-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 02/14/2023] [Indexed: 02/24/2023] Open
Abstract
Binding MOAD is a database of protein-ligand complexes and their affinities with many structured relationships across the dataset. The project has been in development for over 20 years, but now, the time has come to bring it to a close. Currently, the database contains 41,409 structures with affinity coverage for 15,223 (37%) complexes. The website BindingMOAD.org provides numerous tools for polypharmacology exploration. Current relationships include links for structures with sequence similarity, 2D ligand similarity, and binding-site similarity. In this last update, we have added 3D ligand similarity using ROCS to identify ligands which may not necessarily be similar in two dimensions but can occupy the same three-dimensional space. For the 20,387 different ligands present in the database, a total of 1,320,511 3D-shape matches between the ligands were added. Examples of the utility of 3D-shape matching in polypharmacology are presented. Finally, plans for future access to the project data are outlined.
Collapse
|
3
|
Jatzlau J, Burdzinski W, Trumpp M, Obendorf L, Roßmann K, Ravn K, Hyvönen M, Bottanelli F, Broichhagen J, Knaus P. A versatile Halo- and SNAP-tagged BMP/TGFβ receptor library for quantification of cell surface ligand binding. Commun Biol 2023; 6:34. [PMID: 36635368 PMCID: PMC9837045 DOI: 10.1038/s42003-022-04388-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 12/20/2022] [Indexed: 01/14/2023] Open
Abstract
TGFβs, BMPs and Activins regulate numerous developmental and homeostatic processes and signal through hetero-tetrameric receptor complexes composed of two types of serine/threonine kinase receptors. Each of the 33 different ligands possesses unique affinities towards specific receptor types. However, the lack of specific tools hampered simultaneous testing of ligand binding towards all BMP/TGFβ receptors. Here we present a N-terminally Halo- and SNAP-tagged TGFβ/BMP receptor library to visualize receptor complexes in dual color. In combination with fluorescently labeled ligands, we established a Ligand Surface Binding Assay (LSBA) for optical quantification of receptor-dependent ligand binding in a cellular context. We highlight that LSBA is generally applicable to test (i) binding of different ligands such as Activin A, TGFβ1 and BMP9, (ii) for mutant screens and (iii) evolutionary comparisons. This experimental set-up opens opportunities for visualizing ligand-receptor binding dynamics, essential to determine signaling specificity and is easily adaptable for other receptor signaling pathways.
Collapse
Affiliation(s)
- Jerome Jatzlau
- Institute of Chemistry and Biochemistry - Biochemistry, Berlin, Germany
| | - Wiktor Burdzinski
- Institute of Chemistry and Biochemistry - Biochemistry, Berlin, Germany
- Berlin-Brandenburg School for Regenerative Therapies (BSRT), Berlin, Germany
| | - Michael Trumpp
- Institute of Chemistry and Biochemistry - Biochemistry, Berlin, Germany
| | - Leon Obendorf
- Institute of Chemistry and Biochemistry - Biochemistry, Berlin, Germany
| | - Kilian Roßmann
- Leibniz-Forschungsinstitut für Molekulare Pharmakologie, Berlin, Germany
| | - Katharina Ravn
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Marko Hyvönen
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | | | | | - Petra Knaus
- Institute of Chemistry and Biochemistry - Biochemistry, Berlin, Germany.
- Berlin-Brandenburg School for Regenerative Therapies (BSRT), Berlin, Germany.
| |
Collapse
|
4
|
Ashworth MA, Bombino E, de Jong RM, Wijma HJ, Janssen DB, McLean KJ, Munro AW. Computation-Aided Engineering of Cytochrome P450 for the Production of Pravastatin. ACS Catal 2022; 12:15028-15044. [PMID: 36570080 PMCID: PMC9764288 DOI: 10.1021/acscatal.2c03974] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 10/22/2022] [Indexed: 11/29/2022]
Abstract
CYP105AS1 is a cytochrome P450 from Amycolatopsis orientalis that catalyzes monooxygenation of compactin to 6-epi-pravastatin. For fermentative production of the cholesterol-lowering drug pravastatin, the stereoselectivity of the enzyme needs to be inverted, which has been partially achieved by error-prone PCR mutagenesis and screening. In the current study, we report further optimization of the stereoselectivity by a computationally aided approach. Using the CoupledMoves protocol of Rosetta, a virtual library of mutants was designed to bind compactin in a pro-pravastatin orientation. By examining the frequency of occurrence of beneficial substitutions and rational inspection of their interactions, a small set of eight mutants was predicted to show the desired selectivity and these variants were tested experimentally. The best CYP105AS1 variant gave >99% stereoselective hydroxylation of compactin to pravastatin, with complete elimination of the unwanted 6-epi-pravastatin diastereomer. The enzyme-substrate complexes were also examined by ultrashort molecular dynamics simulations of 50 × 100 ps and 5 × 22 ns, which revealed that the frequency of occurrence of near-attack conformations agreed with the experimentally observed stereoselectivity. These results show that a combination of computational methods and rational inspection could improve CYP105AS1 stereoselectivity beyond what was obtained by directed evolution. Moreover, the work lays out a general in silico framework for specificity engineering of enzymes of known structure.
Collapse
Affiliation(s)
- Mark A. Ashworth
- Manchester
Institute of Biotechnology, School of Chemistry, The University of Manchester, Manchester M1 7DN, United Kingdom
| | - Elvira Bombino
- Department
of Biochemistry, Groningen Biomolecular Sciences and Biotechnology
Institute, University of Groningen, Nijenborgh 4, Groningen 9747 AG, Netherlands
| | - René M. de Jong
- DSM
Food & Beverage, Alexander Fleminglaan 1, 2613 AX Delft, the Netherlands
| | - Hein J. Wijma
- Department
of Biochemistry, Groningen Biomolecular Sciences and Biotechnology
Institute, University of Groningen, Nijenborgh 4, Groningen 9747 AG, Netherlands
| | - Dick B. Janssen
- Department
of Biochemistry, Groningen Biomolecular Sciences and Biotechnology
Institute, University of Groningen, Nijenborgh 4, Groningen 9747 AG, Netherlands,
| | - Kirsty J. McLean
- Manchester
Institute of Biotechnology, School of Chemistry, The University of Manchester, Manchester M1 7DN, United Kingdom,Department
of Biological and Geographical Sciences, School of Applied Sciences, University of Huddersfield, Huddersfield HD1 3DH, United Kingdom
| | - Andrew W. Munro
- Manchester
Institute of Biotechnology, School of Chemistry, The University of Manchester, Manchester M1 7DN, United Kingdom
| |
Collapse
|
5
|
Magi Meconi G, Sasselli IR, Bianco V, Onuchic JN, Coluzza I. Key aspects of the past 30 years of protein design. REPORTS ON PROGRESS IN PHYSICS. PHYSICAL SOCIETY (GREAT BRITAIN) 2022; 85:086601. [PMID: 35704983 DOI: 10.1088/1361-6633/ac78ef] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 06/15/2022] [Indexed: 06/15/2023]
Abstract
Proteins are the workhorse of life. They are the building infrastructure of living systems; they are the most efficient molecular machines known, and their enzymatic activity is still unmatched in versatility by any artificial system. Perhaps proteins' most remarkable feature is their modularity. The large amount of information required to specify each protein's function is analogically encoded with an alphabet of just ∼20 letters. The protein folding problem is how to encode all such information in a sequence of 20 letters. In this review, we go through the last 30 years of research to summarize the state of the art and highlight some applications related to fundamental problems of protein evolution.
Collapse
Affiliation(s)
- Giulia Magi Meconi
- Computational Biophysics Lab, Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramon 182, 20014, Donostia-San Sebastián, Spain
| | - Ivan R Sasselli
- Computational Biophysics Lab, Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramon 182, 20014, Donostia-San Sebastián, Spain
| | | | - Jose N Onuchic
- Center for Theoretical Biological Physics, Department of Physics & Astronomy, Department of Chemistry, Department of Biosciences, Rice University, Houston, TX 77251, United States of America
| | - Ivan Coluzza
- BCMaterials, Basque Center for Materials, Applications and Nanostructures, Bld. Martina Casiano, UPV/EHU Science Park, Barrio Sarriena s/n, 48940 Leioa, Spain
- Basque Foundation for Science, Ikerbasque, 48009, Bilbao, Spain
| |
Collapse
|
6
|
Koehler Leman J, Lyskov S, Lewis SM, Adolf-Bryfogle J, Alford RF, Barlow K, Ben-Aharon Z, Farrell D, Fell J, Hansen WA, Harmalkar A, Jeliazkov J, Kuenze G, Krys JD, Ljubetič A, Loshbaugh AL, Maguire J, Moretti R, Mulligan VK, Nance ML, Nguyen PT, Ó Conchúir S, Roy Burman SS, Samanta R, Smith ST, Teets F, Tiemann JKS, Watkins A, Woods H, Yachnin BJ, Bahl CD, Bailey-Kellogg C, Baker D, Das R, DiMaio F, Khare SD, Kortemme T, Labonte JW, Lindorff-Larsen K, Meiler J, Schief W, Schueler-Furman O, Siegel JB, Stein A, Yarov-Yarovoy V, Kuhlman B, Leaver-Fay A, Gront D, Gray JJ, Bonneau R. Ensuring scientific reproducibility in bio-macromolecular modeling via extensive, automated benchmarks. Nat Commun 2021; 12:6947. [PMID: 34845212 PMCID: PMC8630030 DOI: 10.1038/s41467-021-27222-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 11/02/2021] [Indexed: 01/14/2023] Open
Abstract
Each year vast international resources are wasted on irreproducible research. The scientific community has been slow to adopt standard software engineering practices, despite the increases in high-dimensional data, complexities of workflows, and computational environments. Here we show how scientific software applications can be created in a reproducible manner when simple design goals for reproducibility are met. We describe the implementation of a test server framework and 40 scientific benchmarks, covering numerous applications in Rosetta bio-macromolecular modeling. High performance computing cluster integration allows these benchmarks to run continuously and automatically. Detailed protocol captures are useful for developers and users of Rosetta and other macromolecular modeling tools. The framework and design concepts presented here are valuable for developers and users of any type of scientific software and for the scientific community to create reproducible methods. Specific examples highlight the utility of this framework, and the comprehensive documentation illustrates the ease of adding new tests in a matter of hours.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, 10010, USA.
- Department of Biology, New York University, New York, NY, 10003, USA.
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Steven M Lewis
- Cyrus Biotechnology, 1201 Second Ave, Suite 900, Seattle, WA, 98101, USA
| | - Jared Adolf-Bryfogle
- Department of Immunology and Microbiology, Scripps Research, La Jolla, CA, 92037, USA
- IAVI Neutralizing Antibody Center, Scripps Research, La Jolla, CA, 92037, USA
| | - Rebecca F Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Kyle Barlow
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Ziv Ben-Aharon
- Department of Microbiology and Molecular Genetics, Hebrew University, Hadassah Medical School, POB 12272, Jerusalem, 91120, Israel
| | - Daniel Farrell
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Jason Fell
- Genome Center, University of California, Davis, CA, 95616, USA
- Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, 95616, USA
- Department of Chemistry, University of California, Davis, CA, 95616, USA
| | - William A Hansen
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
| | - Ameya Harmalkar
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Jeliazko Jeliazkov
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Georg Kuenze
- Department of Chemistry, Vanderbilt University, Nashville, TN, 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Institute for Drug Discovery, Medical School, Leipzig University, 04103, Leipzig, Germany
| | - Justyna D Krys
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093, Warsaw, Poland
| | - Ajasja Ljubetič
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Amanda L Loshbaugh
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, 94158, USA
- Biophysics Graduate Program, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Jack Maguire
- Program in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Rocco Moretti
- Department of Chemistry, Vanderbilt University, Nashville, TN, 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
| | - Vikram Khipple Mulligan
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, 10010, USA
| | - Morgan L Nance
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Phuong T Nguyen
- Department of Physiology and Membrane Biology, School of Medicine, University of California, Davis, CA, 95616, USA
| | - Shane Ó Conchúir
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Shourya S Roy Burman
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Rituparna Samanta
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Shannon T Smith
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Chemical and Physical Biology Program, Vanderbilt University, Nashville, TN, 37235, USA
| | - Frank Teets
- Department of Bioochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27516, USA
| | - Johanna K S Tiemann
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200, Copenhagen N., Denmark
| | - Andrew Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Hope Woods
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Chemical and Physical Biology Program, Vanderbilt University, Nashville, TN, 37235, USA
| | - Brahm J Yachnin
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
| | - Christopher D Bahl
- Institute for Protein Innovation, Boston, MA, 02115, USA
- Division of Hematology/Oncology, Boston Children's Hospital, Boston, MA, 02115, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, 02115, USA
| | | | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Sagar D Khare
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, 94158, USA
- Biophysics Graduate Program, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Jason W Labonte
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Kresten Lindorff-Larsen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200, Copenhagen N., Denmark
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, TN, 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Institute for Drug Discovery, Medical School, Leipzig University, 04103, Leipzig, Germany
| | - William Schief
- Department of Immunology and Microbiology, Scripps Research, La Jolla, CA, 92037, USA
- IAVI Neutralizing Antibody Center, Scripps Research, La Jolla, CA, 92037, USA
| | - Ora Schueler-Furman
- Department of Microbiology and Molecular Genetics, Hebrew University, Hadassah Medical School, POB 12272, Jerusalem, 91120, Israel
| | - Justin B Siegel
- Genome Center, University of California, Davis, CA, 95616, USA
- Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, 95616, USA
- Department of Chemistry, University of California, Davis, CA, 95616, USA
| | - Amelie Stein
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200, Copenhagen N., Denmark
| | - Vladimir Yarov-Yarovoy
- Department of Physiology and Membrane Biology, School of Medicine, University of California, Davis, CA, 95616, USA
| | - Brian Kuhlman
- Department of Bioochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27516, USA
| | - Andrew Leaver-Fay
- Department of Bioochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27516, USA
| | - Dominik Gront
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093, Warsaw, Poland
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA.
| | - Richard Bonneau
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, 10010, USA.
- Department of Biology, New York University, New York, NY, 10003, USA.
- Department of Computer Science, New York University, New York, NY, 10003, USA.
| |
Collapse
|
7
|
Loeffler FF, Viana IFT, Fischer N, Coêlho DF, Silva CS, Purificação AF, Araújo CMCS, Leite BHS, Durães-Carvalho R, Magalhães T, Morais CNL, Cordeiro MT, Lins RD, Marques ETA, Jaenisch T. Identification of a Zika NS2B epitope as a biomarker for severe clinical phenotypes. RSC Med Chem 2021; 12:1525-1539. [PMID: 34671736 DOI: 10.1039/d1md00124h] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2021] [Accepted: 06/17/2021] [Indexed: 01/04/2023] Open
Abstract
The identification of specific biomarkers for Zika infection and its clinical complications is fundamental to mitigate the infection spread, which has been associated with a broad range of neurological sequelae. We present the characterization of antibody responses in serum samples from individuals infected with Zika, presenting non-severe (classical) and severe (neurological disease) phenotypes, with high-density peptide arrays comprising the Zika NS1 and NS2B proteins. The data pinpoints one strongly IgG-targeted NS2B epitope in non-severe infections, which is absent in Zika patients, where infection progressed to the severe phenotype. This differential IgG profile between the studied groups was confirmed by multivariate data analysis. Molecular dynamics simulations and circular dichroism have shown that the peptide in solution presents itself in a sub-optimal conformation for antibody recognition, which led us to computationally engineer an artificial protein able to stabilize the NS2B epitope structure. The engineered protein was used to interrogate paired samples from mothers and their babies presenting Zika-associated microcephaly and confirmed the absence of NS2B IgG response in those samples. These findings suggest that the assessment of antibody responses to the herein identified NS2B epitope is a strong candidate biomarker for the diagnosis and prognosis of Zika-associated neurological disease.
Collapse
Affiliation(s)
- Felix F Loeffler
- Max Planck Institute of Colloids and Interfaces, Department of Biomolecular Systems Potsdam Germany
| | - Isabelle F T Viana
- Department of Virology, Aggeu Magalhães Institute, Oswaldo Cruz Foundation Recife PE Brazil
| | - Nico Fischer
- Section Clinical Tropical Medicine, Department of Infectious Diseases, Heidelberg University Hospital Germany
| | - Danilo F Coêlho
- Department of Virology, Aggeu Magalhães Institute, Oswaldo Cruz Foundation Recife PE Brazil.,Department of Fundamental Chemistry, Federal University of Pernambuco Recife PE Brazil
| | - Carolina S Silva
- Department of Chemical Engineering, Federal University of Pernambuco Recife PE Brazil
| | - Antônio F Purificação
- Department of Virology, Aggeu Magalhães Institute, Oswaldo Cruz Foundation Recife PE Brazil
| | - Catarina M C S Araújo
- Department of Virology, Aggeu Magalhães Institute, Oswaldo Cruz Foundation Recife PE Brazil
| | - Bruno H S Leite
- Department of Virology, Aggeu Magalhães Institute, Oswaldo Cruz Foundation Recife PE Brazil
| | | | - Tereza Magalhães
- Department of Virology, Aggeu Magalhães Institute, Oswaldo Cruz Foundation Recife PE Brazil
| | - Clarice N L Morais
- Department of Virology, Aggeu Magalhães Institute, Oswaldo Cruz Foundation Recife PE Brazil
| | - Marli T Cordeiro
- Department of Virology, Aggeu Magalhães Institute, Oswaldo Cruz Foundation Recife PE Brazil
| | - Roberto D Lins
- Department of Virology, Aggeu Magalhães Institute, Oswaldo Cruz Foundation Recife PE Brazil
| | - Ernesto T A Marques
- Department of Virology, Aggeu Magalhães Institute, Oswaldo Cruz Foundation Recife PE Brazil.,Department of Infectious Diseases and Microbiology, University of Pittsburgh Pittsburgh PA USA
| | - Thomas Jaenisch
- Section Clinical Tropical Medicine, Department of Infectious Diseases, Heidelberg University Hospital Germany .,German Centre for Infection Research (DZIF) Heidelberg Site Heidelberg Germany
| |
Collapse
|
8
|
Nazet J, Lang E, Merkl R. Rosetta:MSF:NN: Boosting performance of multi-state computational protein design with a neural network. PLoS One 2021; 16:e0256691. [PMID: 34437621 PMCID: PMC8389498 DOI: 10.1371/journal.pone.0256691] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Accepted: 08/12/2021] [Indexed: 12/05/2022] Open
Abstract
Rational protein design aims at the targeted modification of existing proteins. To reach this goal, software suites like Rosetta propose sequences to introduce the desired properties. Challenging design problems necessitate the representation of a protein by means of a structural ensemble. Thus, Rosetta multi-state design (MSD) protocols have been developed wherein each state represents one protein conformation. Computational demands of MSD protocols are high, because for each of the candidate sequences a costly three-dimensional (3D) model has to be created and assessed for all states. Each of these scores contributes one data point to a complex, design-specific energy landscape. As neural networks (NN) proved well-suited to learn such solution spaces, we integrated one into the framework Rosetta:MSF instead of the so far used genetic algorithm with the aim to reduce computational costs. As its predecessor, Rosetta:MSF:NN administers a set of candidate sequences and their scores and scans sequence space iteratively. During each iteration, the union of all candidate sequences and their Rosetta scores are used to re-train NNs that possess a design-specific architecture. The enormous speed of the NNs allows an extensive assessment of alternative sequences, which are ranked on the scores predicted by the NN. Costly 3D models are computed only for a small fraction of best-scoring sequences; these and the corresponding 3D-based scores replace half of the candidate sequences during each iteration. The analysis of two sets of candidate sequences generated for a specific design problem by means of a genetic algorithm confirmed that the NN predicted 3D-based scores quite well; the Pearson correlation coefficient was at least 0.95. Applying Rosetta:MSF:NN:enzdes to a benchmark consisting of 16 ligand-binding problems showed that this protocol converges ten-times faster than the genetic algorithm and finds sequences with comparable scores.
Collapse
Affiliation(s)
- Julian Nazet
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Elmar Lang
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Rainer Merkl
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
- * E-mail:
| |
Collapse
|
9
|
Bouchiba Y, Cortés J, Schiex T, Barbe S. Molecular flexibility in computational protein design: an algorithmic perspective. Protein Eng Des Sel 2021; 34:6271252. [PMID: 33959778 DOI: 10.1093/protein/gzab011] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 03/12/2021] [Accepted: 03/29/2021] [Indexed: 12/19/2022] Open
Abstract
Computational protein design (CPD) is a powerful technique for engineering new proteins, with both great fundamental implications and diverse practical interests. However, the approximations usually made for computational efficiency, using a single fixed backbone and a discrete set of side chain rotamers, tend to produce rigid and hyper-stable folds that may lack functionality. These approximations contrast with the demonstrated importance of molecular flexibility and motions in a wide range of protein functions. The integration of backbone flexibility and multiple conformational states in CPD, in order to relieve the inaccuracies resulting from these simplifications and to improve design reliability, are attracting increased attention. However, the greatly increased search space that needs to be explored in these extensions defines extremely challenging computational problems. In this review, we outline the principles of CPD and discuss recent effort in algorithmic developments for incorporating molecular flexibility in the design process.
Collapse
Affiliation(s)
- Younes Bouchiba
- Toulouse Biotechnology Institute, TBI, CNRS, INRAE, INSA, ANITI, Toulouse 31400, France.,Laboratoire d'Analyse et d'Architecture des Systèmes, LAAS CNRS, Université de Toulouse, CNRS, Toulouse 31400, France
| | - Juan Cortés
- Laboratoire d'Analyse et d'Architecture des Systèmes, LAAS CNRS, Université de Toulouse, CNRS, Toulouse 31400, France
| | - Thomas Schiex
- Université de Toulouse, ANITI, INRAE, UR MIAT, F-31320, Castanet-Tolosan, France
| | - Sophie Barbe
- Toulouse Biotechnology Institute, TBI, CNRS, INRAE, INSA, ANITI, Toulouse 31400, France
| |
Collapse
|
10
|
Leman JK, Weitzner BD, Lewis SM, Adolf-Bryfogle J, Alam N, Alford RF, Aprahamian M, Baker D, Barlow KA, Barth P, Basanta B, Bender BJ, Blacklock K, Bonet J, Boyken SE, Bradley P, Bystroff C, Conway P, Cooper S, Correia BE, Coventry B, Das R, De Jong RM, DiMaio F, Dsilva L, Dunbrack R, Ford AS, Frenz B, Fu DY, Geniesse C, Goldschmidt L, Gowthaman R, Gray JJ, Gront D, Guffy S, Horowitz S, Huang PS, Huber T, Jacobs TM, Jeliazkov JR, Johnson DK, Kappel K, Karanicolas J, Khakzad H, Khar KR, Khare SD, Khatib F, Khramushin A, King IC, Kleffner R, Koepnick B, Kortemme T, Kuenze G, Kuhlman B, Kuroda D, Labonte JW, Lai JK, Lapidoth G, Leaver-Fay A, Lindert S, Linsky T, London N, Lubin JH, Lyskov S, Maguire J, Malmström L, Marcos E, Marcu O, Marze NA, Meiler J, Moretti R, Mulligan VK, Nerli S, Norn C, Ó'Conchúir S, Ollikainen N, Ovchinnikov S, Pacella MS, Pan X, Park H, Pavlovicz RE, Pethe M, Pierce BG, Pilla KB, Raveh B, Renfrew PD, Burman SSR, Rubenstein A, Sauer MF, Scheck A, Schief W, Schueler-Furman O, Sedan Y, Sevy AM, Sgourakis NG, Shi L, Siegel JB, Silva DA, Smith S, Song Y, Stein A, Szegedy M, Teets FD, Thyme SB, Wang RYR, Watkins A, Zimmerman L, Bonneau R. Macromolecular modeling and design in Rosetta: recent methods and frameworks. Nat Methods 2020; 17:665-680. [PMID: 32483333 PMCID: PMC7603796 DOI: 10.1038/s41592-020-0848-2] [Citation(s) in RCA: 434] [Impact Index Per Article: 108.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 04/22/2020] [Indexed: 12/12/2022]
Abstract
The Rosetta software for macromolecular modeling, docking and design is extensively used in laboratories worldwide. During two decades of development by a community of laboratories at more than 60 institutions, Rosetta has been continuously refactored and extended. Its advantages are its performance and interoperability between broad modeling capabilities. Here we review tools developed in the last 5 years, including over 80 methods. We discuss improvements to the score function, user interfaces and usability. Rosetta is available at http://www.rosettacommons.org.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA.
- Department of Biology, New York University, New York, New York, USA.
| | - Brian D Weitzner
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Lyell Immunopharma Inc., Seattle, WA, USA
| | - Steven M Lewis
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Department of Biochemistry, Duke University, Durham, NC, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Jared Adolf-Bryfogle
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, USA
| | - Nawsad Alam
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Rebecca F Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Melanie Aprahamian
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Kyle A Barlow
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, CA, USA
| | - Patrick Barth
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Baylor College of Medicine, Department of Pharmacology, Houston, TX, USA
| | - Benjamin Basanta
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Biological Physics Structure and Design PhD Program, University of Washington, Seattle, WA, USA
| | - Brian J Bender
- Department of Pharmacology, Vanderbilt University, Nashville, TN, USA
| | - Kristin Blacklock
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Jaume Bonet
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Scott E Boyken
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Lyell Immunopharma Inc., Seattle, WA, USA
| | - Phil Bradley
- Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Chris Bystroff
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, USA
| | - Patrick Conway
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Seth Cooper
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Bruno E Correia
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Brian Coventry
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Lorna Dsilva
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Roland Dunbrack
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Alexander S Ford
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Brandon Frenz
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Darwin Y Fu
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Caleb Geniesse
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Ragul Gowthaman
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, USA
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, USA
| | - Dominik Gront
- Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Warsaw, Poland
| | - Sharon Guffy
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Scott Horowitz
- Department of Chemistry & Biochemistry, University of Denver, Denver, CO, USA
- The Knoebel Institute for Healthy Aging, University of Denver, Denver, CO, USA
| | - Po-Ssu Huang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Thomas Huber
- Research School of Chemistry, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Tim M Jacobs
- Program in Bioinformatics and Computational Biology, Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | | | - David K Johnson
- Center for Computational Biology, University of Kansas, Lawrence, KS, USA
| | - Kalli Kappel
- Biophysics Program, Stanford University, Stanford, CA, USA
| | - John Karanicolas
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Hamed Khakzad
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute for Computational Science, University of Zurich, Zurich, Switzerland
- S3IT, University of Zurich, Zurich, Switzerland
| | - Karen R Khar
- Cyrus Biotechnology, Seattle, WA, USA
- Center for Computational Biology, University of Kansas, Lawrence, KS, USA
| | - Sagar D Khare
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, The State University of New Jersey, Piscataway, NJ, USA
- Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Computational Biology and Molecular Biophysics Program, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Firas Khatib
- Department of Computer and Information Science, University of Massachusetts Dartmouth, Dartmouth, MA, USA
| | - Alisa Khramushin
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Indigo C King
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Robert Kleffner
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Brian Koepnick
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Georg Kuenze
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Daisuke Kuroda
- Medical Device Development and Regulation Research Center, School of Engineering, University of Tokyo, Tokyo, Japan
- Department of Bioengineering, School of Engineering, University of Tokyo, Tokyo, Japan
| | - Jason W Labonte
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Department of Chemistry, Franklin & Marshall College, Lancaster, PA, USA
| | - Jason K Lai
- Baylor College of Medicine, Department of Pharmacology, Houston, TX, USA
| | - Gideon Lapidoth
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Andrew Leaver-Fay
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, USA
| | - Thomas Linsky
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Nir London
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Joseph H Lubin
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Jack Maguire
- Program in Bioinformatics and Computational Biology, Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Lars Malmström
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute for Computational Science, University of Zurich, Zurich, Switzerland
- S3IT, University of Zurich, Zurich, Switzerland
- Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden
| | - Enrique Marcos
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Research in Biomedicine Barcelona, The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Orly Marcu
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Nicholas A Marze
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Jens Meiler
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
- Departments of Chemistry, Pharmacology and Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
- Institute for Chemical Biology, Vanderbilt University, Nashville, TN, USA
| | - Rocco Moretti
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Vikram Khipple Mulligan
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Santrupti Nerli
- Department of Computer Science, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Christoffer Norn
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Shane Ó'Conchúir
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Noah Ollikainen
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Sergey Ovchinnikov
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Molecular and Cellular Biology Program, University of Washington, Seattle, WA, USA
| | - Michael S Pacella
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Xingjie Pan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Hahnbeom Park
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Ryan E Pavlovicz
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Manasi Pethe
- Department of Chemistry and Chemical Biology, The State University of New Jersey, Piscataway, NJ, USA
- Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Brian G Pierce
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, USA
| | - Kala Bharath Pilla
- Research School of Chemistry, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Barak Raveh
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - P Douglas Renfrew
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
| | - Shourya S Roy Burman
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Aliza Rubenstein
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Computational Biology and Molecular Biophysics Program, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Marion F Sauer
- Chemical and Physical Biology Program, Vanderbilt Vaccine Center, Vanderbilt University, Nashville, TN, USA
| | - Andreas Scheck
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - William Schief
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, USA
| | - Ora Schueler-Furman
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Yuval Sedan
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Alexander M Sevy
- Chemical and Physical Biology Program, Vanderbilt Vaccine Center, Vanderbilt University, Nashville, TN, USA
| | - Nikolaos G Sgourakis
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Lei Shi
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Justin B Siegel
- Department of Chemistry, University of California, Davis, Davis, CA, USA
- Department of Biochemistry and Molecular Medicine, University of California, Davis, Davis, California, USA
- Genome Center, University of California, Davis, Davis, CA, USA
| | | | - Shannon Smith
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Yifan Song
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Amelie Stein
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Maria Szegedy
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Frank D Teets
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Summer B Thyme
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Ray Yu-Ruei Wang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Andrew Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | - Lior Zimmerman
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Richard Bonneau
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA.
- Department of Biology, New York University, New York, New York, USA.
- Department of Computer Science, New York University, New York, NY, USA.
- Center for Data Science, New York University, New York, NY, USA.
| |
Collapse
|
11
|
Weinstein J, Khersonsky O, Fleishman SJ. Practically useful protein-design methods combining phylogenetic and atomistic calculations. Curr Opin Struct Biol 2020; 63:58-64. [PMID: 32505941 DOI: 10.1016/j.sbi.2020.04.003] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2020] [Accepted: 04/06/2020] [Indexed: 12/11/2022]
Abstract
Our ability to design new or improved biomolecular activities depends on understanding the sequence-function relationships in proteins. The large size and fold complexity of most proteins, however, obscure these relationships, and protein-optimization methods continue to rely on laborious experimental iterations. Recently, a deeper understanding of the roles of stability-threshold effects and biomolecular epistasis in proteins has led to the development of hybrid methods that combine phylogenetic analysis with atomistic design calculations. These methods enable reliable and even single-step optimization of protein stability, expressibility, and activity in proteins that were considered outside the scope of computational design. Furthermore, ancestral-sequence reconstruction produces insights on missing links in the evolution of enzymes and binders that may be used in protein design. Through the combination of phylogenetic and atomistic calculations, the long-standing goal of general computational methods that can be universally applied to study and optimize proteins finally seems within reach.
Collapse
Affiliation(s)
- Jonathan Weinstein
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Olga Khersonsky
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel.
| | - Sarel J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel.
| |
Collapse
|
12
|
Arabnejad H, Bombino E, Colpa DI, Jekel PA, Trajkovic M, Wijma HJ, Janssen DB. Computational Design of Enantiocomplementary Epoxide Hydrolases for Asymmetric Synthesis of Aliphatic and Aromatic Diols. Chembiochem 2020; 21:1893-1904. [PMID: 31961471 PMCID: PMC7383614 DOI: 10.1002/cbic.201900726] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Revised: 01/16/2020] [Indexed: 12/13/2022]
Abstract
The use of enzymes in preparative biocatalysis often requires tailoring enzyme selectivity by protein engineering. Herein we explore the use of computational library design and molecular dynamics simulations to create variants of limonene epoxide hydrolase that produce enantiomeric diols from meso‐epoxides. Three substrates of different sizes were targeted: cis‐2,3‐butene oxide, cyclopentene oxide, and cis‐stilbene oxide. Most of the 28 designs tested were active and showed the predicted enantioselectivity. Excellent enantioselectivities were obtained for the bulky substrate cis‐stilbene oxide, and enantiocomplementary mutants produced (S,S)‐ and (R,R)‐stilbene diol with >97 % enantiomeric excess. An (R,R)‐selective mutant was used to prepare (R,R)‐stilbene diol with high enantiopurity (98 % conversion into diol, >99 % ee). Some variants displayed higher catalytic rates (kcat) than the original enzyme, but in most cases KM values increased as well. The results demonstrate the feasibility of computational design and screening to engineer enantioselective epoxide hydrolase variants with very limited laboratory screening.
Collapse
Affiliation(s)
- Hesam Arabnejad
- Biotransformation and Biocatalysis, Groningen Biomolecular Sciences and Biotechnology InstituteUniversity of GroningenNijenborgh 49747 AGGroningenThe Netherlands
| | - Elvira Bombino
- Biotransformation and Biocatalysis, Groningen Biomolecular Sciences and Biotechnology InstituteUniversity of GroningenNijenborgh 49747 AGGroningenThe Netherlands
| | - Dana I. Colpa
- Biotransformation and Biocatalysis, Groningen Biomolecular Sciences and Biotechnology InstituteUniversity of GroningenNijenborgh 49747 AGGroningenThe Netherlands
| | - Peter A. Jekel
- Biotransformation and Biocatalysis, Groningen Biomolecular Sciences and Biotechnology InstituteUniversity of GroningenNijenborgh 49747 AGGroningenThe Netherlands
| | - Milos Trajkovic
- Biotransformation and Biocatalysis, Groningen Biomolecular Sciences and Biotechnology InstituteUniversity of GroningenNijenborgh 49747 AGGroningenThe Netherlands
| | - Hein J. Wijma
- Biotransformation and Biocatalysis, Groningen Biomolecular Sciences and Biotechnology InstituteUniversity of GroningenNijenborgh 49747 AGGroningenThe Netherlands
| | - Dick B. Janssen
- Biotransformation and Biocatalysis, Groningen Biomolecular Sciences and Biotechnology InstituteUniversity of GroningenNijenborgh 49747 AGGroningenThe Netherlands
| |
Collapse
|
13
|
Ford AS, Weitzner BD, Bahl CD. Integration of the Rosetta suite with the python software stack via reproducible packaging and core programming interfaces for distributed simulation. Protein Sci 2019; 29:43-51. [PMID: 31495995 DOI: 10.1002/pro.3721] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Revised: 08/30/2019] [Accepted: 09/03/2019] [Indexed: 01/03/2023]
Abstract
The Rosetta software suite for macromolecular modeling is a powerful computational toolbox for protein design, structure prediction, and protein structure analysis. The development of novel Rosetta-based scientific tools requires two orthogonal skill sets: deep domain-specific expertise in protein biochemistry and technical expertise in development, deployment, and analysis of molecular simulations. Furthermore, the computational demands of molecular simulation necessitate large scale cluster-based or distributed solutions for nearly all scientifically relevant tasks. To reduce the technical barriers to entry for new development, we integrated Rosetta with modern, widely adopted computational infrastructure. This allows simplified deployment in large-scale cluster and cloud computing environments, and effective reuse of common libraries for simulation execution and data analysis. To achieve this, we integrated Rosetta with the Conda package manager; this simplifies installation into existing computational environments and packaging as docker images for cloud deployment. Then, we developed programming interfaces to integrate Rosetta with the PyData stack for analysis and distributed computing, including the popular tools Jupyter, Pandas, and Dask. We demonstrate the utility of these components by generating a library of a thousand de novo disulfide-rich miniproteins in a hybrid simulation that included cluster-based design and interactive notebook-based analyses. Our new tools enable users, who would otherwise not have access to the necessary computational infrastructure, to perform state-of-the-art molecular simulation and design with Rosetta.
Collapse
Affiliation(s)
- Alexander S Ford
- Institute for Protein Innovation, Boston, Massachusetts.,Institute for Protein Design, University of Washington, Seattle, Washington.,Department of Biochemistry, University of Washington, Seattle, Washington
| | - Brian D Weitzner
- Institute for Protein Design, University of Washington, Seattle, Washington.,Department of Biochemistry, University of Washington, Seattle, Washington.,Lyell Immunopharma, Inc., Seattle, Washington
| | - Christopher D Bahl
- Institute for Protein Innovation, Boston, Massachusetts.,Division of Hematology/Oncology, Boston Children's Hospital, Boston, Massachusetts.,Department of Pediatrics, Harvard Medical School, Boston, Massachusetts
| |
Collapse
|
14
|
Jester BW, Tinberg CE, Rich MS, Baker D, Fields S. Engineered Biosensors from Dimeric Ligand-Binding Domains. ACS Synth Biol 2018; 7:2457-2467. [PMID: 30204430 DOI: 10.1021/acssynbio.8b00242] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Biosensors are important components of many synthetic biology and metabolic engineering applications. Here, we report a second generation of Saccharomyces cerevisiae digoxigenin and progesterone biosensors based on destabilized dimeric ligand-binding domains that undergo ligand-induced stabilization. The biosensors, comprising one ligand-binding domain monomer fused to a DNA-binding domain and another fused to a transcriptional activation domain, activate reporter gene expression in response to steroid binding and receptor dimerization. The introduction of a destabilizing mutation to the dimer interface increased biosensor dynamic range by an order of magnitude. Computational redesign of the dimer interface and functional selections were used to create heterodimeric pairs with further improved dynamic range. A heterodimeric biosensor built from the digoxigenin and progesterone ligand-binding domains functioned as a synthetic "AND"-gate, with 20-fold stronger response to the two ligands in combination than to either one alone. We also identified mutations that increase the sensitivity or selectivity of the biosensors to chemically similar ligands. These dimerizing biosensors provide additional flexibility for the construction of logic gates and other applications.
Collapse
Affiliation(s)
- Benjamin W. Jester
- Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, United States
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, United States
| | - Christine E. Tinberg
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, United States
| | - Matthew S. Rich
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, United States
| | - David Baker
- Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, United States
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, United States
| | - Stanley Fields
- Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, United States
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, United States
- Department of Medicine, University of Washington, Seattle, Washington 98195, United States
| |
Collapse
|
15
|
Garcia-Borràs M, Houk KN, Jiménez-Osés G. Computational Design of Protein Function. COMPUTATIONAL TOOLS FOR CHEMICAL BIOLOGY 2017. [DOI: 10.1039/9781788010139-00087] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
The computational design of enzymes is a tremendous challenge for both chemistry and biochemistry. The ability to design stable and functional biocatalysts that could operate under different conditions to perform chemical reactions without precedent in nature, allowing the large-scale production of chemicals à la carte, would revolutionise both synthetic, pharmacologic and materials chemistry. Despite the great advances achieved, this highly multidisciplinary area of research is still in its infancy. This chapter describes the ‘inside-out’ protocol for computational enzyme design and both the achievements and limitations of the current technology are highlighted. Furthermore, molecular dynamics simulations have proved to be invaluable in the enzyme design process, constituting an important tool for discovering elusive catalytically relevant conformations of the engineered or designed enzyme. As a complement to the ‘inside-out’ design protocol, different examples where hybrid QM/MM approaches have been directly applied to discover beneficial mutations in rational computational enzyme design are described.
Collapse
Affiliation(s)
- Marc Garcia-Borràs
- Department of Chemistry and Biochemistry, University of California Los Angeles California CA 90095-1569 USA
| | - Kendall N. Houk
- Department of Chemistry and Biochemistry, University of California Los Angeles California CA 90095-1569 USA
| | - Gonzalo Jiménez-Osés
- Departamento de Química, Centro de Investigación en Síntesis Química Universidad de La Rioja 26006 Logroño La Rioja Spain
| |
Collapse
|
16
|
Löffler P, Schmitz S, Hupfeld E, Sterner R, Merkl R. Rosetta:MSF: a modular framework for multi-state computational protein design. PLoS Comput Biol 2017; 13:e1005600. [PMID: 28604768 PMCID: PMC5484525 DOI: 10.1371/journal.pcbi.1005600] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Revised: 06/26/2017] [Accepted: 05/27/2017] [Indexed: 12/20/2022] Open
Abstract
Computational protein design (CPD) is a powerful technique to engineer existing proteins or to design novel ones that display desired properties. Rosetta is a software suite including algorithms for computational modeling and analysis of protein structures and offers many elaborate protocols created to solve highly specific tasks of protein engineering. Most of Rosetta’s protocols optimize sequences based on a single conformation (i. e. design state). However, challenging CPD objectives like multi-specificity design or the concurrent consideration of positive and negative design goals demand the simultaneous assessment of multiple states. This is why we have developed the multi-state framework MSF that facilitates the implementation of Rosetta’s single-state protocols in a multi-state environment and made available two frequently used protocols. Utilizing MSF, we demonstrated for one of these protocols that multi-state design yields a 15% higher performance than single-state design on a ligand-binding benchmark consisting of structural conformations. With this protocol, we designed de novo nine retro-aldolases on a conformational ensemble deduced from a (βα)8-barrel protein. All variants displayed measurable catalytic activity, testifying to a high success rate for this concept of multi-state enzyme design. Protein engineering, i. e. the targeted modification or design of proteins has tremendous potential for medical and industrial applications. One generally applicable strategy for protein engineering is rational protein design: based on detailed knowledge of structure and function, computer programs like Rosetta propose the sequence of a protein possessing the desired properties. So far, most computer protocols have used rigid structures for design, which is a simplification because a protein’s structure is more accurately specified by a conformational ensemble. We have now implemented a framework for computational protein design that allows certain design protocols of Rosetta to make use of multiple design states like structural ensembles. An in silico assessment simulating ligand-binding design showed that this new approach generates more reliably native-like sequences than a single-state approach. As a proof-of-concept, we introduced de novo retro-aldolase activity into a scaffold protein and characterized nine variants experimentally, all of which were catalytically active.
Collapse
Affiliation(s)
- Patrick Löffler
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Samuel Schmitz
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Enrico Hupfeld
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Reinhard Sterner
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Rainer Merkl
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
- * E-mail:
| |
Collapse
|
17
|
Spencer RK, Hochbaum AI. X-ray Crystallographic Structure and Solution Behavior of an Antiparallel Coiled-Coil Hexamer Formed by de Novo Peptides. Biochemistry 2016; 55:3214-23. [DOI: 10.1021/acs.biochem.6b00201] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Ryan K. Spencer
- Department of Chemistry and Department of Chemical Engineering & Materials Science, University of California, Irvine, Irvine, California 92697-2575, United States
| | - Allon I. Hochbaum
- Department of Chemistry and Department of Chemical Engineering & Materials Science, University of California, Irvine, Irvine, California 92697-2575, United States
| |
Collapse
|
18
|
Abstract
Proteins that bind small molecules (ligands) can be used as biosensors, signal modulators, and sequestering agents. When naturally occurring proteins for a particular target ligand are not available, artificial proteins can be computationally designed. We present a protocol based on RosettaLigand to redesign an existing protein pocket to bind a target ligand. Starting with a protein structure and the structure of the ligand, Rosetta can optimize both the placement of the ligand in the pocket and the identity and conformation of the surrounding sidechains, yielding proteins that bind the target compound.
Collapse
|
19
|
King C, Garza EN, Mazor R, Linehan JL, Pastan I, Pepper M, Baker D. Removing T-cell epitopes with computational protein design. Proc Natl Acad Sci U S A 2014; 111:8577-82. [PMID: 24843166 PMCID: PMC4060723 DOI: 10.1073/pnas.1321126111] [Citation(s) in RCA: 86] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Immune responses can make protein therapeutics ineffective or even dangerous. We describe a general computational protein design method for reducing immunogenicity by eliminating known and predicted T-cell epitopes and maximizing the content of human peptide sequences without disrupting protein structure and function. We show that the method recapitulates previous experimental results on immunogenicity reduction, and we use it to disrupt T-cell epitopes in GFP and Pseudomonas exotoxin A without disrupting function.
Collapse
Affiliation(s)
- Chris King
- Institute for Protein Design, Department of Biochemistry and
| | - Esteban N Garza
- Department of Immunology, University of Washington, Seattle, WA 98195; and
| | | | - Jonathan L Linehan
- National Institute of Allergy and Infectious Disease, National Institutes of Health, Bethesda, MD 20892
| | | | - Marion Pepper
- Department of Immunology, University of Washington, Seattle, WA 98195; and
| | - David Baker
- Institute for Protein Design, Department of Biochemistry and
| |
Collapse
|