101
|
Mou M, Pan Z, Zhou Z, Zheng L, Zhang H, Shi S, Li F, Sun X, Zhu F. A Transformer-Based Ensemble Framework for the Prediction of Protein-Protein Interaction Sites. RESEARCH (WASHINGTON, D.C.) 2023; 6:0240. [PMID: 37771850 PMCID: PMC10528219 DOI: 10.34133/research.0240] [Citation(s) in RCA: 26] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Accepted: 09/08/2023] [Indexed: 09/30/2023]
Abstract
The identification of protein-protein interaction (PPI) sites is essential in the research of protein function and the discovery of new drugs. So far, a variety of computational tools based on machine learning have been developed to accelerate the identification of PPI sites. However, existing methods suffer from the low predictive accuracy or the limited scope of application. Specifically, some methods learned only global or local sequential features, leading to low predictive accuracy, while others achieved improved performance by extracting residue interactions from structures but were limited in their application scope for the serious dependence on precise structure information. There is an urgent need to develop a method that integrates comprehensive information to realize proteome-wide accurate profiling of PPI sites. Herein, a novel ensemble framework for PPI sites prediction, EnsemPPIS, was therefore proposed based on transformer and gated convolutional networks. EnsemPPIS can effectively capture not only global and local patterns but also residue interactions. Specifically, EnsemPPIS was unique in (a) extracting residue interactions from protein sequences with transformer and (b) further integrating global and local sequential features with the ensemble learning strategy. Compared with various existing methods, EnsemPPIS exhibited either superior performance or broader applicability on multiple PPI sites prediction tasks. Moreover, pattern analysis based on the interpretability of EnsemPPIS demonstrated that EnsemPPIS was fully capable of learning residue interactions within the local structure of PPI sites using only sequence information. The web server of EnsemPPIS is freely available at http://idrblab.org/ensemppis.
Collapse
Affiliation(s)
- Minjie Mou
- College of Pharmaceutical Sciences, The Second Affiliated Hospital,
Zhejiang UniversitySchool of Medicine, National Key Laboratory of Advanced Drug Delivery and Release Systems, Zhejiang University, Hangzhou 310058, China
| | - Ziqi Pan
- College of Pharmaceutical Sciences, The Second Affiliated Hospital,
Zhejiang UniversitySchool of Medicine, National Key Laboratory of Advanced Drug Delivery and Release Systems, Zhejiang University, Hangzhou 310058, China
| | - Zhimeng Zhou
- College of Pharmaceutical Sciences, The Second Affiliated Hospital,
Zhejiang UniversitySchool of Medicine, National Key Laboratory of Advanced Drug Delivery and Release Systems, Zhejiang University, Hangzhou 310058, China
| | - Lingyan Zheng
- College of Pharmaceutical Sciences, The Second Affiliated Hospital,
Zhejiang UniversitySchool of Medicine, National Key Laboratory of Advanced Drug Delivery and Release Systems, Zhejiang University, Hangzhou 310058, China
| | - Hanyu Zhang
- College of Pharmaceutical Sciences, The Second Affiliated Hospital,
Zhejiang UniversitySchool of Medicine, National Key Laboratory of Advanced Drug Delivery and Release Systems, Zhejiang University, Hangzhou 310058, China
| | - Shuiyang Shi
- College of Pharmaceutical Sciences, The Second Affiliated Hospital,
Zhejiang UniversitySchool of Medicine, National Key Laboratory of Advanced Drug Delivery and Release Systems, Zhejiang University, Hangzhou 310058, China
| | - Fengcheng Li
- College of Pharmaceutical Sciences, The Second Affiliated Hospital,
Zhejiang UniversitySchool of Medicine, National Key Laboratory of Advanced Drug Delivery and Release Systems, Zhejiang University, Hangzhou 310058, China
| | - Xiuna Sun
- College of Pharmaceutical Sciences, The Second Affiliated Hospital,
Zhejiang UniversitySchool of Medicine, National Key Laboratory of Advanced Drug Delivery and Release Systems, Zhejiang University, Hangzhou 310058, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, The Second Affiliated Hospital,
Zhejiang UniversitySchool of Medicine, National Key Laboratory of Advanced Drug Delivery and Release Systems, Zhejiang University, Hangzhou 310058, China
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou 330110, China
| |
Collapse
|
102
|
Sieg J, Rarey M. Searching similar local 3D micro-environments in protein structure databases with MicroMiner. Brief Bioinform 2023; 24:bbad357. [PMID: 37833838 DOI: 10.1093/bib/bbad357] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 08/28/2023] [Accepted: 09/18/2023] [Indexed: 10/15/2023] Open
Abstract
The available protein structure data are rapidly increasing. Within these structures, numerous local structural sites depict the details characterizing structure and function. However, searching and analyzing these sites extensively and at scale poses a challenge. We present a new method to search local sites in protein structure databases using residue-defined local 3D micro-environments. We implemented the method in a new tool called MicroMiner and demonstrate the capabilities of residue micro-environment search on the example of structural mutation analysis. Usually, experimental structures for both the wild-type and the mutant are unavailable for comparison. With MicroMiner, we extracted $>255 \times 10^{6}$ amino acid pairs in protein structures from the PDB, exemplifying single mutations' local structural changes for single chains and $>45 \times 10^{6}$ pairs for protein-protein interfaces. We further annotate existing data sets of experimentally measured mutation effects, like $\Delta \Delta G$ measurements, with the extracted structure pairs to combine the mutation effect measurement with the structural change upon mutation. In addition, we show how MicroMiner can bridge the gap between mutation analysis and structure-based drug design tools. MicroMiner is available as a command line tool and interactively on the https://proteins.plus/ webserver.
Collapse
Affiliation(s)
- Jochen Sieg
- Universität Hamburg, ZBH - Center for Bioinformatics, Bundesstraße 43, 20146 Hamburg, Germany
| | - Matthias Rarey
- Universität Hamburg, ZBH - Center for Bioinformatics, Bundesstraße 43, 20146 Hamburg, Germany
| |
Collapse
|
103
|
Glasscock CJ, Pecoraro R, McHugh R, Doyle LA, Chen W, Boivin O, Lonnquist B, Na E, Politanska Y, Haddox HK, Cox D, Norn C, Coventry B, Goreshnik I, Vafeados D, Lee GR, Gordan R, Stoddard BL, DiMaio F, Baker D. Computational design of sequence-specific DNA-binding proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.20.558720. [PMID: 37790440 PMCID: PMC10542524 DOI: 10.1101/2023.09.20.558720] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]
Abstract
Sequence-specific DNA-binding proteins (DBPs) play critical roles in biology and biotechnology, and there has been considerable interest in the engineering of DBPs with new or altered specificities for genome editing and other applications. While there has been some success in reprogramming naturally occurring DBPs using selection methods, the computational design of new DBPs that recognize arbitrary target sites remains an outstanding challenge. We describe a computational method for the design of small DBPs that recognize specific target sequences through interactions with bases in the major groove, and employ this method in conjunction with experimental screening to generate binders for 5 distinct DNA targets. These binders exhibit specificity closely matching the computational models for the target DNA sequences at as many as 6 base positions and affinities as low as 30-100 nM. The crystal structure of a designed DBP-target site complex is in close agreement with the design model, highlighting the accuracy of the design method. The designed DBPs function in both Escherichia coli and mammalian cells to repress and activate transcription of neighboring genes. Our method is a substantial step towards a general route to small and hence readily deliverable sequence-specific DBPs for gene regulation and editing.
Collapse
Affiliation(s)
- Cameron J. Glasscock
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Robert Pecoraro
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Physics, University of Washington, Seattle, WA, USA
| | - Ryan McHugh
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Lindsey A. Doyle
- Division of Basic Sciences, Fred Hutchinson Cancer Center, Seattle, Washington, USA
| | - Wei Chen
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Olivier Boivin
- Program in Genetics and Genomic, Duke University, Durham, NC, USA
- Center for Advanced Genomic Technologies, Duke University, Durham, NC, USA
| | - Beau Lonnquist
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - Emily Na
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Yuliya Politanska
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Hugh K. Haddox
- Division of Basic Sciences, Fred Hutchinson Cancer Center, Seattle, Washington, USA
| | - David Cox
- Department of Biochemistry, Stanford University School of Medicine, Palo Alto, CA USA
- Department of Medicine, Division of Hematology, Stanford University, Stanford, CA, USA
| | - Christoffer Norn
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- BioInnovation Institute, DK2200 Copenhagen N, Denmark
| | - Brian Coventry
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Inna Goreshnik
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Dionne Vafeados
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Gyu Rie Lee
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA USA
| | - Raluca Gordan
- Center for Advanced Genomic Technologies, Duke University, Durham, NC, USA
- Department of Biostatistics and Bioinformatics, Department of Computer Science, Department of Molecular Genetics and Microbiology, Duke University, Durham, NC, USA
| | - Barry L. Stoddard
- Division of Basic Sciences, Fred Hutchinson Cancer Center, Seattle, Washington, USA
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- BioInnovation Institute, DK2200 Copenhagen N, Denmark
| |
Collapse
|
104
|
Huang B, Abedi M, Ahn G, Coventry B, Sappington I, Wang R, Schlichthaerle T, Zhang JZ, Wang Y, Goreshnik I, Chiu CW, Chazin-Gray A, Chan S, Gerben S, Murray A, Wang S, O'Neill J, Yeh R, Misquith A, Wolf A, Tomasovic LM, Piraner DI, Gonzalez MJD, Bennett NR, Venkatesh P, Satoe D, Ahlrichs M, Dobbins C, Yang W, Wang X, Vafeados D, Mout R, Shivaei S, Cao L, Carter L, Stewart L, Spangler JB, Bernardes GJL, Roybal KT, Greisen P, Li X, Bertozzi C, Baker D. Designed Endocytosis-Triggering Proteins mediate Targeted Degradation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.19.553321. [PMID: 37781607 PMCID: PMC10541094 DOI: 10.1101/2023.08.19.553321] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/03/2023]
Abstract
Endocytosis and lysosomal trafficking of cell surface receptors can be triggered by interaction with endogenous ligands. Therapeutic approaches such as LYTAC1,2 and KineTAC3, have taken advantage of this to target specific proteins for degradation by fusing modified native ligands to target binding proteins. While powerful, these approaches can be limited by possible competition with the endogenous ligand(s), the requirement in some cases for chemical modification that limits genetic encodability and can complicate manufacturing, and more generally, there may not be natural ligands which stimulate endocytosis through a given receptor. Here we describe general protein design approaches for designing endocytosis triggering binding proteins (EndoTags) that overcome these challenges. We present EndoTags for the IGF-2R, ASGPR, Sortillin, and Transferrin receptors, and show that fusing these tags to proteins which bind to soluble or transmembrane protein leads to lysosomal trafficking and target degradation; as these receptors have different tissue distributions, the different EndoTags could enable targeting of degradation to different tissues. The modularity and genetic encodability of EndoTags enables AND gate control for higher specificity targeted degradation, and the localized secretion of degraders from engineered cells. The tunability and modularity of our genetically encodable EndoTags should contribute to deciphering the relationship between receptor engagement and cellular trafficking, and they have considerable therapeutic potential as targeted degradation inducers, signaling activators for endocytosis-dependent pathways, and cellular uptake inducers for targeted antibody drug and RNA conjugates.
Collapse
Affiliation(s)
- Buwei Huang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - Mohamad Abedi
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Green Ahn
- Department of Chemistry, Stanford University, Stanford, CA, USA
| | - Brian Coventry
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Isaac Sappington
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Rong Wang
- Department of Molecular Genetics, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Thomas Schlichthaerle
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Jason Z Zhang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Yujia Wang
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Inna Goreshnik
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Ching Wen Chiu
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Adam Chazin-Gray
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Sidney Chan
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Stacey Gerben
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Analisa Murray
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Shunzhi Wang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | | | | | | | | | - Luke M Tomasovic
- Departments of Biomedical Engineering and Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Medical Scientist Training Program, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Dan I Piraner
- Department of Microbiology and Immunology, University of California San Francisco
| | | | - Nathaniel R Bennett
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Preetham Venkatesh
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Danny Satoe
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Maggie Ahlrichs
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Craig Dobbins
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Wei Yang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Xinru Wang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Dionne Vafeados
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Rubul Mout
- Harvard Medical School, Harvard University, Boston, Massachusetts, USA
| | - Shirin Shivaei
- Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA, USA
| | - Longxing Cao
- School of Life Sciences, Westlake University, Hangzhou, China
| | - Lauren Carter
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Lance Stewart
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Jamie B Spangler
- Departments of Biomedical Engineering and Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Gonçalo J L Bernardes
- Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA, USA
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK
| | - Kole T Roybal
- Department of Microbiology and Immunology, University of California San Francisco
| | | | - Xiaochun Li
- Department of Molecular Genetics, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Carolyn Bertozzi
- Department of Chemistry, Stanford University, Stanford, CA, USA
- Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA 94305, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| |
Collapse
|
105
|
Stern JA, Free TJ, Stern KL, Gardiner S, Dalley NA, Bundy BC, Price JL, Wingate D, Della Corte D. A probabilistic view of protein stability, conformational specificity, and design. Sci Rep 2023; 13:15493. [PMID: 37726313 PMCID: PMC10509192 DOI: 10.1038/s41598-023-42032-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 09/04/2023] [Indexed: 09/21/2023] Open
Abstract
Various approaches have used neural networks as probabilistic models for the design of protein sequences. These "inverse folding" models employ different objective functions, which come with trade-offs that have not been assessed in detail before. This study introduces probabilistic definitions of protein stability and conformational specificity and demonstrates the relationship between these chemical properties and the [Formula: see text] Boltzmann probability objective. This links the Boltzmann probability objective function to experimentally verifiable outcomes. We propose a novel sequence decoding algorithm, referred to as "BayesDesign", that leverages Bayes' Rule to maximize the [Formula: see text] objective instead of the [Formula: see text] objective common in inverse folding models. The efficacy of BayesDesign is evaluated in the context of two protein model systems, the NanoLuc enzyme and the WW structural motif. Both BayesDesign and the baseline ProteinMPNN algorithm increase the thermostability of NanoLuc and increase the conformational specificity of WW. The possible sources of error in the model are analyzed.
Collapse
Affiliation(s)
- Jacob A Stern
- Department of Computer Science, Brigham Young University, Provo, UT, USA
| | - Tyler J Free
- Department of Chemical Engineering, Brigham Young University, Provo, UT, USA
| | - Kimberlee L Stern
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, UT, USA
| | - Spencer Gardiner
- Department of Physics and Astronomy, Brigham Young University, Provo, UT, USA
| | - Nicholas A Dalley
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, UT, USA
| | - Bradley C Bundy
- Department of Chemical Engineering, Brigham Young University, Provo, UT, USA
| | - Joshua L Price
- Department of Chemistry and Biochemistry, Brigham Young University, Provo, UT, USA
| | - David Wingate
- Department of Computer Science, Brigham Young University, Provo, UT, USA
| | - Dennis Della Corte
- Department of Physics and Astronomy, Brigham Young University, Provo, UT, USA.
| |
Collapse
|
106
|
Fast E, Dhar M, Chen B. TAPIR: a T-cell receptor language model for predicting rare and novel targets. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.12.557285. [PMID: 37745475 PMCID: PMC10515850 DOI: 10.1101/2023.09.12.557285] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
T-cell receptors (TCRs) are involved in most human diseases, but linking their sequences with their targets remains an unsolved grand challenge in the field. In this study, we present TAPIR (T-cell receptor and Peptide Interaction Recognizer), a T-cell receptor (TCR) language model that predicts TCR-target interactions, with a focus on novel and rare targets. TAPIR employs deep convolutional neural network (CNN) encoders to process TCR and target sequences across flexible representations (e.g., beta-chain only, unknown MHC allele, etc.) and learns patterns of interactivity via several training tasks. This flexibility allows TAPIR to train on more than 50k either paired (alpha and beta chain) or unpaired TCRs (just alpha or beta chain) from public and proprietary databases against 1933 unique targets. TAPIR demonstrates state-of-the-art performance when predicting TCR interactivity against common benchmark targets and is the first method to demonstrate strong performance when predicting TCR interactivity against novel targets, where no examples are provided in training. TAPIR is also capable of predicting TCR interaction against MHC alleles in the absence of target information. Leveraging these capabilities, we apply TAPIR to cancer patient TCR repertoires and identify and validate a novel and potent anti-cancer T-cell receptor against a shared cancer neoantigen target (PIK3CA H1047L). We further show how TAPIR, when extended with a generative neural network, is capable of directly designing T-cell receptor sequences that interact with a target of interest.
Collapse
Affiliation(s)
- Ethan Fast
- Vcreate, Inc., Menlo Park, CA, 94025, USA
| | | | | |
Collapse
|
107
|
Praetorius F, Leung PJY, Tessmer MH, Broerman A, Demakis C, Dishman AF, Pillai A, Idris A, Juergens D, Dauparas J, Li X, Levine PM, Lamb M, Ballard RK, Gerben SR, Nguyen H, Kang A, Sankaran B, Bera AK, Volkman BF, Nivala J, Stoll S, Baker D. Design of stimulus-responsive two-state hinge proteins. Science 2023; 381:754-760. [PMID: 37590357 PMCID: PMC10697137 DOI: 10.1126/science.adg7731] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Accepted: 07/11/2023] [Indexed: 08/19/2023]
Abstract
In nature, proteins that switch between two conformations in response to environmental stimuli structurally transduce biochemical information in a manner analogous to how transistors control information flow in computing devices. Designing proteins with two distinct but fully structured conformations is a challenge for protein design as it requires sculpting an energy landscape with two distinct minima. Here we describe the design of "hinge" proteins that populate one designed state in the absence of ligand and a second designed state in the presence of ligand. X-ray crystallography, electron microscopy, double electron-electron resonance spectroscopy, and binding measurements demonstrate that despite the significant structural differences the two states are designed with atomic level accuracy and that the conformational and binding equilibria are closely coupled.
Collapse
Affiliation(s)
- Florian Praetorius
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Philip J. Y. Leung
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Graduate Program in Molecular Engineering, University of Washington, Seattle, WA, USA
| | - Maxx H. Tessmer
- Department of Chemistry, University of Washington, Seattle, WA, USA
| | - Adam Broerman
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Chemical Engineering, University of Washington, Seattle, WA, USA
| | - Cullen Demakis
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Graduate Program in Biological Physics, Structure, and Design, University of Washington, Seattle, Washington, USA
| | - Acacia F. Dishman
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
- Medical Scientist Training Program, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Arvind Pillai
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Abbas Idris
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - David Juergens
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Graduate Program in Molecular Engineering, University of Washington, Seattle, WA, USA
| | - Justas Dauparas
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Xinting Li
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Paul M. Levine
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Mila Lamb
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Ryanne K. Ballard
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Stacey R. Gerben
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Hannah Nguyen
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Alex Kang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Banumathi Sankaran
- Molecular Biophysics and Integrated Bioimaging, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Asim K. Bera
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Brian F. Volkman
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Jeff Nivala
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA
- Molecular Engineering and Sciences Institute, University of Washington, Seattle, WA, USA
| | - Stefan Stoll
- Department of Chemistry, University of Washington, Seattle, WA, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA,USA
| |
Collapse
|
108
|
Kell DB, Pretorius E. Are fibrinaloid microclots a cause of autoimmunity in Long Covid and other post-infection diseases? Biochem J 2023; 480:1217-1240. [PMID: 37584410 DOI: 10.1042/bcj20230241] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 08/03/2023] [Accepted: 08/07/2023] [Indexed: 08/17/2023]
Abstract
It is now well established that the blood-clotting protein fibrinogen can polymerise into an anomalous form of fibrin that is amyloid in character; the resultant clots and microclots entrap many other molecules, stain with fluorogenic amyloid stains, are rather resistant to fibrinolysis, can block up microcapillaries, are implicated in a variety of diseases including Long COVID, and have been referred to as fibrinaloids. A necessary corollary of this anomalous polymerisation is the generation of novel epitopes in proteins that would normally be seen as 'self', and otherwise immunologically silent. The precise conformation of the resulting fibrinaloid clots (that, as with prions and classical amyloid proteins, can adopt multiple, stable conformations) must depend on the existing small molecules and metal ions that the fibrinogen may (and is some cases is known to) have bound before polymerisation. Any such novel epitopes, however, are likely to lead to the generation of autoantibodies. A convergent phenomenology, including distinct conformations and seeding of the anomalous form for initiation and propagation, is emerging to link knowledge in prions, prionoids, amyloids and now fibrinaloids. We here summarise the evidence for the above reasoning, which has substantial implications for our understanding of the genesis of autoimmunity (and the possible prevention thereof) based on the primary process of fibrinaloid formation.
Collapse
Affiliation(s)
- Douglas B Kell
- Department of Biochemistry, Cell and Systems Biology, Institute of Systems, Molecular and Integrative Biology, Faculty of Health and Life Sciences, University of Liverpool, Liverpool L69 7ZB, U.K
- The Novo Nordisk Foundation Centre for Biosustainability, Technical University of Denmark, Kemitorvet 200, 2800 Kgs Lyngby, Denmark
- Department of Physiological Sciences, Faculty of Science, Stellenbosch University, Private Bag X1 Matieland, Stellenbosch 7602, South Africa
| | - Etheresia Pretorius
- Department of Biochemistry, Cell and Systems Biology, Institute of Systems, Molecular and Integrative Biology, Faculty of Health and Life Sciences, University of Liverpool, Liverpool L69 7ZB, U.K
- Department of Physiological Sciences, Faculty of Science, Stellenbosch University, Private Bag X1 Matieland, Stellenbosch 7602, South Africa
| |
Collapse
|
109
|
de Haas RJ, Brunette N, Goodson A, Dauparas J, Yi SY, Yang EC, Dowling Q, Nguyen H, Kang A, Bera AK, Sankaran B, de Vries R, Baker D, King NP. Rapid and automated design of two-component protein nanomaterials using ProteinMPNN. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.04.551935. [PMID: 37577478 PMCID: PMC10418170 DOI: 10.1101/2023.08.04.551935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
The design of novel protein-protein interfaces using physics-based design methods such as Rosetta requires substantial computational resources and manual refinement by expert structural biologists. A new generation of deep learning methods promises to simplify protein-protein interface design and enable its application to a wide variety of problems by researchers from various scientific disciplines. Here we test the ability of a deep learning method for protein sequence design, ProteinMPNN, to design two-component tetrahedral protein nanomaterials and benchmark its performance against Rosetta. ProteinMPNN had a similar success rate to Rosetta, yielding 13 new experimentally confirmed assemblies, but required orders of magnitude less computation and no manual refinement. The interfaces designed by ProteinMPNN were substantially more polar than those designed by Rosetta, which facilitated in vitro assembly of the designed nanomaterials from independently purified components. Crystal structures of several of the assemblies confirmed the accuracy of the design method at high resolution. Our results showcase the potential of deep learning-based methods to unlock the widespread application of designed protein-protein interfaces and self-assembling protein nanomaterials in biotechnology.
Collapse
|
110
|
Cummins MC, Tripathy A, Sondek J, Kuhlman B. De novo design of stable proteins that efficaciously inhibit oncogenic G proteins. Protein Sci 2023; 32:e4713. [PMID: 37368504 PMCID: PMC10360382 DOI: 10.1002/pro.4713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Revised: 06/23/2023] [Accepted: 06/24/2023] [Indexed: 06/29/2023]
Abstract
Many protein therapeutics are competitive inhibitors that function by binding to endogenous proteins and preventing them from interacting with native partners. One effective strategy for engineering competitive inhibitors is to graft structural motifs from a native partner into a host protein. Here, we develop and experimentally test a computational protocol for embedding binding motifs in de novo designed proteins. The protocol uses an "inside-out" approach: Starting with a structural model of the binding motif docked against the target protein, the de novo protein is built by growing new structural elements off the termini of the binding motif. During backbone assembly, a score function favors backbones that introduce new tertiary contacts within the designed protein and do not introduce clashes with the target binding partner. Final sequences are designed and optimized using the molecular modeling program Rosetta. To test our protocol, we designed small helical proteins to inhibit the interaction between Gαq and its effector PLC-β isozymes. Several of the designed proteins remain folded above 90°C and bind to Gαq with equilibrium dissociation constants tighter than 80 nM. In cellular assays with oncogenic variants of Gαq , the designed proteins inhibit activation of PLC-β isozymes and Dbl-family RhoGEFs. Our results demonstrate that computational protein design, in combination with motif grafting, can be used to directly generate potent inhibitors without further optimization via high throughput screening or selection.
Collapse
Affiliation(s)
- Matthew C. Cummins
- Department of PharmacologyUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
| | - Ashutosh Tripathy
- Department of Biochemistry and BiophysicsUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
| | - John Sondek
- Department of PharmacologyUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
- Department of Biochemistry and BiophysicsUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
- Lineberger Comprehensive Cancer CenterUniversity of North Carolina at Chapel HillChapel HillNorth CarolinaUSA
| | - Brian Kuhlman
- Department of Biochemistry and BiophysicsUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
- Lineberger Comprehensive Cancer CenterUniversity of North Carolina at Chapel HillChapel HillNorth CarolinaUSA
| |
Collapse
|
111
|
Watson JL, Juergens D, Bennett NR, Trippe BL, Yim J, Eisenach HE, Ahern W, Borst AJ, Ragotte RJ, Milles LF, Wicky BIM, Hanikel N, Pellock SJ, Courbet A, Sheffler W, Wang J, Venkatesh P, Sappington I, Torres SV, Lauko A, De Bortoli V, Mathieu E, Ovchinnikov S, Barzilay R, Jaakkola TS, DiMaio F, Baek M, Baker D. De novo design of protein structure and function with RFdiffusion. Nature 2023; 620:1089-1100. [PMID: 37433327 PMCID: PMC10468394 DOI: 10.1038/s41586-023-06415-8] [Citation(s) in RCA: 299] [Impact Index Per Article: 299.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 07/07/2023] [Indexed: 07/13/2023]
Abstract
There has been considerable recent progress in designing new proteins using deep-learning methods1-9. Despite this progress, a general deep-learning framework for protein design that enables solution of a wide range of design challenges, including de novo binder design and design of higher-order symmetric architectures, has yet to be described. Diffusion models10,11 have had considerable success in image and language generative modelling but limited success when applied to protein modelling, probably due to the complexity of protein backbone geometry and sequence-structure relationships. Here we show that by fine-tuning the RoseTTAFold structure prediction network on protein structure denoising tasks, we obtain a generative model of protein backbones that achieves outstanding performance on unconditional and topology-constrained protein monomer design, protein binder design, symmetric oligomer design, enzyme active site scaffolding and symmetric motif scaffolding for therapeutic and metal-binding protein design. We demonstrate the power and generality of the method, called RoseTTAFold diffusion (RFdiffusion), by experimentally characterizing the structures and functions of hundreds of designed symmetric assemblies, metal-binding proteins and protein binders. The accuracy of RFdiffusion is confirmed by the cryogenic electron microscopy structure of a designed binder in complex with influenza haemagglutinin that is nearly identical to the design model. In a manner analogous to networks that produce images from user-specified inputs, RFdiffusion enables the design of diverse functional proteins from simple molecular specifications.
Collapse
Affiliation(s)
- Joseph L Watson
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - David Juergens
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Graduate Program in Molecular Engineering, University of Washington, Seattle, WA, USA
| | - Nathaniel R Bennett
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Graduate Program in Molecular Engineering, University of Washington, Seattle, WA, USA
| | - Brian L Trippe
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Columbia University, Department of Statistics, New York, NY, USA
- Irving Institute for Cancer Dynamics, Columbia University, New York, NY, USA
| | - Jason Yim
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Helen E Eisenach
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Woody Ahern
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA
| | - Andrew J Borst
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Robert J Ragotte
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Lukas F Milles
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Basile I M Wicky
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Nikita Hanikel
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Samuel J Pellock
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Alexis Courbet
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- National Centre for Scientific Research, École Normale Supérieure rue d'Ulm, Paris, France
| | - William Sheffler
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Jue Wang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Preetham Venkatesh
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Graduate Program in Biological Physics, Structure and Design, University of Washington, Seattle, WA, USA
| | - Isaac Sappington
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Graduate Program in Biological Physics, Structure and Design, University of Washington, Seattle, WA, USA
| | - Susana Vázquez Torres
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Graduate Program in Biological Physics, Structure and Design, University of Washington, Seattle, WA, USA
| | - Anna Lauko
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Graduate Program in Biological Physics, Structure and Design, University of Washington, Seattle, WA, USA
| | - Valentin De Bortoli
- National Centre for Scientific Research, École Normale Supérieure rue d'Ulm, Paris, France
| | - Emile Mathieu
- Department of Engineering, University of Cambridge, Cambridge, UK
| | - Sergey Ovchinnikov
- Faculty of Applied Sciences, Harvard University, Cambridge, MA, USA
- John Harvard Distinguished Science Fellowship, Harvard University, Cambridge, MA, USA
| | | | | | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Minkyung Baek
- School of Biological Sciences, Seoul National University, Seoul, Republic of Korea
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA.
- Institute for Protein Design, University of Washington, Seattle, WA, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA.
| |
Collapse
|
112
|
Mallik BB, Stanislaw J, Alawathurage TM, Khmelinskaia A. De Novo Design of Polyhedral Protein Assemblies: Before and After the AI Revolution. Chembiochem 2023; 24:e202300117. [PMID: 37014094 DOI: 10.1002/cbic.202300117] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 04/03/2023] [Accepted: 04/03/2023] [Indexed: 04/05/2023]
Abstract
Self-assembling polyhedral protein biomaterials have gained attention as engineering targets owing to their naturally evolved sophisticated functions, ranging from protecting macromolecules from the environment to spatially controlling biochemical reactions. Precise computational design of de novo protein polyhedra is possible through two main types of approaches: methods from first principles, using physical and geometrical rules, and more recent data-driven methods based on artificial intelligence (AI), including deep learning (DL). Here, we retrospect first principle- and AI-based approaches for designing finite polyhedral protein assemblies, as well as advances in the structure prediction of such assemblies. We further highlight the possible applications of these materials and explore how the presented approaches can be combined to overcome current challenges and to advance the design of functional protein-based biomaterials.
Collapse
Affiliation(s)
- Bhoomika Basu Mallik
- Transdisciplinary Research Area, "Building Blocks of Matter and Fundamental Interactions (TRA Matter)", University of Bonn, 53121, Bonn, Germany
- Life and Medical Sciences Institute, University of Bonn, 53115, Bonn, Germany
| | - Jenna Stanislaw
- Transdisciplinary Research Area, "Building Blocks of Matter and Fundamental Interactions (TRA Matter)", University of Bonn, 53121, Bonn, Germany
- Life and Medical Sciences Institute, University of Bonn, 53115, Bonn, Germany
| | - Tharindu Madhusankha Alawathurage
- Transdisciplinary Research Area, "Building Blocks of Matter and Fundamental Interactions (TRA Matter)", University of Bonn, 53121, Bonn, Germany
- Life and Medical Sciences Institute, University of Bonn, 53115, Bonn, Germany
| | - Alena Khmelinskaia
- Transdisciplinary Research Area, "Building Blocks of Matter and Fundamental Interactions (TRA Matter)", University of Bonn, 53121, Bonn, Germany
- Life and Medical Sciences Institute, University of Bonn, 53115, Bonn, Germany
- Current address: Department of Chemistry, Ludwig Maximillian University, 80539, Munich, Germany
| |
Collapse
|
113
|
Ding X, Huang Y, Tian X, Zhao Y, Feng G, Gao Z. Diagnosis, Treatment, and Management of Otitis Media with Artificial Intelligence. Diagnostics (Basel) 2023; 13:2309. [PMID: 37443702 DOI: 10.3390/diagnostics13132309] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 06/04/2023] [Accepted: 06/14/2023] [Indexed: 07/15/2023] Open
Abstract
A common infectious disease, otitis media (OM) has a low rate of early diagnosis, which significantly increases the difficulty of treating the disease and the likelihood of serious complications developing including hearing loss, speech impairment, and even intracranial infection. Several areas of healthcare have shown great promise in the application of artificial intelligence (AI) systems, such as the accurate detection of diseases, the automated interpretation of images, and the prediction of patient outcomes. Several articles have reported some machine learning (ML) algorithms such as ResNet, InceptionV3 and Unet, were applied to the diagnosis of OM successfully. The use of these techniques in the OM is still in its infancy, but their potential is enormous. We present in this review important concepts related to ML and AI, describe how these technologies are currently being applied to diagnosing, treating, and managing OM, and discuss the challenges associated with developing AI-assisted OM technologies in the future.
Collapse
Affiliation(s)
- Xin Ding
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Yu Huang
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Xu Tian
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Yang Zhao
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Guodong Feng
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| | - Zhiqiang Gao
- Department of Otorhinolaryngology Head and Neck Surgery, The Peaking Union Medical College Hospital, No. 1, Shuaifuyuan, Dongcheng District, Beijing 100010, China
| |
Collapse
|
114
|
Hoffnagle AM, Tezcan FA. Atomically Accurate Design of Metalloproteins with Predefined Coordination Geometries. J Am Chem Soc 2023; 145:14208-14214. [PMID: 37352018 PMCID: PMC10439731 DOI: 10.1021/jacs.3c04047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/25/2023]
Abstract
We report a new computational protein design method for the construction of oligomeric protein assemblies around metal centers with predefined coordination geometries. We apply this method to design two homotrimeric assemblies, Tet4 and TP1, with tetrahedral and trigonal-pyramidal tris(histidine) metal coordination geometries, respectively, and demonstrate that both assemblies form the targeted metal centers with ≤0.2 Å accuracy. Although Tet4 and TP1 are constructed from the same parent protein building block, they are distinct in terms of their overall architectures, the environment surrounding the metal centers, and their metal-based reactivities, illustrating the versatility of our approach.
Collapse
Affiliation(s)
- Alexander M. Hoffnagle
- Department of Chemistry and Biochemistry, University of California, San Diego, CA 92093, USA
| | - F. Akif Tezcan
- Department of Chemistry and Biochemistry, University of California, San Diego, CA 92093, USA
| |
Collapse
|
115
|
de Haas RJ, Tas RP, van den Broek D, Zheng C, Nguyen H, Kang A, Bera AK, King NP, Voets IK, de Vries R. De novo designed ice-binding proteins from twist-constrained helices. Proc Natl Acad Sci U S A 2023; 120:e2220380120. [PMID: 37364125 PMCID: PMC10319034 DOI: 10.1073/pnas.2220380120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Accepted: 05/02/2023] [Indexed: 06/28/2023] Open
Abstract
Attaining molecular-level control over solidification processes is a crucial aspect of materials science. To control ice formation, organisms have evolved bewildering arrays of ice-binding proteins (IBPs), but these have poorly understood structure-activity relationships. We propose that reverse engineering using de novo computational protein design can shed light on structure-activity relationships of IBPs. We hypothesized that the model alpha-helical winter flounder antifreeze protein uses an unusual undertwisting of its alpha-helix to align its putative ice-binding threonine residues in exactly the same direction. We test this hypothesis by designing a series of straight three-helix bundles with an ice-binding helix projecting threonines and two supporting helices constraining the twist of the ice-binding helix. Our findings show that ice-recrystallization inhibition by the designed proteins increases with the degree of designed undertwisting, thus validating our hypothesis, and opening up avenues for the computational design of IBPs.
Collapse
Affiliation(s)
- Robbert J. de Haas
- Department of Physical Chemistry and Soft Matter, Wageningen University and Research, Wageningen, WE6708, The Netherlands
| | - Roderick P. Tas
- Laboratory of Self-Organizing Soft Matter, Department of Chemical Engineering and Chemistry and Institute for Complex Molecular Systems, Eindhoven University of Technology, Eindhoven, MB5600, The Netherlands
| | - Daniëlle van den Broek
- Laboratory of Self-Organizing Soft Matter, Department of Chemical Engineering and Chemistry and Institute for Complex Molecular Systems, Eindhoven University of Technology, Eindhoven, MB5600, The Netherlands
| | - Chuanbao Zheng
- Department of Physical Chemistry and Soft Matter, Wageningen University and Research, Wageningen, WE6708, The Netherlands
| | - Hannah Nguyen
- Department of Biochemistry, University of Washington, Seattle, WA98195
- Institute for Protein Design, University of Washington, Seattle, WA98195
| | - Alex Kang
- Department of Biochemistry, University of Washington, Seattle, WA98195
- Institute for Protein Design, University of Washington, Seattle, WA98195
| | - Asim K. Bera
- Department of Biochemistry, University of Washington, Seattle, WA98195
- Institute for Protein Design, University of Washington, Seattle, WA98195
| | - Neil P. King
- Department of Biochemistry, University of Washington, Seattle, WA98195
- Institute for Protein Design, University of Washington, Seattle, WA98195
| | - Ilja K. Voets
- Laboratory of Self-Organizing Soft Matter, Department of Chemical Engineering and Chemistry and Institute for Complex Molecular Systems, Eindhoven University of Technology, Eindhoven, MB5600, The Netherlands
| | - Renko de Vries
- Department of Physical Chemistry and Soft Matter, Wageningen University and Research, Wageningen, WE6708, The Netherlands
| |
Collapse
|
116
|
Casadevall G, Duran C, Osuna S. AlphaFold2 and Deep Learning for Elucidating Enzyme Conformational Flexibility and Its Application for Design. JACS AU 2023; 3:1554-1562. [PMID: 37388680 PMCID: PMC10302747 DOI: 10.1021/jacsau.3c00188] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 05/22/2023] [Accepted: 05/22/2023] [Indexed: 07/01/2023]
Abstract
The recent success of AlphaFold2 (AF2) and other deep learning (DL) tools in accurately predicting the folded three-dimensional (3D) structure of proteins and enzymes has revolutionized the structural biology and protein design fields. The 3D structure indeed reveals key information on the arrangement of the catalytic machinery of enzymes and which structural elements gate the active site pocket. However, comprehending enzymatic activity requires a detailed knowledge of the chemical steps involved along the catalytic cycle and the exploration of the multiple thermally accessible conformations that enzymes adopt when in solution. In this Perspective, some of the recent studies showing the potential of AF2 in elucidating the conformational landscape of enzymes are provided. Selected examples of the key developments of AF2-based and DL methods for protein design are discussed, as well as a few enzyme design cases. These studies show the potential of AF2 and DL for allowing the routine computational design of efficient enzymes.
Collapse
Affiliation(s)
- Guillem Casadevall
- Institut
de Química Computacional i Catàlisi (IQCC) and Departament
de Química, Universitat de Girona, Maria Aurèlia Capmany 69, 17003 Girona, Spain
| | - Cristina Duran
- Institut
de Química Computacional i Catàlisi (IQCC) and Departament
de Química, Universitat de Girona, Maria Aurèlia Capmany 69, 17003 Girona, Spain
| | - Sílvia Osuna
- Institut
de Química Computacional i Catàlisi (IQCC) and Departament
de Química, Universitat de Girona, Maria Aurèlia Capmany 69, 17003 Girona, Spain
- ICREA, Passeig Lluís Companys 23, 08010 Barcelona, Spain
| |
Collapse
|
117
|
Nguyen ATN, Tran QL, Baltos JA, McNeill SM, Nguyen DTN, May LT. Small molecule allosteric modulation of the adenosine A 1 receptor. Front Endocrinol (Lausanne) 2023; 14:1184360. [PMID: 37435481 PMCID: PMC10331460 DOI: 10.3389/fendo.2023.1184360] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/11/2023] [Accepted: 05/23/2023] [Indexed: 07/13/2023] Open
Abstract
G protein-coupled receptors (GPCRs) represent the target for approximately a third of FDA-approved small molecule drugs. The adenosine A1 receptor (A1R), one of four adenosine GPCR subtypes, has important (patho)physiological roles in humans. A1R has well-established roles in the regulation of the cardiovascular and nervous systems, where it has been identified as a potential therapeutic target for a number of conditions, including cardiac ischemia-reperfusion injury, cognition, epilepsy, and neuropathic pain. A1R small molecule drugs, typically orthosteric ligands, have undergone clinical trials. To date, none have progressed into the clinic, predominantly due to dose-limiting unwanted effects. The development of A1R allosteric modulators that target a topographically distinct binding site represent a promising approach to overcome current limitations. Pharmacological parameters of allosteric ligands, including affinity, efficacy and cooperativity, can be optimized to regulate A1R activity with high subtype, spatial and temporal selectivity. This review aims to offer insights into the A1R as a potential therapeutic target and highlight recent advances in the structural understanding of A1R allosteric modulation.
Collapse
Affiliation(s)
- Anh T. N. Nguyen
- Drug Discovery Biology, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, VIC, Australia
| | - Quan L. Tran
- Drug Discovery Biology, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, VIC, Australia
| | - Jo-Anne Baltos
- Drug Discovery Biology, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, VIC, Australia
| | - Samantha M. McNeill
- Drug Discovery Biology, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, VIC, Australia
| | - Diep T. N. Nguyen
- Department of Information Technology, Faculty of Engineering and Technology, Vietnam National University, Hanoi, Vietnam
| | - Lauren T. May
- Drug Discovery Biology, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, VIC, Australia
| |
Collapse
|
118
|
Liu G, Huang L, Lian J. Alcohol acyltransferases for the biosynthesis of esters. BIOTECHNOLOGY FOR BIOFUELS AND BIOPRODUCTS 2023; 16:93. [PMID: 37264424 DOI: 10.1186/s13068-023-02343-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Accepted: 05/18/2023] [Indexed: 06/03/2023]
Abstract
Esters are widely used in food, energy, spices, chemical industry, etc., becoming an indispensable part of life. However, their production heavily relies on the fossil energy industry, which presents significant challenges associated with energy shortages and environmental pollution. Consequently, there is an urgent need to identify alternative green methods for ester production. One promising solution is biosynthesis, which offers sustainable and environmentally friendly processes. In ester biosynthesis, alcohol acyltransferases (AATs) catalyze the condensation of acyl-CoAs and alcohols to form esters, enabling the biosynthesis of nearly 100 different kinds of esters, such as ethyl acetate, hexyl acetate, ethyl crotonate, isoamyl acetate, and butyl butyrate. However, low catalytic efficiency and low selectivity of AATs represent the major bottlenecks for the biosynthesis of certain specific esters, which should be addressed with protein molecular engineering approaches before practical biotechnological applications. This review provides an overview of AAT enzymes, including their sequences, structures, active sites, catalytic mechanisms, and metabolic engineering applications. Furthermore, considering the critical role of AATs in determining the final ester products, the current research progresses of AAT modification using protein molecular engineering are also discussed. This review summarized the major challenges and prospects of AAT enzymes in ester biosynthesis.
Collapse
Affiliation(s)
- Gaofei Liu
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, China
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou, 311215, China
| | - Lei Huang
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, China
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou, 311215, China
| | - Jiazhang Lian
- Key Laboratory of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, China.
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou, 311215, China.
- Zhejiang Key Laboratory of Smart Biomaterials, Zhejiang University, Hangzhou, 310027, China.
| |
Collapse
|
119
|
Chu AE, Cheng L, Nesr GE, Xu M, Huang PS. An all-atom protein generative model. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.24.542194. [PMID: 37292974 PMCID: PMC10245864 DOI: 10.1101/2023.05.24.542194] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Proteins mediate their functions through chemical interactions; modeling these interactions, which are typically through sidechains, is an important need in protein design. However, constructing an all-atom generative model requires an appropriate scheme for managing the jointly continuous and discrete nature of proteins encoded in the structure and sequence. We describe an all-atom diffusion model of protein structure, Protpardelle, which instantiates a "superposition" over the possible sidechain states, and collapses it to conduct reverse diffusion for sample generation. When combined with sequence design methods, our model is able to co-design all-atom protein structure and sequence. Generated proteins are of good quality under the typical quality, diversity, and novelty metrics, and sidechains reproduce the chemical features and behavior of natural proteins. Finally, we explore the potential of our model conduct all-atom protein design and scaffold functional motifs in a backbone- and rotamer-free way.
Collapse
Affiliation(s)
- Alexander E. Chu
- Biophysics Program, Stanford University
- Department of Bioengineering, Stanford University
| | | | - Gina El Nesr
- Biophysics Program, Stanford University
- Department of Bioengineering, Stanford University
| | - Minkai Xu
- Department of Computer Science, Stanford University
| | - Po-Ssu Huang
- Department of Bioengineering, Stanford University
| |
Collapse
|
120
|
Mészáros B, Park E, Malinverni D, Sejdiu BI, Immadisetty K, Sandhu M, Lang B, Babu MM. Recent breakthroughs in computational structural biology harnessing the power of sequences and structures. Curr Opin Struct Biol 2023; 80:102608. [PMID: 37182396 DOI: 10.1016/j.sbi.2023.102608] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Revised: 04/12/2023] [Accepted: 04/17/2023] [Indexed: 05/16/2023]
Abstract
Recent advances in computational approaches and their integration into structural biology enable tackling increasingly complex questions. Here, we discuss several key areas, highlighting breakthroughs and remaining challenges. Theoretical modeling has provided tools to accurately predict and design protein structures on a scale currently difficult to achieve using experimental approaches. Molecular Dynamics simulations have become faster and more precise, delivering actionable information inaccessible by current experimental methods. Virtual screening workflows allow a high-throughput approach to discover ligands that bind and modulate protein function, while Machine Learning methods enable the design of proteins with new functionalities. Integrative structural biology combines several of these approaches, pushing the frontiers of structural and functional characterization to ever larger systems, advancing towards a complete understanding of the living cell. These breakthroughs will accelerate and significantly impact diverse areas of science.
Collapse
Affiliation(s)
- Bálint Mészáros
- Department of Structural Biology and Center of Excellence for Data Driven Discovery, St Jude Children's Research Hospital, 262 Danny Thomas Place, Memphis, TN, 38105, USA.
| | - Electa Park
- Department of Structural Biology and Center of Excellence for Data Driven Discovery, St Jude Children's Research Hospital, 262 Danny Thomas Place, Memphis, TN, 38105, USA.
| | - Duccio Malinverni
- Department of Structural Biology and Center of Excellence for Data Driven Discovery, St Jude Children's Research Hospital, 262 Danny Thomas Place, Memphis, TN, 38105, USA. https://twitter.com/DucMalinverni
| | - Besian I Sejdiu
- Department of Structural Biology and Center of Excellence for Data Driven Discovery, St Jude Children's Research Hospital, 262 Danny Thomas Place, Memphis, TN, 38105, USA. https://twitter.com/bisejdiu
| | - Kalyan Immadisetty
- Department of Bone Marrow Transplantation & Cellular Therapy, St Jude Children's Research Hospital, 262 Danny Thomas Place, Memphis, TN, 38105, USA. https://twitter.com/k_immadisetty
| | - Manbir Sandhu
- Department of Structural Biology and Center of Excellence for Data Driven Discovery, St Jude Children's Research Hospital, 262 Danny Thomas Place, Memphis, TN, 38105, USA. https://twitter.com/M5andhu
| | - Benjamin Lang
- Department of Structural Biology and Center of Excellence for Data Driven Discovery, St Jude Children's Research Hospital, 262 Danny Thomas Place, Memphis, TN, 38105, USA. https://twitter.com/langbnj
| | - M Madan Babu
- Department of Structural Biology and Center of Excellence for Data Driven Discovery, St Jude Children's Research Hospital, 262 Danny Thomas Place, Memphis, TN, 38105, USA.
| |
Collapse
|
121
|
Bennett NR, Coventry B, Goreshnik I, Huang B, Allen A, Vafeados D, Peng YP, Dauparas J, Baek M, Stewart L, DiMaio F, De Munck S, Savvides SN, Baker D. Improving de novo protein binder design with deep learning. Nat Commun 2023; 14:2625. [PMID: 37149653 PMCID: PMC10163288 DOI: 10.1038/s41467-023-38328-5] [Citation(s) in RCA: 56] [Impact Index Per Article: 56.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 04/24/2023] [Indexed: 05/08/2023] Open
Abstract
Recently it has become possible to de novo design high affinity protein binding proteins from target structural information alone. There is, however, considerable room for improvement as the overall design success rate is low. Here, we explore the augmentation of energy-based protein binder design using deep learning. We find that using AlphaFold2 or RoseTTAFold to assess the probability that a designed sequence adopts the designed monomer structure, and the probability that this structure binds the target as designed, increases design success rates nearly 10-fold. We find further that sequence design using ProteinMPNN rather than Rosetta considerably increases computational efficiency.
Collapse
Affiliation(s)
- Nathaniel R Bennett
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Molecular Engineering Graduate Program, University of Washington, Seattle, WA, USA
| | - Brian Coventry
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Inna Goreshnik
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Buwei Huang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - Aza Allen
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Dionne Vafeados
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Ying Po Peng
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Justas Dauparas
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Minkyung Baek
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Lance Stewart
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Steven De Munck
- VIB-UGent Center for Inflammation Research, Ghent, Belgium
- Unit for Structural Biology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
| | - Savvas N Savvides
- VIB-UGent Center for Inflammation Research, Ghent, Belgium
- Unit for Structural Biology, Department of Biochemistry and Microbiology, Ghent University, Ghent, Belgium
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA.
- Institute for Protein Design, University of Washington, Seattle, WA, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA.
| |
Collapse
|
122
|
Gainza P, Wehrle S, Van Hall-Beauvais A, Marchand A, Scheck A, Harteveld Z, Buckley S, Ni D, Tan S, Sverrisson F, Goverde C, Turelli P, Raclot C, Teslenko A, Pacesa M, Rosset S, Georgeon S, Marsden J, Petruzzella A, Liu K, Xu Z, Chai Y, Han P, Gao GF, Oricchio E, Fierz B, Trono D, Stahlberg H, Bronstein M, Correia BE. De novo design of protein interactions with learned surface fingerprints. Nature 2023; 617:176-184. [PMID: 37100904 PMCID: PMC10131520 DOI: 10.1038/s41586-023-05993-x] [Citation(s) in RCA: 60] [Impact Index Per Article: 60.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 03/21/2023] [Indexed: 04/28/2023]
Abstract
Physical interactions between proteins are essential for most biological processes governing life1. However, the molecular determinants of such interactions have been challenging to understand, even as genomic, proteomic and structural data increase. This knowledge gap has been a major obstacle for the comprehensive understanding of cellular protein-protein interaction networks and for the de novo design of protein binders that are crucial for synthetic biology and translational applications2-9. Here we use a geometric deep-learning framework operating on protein surfaces that generates fingerprints to describe geometric and chemical features that are critical to drive protein-protein interactions10. We hypothesized that these fingerprints capture the key aspects of molecular recognition that represent a new paradigm in the computational design of novel protein interactions. As a proof of principle, we computationally designed several de novo protein binders to engage four protein targets: SARS-CoV-2 spike, PD-1, PD-L1 and CTLA-4. Several designs were experimentally optimized, whereas others were generated purely in silico, reaching nanomolar affinity with structural and mutational characterization showing highly accurate predictions. Overall, our surface-centric approach captures the physical and chemical determinants of molecular recognition, enabling an approach for the de novo design of protein interactions and, more broadly, of artificial proteins with function.
Collapse
Affiliation(s)
- Pablo Gainza
- Laboratory of Protein Design and Immunoengineering, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Monte Rosa Therapeutics, Basel, Switzerland
| | - Sarah Wehrle
- Laboratory of Protein Design and Immunoengineering, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Alexandra Van Hall-Beauvais
- Laboratory of Protein Design and Immunoengineering, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Anthony Marchand
- Laboratory of Protein Design and Immunoengineering, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Andreas Scheck
- Laboratory of Protein Design and Immunoengineering, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Zander Harteveld
- Laboratory of Protein Design and Immunoengineering, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Stephen Buckley
- Laboratory of Protein Design and Immunoengineering, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Dongchun Ni
- Laboratory of Biological Electron Microscopy, Institute of Physics, School of Basic Science, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Department of Fundamental Microbiology, Faculty of Biology and Medicine, University of Lausanne, Lausanne, Switzerland
| | - Shuguang Tan
- CAS Key Laboratory of Pathogen Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Freyr Sverrisson
- Laboratory of Protein Design and Immunoengineering, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Casper Goverde
- Laboratory of Protein Design and Immunoengineering, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Priscilla Turelli
- Laboratory of Virology and Genetics, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Charlène Raclot
- Laboratory of Virology and Genetics, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Alexandra Teslenko
- Laboratory of Biophysical Chemistry of Macromolecules, School of Basic Sciences, Institute of Chemical Sciences and Engineering (ISIC), École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Martin Pacesa
- Laboratory of Protein Design and Immunoengineering, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Stéphane Rosset
- Laboratory of Protein Design and Immunoengineering, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Sandrine Georgeon
- Laboratory of Protein Design and Immunoengineering, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Jane Marsden
- Laboratory of Protein Design and Immunoengineering, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Aaron Petruzzella
- Swiss Institute for Experimental Cancer Research, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Kefang Liu
- CAS Key Laboratory of Pathogen Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Zepeng Xu
- CAS Key Laboratory of Pathogen Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Yan Chai
- CAS Key Laboratory of Pathogen Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Pu Han
- CAS Key Laboratory of Pathogen Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - George F Gao
- CAS Key Laboratory of Pathogen Microbiology and Immunology, Institute of Microbiology, Chinese Academy of Sciences, Beijing, China
| | - Elisa Oricchio
- Swiss Institute for Experimental Cancer Research, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Beat Fierz
- Laboratory of Biophysical Chemistry of Macromolecules, School of Basic Sciences, Institute of Chemical Sciences and Engineering (ISIC), École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Didier Trono
- Laboratory of Virology and Genetics, School of Life Sciences, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Henning Stahlberg
- Laboratory of Biological Electron Microscopy, Institute of Physics, School of Basic Science, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Department of Fundamental Microbiology, Faculty of Biology and Medicine, University of Lausanne, Lausanne, Switzerland
| | | | - Bruno E Correia
- Laboratory of Protein Design and Immunoengineering, Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland.
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| |
Collapse
|
123
|
Lee JS, Kim J, Kim PM. Score-based generative modeling for de novo protein design. NATURE COMPUTATIONAL SCIENCE 2023; 3:382-392. [PMID: 38177840 DOI: 10.1038/s43588-023-00440-3] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Accepted: 03/30/2023] [Indexed: 01/06/2024]
Abstract
The generation of de novo protein structures with predefined functions and properties remains a challenging problem in protein design. Diffusion models, also known as score-based generative models (SGMs), have recently exhibited astounding empirical performance in image synthesis. Here we use image-based representations of protein structure to develop ProteinSGM, a score-based generative model that produces realistic de novo proteins. Through unconditional generation, we show that ProteinSGM can generate native-like protein structures, surpassing the performance of previously reported generative models. We experimentally validate some de novo designs and observe secondary structure compositions consistent with generated backbones. Finally, we apply conditional generation to de novo protein design by formulating it as an image inpainting problem, allowing precise and modular design of protein structure.
Collapse
Affiliation(s)
- Jin Sub Lee
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada
| | - Jisun Kim
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada
| | - Philip M Kim
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada.
- Department of Computer Science, University of Toronto, Toronto, ON, Canada.
| |
Collapse
|
124
|
New protein-protein interactions designed by a computer. Nature 2023:10.1038/d41586-023-01324-2. [PMID: 37101068 DOI: 10.1038/d41586-023-01324-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/28/2023]
|
125
|
Ruffolo JA, Chu LS, Mahajan SP, Gray JJ. Fast, accurate antibody structure prediction from deep learning on massive set of natural antibodies. Nat Commun 2023; 14:2389. [PMID: 37185622 PMCID: PMC10129313 DOI: 10.1038/s41467-023-38063-x] [Citation(s) in RCA: 67] [Impact Index Per Article: 67.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Accepted: 04/14/2023] [Indexed: 05/17/2023] Open
Abstract
Antibodies have the capacity to bind a diverse set of antigens, and they have become critical therapeutics and diagnostic molecules. The binding of antibodies is facilitated by a set of six hypervariable loops that are diversified through genetic recombination and mutation. Even with recent advances, accurate structural prediction of these loops remains a challenge. Here, we present IgFold, a fast deep learning method for antibody structure prediction. IgFold consists of a pre-trained language model trained on 558 million natural antibody sequences followed by graph networks that directly predict backbone atom coordinates. IgFold predicts structures of similar or better quality than alternative methods (including AlphaFold) in significantly less time (under 25 s). Accurate structure prediction on this timescale makes possible avenues of investigation that were previously infeasible. As a demonstration of IgFold's capabilities, we predicted structures for 1.4 million paired antibody sequences, providing structural insights to 500-fold more antibodies than have experimentally determined structures.
Collapse
Affiliation(s)
- Jeffrey A Ruffolo
- Program in Molecular Biophysics, The Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Lee-Shin Chu
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Sai Pooja Mahajan
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Jeffrey J Gray
- Program in Molecular Biophysics, The Johns Hopkins University, Baltimore, MD, 21218, USA.
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, MD, 21218, USA.
| |
Collapse
|
126
|
Lutz ID, Wang S, Norn C, Courbet A, Borst AJ, Zhao YT, Dosey A, Cao L, Xu J, Leaf EM, Treichel C, Litvicov P, Li Z, Goodson AD, Rivera-Sánchez P, Bratovianu AM, Baek M, King NP, Ruohola-Baker H, Baker D. Top-down design of protein architectures with reinforcement learning. Science 2023; 380:266-273. [PMID: 37079676 DOI: 10.1126/science.adf6591] [Citation(s) in RCA: 24] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Accepted: 03/21/2023] [Indexed: 04/22/2023]
Abstract
As a result of evolutionary selection, the subunits of naturally occurring protein assemblies often fit together with substantial shape complementarity to generate architectures optimal for function in a manner not achievable by current design approaches. We describe a "top-down" reinforcement learning-based design approach that solves this problem using Monte Carlo tree search to sample protein conformers in the context of an overall architecture and specified functional constraints. Cryo-electron microscopy structures of the designed disk-shaped nanopores and ultracompact icosahedra are very close to the computational models. The icosohedra enable very-high-density display of immunogens and signaling molecules, which potentiates vaccine response and angiogenesis induction. Our approach enables the top-down design of complex protein nanomaterials with desired system properties and demonstrates the power of reinforcement learning in protein design.
Collapse
Affiliation(s)
- Isaac D Lutz
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - Shunzhi Wang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Christoffer Norn
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- BioInnovation Institute, DK2200 Copenhagen N, Denmark
| | - Alexis Courbet
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Andrew J Borst
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Yan Ting Zhao
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
- Oral Health Sciences, University of Washington, Seattle, WA, USA
| | - Annie Dosey
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Longxing Cao
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China
| | - Jinwei Xu
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Elizabeth M Leaf
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Catherine Treichel
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Patrisia Litvicov
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
| | - Zhe Li
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Alexander D Goodson
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | | | | | - Minkyung Baek
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- School of Biological Sciences, Seoul National University, Seoul, Republic of Korea
| | - Neil P King
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Hannele Ruohola-Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
- Oral Health Sciences, University of Washington, Seattle, WA, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| |
Collapse
|
127
|
Durham J, Zhang J, Humphreys IR, Pei J, Cong Q. Recent advances in predicting and modeling protein-protein interactions. Trends Biochem Sci 2023; 48:527-538. [PMID: 37061423 DOI: 10.1016/j.tibs.2023.03.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 03/03/2023] [Accepted: 03/17/2023] [Indexed: 04/17/2023]
Abstract
Protein-protein interactions (PPIs) drive biological processes, and disruption of PPIs can cause disease. With recent breakthroughs in structure prediction and a deluge of genomic sequence data, computational methods to predict PPIs and model spatial structures of protein complexes are now approaching the accuracy of experimental approaches for permanent interactions and show promise for elucidating transient interactions. As we describe here, the key to this success is rich evolutionary information deciphered from thousands of homologous sequences that coevolve in interacting partners. This covariation signal, revealed by sophisticated statistical and machine learning (ML) algorithms, predicts physiological interactions. Accurate artificial intelligence (AI)-based modeling of protein structures promises to provide accurate 3D models of PPIs at a proteome-wide scale.
Collapse
Affiliation(s)
- Jesse Durham
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, USA; Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA; Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Jing Zhang
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, USA; Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA; Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Ian R Humphreys
- Department of Biochemistry, University of Washington, Seattle, WA, USA; Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Jimin Pei
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, USA; Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA; Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA
| | - Qian Cong
- Eugene McDermott Center for Human Growth and Development, University of Texas Southwestern Medical Center, Dallas, TX, USA; Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, USA; Harold C. Simmons Comprehensive Cancer Center, University of Texas Southwestern Medical Center, Dallas, TX, USA.
| |
Collapse
|
128
|
Dai C, Wang J, Tu L, Pan Z, Yang J, Zhou S, Luo Q, Zhu L, Ye Y. Genetically-encoded degraders as versatile modulators of intracellular therapeutic targets. CURRENT OPINION IN BIOMEDICAL ENGINEERING 2023. [DOI: 10.1016/j.cobme.2023.100458] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/09/2023]
|
129
|
Makarova KS, Wolf YI, Koonin EV. In Silico Approaches for Prediction of Anti-CRISPR Proteins. J Mol Biol 2023; 435:168036. [PMID: 36868398 PMCID: PMC10073340 DOI: 10.1016/j.jmb.2023.168036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 02/18/2023] [Accepted: 02/23/2023] [Indexed: 03/05/2023]
Abstract
Numerous viruses infecting bacteria and archaea encode CRISPR-Cas system inhibitors, known as anti-CRISPR proteins (Acr). The Acrs typically are highly specific for particular CRISPR variants, resulting in remarkable sequence and structural diversity and complicating accurate prediction and identification of Acrs. In addition to their intrinsic interest for understanding the coevolution of defense and counter-defense systems in prokaryotes, Acrs could be natural, potent on-off switches for CRISPR-based biotechnological tools, so their discovery, characterization and application are of major importance. Here we discuss the computational approaches for Acr prediction. Due to the enormous diversity and likely multiple origins of the Acrs, sequence similarity searches are of limited use. However, multiple features of protein and gene organization have been successfully harnessed to this end including small protein size and distinct amino acid compositions of the Acrs, association of acr genes in virus genomes with genes encoding helix-turn-helix proteins that regulate Acr expression (Acr-associated proteins, Aca), and presence of self-targeting CRISPR spacers in bacterial and archaeal genomes containing Acr-encoding proviruses. Productive approaches for Acr prediction also involve genome comparison of closely related viruses, of which one is resistant and the other one is sensitive to a particular CRISPR variant, and "guilt by association" whereby genes adjacent to a homolog of a known Aca are identified as candidate Acrs. The distinctive features of Acrs are employed for Acr prediction both by developing dedicated search algorithms and through machine learning. New approaches will be needed to identify novel types of Acrs that are likely to exist.
Collapse
Affiliation(s)
- Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, USA.
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, NIH, Bethesda, USA
| |
Collapse
|
130
|
Cummins MC, Tripathy A, Sondek J, Kuhlman B. De novo design of stable proteins that efficaciously inhibit oncogenic G proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.28.534629. [PMID: 37034763 PMCID: PMC10081213 DOI: 10.1101/2023.03.28.534629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/22/2023]
Abstract
Many protein therapeutics are competitive inhibitors that function by binding to endogenous proteins and preventing them from interacting with native partners. One effective strategy for engineering competitive inhibitors is to graft structural motifs from a native partner into a host protein. Here, we develop and experimentally test a computational protocol for embedding binding motifs in de novo designed proteins. The protocol uses an "inside-out" approach: Starting with a structural model of the binding motif docked against the target protein, the de novo protein is built by growing new structural elements off the termini of the binding motif. During backbone assembly, a score function favors backbones that introduce new tertiary contacts within the designed protein and do not introduce clashes with the target binding partner. Final sequences are designed and optimized using the molecular modeling program Rosetta. To test our protocol, we designed small helical proteins to inhibit the interaction between Gα q and its effector PLC-β isozymes. Several of the designed proteins remain folded above 90°C and bind to Gα q with equilibrium dissociation constants tighter than 80 nM. In cellular assays with oncogenic variants of Gα q , the designed proteins inhibit activation of PLC-β isozymes and Dbl-family RhoGEFs. Our results demonstrate that computational protein design, in combination with motif grafting, can be used to directly generate potent inhibitors without further optimization via high throughput screening or selection. statement for broader audience Engineered proteins that bind to specific target proteins are useful as research reagents, diagnostics, and therapeutics. We used computational protein design to engineer de novo proteins that bind and competitively inhibit the G protein, Gα q , which is an oncogene for uveal melanomas. This computational method is a general approach that should be useful for designing competitive inhibitors against other proteins of interest.
Collapse
Affiliation(s)
- Matthew C. Cummins
- Department of Pharmacology, University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
| | - Ashutosh Tripathy
- Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
| | - John Sondek
- Department of Pharmacology, University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
- Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| |
Collapse
|
131
|
Ramírez-Palacios C, Marrink SJ. Super High-Throughput Screening of Enzyme Variants by Spectral Graph Convolutional Neural Networks. J Chem Theory Comput 2023. [PMID: 36961994 PMCID: PMC10373491 DOI: 10.1021/acs.jctc.2c01227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/26/2023]
Abstract
Finding new enzyme variants with the desired substrate scope requires screening through a large number of potential variants. In a typical in silico enzyme engineering workflow, it is possible to scan a few thousands of variants, and gather several candidates for further screening or experimental verification. In this work, we show that a Graph Convolutional Neural Network (GCN) can be trained to predict the binding energy of combinatorial libraries of enzyme complexes using only sequence information. The GCN model uses a stack of message-passing and graph pooling layers to extract information from the protein input graph and yield a prediction. The GCN model is agnostic to the identity of the ligand, which is kept constant within the mutant libraries. Using a miniscule subset of the total combinatorial space (204-208 mutants) as training data, the proposed GCN model achieves a high accuracy in predicting the binding energy of unseen variants. The network's accuracy was further improved by injecting feature embeddings obtained from a language module pretrained on 10 million protein sequences. Since no structural information is needed to evaluate new variants, the deep learning algorithm is capable of scoring an enzyme variant in under 1 ms, allowing the search of billions of candidates on a single GPU.
Collapse
Affiliation(s)
- Carlos Ramírez-Palacios
- Molecular Dynamics, Groningen Biomolecular Sciences and Biotechnology Institute (GBB), University of Groningen, Nijenborgh 7, 9747 AG Groningen, The Netherlands
| | - Siewert J Marrink
- Molecular Dynamics, Groningen Biomolecular Sciences and Biotechnology Institute (GBB), University of Groningen, Nijenborgh 7, 9747 AG Groningen, The Netherlands
| |
Collapse
|
132
|
Baron L, Hadjerci J, Thoidingjam L, Plays M, Bucci R, Morris N, Müller S, Sindikubwabo F, Solier S, Cañeque T, Colombeau L, Blouin CM, Lamaze C, Puisieux A, Bono Y, Gaillet C, Laraia L, Vauzeilles B, Taran F, Papot S, Karoyan P, Duval R, Mahuteau-Betzer F, Arimondo P, Cariou K, Guichard G, Micouin L, Ethève-Quelquejeu M, Verga D, Versini A, Gasser G, Tang C, Belmont P, Linkermann A, Bonfio C, Gillingham D, Poulsen T, Di Antonio M, Lopez M, Guianvarc'h D, Thomas C, Masson G, Gautier A, Johannes L, Rodriguez R. PSL Chemical Biology Symposia Third Edition: A Branch of Science in its Explosive Phase. Chembiochem 2023; 24:e202300093. [PMID: 36942862 DOI: 10.1002/cbic.202300093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Indexed: 03/23/2023]
Abstract
This symposium is the third PSL (Paris Sciences & Lettres) Chemical Biology meeting (2016, 2019, 2023) held at Institut Curie. This initiative originally started at Institut de Chimie des Substances Naturelles (ICSN) in Gif-sur-Yvette (2013, 2014), under the directorship of Professor Max Malacria, with a strong focus on chemistry. It was then continued at the Institut Curie (2015) covering a larger scope, before becoming the official PSL Chemical Biology meeting. This latest edition was postponed twice for the reasons that we know. This has given us the opportunity to invite additional speakers of great standing. This year, Institut Curie hosted around 300 participants, including 220 on site and over 80 online. The pandemic has had, at least, the virtue of promoting online meetings, which we came to realize is not perfect but has its own merits. In particular, it enables those with restricted time and resources to take part in events and meetings, which can now accommodate unlimited participants. We apologize to all those who could not attend in person this time due to space limitation at Institut Curie.
Collapse
Affiliation(s)
- Leeroy Baron
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Justine Hadjerci
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Leishemba Thoidingjam
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Marina Plays
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Romain Bucci
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Nolwenn Morris
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Sebastian Müller
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Fabien Sindikubwabo
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Stéphanie Solier
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Tatiana Cañeque
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Ludovic Colombeau
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Cedric M Blouin
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Christophe Lamaze
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Alain Puisieux
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Yannick Bono
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Christine Gaillet
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Luca Laraia
- Technical University of Denmark, Department of Chemistry, 2800, Kgs. Lyngby, Denmark
| | - Boris Vauzeilles
- Université Paris-Saclay, CNRS UPR 2301, 91198, Gif-sur-Yvette, France
| | - Frédéric Taran
- Université Paris-Saclay, CEA, 91191, Gif-sur-Yvette, France
| | - Sébastien Papot
- Université de Poitiers, CNRS UMR 7285, 86073, Poitiers, France
| | - Philippe Karoyan
- PSL Université Paris, Sorbonne Université Ecole Normale Supérieure, CNRS UMR 7203, 75005, Paris, France
| | - Romain Duval
- Faculté de Pharmacie de Paris, Université Paris Cité CNRS UMR 261, 75006, Paris, France
| | | | | | - Kevin Cariou
- PSL Université Paris, Chimie ParisTech, CNRS, Institute of Chemistry and Health Sciences CNRS UMR 8060, 75005, Paris, France
| | - Gilles Guichard
- Université de Bordeaux, CNRS, Bordeaux INP CBMN, UMR 5248, 33600, Pessac, France
| | | | | | - Daniela Verga
- PSL Université Paris, Institut Curie CNRS UMR 9187, INSERM U1196, 91405, Orsay, France
| | - Antoine Versini
- University of Zurich, Department of Chemistry, 8057, Zurich, Switzerland
| | - Gilles Gasser
- PSL Université Paris, Chimie ParisTech, CNRS, Institute of Chemistry and Health Sciences CNRS UMR 8060, 75005, Paris, France
| | - Cong Tang
- Universidade de Lisboa, Instituto de Medicina Molecular João Lobo Antunes, 1649-028, Lisboa, Portugal
| | | | - Andreas Linkermann
- Technische Universität Dresden Department of Internal Medicine 3, 01062, Dresden, Germany
| | - Claudia Bonfio
- Université de Strasbourg, CNRS UMR 7006, 67000, Strasbourg, France
| | | | - Thomas Poulsen
- Aarhus University, Department of Chemistry, 8000, Aarhus C Aarhus, Denmark
| | - Marco Di Antonio
- Imperial College London, Molecular Sciences Research Hub, London, W12 0BZ, UK
| | - Marie Lopez
- Université de Montpellier, CNRS UMR 5247, 34000, Montpellier, France
| | | | - Christophe Thomas
- PSL Université Paris, Chimie ParisTech CNRS UMR 6226, 75005, Paris, France
| | - Géraldine Masson
- Université Paris-Saclay, CNRS UPR 2301, 91198, Gif-sur-Yvette, France
| | - Arnaud Gautier
- Sorbonne Université, École Normale Supérieure, Université PSL, CNRS, Laboratoire des Biomolécules, LBM, 75005, Paris, France
| | - Ludger Johannes
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| | - Raphaël Rodriguez
- Institut Curie, Department of Cellular and Chemical Biology, UMR 3666 CNRS, U1143 INSERM, PSL Université Paris, 75005, Paris, France
| |
Collapse
|
133
|
Roda S, Terholsen H, Meyer JRH, Cañellas-Solé A, Guallar V, Bornscheuer U, Kazemi M. AsiteDesign: a Semirational Algorithm for an Automated Enzyme Design. J Phys Chem B 2023; 127:2661-2670. [PMID: 36944360 PMCID: PMC10068746 DOI: 10.1021/acs.jpcb.2c07091] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/23/2023]
Abstract
With advances in protein structure predictions, the number of available high-quality structures has increased dramatically. In light of these advances, structure-based enzyme engineering is expected to become increasingly important for optimizing biocatalysts for industrial processes. Here, we present AsiteDesign, a Monte Carlo-based protocol for structure-based engineering of active sites. AsiteDesign provides a framework for introducing new catalytic residues in a given binding pocket to either create a new catalytic activity or alter the existing one. AsiteDesign is implemented using pyRosetta and incorporates enhanced sampling techniques to efficiently explore the search space. The protocol was tested by designing an alternative catalytic triad in the active site of Pseudomonas fluorescens esterase (PFE). The designed variant was experimentally verified to be active, demonstrating that AsiteDesign can find alternative catalytic triads. Additionally, the AsiteDesign protocol was employed to enhance the hydrolysis of a bulky chiral substrate (1-phenyl-2-pentyl acetate) by PFE. The experimental verification of the designed variants demonstrated that F158L/F198A and F125A/F158L mutations increased the hydrolysis of 1-phenyl-2-pentyl acetate from 8.9 to 66.7 and 23.4%, respectively, and reversed the enantioselectivity of the enzyme from (R) to (S)-enantiopreference, with 32 and 55% enantiomeric excess (ee), respectively.
Collapse
Affiliation(s)
- Sergi Roda
- Barcelona Supercomputing Center (BSC), Plaça d'Eusebi Güell, 1-3, Barcelona 08034, Spain
| | - Henrik Terholsen
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University of Greifswald, Felix-Hausdorff-Str. 4, D-17487 Greifswald, Germany
| | - Jule Ruth Heike Meyer
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University of Greifswald, Felix-Hausdorff-Str. 4, D-17487 Greifswald, Germany
| | - Albert Cañellas-Solé
- Barcelona Supercomputing Center (BSC), Plaça d'Eusebi Güell, 1-3, Barcelona 08034, Spain
| | - Victor Guallar
- Barcelona Supercomputing Center (BSC), Plaça d'Eusebi Güell, 1-3, Barcelona 08034, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig de Lluís Companys, 23, Barcelona 08010, Spain
| | - Uwe Bornscheuer
- Department of Biotechnology & Enzyme Catalysis, Institute of Biochemistry, University of Greifswald, Felix-Hausdorff-Str. 4, D-17487 Greifswald, Germany
| | - Masoud Kazemi
- Barcelona Supercomputing Center (BSC), Plaça d'Eusebi Güell, 1-3, Barcelona 08034, Spain
- Biomatter Designs, Žirmu̅n̨ g. 139A, Vilnius 09120, Lithuania
| |
Collapse
|
134
|
Malbranke C, Bikard D, Cocco S, Monasson R, Tubiana J. Machine learning for evolutionary-based and physics-inspired protein design: Current and future synergies. Curr Opin Struct Biol 2023; 80:102571. [PMID: 36947951 DOI: 10.1016/j.sbi.2023.102571] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 01/29/2023] [Accepted: 02/07/2023] [Indexed: 03/24/2023]
Abstract
Computational protein design facilitates the discovery of novel proteins with prescribed structure and functionality. Exciting designs were recently reported using novel data-driven methodologies that can be roughly divided into two categories: evolutionary-based and physics-inspired approaches. The former infer characteristic sequence features shared by sets of evolutionary-related proteins, such as conserved or coevolving positions, and recombine them to generate candidates with similar structure and function. The latter approaches estimate key biochemical properties, such as structure free energy, conformational entropy, or binding affinities using machine learning surrogates, and optimize them to yield improved designs. Here, we review recent progress along both tracks, discuss their strengths and weaknesses, and highlight opportunities for synergistic approaches.
Collapse
Affiliation(s)
- Cyril Malbranke
- Laboratory of Physics of the Ecole Normale Supérieure, PSL Research, CNRS UMR 8023, Sorbonne Université, Université de Paris, Paris, France; Institut Pasteur, Université Paris Cité, CNRS UMR 6047, Synthetic Biology, 75015 Paris, France.
| | - David Bikard
- Institut Pasteur, Université Paris Cité, CNRS UMR 6047, Synthetic Biology, 75015 Paris, France
| | - Simona Cocco
- Laboratory of Physics of the Ecole Normale Supérieure, PSL Research, CNRS UMR 8023, Sorbonne Université, Université de Paris, Paris, France
| | - Rémi Monasson
- Laboratory of Physics of the Ecole Normale Supérieure, PSL Research, CNRS UMR 8023, Sorbonne Université, Université de Paris, Paris, France
| | - Jérôme Tubiana
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel.
| |
Collapse
|
135
|
Lin Z, Akin H, Rao R, Hie B, Zhu Z, Lu W, Smetanin N, Verkuil R, Kabeli O, Shmueli Y, Dos Santos Costa A, Fazel-Zarandi M, Sercu T, Candido S, Rives A. Evolutionary-scale prediction of atomic-level protein structure with a language model. Science 2023; 379:1123-1130. [PMID: 36927031 DOI: 10.1126/science.ade2574] [Citation(s) in RCA: 753] [Impact Index Per Article: 753.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/18/2023]
Abstract
Recent advances in machine learning have leveraged evolutionary information in multiple sequence alignments to predict protein structure. We demonstrate direct inference of full atomic-level protein structure from primary sequence using a large language model. As language models of protein sequences are scaled up to 15 billion parameters, an atomic-resolution picture of protein structure emerges in the learned representations. This results in an order-of-magnitude acceleration of high-resolution structure prediction, which enables large-scale structural characterization of metagenomic proteins. We apply this capability to construct the ESM Metagenomic Atlas by predicting structures for >617 million metagenomic protein sequences, including >225 million that are predicted with high confidence, which gives a view into the vast breadth and diversity of natural proteins.
Collapse
Affiliation(s)
- Zeming Lin
- FAIR, Meta AI, New York, NY, USA
- New York University, New York, NY, USA
| | | | | | - Brian Hie
- FAIR, Meta AI, New York, NY, USA
- Stanford University, Palo Alto, CA, USA
| | | | | | | | | | | | | | | | | | | | | | - Alexander Rives
- FAIR, Meta AI, New York, NY, USA
- New York University, New York, NY, USA
| |
Collapse
|
136
|
Determinants for an Efficient Enzymatic Catalysis in Poly(Ethylene Terephthalate) Degradation. Catalysts 2023. [DOI: 10.3390/catal13030591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/18/2023] Open
Abstract
The enzymatic degradation of the recalcitrant poly(ethylene terephthalate) (PET) has been an important biotechnological goal. The present review focuses on the state of the art in enzymatic degradation of PET, and the challenges ahead. This review covers (i) enzymes acting on PET, (ii) protein improvements through selection or engineering, (iii) strategies to improve biocatalyst–polymer interaction and monomer yields. Finally, this review discusses critical points on PET degradation, and their related experimental aspects, that include the control of physicochemical parameters. The search for, and engineering of, PET hydrolases, have been widely studied to achieve this, and several examples are discussed here. Many enzymes, from various microbial sources, have been studied and engineered, but recently true PET hydrolases (PETases), active at moderate temperatures, were reported. For a circular economy process, terephtalic acid (TPA) production is critical. Some thermophilic cutinases and engineered PETases have been reported to release terephthalic acid in significant amounts. Some bottlenecks in enzyme performance are discussed, including enzyme activity, thermal stability, substrate accessibility, PET microstructures, high crystallinity, molecular mass, mass transfer, and efficient conversion into reusable fragments.
Collapse
|
137
|
Yang Z, Zeng X, Zhao Y, Chen R. AlphaFold2 and its applications in the fields of biology and medicine. Signal Transduct Target Ther 2023; 8:115. [PMID: 36918529 PMCID: PMC10011802 DOI: 10.1038/s41392-023-01381-z] [Citation(s) in RCA: 104] [Impact Index Per Article: 104.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2022] [Revised: 12/27/2022] [Accepted: 02/16/2023] [Indexed: 03/16/2023] Open
Abstract
AlphaFold2 (AF2) is an artificial intelligence (AI) system developed by DeepMind that can predict three-dimensional (3D) structures of proteins from amino acid sequences with atomic-level accuracy. Protein structure prediction is one of the most challenging problems in computational biology and chemistry, and has puzzled scientists for 50 years. The advent of AF2 presents an unprecedented progress in protein structure prediction and has attracted much attention. Subsequent release of structures of more than 200 million proteins predicted by AF2 further aroused great enthusiasm in the science community, especially in the fields of biology and medicine. AF2 is thought to have a significant impact on structural biology and research areas that need protein structure information, such as drug discovery, protein design, prediction of protein function, et al. Though the time is not long since AF2 was developed, there are already quite a few application studies of AF2 in the fields of biology and medicine, with many of them having preliminarily proved the potential of AF2. To better understand AF2 and promote its applications, we will in this article summarize the principle and system architecture of AF2 as well as the recipe of its success, and particularly focus on reviewing its applications in the fields of biology and medicine. Limitations of current AF2 prediction will also be discussed.
Collapse
Affiliation(s)
- Zhenyu Yang
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Xiaoxi Zeng
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, 610041, China.
| | - Yi Zhao
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, 610041, China.
- Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China.
| | - Runsheng Chen
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, 610041, China.
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China.
- Pingshan Translational Medicine Center, Shenzhen Bay Laboratory, Shenzhen, 518118, China.
| |
Collapse
|
138
|
Agajanian S, Alshahrani M, Bai F, Tao P, Verkhivker GM. Exploring and Learning the Universe of Protein Allostery Using Artificial Intelligence Augmented Biophysical and Computational Approaches. J Chem Inf Model 2023; 63:1413-1428. [PMID: 36827465 PMCID: PMC11162550 DOI: 10.1021/acs.jcim.2c01634] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/26/2023]
Abstract
Allosteric mechanisms are commonly employed regulatory tools used by proteins to orchestrate complex biochemical processes and control communications in cells. The quantitative understanding and characterization of allosteric molecular events are among major challenges in modern biology and require integration of innovative computational experimental approaches to obtain atomistic-level knowledge of the allosteric states, interactions, and dynamic conformational landscapes. The growing body of computational and experimental studies empowered by emerging artificial intelligence (AI) technologies has opened up new paradigms for exploring and learning the universe of protein allostery from first principles. In this review we analyze recent developments in high-throughput deep mutational scanning of allosteric protein functions; applications and latest adaptations of Alpha-fold structural prediction methods for studies of protein dynamics and allostery; new frontiers in integrating machine learning and enhanced sampling techniques for characterization of allostery; and recent advances in structural biology approaches for studies of allosteric systems. We also highlight recent computational and experimental studies of the SARS-CoV-2 spike (S) proteins revealing an important and often hidden role of allosteric regulation driving functional conformational changes, binding interactions with the host receptor, and mutational escape mechanisms of S proteins which are critical for viral infection. We conclude with a summary and outlook of future directions suggesting that AI-augmented biophysical and computer simulation approaches are beginning to transform studies of protein allostery toward systematic characterization of allosteric landscapes, hidden allosteric states, and mechanisms which may bring about a new revolution in molecular biology and drug discovery.
Collapse
Affiliation(s)
- Steve Agajanian
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Mohammed Alshahrani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Fang Bai
- Shanghai Institute for Advanced Immunochemical Studies, School of Life Science and Technology and Information Science and Technology, Shanghai Tech University, 393 Middle Huaxia Road, Shanghai 201210, China
| | - Peng Tao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75205, United States
| | - Gennady M Verkhivker
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
- Department of Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, Irvine, California 92618, United States
| |
Collapse
|
139
|
Omar SI, Keasar C, Ben-Sasson AJ, Haber E. Protein Design Using Physics Informed Neural Networks. Biomolecules 2023; 13:biom13030457. [PMID: 36979392 PMCID: PMC10046838 DOI: 10.3390/biom13030457] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 02/16/2023] [Accepted: 02/27/2023] [Indexed: 03/06/2023] Open
Abstract
The inverse protein folding problem, also known as protein sequence design, seeks to predict an amino acid sequence that folds into a specific structure and performs a specific function. Recent advancements in machine learning techniques have been successful in generating functional sequences, outperforming previous energy function-based methods. However, these machine learning methods are limited in their interoperability and robustness, especially when designing proteins that must function under non-ambient conditions, such as high temperature, extreme pH, or in various ionic solvents. To address this issue, we propose a new Physics-Informed Neural Networks (PINNs)-based protein sequence design approach. Our approach combines all-atom molecular dynamics simulations, a PINNs MD surrogate model, and a relaxation of binary programming to solve the protein design task while optimizing both energy and the structural stability of proteins. We demonstrate the effectiveness of our design framework in designing proteins that can function under non-ambient conditions.
Collapse
Affiliation(s)
| | - Chen Keasar
- Department of Computer Science, Ben Gurion University of the Negev, Be’er Sheva 84105, Israel
| | - Ariel J. Ben-Sasson
- Independent Researcher, Haifa 3436301, Israel
- Correspondence: (A.J.B.-S.); (E.H.)
| | - Eldad Haber
- Department of Earth Ocean and Atmospheric Sciences, University of British Columbia, Vancouver, BC V6T 1Z4, Canada
- Correspondence: (A.J.B.-S.); (E.H.)
| |
Collapse
|
140
|
Rettie SA, Campbell KV, Bera AK, Kang A, Kozlov S, De La Cruz J, Adebomi V, Zhou G, DiMaio F, Ovchinnikov S, Bhardwaj G. Cyclic peptide structure prediction and design using AlphaFold. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.25.529956. [PMID: 36865323 PMCID: PMC9980166 DOI: 10.1101/2023.02.25.529956] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/28/2023]
Abstract
Deep learning networks offer considerable opportunities for accurate structure prediction and design of biomolecules. While cyclic peptides have gained significant traction as a therapeutic modality, developing deep learning methods for designing such peptides has been slow, mostly due to the small number of available structures for molecules in this size range. Here, we report approaches to modify the AlphaFold network for accurate structure prediction and design of cyclic peptides. Our results show this approach can accurately predict the structures of native cyclic peptides from a single sequence, with 36 out of 49 cases predicted with high confidence (pLDDT > 0.85) matching the native structure with root mean squared deviation (RMSD) less than 1.5 Å. Further extending our approach, we describe computational methods for designing sequences of peptide backbones generated by other backbone sampling methods and for de novo design of new macrocyclic peptides. We extensively sampled the structural diversity of cyclic peptides between 7-13 amino acids, and identified around 10,000 unique design candidates predicted to fold into the designed structures with high confidence. X-ray crystal structures for seven sequences with diverse sizes and structures designed by our approach match very closely with the design models (root mean squared deviation < 1.0 Å), highlighting the atomic level accuracy in our approach. The computational methods and scaffolds developed here provide the basis for custom-designing peptides for targeted therapeutic applications.
Collapse
Affiliation(s)
- Stephen A. Rettie
- Molecular and Cell Biology program, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Katelyn V. Campbell
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Asim K. Bera
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Alex Kang
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Simon Kozlov
- FAS Division of Science, Harvard University, Cambridge, MA, USA
| | - Joshmyn De La Cruz
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Victor Adebomi
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Medicinal Chemistry, University of Washington, Seattle, WA, USA
| | - Guangfeng Zhou
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Frank DiMaio
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Sergey Ovchinnikov
- John Harvard Distinguished Science Fellowship, Harvard University, Cambridge, MA, USA
- FAS Division of Science, Harvard University, Cambridge, MA, USA
| | - Gaurav Bhardwaj
- Molecular and Cell Biology program, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Medicinal Chemistry, University of Washington, Seattle, WA, USA
| |
Collapse
|
141
|
Yu T, Boob AG, Volk MJ, Liu X, Cui H, Zhao H. Machine learning-enabled retrobiosynthesis of molecules. Nat Catal 2023. [DOI: 10.1038/s41929-022-00909-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
|
142
|
Wang X, Xu K, Tan Y, Liu S, Zhou J. Possibilities of Using De Novo Design for Generating Diverse Functional Food Enzymes. Int J Mol Sci 2023; 24:3827. [PMID: 36835238 PMCID: PMC9964944 DOI: 10.3390/ijms24043827] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 02/03/2023] [Accepted: 02/03/2023] [Indexed: 02/17/2023] Open
Abstract
Food enzymes have an important role in the improvement of certain food characteristics, such as texture improvement, elimination of toxins and allergens, production of carbohydrates, enhancing flavor/appearance characteristics. Recently, along with the development of artificial meats, food enzymes have been employed to achieve more diverse functions, especially in converting non-edible biomass to delicious foods. Reported food enzyme modifications for specific applications have highlighted the significance of enzyme engineering. However, using direct evolution or rational design showed inherent limitations due to the mutation rates, which made it difficult to satisfy the stability or specific activity needs for certain applications. Generating functional enzymes using de novo design, which highly assembles naturally existing enzymes, provides potential solutions for screening desired enzymes. Here, we describe the functions and applications of food enzymes to introduce the need for food enzymes engineering. To illustrate the possibilities of using de novo design for generating diverse functional proteins, we reviewed protein modelling and de novo design methods and their implementations. The future directions for adding structural data for de novo design model training, acquiring diversified training data, and investigating the relationship between enzyme-substrate binding and activity were highlighted as challenges to overcome for the de novo design of food enzymes.
Collapse
Affiliation(s)
- Xinglong Wang
- Engineering Research Center of Ministry of Education on Food Synthetic Biotechnology, School of Biotechnology, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Kangjie Xu
- Engineering Research Center of Ministry of Education on Food Synthetic Biotechnology, School of Biotechnology, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Yameng Tan
- Engineering Research Center of Ministry of Education on Food Synthetic Biotechnology, School of Biotechnology, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Song Liu
- Engineering Research Center of Ministry of Education on Food Synthetic Biotechnology, School of Biotechnology, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
| | - Jingwen Zhou
- Engineering Research Center of Ministry of Education on Food Synthetic Biotechnology, School of Biotechnology, Jiangnan University, Wuxi 214122, China
- Science Center for Future Foods, Jiangnan University, Wuxi 214122, China
- Jiangsu Province Engineering Research Center of Food Synthetic Biotechnology, Jiangnan University, Wuxi 214122, China
| |
Collapse
|
143
|
Guffy SL, Pulavarti SVSRK, Harrison J, Fleming D, Szyperski T, Kuhlman B. Inside-Out Design of Zinc-Binding Proteins with Non-Native Backbones. Biochemistry 2023; 62:770-781. [PMID: 36634348 PMCID: PMC9939277 DOI: 10.1021/acs.biochem.2c00595] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
The de novo design of functional proteins requires specification of tertiary structure and incorporation of molecular binding sites. Here, we develop an inside-out design strategy in the molecular modeling program Rosetta that begins with amino acid side chains from one or two α-helices making well-defined contacts with a ligand. A full-sized protein is then built around the ligand by adding additional helices that promote the formation of a protein core and allow additional contacts with the ligand. The protocol was tested by designing 12 zinc-binding proteins, each with 4-5 helices. Four of the designs were folded and bound to zinc with equilibrium dissociation constants varying between 95 nM and 1.1 μM. The design with the tightest affinity for zinc, N12, adopts a unique conformation in the folded state as assessed with nuclear magnetic resonance (NMR) and the design model closely matches (backbone root-mean-square deviation (RMSD) < 1 Å) an AlphaFold model of the sequence. Retrospective analysis with AlphaFold suggests that the sequences of many of the failed designs did not encode the desired tertiary packing.
Collapse
Affiliation(s)
- Sharon L. Guffy
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC, 27599, USA
| | | | - Joseph Harrison
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Drew Fleming
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Thomas Szyperski
- Department of Chemistry, State University of New York, Buffalo, NY, 14260, USA
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC, 27599, USA
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC, 27599, USA
| |
Collapse
|
144
|
Kong N, Ma H, Pu Z, Wan F, Li D, Huang L, Lian J, Huang X, Ling S, Yu H, Yao Y. De Novo Design and Synthesis of Polypeptide Immunomodulators for Resetting Macrophage Polarization. BIODESIGN RESEARCH 2023; 5:0006. [PMID: 37849457 PMCID: PMC10521685 DOI: 10.34133/bdr.0006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Accepted: 12/25/2022] [Indexed: 10/19/2023] Open
Abstract
Modulating the extracellular matrix microenvironment is critical for achieving the desired macrophage phenotype in immune investigations or tumor therapy. Combining de novo protein design and biosynthesis techniques, herein, we designed a biomimetic polypeptide self-assembled nano-immunomodulator to trigger the activation of a specific macrophage phenotype. It was intended to be made up of (GGSGGPGGGPASAAANSASRATSNSP)n, the RGD motif from collagen, and the IKVAV motif from laminin. The combination of these domains allows the biomimetic polypeptide to assemble into extracellular matrix-like nanofibrils, creating an extracellular matrix-like milieu for macrophages. Furthermore, changing the concentration further provides a facile route to fine-tune macrophage polarization, which enhances antitumor immune responses by precisely resetting tumor-associated macrophage immune responses into an M1-like phenotype, which is generally considered to be tumor-killing macrophages, primarily antitumor, and immune-promoting. Unlike metal or synthetic polymer-based nanoparticles, this polypeptide-based nanomaterial exhibits excellent biocompatibility, high efficacy, and precise tunability in immunomodulatory effectiveness. These encouraging findings motivate us to continue our research into cancer immunotherapy applications in the future.
Collapse
Affiliation(s)
- Na Kong
- School of Physical Science and Technology, ShanghaiTech University, 393 Middle Huaxia Road, Shanghai 201210, China
| | - Hongru Ma
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou 311215, China
- College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, Zhejiang 310027, China
| | - Zhongji Pu
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou 311215, China
- College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, Zhejiang 310027, China
| | - Fengju Wan
- School of Physical Science and Technology, ShanghaiTech University, 393 Middle Huaxia Road, Shanghai 201210, China
| | - Dongfang Li
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou 311215, China
| | - Lei Huang
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou 311215, China
- College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, Zhejiang 310027, China
| | - Jiazhang Lian
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou 311215, China
- College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, Zhejiang 310027, China
| | - Xingxu Huang
- Zhejiang Lab, Hangzhou, Zhejiang 311121, China
- School of Life Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Shengjie Ling
- School of Physical Science and Technology, ShanghaiTech University, 393 Middle Huaxia Road, Shanghai 201210, China
- Shanghai Clinical Research and Trial Center, Shanghai 201210, China
| | - Haoran Yu
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou 311215, China
- College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, Zhejiang 310027, China
| | - Yuan Yao
- School of Physical Science and Technology, ShanghaiTech University, 393 Middle Huaxia Road, Shanghai 201210, China
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou 311215, China
- College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, Zhejiang 310027, China
| |
Collapse
|
145
|
Chang L, Perez A. Ranking Peptide Binders by Affinity with AlphaFold. Angew Chem Int Ed Engl 2023; 62:e202213362. [PMID: 36542066 DOI: 10.1002/anie.202213362] [Citation(s) in RCA: 24] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 12/20/2022] [Accepted: 12/21/2022] [Indexed: 12/24/2022]
Abstract
AlphaFold has revolutionized structural biology by predicting highly accurate structures of proteins and their complexes with peptides and other proteins. However, for protein-peptide systems, we are also interested in identifying the highest affinity binder among a set of candidate peptides. We present a novel competitive binding assay using AlphaFold to predict structures of the receptor in the presence of two peptides. For systems in which the individual structures of the peptides are well predicted, the assay captures the higher affinity binder in the bound state, and the other peptide in the unbound form with statistical significance. We test the application on six protein receptors for which we have experimental binding affinities to several peptides. We find that the assay is best suited for identifying medium to strong peptide binders that adopt stable secondary structures upon binding.
Collapse
Affiliation(s)
- Liwei Chang
- Department of Chemistry, University of Florida, Gainesville, FL, USA.,Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Alberto Perez
- Department of Chemistry, University of Florida, Gainesville, FL, USA.,Quantum Theory Project, University of Florida, Gainesville, FL, USA
| |
Collapse
|
146
|
Chen S, Xu Z, Ding B, Zhang Y, Liu S, Cai C, Li M, Dale BE, Jin M. Big data mining, rational modification, and ancestral sequence reconstruction inferred multiple xylose isomerases for biorefinery. SCIENCE ADVANCES 2023; 9:eadd8835. [PMID: 36724227 PMCID: PMC9891696 DOI: 10.1126/sciadv.add8835] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Accepted: 12/30/2022] [Indexed: 05/28/2023]
Abstract
The isomerization of xylose to xylulose is considered the most promising approach to initiate xylose bioconversion. Here, phylogeny-guided big data mining, rational modification, and ancestral sequence reconstruction strategies were implemented to explore new active xylose isomerases (XIs) for Saccharomyces cerevisiae. Significantly, 13 new active XIs for S. cerevisiae were mined or artificially created. Moreover, the importance of the amino-terminal fragment for maintaining basic XI activity was demonstrated. With the mined XIs, four efficient xylose-utilizing S. cerevisiae were constructed and evolved, among which the strain S. cerevisiae CRD5HS contributed to ethanol titers as high as 85.95 and 94.76 g/liter from pretreated corn stover and corn cob, respectively, without detoxifying or washing pretreated biomass. Potential genetic targets obtained from adaptive laboratory evolution were further analyzed by sequencing the high-performance strains. The combined XI mining methods described here provide practical references for mining other scarce and valuable enzymes.
Collapse
Affiliation(s)
- Sitong Chen
- School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
- Biorefinery Research Institution, Nanjing University of Science and Technology, Nanjing 210094, China
| | - Zhaoxian Xu
- School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
- Biorefinery Research Institution, Nanjing University of Science and Technology, Nanjing 210094, China
| | - Boning Ding
- School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
- Biorefinery Research Institution, Nanjing University of Science and Technology, Nanjing 210094, China
| | - Yuwei Zhang
- School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
- Biorefinery Research Institution, Nanjing University of Science and Technology, Nanjing 210094, China
| | - Shuangmei Liu
- School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
- Biorefinery Research Institution, Nanjing University of Science and Technology, Nanjing 210094, China
| | - Chenggu Cai
- School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
- Biorefinery Research Institution, Nanjing University of Science and Technology, Nanjing 210094, China
| | - Muzi Li
- School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
- Biorefinery Research Institution, Nanjing University of Science and Technology, Nanjing 210094, China
| | - Bruce E. Dale
- Biomass Conversion Research Laboratory, Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI 48824, USA
- Great Lakes Bioenergy Research Centre (GLBRC), Michigan State University, East Lansing, MI, 48824 USA
| | - Mingjie Jin
- School of Environmental and Biological Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
- Biorefinery Research Institution, Nanjing University of Science and Technology, Nanjing 210094, China
| |
Collapse
|
147
|
Chen Y, Zhang S, Zhai Z, Zhang S, Ma J, Liang X, Li Q. Construction of Fusion Protein with Carbohydrate-Binding Module and Leaf-Branch Compost Cutinase to Enhance the Degradation Efficiency of Polyethylene Terephthalate. Int J Mol Sci 2023; 24:2780. [PMID: 36769118 PMCID: PMC9917269 DOI: 10.3390/ijms24032780] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Revised: 01/23/2023] [Accepted: 01/30/2023] [Indexed: 02/05/2023] Open
Abstract
Poly(ethylene terephthalate) (PET) is a manufactured plastic broadly available, whereas improper disposal of PET waste has become a serious burden on the environment. Leaf-branch compost cutinase (LCC) is one of the most powerful and promising PET hydrolases, and its mutant LCCICCG shows high catalytic activity and excellent thermal stability. However, low binding affinity with PET has been found to dramatically limit its further industrial application. Herein, TrCBM and CfCBM were rationally selected from the CAZy database to construct fusion proteins with LCCICCG, and mechanistic studies revealed that these two domains could bind with PET favorably via polar amino acids. The optimal temperatures of LCCICCG-TrCBM and CfCBM-LCCICCG were measured to be 70 and 80 °C, respectively. Moreover, these two fusion proteins exhibited favorable thermal stability, maintaining 53.1% and 48.8% of initial activity after the incubation at 90 °C for 300 min. Compared with LCCICCG, the binding affinity of LCCICCG-TrCBM and CfCBM-LCCICCG for PET has been improved by 1.4- and 1.3-fold, respectively, and meanwhile their degradation efficiency on PET films was enhanced by 3.7% and 24.2%. Overall, this study demonstrated that the strategy of constructing fusion proteins is practical and prospective to facilitate the enzymatic PET degradation ability.
Collapse
Affiliation(s)
| | | | | | | | | | - Xiao Liang
- Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Sciences, Jilin University, Changchun 130012, China
| | - Quanshun Li
- Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Sciences, Jilin University, Changchun 130012, China
| |
Collapse
|
148
|
Yeh AHW, Norn C, Kipnis Y, Tischer D, Pellock SJ, Evans D, Ma P, Lee GR, Zhang JZ, Anishchenko I, Coventry B, Cao L, Dauparas J, Halabiya S, DeWitt M, Carter L, Houk KN, Baker D. De novo design of luciferases using deep learning. Nature 2023; 614:774-780. [PMID: 36813896 PMCID: PMC9946828 DOI: 10.1038/s41586-023-05696-3] [Citation(s) in RCA: 119] [Impact Index Per Article: 119.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 01/03/2023] [Indexed: 02/24/2023]
Abstract
De novo enzyme design has sought to introduce active sites and substrate-binding pockets that are predicted to catalyse a reaction of interest into geometrically compatible native scaffolds1,2, but has been limited by a lack of suitable protein structures and the complexity of native protein sequence-structure relationships. Here we describe a deep-learning-based 'family-wide hallucination' approach that generates large numbers of idealized protein structures containing diverse pocket shapes and designed sequences that encode them. We use these scaffolds to design artificial luciferases that selectively catalyse the oxidative chemiluminescence of the synthetic luciferin substrates diphenylterazine3 and 2-deoxycoelenterazine. The designed active sites position an arginine guanidinium group adjacent to an anion that develops during the reaction in a binding pocket with high shape complementarity. For both luciferin substrates, we obtain designed luciferases with high selectivity; the most active of these is a small (13.9 kDa) and thermostable (with a melting temperature higher than 95 °C) enzyme that has a catalytic efficiency on diphenylterazine (kcat/Km = 106 M-1 s-1) comparable to that of native luciferases, but a much higher substrate specificity. The creation of highly active and specific biocatalysts from scratch with broad applications in biomedicine is a key milestone for computational enzyme design, and our approach should enable generation of a wide range of luciferases and other enzymes.
Collapse
Affiliation(s)
- Andy Hsien-Wei Yeh
- Department of Biochemistry, University of Washington, Seattle, WA, USA.
- Institute for Protein Design, University of Washington, Seattle, WA, USA.
- Department of Biomolecular Engineering, University of California, Santa Cruz, Santa Cruz, CA, USA.
| | - Christoffer Norn
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Yakov Kipnis
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Doug Tischer
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Samuel J Pellock
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Declan Evans
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, USA
| | - Pengchen Ma
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, USA
- School of Chemistry, Xi'an Key Laboratory of Sustainable Energy Materials Chemistry, MOE Key Laboratory for Nonequilibrium Synthesis and Modulation of Condensed Matter, Xi'an Jiaotong University, Xi'an, China
| | - Gyu Rie Lee
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Jason Z Zhang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Ivan Anishchenko
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Brian Coventry
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Longxing Cao
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Justas Dauparas
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Samer Halabiya
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Michelle DeWitt
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Lauren Carter
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - K N Houk
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA.
- Institute for Protein Design, University of Washington, Seattle, WA, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA.
| |
Collapse
|
149
|
Pearce R, Huang X, Omenn GS, Zhang Y. De novo protein fold design through sequence-independent fragment assembly simulations. Proc Natl Acad Sci U S A 2023; 120:e2208275120. [PMID: 36656852 PMCID: PMC9942881 DOI: 10.1073/pnas.2208275120] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 12/22/2022] [Indexed: 01/20/2023] Open
Abstract
De novo protein design generally consists of two steps, including structure and sequence design. Many protein design studies have focused on sequence design with scaffolds adapted from native structures in the PDB, which renders novel areas of protein structure and function space unexplored. We developed FoldDesign to create novel protein folds from specific secondary structure (SS) assignments through sequence-independent replica-exchange Monte Carlo (REMC) simulations. The method was tested on 354 non-redundant topologies, where FoldDesign consistently created stable structural folds, while recapitulating on average 87.7% of the SS elements. Meanwhile, the FoldDesign scaffolds had well-formed structures with buried residues and solvent-exposed areas closely matching their native counterparts. Despite the high fidelity to the input SS restraints and local structural characteristics of native proteins, a large portion of the designed scaffolds possessed global folds completely different from natural proteins in the PDB, highlighting the ability of FoldDesign to explore novel areas of protein fold space. Detailed data analyses revealed that the major contributions to the successful structure design lay in the optimal energy force field, which contains a balanced set of SS packing terms, and REMC simulations, which were coupled with multiple auxiliary movements to efficiently search the conformational space. Additionally, the ability to recognize and assemble uncommon super-SS geometries, rather than the unique arrangement of common SS motifs, was the key to generating novel folds. These results demonstrate a strong potential to explore both structural and functional spaces through computational design simulations that natural proteins have not reached through evolution.
Collapse
Affiliation(s)
- Robin Pearce
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI48109
| | - Xiaoqiang Huang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI48109
| | - Gilbert S. Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI48109
- Department of Internal Medicine, University of Michigan, Ann Arbor, MI48109
- Department of Human Genetics, University of Michigan, Ann Arbor, MI48109
- School of Public Health, University of Michigan, Ann Arbor, MI48109
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI48109
- Department of Biological Chemistry, University of Michigan, Ann Arbor, MI48109
- Department of Computer Science, School of Computing, National University of Singapore117417, Singapore
- Cancer Science Institute of Singapore, National University of Singapore117599, Singapore
| |
Collapse
|
150
|
Song Z, Zhang Q, Wu W, Pu Z, Yu H. Rational design of enzyme activity and enantioselectivity. Front Bioeng Biotechnol 2023; 11:1129149. [PMID: 36761300 PMCID: PMC9902596 DOI: 10.3389/fbioe.2023.1129149] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 01/16/2023] [Indexed: 01/25/2023] Open
Abstract
The strategy of rational design to engineer enzymes is to predict the potential mutants based on the understanding of the relationships between protein structure and function, and subsequently introduce the mutations using the site-directed mutagenesis. Rational design methods are universal, relatively fast and have the potential to be developed into algorithms that can quantitatively predict the performance of the designed sequences. Compared to the protein stability, it was more challenging to design an enzyme with improved activity or selectivity, due to the complexity of enzyme molecular structure and inadequate understanding of the relationships between enzyme structures and functions. However, with the development of computational force, advanced algorithm and a deeper understanding of enzyme catalytic mechanisms, rational design could significantly simplify the process of engineering enzyme functions and the number of studies applying rational design strategy has been increasing. Here, we reviewed the recent advances of applying the rational design strategy to engineer enzyme functions including activity and enantioselectivity. Five strategies including multiple sequence alignment, strategy based on steric hindrance, strategy based on remodeling interaction network, strategy based on dynamics modification and computational protein design are discussed and the successful cases using these strategies are introduced.
Collapse
Affiliation(s)
- Zhongdi Song
- Key Laboratory of Pollution Exposure and Health Intervention of Zhejiang Province, Interdisciplinary Research Academy, Zhejiang Shuren University, Hangzhou, China
| | - Qunfeng Zhang
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, Zhejiang, China
| | - Wenhui Wu
- ZJU-Hangzhou Global Scientific and Technological Innovation Centre, Hangzhou, Zhejiang, China
| | - Zhongji Pu
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, Zhejiang, China
- ZJU-Hangzhou Global Scientific and Technological Innovation Centre, Hangzhou, Zhejiang, China
| | - Haoran Yu
- Institute of Bioengineering, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, Zhejiang, China
- ZJU-Hangzhou Global Scientific and Technological Innovation Centre, Hangzhou, Zhejiang, China
| |
Collapse
|