1
|
Talluri S. Algorithms for protein design. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2022; 130:1-38. [PMID: 35534105 DOI: 10.1016/bs.apcsb.2022.01.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Computational Protein Design has the potential to contribute to major advances in enzyme technology, vaccine design, receptor-ligand engineering, biomaterials, nanosensors, and synthetic biology. Although Protein Design is a challenging problem, proteins can be designed by experts in Protein Design, as well as by non-experts whose primary interests are in the applications of Protein Design. The increased accessibility of Protein Design technology is attributable to the accumulated knowledge and experience with Protein Design as well as to the availability of software and online resources. The objective of this review is to serve as a guide to the relevant literature with a focus on the novel methods and algorithms that have been developed or applied for Protein Design, and to assist in the selection of algorithms for Protein Design. Novel algorithms and models that have been introduced to utilize the enormous amount of experimental data and novel computational hardware have the potential for producing substantial increases in the accuracy, reliability and range of applications of designed proteins.
Collapse
Affiliation(s)
- Sekhar Talluri
- Department of Biotechnology, GITAM, Visakhapatnam, India.
| |
Collapse
|
2
|
ElGamacy M. Accelerating therapeutic protein design. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2022; 130:85-118. [PMID: 35534117 DOI: 10.1016/bs.apcsb.2022.01.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Protein structures provide for defined microenvironments that can support complex pharmacological functions, otherwise unachievable by small molecules. The advent of therapeutic proteins has thus greatly broadened the range of manageable disorders. Leveraging the knowledge and recent advances in de novo protein design methods has the prospect of revolutionizing how protein drugs are discovered and developed. This review lays out the main challenges facing therapeutic proteins discovery and development, and how present and future advancements of protein design can accelerate the protein drug pipelines.
Collapse
Affiliation(s)
- Mohammad ElGamacy
- University Hospital Tübingen, Division of Translational Oncology, Tübingen, Germany; Max Planck Institute for Biology, Tübingen, Germany.
| |
Collapse
|
3
|
Zhou J, Panaitiu AE, Grigoryan G. A general-purpose protein design framework based on mining sequence-structure relationships in known protein structures. Proc Natl Acad Sci U S A 2020; 117:1059-1068. [PMID: 31892539 PMCID: PMC6969538 DOI: 10.1073/pnas.1908723117] [Citation(s) in RCA: 61] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Current state-of-the-art approaches to computational protein design (CPD) aim to capture the determinants of structure from physical principles. While this has led to many successful designs, it does have strong limitations associated with inaccuracies in physical modeling, such that a reliable general solution to CPD has yet to be found. Here, we propose a design framework-one based on identifying and applying patterns of sequence-structure compatibility found in known proteins, rather than approximating them from models of interatomic interactions. We carry out extensive computational analyses and an experimental validation for our method. Our results strongly argue that the Protein Data Bank is now sufficiently large to enable proteins to be designed by using only examples of structural motifs from unrelated proteins. Because our method is likely to have orthogonal strengths relative to existing techniques, it could represent an important step toward removing remaining barriers to robust CPD.
Collapse
Affiliation(s)
- Jianfu Zhou
- Department of Computer Science, Dartmouth College, Hanover, NH 03755
| | | | - Gevorg Grigoryan
- Department of Computer Science, Dartmouth College, Hanover, NH 03755;
- Department of Biological Sciences, Dartmouth College, Hanover, NH 03755
| |
Collapse
|
4
|
Joh NH, Grigoryan G, Wu Y, DeGrado WF. Design of self-assembling transmembrane helical bundles to elucidate principles required for membrane protein folding and ion transport. Philos Trans R Soc Lond B Biol Sci 2018. [PMID: 28630154 DOI: 10.1098/rstb.2016.0214] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Ion transporters and channels are able to identify and act on specific substrates among myriads of ions and molecules critical to cellular processes, such as homeostasis, cell signalling, nutrient influx and drug efflux. Recently, we designed Rocker, a minimalist model for Zn2+/H+ co-transport. The success of this effort suggests that de novo membrane protein design has now come of age so as to serve a key approach towards probing the determinants of membrane protein folding, assembly and function. Here, we review general principles that can be used to design membrane proteins, with particular reference to helical assemblies with transport function. We also provide new functional and NMR data that probe the dynamic mechanism of conduction through Rocker.This article is part of the themed issue 'Membrane pores: from structure and assembly, to medicine and technology'.
Collapse
Affiliation(s)
- Nathan H Joh
- Department of Pharmaceutical Chemistry, Cardiovascular Research Institute, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Gevorg Grigoryan
- Department of Computer Science, Dartmouth College, Hanover, NH 03755, USA.,Department of Biological Sciences, Dartmouth College, Hanover, NH 03755, USA
| | - Yibing Wu
- Department of Pharmaceutical Chemistry, Cardiovascular Research Institute, University of California, San Francisco, San Francisco, CA 94158, USA
| | - William F DeGrado
- Department of Pharmaceutical Chemistry, Cardiovascular Research Institute, University of California, San Francisco, San Francisco, CA 94158, USA
| |
Collapse
|
5
|
Mustata GM, Kim YH, Zhang J, DeGrado WF, Grigoryan G, Wanunu M. Graphene Symmetry Amplified by Designed Peptide Self-Assembly. Biophys J 2017; 110:2507-2516. [PMID: 27276268 PMCID: PMC4906377 DOI: 10.1016/j.bpj.2016.04.037] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2016] [Revised: 04/03/2016] [Accepted: 04/08/2016] [Indexed: 11/28/2022] Open
Abstract
We present a strategy for designed self-assembly of peptides into two-dimensional monolayer crystals on the surface of graphene and graphite. As predicted by computation, designed peptides assemble on the surface of graphene to form very long, parallel, in-register β-sheets, which we call β-tapes. Peptides extend perpendicularly to the long axis of each β-tape, defining its width, with hydrogen bonds running along the axis. Tapes align on the surface to create highly regular microdomains containing 4-nm pitch striations. Moreover, in agreement with calculations, the atomic structure of the underlying graphene dictates the arrangement of the β-tapes, as they orient along one of six directions defined by graphene’s sixfold symmetry. A cationic-assembled peptide surface is shown here to strongly adhere to DNA, preferentially orienting the double helix along β-tape axes. This orientational preference is well anticipated from calculations, given the underlying peptide layer structure. These studies illustrate how designed peptides can amplify the Ångstrom-level atomic symmetry of a surface onto the micrometer scale, further imparting long-range directional order onto the next level of assembly. The remarkably stable nature of these assemblies under various environmental conditions suggests applications in enzymelike catalysis, biological interfaces for cellular recognition, and two-dimensional platforms for studying DNA-peptide interactions.
Collapse
Affiliation(s)
| | - Yong Ho Kim
- SKKU Advanced Institute of Nanotechnology and Department of Chemistry, Sungkyunkwan University, Seoul, Korea; Center for Neuroscience Imaging Research, Institute for Basic Science(IBS), Suwon, Korea.
| | - Jian Zhang
- Department of Computer Science, Dartmouth College, Hanover, New Hampshire
| | - William F DeGrado
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco
| | - Gevorg Grigoryan
- Department of Computer Science, Dartmouth College, Hanover, New Hampshire; Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire.
| | - Meni Wanunu
- Department of Physics, Northeastern University, Boston, Massachusetts.
| |
Collapse
|
6
|
Toward high-resolution computational design of the structure and function of helical membrane proteins. Nat Struct Mol Biol 2017; 23:475-80. [PMID: 27273630 DOI: 10.1038/nsmb.3231] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Accepted: 04/20/2016] [Indexed: 02/07/2023]
Abstract
The computational design of α-helical membrane proteins is still in its infancy but has already made great progress. De novo design allows stable, specific and active minimal oligomeric systems to be obtained. Computational reengineering can improve the stability and function of naturally occurring membrane proteins. Currently, the major hurdle for the field is the experimental characterization of the designs. The emergence of new structural methods for membrane proteins will accelerate progress.
Collapse
|
7
|
Sequence statistics of tertiary structural motifs reflect protein stability. PLoS One 2017; 12:e0178272. [PMID: 28552940 PMCID: PMC5446159 DOI: 10.1371/journal.pone.0178272] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2017] [Accepted: 05/10/2017] [Indexed: 11/19/2022] Open
Abstract
The Protein Data Bank (PDB) has been a key resource for learning general rules of sequence-structure relationships in proteins. Quantitative insights have been gained by defining geometric descriptors of structure (e.g., distances, dihedral angles, solvent exposure, etc.) and observing their distributions and sequence preferences. Here we argue that as the PDB continues to grow, it may become unnecessary to reduce structure into a set of elementary descriptors. Instead, it could be possible to deduce quantitative sequence-structure relationships in the context of precisely-defined complex structural motifs by mining the PDB for closely matching backbone geometries. To validate this idea, we turned to the the task of predicting changes in protein stability upon amino-acid substitution—a difficult problem of broad significance. We defined non-contiguous tertiary motifs (TERMs) around a protein site of interest and extracted sequence preferences from ensembles of closely-matching substructures in the PDB to predict mutational stability changes at the site, ΔΔGm. We demonstrate that these ensemble statistics predict ΔΔGm on par with state-of-the-art statistical and machine-learning methods on large thermodynamic datasets, and outperform these, along with a leading structure-based modeling approach, when tested in the context of unbiased diverse mutations. Further, we show that the performance of the TERM-based method is directly related to the amount of available relevant structural data, automatically improving with the growing PDB. This enables a means of estimating prediction accuracy. Our results clearly demonstrate that: 1) statistics of non-contiguous structural motifs in the PDB encode fundamental sequence-structure relationships related to protein thermodynamic stability, and 2) the PDB is now large enough that such statistics are already useful in practice, with their accuracy expected to continue increasing as the database grows. These observations suggest new ways of using structural data towards addressing problems of computational structural biology.
Collapse
|
8
|
Abstract
Computational protein design (CPD), a yet evolving field, includes computer-aided engineering for partial or full de novo designs of proteins of interest. Designs are defined by a requested structure, function, or working environment. This chapter describes the birth and maturation of the field by presenting 101 CPD examples in a chronological order emphasizing achievements and pending challenges. Integrating these aspects presents the plethora of CPD approaches with the hope of providing a "CPD 101". These reflect on the broader structural bioinformatics and computational biophysics field and include: (1) integration of knowledge-based and energy-based methods, (2) hierarchical designated approach towards local, regional, and global motifs and the integration of high- and low-resolution design schemes that fit each such region, (3) systematic differential approaches towards different protein regions, (4) identification of key hot-spot residues and the relative effect of remote regions, (5) assessment of shape-complementarity, electrostatics and solvation effects, (6) integration of thermal plasticity and functional dynamics, (7) negative design, (8) systematic integration of experimental approaches, (9) objective cross-assessment of methods, and (10) successful ranking of potential designs. Future challenges also include dissemination of CPD software to the general use of life-sciences researchers and the emphasis of success within an in vivo milieu. CPD increases our understanding of protein structure and function and the relationships between the two along with the application of such know-how for the benefit of mankind. Applied aspects range from biological drugs, via healthier and tastier food products to nanotechnology and environmentally friendly enzymes replacing toxic chemicals utilized in the industry.
Collapse
|
9
|
Gainza P, Nisonoff HM, Donald BR. Algorithms for protein design. Curr Opin Struct Biol 2016; 39:16-26. [PMID: 27086078 PMCID: PMC5065368 DOI: 10.1016/j.sbi.2016.03.006] [Citation(s) in RCA: 58] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2015] [Revised: 03/15/2016] [Accepted: 03/22/2016] [Indexed: 02/05/2023]
Abstract
Computational structure-based protein design programs are becoming an increasingly important tool in molecular biology. These programs compute protein sequences that are predicted to fold to a target structure and perform a desired function. The success of a program's predictions largely relies on two components: first, the input biophysical model, and second, the algorithm that computes the best sequence(s) and structure(s) according to the biophysical model. Improving both the model and the algorithm in tandem is essential to improving the success rate of current programs, and here we review recent developments in algorithms for protein design, emphasizing how novel algorithms enable the use of more accurate biophysical models. We conclude with a list of algorithmic challenges in computational protein design that we believe will be especially important for the design of therapeutic proteins and protein assemblies.
Collapse
Affiliation(s)
- Pablo Gainza
- Department of Computer Science, Duke University, Durham, NC, United States
| | - Hunter M Nisonoff
- Department of Computer Science, Duke University, Durham, NC, United States
| | - Bruce R Donald
- Department of Computer Science, Duke University, Durham, NC, United States; Department of Biochemistry, Duke University Medical Center, Durham, NC, United States; Department of Chemistry, Duke University, Durham, NC, United States.
| |
Collapse
|
10
|
Prediction of Stable Globular Proteins Using Negative Design with Non-native Backbone Ensembles. Structure 2015; 23:2011-21. [DOI: 10.1016/j.str.2015.07.021] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2015] [Revised: 07/26/2015] [Accepted: 07/29/2015] [Indexed: 11/21/2022]
|
11
|
Joh NH, Wang T, Bhate MP, Acharya R, Wu Y, Grabe M, Hong M, Grigoryan G, DeGrado WF. De novo design of a transmembrane Zn²⁺-transporting four-helix bundle. Science 2015; 346:1520-4. [PMID: 25525248 DOI: 10.1126/science.1261172] [Citation(s) in RCA: 226] [Impact Index Per Article: 25.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
The design of functional membrane proteins from first principles represents a grand challenge in chemistry and structural biology. Here, we report the design of a membrane-spanning, four-helical bundle that transports first-row transition metal ions Zn(2+) and Co(2+), but not Ca(2+), across membranes. The conduction path was designed to contain two di-metal binding sites that bind with negative cooperativity. X-ray crystallography and solid-state and solution nuclear magnetic resonance indicate that the overall helical bundle is formed from two tightly interacting pairs of helices, which form individual domains that interact weakly along a more dynamic interface. Vesicle flux experiments show that as Zn(2+) ions diffuse down their concentration gradients, protons are antiported. These experiments illustrate the feasibility of designing membrane proteins with predefined structural and dynamic properties.
Collapse
Affiliation(s)
- Nathan H Joh
- Department of Pharmaceutical Chemistry, Cardiovascular Research Institute, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Tuo Wang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Manasi P Bhate
- Department of Pharmaceutical Chemistry, Cardiovascular Research Institute, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Rudresh Acharya
- School of Biological Sciences, National Institute of Science Education and Research, Bhubaneswar, Odisha, India
| | - Yibing Wu
- Department of Pharmaceutical Chemistry, Cardiovascular Research Institute, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Michael Grabe
- Department of Pharmaceutical Chemistry, Cardiovascular Research Institute, University of California, San Francisco, San Francisco, CA 94158, USA.
| | - Mei Hong
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| | - Gevorg Grigoryan
- Department of Computer Science and Department of Biological Sciences, Dartmouth College, Hanover, NH 03755, USA.
| | - William F DeGrado
- Department of Pharmaceutical Chemistry, Cardiovascular Research Institute, University of California, San Francisco, San Francisco, CA 94158, USA.
| |
Collapse
|