1
|
Kim DE, Watson JL, Juergens D, Majumder S, Gerben SR, Kang A, Bera AK, Li X, Baker D. Parametrically guided design of beta barrels and transmembrane nanopores using deep learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.22.604663. [PMID: 39091726 PMCID: PMC11291061 DOI: 10.1101/2024.07.22.604663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]
Abstract
Francis Crick's global parameterization of coiled coil geometry has been widely useful for guiding design of new protein structures and functions. However, design guided by similar global parameterization of beta barrel structures has been less successful, likely due to the deviations required from ideal beta barrel geometry to maintain extensive inter-strand hydrogen bonding without introducing considerable backbone strain. Instead, beta barrels and other protein folds have been designed guided by 2D structural blueprints; while this approach has successfully generated new fluorescent proteins, transmembrane nanopores, and other structures, it requires considerable expert knowledge and provides only indirect control over the global barrel shape. Here we show that the simplicity and control over shape and structure provided by global parametric representations can be generalized beyond coiled coils by taking advantage of the rich sequence-structure relationships implicit in RoseTTAFold based inpainting and diffusion design methods. Starting from parametrically generated idealized barrel backbones, both RFjoint inpainting and RFdiffusion readily incorporate the backbone irregularities necessary for proper folding with minimal deviation from the idealized barrel geometries. We show that for beta barrels across a broad range of global beta sheet parameterizations, these methods achieve high in silico and experimental success rates, with atomic accuracy confirmed by an X-ray crystal structure of a novel beta barrel topology, and de novo designed 12, 14, and 16 stranded transmembrane nanopores with conductances ranging from 200 to 500 pS. By combining the simplicity and control of parametric generation with the high success rates of deep learning based protein design methods, our approach makes the design of proteins where global shape confers function, such as beta barrel nanopores, more precisely specifiable and accessible.
Collapse
Affiliation(s)
- David E. Kim
- Department of Biochemistry, University of Washington, Seattle, WA 98195
- Institute for Protein Design, University of Washington, Seattle, WA 98195
- HHMI, University of Washington, Seattle, WA 98195
| | - Joseph L. Watson
- Department of Biochemistry, University of Washington, Seattle, WA 98195
- Institute for Protein Design, University of Washington, Seattle, WA 98195
- HHMI, University of Washington, Seattle, WA 98195
| | - David Juergens
- Department of Biochemistry, University of Washington, Seattle, WA 98195
- Institute for Protein Design, University of Washington, Seattle, WA 98195
- HHMI, University of Washington, Seattle, WA 98195
| | - Sagardip Majumder
- Department of Biochemistry, University of Washington, Seattle, WA 98195
- Institute for Protein Design, University of Washington, Seattle, WA 98195
- HHMI, University of Washington, Seattle, WA 98195
| | - Stacey R. Gerben
- Department of Biochemistry, University of Washington, Seattle, WA 98195
- Institute for Protein Design, University of Washington, Seattle, WA 98195
| | - Alex Kang
- Department of Biochemistry, University of Washington, Seattle, WA 98195
- Institute for Protein Design, University of Washington, Seattle, WA 98195
| | - Asim K. Bera
- Department of Biochemistry, University of Washington, Seattle, WA 98195
- Institute for Protein Design, University of Washington, Seattle, WA 98195
| | - Xinting Li
- Department of Biochemistry, University of Washington, Seattle, WA 98195
- Institute for Protein Design, University of Washington, Seattle, WA 98195
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA 98195
- Institute for Protein Design, University of Washington, Seattle, WA 98195
- HHMI, University of Washington, Seattle, WA 98195
| |
Collapse
|
2
|
Naudé M, Faller P, Lebrun V. A Closer Look at Type I Left-Handed β-Helices Provides a Better Understanding in Their Sequence-Structure Relationship: Toward Their Rational Design. Proteins 2024. [PMID: 38980225 DOI: 10.1002/prot.26726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 04/17/2024] [Accepted: 06/20/2024] [Indexed: 07/10/2024]
Abstract
Understanding the sequence-structure relationship in protein is of fundamental interest, but has practical applications such as the rational design of peptides and proteins. This relationship in the Type I left-handed β-helix containing proteins is updated and revisited in this study. Analyzing the available experimental structures in the Protein Data Bank, we could describe, further in detail, the structural features that are important for the stability of this fold, as well as its nucleation and termination. This study is meant to complete previous work, as it provides a separate analysis of the N-terminal and C-terminal rungs of the helix. Particular sequence motifs of these rungs are described along with the structural element they form.
Collapse
Affiliation(s)
- Maxime Naudé
- Institute of Chemistry of Strasbourg (UMR 7177), University of Strasbourg-CNRS, Strasbourg, France
| | - Peter Faller
- Institute of Chemistry of Strasbourg (UMR 7177), University of Strasbourg-CNRS, Strasbourg, France
| | - Vincent Lebrun
- Institute of Chemistry of Strasbourg (UMR 7177), University of Strasbourg-CNRS, Strasbourg, France
| |
Collapse
|
3
|
Mesdaghi S, Price RM, Madine J, Rigden DJ. Deep Learning-based structure modelling illuminates structure and function in uncharted regions of β-solenoid fold space. J Struct Biol 2023; 215:108010. [PMID: 37544372 DOI: 10.1016/j.jsb.2023.108010] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 07/19/2023] [Accepted: 08/03/2023] [Indexed: 08/08/2023]
Abstract
Repeat proteins are common in all domains of life and exhibit a wide range of functions. One class of repeat protein contains solenoid folds where the repeating unit consists of β-strands separated by tight turns. β-solenoids have distinguishing structural features such as handedness, twist, oligomerisation state, coil shape and size which give rise to their diversity. Characterised β-solenoid repeat proteins are known to form regions in bacterial and viral virulence factors, antifreeze proteins and functional amyloids. For many of these proteins, the experimental structure has not been solved, as they are difficult to crystallise or model. Here we use various deep learning-based structure-modelling methods to discover novel predicted β-solenoids, perform structural database searches to mine further structural neighbours and relate their predicted structure to possible functions. We find both eukaryotic and prokaryotic adhesins, confirming a known functional linkage between adhesin function and the β-solenoid fold. We further identify exceptionally long, flat β-solenoid folds as possible structures of mucin tandem repeat regions and unprecedentedly small β-solenoid structures. Additionally, we characterise a novel β-solenoid coil shape, the FapC Greek key β-solenoid as well as plausible complexes between it and other proteins involved in Pseudomonas functional amyloid fibres.
Collapse
Affiliation(s)
- Shahram Mesdaghi
- The University of Liverpool, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, Liverpool L69 7ZB, United Kingdom; Computational Biology Facility, MerseyBio, University of Liverpool, Crown Street, Liverpool L69 7ZB, United Kingdom
| | - Rebecca M Price
- The University of Liverpool, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, Liverpool L69 7ZB, United Kingdom
| | - Jillian Madine
- The University of Liverpool, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, Liverpool L69 7ZB, United Kingdom.
| | - Daniel J Rigden
- The University of Liverpool, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, Liverpool L69 7ZB, United Kingdom.
| |
Collapse
|
4
|
Bonadio A, Wenig BL, Hockla A, Radisky ES, Shifman JM. Designed Loop Extension Followed by Combinatorial Screening Confers High Specificity to a Broad Matrix MetalloproteinaseInhibitor. J Mol Biol 2023; 435:168095. [PMID: 37068580 PMCID: PMC10312305 DOI: 10.1016/j.jmb.2023.168095] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 04/03/2023] [Accepted: 04/10/2023] [Indexed: 04/19/2023]
Abstract
Matrix metalloproteinases (MMPs) are key drivers of various diseases, including cancer. Development of probes and drugs capable of selectively inhibiting the individual members of the large MMP family remains a persistent challenge. The inhibitory N-terminal domain of tissue inhibitor of metalloproteinases-2 (N-TIMP2), a natural broad MMP inhibitor, can provide a scaffold for protein engineering to create more selective MMP inhibitors. Here, we pursued a unique approach harnessing both computational design and combinatorial screening to confer high binding specificity toward a target MMP in preference to an anti-target MMP. We designed a loop extension of N-TIMP2 to allow new interactions with the non-conserved MMP surface and generated an efficient focused library for yeast surface display, which was then screened for high binding to the target MMP-14 and low binding to anti-target MMP-3. Deep sequencing analysis identified the most promising variants, which were expressed, purified, and tested for selectivity of inhibition. Our best N-TIMP2 variant exhibited 29 pM binding affinity to MMP-14 and 2.4 µM affinity to MMP-3, revealing 7500-fold greater specificity than WT N-TIMP2. High-confidence structural models were obtained by including NGS data in the AlphaFold multiple sequence alignment. The modeling together with experimental mutagenesis validated our design predictions, demonstrating that the loop extension packs tightly against non-conserved residues on MMP-14 and clashes with MMP-3. This study demonstrates how introduction of loop extensions in a manner guided by target protein conservation data and loop design can offer an attractive strategy to achieve specificity in design of protein ligands.
Collapse
Affiliation(s)
- Alessandro Bonadio
- Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Israel
| | - Bernhard L Wenig
- Department of Cancer Biology, Mayo Clinic Comprehensive Cancer Center, Jacksonville, Florida, USA; Paracelsus Medical University, Salzburg, Austria
| | - Alexandra Hockla
- Department of Cancer Biology, Mayo Clinic Comprehensive Cancer Center, Jacksonville, Florida, USA
| | - Evette S Radisky
- Department of Cancer Biology, Mayo Clinic Comprehensive Cancer Center, Jacksonville, Florida, USA.
| | - Julia M Shifman
- Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Israel.
| |
Collapse
|
5
|
Talluri S. Algorithms for protein design. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2022; 130:1-38. [PMID: 35534105 DOI: 10.1016/bs.apcsb.2022.01.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Computational Protein Design has the potential to contribute to major advances in enzyme technology, vaccine design, receptor-ligand engineering, biomaterials, nanosensors, and synthetic biology. Although Protein Design is a challenging problem, proteins can be designed by experts in Protein Design, as well as by non-experts whose primary interests are in the applications of Protein Design. The increased accessibility of Protein Design technology is attributable to the accumulated knowledge and experience with Protein Design as well as to the availability of software and online resources. The objective of this review is to serve as a guide to the relevant literature with a focus on the novel methods and algorithms that have been developed or applied for Protein Design, and to assist in the selection of algorithms for Protein Design. Novel algorithms and models that have been introduced to utilize the enormous amount of experimental data and novel computational hardware have the potential for producing substantial increases in the accuracy, reliability and range of applications of designed proteins.
Collapse
Affiliation(s)
- Sekhar Talluri
- Department of Biotechnology, GITAM, Visakhapatnam, India.
| |
Collapse
|
6
|
A backbone-centred energy function of neural networks for protein design. Nature 2022; 602:523-528. [PMID: 35140398 DOI: 10.1038/s41586-021-04383-5] [Citation(s) in RCA: 42] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 12/23/2021] [Indexed: 12/29/2022]
Abstract
A protein backbone structure is designable if a substantial number of amino acid sequences exist that autonomously fold into it1,2. It has been suggested that the designability of backbones is governed mainly by side chain-independent or side chain type-insensitive molecular interactions3-5, indicating an approach for designing new backbones (ready for amino acid selection) based on continuous sampling and optimization of the backbone-centred energy surface. However, a sufficiently comprehensive and precise energy function has yet to be established for this purpose. Here we show that this goal is met by a statistical model named SCUBA (for Side Chain-Unknown Backbone Arrangement) that uses neural network-form energy terms. These terms are learned with a two-step approach that comprises kernel density estimation followed by neural network training and can analytically represent multidimensional, high-order correlations in known protein structures. We report the crystal structures of nine de novo proteins whose backbones were designed to high precision using SCUBA, four of which have novel, non-natural overall architectures. By eschewing use of fragments from existing protein structures, SCUBA-driven structure design facilitates far-reaching exploration of the designable backbone space, thus extending the novelty and diversity of the proteins amenable to de novo design.
Collapse
|
7
|
Pereira JM, Vieira M, Santos SM. Step-by-step design of proteins for small molecule interaction: A review on recent milestones. Protein Sci 2021; 30:1502-1520. [PMID: 33934427 PMCID: PMC8284594 DOI: 10.1002/pro.4098] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Revised: 04/21/2021] [Accepted: 04/23/2021] [Indexed: 01/01/2023]
Abstract
Protein design is the field of synthetic biology that aims at developing de novo custom-made proteins and peptides for specific applications. Despite exploring an ambitious goal, recent computational advances in both hardware and software technologies have paved the way to high-throughput screening and detailed design of novel folds and improved functionalities. Modern advances in the field of protein design for small molecule targeting are described in this review, organized in a step-by-step fashion: from the conception of a new or upgraded active binding site, to scaffold design, sequence optimization, and experimental expression of the custom protein. In each step, contemporary examples are described, and state-of-the-art software is briefly explored.
Collapse
Affiliation(s)
- José M. Pereira
- CICECO & Departamento de QuímicaUniversidade de AveiroAveiroPortugal
| | - Maria Vieira
- CICECO & Departamento de QuímicaUniversidade de AveiroAveiroPortugal
| | - Sérgio M. Santos
- CICECO & Departamento de QuímicaUniversidade de AveiroAveiroPortugal
| |
Collapse
|
8
|
Abstract
The field of de novo protein design has met with considerable success over the past few decades. Heme, a cofactor, has often been introduced to impart a diverse array of functions to a protein, ranging from electron transport to respiration. In nature, heme is found to occur predominantly in α-helical structures over β-sheets, which has resulted in significant designs of heme proteins utilizing coiled-coil helices. By contrast, there are only a few known β-sheet proteins that bind heme and designs of β-sheets frequently result in amyloid-like aggregates. This review reflects on our success in designing a series of multistranded β-sheet heme binding peptides that are well folded in both aqueous and membrane-like environments. Initially, we designed a β-hairpin peptide that self-assembles to bind heme and performs peroxidase activity in membrane. The β-hairpin was optimized further to accommodate a heme binding pocket within multistranded β-sheets for catalysis and electron transfer in membranes. Furthermore, we de novo designed and characterized β-sheet peptides and miniproteins that are soluble in an aqueous environment capable of binding single and multiple hemes with high affinity and stability. Collectively, these studies highlight the substantial progress made toward the design of functional β-sheets.
Collapse
Affiliation(s)
- Areetha D'Souza
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551
| | - Surajit Bhattacharjya
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551
| |
Collapse
|
9
|
Perez-Riba A, Komives E, Main ERG, Itzhaki LS. Decoupling a tandem-repeat protein: Impact of multiple loop insertions on a modular scaffold. Sci Rep 2019; 9:15439. [PMID: 31659184 PMCID: PMC6817815 DOI: 10.1038/s41598-019-49905-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2019] [Accepted: 08/29/2019] [Indexed: 11/25/2022] Open
Abstract
The simple topology and modular architecture of tandem-repeat proteins such as tetratricopeptide repeats (TPRs) and ankyrin repeats makes them straightforward to dissect and redesign. Repeat-protein stability can be manipulated in a predictable way using site-specific mutations. Here we explore a different type of modification - loop insertion - that will enable a simple route to functionalisation of this versatile scaffold. We previously showed that a single loop insertion has a dramatically different effect on stability depending on its location in the repeat array. Here we dissect this effect by a combination of multiple and alternated loop insertions to understand the origins of the context-dependent loss in stability. We find that the scaffold is remarkably robust in that its overall structure is maintained. However, adjacent repeats are now only weakly coupled, and consequently the increase in solvent protection, and thus stability, with increasing repeat number that defines the tandem-repeat protein class is lost. Our results also provide us with a rulebook with which we can apply these principles to the design of artificial repeat proteins with precisely tuned folding landscapes and functional capabilities, thereby paving the way for their exploitation as a versatile and truly modular platform in synthetic biology.
Collapse
Affiliation(s)
- Albert Perez-Riba
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge, CB2 1PD, UK
- Donnelly Centre for Cellular & Biomolecular Research, University of Toronto, Toronto, Canada
| | - Elizabeth Komives
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA, 92093-0378, USA
| | - Ewan R G Main
- School of Biological and Chemical Sciences, Queen Mary University of London, Mile End Road, London, E1 4NS, UK
| | - Laura S Itzhaki
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge, CB2 1PD, UK.
| |
Collapse
|
10
|
Grayson KJ, Anderson JLR. Designed for life: biocompatible de novo designed proteins and components. J R Soc Interface 2019; 15:rsif.2018.0472. [PMID: 30158186 PMCID: PMC6127164 DOI: 10.1098/rsif.2018.0472] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2018] [Accepted: 08/01/2018] [Indexed: 12/30/2022] Open
Abstract
A principal goal of synthetic biology is the de novo design or redesign of biomolecular components. In addition to revealing fundamentally important information regarding natural biomolecular engineering and biochemistry, functional building blocks will ultimately be provided for applications including the manufacture of valuable products and therapeutics. To fully realize this ambitious goal, the designed components must be biocompatible, working in concert with natural biochemical processes and pathways, while not adversely affecting cellular function. For example, de novo protein design has provided us with a wide repertoire of structures and functions, including those that can be assembled and function in vivo. Here we discuss such biocompatible designs, as well as others that have the potential to become biocompatible, including non-protein molecules, and routes to achieving full biological integration.
Collapse
Affiliation(s)
- Katie J Grayson
- School of Biochemistry, University of Bristol, Biomedical Sciences Building, Bristol BS8 1TD, UK
| | - J L Ross Anderson
- School of Biochemistry, University of Bristol, Biomedical Sciences Building, Bristol BS8 1TD, UK .,BrisSynBio Synthetic Biology Research Centre, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK
| |
Collapse
|
11
|
Perez-Riba A, Synakewicz M, Itzhaki LS. Folding cooperativity and allosteric function in the tandem-repeat protein class. Philos Trans R Soc Lond B Biol Sci 2019; 373:rstb.2017.0188. [PMID: 29735741 DOI: 10.1098/rstb.2017.0188] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/17/2018] [Indexed: 01/08/2023] Open
Abstract
The term allostery was originally developed to describe structural changes in one binding site induced by the interaction of a partner molecule with a distant binding site, and it has been studied in depth in the field of enzymology. Here, we discuss the concept of action at a distance in relation to the folding and function of the solenoid class of tandem-repeat proteins such as tetratricopeptide repeats (TPRs) and ankyrin repeats. Distantly located repeats fold cooperatively, even though only nearest-neighbour interactions exist in these proteins. A number of repeat-protein scaffolds have been reported to display allosteric effects, transferred through the repeat array, that enable them to direct the activity of the multi-subunit enzymes within which they reside. We also highlight a recently identified group of tandem-repeat proteins, the RRPNN subclass of TPRs, recent crystal structures of which indicate that they function as allosteric switches to modulate multiple bacterial quorum-sensing mechanisms. We believe that the folding cooperativity of tandem-repeat proteins and the biophysical mechanisms that transform them into allosteric switches are intimately intertwined. This opinion piece aims to combine our understanding of the two areas and develop ideas on their common underlying principles.This article is part of a discussion meeting issue 'Allostery and molecular machines'.
Collapse
Affiliation(s)
- Albert Perez-Riba
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, UK
| | - Marie Synakewicz
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, UK
| | - Laura S Itzhaki
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, UK
| |
Collapse
|
12
|
Kundert K, Kortemme T. Computational design of structured loops for new protein functions. Biol Chem 2019; 400:275-288. [PMID: 30676995 PMCID: PMC6530579 DOI: 10.1515/hsz-2018-0348] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2018] [Accepted: 12/18/2018] [Indexed: 12/20/2022]
Abstract
The ability to engineer the precise geometries, fine-tuned energetics and subtle dynamics that are characteristic of functional proteins is a major unsolved challenge in the field of computational protein design. In natural proteins, functional sites exhibiting these properties often feature structured loops. However, unlike the elements of secondary structures that comprise idealized protein folds, structured loops have been difficult to design computationally. Addressing this shortcoming in a general way is a necessary first step towards the routine design of protein function. In this perspective, we will describe the progress that has been made on this problem and discuss how recent advances in the field of loop structure prediction can be harnessed and applied to the inverse problem of computational loop design.
Collapse
Affiliation(s)
- Kale Kundert
- Graduate Group in Biophysics, University of California San Francisco, San Francisco, CA 94158, USA
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94158, USA
| | - Tanja Kortemme
- Graduate Group in Biophysics, University of California San Francisco, San Francisco, CA 94158, USA
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94158, USA
- Chan Zuckerberg Biohub, 499 Illinois St, San Francisco, CA 94158, USA
| |
Collapse
|
13
|
Marcos E, Chidyausiku TM, McShan AC, Evangelidis T, Nerli S, Carter L, Nivón LG, Davis A, Oberdorfer G, Tripsianes K, Sgourakis NG, Baker D. De novo design of a non-local β-sheet protein with high stability and accuracy. Nat Struct Mol Biol 2018; 25:1028-1034. [PMID: 30374087 PMCID: PMC6219906 DOI: 10.1038/s41594-018-0141-6] [Citation(s) in RCA: 77] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Accepted: 09/11/2018] [Indexed: 11/08/2022]
Abstract
β-sheet proteins carry out critical functions in biology, and hence are attractive scaffolds for computational protein design. Despite this potential, de novo design of all-β-sheet proteins from first principles lags far behind the design of all-α or mixed-αβ domains owing to their non-local nature and the tendency of exposed β-strand edges to aggregate. Through study of loops connecting unpaired β-strands (β-arches), we have identified a series of structural relationships between loop geometry, side chain directionality and β-strand length that arise from hydrogen bonding and packing constraints on regular β-sheet structures. We use these rules to de novo design jellyroll structures with double-stranded β-helices formed by eight antiparallel β-strands. The nuclear magnetic resonance structure of a hyperthermostable design closely matched the computational model, demonstrating accurate control over the β-sheet structure and loop geometry. Our results open the door to the design of a broad range of non-local β-sheet protein structures.
Collapse
Affiliation(s)
- Enrique Marcos
- Department of Biochemistry, University of Washington, Seattle, WA, USA.
- Institute for Protein Design, University of Washington, Seattle, WA, USA.
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona Institute of Science and Technology, Barcelona, Spain.
| | - Tamuka M Chidyausiku
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Andrew C McShan
- Department of Chemistry and Biochemistry, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Thomas Evangelidis
- CEITEC-Central European Institute of Technology, Masaryk University, Brno, Czech Republic
| | - Santrupti Nerli
- Department of Chemistry and Biochemistry, University of California, Santa Cruz, Santa Cruz, CA, USA
- Department of Computer Science, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Lauren Carter
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Lucas G Nivón
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Audrey Davis
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Amazon, Seattle, WA, USA
| | - Gustav Oberdorfer
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Institute of Biochemistry, Graz University of Technology, Graz, Austria
| | | | - Nikolaos G Sgourakis
- Department of Chemistry and Biochemistry, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA.
- Institute for Protein Design, University of Washington, Seattle, WA, USA.
| |
Collapse
|
14
|
Baarda RA, Marianchuk TL, Toney MD, Cox DL. In silico stress-strain measurements on self-assembled protein lattices. SOFT MATTER 2018; 14:8095-8104. [PMID: 30159554 DOI: 10.1039/c8sm00412a] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Due to their large mechanical strength and potential for functionalization, beta-solenoid proteins show promise as building blocks in biomaterials applications such as two- and three-dimensional scaffolds. We have designed simulation models of two-dimensional square and honeycomb protein lattices by covalently linking a beta-solenoid protein, the spruce budworm antifreeze protein (SBAFP), to symmetric protein multimers. Periodic boundary conditions applied to the simulation cell allow for the simulation of an infinite lattice. We use molecular dynamics to strain the lattice by deforming the simulation cell and measuring the resulting stress tensor. We evaluate the linear portion of stress-strain curves to extract the corresponding bulk and shear elastic moduli. When strained at a rate of 0.3 nm ps-1, the lattices yield a bulk modulus of approximately 3 GPa. This large elastic modulus demonstrates that 2-dimensional structures designed from beta-solenoid proteins can be expected to retain the exceptional material strength of their building blocks.
Collapse
Affiliation(s)
- Rachel A Baarda
- Department of Physics, University of California, Davis, California, USA.
| | | | - Michael D Toney
- Department of Chemistry, University of California, Davis, California, USA
| | - Daniel L Cox
- Department of Physics, University of California, Davis, California, USA.
| |
Collapse
|
15
|
Grayson KJ, Anderson JR. The ascent of man(made oxidoreductases). Curr Opin Struct Biol 2018; 51:149-155. [PMID: 29754103 PMCID: PMC6227378 DOI: 10.1016/j.sbi.2018.04.008] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Accepted: 04/24/2018] [Indexed: 11/09/2022]
Abstract
Though established 40 years ago, the field of de novo protein design has recently come of age, with new designs exhibiting an unprecedented level of sophistication in structure and function. With respect to catalysis, de novo enzymes promise to revolutionise the industrial production of useful chemicals and materials, while providing new biomolecules as plug-and-play components in the metabolic pathways of living cells. To this end, there are now de novo metalloenzymes that are assembled in vivo, including the recently reported C45 maquette, which can catalyse a variety of substrate oxidations with efficiencies rivalling those of closely related natural enzymes. Here we explore the successful design of this de novo enzyme, which was designed to minimise the undesirable complexity of natural proteins using a minimalistic bottom-up approach.
Collapse
Affiliation(s)
- Katie J Grayson
- School of Biochemistry, Biomedical Sciences Building, University of Bristol, BS8 1TD, UK
| | - Jl Ross Anderson
- School of Biochemistry, Biomedical Sciences Building, University of Bristol, BS8 1TD, UK; BrisSynBio Synthetic Biology Research Centre, Life Sciences Building, University of Bristol, Tyndall Avenue, Bristol BS8 1TQ, UK.
| |
Collapse
|
16
|
Hexapeptide Tandem Repeats Dictate the Formation of Silkmoth Chorion, a Natural Protective Amyloid. J Mol Biol 2018; 430:3774-3783. [PMID: 29964045 DOI: 10.1016/j.jmb.2018.06.042] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Revised: 06/18/2018] [Accepted: 06/22/2018] [Indexed: 12/29/2022]
Abstract
Silkmoth chorion is a fibrous structure composed mainly of two major protein classes, families A and B. Both families of silkmoth chorion proteins present a highly conserved, in sequence and in length, central domain, consisting of Gly-rich tandem hexapeptide repetitive segments, flanked by two more variable N-terminal and C-terminal arms. Primary studies identified silkmoth chorion as a functional protective amyloid by unveiling the amyloidogenic properties of the central domain of both protein families. In this work, we attempt to detect the principal source of amyloidogenicity of the central domain by focusing on the role of the tandem hexapeptide sequence repeats. Concurrently, we discuss a possible mechanism for the self-assembly of class A protofilaments, suggesting that the aggregation-prone hexapeptide building blocks may fold into a triangle-shaped β-helical structure.
Collapse
|
17
|
Chu H, Liu H. TetraBASE: A Side Chain-Independent Statistical Energy for Designing Realistically Packed Protein Backbones. J Chem Inf Model 2018; 58:430-442. [PMID: 29314837 DOI: 10.1021/acs.jcim.7b00677] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
To construct backbone structures of high designability is a primary aspect of computational protein design. We report here a side chain-independent statistical energy that aims at realistic modeling of through-space packing of polypeptide backbones. To mitigate the lack of explicit amino acid side chains, the model treats the interbackbone site packing as being dependent on peptide local conformation. In addition, new variables suitable for statistical analysis, one for relative orientation and another for distance, have been introduced to represent the intersite geometry based on the asymmetrical tetrahedron organization of distinct chemical groups surrounding the Cα-carbon atoms. The resulting tetrahedron-based backbone statistical energy (tetraBASE) model has been used to optimize the tertiary organizations of secondary structure elements (SSEs) of designated types with Monte Caro simulated annealing, starting from artificial initial configurations. The tetraBASE minimum energy structures can reproduce SSE packing frequently observed in native proteins with atomic root-mean-square deviations of 1-2 Å. The model has also been tested by examining the stability of native SSE arrangements under tetraBASE. The results suggest that tetraBASE model can be used to effectively represent interbackbone packing when designing backbone structures without explicitly knowing side chain types.
Collapse
Affiliation(s)
- Huanyu Chu
- School of Life Sciences, University of Science and Technology of China , 230027 Hefei, Anhui China.,Hefei National Laboratory for Physical Sciences at the Microscales , 230027 Hefei, Anhui China
| | - Haiyan Liu
- School of Life Sciences, University of Science and Technology of China , 230027 Hefei, Anhui China.,Hefei National Laboratory for Physical Sciences at the Microscales , 230027 Hefei, Anhui China.,Collaborative Innovation Center of Chemistry for Life Sciences , 230027 Hefei, Anhui China
| |
Collapse
|
18
|
Su T, Su J, Liu S, Zhang C, He J, Huang Y, Xu S, Gu L. Structural and Biochemical Characterization of BdsA from Bacillus subtilis WU-S2B, a Key Enzyme in the "4S" Desulfurization Pathway. Front Microbiol 2018; 9:231. [PMID: 29497411 PMCID: PMC5819316 DOI: 10.3389/fmicb.2018.00231] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2017] [Accepted: 01/30/2018] [Indexed: 11/13/2022] Open
Abstract
Dibenzothiophene (DBT) and their derivatives, accounting for the major part of the sulfur components in crude oil, make one of the most significant pollution sources. The DBT sulfone monooxygenase BdsA, one of the key enzymes in the “4S” desulfurization pathway, catalyzes the oxidation of DBT sulfone to 2′-hydroxybiphenyl 2-sulfonic acid (HBPSi). Here, we determined the crystal structure of BdsA from Bacillus subtilis WU-S2B, at the resolution of 2.2 Å, and the structure of the BdsA-FMN complex at 2.4 Å. BdsA and the BdsA-FMN complex exist as tetramers. DBT sulfone was placed into the active site by molecular docking. Seven residues (Phe12, His20, Phe56, Phe246, Val248, His316, and Val372) are found to be involved in the binding of DBT sulfone. The importance of these residues is supported by the study of the catalytic activity of the active site variants. Structural analysis and enzyme activity assay confirmed the importance of the right position and orientation of FMN and DBT sulfone, as well as the involvement of Ser139 as a nucleophile in catalysis. This work combined with our previous structure of DszC provides a systematic structural basis for the development of engineered desulfurization enzymes with higher efficiency and stability.
Collapse
Affiliation(s)
- Tiantian Su
- State Key Laboratory of Microbial Technology, School of Life Sciences, Shandong University, Jinan, China
| | - Jing Su
- State Key Laboratory of Microbial Technology, School of Life Sciences, Shandong University, Jinan, China.,Faculty of Light Industry, Province Key Laboratory of Microbial Engineering, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
| | - Shiheng Liu
- State Key Laboratory of Microbial Technology, School of Life Sciences, Shandong University, Jinan, China
| | - Conggang Zhang
- State Key Laboratory of Microbial Technology, School of Life Sciences, Shandong University, Jinan, China
| | - Jing He
- State Key Laboratory of Microbial Technology, School of Life Sciences, Shandong University, Jinan, China
| | - Yan Huang
- State Key Laboratory of Microbial Technology, School of Life Sciences, Shandong University, Jinan, China
| | - Sujuan Xu
- State Key Laboratory of Microbial Technology, School of Life Sciences, Shandong University, Jinan, China
| | - Lichuan Gu
- State Key Laboratory of Microbial Technology, School of Life Sciences, Shandong University, Jinan, China
| |
Collapse
|
19
|
Roterman I, Banach M, Konieczny L. Propagation of Fibrillar Structural Forms in Proteins Stopped by Naturally Occurring Short Polypeptide Chain Fragments. Pharmaceuticals (Basel) 2017; 10:E89. [PMID: 29144442 PMCID: PMC5748646 DOI: 10.3390/ph10040089] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2017] [Revised: 11/02/2017] [Accepted: 11/13/2017] [Indexed: 11/17/2022] Open
Abstract
Amyloids characterized by unbounded growth of fibrillar structures cause many pathological processes. Such unbounded propagation is due to the presence of a propagating hydrophobicity field around the fibril's main axis, preventing its closure (unlike in globular proteins). Interestingly, similar fragments, commonly referred to as solenoids, are present in many naturally occurring proteins, where their propagation is arrested by suitably located "stopper" fragments. In this work, we analyze the distribution of hydrophobicity in solenoids and in their corresponding "stoppers" from the point of view of the fuzzy oil drop model (called FOD in this paper). This model characterizes the unique linear propagation of local hydrophobicity in the solenoid fragment and allows us to pinpoint "stopper" sequences, where local hydrophobicity quite closely resembles conditions encountered in globular proteins. Consequently, such fragments perform their function by mediating entropically advantageous contact with the water environment. We discuss examples of amyloid-like structures in solenoids, with particular attention to "stop" segments present in properly folded proteins found in living organisms.
Collapse
Affiliation(s)
- Irena Roterman
- Department of Bioinformatics and Telemedicine, Medical College, Jagiellonian University, 31-530 Krakow, Poland.
| | - Mateusz Banach
- Department of Bioinformatics and Telemedicine, Medical College, Jagiellonian University, 31-530 Krakow, Poland.
| | - Leszek Konieczny
- Chair of Medical Biochemistry, Medical College, Jagiellonian University, 31-034 Krakow, Poland.
| |
Collapse
|
20
|
Jiang T, Magnotti EL, Conticello VP. Geometrical frustration as a potential design principle for peptide-based assemblies. Interface Focus 2017; 7:20160141. [PMID: 29147554 DOI: 10.1098/rsfs.2016.0141] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Two-dimensional peptide and protein assemblies have been the focus of increased scientific research as they display significant potential for the creation of functional nanomaterials. Soluble subunits derived from a variety of protein motifs have been demonstrated to self-assemble into structurally defined nanosheets under environmentally benign conditions in which the components often retain their native structure and function. These types of two-dimensional assemblies may have an advantage for nanofabrication in that their extended planar shapes can be more straightforwardly incorporated into the current formats of nanoscale devices. However, significant challenges remain in the fabrication of these materials, particularly in devising methods to control the size, shape and internal structure of the resultant materials. Geometrical frustration may be envisioned as a possible mechanism to exert control over these structural parameters through rational design. While this objective has yet to be realized in practice, we discuss in this article the potential role of geometrical frustration as a principle to rationalize unusual self-assembly behaviour in several examples of two-dimensional peptide assemblies.
Collapse
Affiliation(s)
- Tao Jiang
- Department of Chemistry, Emory University, 1515 Dickey Drive, Atlanta, GA 30322, USA
| | - Elizabeth L Magnotti
- Department of Chemistry, Emory University, 1515 Dickey Drive, Atlanta, GA 30322, USA
| | - Vincent P Conticello
- Department of Chemistry, Emory University, 1515 Dickey Drive, Atlanta, GA 30322, USA
| |
Collapse
|
21
|
Wood CW, Woolfson DN. CCBuilder 2.0: Powerful and accessible coiled-coil modeling. Protein Sci 2017; 27:103-111. [PMID: 28836317 PMCID: PMC5734305 DOI: 10.1002/pro.3279] [Citation(s) in RCA: 91] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Accepted: 08/22/2017] [Indexed: 01/06/2023]
Abstract
The increased availability of user-friendly and accessible computational tools for biomolecular modeling would expand the reach and application of biomolecular engineering and design. For protein modeling, one key challenge is to reduce the complexities of 3D protein folds to sets of parametric equations that nonetheless capture the salient features of these structures accurately. At present, this is possible for a subset of proteins, namely, repeat proteins. The α-helical coiled coil provides one such example, which represents ≈ 3-5% of all known protein-encoding regions of DNA. Coiled coils are bundles of α helices that can be described by a small set of structural parameters. Here we describe how this parametric description can be implemented in an easy-to-use web application, called CCBuilder 2.0, for modeling and optimizing both α-helical coiled coils and polyproline-based collagen triple helices. This has many applications from providing models to aid molecular replacement for X-ray crystallography, in silico model building and engineering of natural and designed protein assemblies, and through to the creation of completely de novo "dark matter" protein structures. CCBuilder 2.0 is available as a web-based application, the code for which is open-source and can be downloaded freely. http://coiledcoils.chm.bris.ac.uk/ccbuilder2. LAY SUMMARY We have created CCBuilder 2.0, an easy to use web-based application that can model structures for a whole class of proteins, the α-helical coiled coil, which is estimated to account for 3-5% of all proteins in nature. CCBuilder 2.0 will be of use to a large number of protein scientists engaged in fundamental studies, such as protein structure determination, through to more-applied research including designing and engineering novel proteins that have potential applications in biotechnology.
Collapse
Affiliation(s)
- Christopher W Wood
- School of Chemistry, University of Bristol, Cantock's Close, Bristol, BS8 1TS, United Kingdom
| | - Derek N Woolfson
- School of Chemistry, University of Bristol, Cantock's Close, Bristol, BS8 1TS, United Kingdom.,School of Biochemistry, University of Bristol, Medical Sciences Building, University Walk, Bristol, BS8 1TD, United Kingdom.,BrisSynBio, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol, BS8 1TQ, United Kingdom
| |
Collapse
|
22
|
Designing repeat proteins: a modular approach to protein design. Curr Opin Struct Biol 2017; 45:116-123. [DOI: 10.1016/j.sbi.2017.02.001] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2017] [Revised: 02/06/2017] [Accepted: 02/16/2017] [Indexed: 01/01/2023]
|
23
|
Computational protein design with backbone plasticity. Biochem Soc Trans 2016; 44:1523-1529. [PMID: 27911735 PMCID: PMC5264498 DOI: 10.1042/bst20160155] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2016] [Revised: 08/01/2016] [Accepted: 08/03/2016] [Indexed: 11/17/2022]
Abstract
The computational algorithms used in the design of artificial proteins have become increasingly sophisticated in recent years, producing a series of remarkable successes. The most dramatic of these is the de novo design of artificial enzymes. The majority of these designs have reused naturally occurring protein structures as ‘scaffolds’ onto which novel functionality can be grafted without having to redesign the backbone structure. The incorporation of backbone flexibility into protein design is a much more computationally challenging problem due to the greatly increased search space, but promises to remove the limitations of reusing natural protein scaffolds. In this review, we outline the principles of computational protein design methods and discuss recent efforts to consider backbone plasticity in the design process.
Collapse
|