1
|
Qing R, Hao S, Smorodina E, Jin D, Zalevsky A, Zhang S. Protein Design: From the Aspect of Water Solubility and Stability. Chem Rev 2022; 122:14085-14179. [PMID: 35921495 PMCID: PMC9523718 DOI: 10.1021/acs.chemrev.1c00757] [Citation(s) in RCA: 54] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Indexed: 12/13/2022]
Abstract
Water solubility and structural stability are key merits for proteins defined by the primary sequence and 3D-conformation. Their manipulation represents important aspects of the protein design field that relies on the accurate placement of amino acids and molecular interactions, guided by underlying physiochemical principles. Emulated designer proteins with well-defined properties both fuel the knowledge-base for more precise computational design models and are used in various biomedical and nanotechnological applications. The continuous developments in protein science, increasing computing power, new algorithms, and characterization techniques provide sophisticated toolkits for solubility design beyond guess work. In this review, we summarize recent advances in the protein design field with respect to water solubility and structural stability. After introducing fundamental design rules, we discuss the transmembrane protein solubilization and de novo transmembrane protein design. Traditional strategies to enhance protein solubility and structural stability are introduced. The designs of stable protein complexes and high-order assemblies are covered. Computational methodologies behind these endeavors, including structure prediction programs, machine learning algorithms, and specialty software dedicated to the evaluation of protein solubility and aggregation, are discussed. The findings and opportunities for Cryo-EM are presented. This review provides an overview of significant progress and prospects in accurate protein design for solubility and stability.
Collapse
Affiliation(s)
- Rui Qing
- State
Key Laboratory of Microbial Metabolism, School of Life Sciences and
Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
- Media
Lab, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
- The
David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Shilei Hao
- Media
Lab, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
- Key
Laboratory of Biorheological Science and Technology, Ministry of Education, College of Bioengineering, Chongqing University, Chongqing 400030, China
| | - Eva Smorodina
- Department
of Immunology, University of Oslo and Oslo
University Hospital, Oslo 0424, Norway
| | - David Jin
- Avalon GloboCare
Corp., Freehold, New Jersey 07728, United States
| | - Arthur Zalevsky
- Laboratory
of Bioinformatics Approaches in Combinatorial Chemistry and Biology, Shemyakin−Ovchinnikov Institute of Bioorganic
Chemistry RAS, Moscow 117997, Russia
| | - Shuguang Zhang
- Media
Lab, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
2
|
Chen H, Ma L, Dai H, Fu Y, Wang H, Zhang Y. Advances in Rational Protein Engineering toward Functional Architectures and Their Applications in Food Science. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2022; 70:4522-4533. [PMID: 35353517 DOI: 10.1021/acs.jafc.2c00232] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Protein biomolecules including enzymes, cagelike proteins, and specific peptides have been continuously exploited as functional biomaterials applied in catalysis, nutrient delivery, and food preservation in food-related areas. However, natural proteins usually function well in physiological conditions, not industrial conditions, or may possess undesirable physical and chemical properties. Currently, rational protein design as a valuable technology has attracted extensive attention for the rational engineering or fabrication of ideal protein biomaterials with novel properties and functionality. This article starts with the underlying knowledge of protein folding and assembly and is followed by the introduction of the principles and strategies for rational protein design. Basic strategies for rational protein engineering involving experienced protein tailoring, computational prediction, computation redesign, and de novo protein design are summarized. Then, we focus on the recent progress of rational protein engineering or design in the application of food science, and a comprehensive summary ranging from enzyme manufacturing to cagelike protein nanocarriers engineering and antimicrobial peptides preparation is given. Overall, this review highlights the importance of rational protein engineering in food biomaterial preparation which could be beneficial for food science.
Collapse
Affiliation(s)
- Hai Chen
- College of Food Science, Southwest University, Chongqing 400715, China
| | - Liang Ma
- College of Food Science, Southwest University, Chongqing 400715, China
| | - Hongjie Dai
- College of Food Science, Southwest University, Chongqing 400715, China
| | - Yu Fu
- College of Food Science, Southwest University, Chongqing 400715, China
| | - Hongxia Wang
- College of Food Science, Southwest University, Chongqing 400715, China
| | - Yuhao Zhang
- College of Food Science, Southwest University, Chongqing 400715, China
| |
Collapse
|
3
|
Revolutionizing enzyme engineering through artificial intelligence and machine learning. Emerg Top Life Sci 2021; 5:113-125. [PMID: 33835131 DOI: 10.1042/etls20200257] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 03/17/2021] [Accepted: 03/22/2021] [Indexed: 12/20/2022]
Abstract
The combinatorial space of an enzyme sequence has astronomical possibilities and exploring it with contemporary experimental techniques is arduous and often ineffective. Multi-target objectives such as concomitantly achieving improved selectivity, solubility and activity of an enzyme have narrow plausibility under approaches of restricted mutagenesis and combinatorial search. Traditional enzyme engineering approaches have a limited scope for complex optimization due to the requirement of a priori knowledge or experimental burden of screening huge protein libraries. The recent surge in high-throughput experimental methods including Next Generation Sequencing and automated screening has flooded the field of molecular biology with big-data, which requires us to re-think our concurrent approaches towards enzyme engineering. Artificial Intelligence (AI) and Machine Learning (ML) have great potential to revolutionize smart enzyme engineering without the explicit need for a complete understanding of the underlying molecular system. Here, we portray the role and position of AI techniques in the field of enzyme engineering along with their scope and limitations. In addition, we explain how the traditional approaches of directed evolution and rational design can be extended through AI tools. Recent successful examples of AI-assisted enzyme engineering projects and their deviation from traditional approaches are highlighted. A comprehensive picture of current challenges and future avenues for AI in enzyme engineering are also discussed.
Collapse
|
4
|
Abstract
Atom pairwise potential functions make up an essential part of many scoring functions for protein decoy detection. With the development of machine learning (ML) tools, there are multiple ways to combine potential functions to create novel ML models and methods. Potential function parameters can be easily extracted; however, it is usually hard to directly obtain the calculated atom pairwise energies from scoring functions. Amber, as one of the most popular suites of modeling programs, has an extensive history and library of force field potential functions. In this work, we directly used the force field parameters in ff94 and ff14SB from Amber and encoded them to calculate atom pairwise energies for different interactions. Two sets of structures (single amino acid set and a dipeptide set) were used to evaluate the performance of our encoded Amber potentials. From the comparison results between energy terms obtained from our encoding and Amber, we find energy difference within ±0.06 kcal/mol for all tested structures. Previously we have shown that the Random Forest (RF) model can help to emphasize more important atom pairwise interactions and ignore insignificant ones [Pei, J.; Zheng, Z.; Merz, K. M. J. Chem. Inf. Model. 2019, 59, 1919-1929]. Here, as an example of combining ML methods with traditional potential functions, we followed the same work flow to combine the RF models with force field potential functions from Amber. To determine the performance of our RF models with force field potential functions, 224 different protein native-decoy systems were used as our training and testing sets We find that the RF models with ff94 and ff14SB force field parameters outperformed all other scoring functions (RF models with KECSA2, RWplus, DFIRE, dDFIRE, and GOAP) considered in this work for native structure detection, and they performed similarly in detecting the best decoy. Through inclusion of best decoy to decoy comparisons in building our RF models, we were able to generate models that outperformed the score functions tested herein both on accuracy and best decoy detection, again showing the performance and flexibility of our RF models to tackle this problem. Finally, the importance of the RF algorithm and force field parameters were also tested and the comparison results suggest that both the RF algorithm and force field potentials are important with the ML scoring function achieving its best performance only by combining them together. All code and data used in this work are available at https://github.com/JunPei000/FFENCODER_for_Protein_Folding_Pose_Selection.
Collapse
Affiliation(s)
- Jun Pei
- Department of Chemistry and the Department of Biochemistry and Molecular Biology, Michigan State University, 578 South Shaw Lane, East Lansing, Michigan 48824, United States
| | - Lin Frank Song
- Department of Chemistry and the Department of Biochemistry and Molecular Biology, Michigan State University, 578 South Shaw Lane, East Lansing, Michigan 48824, United States
| | - Kenneth M Merz
- Department of Chemistry and the Department of Biochemistry and Molecular Biology, Michigan State University, 578 South Shaw Lane, East Lansing, Michigan 48824, United States
| |
Collapse
|
5
|
Gonzalez-Rivera JC, Orr AA, Engels SM, Jakubowski JM, Sherman MW, O'Connor KN, Matteson T, Woodcock BC, Contreras LM, Tamamis P. Computational evolution of an RNA-binding protein towards enhanced oxidized-RNA binding. Comput Struct Biotechnol J 2020; 18:137-152. [PMID: 31988703 PMCID: PMC6965710 DOI: 10.1016/j.csbj.2019.12.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2019] [Revised: 12/06/2019] [Accepted: 12/06/2019] [Indexed: 12/02/2022] Open
Abstract
The oxidation of RNA has been implicated in the development of many diseases. Among the four ribonucleotides, guanosine is the most susceptible to oxidation, resulting in the formation of 8-oxo-7,8-dihydroguanosine (8-oxoG). Despite the limited knowledge about how cells regulate the detrimental effects of oxidized RNA, cellular factors involved in its regulation have begun to be identified. One of these factors is polynucleotide phosphorylase (PNPase), a multifunctional enzyme implicated in RNA turnover. In the present study, we have examined the interaction of PNPase with 8-oxoG in atomic detail to provide insights into the mechanism of 8-oxoG discrimination. We hypothesized that PNPase subunits cooperate to form a binding site using the dynamic SFF loop within the central channel of the PNPase homotrimer. We evolved this site using a novel approach that initially screened mutants from a library of beneficial mutations and assessed their interactions using multi-nanosecond Molecular Dynamics simulations. We found that evolving this single site resulted in a fold change increase in 8-oxoG affinity between 1.2 and 1.5 and/or selectivity between 1.5 and 1.9. In addition to the improvement in 8-oxoG binding, complementation of K12 Δpnp with plasmids expressing mutant PNPases caused increased cell tolerance to H2O2. This observation provides a clear link between molecular discrimination of RNA oxidation and cell survival. Moreover, this study provides a framework for the manipulation of modified-RNA protein readers, which has potential application in synthetic biology and epitranscriptomics.
Collapse
Affiliation(s)
- Juan C. Gonzalez-Rivera
- McKetta Department of Chemical Engineering, The University of Texas, 200 E. Dean Keeton Street Stop C0400, Austin, TX 78712, United States
| | - Asuka A. Orr
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, 3122 TAMU Room 200, College Station, TX 77843, United States
| | - Sean M. Engels
- McKetta Department of Chemical Engineering, The University of Texas, 200 E. Dean Keeton Street Stop C0400, Austin, TX 78712, United States
| | - Joseph M. Jakubowski
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, 3122 TAMU Room 200, College Station, TX 77843, United States
| | - Mark W. Sherman
- Institute of Cellular and Molecular Biology, The University of Texas at Austin, 100 E 24th Street, Stop A5000, Austin, TX 78712, United States
| | - Katherine N. O'Connor
- McKetta Department of Chemical Engineering, The University of Texas, 200 E. Dean Keeton Street Stop C0400, Austin, TX 78712, United States
| | - Tomas Matteson
- Institute of Cellular and Molecular Biology, The University of Texas at Austin, 100 E 24th Street, Stop A5000, Austin, TX 78712, United States
| | - Brendan C. Woodcock
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, 3122 TAMU Room 200, College Station, TX 77843, United States
| | - Lydia M. Contreras
- McKetta Department of Chemical Engineering, The University of Texas, 200 E. Dean Keeton Street Stop C0400, Austin, TX 78712, United States
- Institute of Cellular and Molecular Biology, The University of Texas at Austin, 100 E 24th Street, Stop A5000, Austin, TX 78712, United States
- Corresponding authors at: McKetta Department of Chemical Engineering, The University of Texas, 200 E. Dean Keeton Street Stop C0400, Austin, TX 78712, United States (L.M. Contreras).
| | - Phanourios Tamamis
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, 3122 TAMU Room 200, College Station, TX 77843, United States
- Corresponding authors at: McKetta Department of Chemical Engineering, The University of Texas, 200 E. Dean Keeton Street Stop C0400, Austin, TX 78712, United States (L.M. Contreras).
| |
Collapse
|
6
|
Pei J, Zheng Z, Merz KM. Random Forest Refinement of the KECSA2 Knowledge-Based Scoring Function for Protein Decoy Detection. J Chem Inf Model 2019; 59:1919-1929. [DOI: 10.1021/acs.jcim.8b00734] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Jun Pei
- Department of Chemistry, Michigan State University, 578 S. Shaw Lane, East Lansing, Michigan 48824, United States
| | - Zheng Zheng
- Department of Chemistry, Michigan State University, 578 S. Shaw Lane, East Lansing, Michigan 48824, United States
| | - Kenneth M. Merz
- Department of Chemistry, Michigan State University, 578 S. Shaw Lane, East Lansing, Michigan 48824, United States
- Institute for Cyber Enabled Research, Michigan State University, 567 Wilson Road, East Lansing, Michigan 48824, United States
| |
Collapse
|
7
|
The state-of-the-art strategies of protein engineering for enzyme stabilization. Biotechnol Adv 2018; 37:530-537. [PMID: 31138425 DOI: 10.1016/j.biotechadv.2018.10.011] [Citation(s) in RCA: 89] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2017] [Revised: 10/12/2018] [Accepted: 10/25/2018] [Indexed: 12/11/2022]
Abstract
Enzymes generated by natural recruitment and protein engineering have greatly contribute in various sets of applications. However, their insufficient stability is a bottleneck that limit the rapid development of biocatalysis. Novel approaches based on precise and global structural dissection, advanced gene manipulation, and combination with the multidisciplinary techniques open a new horizon to generate stable enzymes efficiently. Here, we comprehensively introduced emerging advances of protein engineering strategies for enzyme stabilization. Then, we highlighted practical cases to show importance of enzyme stabilization in pharmaceutical and industrial applications. Combining computational enzyme design with molecular evolution will hold considerable promise in this field.
Collapse
|
8
|
Wang L, Dash S, Ng CY, Maranas CD. A review of computational tools for design and reconstruction of metabolic pathways. Synth Syst Biotechnol 2017; 2:243-252. [PMID: 29552648 PMCID: PMC5851934 DOI: 10.1016/j.synbio.2017.11.002] [Citation(s) in RCA: 71] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2017] [Revised: 11/06/2017] [Accepted: 11/06/2017] [Indexed: 11/28/2022] Open
Abstract
Metabolic pathways reflect an organism's chemical repertoire and hence their elucidation and design have been a primary goal in metabolic engineering. Various computational methods have been developed to design novel metabolic pathways while taking into account several prerequisites such as pathway stoichiometry, thermodynamics, host compatibility, and enzyme availability. The choice of the method is often determined by the nature of the metabolites of interest and preferred host organism, along with computational complexity and availability of software tools. In this paper, we review different computational approaches used to design metabolic pathways based on the reaction network representation of the database (i.e., graph or stoichiometric matrix) and the search algorithm (i.e., graph search, flux balance analysis, or retrosynthetic search). We also put forth a systematic workflow that can be implemented in projects requiring pathway design and highlight current limitations and obstacles in computational pathway design.
Collapse
Affiliation(s)
- Lin Wang
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, USA
| | - Satyakam Dash
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, USA
| | - Chiam Yu Ng
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, USA
| | - Costas D Maranas
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, USA
| |
Collapse
|
9
|
Computational approaches for de novo design and redesign of metal-binding sites on proteins. Biosci Rep 2017; 37:BSR20160179. [PMID: 28167677 PMCID: PMC5482196 DOI: 10.1042/bsr20160179] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2016] [Revised: 02/06/2017] [Accepted: 02/06/2017] [Indexed: 12/25/2022] Open
Abstract
Metal ions play pivotal roles in protein structure, function and stability. The functional and structural diversity of proteins in nature expanded with the incorporation of metal ions or clusters in proteins. Approximately one-third of these proteins in the databases contain metal ions. Many biological and chemical processes in nature involve metal ion-binding proteins, aka metalloproteins. Many cellular reactions that underpin life require metalloproteins. Most of the remarkable, complex chemical transformations are catalysed by metalloenzymes. Realization of the importance of metal-binding sites in a variety of cellular events led to the advancement of various computational methods for their prediction and characterization. Furthermore, as structural and functional knowledgebase about metalloproteins is expanding with advances in computational and experimental fields, the focus of the research is now shifting towards de novo design and redesign of metalloproteins to extend nature’s own diversity beyond its limits. In this review, we will focus on the computational toolbox for prediction of metal ion-binding sites, de novo metalloprotein design and redesign. We will also give examples of tailor-made artificial metalloproteins designed with the computational toolbox.
Collapse
|
10
|
Gorham R, Forest DL, Khoury GA, Smadbeck J, Beecher CN, Healy ED, Tamamis P, Archontis G, Larive C, Floudas CA, Radeke MJ, Johnson LV, Morikis D. New compstatin peptides containing N-terminal extensions and non-natural amino acids exhibit potent complement inhibition and improved solubility characteristics. J Med Chem 2015; 58:814-26. [PMID: 25494040 PMCID: PMC4306506 DOI: 10.1021/jm501345y] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2014] [Indexed: 01/21/2023]
Abstract
Compstatin peptides are complement inhibitors that bind and inhibit cleavage of complement C3. Peptide binding is enhanced by hydrophobic interactions; however, poor solubility promotes aggregation in aqueous environments. We have designed new compstatin peptides derived from the W4A9 sequence (Ac-ICVWQDWGAHRCT-NH2, cyclized between C2 and C12), based on structural, computational, and experimental studies. Furthermore, we developed and utilized a computational framework for the design of peptides containing non-natural amino acids. These new compstatin peptides contain polar N-terminal extensions and non-natural amino acid substitutions at positions 4 and 9. Peptides with α-modified non-natural alanine analogs at position 9, as well as peptides containing only N-terminal polar extensions, exhibited similar activity compared to W4A9, as quantified via ELISA, hemolytic, and cell-based assays, and showed improved solubility, as measured by UV absorbance and reverse-phase HPLC experiments. Because of their potency and solubility, these peptides are promising candidates for therapeutic development in numerous complement-mediated diseases.
Collapse
Affiliation(s)
- Ronald
D. Gorham
- Department
of Bioengineering, University of California, Riverside, California 92521, United States
| | - David L. Forest
- Center
for the Study of Macular Degeneration, Neuroscience Research Institute, University of California, Santa Barbara, California 93106, United States
| | - George A. Khoury
- Department
of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States
| | - James Smadbeck
- Department
of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States
| | - Consuelo N. Beecher
- Department
of Chemistry, University of California, Riverside, California 92521, United States
| | - Evangeline D. Healy
- Center
for the Study of Macular Degeneration, Neuroscience Research Institute, University of California, Santa Barbara, California 93106, United States
| | - Phanourios Tamamis
- Department
of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States
- Department
of Physics, University of Cyprus, PO20537, CY1678 Nicosia, Cyprus
| | - Georgios Archontis
- Department
of Physics, University of Cyprus, PO20537, CY1678 Nicosia, Cyprus
| | - Cynthia
K. Larive
- Department
of Chemistry, University of California, Riverside, California 92521, United States
| | - Christodoulos A. Floudas
- Department
of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States
| | - Monte J. Radeke
- Center
for the Study of Macular Degeneration, Neuroscience Research Institute, University of California, Santa Barbara, California 93106, United States
| | - Lincoln V. Johnson
- Center
for the Study of Macular Degeneration, Neuroscience Research Institute, University of California, Santa Barbara, California 93106, United States
| | - Dimitrios Morikis
- Department
of Bioengineering, University of California, Riverside, California 92521, United States
| |
Collapse
|
11
|
Khoury GA, Smadbeck J, Tamamis P, Vandris AC, Kieslich CA, Floudas CA. Forcefield_NCAA: ab initio charge parameters to aid in the discovery and design of therapeutic proteins and peptides with unnatural amino acids and their application to complement inhibitors of the compstatin family. ACS Synth Biol 2014; 3:855-69. [PMID: 24932669 PMCID: PMC4277759 DOI: 10.1021/sb400168u] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
We describe the development and testing of ab initio derived, AMBER ff03 compatible charge parameters for a large library of 147 noncanonical amino acids including β- and N-methylated amino acids for use in applications such as protein structure prediction and de novo protein design. The charge parameter derivation was performed using the RESP fitting approach. Studies were performed assessing the suitability of the derived charge parameters in discriminating the activity/inactivity between 63 analogs of the complement inhibitor Compstatin on the basis of previously published experimental IC50 data and a screening procedure involving short simulations and binding free energy calculations. We found that both the approximate binding affinity (K*) and the binding free energy calculated through MM-GBSA are capable of discriminating between active and inactive Compstatin analogs, with MM-GBSA performing significantly better. Key interactions between the most potent Compstatin analog that contains a noncanonical amino acid are presented and compared to the most potent analog containing only natural amino acids and native Compstatin. We make the derived parameters and an associated web interface that is capable of performing modifications on proteins using Forcefield_NCAA and outputting AMBER-ready topology and parameter files freely available for academic use at http://selene.princeton.edu/FFNCAA . The forcefield allows one to incorporate these customized amino acids into design applications with control over size, van der Waals, and electrostatic interactions.
Collapse
Affiliation(s)
- George A. Khoury
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States
| | - James Smadbeck
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States
| | - Phanourios Tamamis
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States
| | - Andrew C. Vandris
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States
| | - Chris A. Kieslich
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States
| | - Christodoulos A. Floudas
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States
| |
Collapse
|
12
|
Smadbeck J, Chan KH, Khoury GA, Xue B, Robinson RC, Hauser CAE, Floudas CA. De novo design and experimental characterization of ultrashort self-associating peptides. PLoS Comput Biol 2014; 10:e1003718. [PMID: 25010703 PMCID: PMC4091692 DOI: 10.1371/journal.pcbi.1003718] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2014] [Accepted: 05/31/2014] [Indexed: 12/19/2022] Open
Abstract
Self-association is a common phenomenon in biology and one that can have positive and negative impacts, from the construction of the architectural cytoskeleton of cells to the formation of fibrils in amyloid diseases. Understanding the nature and mechanisms of self-association is important for modulating these systems and in creating biologically-inspired materials. Here, we present a two-stage de novo peptide design framework that can generate novel self-associating peptide systems. The first stage uses a simulated multimeric template structure as input into the optimization-based Sequence Selection to generate low potential energy sequences. The second stage is a computational validation procedure that calculates Fold Specificity and/or Approximate Association Affinity (K*association) based on metrics that we have devised for multimeric systems. This framework was applied to the design of self-associating tripeptides using the known self-associating tripeptide, Ac-IVD, as a structural template. Six computationally predicted tripeptides (Ac-LVE, Ac-YYD, Ac-LLE, Ac-YLD, Ac-MYD, Ac-VIE) were chosen for experimental validation in order to illustrate the self-association outcomes predicted by the three metrics. Self-association and electron microscopy studies revealed that Ac-LLE formed bead-like microstructures, Ac-LVE and Ac-YYD formed fibrillar aggregates, Ac-VIE and Ac-MYD formed hydrogels, and Ac-YLD crystallized under ambient conditions. An X-ray crystallographic study was carried out on a single crystal of Ac-YLD, which revealed that each molecule adopts a β-strand conformation that stack together to form parallel β-sheets. As an additional validation of the approach, the hydrogel-forming sequences of Ac-MYD and Ac-VIE were shuffled. The shuffled sequences were computationally predicted to have lower K*association values and were experimentally verified to not form hydrogels. This illustrates the robustness of the framework in predicting self-associating tripeptides. We expect that this enhanced multimeric de novo peptide design framework will find future application in creating novel self-associating peptides based on unnatural amino acids, and inhibitor peptides of detrimental self-aggregating biological proteins.
Collapse
Affiliation(s)
- James Smadbeck
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey, United States of America
| | - Kiat Hwa Chan
- Institute of Bioengineering and Nanotechnology, Singapore, Singapore
| | - George A. Khoury
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey, United States of America
| | - Bo Xue
- Institute of Molecular and Cell Biology, A*STAR (Agency of Science, Technology and Research), Biopolis, Singapore, Singapore
| | - Robert C. Robinson
- Institute of Molecular and Cell Biology, A*STAR (Agency of Science, Technology and Research), Biopolis, Singapore, Singapore
| | | | - Christodoulos A. Floudas
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey, United States of America
| |
Collapse
|
13
|
Elucidating a key anti-HIV-1 and cancer-associated axis: the structure of CCL5 (Rantes) in complex with CCR5. Sci Rep 2014; 4:5447. [PMID: 24965094 PMCID: PMC4894430 DOI: 10.1038/srep05447] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2014] [Accepted: 06/05/2014] [Indexed: 01/01/2023] Open
Abstract
CCL5 (RANTES) is an inflammatory chemokine which binds to chemokine receptor CCR5 and induces signaling. The CCL5:CCR5 associated chemotactic signaling is of critical biological importance and is a potential HIV-1 therapeutic axis. Several studies provided growing evidence for the expression of CCL5 and CCR5 in non-hematological malignancies. Therefore, the delineation of the CCL5:CCR5 complex structure can pave the way for novel CCR5-targeted drugs. We employed a computational protocol which is primarily based on free energy calculations and molecular dynamics simulations, and report, what is to our knowledge, the first computationally derived CCL5:CCR5 complex structure which is in excellent agreement with experimental findings and clarifies the functional role of CCL5 and CCR5 residues which are associated with binding and signaling. A wealth of polar and non-polar interactions contributes to the tight CCL5:CCR5 binding. The structure of an HIV-1 gp120 V3 loop in complex with CCR5 has recently been derived through a similar computational protocol. A comparison between the CCL5 : CCR5 and the HIV-1 gp120 V3 loop : CCR5 complex structures depicts that both the chemokine and the virus primarily interact with the same CCR5 residues. The present work provides insights into the blocking mechanism of HIV-1 by CCL5.
Collapse
|
14
|
Tamamis P, Floudas CA. Molecular recognition of CCR5 by an HIV-1 gp120 V3 loop. PLoS One 2014; 9:e95767. [PMID: 24763408 PMCID: PMC3999033 DOI: 10.1371/journal.pone.0095767] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2014] [Accepted: 03/29/2014] [Indexed: 12/04/2022] Open
Abstract
The binding of protein HIV-1 gp120 to coreceptors CCR5 or CXCR4 is a key step of the HIV-1 entry to the host cell, and is predominantly mediated through the V3 loop fragment of HIV-1 gp120. In the present work, we delineate the molecular recognition of chemokine receptor CCR5 by a dual tropic HIV-1 gp120 V3 loop, using a comprehensive set of computational tools predominantly based on molecular dynamics simulations and free energy calculations. We report, what is to our knowledge, the first complete HIV-1 gp120 V3 loop : CCR5 complex structure, which includes the whole V3 loop and the N-terminus of CCR5, and exhibits exceptional agreement with previous experimental findings. The computationally derived structure sheds light into the functional role of HIV-1 gp120 V3 loop and CCR5 residues associated with the HIV-1 coreceptor activity, and provides insights into the HIV-1 coreceptor selectivity and the blocking mechanism of HIV-1 gp120 by maraviroc. By comparing the binding of the specific dual tropic HIV-1 gp120 V3 loop with CCR5 and CXCR4, we observe that the HIV-1 gp120 V3 loop residues 13-21, which include the tip, share nearly identical structural and energetic properties in complex with both coreceptors. This result paves the way for the design of dual CCR5/CXCR4 targeted peptides as novel potential anti-AIDS therapeutics.
Collapse
Affiliation(s)
- Phanourios Tamamis
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey, United States of America
| | - Christodoulos A. Floudas
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey, United States of America
| |
Collapse
|
15
|
Tamamis P, Floudas CA. Elucidating a key component of cancer metastasis: CXCL12 (SDF-1α) binding to CXCR4. J Chem Inf Model 2014; 54:1174-88. [PMID: 24660779 PMCID: PMC4004218 DOI: 10.1021/ci500069y] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
![]()
The chemotactic signaling induced
by the binding of chemokine CXCL12
(SDF-1α) to chemokine receptor CXCR4 is of significant biological
importance and is a potential therapeutic axis against HIV-1. However,
as CXCR4 is overexpressed in certain cancer cells, the CXCL12:CXCR4
signaling is involved in tumor metastasis, progression, angiogenesis,
and survival. Motivated by the pivotal role of the CXCL12:CXCR4 axis
in cancer, we employed a comprehensive set of computational tools,
predominantly based on free energy calculations and molecular dynamics
simulations, to obtain insights into the molecular recognition of
CXCR4 by CXCL12. We report, what is to our knowledge, the first computationally
derived CXCL12:CXCR4 complex structure which is in remarkable agreement
with experimental findings and sheds light into the functional role
of CXCL12 and CXCR4 residues which are associated with binding and
signaling. Our results reveal that the CXCL12 N-terminal domain is
firmly bound within the CXCR4 transmembrane domain, and the central
24–50 residue domain of CXCL12 interacts with the upper N-terminal
domain of CXCR4. The stability of the CXCL12:CXCR4 complex structure
is attributed to an abundance of nonpolar and polar intermolecular
interactions, including salt bridges formed between positively charged
CXCL12 residues and negatively charged CXCR4 residues. The success
of the computational protocol can mainly be attributed to the nearly
exhaustive docking conformational search, as well as the heterogeneous
dielectric implicit water-membrane-water model used to simulate and
select the optimum conformations. We also recently utilized this protocol
to elucidate the binding of an HIV-1 gp120 V3 loop in complex with
CXCR4, and a comparison between the molecular recognition of CXCR4
by CXCL12 and the HIV-1 gp120 V3 loop shows that both CXCL12 and the
HIV-1 gp120 V3 loop share the same CXCR4 binding pocket, as they mostly
interact with the same CXCR4 residues.
Collapse
Affiliation(s)
- Phanourios Tamamis
- Department of Chemical and Biological Engineering, Princeton University , New Jersey 08544, United States
| | | |
Collapse
|
16
|
Computational tools for designing and engineering enzymes. Curr Opin Chem Biol 2013; 19:8-16. [PMID: 24780274 DOI: 10.1016/j.cbpa.2013.12.003] [Citation(s) in RCA: 132] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2013] [Revised: 12/04/2013] [Accepted: 12/04/2013] [Indexed: 01/23/2023]
Abstract
Protein engineering strategies aimed at constructing enzymes with novel or improved activities, specificities, and stabilities greatly benefit from in silico methods. Computational methods can be principally grouped into three main categories: bioinformatics; molecular modelling; and de novo design. Particularly de novo protein design is experiencing rapid development, resulting in more robust and reliable predictions. A recent trend in the field is to combine several computational approaches in an interactive manner and to complement them with structural analysis and directed evolution. A detailed investigation of designed catalysts provides valuable information on the structural basis of molecular recognition, biochemical catalysis, and natural protein evolution.
Collapse
|
17
|
Khoury GA, Smadbeck J, Kieslich CA, Floudas CA. Protein folding and de novo protein design for biotechnological applications. Trends Biotechnol 2013; 32:99-109. [PMID: 24268901 DOI: 10.1016/j.tibtech.2013.10.008] [Citation(s) in RCA: 101] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2013] [Revised: 10/10/2013] [Accepted: 10/18/2013] [Indexed: 11/19/2022]
Abstract
In the postgenomic era, the medical/biological fields are advancing faster than ever. However, before the power of full-genome sequencing can be fully realized, the connection between amino acid sequence and protein structure, known as the protein folding problem, needs to be elucidated. The protein folding problem remains elusive, with significant difficulties still arising when modeling amino acid sequences lacking an identifiable template. Understanding protein folding will allow for unforeseen advances in protein design; often referred to as the inverse protein folding problem. Despite challenges in protein folding, de novo protein design has recently demonstrated significant success via computational techniques. We review advances and challenges in protein structure prediction and de novo protein design, and highlight their interplay in successful biotechnological applications.
Collapse
Affiliation(s)
- George A Khoury
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ 08544, USA
| | - James Smadbeck
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ 08544, USA
| | - Chris A Kieslich
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ 08544, USA
| | - Christodoulos A Floudas
- Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ 08544, USA.
| |
Collapse
|