1
|
Son A, Park J, Kim W, Yoon Y, Lee S, Park Y, Kim H. Revolutionizing Molecular Design for Innovative Therapeutic Applications through Artificial Intelligence. Molecules 2024; 29:4626. [PMID: 39407556 PMCID: PMC11477718 DOI: 10.3390/molecules29194626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2024] [Revised: 09/19/2024] [Accepted: 09/27/2024] [Indexed: 10/20/2024] Open
Abstract
The field of computational protein engineering has been transformed by recent advancements in machine learning, artificial intelligence, and molecular modeling, enabling the design of proteins with unprecedented precision and functionality. Computational methods now play a crucial role in enhancing the stability, activity, and specificity of proteins for diverse applications in biotechnology and medicine. Techniques such as deep learning, reinforcement learning, and transfer learning have dramatically improved protein structure prediction, optimization of binding affinities, and enzyme design. These innovations have streamlined the process of protein engineering by allowing the rapid generation of targeted libraries, reducing experimental sampling, and enabling the rational design of proteins with tailored properties. Furthermore, the integration of computational approaches with high-throughput experimental techniques has facilitated the development of multifunctional proteins and novel therapeutics. However, challenges remain in bridging the gap between computational predictions and experimental validation and in addressing ethical concerns related to AI-driven protein design. This review provides a comprehensive overview of the current state and future directions of computational methods in protein engineering, emphasizing their transformative potential in creating next-generation biologics and advancing synthetic biology.
Collapse
Affiliation(s)
- Ahrum Son
- Department of Molecular Medicine, Scripps Research, La Jolla, CA 92037, USA;
| | - Jongham Park
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea; (J.P.); (W.K.); (Y.Y.); (S.L.); (Y.P.)
| | - Woojin Kim
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea; (J.P.); (W.K.); (Y.Y.); (S.L.); (Y.P.)
| | - Yoonki Yoon
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea; (J.P.); (W.K.); (Y.Y.); (S.L.); (Y.P.)
| | - Sangwoon Lee
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea; (J.P.); (W.K.); (Y.Y.); (S.L.); (Y.P.)
| | - Yongho Park
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea; (J.P.); (W.K.); (Y.Y.); (S.L.); (Y.P.)
| | - Hyunsoo Kim
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea; (J.P.); (W.K.); (Y.Y.); (S.L.); (Y.P.)
- Department of Convergent Bioscience and Informatics, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
- Protein AI Design Institute, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
- SCICS, Prove beyond AI, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
| |
Collapse
|
2
|
Aguilera-Puga MDC, Cancelarich NL, Marani MM, de la Fuente-Nunez C, Plisson F. Accelerating the Discovery and Design of Antimicrobial Peptides with Artificial Intelligence. Methods Mol Biol 2024; 2714:329-352. [PMID: 37676607 DOI: 10.1007/978-1-0716-3441-7_18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
Peptides modulate many processes of human physiology targeting ion channels, protein receptors, or enzymes. They represent valuable starting points for the development of new biologics against communicable and non-communicable disorders. However, turning native peptide ligands into druggable materials requires high selectivity and efficacy, predictable metabolism, and good safety profiles. Machine learning models have gradually emerged as cost-effective and time-saving solutions to predict and generate new proteins with optimal properties. In this chapter, we will discuss the evolution and applications of predictive modeling and generative modeling to discover and design safe and effective antimicrobial peptides. We will also present their current limitations and suggest future research directions, applicable to peptide drug design campaigns.
Collapse
Affiliation(s)
- Mariana D C Aguilera-Puga
- Centro de Investigación y de Estudios Avanzados del IPN (CINVESTAV-IPN), Unidad de Genómica Avanzada, Laboratorio Nacional de Genómica para la Biodiversidad (Langebio), Irapuato, Guanajuato, Mexico
- CINVESTAV-IPN, Unidad Irapuato, Departamento de Biotecnología y Bioquímica, Irapuato, Guanajuato, Mexico
| | - Natalia L Cancelarich
- Instituto Patagónico para el Estudio de los Ecosistemas Continentales (IPEEC), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Puerto Madryn, Argentina
| | - Mariela M Marani
- Instituto Patagónico para el Estudio de los Ecosistemas Continentales (IPEEC), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Puerto Madryn, Argentina
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Departments of Psychiatry and Microbiology, Institute for Biomedical Informatics, Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
- Departments of Bioengineering and Chemical and Biomolecular Engineering, School of Engineering and Applied Science, University of Pennsylvania, Philadelphia, PA, USA.
- Penn Institute for Computational Science, University of Pennsylvania, Philadelphia, PA, USA.
| | - Fabien Plisson
- Centro de Investigación y de Estudios Avanzados del IPN (CINVESTAV-IPN), Unidad de Genómica Avanzada, Laboratorio Nacional de Genómica para la Biodiversidad (Langebio), Irapuato, Guanajuato, Mexico.
- CINVESTAV-IPN, Unidad Irapuato, Departamento de Biotecnología y Bioquímica, Irapuato, Guanajuato, Mexico.
| |
Collapse
|
3
|
Talluri S. Algorithms for protein design. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2022; 130:1-38. [PMID: 35534105 DOI: 10.1016/bs.apcsb.2022.01.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Computational Protein Design has the potential to contribute to major advances in enzyme technology, vaccine design, receptor-ligand engineering, biomaterials, nanosensors, and synthetic biology. Although Protein Design is a challenging problem, proteins can be designed by experts in Protein Design, as well as by non-experts whose primary interests are in the applications of Protein Design. The increased accessibility of Protein Design technology is attributable to the accumulated knowledge and experience with Protein Design as well as to the availability of software and online resources. The objective of this review is to serve as a guide to the relevant literature with a focus on the novel methods and algorithms that have been developed or applied for Protein Design, and to assist in the selection of algorithms for Protein Design. Novel algorithms and models that have been introduced to utilize the enormous amount of experimental data and novel computational hardware have the potential for producing substantial increases in the accuracy, reliability and range of applications of designed proteins.
Collapse
Affiliation(s)
- Sekhar Talluri
- Department of Biotechnology, GITAM, Visakhapatnam, India.
| |
Collapse
|
4
|
Pereira JM, Vieira M, Santos SM. Step-by-step design of proteins for small molecule interaction: A review on recent milestones. Protein Sci 2021; 30:1502-1520. [PMID: 33934427 PMCID: PMC8284594 DOI: 10.1002/pro.4098] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Revised: 04/21/2021] [Accepted: 04/23/2021] [Indexed: 01/01/2023]
Abstract
Protein design is the field of synthetic biology that aims at developing de novo custom-made proteins and peptides for specific applications. Despite exploring an ambitious goal, recent computational advances in both hardware and software technologies have paved the way to high-throughput screening and detailed design of novel folds and improved functionalities. Modern advances in the field of protein design for small molecule targeting are described in this review, organized in a step-by-step fashion: from the conception of a new or upgraded active binding site, to scaffold design, sequence optimization, and experimental expression of the custom protein. In each step, contemporary examples are described, and state-of-the-art software is briefly explored.
Collapse
Affiliation(s)
- José M. Pereira
- CICECO & Departamento de QuímicaUniversidade de AveiroAveiroPortugal
| | - Maria Vieira
- CICECO & Departamento de QuímicaUniversidade de AveiroAveiroPortugal
| | - Sérgio M. Santos
- CICECO & Departamento de QuímicaUniversidade de AveiroAveiroPortugal
| |
Collapse
|
5
|
Zhou J, Panaitiu AE, Grigoryan G. A general-purpose protein design framework based on mining sequence-structure relationships in known protein structures. Proc Natl Acad Sci U S A 2020; 117:1059-1068. [PMID: 31892539 PMCID: PMC6969538 DOI: 10.1073/pnas.1908723117] [Citation(s) in RCA: 61] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Current state-of-the-art approaches to computational protein design (CPD) aim to capture the determinants of structure from physical principles. While this has led to many successful designs, it does have strong limitations associated with inaccuracies in physical modeling, such that a reliable general solution to CPD has yet to be found. Here, we propose a design framework-one based on identifying and applying patterns of sequence-structure compatibility found in known proteins, rather than approximating them from models of interatomic interactions. We carry out extensive computational analyses and an experimental validation for our method. Our results strongly argue that the Protein Data Bank is now sufficiently large to enable proteins to be designed by using only examples of structural motifs from unrelated proteins. Because our method is likely to have orthogonal strengths relative to existing techniques, it could represent an important step toward removing remaining barriers to robust CPD.
Collapse
Affiliation(s)
- Jianfu Zhou
- Department of Computer Science, Dartmouth College, Hanover, NH 03755
| | | | - Gevorg Grigoryan
- Department of Computer Science, Dartmouth College, Hanover, NH 03755;
- Department of Biological Sciences, Dartmouth College, Hanover, NH 03755
| |
Collapse
|
6
|
Banerjee S, Mitra D. Structural Basis of Design and Engineering for Advanced Plant Optogenetics. TRENDS IN PLANT SCIENCE 2020; 25:35-65. [PMID: 31699521 DOI: 10.1016/j.tplants.2019.10.002] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Revised: 09/12/2019] [Accepted: 10/03/2019] [Indexed: 06/10/2023]
Abstract
In optogenetics, light-sensitive proteins are specifically expressed in target cells and light is used to precisely control the activity of these proteins at high spatiotemporal resolution. Optogenetics initially used naturally occurring photoreceptors to control neural circuits, but has expanded to include carefully designed and engineered photoreceptors. Several optogenetic constructs are based on plant photoreceptors, but their application to plant systems has been limited. Here, we present perspectives on the development of plant optogenetics, considering different levels of design complexity. We discuss how general principles of light-driven signal transduction can be coupled with approaches for engineering protein folding to develop novel optogenetic tools. Finally, we explore how the use of computation, networks, circular permutation, and directed evolution could enrich optogenetics.
Collapse
Affiliation(s)
- Sudakshina Banerjee
- Department of Life Sciences, Presidency University, 86/1 College Street, Kolkata 700073, India
| | - Devrani Mitra
- Department of Life Sciences, Presidency University, 86/1 College Street, Kolkata 700073, India.
| |
Collapse
|
7
|
Day AL, Greisen P, Doyle L, Schena A, Stella N, Johnsson K, Baker D, Stoddard B. Unintended specificity of an engineered ligand-binding protein facilitated by unpredicted plasticity of the protein fold. Protein Eng Des Sel 2019; 31:375-387. [PMID: 30566669 DOI: 10.1093/protein/gzy031] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2018] [Revised: 10/02/2018] [Accepted: 11/07/2018] [Indexed: 12/25/2022] Open
Abstract
Attempts to create novel ligand-binding proteins often focus on formation of a binding pocket with shape complementarity against the desired ligand (particularly for compounds that lack distinct polar moieties). Although designed proteins often exhibit binding of the desired ligand, in some cases they display unintended recognition behavior. One such designed protein, that was originally intended to bind tetrahydrocannabinol (THC), was found instead to display binding of 25-hydroxy-cholecalciferol (25-D3) and was subjected to biochemical characterization, further selections for enhanced 25-D3 binding affinity and crystallographic analyses. The deviation in specificity is due in part to unexpected altertion of its conformation, corresponding to a significant change of the orientation of an α-helix and an equally large movement of a loop, both of which flank the designed ligand-binding pocket. Those changes led to engineered protein constructs that exhibit significantly more contacts and complementarity towards the 25-D3 ligand than the initial designed protein had been predicted to form towards its intended THC ligand. Molecular dynamics simulations imply that the initial computationally designed mutations may contribute to the movement of the helix. These analyses collectively indicate that accurate prediction and control of backbone dynamics conformation, through a combination of improved conformational sampling and/or de novo structure design, represents a key area of further development for the design and optimization of engineered ligand-binding proteins.
Collapse
Affiliation(s)
- Austin L Day
- Departments of Bioengineering and Biochemistry, University of Washington, Molecular Engineering and Sciences, Seattle, WA, USA
| | - Per Greisen
- Departments of Bioengineering and Biochemistry, University of Washington, Molecular Engineering and Sciences, Seattle, WA, USA
| | - Lindsey Doyle
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N., Seattle, WA, USA
| | - Alberto Schena
- Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Nephi Stella
- Departments of Bioengineering and Biochemistry, University of Washington, Molecular Engineering and Sciences, Seattle, WA, USA
| | - Kai Johnsson
- Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - David Baker
- Departments of Bioengineering and Biochemistry, University of Washington, Molecular Engineering and Sciences, Seattle, WA, USA
| | - Barry Stoddard
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N., Seattle, WA, USA
| |
Collapse
|
8
|
Sana B, Chee SMQ, Wongsantichon J, Raghavan S, Robinson RC, Ghadessy FJ. Development and structural characterization of an engineered multi-copper oxidase reporter of protein-protein interactions. J Biol Chem 2019; 294:7002-7012. [PMID: 30770473 PMCID: PMC6497955 DOI: 10.1074/jbc.ra118.007141] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2018] [Revised: 02/10/2019] [Indexed: 12/13/2022] Open
Abstract
Protein–protein interactions (PPIs) are ubiquitous in almost all biological processes and are often corrupted in diseased states. A detailed understanding of PPIs is therefore key to understanding cellular physiology and can yield attractive therapeutic targets. Here, we describe the development and structural characterization of novel Escherichia coli CueO multi-copper oxidase variants engineered to recapitulate protein–protein interactions with commensurate modulation of their enzymatic activities. The fully integrated single-protein sensors were developed through modular grafting of ligand-specific peptides into a highly compliant and flexible methionine-rich loop of CueO. Sensitive detection of diverse ligand classes exemplified by antibodies, an E3 ligase, MDM2 proto-oncogene (MDM2), and protease (SplB from Staphylococcus aureus) was achieved in a simple mix and measure homogeneous format with visually observable colorimetric readouts. Therapeutic antagonism of MDM2 by small molecules and peptides in clinical development for treatment of cancer patients was assayed using the MDM2-binding CueO enzyme. Structural characterization of the free and MDM2-bound CueO variant provided functional insight into signal-transducing mechanisms of the engineered enzymes and highlighted the robustness of CueO as a stable and compliant scaffold for multiple applications.
Collapse
Affiliation(s)
- Barindra Sana
- From the p53 Laboratory, Agency for Science, Technology, and Research (A*STAR), 8A Biomedical Grove, Singapore 138648, Singapore
| | - Sharon M Q Chee
- From the p53 Laboratory, Agency for Science, Technology, and Research (A*STAR), 8A Biomedical Grove, Singapore 138648, Singapore
| | - Jantana Wongsantichon
- the Mahidol-Oxford Tropical Medicine Research Unit, Mahidol University, Bangkok 10400, Thailand, and.,the Institute of Molecular and Cellular Biology, A*STAR, 61 Biopolis Drive, Singapore 138673, Singapore
| | - Sarada Raghavan
- From the p53 Laboratory, Agency for Science, Technology, and Research (A*STAR), 8A Biomedical Grove, Singapore 138648, Singapore
| | - Robert C Robinson
- the Institute of Molecular and Cellular Biology, A*STAR, 61 Biopolis Drive, Singapore 138673, Singapore
| | - Farid J Ghadessy
- From the p53 Laboratory, Agency for Science, Technology, and Research (A*STAR), 8A Biomedical Grove, Singapore 138648, Singapore,
| |
Collapse
|
9
|
Fraczyk J, Kamiński ZJ. N-Lipidated Amino Acids and Peptides Immobilizedon Cellulose Able to Split Amide Bonds. MATERIALS 2019; 12:ma12040578. [PMID: 30769907 PMCID: PMC6416662 DOI: 10.3390/ma12040578] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/26/2019] [Revised: 02/10/2019] [Accepted: 02/11/2019] [Indexed: 12/02/2022]
Abstract
N-lipidated short peptides and amino acids immobilized on the cellulose were used as catalysts cleaved amide bonds under biomimetic conditions. In order to select catalytically most active derivatives a library of 156 N-lipidated amino acids, dipeptides and tripeptides immobilized on cellulose was obtained. The library was synthesized from serine, histidine and glutamic acid peptides N-acylated with heptanoic, octanoic, hexadecanoic and (E)-octadec-9-enoic acids. Catalytic efficiency was monitored by spectrophotometric determination of p-nitroaniline formed by the hydrolysis of a 0.1 M solution of Z-Leu-NP. The most active 8 structures contained tripeptide fragment with 1-3 serine residues. It has been found that incorporation of metal ions into catalytic pockets increase the activity of the synzymes. The structures of the 17 most active catalysts selected from the library of complexes obtained with Cu2+ ion varied from 16 derivatives complexed with Zn2+ ion. For all of them, a very high reaction rate during the preliminary phase of measurements was followed by a substantial slowdown after 1 h. The catalytic activity gradually diminished after subsequent re-use. HPLC analysis of amide bond splitting confirmed that substrate consumption proceeded in two stages. In the preliminary stage 24–40% of the substrate was rapidly hydrolysed followed by the substantially lower reaction rate. Nevertheless, using the most competent synzymes product of hydrolysis was formed with a yield of 60–83% after 48h under mild and strictly biomimetic conditions.
Collapse
Affiliation(s)
- Justyna Fraczyk
- Institute of Organic Chemistry, Lodz University of Technology, Zeromskiego 116, 90-924 Lodz, Poland.
| | - Zbigniew J Kamiński
- Institute of Organic Chemistry, Lodz University of Technology, Zeromskiego 116, 90-924 Lodz, Poland.
| |
Collapse
|
10
|
Lechner H, Ferruz N, Höcker B. Strategies for designing non-natural enzymes and binders. Curr Opin Chem Biol 2018; 47:67-76. [PMID: 30248579 DOI: 10.1016/j.cbpa.2018.07.022] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2018] [Revised: 07/16/2018] [Accepted: 07/17/2018] [Indexed: 12/20/2022]
Abstract
The design of tailor-made enzymes is a major goal in biochemical research that can result in wide-range applications and will lead to a better understanding of how proteins fold and function. In this review we highlight recent advances in enzyme and small molecule binder design. A focus is placed on novel strategies for the design of scaffolds, developments in computational methods, and recent applications of these techniques on receptors, sensors, and enzymes. Further, the integration of computational and experimental methodologies is discussed. The outlined examples of designed enzymes and binders for various purposes highlight the importance of this topic and underline the need for tailor-made proteins.
Collapse
Affiliation(s)
- Horst Lechner
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Noelia Ferruz
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany.
| |
Collapse
|
11
|
Wood CW, Heal JW, Thomson AR, Bartlett GJ, Ibarra AÁ, Brady RL, Sessions RB, Woolfson DN. ISAMBARD: an open-source computational environment for biomolecular analysis, modelling and design. Bioinformatics 2018; 33:3043-3050. [PMID: 28582565 PMCID: PMC5870769 DOI: 10.1093/bioinformatics/btx352] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2017] [Accepted: 05/31/2017] [Indexed: 12/03/2022] Open
Abstract
Motivation The rational design of biomolecules is becoming a reality. However, further computational tools are needed to facilitate and accelerate this, and to make it accessible to more users. Results Here we introduce ISAMBARD, a tool for structural analysis, model building and rational design of biomolecules. ISAMBARD is open-source, modular, computationally scalable and intuitive to use. These features allow non-experts to explore biomolecular design in silico. ISAMBARD addresses a standing issue in protein design, namely, how to introduce backbone variability in a controlled manner. This is achieved through the generalization of tools for parametric modelling, describing the overall shape of proteins geometrically, and without input from experimentally determined structures. This will allow backbone conformations for entire folds and assemblies not observed in nature to be generated de novo, that is, to access the ‘dark matter of protein-fold space’. We anticipate that ISAMBARD will find broad applications in biomolecular design, biotechnology and synthetic biology. Availability and implementation A current stable build can be downloaded from the python package index (https://pypi.python.org/pypi/isambard/) with development builds available on GitHub (https://github.com/woolfson-group/) along with documentation, tutorial material and all the scripts used to generate the data described in this paper. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Christopher W Wood
- School of Chemistry, University of Bristol, Bristol BS8 1TS, UK.,School of Biochemistry, University of Bristol, Bristol BS8 1TD, UK
| | - Jack W Heal
- School of Chemistry, University of Bristol, Bristol BS8?1TS, UK
| | - Andrew R Thomson
- School of Chemistry, University of Bristol, Bristol BS8 1TS, UK.,School of Chemistry, University of Glasgow, Glasgow G12 8QQ, UK
| | - Gail J Bartlett
- School of Chemistry, University of Bristol, Bristol BS8?1TS, UK
| | - Amaurys Á Ibarra
- School of Biochemistry, University of Bristol, Bristol BS8?1TD, UK
| | - R Leo Brady
- School of Biochemistry, University of Bristol, Bristol BS8?1TD, UK
| | - Richard B Sessions
- School of Biochemistry, University of Bristol, Bristol BS8 1TD, UK.,BrisSynBio, University of Bristol, Bristol BS8 1TQ, UK
| | - Derek N Woolfson
- School of Chemistry, University of Bristol, Bristol BS8 1TS, UK.,School of Biochemistry, University of Bristol, Bristol BS8 1TD, UK.,BrisSynBio, University of Bristol, Bristol BS8 1TQ, UK
| |
Collapse
|
12
|
Chu H, Liu H. TetraBASE: A Side Chain-Independent Statistical Energy for Designing Realistically Packed Protein Backbones. J Chem Inf Model 2018; 58:430-442. [PMID: 29314837 DOI: 10.1021/acs.jcim.7b00677] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
To construct backbone structures of high designability is a primary aspect of computational protein design. We report here a side chain-independent statistical energy that aims at realistic modeling of through-space packing of polypeptide backbones. To mitigate the lack of explicit amino acid side chains, the model treats the interbackbone site packing as being dependent on peptide local conformation. In addition, new variables suitable for statistical analysis, one for relative orientation and another for distance, have been introduced to represent the intersite geometry based on the asymmetrical tetrahedron organization of distinct chemical groups surrounding the Cα-carbon atoms. The resulting tetrahedron-based backbone statistical energy (tetraBASE) model has been used to optimize the tertiary organizations of secondary structure elements (SSEs) of designated types with Monte Caro simulated annealing, starting from artificial initial configurations. The tetraBASE minimum energy structures can reproduce SSE packing frequently observed in native proteins with atomic root-mean-square deviations of 1-2 Å. The model has also been tested by examining the stability of native SSE arrangements under tetraBASE. The results suggest that tetraBASE model can be used to effectively represent interbackbone packing when designing backbone structures without explicitly knowing side chain types.
Collapse
Affiliation(s)
- Huanyu Chu
- School of Life Sciences, University of Science and Technology of China , 230027 Hefei, Anhui China.,Hefei National Laboratory for Physical Sciences at the Microscales , 230027 Hefei, Anhui China
| | - Haiyan Liu
- School of Life Sciences, University of Science and Technology of China , 230027 Hefei, Anhui China.,Hefei National Laboratory for Physical Sciences at the Microscales , 230027 Hefei, Anhui China.,Collaborative Innovation Center of Chemistry for Life Sciences , 230027 Hefei, Anhui China
| |
Collapse
|
13
|
Ludwiczak J, Jarmula A, Dunin-Horkawicz S. Combining Rosetta with molecular dynamics (MD): A benchmark of the MD-based ensemble protein design. J Struct Biol 2018; 203:54-61. [PMID: 29454111 DOI: 10.1016/j.jsb.2018.02.004] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2017] [Revised: 01/25/2018] [Accepted: 02/13/2018] [Indexed: 01/15/2023]
Abstract
Computational protein design is a set of procedures for computing amino acid sequences that will fold into a specified structure. Rosetta Design, a commonly used software for protein design, allows for the effective identification of sequences compatible with a given backbone structure, while molecular dynamics (MD) simulations can thoroughly sample near-native conformations. We benchmarked a procedure in which Rosetta design is started on MD-derived structural ensembles and showed that such a combined approach generates 20-30% more diverse sequences than currently available methods with only a slight increase in computation time. Importantly, the increase in diversity is achieved without a loss in the quality of the designed sequences assessed by their resemblance to natural sequences. We demonstrate that the MD-based procedure is also applicable to de novo design tasks started from backbone structures without any sequence information. In addition, we implemented a protocol that can be used to assess the stability of designed models and to select the best candidates for experimental validation. In sum our results demonstrate that the MD ensemble-based flexible backbone design can be a viable method for protein design, especially for tasks that require a large pool of diverse sequences.
Collapse
Affiliation(s)
- Jan Ludwiczak
- Laboratory of Structural Bioinformatics, Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097 Warsaw, Poland; Laboratory of Bioinformatics, Nencki Institute of Experimental Biology, Pasteura 3, 02-093 Warsaw, Poland
| | - Adam Jarmula
- Laboratory of Bioinformatics, Nencki Institute of Experimental Biology, Pasteura 3, 02-093 Warsaw, Poland
| | - Stanislaw Dunin-Horkawicz
- Laboratory of Structural Bioinformatics, Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097 Warsaw, Poland.
| |
Collapse
|
14
|
Komar AA. Unraveling co-translational protein folding: Concepts and methods. Methods 2017; 137:71-81. [PMID: 29221924 DOI: 10.1016/j.ymeth.2017.11.007] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2017] [Accepted: 11/13/2017] [Indexed: 12/26/2022] Open
Abstract
Advances in techniques such as nuclear magnetic resonance spectroscopy, cryo-electron microscopy, and single-molecule and time-resolved fluorescent approaches are transforming our ability to study co-translational protein folding both in vivo in living cells and in vitro in reconstituted cell-free translation systems. These approaches provide comprehensive information on the spatial organization and dynamics of nascent polypeptide chains and the kinetics of co-translational protein folding. This information has led to an improved understanding of the process of protein folding in living cells and should allow remaining key questions in the field, such as what structures are formed within nascent chains during protein synthesis and when, to be answered. Ultimately, studies using these techniques will facilitate development of a unified concept of protein folding, a process that is essential for proper cell function and organism viability. This review describes current methods for analysis of co-translational protein folding with an emphasis on some of the recently developed techniques that allow monitoring of co-translational protein folding in real-time.
Collapse
Affiliation(s)
- Anton A Komar
- Center for Gene Regulation in Health and Disease and Department of Biological, Geological and Environmental Sciences, Cleveland State University, Cleveland, OH 44115, USA; Department of Biochemistry and the Center for RNA Science and Therapeutics, Case Western Reserve University, Cleveland, OH 44106, USA; Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA.
| |
Collapse
|
15
|
Advances in design of protein folds and assemblies. Curr Opin Chem Biol 2017; 40:65-71. [DOI: 10.1016/j.cbpa.2017.06.020] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2017] [Revised: 05/29/2017] [Accepted: 06/27/2017] [Indexed: 11/20/2022]
|