1
|
Sánchez IE, Galpern EA, Garibaldi MM, Ferreiro DU. Molecular Information Theory Meets Protein Folding. J Phys Chem B 2022; 126:8655-8668. [PMID: 36282961 DOI: 10.1021/acs.jpcb.2c04532] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
We propose an application of molecular information theory to analyze the folding of single domain proteins. We analyze results from various areas of protein science, such as sequence-based potentials, reduced amino acid alphabets, backbone configurational entropy, secondary structure content, residue burial layers, and mutational studies of protein stability changes. We found that the average information contained in the sequences of evolved proteins is very close to the average information needed to specify a fold ∼2.2 ± 0.3 bits/(site·operation). The effective alphabet size in evolved proteins equals the effective number of conformations of a residue in the compact unfolded state at around 5. We calculated an energy-to-information conversion efficiency upon folding of around 50%, lower than the theoretical limit of 70%, but much higher than human-built macroscopic machines. We propose a simple mapping between molecular information theory and energy landscape theory and explore the connections between sequence evolution, configurational entropy, and the energetics of protein folding.
Collapse
Affiliation(s)
- Ignacio E Sánchez
- Facultad de Ciencias Exactas y Naturales, Laboratorio de Fisiología de Proteínas, Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos AiresCP1428, Argentina
| | - Ezequiel A Galpern
- Facultad de Ciencias Exactas y Naturales, Laboratorio de Fisiología de Proteínas, Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos AiresCP1428, Argentina
| | - Martín M Garibaldi
- Facultad de Ciencias Exactas y Naturales, Laboratorio de Fisiología de Proteínas, Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos AiresCP1428, Argentina
| | - Diego U Ferreiro
- Facultad de Ciencias Exactas y Naturales, Laboratorio de Fisiología de Proteínas, Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos AiresCP1428, Argentina
| |
Collapse
|
2
|
Rodríguez-Soacha DA, Steinmüller SAM, Işbilir A, Fender J, Deventer MH, Ramírez YA, Tutov A, Sotriffer C, Stove CP, Lorenz K, Lohse MJ, Hislop JN, Decker M. Development of an Indole-Amide-Based Photoswitchable Cannabinoid Receptor Subtype 1 (CB 1R) "Cis-On" Agonist. ACS Chem Neurosci 2022; 13:2410-2435. [PMID: 35881914 DOI: 10.1021/acschemneuro.2c00160] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
Activation of the human cannabinoid receptor type 1 (hCB1R) with high spatiotemporal control is useful to study processes involved in different pathologies related to nociception, metabolic alterations, and neurological disorders. To synthesize new agonist ligands for hCB1R, we have designed different classes of photoswitchable molecules based on an indole core. The modifications made to the central core have allowed us to understand the molecular characteristics necessary to design an agonist with optimal pharmacological properties. Compound 27a shows high affinity for CB1R (Ki (cis-form) = 0.18 μM), with a marked difference in affinity with respect to its inactive "trans-off" form (CB1R Ki trans/cis ratio = 5.4). The novel compounds were evaluated by radioligand binding studies, receptor internalization, sensor receptor activation (GRABeCB2.0), Western blots for analysis of ERK1/2 activation, NanoBiT βarr2 recruitment, and calcium mobilization assays, respectively. The data show that the novel agonist 27a is a candidate for studying the optical modulation of cannabinoid receptors (CBRs), serving as a new molecular tool for investigating the involvement of hCB1R in disorders associated with the endocannabinoid system.
Collapse
Affiliation(s)
- Diego A Rodríguez-Soacha
- Pharmazeutische und Medizinische Chemie, Institut für Pharmazie und Lebensmittelchemie, Julius-Maximilians-Universität Würzburg, Am Hubland, 97074 Würzburg, Germany
| | - Sophie A M Steinmüller
- Pharmazeutische und Medizinische Chemie, Institut für Pharmazie und Lebensmittelchemie, Julius-Maximilians-Universität Würzburg, Am Hubland, 97074 Würzburg, Germany
| | - Ali Işbilir
- Institut für Pharmakologie und Toxikologie, Julius-Maximilians-Universität Würzburg, Versbacher Str. 9, D-97078 Würzburg, Germany.,Receptor Signaling Group, Max Delbrück Center for Molecular Medicine, 13125 Berlin, Germany
| | - Julia Fender
- Institut für Pharmakologie und Toxikologie, Julius-Maximilians-Universität Würzburg, Versbacher Str. 9, D-97078 Würzburg, Germany
| | - Marie H Deventer
- Laboratory of Toxicology, Department of Bioanalysis, Faculty of Pharmaceutical Sciences, Ghent University, Ottergemsesteenweg 460, 9000 Ghent, Belgium
| | - Yesid A Ramírez
- Pharmazeutische und Medizinische Chemie, Institut für Pharmazie und Lebensmittelchemie, Julius-Maximilians-Universität Würzburg, Am Hubland, 97074 Würzburg, Germany.,Departamento de Ciencias Farmacéuticas, Facultad de Ciencias Naturales, Universidad Icesi, Valle del Cauca, 760031 Cali, Colombia
| | - Anna Tutov
- Pharmazeutische und Medizinische Chemie, Institut für Pharmazie und Lebensmittelchemie, Julius-Maximilians-Universität Würzburg, Am Hubland, 97074 Würzburg, Germany
| | - Christoph Sotriffer
- Pharmazeutische und Medizinische Chemie, Institut für Pharmazie und Lebensmittelchemie, Julius-Maximilians-Universität Würzburg, Am Hubland, 97074 Würzburg, Germany
| | - Christophe P Stove
- Laboratory of Toxicology, Department of Bioanalysis, Faculty of Pharmaceutical Sciences, Ghent University, Ottergemsesteenweg 460, 9000 Ghent, Belgium
| | - Kristina Lorenz
- Institut für Pharmakologie und Toxikologie, Julius-Maximilians-Universität Würzburg, Versbacher Str. 9, D-97078 Würzburg, Germany.,Leibniz-Institut für Analytische Wissenschaften─ISAS e.V., Bunsen-Kirchhoff-Str. 11, 44139 Dortmund, Germany
| | - Martin J Lohse
- Institut für Pharmakologie und Toxikologie, Julius-Maximilians-Universität Würzburg, Versbacher Str. 9, D-97078 Würzburg, Germany.,Receptor Signaling Group, Max Delbrück Center for Molecular Medicine, 13125 Berlin, Germany.,ISAR Bioscience Institut, 82152 Planegg/Munich, Germany
| | - James N Hislop
- School of Medicine, Medical Sciences and Nutrition, Institute of Medical Sciences, University of Aberdeen, Foresterhill, Aberdeen AB25 2ZD, United Kingdom
| | - Michael Decker
- Pharmazeutische und Medizinische Chemie, Institut für Pharmazie und Lebensmittelchemie, Julius-Maximilians-Universität Würzburg, Am Hubland, 97074 Würzburg, Germany
| |
Collapse
|
3
|
Zhu J, Avakyan N, Kakkis AA, Hoffnagle AM, Han K, Li Y, Zhang Z, Choi TS, Na Y, Yu CJ, Tezcan FA. Protein Assembly by Design. Chem Rev 2021; 121:13701-13796. [PMID: 34405992 PMCID: PMC9148388 DOI: 10.1021/acs.chemrev.1c00308] [Citation(s) in RCA: 107] [Impact Index Per Article: 35.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Proteins are nature's primary building blocks for the construction of sophisticated molecular machines and dynamic materials, ranging from protein complexes such as photosystem II and nitrogenase that drive biogeochemical cycles to cytoskeletal assemblies and muscle fibers for motion. Such natural systems have inspired extensive efforts in the rational design of artificial protein assemblies in the last two decades. As molecular building blocks, proteins are highly complex, in terms of both their three-dimensional structures and chemical compositions. To enable control over the self-assembly of such complex molecules, scientists have devised many creative strategies by combining tools and principles of experimental and computational biophysics, supramolecular chemistry, inorganic chemistry, materials science, and polymer chemistry, among others. Owing to these innovative strategies, what started as a purely structure-building exercise two decades ago has, in short order, led to artificial protein assemblies with unprecedented structures and functions and protein-based materials with unusual properties. Our goal in this review is to give an overview of this exciting and highly interdisciplinary area of research, first outlining the design strategies and tools that have been devised for controlling protein self-assembly, then describing the diverse structures of artificial protein assemblies, and finally highlighting the emergent properties and functions of these assemblies.
Collapse
Affiliation(s)
| | | | - Albert A. Kakkis
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - Alexander M. Hoffnagle
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - Kenneth Han
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - Yiying Li
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - Zhiyin Zhang
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - Tae Su Choi
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - Youjeong Na
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - Chung-Jui Yu
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| | - F. Akif Tezcan
- Department of Chemistry and Biochemistry, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0340, United States
| |
Collapse
|
4
|
Rodríguez-Soacha DA, Fender J, Ramírez YA, Collado JA, Muñoz E, Maitra R, Sotriffer C, Lorenz K, Decker M. "Photo-Rimonabant": Synthesis and Biological Evaluation of Novel Photoswitchable Molecules Derived from Rimonabant Lead to a Highly Selective and Nanomolar " Cis-On" CB 1R Antagonist. ACS Chem Neurosci 2021; 12:1632-1647. [PMID: 33856764 DOI: 10.1021/acschemneuro.1c00086] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Human cannabinoid receptor type 1 (hCB1R) plays important roles in the regulation of appetite and development of addictive behaviors. Herein, we describe the design, synthesis, photocharacterization, molecular docking, and in vitro characterization of "photo-rimonabant", i.e., azo-derivatives of the selective hCB1R antagonist SR1411716A (rimonabant). By applying azo-extension strategies, we yielded compound 16a, which shows marked affinity for CB1R (Ki (cis form) = 29 nM), whose potency increases by illumination with ultraviolet light (CB1R Kitrans/cis ratio = 15.3). Through radioligand binding, calcium mobilization, and cell luminescence assays, we established that 16a is highly selective for hCB1R over hCB2R. These selective antagonists can be valuable molecular tools for optical modulation of CBRs and better understanding of disorders associated with the endocannabinoid system.
Collapse
Affiliation(s)
- Diego A. Rodríguez-Soacha
- Pharmazeutische und Medizinische Chemie, Institut für Pharmazie und Lebensmittelchemie, Julius-Maximilians-Universität Würzburg, Am Hubland, D-97074 Würzburg, Germany
| | - Julia Fender
- Institut für Pharmakologie und Toxikologie, Julius-Maximilians-Universität Würzburg, Versbacher Straße 9, D-97078 Würzburg, Germany
| | - Yesid A. Ramírez
- Pharmazeutische und Medizinische Chemie, Institut für Pharmazie und Lebensmittelchemie, Julius-Maximilians-Universität Würzburg, Am Hubland, D-97074 Würzburg, Germany
- Departmento de Ciencias Farmacéuticas, Facultad de Ciencias Naturales, Universidad Icesi, 760031 Cali, Valle del Cauca, Colombia
| | - Juan Antonio Collado
- Instituto Maimónides de Investigación Biomédica de Córdoba, Departamento de Biología Celular, Fisiología e Inmunología, Universidad de Córdoba, Hospital Universitario Reina Sofía, Avda Menendez Pidal s/n, 14004 Córdoba, Spain
| | - Eduardo Muñoz
- Instituto Maimónides de Investigación Biomédica de Córdoba, Departamento de Biología Celular, Fisiología e Inmunología, Universidad de Córdoba, Hospital Universitario Reina Sofía, Avda Menendez Pidal s/n, 14004 Córdoba, Spain
| | - Rangan Maitra
- Discovery Science and Technology, RTI International, 3040 Cornwallis Road, Research Triangle Park, North Carolina 27709-2194, United States
| | - Christoph Sotriffer
- Pharmazeutische und Medizinische Chemie, Institut für Pharmazie und Lebensmittelchemie, Julius-Maximilians-Universität Würzburg, Am Hubland, D-97074 Würzburg, Germany
| | - Kristina Lorenz
- Institut für Pharmakologie und Toxikologie, Julius-Maximilians-Universität Würzburg, Versbacher Straße 9, D-97078 Würzburg, Germany
- Leibniz-Institut für Analytische Wissenschaften—ISAS e.V., Bunsen-Kirchhoff-Straße 11, 44139 Dortmund, Germany
| | - Michael Decker
- Pharmazeutische und Medizinische Chemie, Institut für Pharmazie und Lebensmittelchemie, Julius-Maximilians-Universität Würzburg, Am Hubland, D-97074 Würzburg, Germany
| |
Collapse
|
5
|
Michael E, Polydorides S, Simonson T, Archontis G. Hybrid MC/MD for protein design. J Chem Phys 2021; 153:054113. [PMID: 32770896 DOI: 10.1063/5.0013320] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Computational protein design relies on simulations of a protein structure, where selected amino acids can mutate randomly, and mutations are selected to enhance a target property, such as stability. Often, the protein backbone is held fixed and its degrees of freedom are modeled implicitly to reduce the complexity of the conformational space. We present a hybrid method where short molecular dynamics (MD) segments are used to explore conformations and alternate with Monte Carlo (MC) moves that apply mutations to side chains. The backbone is fully flexible during MD. As a test, we computed side chain acid/base constants or pKa's in five proteins. This problem can be considered a special case of protein design, with protonation/deprotonation playing the role of mutations. The solvent was modeled as a dielectric continuum. Due to cost, in each protein we allowed just one side chain position to change its protonation state and the other position to change its type or mutate. The pKa's were computed with a standard method that scans a range of pH values and with a new method that uses adaptive landscape flattening (ALF) to sample all protonation states in a single simulation. The hybrid method gave notably better accuracy than standard, fixed-backbone MC. ALF decreased the computational cost a factor of 13.
Collapse
Affiliation(s)
- Eleni Michael
- Department of Physics, University of Cyprus, P.O 20537, CY678 Nicosia, Cyprus
| | - Savvas Polydorides
- Department of Physics, University of Cyprus, P.O 20537, CY678 Nicosia, Cyprus
| | - Thomas Simonson
- Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, Palaiseau, France
| | - Georgios Archontis
- Department of Physics, University of Cyprus, P.O 20537, CY678 Nicosia, Cyprus
| |
Collapse
|
6
|
Mignon D, Druart K, Michael E, Opuu V, Polydorides S, Villa F, Gaillard T, Panel N, Archontis G, Simonson T. Physics-Based Computational Protein Design: An Update. J Phys Chem A 2020; 124:10637-10648. [DOI: 10.1021/acs.jpca.0c07605] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- David Mignon
- Laboratoire de Biologie Structurale de la Cellule (CNRS UMR7654), Ecole Polytechnique, 91128 Palaiseau, France
| | - Karen Druart
- Laboratoire de Biologie Structurale de la Cellule (CNRS UMR7654), Ecole Polytechnique, 91128 Palaiseau, France
| | - Eleni Michael
- Department of Physics, University of Cyprus, PO20537, CY1678 Nicosia, Cyprus
| | - Vaitea Opuu
- Laboratoire de Biologie Structurale de la Cellule (CNRS UMR7654), Ecole Polytechnique, 91128 Palaiseau, France
| | - Savvas Polydorides
- Department of Physics, University of Cyprus, PO20537, CY1678 Nicosia, Cyprus
| | - Francesco Villa
- Laboratoire de Biologie Structurale de la Cellule (CNRS UMR7654), Ecole Polytechnique, 91128 Palaiseau, France
| | - Thomas Gaillard
- Laboratoire de Biologie Structurale de la Cellule (CNRS UMR7654), Ecole Polytechnique, 91128 Palaiseau, France
| | - Nicolas Panel
- Laboratoire de Biologie Structurale de la Cellule (CNRS UMR7654), Ecole Polytechnique, 91128 Palaiseau, France
| | - Georgios Archontis
- Department of Physics, University of Cyprus, PO20537, CY1678 Nicosia, Cyprus
| | - Thomas Simonson
- Laboratoire de Biologie Structurale de la Cellule (CNRS UMR7654), Ecole Polytechnique, 91128 Palaiseau, France
| |
Collapse
|
7
|
Kuroda D, Tsumoto K. Engineering Stability, Viscosity, and Immunogenicity of Antibodies by Computational Design. J Pharm Sci 2020; 109:1631-1651. [DOI: 10.1016/j.xphs.2020.01.011] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Revised: 12/25/2019] [Accepted: 01/10/2020] [Indexed: 12/18/2022]
|
8
|
Xu G, Ma T, Du J, Wang Q, Ma J. OPUS-Rota2: An Improved Fast and Accurate Side-Chain Modeling Method. J Chem Theory Comput 2019; 15:5154-5160. [PMID: 31412199 DOI: 10.1021/acs.jctc.9b00309] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Side-chain modeling plays a critical role in protein structure prediction. However, in many current methods, balancing the speed and accuracy is still challenging. In this paper, on the basis of our previous work OPUS-Rota (Protein Sci. 2008, 17, 1576-1585), we introduce a new side-chain modeling method, OPUS-Rota2, which is tested on both a 65-protein test set (DB65) in the OPUS-Rota paper and a 379-protein test set (DB379) in the SCWRL4 paper. If the main chain is native, OPUS-Rota2 is more accurate than OPUS-Rota, SCWRL4, and OSCAR-star but slightly less accurate than OSCAR-o. Also, if the main chain is non-native, OPUS-Rota2 is more accurate than any other method. Moreover, OPUS-Rota2 is significantly faster than any other method, in particular, 2 orders of magnitude faster than OSCAR-o. Thus, the combination of higher accuracy and speed of OPUS-Rota2 in modeling side chains on both the native and non-native main chains makes OPUS-Rota2 a very useful tool in protein structure modeling.
Collapse
Affiliation(s)
- Gang Xu
- Multiscale Research Institute of Complex Systems , Fudan University , Shanghai 200433 , China.,School of Life Sciences , Tsinghua University , Beijing 100084 , China
| | | | - Junqing Du
- Verna and Marrs Mclean Department of Biochemistry and Molecular Biology , Baylor College of Medicine , One Baylor Plaza, BCM-125 , Houston , Texas 77030 , United States
| | - Qinghua Wang
- Verna and Marrs Mclean Department of Biochemistry and Molecular Biology , Baylor College of Medicine , One Baylor Plaza, BCM-125 , Houston , Texas 77030 , United States
| | - Jianpeng Ma
- Multiscale Research Institute of Complex Systems , Fudan University , Shanghai 200433 , China.,School of Life Sciences , Tsinghua University , Beijing 100084 , China.,Verna and Marrs Mclean Department of Biochemistry and Molecular Biology , Baylor College of Medicine , One Baylor Plaza, BCM-125 , Houston , Texas 77030 , United States.,School of Life Sciences , Fudan University , Shanghai 200433 , China
| |
Collapse
|
9
|
Tabasinezhad M, Talebkhan Y, Wenzel W, Rahimi H, Omidinia E, Mahboudi F. Trends in therapeutic antibody affinity maturation: From in-vitro towards next-generation sequencing approaches. Immunol Lett 2019; 212:106-113. [PMID: 31247224 DOI: 10.1016/j.imlet.2019.06.009] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2019] [Revised: 06/08/2019] [Accepted: 06/24/2019] [Indexed: 12/12/2022]
Abstract
Current advances in antibody engineering driving the strongest growth area in biotherapeutic agents development. Affinity improvement that is mainly important for biological activity and clinical efficacy of therapeutic antibodies, has still remained a challenging task. In the human body, during a course of immune response affinity maturation increase antibody activity by several rounds of somatic hypermutation and clonal selection in the germinal center. The final outputs are antibodies representing higher affinity and specificity against a particular antigen. In the realm of biotechnology, exploring of mutations which improve antibody affinity while preserving its specificity and stability is an extremely time-consuming and laborious process. Recent advances in computational algorithms and DNA sequencing technologies help researchers to redesign antibody structure to achieve desired properties such as improved binding affinity. In this review, we briefly described the principle of affinity maturation and different corresponding in vitro techniques. Also, we recapitulated the most recent advancements in the field of antibody affinity maturation including computational approaches and next-generation sequencing (NGS).
Collapse
Affiliation(s)
- Maryam Tabasinezhad
- Biotechnology Research Centre, Pasteur Institute of Iran, Tehran, Iran; Institute of Nanotechnology, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Yeganeh Talebkhan
- Biotechnology Research Centre, Pasteur Institute of Iran, Tehran, Iran
| | - Wolfgang Wenzel
- Institute of Nanotechnology, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Hamzeh Rahimi
- Molecular Medicine Department, Pasteur Institute of Iran, Tehran, Iran
| | - Eskandar Omidinia
- Genetics & Metabolism Research Centre, Pasteur Institute of Iran, Tehran, Iran.
| | | |
Collapse
|
10
|
Ramirez YA, Adler TB, Altmann E, Klemm T, Tiesmeyer C, Sauer F, Kathman SG, Statsyuk AV, Sotriffer C, Kisker C. Structural Basis of Substrate Recognition and Covalent Inhibition of Cdu1 from Chlamydia trachomatis. ChemMedChem 2018; 13:2014-2023. [PMID: 30028574 PMCID: PMC6177307 DOI: 10.1002/cmdc.201800364] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2018] [Revised: 07/17/2018] [Indexed: 01/12/2023]
Abstract
Based on the similarity between the active sites of the deubiquitylating and deneddylating enzyme ChlaDub1 (Cdu1) and the evolutionarily related protease adenain, a target-hopping screening approach on a focused set of adenain inhibitors was investigated. The cyanopyrimidine-based inhibitors identified represent the first active-site-directed small-molecule inhibitors of Cdu1. High-resolution crystal structures of Cdu1 in complex with two covalently bound cyanopyrimidines, as well as with its substrate ubiquitin, were obtained. These structural data were complemented by enzymatic assays and covalent docking studies to provide insight into the substrate recognition of Cdu1, active-site pocket flexibility and potential hotspots for ligand interaction. Combined, these data provide a strong basis for future structure-guided medicinal chemistry optimization of this cyanopyrimidine scaffold into more potent and selective Cdu1 inhibitors.
Collapse
Affiliation(s)
- Yesid A. Ramirez
- Prof. Dr. C. Kisker, Y.A. Ramirez, T. Klemm, C. Tiesmeyer, Dr. F. Sauer, Rudolf Virchow Center for Experimental Biomedicine, Institute of Structural Biology, University of Würzburg, Josef-Schneider-Straße 2, 97080 Würzburg, Germany.,
- Prof. Dr. C. Sotriffer, Dr. T. B. Adler, Institute of Pharmacy and Food Chemistry, University of Würzburg, Am Hubland C7, 97074 Würzburg, Germany.,
| | - Thomas B. Adler
- Prof. Dr. C. Sotriffer, Dr. T. B. Adler, Institute of Pharmacy and Food Chemistry, University of Würzburg, Am Hubland C7, 97074 Würzburg, Germany.,
| | - Eva Altmann
- Dr. E. Altmann, Novartis Institute for Biomedical Research, Novartis Campus, 4002 Basel, Switzerland
| | - Theresa Klemm
- Prof. Dr. C. Kisker, Y.A. Ramirez, T. Klemm, C. Tiesmeyer, Dr. F. Sauer, Rudolf Virchow Center for Experimental Biomedicine, Institute of Structural Biology, University of Würzburg, Josef-Schneider-Straße 2, 97080 Würzburg, Germany.,
| | - Christian Tiesmeyer
- Prof. Dr. C. Kisker, Y.A. Ramirez, T. Klemm, C. Tiesmeyer, Dr. F. Sauer, Rudolf Virchow Center for Experimental Biomedicine, Institute of Structural Biology, University of Würzburg, Josef-Schneider-Straße 2, 97080 Würzburg, Germany.,
| | - Florian Sauer
- Prof. Dr. C. Kisker, Y.A. Ramirez, T. Klemm, C. Tiesmeyer, Dr. F. Sauer, Rudolf Virchow Center for Experimental Biomedicine, Institute of Structural Biology, University of Würzburg, Josef-Schneider-Straße 2, 97080 Würzburg, Germany.,
| | - Stefan G. Kathman
- S. G. Kathman, Northwestern University, Department of Chemistry., Silverman Hall, Sheridan Road 2145, Evanston, 60208 Illinois, USA
| | - Alexander V. Statsyuk
- Dr. A. V. Statsyuk, University of Houston College of Pharmacy, Calhoun Road 7037, 4849 Houston, USA
| | - Christoph Sotriffer
- Prof. Dr. C. Sotriffer, Dr. T. B. Adler, Institute of Pharmacy and Food Chemistry, University of Würzburg, Am Hubland C7, 97074 Würzburg, Germany.,
| | - Caroline Kisker
- Prof. Dr. C. Kisker, Y.A. Ramirez, T. Klemm, C. Tiesmeyer, Dr. F. Sauer, Rudolf Virchow Center for Experimental Biomedicine, Institute of Structural Biology, University of Würzburg, Josef-Schneider-Straße 2, 97080 Würzburg, Germany.,
| |
Collapse
|
11
|
Hogues H, Gaudreault F, Corbeil CR, Deprez C, Sulea T, Purisima EO. ProPOSE: Direct Exhaustive Protein-Protein Docking with Side Chain Flexibility. J Chem Theory Comput 2018; 14:4938-4947. [PMID: 30107730 DOI: 10.1021/acs.jctc.8b00225] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Despite decades of development, protein-protein docking remains a largely unsolved problem. The main difficulties are the immense space spanned by the translational and rotational degrees of freedom and the prediction of the conformational changes of proteins upon binding. FFT is generally the preferred method to exhaustively explore the translation-rotation space at a fine grid resolution, albeit with the trade-off of approximating force fields with correlation functions. This work presents a direct search alternative that samples the states in Cartesian space at the same resolution and computational cost as standard FFT methods. Operating in real space allows the use of standard force field functional forms used in typical non-FFT methods as well as the implementation of strategies for focused exploration of conformational flexibility. Currently, a few misplaced side chains can cause docking programs to fail. This work specifically addresses the problem of side chain rearrangements upon complex formation. Based on the observation that most side chains retain their unbound conformation upon binding, each rigidly docked pose is initially scored ignoring up to a limited number of side chain overlaps which are resolved in subsequent repacking and minimization steps. On test systems where side chains are altered and backbones held in their bound state, this implementation provides significantly better native pose recovery and higher quality (lower RMSD) predictions when compared with five of the most popular docking programs. The method is implemented in the software program ProPOSE (Protein Pose Optimization by Systematic Enumeration).
Collapse
Affiliation(s)
- Hervé Hogues
- Human Health Therapeutics , National Research Council Canada , 6100 Royalmount Avenue , Montreal , Quebec H4P 2R2 , Canada
| | - Francis Gaudreault
- Human Health Therapeutics , National Research Council Canada , 6100 Royalmount Avenue , Montreal , Quebec H4P 2R2 , Canada
| | - Christopher R Corbeil
- Human Health Therapeutics , National Research Council Canada , 6100 Royalmount Avenue , Montreal , Quebec H4P 2R2 , Canada
| | - Christophe Deprez
- Human Health Therapeutics , National Research Council Canada , 6100 Royalmount Avenue , Montreal , Quebec H4P 2R2 , Canada
| | - Traian Sulea
- Human Health Therapeutics , National Research Council Canada , 6100 Royalmount Avenue , Montreal , Quebec H4P 2R2 , Canada
| | - Enrico O Purisima
- Human Health Therapeutics , National Research Council Canada , 6100 Royalmount Avenue , Montreal , Quebec H4P 2R2 , Canada
| |
Collapse
|
12
|
Abstract
The immune systems protect our bodies from foreign molecules or antigens, where antibodies play important roles. Antibodies evolve over time upon antigen encounter by somatically mutating their genome sequences. The end result is a series of antibodies that display higher affinities and specificities to specific antigens. This process is called affinity maturation. Recent improvements in computer hardware and modeling algorithms now enable the rational design of protein structures and functions, and several works on computer-aided antibody design have been published. In this chapter, we briefly describe computational methods for antibody affinity maturation, focusing on methods for sampling antibody conformations and for scoring designed antibody variants. We also discuss lessons learned from the successful computer-aided design of antibodies.
Collapse
Affiliation(s)
- Daisuke Kuroda
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, Japan
| | - Kouhei Tsumoto
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, Japan.
- Medical Proteomics Laboratory, The Institute of Medical Science, The University of Tokyo, Tokyo, Japan.
| |
Collapse
|
13
|
Gaillard T, Simonson T. Full Protein Sequence Redesign with an MMGBSA Energy Function. J Chem Theory Comput 2017; 13:4932-4943. [DOI: 10.1021/acs.jctc.7b00202] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Thomas Gaillard
- Laboratoire de Biochimie
(CNRS UMR7654), Department of Biology, Ecole Polytechnique, 91128 Palaiseau, France
| | - Thomas Simonson
- Laboratoire de Biochimie
(CNRS UMR7654), Department of Biology, Ecole Polytechnique, 91128 Palaiseau, France
| |
Collapse
|
14
|
Traoré S, Allouche D, André I, Schiex T, Barbe S. Deterministic Search Methods for Computational Protein Design. Methods Mol Biol 2017; 1529:107-123. [PMID: 27914047 DOI: 10.1007/978-1-4939-6637-0_4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
One main challenge in Computational Protein Design (CPD) lies in the exploration of the amino-acid sequence space, while considering, to some extent, side chain flexibility. The exorbitant size of the search space urges for the development of efficient exact deterministic search methods enabling identification of low-energy sequence-conformation models, corresponding either to the global minimum energy conformation (GMEC) or an ensemble of guaranteed near-optimal solutions. In contrast to stochastic local search methods that are not guaranteed to find the GMEC, exact deterministic approaches always identify the GMEC and prove its optimality in finite but exponential worst-case time. After a brief overview on these two classes of methods, we discuss the grounds and merits of four deterministic methods that have been applied to solve CPD problems. These approaches are based either on the Dead-End-Elimination theorem combined with A* algorithm (DEE/A*), on Cost Function Networks algorithms (CFN), on Integer Linear Programming solvers (ILP) or on Markov Random Fields solvers (MRF). The way two of these methods (DEE/A* and CFN) can be used in practice to identify low-energy sequence-conformation models starting from a pairwise decomposed energy matrix is detailed in this review.
Collapse
Affiliation(s)
- Seydou Traoré
- INSA, UPS, INP, Université de Toulouse, 135 Avenue de Rangueil, 31077, Toulouse, France
- Laboratoire d'Ingénierie Ingénierie des Systèmes Biologiques et des Procédés - INSA, INRA, UMR792, 31400, Toulouse, France
- CNRS, UMR5504, 31400, Toulouse, France
| | - David Allouche
- Unité de Mathématiques et Informatique de Toulouse, UR 875, INRA, 31320, Castanet Tolosan, France
| | - Isabelle André
- INSA, UPS, INP, Université de Toulouse, 135 Avenue de Rangueil, 31077, Toulouse, France
- Laboratoire d'Ingénierie Ingénierie des Systèmes Biologiques et des Procédés - INSA, INRA, UMR792, 31400, Toulouse, France
- CNRS, UMR5504, 31400, Toulouse, France
| | - Thomas Schiex
- Unité de Mathématiques et Informatique de Toulouse, UR 875, INRA, 31320, Castanet Tolosan, France
| | - Sophie Barbe
- INSA, UPS, INP, Université de Toulouse, 135 Avenue de Rangueil, 31077, Toulouse, France.
- Laboratoire d'Ingénierie Ingénierie des Systèmes Biologiques et des Procédés - INSA, INRA, UMR792, 31400, Toulouse, France.
- CNRS, UMR5504, 31400, Toulouse, France.
| |
Collapse
|
15
|
Kumar A, Ranbhor R, Patel K, Ramakrishnan V, Durani S. Automated protein design: Landmarks and operational principles. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2016; 125:24-35. [PMID: 27979438 DOI: 10.1016/j.pbiomolbio.2016.12.002] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2016] [Accepted: 12/06/2016] [Indexed: 11/25/2022]
Abstract
Protein design has an eventful history spanning over three decades, with handful of success stories reported, and numerous failures not reported. Design practices have benefited tremendously from improvements in computer hardware and advances in scientific algorithms. Though protein folding problem still remains unsolved, the possibility of having multiple sequence solutions for a single fold makes protein design a more tractable problem than protein folding. One of the most significant advancement in this area is the implementation of automated design algorithms on pre-defined templates or completely new folds, optimized through deterministic and heuristic search algorithms. This progress report provides a succinct presentation of important landmarks in automated design attempts, followed by brief account of operational principles in automated design methods.
Collapse
Affiliation(s)
- Anil Kumar
- Department of Chemistry, University of Toronto, ON, M5S3H6, Canada.
| | | | | | - Vibin Ramakrishnan
- Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, 781039, India.
| | - Susheel Durani
- Department of Chemistry, Indian Institute of Technology, Bombay, 400076, India
| |
Collapse
|
16
|
Kmiecik S, Gront D, Kolinski M, Wieteska L, Dawid AE, Kolinski A. Coarse-Grained Protein Models and Their Applications. Chem Rev 2016; 116:7898-936. [DOI: 10.1021/acs.chemrev.6b00163] [Citation(s) in RCA: 555] [Impact Index Per Article: 69.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Sebastian Kmiecik
- Faculty
of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
| | - Dominik Gront
- Faculty
of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
| | - Michal Kolinski
- Bioinformatics
Laboratory, Mossakowski Medical Research Center of the Polish Academy of Sciences, Pawinskiego 5, 02-106 Warsaw, Poland
| | - Lukasz Wieteska
- Faculty
of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
- Department
of Medical Biochemistry, Medical University of Lodz, Mazowiecka 6/8, 92-215 Lodz, Poland
| | | | - Andrzej Kolinski
- Faculty
of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
| |
Collapse
|
17
|
Mignon D, Simonson T. Comparing three stochastic search algorithms for computational protein design: Monte Carlo, replica exchange Monte Carlo, and a multistart, steepest-descent heuristic. J Comput Chem 2016; 37:1781-93. [PMID: 27197555 DOI: 10.1002/jcc.24393] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2015] [Revised: 02/26/2016] [Accepted: 03/27/2016] [Indexed: 01/11/2023]
Abstract
Computational protein design depends on an energy function and an algorithm to search the sequence/conformation space. We compare three stochastic search algorithms: a heuristic, Monte Carlo (MC), and a Replica Exchange Monte Carlo method (REMC). The heuristic performs a steepest-descent minimization starting from thousands of random starting points. The methods are applied to nine test proteins from three structural families, with a fixed backbone structure, a molecular mechanics energy function, and with 1, 5, 10, 20, 30, or all amino acids allowed to mutate. Results are compared to an exact, "Cost Function Network" method that identifies the global minimum energy conformation (GMEC) in favorable cases. The designed sequences accurately reproduce experimental sequences in the hydrophobic core. The heuristic and REMC agree closely and reproduce the GMEC when it is known, with a few exceptions. Plain MC performs well for most cases, occasionally departing from the GMEC by 3-4 kcal/mol. With REMC, the diversity of the sequences sampled agrees with exact enumeration where the latter is possible: up to 2 kcal/mol above the GMEC. Beyond, room temperature replicas sample sequences up to 10 kcal/mol above the GMEC, providing thermal averages and a solution to the inverse protein folding problem. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- David Mignon
- Laboratoire De Biochimie (UMR CNRS 7654), Department Of Biology, Ecole Polytechnique, Palaiseau, France
| | - Thomas Simonson
- Laboratoire De Biochimie (UMR CNRS 7654), Department Of Biology, Ecole Polytechnique, Palaiseau, France
| |
Collapse
|
18
|
Gaillard T, Panel N, Simonson T. Protein side chain conformation predictions with an MMGBSA energy function. Proteins 2016; 84:803-19. [PMID: 26948696 DOI: 10.1002/prot.25030] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2015] [Revised: 02/22/2016] [Accepted: 02/27/2016] [Indexed: 12/17/2022]
Abstract
The prediction of protein side chain conformations from backbone coordinates is an important task in structural biology, with applications in structure prediction and protein design. It is a difficult problem due to its combinatorial nature. We study the performance of an "MMGBSA" energy function, implemented in our protein design program Proteus, which combines molecular mechanics terms, a Generalized Born and Surface Area (GBSA) solvent model, with approximations that make the model pairwise additive. Proteus is not a competitor to specialized side chain prediction programs due to its cost, but it allows protein design applications, where side chain prediction is an important step and MMGBSA an effective energy model. We predict the side chain conformations for 18 proteins. The side chains are first predicted individually, with the rest of the protein in its crystallographic conformation. Next, all side chains are predicted together. The contributions of individual energy terms are evaluated and various parameterizations are compared. We find that the GB and SA terms, with an appropriate choice of the dielectric constant and surface energy coefficients, are beneficial for single side chain predictions. For the prediction of all side chains, however, errors due to the pairwise additive approximation overcome the improvement brought by these terms. We also show the crucial contribution of side chain minimization to alleviate the rigid rotamer approximation. Even without GB and SA terms, we obtain accuracies comparable to SCWRL4, a specialized side chain prediction program. In particular, we obtain a better RMSD than SCWRL4 for core residues (at a higher cost), despite our simpler rotamer library. Proteins 2016; 84:803-819. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Thomas Gaillard
- Department of Biology, Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, Palaiseau, 91128, France
| | - Nicolas Panel
- Department of Biology, Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, Palaiseau, 91128, France
| | - Thomas Simonson
- Department of Biology, Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, Palaiseau, 91128, France
| |
Collapse
|
19
|
Tak Kam VW, Goddard WA. Flat-Bottom Strategy for Improved Accuracy in Protein Side-Chain Placements. J Chem Theory Comput 2015; 4:2160-9. [PMID: 26620487 DOI: 10.1021/ct800196k] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We present a new strategy for protein side-chain placement that uses flat-bottom potentials for rotamer scoring. The extent of the flat bottom depends on the coarseness of the rotamer library and is optimized for libraries ranging from diversities of 0.2 Å to 5.0 Å. The parameters reported here were optimized for forcefields using Lennard-Jones 12-6 van der Waals potential with DREIDING parameters but are expected to be similar for AMBER, CHARMM, and other forcefields. This Side-Chain Rotamer Excitation Analysis Method is implemented in the SCREAM software package. Similar scoring function strategies should be useful for ligand docking, virtual ligand screening, and protein folding applications.
Collapse
Affiliation(s)
- Victor Wai Tak Kam
- Materials and Process Simulation Center (MC-139-74), California Institute of Technology, Pasadena, California 91125
| | - William A Goddard
- Materials and Process Simulation Center (MC-139-74), California Institute of Technology, Pasadena, California 91125
| |
Collapse
|
20
|
Allouche D, André I, Barbe S, Davies J, de Givry S, Katsirelos G, O'Sullivan B, Prestwich S, Schiex T, Traoré S. Computational protein design as an optimization problem. ARTIF INTELL 2014. [DOI: 10.1016/j.artint.2014.03.005] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
21
|
Gaillard T, Simonson T. Pairwise decomposition of an MMGBSA energy function for computational protein design. J Comput Chem 2014; 35:1371-87. [PMID: 24854675 DOI: 10.1002/jcc.23637] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2014] [Revised: 04/14/2014] [Accepted: 05/01/2014] [Indexed: 02/02/2023]
Abstract
Computational protein design (CPD) aims at predicting new proteins or modifying existing ones. The computational challenge is huge as it requires exploring an enormous sequence and conformation space. The difficulty can be reduced by considering a fixed backbone and a discrete set of sidechain conformations. Another common strategy consists in precalculating a pairwise energy matrix, from which the energy of any sequence/conformation can be quickly obtained. In this work, we examine the pairwise decomposition of protein MMGBSA energy functions from a general theoretical perspective, and an implementation proposed earlier for CPD. It includes a Generalized Born term, whose many-body character is overcome using an effective dielectric environment, and a Surface Area term, for which we present an improved pairwise decomposition. A detailed evaluation of the error introduced by the decomposition on the different energy components is performed. We show that the error remains reasonable, compared to other uncertainties.
Collapse
Affiliation(s)
- Thomas Gaillard
- Department of Biology, Laboratoire de Biochimie (CNRS UMR7654), Ecole Polytechnique, 91128, Palaiseau, France
| | | |
Collapse
|
22
|
Francis-Lyon P, Koehl P. Protein side-chain modeling with a protein-dependent optimized rotamer library. Proteins 2014; 82:2000-17. [PMID: 24623614 DOI: 10.1002/prot.24555] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2013] [Revised: 02/28/2014] [Accepted: 03/07/2014] [Indexed: 12/16/2022]
Abstract
Despite years of effort, the problem of predicting the conformations of protein side chains remains a subject of inquiry. This problem has three major issues, namely defining the conformations that a side chain may adopt within a protein, developing a sampling procedure for generating possible side-chain packings, and defining a scoring function that can rank these possible packings. To solve the former of these issues, most procedures rely on a rotamer library derived from databases of known protein structures. We introduce an alternative method that is free of statistics. We begin with a rotamer library that is based only on stereochemical considerations; this rotamer library is then optimized independently for each protein under study. We show that this optimization step restores the diversity of conformations observed in native proteins. We combine this protein-dependent rotamer library (PDRL) method with the self-consistent mean field (SCMF) sampling approach and a physics-based scoring function into a new side-chain prediction method, SCMF-PDRL. Using two large test sets of 831 and 378 proteins, respectively, we show that this new method compares favorably with competing methods such as SCAP, OPUS-Rota, and SCWRL4 for energy-minimized structures.
Collapse
Affiliation(s)
- Patricia Francis-Lyon
- Department of Computer Science, University of San Francisco, San Francisco, California, 94117
| | | |
Collapse
|
23
|
Webb B, Eswar N, Fan H, Khuri N, Pieper U, Dong G, Sali A. Comparative Modeling of Drug Target Proteins☆. REFERENCE MODULE IN CHEMISTRY, MOLECULAR SCIENCES AND CHEMICAL ENGINEERING 2014. [PMCID: PMC7157477 DOI: 10.1016/b978-0-12-409547-2.11133-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
In this perspective, we begin by describing the comparative protein structure modeling technique and the accuracy of the corresponding models. We then discuss the significant role that comparative prediction plays in drug discovery. We focus on virtual ligand screening against comparative models and illustrate the state-of-the-art by a number of specific examples.
Collapse
|
24
|
Huang YM, Bystroff C. Expanded explorations into the optimization of an energy function for protein design. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2013; 10:1176-1187. [PMID: 24384706 PMCID: PMC3919130 DOI: 10.1109/tcbb.2013.113] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Nature possesses a secret formula for the energy as a function of the structure of a protein. In protein design, approximations are made to both the structural representation of the molecule and to the form of the energy equation, such that the existence of a general energy function for proteins is by no means guaranteed. Here, we present new insights toward the application of machine learning to the problem of finding a general energy function for protein design. Machine learning requires the definition of an objective function, which carries with it the implied definition of success in protein design. We explored four functions, consisting of two functional forms, each with two criteria for success. Optimization was carried out by a Monte Carlo search through the space of all variable parameters. Cross-validation of the optimized energy function against a test set gave significantly different results depending on the choice of objective function, pointing to relative correctness of the built-in assumptions. Novel energy cross terms correct for the observed nonadditivity of energy terms and an imbalance in the distribution of predicted amino acids. This paper expands on the work presented at the 2012 ACM-BCB.
Collapse
|
25
|
Hanf KJM, Arndt JW, Chen LL, Jarpe M, Boriack-Sjodin PA, Li Y, van Vlijmen HWT, Pepinsky RB, Simon KJ, Lugovskoy A. Antibody humanization by redesign of complementarity-determining region residues proximate to the acceptor framework. Methods 2013; 65:68-76. [PMID: 23816785 DOI: 10.1016/j.ymeth.2013.06.024] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2013] [Revised: 05/29/2013] [Accepted: 06/18/2013] [Indexed: 12/01/2022] Open
Abstract
Antibodies are key components of the adaptive immune system and are well-established protein therapeutic agents. Typically high-affinity antibodies are obtained by immunization of rodent species that need to be humanized to reduce their immunogenicity. The complementarity-determining regions (CDRs) contain the residues in a defined loop structure that confer antigen binding, which must be retained in the humanized antibody. To design a humanized antibody, we graft the mature murine CDRs onto a germline human acceptor framework. Structural defects due to mismatches at the graft interface can be fixed by mutating some framework residues to murine, or by mutating some residues on the CDRs' backside to human or to a de novo designed sequence. The first approach, framework redesign, can yield an antibody with binding better than the CDR graft and one equivalent to the mature murine, and reduced immunogenicity. The second approach, CDR redesign, is presented here as a new approach, yielding an antibody with binding better than the CDR graft, and immunogenicity potentially less than that from framework redesign. Application of both approaches to the humanization of anti-α4 integrin antibody HP1/2 is presented and the concept of the hybrid humanization approach that retains "difficult to match" murine framework amino acids and uses de novo CDR design to minimize murine amino acid content and reduce cell-mediated cytotoxicity liabilities is discussed.
Collapse
Affiliation(s)
- Karl J M Hanf
- Biogen Idec, 12 Cambridge Center, Cambridge, MA 02142, United States.
| | - Joseph W Arndt
- Biogen Idec, 12 Cambridge Center, Cambridge, MA 02142, United States
| | - Ling Ling Chen
- Biogen Idec, 12 Cambridge Center, Cambridge, MA 02142, United States
| | - Matthew Jarpe
- Biogen Idec, 12 Cambridge Center, Cambridge, MA 02142, United States
| | | | - You Li
- Biogen Idec, 12 Cambridge Center, Cambridge, MA 02142, United States
| | | | - R Blake Pepinsky
- Biogen Idec, 12 Cambridge Center, Cambridge, MA 02142, United States
| | - Kenneth J Simon
- Biogen Idec, 12 Cambridge Center, Cambridge, MA 02142, United States
| | - Alexey Lugovskoy
- Biogen Idec, 12 Cambridge Center, Cambridge, MA 02142, United States.
| |
Collapse
|
26
|
|
27
|
Linder M. Computational Enzyme Design: Advances, hurdles and possible ways forward. Comput Struct Biotechnol J 2012; 2:e201209009. [PMID: 24688650 PMCID: PMC3962231 DOI: 10.5936/csbj.201209009] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2012] [Revised: 09/30/2012] [Accepted: 10/12/2012] [Indexed: 12/13/2022] Open
Abstract
This mini review addresses recent developments in computational enzyme design. Successful protocols as well as known issues and limitations are discussed from an energetic perspective. It will be argued that improved results can be obtained by including a dynamic treatment in the design protocol. Finally, a molecular dynamics-based approach for evaluating and refining computational designs is presented.
Collapse
Affiliation(s)
- Mats Linder
- Applied Physical Chemistry, KTH Royal Institute of Technology, Teknikringen 30, SE-100 44, Stockholm, Sweden
| |
Collapse
|
28
|
Kuroda D, Shirai H, Jacobson MP, Nakamura H. Computer-aided antibody design. Protein Eng Des Sel 2012; 25:507-21. [PMID: 22661385 PMCID: PMC3449398 DOI: 10.1093/protein/gzs024] [Citation(s) in RCA: 169] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2012] [Revised: 04/14/2012] [Accepted: 04/19/2012] [Indexed: 11/12/2022] Open
Abstract
Recent clinical trials using antibodies with low toxicity and high efficiency have raised expectations for the development of next-generation protein therapeutics. However, the process of obtaining therapeutic antibodies remains time consuming and empirical. This review summarizes recent progresses in the field of computer-aided antibody development mainly focusing on antibody modeling, which is divided essentially into two parts: (i) modeling the antigen-binding site, also called the complementarity determining regions (CDRs), and (ii) predicting the relative orientations of the variable heavy (V(H)) and light (V(L)) chains. Among the six CDR loops, the greatest challenge is predicting the conformation of CDR-H3, which is the most important in antigen recognition. Further computational methods could be used in drug development based on crystal structures or homology models, including antibody-antigen dockings and energy calculations with approximate potential functions. These methods should guide experimental studies to improve the affinities and physicochemical properties of antibodies. Finally, several successful examples of in silico structure-based antibody designs are reviewed. We also briefly review structure-based antigen or immunogen design, with application to rational vaccine development.
Collapse
Affiliation(s)
- Daisuke Kuroda
- Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, Japan.
| | | | | | | |
Collapse
|
29
|
Designing electrostatic interactions in biological systems via charge optimization or combinatorial approaches: insights and challenges with a continuum electrostatic framework. Theor Chem Acc 2012. [DOI: 10.1007/s00214-012-1252-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
30
|
Li SC, Bu D, Li M. Residues with similar hexagon neighborhoods share similar side-chain conformations. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012; 9:240-248. [PMID: 21519113 DOI: 10.1109/tcbb.2011.74] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
We present in this study a new approach to code protein side-chain conformations into hexagon substructures. Classical side-chain packing methods consist of two steps: first, side-chain conformations, known as rotamers, are extracted from known protein structures as candidates for each residue; second, a searching method along with an energy function is used to resolve conflicts among residues and to optimize the combinations of side chain conformations for all residues. These methods benefit from the fact that the number of possible side-chain conformations is limited, and the rotamer candidates are readily extracted; however, these methods also suffer from the inaccuracy of energy functions. Inspired by threading and Ab Initio approaches to protein structure prediction, we propose to use hexagon substructures to implicitly capture subtle issues of energy functions. Our initial results indicate that even without guidance from an energy function, hexagon structures alone can capture side-chain conformations at an accuracy of 83.8 percent, higher than 82.6 percent by the state-of-art side-chain packing methods.
Collapse
|
31
|
Allouche D, Traoré S, André I, de Givry S, Katsirelos G, Barbe S, Schiex T. Computational Protein Design as a Cost Function Network Optimization Problem. LECTURE NOTES IN COMPUTER SCIENCE 2012. [DOI: 10.1007/978-3-642-33558-7_60] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
32
|
Patsalo V, Raleigh DP, Green DF. Rational and computational design of stabilized variants of cyanovirin-N that retain affinity and specificity for glycan ligands. Biochemistry 2011; 50:10698-712. [PMID: 22032696 PMCID: PMC3234137 DOI: 10.1021/bi201411c] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Cyanovirin-N (CVN) is an 11 kDa pseudosymmetric cyanobacterial lectin that has been shown to inhibit infection by the human immunodeficiency virus by binding to high-mannose oligosaccharides on the surface of the viral envelope glycoprotein gp120. In this work, we describe rationally designed CVN variants that stabilize the protein fold while maintaining high affinity and selectivity for their glycan targets. Poisson-Boltzmann calculations and protein repacking algorithms were used to select stabilizing mutations in the protein core. By substituting the buried polar side chains of Ser11, Ser20, and Thr61 with aliphatic groups, we stabilized CVN by nearly 12 °C against thermal denaturation, and by 1 M GuaHCl against chemical denaturation, relative to a previously characterized stabilized mutant. Glycan microarray binding experiments confirmed that the specificity profile of carbohydrate binding is unperturbed by the mutations and is identical for all variants. In particular, the variants selectively bound glycans containing the Manα(1→2)Man linkage, which is the known minimal binding unit of CVN. We also report the slow denaturation kinetics of CVN and show that they can complicate thermodynamic analysis; in particular, the unfolding of CVN cannot be described as a fixed two-state transition. Accurate thermodynamic parameters are needed to describe the complicated free energy landscape of CVN, and we provide updated values for CVN unfolding.
Collapse
Affiliation(s)
- Vadim Patsalo
- Department of Applied Mathematics and Statistics Stony Brook University Stony Brook, New York 11794 USA
- Laufer Center for Physical and Quantitative Biology Stony Brook University Stony Brook, New York 11794 USA
| | - Daniel P. Raleigh
- Department of Chemistry Stony Brook University Stony Brook, New York 11794 USA
- Graduate Program in Biochemistry and Structural Biology Stony Brook University Stony Brook, New York 11794 USA
| | - David F. Green
- Department of Applied Mathematics and Statistics Stony Brook University Stony Brook, New York 11794 USA
- Laufer Center for Physical and Quantitative Biology Stony Brook University Stony Brook, New York 11794 USA
- Department of Chemistry Stony Brook University Stony Brook, New York 11794 USA
- Graduate Program in Biochemistry and Structural Biology Stony Brook University Stony Brook, New York 11794 USA
| |
Collapse
|
33
|
Koehl P, Orland H, Delarue M. Adapting Poisson-Boltzmann to the self-consistent mean field theory: application to protein side-chain modeling. J Chem Phys 2011; 135:055104. [PMID: 21823735 DOI: 10.1063/1.3621831] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We present an extension of the self-consistent mean field theory for protein side-chain modeling in which solvation effects are included based on the Poisson-Boltzmann (PB) theory. In this approach, the protein is represented with multiple copies of its side chains. Each copy is assigned a weight that is refined iteratively based on the mean field energy generated by the rest of the protein, until self-consistency is reached. At each cycle, the variational free energy of the multi-copy system is computed; this free energy includes the internal energy of the protein that accounts for vdW and electrostatics interactions and a solvation free energy term that is computed using the PB equation. The method converges in only a few cycles and takes only minutes of central processing unit time on a commodity personal computer. The predicted conformation of each residue is then set to be its copy with the highest weight after convergence. We have tested this method on a database of hundred highly refined NMR structures to circumvent the problems of crystal packing inherent to x-ray structures. The use of the PB-derived solvation free energy significantly improves prediction accuracy for surface side chains. For example, the prediction accuracies for χ(1) for surface cysteine, serine, and threonine residues improve from 68%, 35%, and 43% to 80%, 53%, and 57%, respectively. A comparison with other side-chain prediction algorithms demonstrates that our approach is consistently better in predicting the conformations of exposed side chains.
Collapse
Affiliation(s)
- Patrice Koehl
- Department of Biological Sciences, National University of Singapore, Singapore.
| | | | | |
Collapse
|
34
|
GORIELY ALAIN, HAUSRATH ANDREW, NEUKIRCH SÉBASTIEN. THE DIFFERENTIAL GEOMETRY OF PROTEINS AND ITS APPLICATIONS TO STRUCTURE DETERMINATION. ACTA ACUST UNITED AC 2011. [DOI: 10.1142/s1793048008000629] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Understanding the three-dimensional structure of proteins is critical to understand their function. While great progress is being made in understanding the structures of soluble proteins, large classes of proteins such as membrane proteins, large macromolecular assemblies, and partially organized or heterogeneous structures are being comparatively neglected. Part of the difficulty is that the coordinate models we use to represent protein structure are discrete and static, whereas the molecules themselves are flexible and dynamic. In this article, we review methods to develop a continuous description of proteins more general than the traditional coordinate models and which can describe smooth changes in form. This description can be shown to be strictly equivalent to the traditional atomic coordinate description.
Collapse
Affiliation(s)
- ALAIN GORIELY
- Program in Applied Mathematics and Department of Mathematics, University of Arizona, Tucson, AZ 85721, USA
| | - ANDREW HAUSRATH
- Department of Biochemistry and Molecular Biophysics, University of Arizona, Tucson, AZ 85721, USA
| | - SÉBASTIEN NEUKIRCH
- Laboratoire de Modélisation en Mécanique, UMR 7607, CNRS & Université Pierre et Marie Curie, Paris, France
| |
Collapse
|
35
|
Zeng J, Roberts KE, Zhou P, Donald BR. A Bayesian approach for determining protein side-chain rotamer conformations using unassigned NOE data. J Comput Biol 2011; 18:1661-79. [PMID: 21970619 DOI: 10.1089/cmb.2011.0172] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
A major bottleneck in protein structure determination via nuclear magnetic resonance (NMR) is the lengthy and laborious process of assigning resonances and nuclear Overhauser effect (NOE) cross peaks. Recent studies have shown that accurate backbone folds can be determined using sparse NMR data, such as residual dipolar couplings (RDCs) or backbone chemical shifts. This opens a question of whether we can also determine the accurate protein side-chain conformations using sparse or unassigned NMR data. We attack this question by using unassigned nuclear Overhauser effect spectroscopy (NOESY) data, which records the through-space dipolar interactions between protons nearby in three-dimensional (3D) space. We propose a Bayesian approach with a Markov random field (MRF) model to integrate the likelihood function derived from observed experimental data, with prior information (i.e., empirical molecular mechanics energies) about the protein structures. We unify the side-chain structure prediction problem with the side-chain structure determination problem using unassigned NMR data, and apply the deterministic dead-end elimination (DEE) and A* search algorithms to provably find the global optimum solution that maximizes the posterior probability. We employ a Hausdorff-based measure to derive the likelihood of a rotamer or a pairwise rotamer interaction from unassigned NOESY data. In addition, we apply a systematic and rigorous approach to estimate the experimental noise in NMR data, which also determines the weighting factor of the data term in the scoring function derived from the Bayesian framework. We tested our approach on real NMR data of three proteins: the FF Domain 2 of human transcription elongation factor CA150 (FF2), the B1 domain of Protein G (GB1), and human ubiquitin. The promising results indicate that our algorithm can be applied in high-resolution protein structure determination. Since our approach does not require any NOE assignment, it can accelerate the NMR structure determination process.
Collapse
Affiliation(s)
- Jianyang Zeng
- Department of Computer Science, Duke University, Durham, NC 27708, USA
| | | | | | | |
Collapse
|
36
|
Zeng J, Zhou P, Donald BR. Protein side-chain resonance assignment and NOE assignment using RDC-defined backbones without TOCSY data. JOURNAL OF BIOMOLECULAR NMR 2011; 50:371-95. [PMID: 21706248 PMCID: PMC3155202 DOI: 10.1007/s10858-011-9522-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2010] [Accepted: 05/19/2011] [Indexed: 05/31/2023]
Abstract
One bottleneck in NMR structure determination lies in the laborious and time-consuming process of side-chain resonance and NOE assignments. Compared to the well-studied backbone resonance assignment problem, automated side-chain resonance and NOE assignments are relatively less explored. Most NOE assignment algorithms require nearly complete side-chain resonance assignments from a series of through-bond experiments such as HCCH-TOCSY or HCCCONH. Unfortunately, these TOCSY experiments perform poorly on large proteins. To overcome this deficiency, we present a novel algorithm, called NASCA: (NOE Assignment and Side-Chain Assignment), to automate both side-chain resonance and NOE assignments and to perform high-resolution protein structure determination in the absence of any explicit through-bond experiment to facilitate side-chain resonance assignment, such as HCCH-TOCSY. After casting the assignment problem into a Markov Random Field (MRF), NASCA: extends and applies combinatorial protein design algorithms to compute optimal assignments that best interpret the NMR data. The MRF captures the contact map information of the protein derived from NOESY spectra, exploits the backbone structural information determined by RDCs, and considers all possible side-chain rotamers. The complexity of the combinatorial search is reduced by using a dead-end elimination (DEE) algorithm, which prunes side-chain resonance assignments that are provably not part of the optimal solution. Then an A* search algorithm is employed to find a set of optimal side-chain resonance assignments that best fit the NMR data. These side-chain resonance assignments are then used to resolve the NOE assignment ambiguity and compute high-resolution protein structures. Tests on five proteins show that NASCA: assigns resonances for more than 90% of side-chain protons, and achieves about 80% correct assignments. The final structures computed using the NOE distance restraints assigned by NASCA: have backbone RMSD 0.8-1.5 Å from the reference structures determined by traditional NMR approaches.
Collapse
Affiliation(s)
- Jianyang Zeng
- Department of Computer Science, Duke University, Durham NC 27708
| | - Pei Zhou
- Department of Biochemistry, Duke University Medical Center, Durham NC 27710
| | - Bruce Randall Donald
- Department of Computer Science, Duke University, Durham NC 27708
- Department of Biochemistry, Duke University Medical Center, Durham NC 27710
| |
Collapse
|
37
|
Stroganov OV, Novikov FN, Zeifman AA, Stroylov VS, Chilov GG. TSAR, a new graph-theoretical approach to computational modeling of protein side-chain flexibility: Modeling of ionization properties of proteins. Proteins 2011; 79:2693-710. [DOI: 10.1002/prot.23099] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2010] [Revised: 05/16/2011] [Accepted: 05/27/2011] [Indexed: 11/09/2022]
|
38
|
Samish I, MacDermaid CM, Perez-Aguilar JM, Saven JG. Theoretical and Computational Protein Design. Annu Rev Phys Chem 2011; 62:129-49. [DOI: 10.1146/annurev-physchem-032210-103509] [Citation(s) in RCA: 119] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
| | | | | | - Jeffery G. Saven
- Department of Chemistry, University of Pennsylvania, Philadelphia, Pennsylvania 19104;
| |
Collapse
|
39
|
MacDonald JT, Barnes C, Kitney RI, Freemont PS, Stan GBV. Computational design approaches and tools for synthetic biology. Integr Biol (Camb) 2011; 3:97-108. [DOI: 10.1039/c0ib00077a] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
40
|
Schreier B, Höcker B. Engineering the enolase magnesium II binding site: implications for its evolution. Biochemistry 2010; 49:7582-9. [PMID: 20690637 DOI: 10.1021/bi100954f] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The glycolytic enzyme enolase catalyzes the reversible elimination of water from 2-phosphoglycerate (2-PGA) to form phosphoenolpyruvate (PEP). Two magnesium ions in the active site are thought to facilitate the reaction by activation of the C2 proton of 2-PGA and charge stabilization of the intermediate. The initial abstraction of a proton from a carboxylic acid is common to all members of the enolase superfamily, yet in all other known members of this superfamily, only one magnesium ion (MgI) per active site is sufficient to promote catalysis. We wanted to further investigate the importance of the second magnesium ion (MgII) for the catalytic mechanism of yeast enolase 1. Toward this end, we removed all MgII coordinating residues and replaced substrate-MgII interactions by introducing positively charged side chains. High-resolution crystal structures and activity assays show that the introduced positively charged side chains effectively prohibit MgII binding but fail to promote catalysis. We conclude that enolase is inactive without MgII, yet control mutants without additional positively charged side chains retain basal enolase activity through binding of magnesium to 2-PGA in an open active site without the help of MgII coordinating residues. Thus, we believe that ancestral enolase activity might have evolved in a member of the enolase superfamily that provides only the necessary catalytic residues and the binding site for MgI. Additionally, precatalytic binding of 2-PGA to the apo state of enolase was observed.
Collapse
Affiliation(s)
- Bettina Schreier
- Max Planck Institute for Developmental Biology, Spemannstrasse 35, 72076 Tübingen, Germany
| | | |
Collapse
|
41
|
Friedland GD, Kortemme T. Designing ensembles in conformational and sequence space to characterize and engineer proteins. Curr Opin Struct Biol 2010; 20:377-84. [DOI: 10.1016/j.sbi.2010.02.004] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2010] [Accepted: 02/19/2010] [Indexed: 11/16/2022]
|
42
|
Abstract
In recent years, there have been significant advances in the field of computational protein design including the successful computational design of enzymes based on backbone scaffolds from experimentally solved structures. It is likely that large-scale sampling of protein backbone conformations will become necessary as further progress is made on more complicated systems. Removing the constraint of having to use scaffolds based on known protein backbones is a potential method of solving the problem. With this application in mind, we describe a method to systematically construct a large number of de novo backbone structures from idealized topological forms in a top–down hierarchical approach. The structural properties of these novel backbone scaffolds were analyzed and compared with a set of high-resolution experimental structures from the protein data bank (PDB). It was found that the Ramachandran plot distribution and relative γ- and β-turn frequencies were similar to those found in the PDB. The de novo scaffolds were sequence designed with RosettaDesign, and the energy distributions and amino acid compositions were comparable with the results for redesigned experimentally solved backbones. Proteins 2010. © 2009 Wiley-Liss, Inc.
Collapse
Affiliation(s)
- James T MacDonald
- Division of Mathematical Biology, MRC National Institute for Medical Research, The Ridgeway, Mill Hill, London NW7 1AA
| | | | | | | |
Collapse
|
43
|
McAllister SR, Floudas CA. An improved hybrid global optimization method for protein tertiary structure prediction. COMPUTATIONAL OPTIMIZATION AND APPLICATIONS 2010; 45:377-413. [PMID: 20357906 PMCID: PMC2847311 DOI: 10.1007/s10589-009-9277-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
First principles approaches to the protein structure prediction problem must search through an enormous conformational space to identify low-energy, near-native structures. In this paper, we describe the formulation of the tertiary structure prediction problem as a nonlinear constrained minimization problem, where the goal is to minimize the energy of a protein conformation subject to constraints on torsion angles and interatomic distances. The core of the proposed algorithm is a hybrid global optimization method that combines the benefits of the αBB deterministic global optimization approach with conformational space annealing. These global optimization techniques employ a local minimization strategy that combines torsion angle dynamics and rotamer optimization to identify and improve the selection of initial conformations and then applies a sequential quadratic programming approach to further minimize the energy of the protein conformations subject to constraints. The proposed algorithm demonstrates the ability to identify both lower energy protein structures, as well as larger ensembles of low-energy conformations.
Collapse
|
44
|
Abstract
The rational design of artificial enzymes, either by applying physico-chemical intuition of protein structure and function or with the aid of computational methods, is a promising area of research with the potential to tremendously impact medicine, industrial chemistry and energy production. Designed proteins also provide a powerful platform for dissecting enzyme mechanisms of natural systems. Artificial enzymes have come a long way from simple α-helical peptide catalysts to proteins that facilitate multistep chemical reactions designed by state-of-the-art computational methods. Looking forward, we examine strategies employed by natural enzymes that could be used to improve the speed and selectivity of artificial catalysts.
Collapse
|
45
|
Abstract
This paper discusses recent optimization approaches to the protein side-chain prediction problem, protein structural alignment, and molecular structure determination from X-ray diffraction measurements. The machinery employed to solve these problems has included algorithms from linear programming, dynamic programming, combinatorial optimization, and mixed-integer nonlinear programming. Many of these problems are purely continuous in nature. Yet, to this date, they have been approached mostly via combinatorial optimization algorithms that are applied to discrete approximations. The main purpose of the paper is to offer an introduction and motivate further systems approaches to these problems.
Collapse
Affiliation(s)
- Nikolaos V. Sahinidis
- Department of Chemical Engineering Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
46
|
Hong EJ, Lippow SM, Tidor B, Lozano-Pérez T. Rotamer optimization for protein design through MAP estimation and problem-size reduction. J Comput Chem 2009; 30:1923-45. [PMID: 19123203 DOI: 10.1002/jcc.21188] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The search for the global minimum energy conformation (GMEC) of protein side chains is an important computational challenge in protein structure prediction and design. Using rotamer models, the problem is formulated as a NP-hard optimization problem. Dead-end elimination (DEE) methods combined with systematic A* search (DEE/A*) has proven useful, but may not be strong enough as we attempt to solve protein design problems where a large number of similar rotamers is eligible and the network of interactions between residues is dense. In this work, we present an exact solution method, named BroMAP (branch-and-bound rotamer optimization using MAP estimation), for such protein design problems. The design goal of BroMAP is to be able to expand smaller search trees than conventional branch-and-bound methods while performing only a moderate amount of computation in each node, thereby reducing the total running time. To achieve that, BroMAP attempts reduction of the problem size within each node through DEE and elimination by lower bounds from approximate maximum-a-posteriori (MAP) estimation. The lower bounds are also exploited in branching and subproblem selection for fast discovery of strong upper bounds. Our computational results show that BroMAP tends to be faster than DEE/A* for large protein design cases. BroMAP also solved cases that were not solved by DEE/A* within the maximum allowed time, and did not incur significant disadvantage for cases where DEE/A* performed well. Therefore, BroMAP is particularly applicable to large protein design problems where DEE/A* struggles and can also substitute for DEE/A* in general GMEC search.
Collapse
Affiliation(s)
- Eun-Jong Hong
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | | | | | | |
Collapse
|
47
|
Fromer M, Yanover C. Accurate prediction for atomic-level protein design and its application in diversifying the near-optimal sequence space. Proteins 2009; 75:682-705. [PMID: 19003998 DOI: 10.1002/prot.22280] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The task of engineering a protein to assume a target three-dimensional structure is known as protein design. Computational search algorithms are devised to predict a minimal energy amino acid sequence for a particular structure. In practice, however, an ensemble of low-energy sequences is often sought. Primarily, this is performed because an individual predicted low-energy sequence may not necessarily fold to the target structure because of both inaccuracies in modeling protein energetics and the nonoptimal nature of search algorithms employed. Additionally, some low-energy sequences may be overly stable and thus lack the dynamic flexibility required for biological functionality. Furthermore, the investigation of low-energy sequence ensembles will provide crucial insights into the pseudo-physical energy force fields that have been derived to describe structural energetics for protein design. Significantly, numerous studies have predicted low-energy sequences, which were subsequently synthesized and demonstrated to fold to desired structures. However, the characterization of the sequence space defined by such energy functions as compatible with a target structure has not been performed in full detail. This issue is critical for protein design scientists to successfully continue using these force fields at an ever-increasing pace and scale. In this paper, we present a conceptually novel algorithm that rapidly predicts the set of lowest energy sequences for a given structure. Based on the theory of probabilistic graphical models, it performs efficient inspection and partitioning of the near-optimal sequence space, without making any assumptions of positional independence. We benchmark its performance on a diverse set of relevant protein design examples and show that it consistently yields sequences of lower energy than those derived from state-of-the-art techniques. Thus, we find that previously presented search techniques do not fully depict the low-energy space as precisely. Examination of the predicted ensembles indicates that, for each structure, the amino acid identity at a majority of positions must be chosen extremely selectively so as to not incur significant energetic penalties. We investigate this high degree of similarity and demonstrate how more diverse near-optimal sequences can be predicted in order to systematically overcome this bottleneck for computational design. Finally, we exploit this in-depth analysis of a collection of the lowest energy sequences to suggest an explanation for previously observed experimental design results. The novel methodologies introduced here accurately portray the sequence space compatible with a protein structure and further supply a scheme to yield heterogeneous low-energy sequences, thus providing a powerful instrument for future work on protein design.
Collapse
Affiliation(s)
- Menachem Fromer
- School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel.
| | | |
Collapse
|
48
|
Jha AN, Ananthasuresh GK, Vishveshwara S. A search for energy minimized sequences of proteins. PLoS One 2009; 4:e6684. [PMID: 19690619 PMCID: PMC2724685 DOI: 10.1371/journal.pone.0006684] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2009] [Accepted: 07/23/2009] [Indexed: 11/21/2022] Open
Abstract
In this paper, we present numerical evidence that supports the notion of minimization in the sequence space of proteins for a target conformation. We use the conformations of the real proteins in the Protein Data Bank (PDB) and present computationally efficient methods to identify the sequences with minimum energy. We use edge-weighted connectivity graph for ranking the residue sites with reduced amino acid alphabet and then use continuous optimization to obtain the energy-minimizing sequences. Our methods enable the computation of a lower bound as well as a tight upper bound for the energy of a given conformation. We validate our results by using three different inter-residue energy matrices for five proteins from protein data bank (PDB), and by comparing our energy-minimizing sequences with 80 million diverse sequences that are generated based on different considerations in each case. When we submitted some of our chosen energy-minimizing sequences to Basic Local Alignment Search Tool (BLAST), we obtained some sequences from non-redundant protein sequence database that are similar to ours with an E-value of the order of 10-7. In summary, we conclude that proteins show a trend towards minimizing energy in the sequence space but do not seem to adopt the global energy-minimizing sequence. The reason for this could be either that the existing energy matrices are not able to accurately represent the inter-residue interactions in the context of the protein environment or that Nature does not push the optimization in the sequence space, once it is able to perform the function.
Collapse
Affiliation(s)
- Anupam Nath Jha
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | - G. K. Ananthasuresh
- Department of Mechanical Engineering, Indian Institute of Science, Bangalore, India
- * E-mail: (SV); (GKA)
| | - Saraswathi Vishveshwara
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
- * E-mail: (SV); (GKA)
| |
Collapse
|
49
|
Suárez M, Jaramillo A. Challenges in the computational design of proteins. J R Soc Interface 2009; 6 Suppl 4:S477-91. [PMID: 19324680 PMCID: PMC2843960 DOI: 10.1098/rsif.2008.0508.focus] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2008] [Accepted: 02/04/2009] [Indexed: 11/12/2022] Open
Abstract
Protein design has many applications not only in biotechnology but also in basic science. It uses our current knowledge in structural biology to predict, by computer simulations, an amino acid sequence that would produce a protein with targeted properties. As in other examples of synthetic biology, this approach allows the testing of many hypotheses in biology. The recent development of automated computational methods to design proteins has enabled proteins to be designed that are very different from any known ones. Moreover, some of those methods mostly rely on a physical description of atomic interactions, which allows the designed sequences not to be biased towards known proteins. In this paper, we will describe the use of energy functions in computational protein design, the use of atomic models to evaluate the free energy in the unfolded and folded states, the exploration and optimization of amino acid sequences, the problem of negative design and the design of biomolecular function. We will also consider its use together with the experimental techniques such as directed evolution. We will end by discussing the challenges ahead in computational protein design and some of their future applications.
Collapse
Affiliation(s)
- María Suárez
- Laboratoire de Biochimie, Ecole Polytechnique, CNRS, 91128 Palaiseau Cedex, France
- Epigenomics Project, Genopole, Université d'Evry Val d'Essonne-Genopole-CNRS, Tour Evry2, Etage 10, Terrasses de l'Agora, 91034 Evry Cedex, France
| | - Alfonso Jaramillo
- Laboratoire de Biochimie, Ecole Polytechnique, CNRS, 91128 Palaiseau Cedex, France
- Epigenomics Project, Genopole, Université d'Evry Val d'Essonne-Genopole-CNRS, Tour Evry2, Etage 10, Terrasses de l'Agora, 91034 Evry Cedex, France
| |
Collapse
|
50
|
Van der Sloot AM, Kiel C, Serrano L, Stricher F. Protein design in biological networks: from manipulating the input to modifying the output. Protein Eng Des Sel 2009; 22:537-42. [DOI: 10.1093/protein/gzp032] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|