1
|
Ranaudo A, Giulini M, Pelissou Ayuso A, Bonvin AMJJ. Modeling Protein-Glycan Interactions with HADDOCK. J Chem Inf Model 2024; 64:7816-7825. [PMID: 39360946 PMCID: PMC11480977 DOI: 10.1021/acs.jcim.4c01372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2024] [Revised: 09/23/2024] [Accepted: 09/25/2024] [Indexed: 10/15/2024]
Abstract
The term glycan refers to a broad category of molecules composed of monosaccharide units linked to each other in a variety of ways, whose structural diversity is related to different functions in living organisms. Among others, glycans are recognized by proteins with the aim of carrying information and for signaling purposes. Determining the three-dimensional structures of protein-glycan complexes is essential both for the understanding of the mechanisms glycans are involved in and for applications such as drug design. In this context, molecular docking approaches are of undoubted importance as complementary approaches to experiments. In this study, we show how high ambiguity-driven DOCKing (HADDOCK) can be efficiently used for the prediction of protein-glycan complexes. Using a benchmark of 89 complexes, starting from their bound or unbound forms, and assuming some knowledge of the binding site on the protein, our protocol reaches a 70% and 40% top 5 success rate on bound and unbound data sets, respectively. We show that the main limiting factor is related to the complexity of the glycan to be modeled and the associated conformational flexibility.
Collapse
Affiliation(s)
- Anna Ranaudo
- Department
of Earth and Environmental Sciences, University
of Milano-Bicocca, Piazza Della Scienza 1, Milan 20126, Italy
- Bijvoet
Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht 3584CH, The Netherlands
| | - Marco Giulini
- Bijvoet
Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht 3584CH, The Netherlands
| | - Angela Pelissou Ayuso
- Bijvoet
Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht 3584CH, The Netherlands
| | - Alexandre M. J. J. Bonvin
- Bijvoet
Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht 3584CH, The Netherlands
| |
Collapse
|
2
|
Gamarra MD, Dieterle ME, Ortigosa J, Lannot JO, Blanco Capurro JI, Di Paola M, Radusky L, Duette G, Piuri M, Modenutti CP. Unveiling crucial amino acids in the carbohydrate recognition domain of a viral protein through a structural bioinformatic approach. Glycobiology 2024; 34:cwae068. [PMID: 39214076 DOI: 10.1093/glycob/cwae068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 08/17/2024] [Accepted: 08/26/2024] [Indexed: 09/04/2024] Open
Abstract
Carbohydrate binding modules (CBMs) are protein domains that typically reside near catalytic domains, increasing substrate-protein proximity by constraining the conformational space of carbohydrates. Due to the flexibility and variability of glycans, the molecular details of how these protein regions recognize their target molecules are not always fully understood. Computational methods, including molecular docking and molecular dynamics simulations, have been employed to investigate lectin-carbohydrate interactions. In this study, we introduce a novel approach that integrates multiple computational techniques to identify the critical amino acids involved in the interaction between a CBM located at the tip of bacteriophage J-1's tail and its carbohydrate counterparts. Our results highlight three amino acids that play a significant role in binding, a finding we confirmed through in vitro experiments. By presenting this approach, we offer an intriguing alternative for pinpointing amino acids that contribute to protein-sugar interactions, leading to a more thorough comprehension of the molecular determinants of protein-carbohydrate interactions.
Collapse
Affiliation(s)
- Marcelo D Gamarra
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires (FCEyN-UBA) e Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN) CONICET, Pabellón 2 de Ciudad Universitaria, Ciudad de Buenos Aires C1428EHA, Argentina
| | - Maria Eugenia Dieterle
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires (FCEyN-UBA) e Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN) CONICET, Pabellón 2 de Ciudad Universitaria, Ciudad de Buenos Aires C1428EHA, Argentina
- Department of Microbiology and Immunology, Albert Einstein College of Medicine, New York, NY 10461, United States
| | - Juan Ortigosa
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires (FCEyN-UBA) e Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN) CONICET, Pabellón 2 de Ciudad Universitaria, Ciudad de Buenos Aires C1428EHA, Argentina
| | - Jorge O Lannot
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires (FCEyN-UBA) e Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN) CONICET, Pabellón 2 de Ciudad Universitaria, Ciudad de Buenos Aires C1428EHA, Argentina
| | - Juan I Blanco Capurro
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires (FCEyN-UBA) e Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN) CONICET, Pabellón 2 de Ciudad Universitaria, Ciudad de Buenos Aires C1428EHA, Argentina
| | - Matias Di Paola
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires (FCEyN-UBA) e Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN) CONICET, Pabellón 2 de Ciudad Universitaria, Ciudad de Buenos Aires C1428EHA, Argentina
| | - Leandro Radusky
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires (FCEyN-UBA) e Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN) CONICET, Pabellón 2 de Ciudad Universitaria, Ciudad de Buenos Aires C1428EHA, Argentina
| | - Gabriel Duette
- The Westmead Institute for Medical Research, Centre for Virus Research, Westmead, NSW 2145, Australia
- Faculty of Medicine and Health, University of Sydney, Sydney, NSW 2050, Australia
| | - Mariana Piuri
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires (FCEyN-UBA) e Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN) CONICET, Pabellón 2 de Ciudad Universitaria, Ciudad de Buenos Aires C1428EHA, Argentina
| | - Carlos P Modenutti
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires (FCEyN-UBA) e Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN) CONICET, Pabellón 2 de Ciudad Universitaria, Ciudad de Buenos Aires C1428EHA, Argentina
| |
Collapse
|
3
|
Nieto-Fabregat F, Lenza MP, Marseglia A, Di Carluccio C, Molinaro A, Silipo A, Marchetti R. Computational toolbox for the analysis of protein-glycan interactions. Beilstein J Org Chem 2024; 20:2084-2107. [PMID: 39189002 PMCID: PMC11346309 DOI: 10.3762/bjoc.20.180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 08/01/2024] [Indexed: 08/28/2024] Open
Abstract
Protein-glycan interactions play pivotal roles in numerous biological processes, ranging from cellular recognition to immune response modulation. Understanding the intricate details of these interactions is crucial for deciphering the molecular mechanisms underlying various physiological and pathological conditions. Computational techniques have emerged as powerful tools that can help in drawing, building and visualising complex biomolecules and provide insights into their dynamic behaviour at atomic and molecular levels. This review provides an overview of the main computational tools useful for studying biomolecular systems, particularly glycans, both in free state and in complex with proteins, also with reference to the principles, methodologies, and applications of all-atom molecular dynamics simulations. Herein, we focused on the programs that are generally employed for preparing protein and glycan input files to execute molecular dynamics simulations and analyse the corresponding results. The presented computational toolbox represents a valuable resource for researchers studying protein-glycan interactions and incorporates advanced computational methods for building, visualising and predicting protein/glycan structures, modelling protein-ligand complexes, and analyse MD outcomes. Moreover, selected case studies have been reported to highlight the importance of computational tools in studying protein-glycan systems, revealing the capability of these tools to provide valuable insights into the binding kinetics, energetics, and structural determinants that govern specific molecular interactions.
Collapse
Affiliation(s)
- Ferran Nieto-Fabregat
- Department of Chemical Sciences, University of Naples Federico II, Via Cinthia 4, 80126, Italy
| | - Maria Pia Lenza
- Department of Chemical Sciences, University of Naples Federico II, Via Cinthia 4, 80126, Italy
| | - Angela Marseglia
- Department of Chemical Sciences, University of Naples Federico II, Via Cinthia 4, 80126, Italy
| | - Cristina Di Carluccio
- Department of Chemical Sciences, University of Naples Federico II, Via Cinthia 4, 80126, Italy
| | - Antonio Molinaro
- Department of Chemical Sciences, University of Naples Federico II, Via Cinthia 4, 80126, Italy
| | - Alba Silipo
- Department of Chemical Sciences, University of Naples Federico II, Via Cinthia 4, 80126, Italy
| | - Roberta Marchetti
- Department of Chemical Sciences, University of Naples Federico II, Via Cinthia 4, 80126, Italy
| |
Collapse
|
4
|
Bibekar P, Krapp L, Peraro MD. PeSTo-Carbs: Geometric Deep Learning for Prediction of Protein-Carbohydrate Binding Interfaces. J Chem Theory Comput 2024; 20:2985-2991. [PMID: 38602504 PMCID: PMC11044267 DOI: 10.1021/acs.jctc.3c01145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 03/28/2024] [Accepted: 03/29/2024] [Indexed: 04/12/2024]
Abstract
The Protein Structure Transformer (PeSTo), a geometric transformer, has exhibited exceptional performance in predicting protein-protein binding interfaces and distinguishing interfaces with nucleic acids, lipids, small molecules, and ions. In this study, we introduce PeSTo-Carbs, an extension of PeSTo specifically engineered to predict protein-carbohydrate binding interfaces. We evaluate the performance of this approach using independent test sets and compare them with those of previous methods. Furthermore, we highlight the model's capability to specialize in predicting interfaces involving cyclodextrins, a biologically and pharmaceutically significant class of carbohydrates. Our method consistently achieves remarkable accuracy despite the scarcity of available structural data for cyclodextrins.
Collapse
Affiliation(s)
- Parth Bibekar
- Department
of Biological Sciences, Indian Institute
of Science Education and Research (IISER) Kolkata, Mohanpur 741246, India
- Institute
of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland
| | - Lucien Krapp
- Institute
of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland
- Swiss
Institute of Bioinformatics (SIB), Lausanne 1015, Switzerland
| | - Matteo Dal Peraro
- Institute
of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne 1015, Switzerland
- Swiss
Institute of Bioinformatics (SIB), Lausanne 1015, Switzerland
| |
Collapse
|
5
|
Anila S, Samsonov SA. Benchmarking Water Models in Molecular Dynamics of Protein-Glycosaminoglycan Complexes. J Chem Inf Model 2024; 64:1691-1703. [PMID: 38410841 PMCID: PMC10934818 DOI: 10.1021/acs.jcim.4c00030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 02/08/2024] [Accepted: 02/13/2024] [Indexed: 02/28/2024]
Abstract
Glycosaminoglycans (GAGs) made of repeating disaccharide units intricately engage with proteins, playing a crucial role in the spatial organization of the extracellular matrix (ECM) and the transduction of biological signals in cells to modulate a number of biochemical processes. Exploring protein-GAG interactions reveals several challenges for their analysis, namely, the highly charged and periodic nature of GAGs, their multipose binding, and the abundance of the interfacial water molecules in the protein-GAG complexes. Most of the studies on protein-GAG interactions are conducted using the TIP3P water model, and there are no data on the effect of various water models on the results obtained in molecular dynamics (MD) simulations of protein-GAG complexes. Hence, it is essential to perform a systematic analysis of different water models in MD simulations for these systems. In this work, we aim to evaluate the properties of the protein-GAG complexes in MD simulations using different explicit: TIP3P, SPC/E, TIP4P, TIP4PEw, OPC, and TIP5P and implicit: IGB = 1, 2, 5, 7, and 8 water models to find out which of them are best suited to study the dynamics of protein-GAG complexes. The FF14SB and GLYCAM06 force fields were used for the proteins and GAGs, respectively. The interactions of several GAG types, such as heparin, chondroitin sulfate, and hyaluronic acid with basic fibroblast growth factor, cathepsin K, and CD44 receptor, respectively, are investigated. The observed variations in different descriptors used to study the binding in these complexes emphasize the relevance of the choice of water models for the MD simulation of these complexes.
Collapse
Affiliation(s)
- Sebastian Anila
- Faculty of Chemistry, University of Gdańsk, ul. Wita Stwosza 63, 80-308 Gdańsk, Poland
| | - Sergey A. Samsonov
- Faculty of Chemistry, University of Gdańsk, ul. Wita Stwosza 63, 80-308 Gdańsk, Poland
| |
Collapse
|
6
|
Krupa MA, Krupa P. Free-Docking and Template-Based Docking: Physics Versus Knowledge-Based Docking. Methods Mol Biol 2024; 2780:27-41. [PMID: 38987462 DOI: 10.1007/978-1-0716-3985-6_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]
Abstract
Docking methods can be used to predict the orientations of two or more molecules with respect of each other using a plethora of various algorithms, which can be based on the physics of interactions or can use information from databases and templates. The usability of these approaches depends on the type and size of the molecules, whose relative orientation will be estimated. The two most important limitations are (i) the computational cost of the prediction and (ii) the availability of the structural information for similar complexes. In general, if there is enough information about similar systems, knowledge-based and template-based methods can significantly reduce the computational cost while providing high accuracy of the prediction. However, if the information about the system topology and interactions between its partners is scarce, physics-based methods are more reliable or even the only choice. In this chapter, knowledge-, template-, and physics-based methods will be compared and briefly discussed providing examples of their usability with a special emphasis on physics-based protein-protein, protein-peptide, and protein-fullerene docking in the UNRES coarse-grained model.
Collapse
Affiliation(s)
- Magdalena A Krupa
- Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland
| | - Paweł Krupa
- Institute of Physics, Polish Academy of Sciences, Warsaw, Poland.
| |
Collapse
|
7
|
Effect of ultrasonication on the protein–polysaccharide complexes: a review. JOURNAL OF FOOD MEASUREMENT AND CHARACTERIZATION 2022. [DOI: 10.1007/s11694-022-01567-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
|
8
|
Abstract
Glycoscience assembles all the scientific disciplines involved in studying various molecules and macromolecules containing carbohydrates and complex glycans. Such an ensemble involves one of the most extensive sets of molecules in quantity and occurrence since they occur in all microorganisms and higher organisms. Once the compositions and sequences of these molecules are established, the determination of their three-dimensional structural and dynamical features is a step toward understanding the molecular basis underlying their properties and functions. The range of the relevant computational methods capable of addressing such issues is anchored by the specificity of stereoelectronic effects from quantum chemistry to mesoscale modeling throughout molecular dynamics and mechanics and coarse-grained and docking calculations. The Review leads the reader through the detailed presentations of the applications of computational modeling. The illustrations cover carbohydrate-carbohydrate interactions, glycolipids, and N- and O-linked glycans, emphasizing their role in SARS-CoV-2. The presentation continues with the structure of polysaccharides in solution and solid-state and lipopolysaccharides in membranes. The full range of protein-carbohydrate interactions is presented, as exemplified by carbohydrate-active enzymes, transporters, lectins, antibodies, and glycosaminoglycan binding proteins. A final section features a list of 150 tools and databases to help address the many issues of structural glycobioinformatics.
Collapse
Affiliation(s)
- Serge Perez
- Centre de Recherche sur les Macromolecules Vegetales, University of Grenoble-Alpes, Centre National de la Recherche Scientifique, Grenoble F-38041, France
| | - Olga Makshakova
- FRC Kazan Scientific Center of Russian Academy of Sciences, Kazan Institute of Biochemistry and Biophysics, Kazan 420111, Russia
| |
Collapse
|
9
|
Systemic Lectin-Glycan Interaction of Pathogenic Enteric Bacteria in the Gastrointestinal Tract. Int J Mol Sci 2022; 23:ijms23031451. [PMID: 35163392 PMCID: PMC8835900 DOI: 10.3390/ijms23031451] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 01/20/2022] [Accepted: 01/25/2022] [Indexed: 01/27/2023] Open
Abstract
Microorganisms, such as bacteria, viruses, and fungi, and host cells, such as plants and animals, have carbohydrate chains and lectins that reciprocally recognize one another. In hosts, the defense system is activated upon non-self-pattern recognition of microbial pathogen-associated molecular patterns. These are present in Gram-negative and Gram-positive bacteria and fungi. Glycan-based PAMPs are bound to a class of lectins that are widely distributed among eukaryotes. The first step of bacterial infection in humans is the adhesion of the pathogen's lectin-like proteins to the outer membrane surfaces of host cells, which are composed of glycans. Microbes and hosts binding to each other specifically is of critical importance. The adhesion factors used between pathogens and hosts remain unknown; therefore, research is needed to identify these factors to prevent intestinal infection or treat it in its early stages. This review aims to present a vision for the prevention and treatment of infectious diseases by identifying the role of the host glycans in the immune response against pathogenic intestinal bacteria through studies on the lectin-glycan interaction.
Collapse
|
10
|
Nguyen TB, Pires DEV, Ascher DB. CSM-carbohydrate: protein-carbohydrate binding affinity prediction and docking scoring function. Brief Bioinform 2021; 23:6457169. [PMID: 34882232 DOI: 10.1093/bib/bbab512] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Revised: 11/06/2021] [Accepted: 11/08/2021] [Indexed: 12/29/2022] Open
Abstract
Protein-carbohydrate interactions are crucial for many cellular processes but can be challenging to biologically characterise. To improve our understanding and ability to model these molecular interactions, we used a carefully curated set of 370 protein-carbohydrate complexes with experimental structural and biophysical data in order to train and validate a new tool, cutoff scanning matrix (CSM)-carbohydrate, using machine learning algorithms to accurately predict their binding affinity and rank docking poses as a scoring function. Information on both protein and carbohydrate complementarity, in terms of shape and chemistry, was captured using graph-based structural signatures. Across both training and independent test sets, we achieved comparable Pearson's correlations of 0.72 under cross-validation [root mean square error (RMSE) of 1.58 Kcal/mol] and 0.67 on the independent test (RMSE of 1.72 Kcal/mol), providing confidence in the generalisability and robustness of the final model. Similar performance was obtained across mono-, di- and oligosaccharides, further highlighting the applicability of this approach to the study of larger complexes. We show CSM-carbohydrate significantly outperformed previous approaches and have implemented our method and make all data freely available through both a user-friendly web interface and application programming interface, to facilitate programmatic access at http://biosig.unimelb.edu.au/csm_carbohydrate/. We believe CSM-carbohydrate will be an invaluable tool for helping assess docking poses and the effects of mutations on protein-carbohydrate affinity, unravelling important aspects that drive binding recognition.
Collapse
Affiliation(s)
- Thanh Binh Nguyen
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Douglas E V Pires
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,School of Computing and Information Systems, University of Melbourne, Melbourne, Victoria, Australia
| | - David B Ascher
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia.,Department of Biochemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
11
|
Nance ML, Labonte JW, Adolf-Bryfogle J, Gray JJ. Development and Evaluation of GlycanDock: A Protein-Glycoligand Docking Refinement Algorithm in Rosetta. J Phys Chem B 2021; 125:10.1021/acs.jpcb.1c00910. [PMID: 34133179 PMCID: PMC8742512 DOI: 10.1021/acs.jpcb.1c00910] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Carbohydrate chains are ubiquitous in the complex molecular processes of life. These highly diverse chains are recognized by a variety of protein receptors, enabling glycans to regulate many biological functions. High-resolution structures of protein-glycoligand complexes reveal the atomic details necessary to understand this level of molecular recognition and inform application-focused scientific and engineering pursuits. When experimental challenges hinder high-throughput determination of quality structures, computational tools can, in principle, fill the gap. In this work, we introduce GlycanDock, a residue-centric protein-glycoligand docking refinement algorithm developed within the Rosetta macromolecular modeling and design software suite. We performed a benchmark docking assessment using a set of 109 experimentally determined protein-glycoligand complexes as well as 62 unbound protein structures. The GlycanDock algorithm can sample and discriminate among protein-glycoligand models of native-like structural accuracy with statistical reliability from starting structures of up to 7 Å root-mean-square deviation in the glycoligand ring atoms. We show that GlycanDock-refined models qualitatively replicated the known binding specificity of a bacterial carbohydrate-binding module. Finally, we present a protein-glycoligand docking pipeline for generating putative protein-glycoligand complexes when only the glycoligand sequence and unbound protein structure are known. In combination with other carbohydrate modeling tools, the GlycanDock docking refinement algorithm will accelerate research in the glycosciences.
Collapse
Affiliation(s)
- Morgan L. Nance
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Jason W. Labonte
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland 21218, United States
- Department of Chemistry, Franklin & Marshall College, Lancaster, Pennsylvania 17603, United States
- Department of Chemistry, Gettysburg College, Gettysburg, Pennsylvania 17325, United States
| | - Jared Adolf-Bryfogle
- Protein Design Lab, Institute for Protein Innovation, Boston, Massachusetts 02115, United States
- Division of Hematology/Oncology, Boston Children’s Hospital, Boston, Massachusetts 02115, United States
- Department of Pediatrics, Harvard Medical School, Boston, Massachusetts 02115, United States
| | - Jeffrey J. Gray
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, Maryland 21218, United States
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland 21218, United States
| |
Collapse
|
12
|
Künze G, Huster D, Samsonov SA. Investigation of the structure of regulatory proteins interacting with glycosaminoglycans by combining NMR spectroscopy and molecular modeling - the beginning of a wonderful friendship. Biol Chem 2021; 402:1337-1355. [PMID: 33882203 DOI: 10.1515/hsz-2021-0119] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Accepted: 04/09/2021] [Indexed: 11/15/2022]
Abstract
The interaction of regulatory proteins with extracellular matrix or cell surface-anchored glycosaminoglycans (GAGs) plays important roles in molecular recognition, wound healing, growth, inflammation and many other processes. In spite of their high biological relevance, protein-GAG complexes are significantly underrepresented in structural databases because standard tools for structure determination experience difficulties in studying these complexes. Co-crystallization with subsequent X-ray analysis is hampered by the high flexibility of GAGs. NMR spectroscopy experiences difficulties related to the periodic nature of the GAGs and the sparse proton network between protein and GAG with distances that typically exceed the detection limit of nuclear Overhauser enhancement spectroscopy. In contrast, computer modeling tools have advanced over the last years delivering specific protein-GAG docking approaches successfully complemented with molecular dynamics (MD)-based analysis. Especially the combination of NMR spectroscopy in solution providing sparse structural constraints with molecular docking and MD simulations represents a useful synergy of forces to describe the structure of protein-GAG complexes. Here we review recent methodological progress in this field and bring up examples where the combination of new NMR methods along with cutting-edge modeling has yielded detailed structural information on complexes of highly relevant cytokines with GAGs.
Collapse
Affiliation(s)
- Georg Künze
- Center for Structural Biology, Vanderbilt University, 465 21st Ave S, 5140 MRB3, Nashville, TN37240, USA.,Department of Chemistry, Vanderbilt University, 7330 Stevenson Center, Station B 351822, Nashville, TN37235, USA.,Institute for Drug Discovery, University of Leipzig, Brüderstr. 34, D-04103Leipzig, Germany
| | - Daniel Huster
- Institute for Medical Physics and Biophysics, University of Leipzig, Härtelstr. 16-18, D-04107Leipzig, Germany
| | - Sergey A Samsonov
- Faculty of Chemistry, University of Gdańsk, Ul. Wita Stwosza 63, 80-308Gdańsk, Poland
| |
Collapse
|
13
|
Zhang S, Chen KY, Zou X. Carbohydrate-Protein Interactions: Advances and Challenges. COMMUNICATIONS IN INFORMATION AND SYSTEMS 2021; 21:147-163. [PMID: 34366717 DOI: 10.4310/cis.2021.v21.n1.a7] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
A carbohydrate, also called saccharide in biochemistry, is a biomolecule consisting of carbon (C), hydrogen (H) and oxygen (O) atoms. For example, sugars are low molecular-weight carbohydrates, and starches are high molecular-weight carbohydrates. Carbohydrates are the most abundant organic substances in nature and essential constituents of all living things. Protein-carbohydrate interactions play important roles in many biological processes, such as cell growth, differentiation, and aggregation. They also have broad applications in pharmaceutical drug design. In this review, we will summarize the characteristic features of protein-carbohydrate interactions and review the computational methods for structure prediction, energy calculations, and kinetic studies of protein-carbohydrate complexes. Finally, we will discuss the challenges in this field.
Collapse
Affiliation(s)
- Shuang Zhang
- Dalton Cardiovascular Research Center, Department of Physics and Astronomy, Department of Biochemistry, Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65211, USA
| | - Kyle Yu Chen
- Rock Bridge High School, 4303 South Providence Rd, Columbia, MO 65203, USA
| | - Xiaoqin Zou
- Dalton Cardiovascular Research Center, Department of Physics and Astronomy, Department of Biochemistry, Institute for Data Science and Informatics, University of Missouri, Columbia, MO 65211, USA
| |
Collapse
|
14
|
Siva Shanmugam NR, Jino Blessy J, Veluraja K, Gromiha MM. Prediction of protein-carbohydrate complex binding affinity using structural features. Brief Bioinform 2020; 22:6032626. [PMID: 33313775 DOI: 10.1093/bib/bbaa319] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 09/28/2020] [Accepted: 10/19/2020] [Indexed: 01/03/2023] Open
Abstract
Protein-carbohydrate interactions play a major role in several cellular and biological processes. Elucidating the factors influencing the binding affinity of protein-carbohydrate complexes and predicting their free energy of binding provide deep insights for understanding the recognition mechanism. In this work, we have collected the experimental binding affinity data for a set of 389 protein-carbohydrate complexes and derived several structure-based features such as contact potentials, interaction energy, number of binding residues and contacts between different types of atoms. Our analysis on the relationship between binding affinity and structural features revealed that the important factors depend on the type of the complex based on number of carbohydrate and protein chains. Specifically, binding site residues, accessible surface area, interactions between various atoms and energy contributions are important to understand the binding affinity. Further, we have developed multiple regression equations for predicting the binding affinity of protein-carbohydrate complexes belonging to six categories of protein-carbohydrate complexes. Our method showed an average correlation and mean absolute error of 0.731 and 1.149 kcal/mol, respectively, between experimental and predicted binding affinities on a jackknife test. We have developed a web server PCA-Pred, Protein-Carbohydrate Affinity Predictor, for predicting the binding affinity of protein-carbohydrate complexes. The web server is freely accessible at https://web.iitm.ac.in/bioinfo2/pcapred/. The web server is implemented using HTML and Python and supports recent versions of major browsers such as Chrome, Firefox, IE10 and Opera.
Collapse
Affiliation(s)
| | | | - K Veluraja
- Indian Institute of Science, Bangalore, India
| | | |
Collapse
|
15
|
Agostino M. Comprehensive analysis of carbohydrate-protein recognition in the Protein Data Bank. Carbohydr Res 2020; 498:108180. [PMID: 33096507 DOI: 10.1016/j.carres.2020.108180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 10/09/2020] [Accepted: 10/14/2020] [Indexed: 10/23/2022]
Abstract
Carbohydrate-protein interactions underpin wide-ranging aspects of biology. However, such interactions remain relatively unexplored in pharmaceutical and biotechnological applications, in part due to the challenges associated with their structural characterisation, both experimentally and computationally. Knowledge-based approaches have shown great success in the prediction of drug-protein and protein-protein interactions, although have not been comprehensively investigated for carbohydrate-protein interactions. In this work, carbohydrate-protein complexes from the Protein Data Bank were comprehensively obtained and analysed to identify patterns in how carbohydrate-protein interactions are mediated.
Collapse
Affiliation(s)
- Mark Agostino
- School of Pharmacy and Biomedical Sciences, Curtin Health Innovation Research Institute and Curtin Institute for Computation, Bentley, Australia.
| |
Collapse
|
16
|
Scherbinina SI, Toukach PV. Three-Dimensional Structures of Carbohydrates and Where to Find Them. Int J Mol Sci 2020; 21:E7702. [PMID: 33081008 PMCID: PMC7593929 DOI: 10.3390/ijms21207702] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2020] [Revised: 10/15/2020] [Accepted: 10/16/2020] [Indexed: 02/06/2023] Open
Abstract
Analysis and systematization of accumulated data on carbohydrate structural diversity is a subject of great interest for structural glycobiology. Despite being a challenging task, development of computational methods for efficient treatment and management of spatial (3D) structural features of carbohydrates breaks new ground in modern glycoscience. This review is dedicated to approaches of chemo- and glyco-informatics towards 3D structural data generation, deposition and processing in regard to carbohydrates and their derivatives. Databases, molecular modeling and experimental data validation services, and structure visualization facilities developed for last five years are reviewed.
Collapse
Affiliation(s)
- Sofya I. Scherbinina
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Science, Leninsky prospect 47, 119991 Moscow, Russia
- Higher Chemical College, D. Mendeleev University of Chemical Technology of Russia, Miusskaya Square 9, 125047 Moscow, Russia
| | - Philip V. Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Science, Leninsky prospect 47, 119991 Moscow, Russia
| |
Collapse
|
17
|
Blanco Capurro JI, Di Paola M, Gamarra MD, Martí MA, Modenutti CP. An efficient use of X-ray information, homology modeling, molecular dynamics and knowledge-based docking techniques to predict protein-monosaccharide complexes. Glycobiology 2019; 29:124-136. [PMID: 30407518 DOI: 10.1093/glycob/cwy102] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2018] [Accepted: 11/06/2018] [Indexed: 01/13/2023] Open
Abstract
Unraveling the structure of lectin-carbohydrate complexes is vital for understanding key biological recognition processes and development of glycomimetic drugs. Molecular Docking application to predict them is challenging due to their low affinity, hydrophilic nature and ligand conformational diversity. In the last decade several strategies, such as the inclusion of glycan conformation specific scoring functions or our developed solvent-site biased method, have improved carbohydrate docking performance but significant challenges remain, in particular, those related to receptor conformational diversity. In the present work we have analyzed conventional and solvent-site biased autodock4 performance concerning receptor conformational diversity as derived from different crystal structures (apo and holo), Molecular Dynamics snapshots and Homology-based models, for 14 different lectin-monosaccharide complexes. Our results show that both conventional and biased docking yield accurate lectin-monosaccharide complexes, starting from either apo or homology-based structures, even when only moderate (45%) sequence identity templates are available. An essential element for success is a proper combination of a middle-sized (10-100 structures) conformational ensemble, derived either from Molecular dynamics or multiple homology model building. Consistent with our previous works, results show that solvent-site biased methods improve overall performance, but that results are still highly system dependent. Finally, our results also show that docking can select the correct receptor structure within the ensemble, underscoring the relevance of joint evaluation of both ligand pose and receptor conformation.
Collapse
Affiliation(s)
- Juan I Blanco Capurro
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Guiraldes 2160, Ciudad Autónoma de Buenos Aires, Argentina.,Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), CONICET, Ciudad Universitaria, Intendente Guiraldes 2160, Ciudad Autónoma de Buenos Aires, Argentina
| | - Matias Di Paola
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Guiraldes 2160, Ciudad Autónoma de Buenos Aires, Argentina.,Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), CONICET, Ciudad Universitaria, Intendente Guiraldes 2160, Ciudad Autónoma de Buenos Aires, Argentina
| | - Marcelo Daniel Gamarra
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Guiraldes 2160, Ciudad Autónoma de Buenos Aires, Argentina.,Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), CONICET, Ciudad Universitaria, Intendente Guiraldes 2160, Ciudad Autónoma de Buenos Aires, Argentina
| | - Marcelo A Martí
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Guiraldes 2160, Ciudad Autónoma de Buenos Aires, Argentina.,Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), CONICET, Ciudad Universitaria, Intendente Guiraldes 2160, Ciudad Autónoma de Buenos Aires, Argentina
| | - Carlos P Modenutti
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Intendente Guiraldes 2160, Ciudad Autónoma de Buenos Aires, Argentina.,Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), CONICET, Ciudad Universitaria, Intendente Guiraldes 2160, Ciudad Autónoma de Buenos Aires, Argentina
| |
Collapse
|
18
|
Uciechowska-Kaczmarzyk U, Chauvot de Beauchene I, Samsonov SA. Docking software performance in protein-glycosaminoglycan systems. J Mol Graph Model 2019; 90:42-50. [PMID: 30959268 DOI: 10.1016/j.jmgm.2019.04.001] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2018] [Revised: 03/29/2019] [Accepted: 04/01/2019] [Indexed: 01/09/2023]
Abstract
We present a benchmarking study for protein-glycosaminoglycan systems with eight docking programs: Dock, rDock, ClusPro, PLANTS, HADDOCK, Hex, SwissDock and ATTRACT. We used a non-redundant representative dataset of 28 protein-glycosaminoglycan complexes with experimentally available structures, where a glycosaminoglycan ligand was longer than a trimer. Overall, the ligand binding poses could be correctly predicted in many cases by the tested docking programs, however the ranks of the docking poses are often poorly assigned. Our results suggest that Dock program performs best in terms of the pose placement, has the most suitable scoring function, and its performance did not depend on the ligand size. This suggests that the implementation of the electrostatics as well as the shape complementarity procedure in Dock are the most suitable for docking glycosaminoglycan ligands. We also analyzed how free energy patterns of the benchmarking complexes affect the performance of the evaluated docking software.
Collapse
Affiliation(s)
- Urszula Uciechowska-Kaczmarzyk
- Laboratory of Molecular Modeling, Department of Theoretical Chemistry, Faculty of Chemistry, University of Gdansk, Wita Stwosza 63, 80-308, Gdańsk, Poland
| | | | - Sergey A Samsonov
- Laboratory of Molecular Modeling, Department of Theoretical Chemistry, Faculty of Chemistry, University of Gdansk, Wita Stwosza 63, 80-308, Gdańsk, Poland.
| |
Collapse
|
19
|
Zhao H, Taherzadeh G, Zhou Y, Yang Y. Computational Prediction of Carbohydrate-Binding Proteins and Binding Sites. ACTA ACUST UNITED AC 2018; 94:e75. [PMID: 30106511 DOI: 10.1002/cpps.75] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Protein-carbohydrate interaction is essential for biological systems, and carbohydrate-binding proteins (CBPs) are important targets when designing antiviral and anticancer drugs. Due to the high cost and difficulty associated with experimental approaches, many computational methods have been developed as complementary approaches to predict CBPs or carbohydrate-binding sites. However, most of these computational methods are not publicly available. Here, we provide a comprehensive review of related studies and demonstrate our two recently developed bioinformatics methods. The method SPOT-CBP is a template-based method for detecting CBPs based on structure through structural homology search combined with a knowledge-based scoring function. This method can yield model complex structure in addition to accurate prediction of CBPs. Furthermore, it has been observed that similarly accurate predictions can be made using structures from homology modeling, which has significantly expanded its applicability. The other method, SPRINT-CBH, is a de novo approach that predicts binding residues directly from protein sequences by using sequence information and predicted structural properties. This approach does not need structurally similar templates and thus is not limited by the current database of known protein-carbohydrate complex structures. These two complementary methods are available at https://sparks-lab.org. © 2018 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Huiying Zhao
- Sun Yat-Sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China
| | - Ghazaleh Taherzadeh
- School of Information and Communication Technology, Griffith University, Gold Coast, Queensland, Australia
| | - Yaoqi Zhou
- School of Information and Communication Technology, Griffith University, Gold Coast, Queensland, Australia.,Institute for Glycomics, Griffith University, Gold Coast, Queensland, Australia
| | - Yuedong Yang
- School of Information and Communication Technology, Griffith University, Gold Coast, Queensland, Australia.,Institute for Glycomics, Griffith University, Gold Coast, Queensland, Australia.,School of Data and Computer Science, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
20
|
Abstract
Complex carbohydrates are ubiquitous in nature, and together with proteins and nucleic acids they comprise the building blocks of life. But unlike proteins and nucleic acids, carbohydrates form nonlinear polymers, and they are not characterized by robust secondary or tertiary structures but rather by distributions of well-defined conformational states. Their molecular flexibility means that oligosaccharides are often refractory to crystallization, and nuclear magnetic resonance (NMR) spectroscopy augmented by molecular dynamics (MD) simulation is the leading method for their characterization in solution. The biological importance of carbohydrate-protein interactions, in organismal development as well as in disease, places urgency on the creation of innovative experimental and theoretical methods that can predict the specificity of such interactions and quantify their strengths. Additionally, the emerging realization that protein glycosylation impacts protein function and immunogenicity places the ability to define the mechanisms by which glycosylation impacts these features at the forefront of carbohydrate modeling. This review will discuss the relevant theoretical approaches to studying the three-dimensional structures of this fascinating class of molecules and interactions, with reference to the relevant experimental data and techniques that are key for validation of the theoretical predictions.
Collapse
Affiliation(s)
- Robert J Woods
- Complex Carbohydrate Research Center and Department of Biochemistry and Molecular Biology , University of Georgia , 315 Riverbend Road , Athens , Georgia 30602 , United States
| |
Collapse
|
21
|
Lacetera A, Berbís MÁ, Nurisso A, Jiménez-Barbero J, Martín-Santamaría S. Computational Chemistry Tools in Glycobiology: Modelling of Carbohydrate–Protein Interactions. COMPUTATIONAL TOOLS FOR CHEMICAL BIOLOGY 2017. [DOI: 10.1039/9781788010139-00145] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Molecular modelling provides a major impact in the field of glycosciences, helping in the characterisation of the molecular basis of the recognition between lectins from pathogens and human glycoconjugates, and in the design of glycocompounds with anti-infectious properties. The conformational properties of oligosaccharides are complex, and therefore, the simulation of these properties is a challenging task. Indeed, the development of suitable force fields is required for the proper simulation of important problems in glycobiology, such as the interatomic interactions responsible for oligosaccharide and glycoprotein dynamics, including O-linkages in oligo- and polysaccharides, and N- and O-linkages in glycoproteins. The computational description of representative examples is discussed, herein, related to biologically active oligosaccharides and their interaction with lectins and other proteins, and the new routes open for the design of glycocompounds with promising biological activities.
Collapse
Affiliation(s)
- Alessandra Lacetera
- Center for Biological Research CIB-CSIC. Ramiro de Maeztu, 9 28040-Madrid Spain
| | - M. Álvaro Berbís
- Center for Biological Research CIB-CSIC. Ramiro de Maeztu, 9 28040-Madrid Spain
| | - Alessandra Nurisso
- School of Pharmaceutical Sciences University of Geneva, University of Lausanne, Rue Michel Servet 1 CH-1211 Geneva 4 Switzerland
| | | | | |
Collapse
|
22
|
Abstract
Many carbohydrate-binding proteins contain aromatic amino acid residues in their binding sites. These residues interact with carbohydrates in a stacking geometry via CH/π interactions. These interactions can be found in carbohydrate-binding proteins, including lectins, enzymes and carbohydrate transporters. Besides this, many non-protein aromatic molecules (natural as well as artificial) can bind saccharides using these interactions. Recent computational and experimental studies have shown that carbohydrate–aromatic CH/π interactions are dispersion interactions, tuned by electrostatics and partially stabilized by a hydrophobic effect in solvated systems.
Collapse
|
23
|
Babik S, Samsonov SA, Pisabarro MT. Computational drill down on FGF1-heparin interactions through methodological evaluation. Glycoconj J 2017; 34:427-440. [PMID: 27858202 PMCID: PMC5487771 DOI: 10.1007/s10719-016-9745-4] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2016] [Revised: 10/25/2016] [Accepted: 10/27/2016] [Indexed: 01/22/2023]
Abstract
Glycosaminoglycans (GAGs) exhibit a key role in cellular communication processes through interactions with target proteins of the extracellular matrix (ECM). The sandwich-like interaction established between Fibroblast growth factor (FGF) and heparin (HE) represents quite a peculiar protein-GAG-protein system, which has been both structurally and functionally intensively studied. The molecular recognition characteristics of this system have been exploited in various computational studies in order to deepen understanding of GAG-protein interactions. Here, we drill down on the interactions established in this peculiar macromolecular complex by analyzing the applicability of docking techniques and molecular dynamics (MD)-based approaches, and we dissect the molecular recognition properties exhibited by FGF towards a series of HE derivatives. We examine the sensitivity of MM-GBSA free energy calculations in terms of receptor conformational space sampling and changes in the ligand structures. Furthermore, we investigate its predictive power in combination with other computational methods, namely the well-established Autodock3 (AD3) and dynamic molecular docking (DMD), a targeted MD-based docking method specifically developed to account for flexibility and solvent in computer simulations of protein-GAG systems. Our results show that a site-mapping approach can be effectively combined with AD3 and DMD calculations to accurately reproduce available experimental data and, furthermore, to determine specific GAG recognition patterns. This study deepens our understanding of the applicability of available theoretical approaches to the investigation of molecular recognition in protein-GAG systems.
Collapse
Affiliation(s)
- Sándor Babik
- Structural Bioinformatics, BIOTEC TU Dresden, Dresden, 01307, Germany
| | - Sergey A Samsonov
- Structural Bioinformatics, BIOTEC TU Dresden, Dresden, 01307, Germany
| | | |
Collapse
|
24
|
Investigating carbohydrate based ligands for galectin-3 with docking and molecular dynamics studies. J Mol Graph Model 2017; 71:211-217. [DOI: 10.1016/j.jmgm.2016.10.018] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2016] [Revised: 10/22/2016] [Accepted: 10/29/2016] [Indexed: 11/21/2022]
|
25
|
Samsonov SA, Pisabarro MT. Computational analysis of interactions in structurally available protein-glycosaminoglycan complexes. Glycobiology 2016; 26:850-861. [PMID: 27496767 DOI: 10.1093/glycob/cww055] [Citation(s) in RCA: 58] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2015] [Accepted: 04/26/2016] [Indexed: 01/01/2023] Open
Abstract
Glycosaminoglycans represent a class of linear anionic periodic polysaccharides, which play a key role in a variety of biological processes in the extracellular matrix via interactions with their protein targets. Computationally, glycosaminoglycans are very challenging due to their high flexibility, periodicity and electrostatics-driven nature of the interactions with their protein counterparts. In this work, we carry out a detailed computational characterization of the interactions in protein-glycosaminoglycan complexes from the Protein Data Bank (PDB), which are split into two subsets accounting for their intrinsic nature: non-enzymatic-protein-glycosaminoglycan and enzyme-glycosaminoglycan complexes. We apply molecular dynamics to analyze the differences in these two subsets in terms of flexibility, retainment of the native interactions in the simulations, free energy components of binding and contributions of protein residue types to glycosaminoglycan binding. Furthermore, we systematically demonstrate that protein electrostatic potential calculations, previously found to be successful for glycosaminoglycan binding sites prediction for individual systems, are in general very useful for proposing protein surface regions as putative glycosaminoglycan binding sites, which can be further used for local docking calculations with these particular polysaccharides. Finally, the performance of six different docking programs (Autodock 3, Autodock Vina, MOE, eHiTS, FlexX and Glide), some of which proved to perform well for particular protein-glycosaminoglycan complexes in previous work, is evaluated on the complete protein-glycosaminoglycan data set from the PDB. This work contributes to widen our knowledge of protein-glycosaminoglycan molecular recognition and could be useful to steer a choice of the strategies to be applied in theoretical studies of these systems.
Collapse
Affiliation(s)
- Sergey A Samsonov
- Structural Bioinformatics, BIOTEC TU Dresden, Dresden 01307, Germany
| | | |
Collapse
|
26
|
Nivedha AK, Thieker DF, Makeneni S, Hu H, Woods RJ. Vina-Carb: Improving Glycosidic Angles during Carbohydrate Docking. J Chem Theory Comput 2016; 12:892-901. [PMID: 26744922 DOI: 10.1021/acs.jctc.5b00834] [Citation(s) in RCA: 68] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Molecular docking programs are primarily designed to align rigid, drug-like fragments into the binding sites of macromolecules and frequently display poor performance when applied to flexible carbohydrate molecules. A critical source of flexibility within an oligosaccharide is the glycosidic linkages. Recently, Carbohydrate Intrinsic (CHI) energy functions were reported that attempt to quantify the glycosidic torsion angle preferences. In the present work, the CHI-energy functions have been incorporated into the AutoDock Vina (ADV) scoring function, subsequently termed Vina-Carb (VC). Two user-adjustable parameters have been introduced, namely, a CHI- energy weight term (chi_coeff) that affects the magnitude of the CHI-energy penalty and a CHI-cutoff term (chi_cutoff) that negates CHI-energy penalties below a specified value. A data set consisting of 101 protein-carbohydrate complexes and 29 apoprotein structures was used in the development and testing of VC, including antibodies, lectins, and carbohydrate binding modules. Accounting for the intramolecular energies of the glycosidic linkages in the oligosaccharides during docking led VC to produce acceptable structures within the top five ranked poses in 74% of the systems tested, compared to a success rate of 55% for ADV. An enzyme system was employed in order to illustrate the potential application of VC to proteins that may distort glycosidic linkages of carbohydrate ligands upon binding. VC represents a significant step toward accurately predicting the structures of protein-carbohydrate complexes. Furthermore, the described approach is conceptually applicable to any class of ligands that populate well-defined conformational states.
Collapse
Affiliation(s)
- Anita K Nivedha
- Complex Carbohydrate Research Center, University of Georgia , Athens, Georgia 30602, United States
| | - David F Thieker
- Complex Carbohydrate Research Center, University of Georgia , Athens, Georgia 30602, United States
| | - Spandana Makeneni
- Complex Carbohydrate Research Center, University of Georgia , Athens, Georgia 30602, United States
| | - Huimin Hu
- Complex Carbohydrate Research Center, University of Georgia , Athens, Georgia 30602, United States
| | - Robert J Woods
- Complex Carbohydrate Research Center, University of Georgia , Athens, Georgia 30602, United States
| |
Collapse
|
27
|
Agostino M, Yuriev E. Editorial: Structural and Computational Glycobiology - Immunity and Infection. Front Immunol 2015; 6:359. [PMID: 26236314 PMCID: PMC4500954 DOI: 10.3389/fimmu.2015.00359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2015] [Accepted: 07/01/2015] [Indexed: 11/28/2022] Open
Affiliation(s)
- Mark Agostino
- CHIRI Biosciences and Curtin Institute for Computation, School of Biomedical Sciences, Curtin University , Perth, WA , Australia ; Centre for Biomedical Research, Burnet Institute , Melbourne, VIC , Australia
| | - Elizabeth Yuriev
- Medicinal Chemistry, Monash Institute of Pharmaceutical Sciences, Monash University , Melbourne, VIC , Australia
| |
Collapse
|
28
|
Abstract
The article reviews the significant contributions to, and the present status of, applications of computational methods for the characterization and prediction of protein-carbohydrate interactions. After a presentation of the specific features of carbohydrate modeling, along with a brief description of the experimental data and general features of carbohydrate-protein interactions, the survey provides a thorough coverage of the available computational methods and tools. At the quantum-mechanical level, the use of both molecular orbitals and density-functional theory is critically assessed. These are followed by a presentation and critical evaluation of the applications of semiempirical and empirical methods: QM/MM, molecular dynamics, free-energy calculations, metadynamics, molecular robotics, and others. The usefulness of molecular docking in structural glycobiology is evaluated by considering recent docking- validation studies on a range of protein targets. The range of applications of these theoretical methods provides insights into the structural, energetic, and mechanistic facets that occur in the course of the recognition processes. Selected examples are provided to exemplify the usefulness and the present limitations of these computational methods in their ability to assist in elucidation of the structural basis underlying the diverse function and biological roles of carbohydrates in their dialogue with proteins. These test cases cover the field of both carbohydrate biosynthesis and glycosyltransferases, as well as glycoside hydrolases. The phenomenon of (macro)molecular recognition is illustrated for the interactions of carbohydrates with such proteins as lectins, monoclonal antibodies, GAG-binding proteins, porins, and viruses.
Collapse
Affiliation(s)
- Serge Pérez
- Department of Molecular Pharmacochemistry, CNRS, University Grenoble-Alpes, Grenoble, France.
| | - Igor Tvaroška
- Department of Chemistry, Slovak Academy of Sciences, Bratislava, Slovak Republic; Department of Chemistry, Faculty of Natural Sciences, Constantine The Philosopher University, Nitra, Slovak Republic.
| |
Collapse
|
29
|
Mishra SK, Calabró G, Loeffler HH, Michel J, Koča J. Evaluation of Selected Classical Force Fields for Alchemical Binding Free Energy Calculations of Protein-Carbohydrate Complexes. J Chem Theory Comput 2015; 11:3333-45. [PMID: 26575767 DOI: 10.1021/acs.jctc.5b00159] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Protein-carbohydrate recognition is crucial in many vital biological processes including host-pathogen recognition, cell-signaling, and catalysis. Accordingly, computational prediction of protein-carbohydrate binding free energies is of enormous interest for drug design. However, the accuracy of current force fields (FFs) for predicting binding free energies of protein-carbohydrate complexes is not well understood owing to technical challenges such as the highly polar nature of the complexes, anomerization, and conformational flexibility of carbohydrates. The present study evaluated the performance of alchemical predictions of binding free energies with the GAFF1.7/AM1-BCC and GLYCAM06j force fields for modeling protein-carbohydrate complexes. Mean unsigned errors of 1.1 ± 0.06 (GLYCAM06j) and 2.6 ± 0.08 (GAFF1.7/AM1-BCC) kcal·mol(-1) are achieved for a large data set of monosaccharide ligands for Ralstonia solanacearum lectin (RSL). The level of accuracy provided by GLYCAM06j is sufficient to discriminate potent, moderate, and weak binders, a goal that has been difficult to achieve through other scoring approaches. Accordingly, the protocols presented here could find useful applications in carbohydrate-based drug and vaccine developments.
Collapse
Affiliation(s)
- Sushil K Mishra
- Central European Institute of Technology (CEITEC) and National Centre for Biomolecular Research, Faculty of Science, Masaryk University , Kamenice-5, 625 00 Brno, Czech Republic
| | - Gaetano Calabró
- EaStCHEM School of Chemistry, Joseph Black Building , King's Buildings, Edinburgh EH9 3JJ, United Kingdom
| | - Hannes H Loeffler
- Scientific Computing Department, STFC Daresbury , Warrington, WA4 4AD, United Kingdom
| | - Julien Michel
- EaStCHEM School of Chemistry, Joseph Black Building , King's Buildings, Edinburgh EH9 3JJ, United Kingdom
| | - Jaroslav Koča
- Central European Institute of Technology (CEITEC) and National Centre for Biomolecular Research, Faculty of Science, Masaryk University , Kamenice-5, 625 00 Brno, Czech Republic
| |
Collapse
|
30
|
Force fields and scoring functions for carbohydrate simulation. Carbohydr Res 2015; 401:73-81. [DOI: 10.1016/j.carres.2014.10.028] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2014] [Revised: 10/28/2014] [Accepted: 10/30/2014] [Indexed: 12/31/2022]
|
31
|
Modenutti C, Gauto D, Radusky L, Blanco J, Turjanski A, Hajos S, Marti M. Using crystallographic water properties for the analysis and prediction of lectin-carbohydrate complex structures. Glycobiology 2014; 25:181-96. [DOI: 10.1093/glycob/cwu102] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
|
32
|
Eid S, Saleh N, Zalewski A, Vedani A. Exploring the free-energy landscape of carbohydrate-protein complexes: development and validation of scoring functions considering the binding-site topology. J Comput Aided Mol Des 2014; 28:1191-204. [PMID: 25205292 DOI: 10.1007/s10822-014-9794-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2014] [Accepted: 09/04/2014] [Indexed: 11/30/2022]
Abstract
Carbohydrates play a key role in a variety of physiological and pathological processes and, hence, represent a rich source for the development of novel therapeutic agents. Being able to predict binding mode and binding affinity is an essential, yet lacking, aspect of the structure-based design of carbohydrate-based ligands. We assembled a diverse data set comprising 273 carbohydrate-protein crystal structures with known binding affinity and evaluated the prediction accuracy of a large collection of well-established scoring and free-energy functions, as well as combinations thereof. Unfortunately, the tested functions were not capable of reproducing binding affinities in the studied complexes. To simplify the complex free-energy surface of carbohydrate-protein systems, we classified the studied proteins according to the topology and solvent exposure of the carbohydrate-binding site into five distinct categories. A free-energy model based on the proposed classification scheme reproduced binding affinities in the carbohydrate data set with an r(2) of 0.71 and root-mean-squared-error of 1.25 kcal/mol (N = 236). The improvement in model performance underlines the significance of the differences in the local micro-environments of carbohydrate-binding sites and demonstrates the usefulness of calibrating free-energy functions individually according to binding-site topology and solvent exposure.
Collapse
Affiliation(s)
- Sameh Eid
- Department of Pharmaceutical Sciences, University of Basel, Klingelbergstrasse 50, 4056, Basel, Switzerland,
| | | | | | | |
Collapse
|
33
|
Grant OC, Woods RJ. Recent advances in employing molecular modelling to determine the specificity of glycan-binding proteins. Curr Opin Struct Biol 2014; 28:47-55. [PMID: 25108191 DOI: 10.1016/j.sbi.2014.07.001] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2014] [Revised: 07/09/2014] [Accepted: 07/10/2014] [Indexed: 01/11/2023]
Abstract
Impressive improvements in docking performance can be achieved by applying energy bonuses to poses in which glycan hydroxyl groups occupy positions otherwise preferred by bound waters. In addition, inclusion of glycosidic conformational energies allows unlikely glycan conformations to be appropriately penalized. A method for predicting the binding specificity of glycan-binding proteins has been developed, which is based on grafting glycan branches onto a minimal binding determinant in the binding site. Grafting can be used either to screen virtual libraries of glycans, such as the known glycome, or to identify docked poses of minimal binding determinants that are consistent with specificity data. The reviewed advances allow accurate modelling of carbohydrate-protein 3D co-complexes, but challenges remain in ranking the affinity of congeners.
Collapse
Affiliation(s)
- Oliver C Grant
- Complex Carbohydrate Research Center, 315 Riverbend Road, University of Georgia, Athens, GA 30602, United States
| | - Robert J Woods
- Complex Carbohydrate Research Center, 315 Riverbend Road, University of Georgia, Athens, GA 30602, United States; School of Chemistry, University Road, National University of Ireland, Galway, Ireland.
| |
Collapse
|
34
|
Gauto DF, Petruk AA, Modenutti CP, Blanco JI, Di Lella S, Martí MA. Solvent structure improves docking prediction in lectin-carbohydrate complexes. Glycobiology 2012; 23:241-58. [PMID: 23089616 DOI: 10.1093/glycob/cws147] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Recognition and complex formation between proteins and carbohydrates is a key issue in many important biological processes. Determination of the three-dimensional structure of such complexes is thus most relevant, but particularly challenging because of their usually low binding affinity. In silico docking methods have a long-standing tradition in predicting protein-ligand complexes, and allow a potentially fast exploration of a number of possible protein-carbohydrate complex structures. However, determining which of these predicted complexes represents the correct structure is not always straightforward. In this work, we present a modification of the scoring function provided by AutoDock4, a widely used docking software, on the basis of analysis of the solvent structure adjacent to the protein surface, as derived from molecular dynamics simulations, that allows the definition and characterization of regions with higher water occupancy than the bulk solvent, called water sites. They mimic the interaction held between the carbohydrate -OH groups and the protein. We used this information for an improved docking method in relation to its capacity to correctly predict the protein-carbohydrate complexes for a number of tested proteins, whose ligands range in size from mono- to tetrasaccharide. Our results show that the presented method significantly improves the docking predictions. The resulting solvent-structure-biased docking protocol, therefore, appears as a powerful tool for the design and optimization of development of glycomimetic drugs, while providing new insights into protein-carbohydrate interactions. Moreover, the achieved improvement also underscores the relevance of the solvent structure to the protein carbohydrate recognition process.
Collapse
Affiliation(s)
- Diego F Gauto
- Departamento de Química Inorgánica, Analítica y Química Física, CONICET, Ciudad Universitaria, Buenos Aires, Argentina
| | | | | | | | | | | |
Collapse
|
35
|
Lütteke T. The use of glycoinformatics in glycochemistry. Beilstein J Org Chem 2012; 8:915-29. [PMID: 23015842 PMCID: PMC3388882 DOI: 10.3762/bjoc.8.104] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2012] [Accepted: 05/29/2012] [Indexed: 01/10/2023] Open
Abstract
Glycoinformatics is a small but growing branch of bioinformatics and chemoinformatics. Various resources are now available that can be of use to glycobiologists, but also to chemists who work on the synthesis or analysis of carbohydrates. This article gives an overview of existing glyco-specific databases and tools, with a focus on their application to glycochemistry: Databases can provide information on candidate glycan structures for synthesis, or on glyco-enzymes that can be used to synthesize carbohydrates. Statistical analyses of glycan databases help to plan glycan synthesis experiments. 3D-Structural data of protein-carbohydrate complexes are used in targeted drug design, and tools to support glycan structure analysis aid with quality control. Specific problems of glycoinformatics compared to bioinformatics for genomics or proteomics, especially concerning integration and long-term maintenance of the existing glycan databases, are also discussed.
Collapse
Affiliation(s)
- Thomas Lütteke
- Justus-Liebig-University Gießen, Institute of Veterinary Physiology and Biochemistry, Frankfurter Str. 100, 35392 Gießen, Germany
| |
Collapse
|
36
|
Mishra SK, Adam J, Wimmerová M, Koča J. In silico mutagenesis and docking study of Ralstonia solanacearum RSL lectin: performance of docking software to predict saccharide binding. J Chem Inf Model 2012; 52:1250-61. [PMID: 22506916 DOI: 10.1021/ci200529n] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
In this study, in silico mutagenesis and docking in Ralstonia solanacearum lectin (RSL) were carried out, and the ability of several docking software programs to calculate binding affinity was evaluated. In silico mutation of six amino acid residues (Agr17, Glu28, Gly39, Ala40, Trp76, and Trp81) was done, and a total of 114 in silico mutants of RSL were docked with Me-α-L-fucoside. Our results show that polar residues Arg17 and Glu28, as well as nonpolar amino acids Trp76 and Trp81, are crucial for binding. Gly39 may also influence ligand binding because any mutations at this position lead to a change in the binding pocket shape. The Ala40 residue was found to be the most interesting residue for mutagenesis and can affect the selectivity and/or affinity. In general, the docking software used performs better for high affinity binders and fails to place the binding affinities in the correct order.
Collapse
Affiliation(s)
- Sushil Kumar Mishra
- Central European Institute of Technology, Masaryk University, Kamenice 5, 62500 Brno, Czech Republic
| | | | | | | |
Collapse
|
37
|
Cheng T, Li Q, Zhou Z, Wang Y, Bryant SH. Structure-based virtual screening for drug discovery: a problem-centric review. AAPS J 2012; 14:133-41. [PMID: 22281989 PMCID: PMC3282008 DOI: 10.1208/s12248-012-9322-0] [Citation(s) in RCA: 352] [Impact Index Per Article: 29.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2011] [Accepted: 01/04/2012] [Indexed: 11/30/2022] Open
Abstract
Structure-based virtual screening (SBVS) has been widely applied in early-stage drug discovery. From a problem-centric perspective, we reviewed the recent advances and applications in SBVS with a special focus on docking-based virtual screening. We emphasized the researchers' practical efforts in real projects by understanding the ligand-target binding interactions as a premise. We also highlighted the recent progress in developing target-biased scoring functions by optimizing current generic scoring functions toward certain target classes, as well as in developing novel ones by means of machine learning techniques.
Collapse
Affiliation(s)
- Tiejun Cheng
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, Maryland 20894 USA
| | - Qingliang Li
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, Maryland 20894 USA
| | - Zhigang Zhou
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, Maryland 20894 USA
| | - Yanli Wang
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, Maryland 20894 USA
| | - Stephen H. Bryant
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, Maryland 20894 USA
| |
Collapse
|
38
|
Theillet FX, Frank M, Vulliez-Le Normand B, Simenel C, Hoos S, Chaffotte A, Bélot F, Guerreiro C, Nato F, Phalipon A, Mulard LA, Delepierre M. Dynamic aspects of antibody:oligosaccharide complexes characterized by molecular dynamics simulations and saturation transfer difference nuclear magnetic resonance. Glycobiology 2011; 21:1570-9. [PMID: 21610193 DOI: 10.1093/glycob/cwr059] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Carbohydrates are likely to maintain significant conformational flexibility in antibody (Ab):carbohydrate complexes. As demonstrated herein for the protective monoclonal Ab (mAb) F22-4 recognizing the Shigella flexneri 2a O-antigen (O-Ag) and numerous synthetic oligosaccharide fragments thereof, the combination of molecular dynamics simulations and nuclear magnetic resonance saturation transfer difference experiments, supported by physicochemical analysis, allows us to determine the binding epitope and its various contributions to affinity without using any modified oligosaccharides. Moreover, the methods used provide insights into ligand flexibility in the complex, thus enabling a better understanding of the Ab affinities observed for a representative set of synthetic O-Ag fragments. Additionally, these complementary pieces of information give evidence to the ability of the studied mAb to recognize internal as well as terminal epitopes of its cognate polysaccharide antigen. Hence, we show that an appropriate combination of computational and experimental methods provides a basis to explore carbohydrate functional mimicry and receptor binding. The strategy may facilitate the design of either ligands or carbohydrate recognition domains, according to needed improvements of the natural carbohydrate:receptor properties.
Collapse
|
39
|
Samsonov SA, Teyra J, Pisabarro MT. Docking glycosaminoglycans to proteins: analysis of solvent inclusion. J Comput Aided Mol Des 2011; 25:477-89. [PMID: 21597992 PMCID: PMC3107433 DOI: 10.1007/s10822-011-9433-1] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2011] [Accepted: 05/06/2011] [Indexed: 12/15/2022]
Abstract
Glycosaminoglycans (GAGs) are anionic polysaccharides, which participate in key processes in the extracellular matrix by interactions with protein targets. Due to their charged nature, accurate consideration of electrostatic and water-mediated interactions is indispensable for understanding GAGs binding properties. However, solvent is often overlooked in molecular recognition studies. Here we analyze the abundance of solvent in GAG-protein interfaces and investigate the challenges of adding explicit solvent in GAG-protein docking experiments. We observe PDB GAG-protein interfaces being significantly more hydrated than protein-protein interfaces. Furthermore, by applying molecular dynamics approaches we estimate that about half of GAG-protein interactions are water-mediated. With a dataset of eleven GAG-protein complexes we analyze how solvent inclusion affects Autodock 3, eHiTs, MOE and FlexX docking. We develop an approach to de novo place explicit solvent into the binding site prior to docking, which uses the GRID program to predict positions of waters and to locate possible areas of solvent displacement upon ligand binding. To investigate how solvent placement affects docking performance, we compare these results with those obtained by taking into account information about the solvent position in the crystal structure. In general, we observe that inclusion of solvent improves the results obtained with these methods. Our data show that Autodock 3 performs best, though it experiences difficulties to quantitatively reproduce experimental data on specificity of heparin/heparan sulfate disaccharides binding to IL-8. Our work highlights the current challenges of introducing solvent in protein-GAGs recognition studies, which is crucial for exploiting the full potential of these molecules for rational engineering.
Collapse
|
40
|
Gandhi NS, Mancera RL. Molecular Dynamics Simulations of CXCL-8 and Its Interactions with a Receptor Peptide, Heparin Fragments, and Sulfated Linked Cyclitols. J Chem Inf Model 2011; 51:335-58. [DOI: 10.1021/ci1003366] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Neha S. Gandhi
- Curtin Health Innovation Research Institute, Western Australian Biomedical Research Institute, ‡School of Biomedical Sciences, and §School of Pharmacy, Curtin University, GPO Box U1987, Perth WA 6845, Australia
| | - Ricardo L. Mancera
- Curtin Health Innovation Research Institute, Western Australian Biomedical Research Institute, ‡School of Biomedical Sciences, and §School of Pharmacy, Curtin University, GPO Box U1987, Perth WA 6845, Australia
| |
Collapse
|
41
|
Pyrkov TV, Ozerov IV, Blitskaia ED, Efremov RG. [Molecular docking: role of intermolecular contacts in formation of complexes of proteins with nucleotides and peptides]. RUSSIAN JOURNAL OF BIOORGANIC CHEMISTRY 2010; 36:482-92. [PMID: 20823916 DOI: 10.1134/s1068162010040023] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Knowledge of 3D-structure of protein-ligand complex is a major prerequisite for understanding the functioning mechanism of cellular proteins and membrane receptors. This is also of a great help in rational drug design projects. In the present paper we briefly review the molecular docking approaches used to predict possible orientation of a ligand in the protein binding site. The recent trends to improve the accuracy and efficiency of docking algorithms are demonstrated with the results obtained in Laboratory of Biomolecular Modeling. Particular attention is paid to protein-ligand hydrophobic and stacking interactions responsible for molecular recognition of ligand fragments. Such type of interactions are not always adequately represented in scoring criteria of docking applications that leads to mismatch in 3D-structure complexes predictions. That is why further inquiry of methods to account for these interactions is now the area of active research.
Collapse
|
42
|
Woods RJ, Tessier MB. Computational glycoscience: characterizing the spatial and temporal properties of glycans and glycan-protein complexes. Curr Opin Struct Biol 2010; 20:575-83. [PMID: 20708922 DOI: 10.1016/j.sbi.2010.07.005] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2010] [Revised: 07/08/2010] [Accepted: 07/19/2010] [Indexed: 01/09/2023]
Abstract
Modern computational methods offer the tools to provide insight into the structural and dynamic properties of carbohydrate-protein complexes, beyond that provided by experimental structural biology. Dynamic properties such as the fluctuation of inter-molecular hydrogen bonds, the residency times of bound water molecules, side chain motions and ligand flexibility may be readily determined computationally. When taken with respect to the unliganded states, these calculations can also provide insight into the entropic and enthalpic changes in free energy associated with glycan binding. In addition, virtual ligand screening may be employed to predict the three dimensional (3D) structures of carbohydrate-protein complexes, given 3D structures for the components. In principle, the 3D structure of the protein may itself be derived by modeling, leading to the exciting--albeit high risk--realm of virtual structure prediction. This latter approach is appealing, given the difficulties associated with generating experimental 3D structures for some classes of glycan binding proteins; however, it is also the least robust. An unexpected outcome of the development of algorithms for modeling carbohydrate-protein interactions has been the discovery of errors in reported experimental 3D structures and a heightened awareness of the need for carbohydrate-specific computational tools for assisting in the refinement and curation of carbohydrate-containing crystal structures. Here we present a summary of the basic strategies associated with employing classical force field based modeling approaches to problems in glycoscience, with a focus on identifying typical pitfalls and limitations. This is not an exhaustive review of the current literature, but hopefully will provide a guide for the glycoscientist interested in modeling carbohydrates and carbohydrate-protein complexes, as well as the computational chemist contemplating such tasks.
Collapse
Affiliation(s)
- Robert J Woods
- Complex Carbohydrate Research Center, University of Georgia, 315 Riverbend Road, Athens, GA 30602, USA.
| | | |
Collapse
|
43
|
Frank M, Schloissnig S. Bioinformatics and molecular modeling in glycobiology. Cell Mol Life Sci 2010; 67:2749-72. [PMID: 20364395 PMCID: PMC2912727 DOI: 10.1007/s00018-010-0352-4] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2009] [Revised: 03/08/2010] [Accepted: 03/11/2010] [Indexed: 12/11/2022]
Abstract
The field of glycobiology is concerned with the study of the structure, properties, and biological functions of the family of biomolecules called carbohydrates. Bioinformatics for glycobiology is a particularly challenging field, because carbohydrates exhibit a high structural diversity and their chains are often branched. Significant improvements in experimental analytical methods over recent years have led to a tremendous increase in the amount of carbohydrate structure data generated. Consequently, the availability of databases and tools to store, retrieve and analyze these data in an efficient way is of fundamental importance to progress in glycobiology. In this review, the various graphical representations and sequence formats of carbohydrates are introduced, and an overview of newly developed databases, the latest developments in sequence alignment and data mining, and tools to support experimental glycan analysis are presented. Finally, the field of structural glycoinformatics and molecular modeling of carbohydrates, glycoproteins, and protein-carbohydrate interaction are reviewed.
Collapse
Affiliation(s)
- Martin Frank
- Molecular Structure Analysis Core Facility-W160, Deutsches Krebsforschungszentrum (German Cancer Research Centre), 69120 Heidelberg, Germany.
| | | |
Collapse
|
44
|
Fadda E, Woods RJ. Molecular simulations of carbohydrates and protein-carbohydrate interactions: motivation, issues and prospects. Drug Discov Today 2010; 15:596-609. [PMID: 20594934 DOI: 10.1016/j.drudis.2010.06.001] [Citation(s) in RCA: 148] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2009] [Revised: 04/20/2010] [Accepted: 06/01/2010] [Indexed: 11/16/2022]
Abstract
The characterization of the 3D structure of oligosaccharides, their conjugates and analogs is particularly challenging for traditional experimental methods. Molecular simulation methods provide a basis for interpreting sparse experimental data and for independently predicting conformational and dynamic properties of glycans. Here, we summarize and analyze the issues associated with modeling carbohydrates, with a detailed discussion of four of the most recently developed carbohydrate force fields, reviewed in terms of applicability to natural glycans, carbohydrate-protein complexes and the emerging area of glycomimetic drugs. In addition, we discuss prospectives and new applications of carbohydrate modeling in drug discovery.
Collapse
Affiliation(s)
- Elisa Fadda
- School of Chemistry, National University of Ireland, Galway, Ireland
| | | |
Collapse
|
45
|
Zhang G. Design andin silicoscreening of inhibitors of the cholera toxin. Expert Opin Drug Discov 2009; 4:923-38. [DOI: 10.1517/17460440903186118] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
46
|
Seifert MHJ. Robust optimization of scoring functions for a target class. J Comput Aided Mol Des 2009; 23:633-44. [DOI: 10.1007/s10822-009-9276-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2009] [Accepted: 04/18/2009] [Indexed: 10/20/2022]
|
47
|
Shridhar S, Chattopadhyay D, Yadav G. PLecDom: a program for identification and analysis of plant lectin domains. Nucleic Acids Res 2009; 37:W452-8. [PMID: 19474338 PMCID: PMC2703983 DOI: 10.1093/nar/gkp409] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
PLecDom is a program for detection of Plant Lectin Domains in a polypeptide or EST sequence, followed by a classification of the identified domains into known families. The web server is a collection of plant lectin domain families represented by alignments and profile Hidden Markov Models. PLecDom was developed after a rigorous analysis of evolutionary relationships between available sequences of lectin domains with known specificities. Users can test their sequences for potential lectin domains, catalog the identified domains into broad substrate classes, estimate the extent of divergence of new domains with existing homologs, extract domain boundaries and examine flanking sequences for further analysis. The high prediction accuracy of PLecDom combined with the ease with which it handles large scale input, enabled us to apply the program to protein and EST data from 48 plant genome-sequencing projects in various stages of completion. Our results represent a significant enrichment of the currently annotated plant lectins, and highlight potential targets for biochemical characterization. The search algorithm requires input in fasta format and is designed to process simultaneous connection requests from multiple users, such that huge sets of input sequences can be scanned in a matter of seconds. PLecDom is available at http://www.nipgr.res.in/plecdom.html.
Collapse
Affiliation(s)
- Smriti Shridhar
- Computational Biology Laboratory, National Institute of Plant Genome Research, Aruna Asaf Ali Marg, New Delhi 110067, India
| | | | | |
Collapse
|
48
|
Targeted scoring functions for virtual screening. Drug Discov Today 2009; 14:562-9. [PMID: 19508918 DOI: 10.1016/j.drudis.2009.03.013] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2008] [Revised: 03/20/2009] [Accepted: 03/24/2009] [Indexed: 11/24/2022]
Abstract
The benefit offered by virtual screening methods during the early drug discovery process is directly related to the predictivity of scoring functions that assess protein-ligand binding affinity. The scoring of protein-ligand complexes, however, is still a challenge: despite great efforts, a universal and accurate scoring method has not been developed up to now. Targeted scoring functions, in contrast, enhance virtual screening performance significantly. This review analyzes recent developments and future directions in the area of targeted scoring functions.
Collapse
|