1
|
Canner SW, Shanker S, Gray JJ. Structure-based neural network protein-carbohydrate interaction predictions at the residue level. FRONTIERS IN BIOINFORMATICS 2023; 3:1186531. [PMID: 37409346 PMCID: PMC10318439 DOI: 10.3389/fbinf.2023.1186531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 05/31/2023] [Indexed: 07/07/2023] Open
Abstract
Carbohydrates dynamically and transiently interact with proteins for cell-cell recognition, cellular differentiation, immune response, and many other cellular processes. Despite the molecular importance of these interactions, there are currently few reliable computational tools to predict potential carbohydrate-binding sites on any given protein. Here, we present two deep learning (DL) models named CArbohydrate-Protein interaction Site IdentiFier (CAPSIF) that predicts non-covalent carbohydrate-binding sites on proteins: (1) a 3D-UNet voxel-based neural network model (CAPSIF:V) and (2) an equivariant graph neural network model (CAPSIF:G). While both models outperform previous surrogate methods used for carbohydrate-binding site prediction, CAPSIF:V performs better than CAPSIF:G, achieving test Dice scores of 0.597 and 0.543 and test set Matthews correlation coefficients (MCCs) of 0.599 and 0.538, respectively. We further tested CAPSIF:V on AlphaFold2-predicted protein structures. CAPSIF:V performed equivalently on both experimentally determined structures and AlphaFold2-predicted structures. Finally, we demonstrate how CAPSIF models can be used in conjunction with local glycan-docking protocols, such as GlycanDock, to predict bound protein-carbohydrate structures.
Collapse
Affiliation(s)
- Samuel W. Canner
- Program in Molecular Biophysics, The Johns Hopkins University, Baltimore, MD, United States
| | - Sudhanshu Shanker
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States
| | - Jeffrey J. Gray
- Program in Molecular Biophysics, The Johns Hopkins University, Baltimore, MD, United States
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States
| |
Collapse
|
2
|
Kassem S, McPhee SA, Berisha N, Ulijn RV. Emergence of Cooperative Glucose-Binding Networks in Adaptive Peptide Systems. J Am Chem Soc 2023; 145:9800-9807. [PMID: 37075194 DOI: 10.1021/jacs.3c01620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/21/2023]
Abstract
Minimalistic peptide-based systems that bind sugars in water are challenging to design due to the weakness of interactions and required cooperative contributions from specific amino-acid side chains. Here, we used a bottom-up approach to create peptide-based adaptive glucose-binding networks by mixing glucose with selected sets of input dipeptides (up to 4) in the presence of an amidase to enable in situ reversible peptide elongation, forming mixtures of up to 16 dynamically interacting tetrapeptides. The choice of input dipeptides was based on amino-acid abundance in glucose-binding sites found in the protein data bank, with side chains that can support hydrogen bonding and CH-π interactions. Tetrapeptide sequence amplification patterns, determined through LC-MS analysis, served as a readout for collective interactions and led to the identification of optimized binding networks. Systematic variation of dipeptide input revealed the emergence of two networks of non-covalent hydrogen bonding and CH-π interactions that can co-exist, are cooperative and context-dependent. A cooperative binding mode was determined by studying the binding of the most amplified tetrapeptide (AWAD) with glucose in isolation. Overall, these results demonstrate that the bottom-up design of complex systems can recreate emergent behaviors driven by covalent and non-covalent self-organization that are not observed in reductionist designs and lead to the identification of system-level cooperative binding motifs.
Collapse
Affiliation(s)
- Salma Kassem
- Nanoscience Initiative at Advanced Science Research Center of the Graduate Center of the City University of New York, New York, New York 10031, United States
| | - Scott A McPhee
- Nanoscience Initiative at Advanced Science Research Center of the Graduate Center of the City University of New York, New York, New York 10031, United States
| | - Naxhije Berisha
- Nanoscience Initiative at Advanced Science Research Center of the Graduate Center of the City University of New York, New York, New York 10031, United States
- Ph.D. Programs in Biochemistry and Chemistry, The Graduate Center of the City University of New York, New York, New York 10016, United States
- Department of Pharmacology, Memorial Sloan Kettering Cancer Center, New York, New York 10065, United States
- Department of Chemistry Hunter College, City University of New York, New York, New York 10065, United States
| | - Rein V Ulijn
- Nanoscience Initiative at Advanced Science Research Center of the Graduate Center of the City University of New York, New York, New York 10031, United States
- Ph.D. Programs in Biochemistry and Chemistry, The Graduate Center of the City University of New York, New York, New York 10016, United States
- Department of Chemistry Hunter College, City University of New York, New York, New York 10065, United States
| |
Collapse
|
3
|
Canner SW, Shanker S, Gray JJ. Structure-Based Neural Network Protein-Carbohydrate Interaction Predictions at the Residue Level. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.14.531382. [PMID: 36993750 PMCID: PMC10054975 DOI: 10.1101/2023.03.14.531382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/13/2023]
Abstract
Carbohydrates dynamically and transiently interact with proteins for cell-cell recognition, cellular differentiation, immune response, and many other cellular processes. Despite the molecular importance of these interactions, there are currently few reliable computational tools to predict potential carbohydrate binding sites on any given protein. Here, we present two deep learning models named CArbohydrate-Protein interaction Site IdentiFier (CAPSIF) that predict carbohydrate binding sites on proteins: (1) a 3D-UNet voxel-based neural network model (CAPSIF:V) and (2) an equivariant graph neural network model (CAPSIF:G). While both models outperform previous surrogate methods used for carbohydrate binding site prediction, CAPSIF:V performs better than CAPSIF:G, achieving test Dice scores of 0.597 and 0.543 and test set Matthews correlation coefficients (MCCs) of 0.599 and 0.538, respectively. We further tested CAPSIF:V on AlphaFold2-predicted protein structures. CAPSIF:V performed equivalently on both experimentally determined structures and AlphaFold2 predicted structures. Finally, we demonstrate how CAPSIF models can be used in conjunction with local glycan-docking protocols, such as GlycanDock, to predict bound protein-carbohydrate structures.
Collapse
Affiliation(s)
- Samuel W Canner
- Program in Molecular Biophysics, The Johns Hopkins University, Baltimore, MD, United States of America
| | - Sudhanshu Shanker
- Dept. of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America
| | - Jeffrey J Gray
- Program in Molecular Biophysics, The Johns Hopkins University, Baltimore, MD, United States of America
- Dept. of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America
| |
Collapse
|
4
|
Ma B, Wang R, Chen B, Liu W, Zhou S, Li X, Gong J, Shang S, Li Y, Xu D, Tan Z. Insights into the effect of protein glycosylation on carbohydrate substrate binding. Int J Biol Macromol 2023; 235:123833. [PMID: 36870654 DOI: 10.1016/j.ijbiomac.2023.123833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 02/20/2023] [Accepted: 02/21/2023] [Indexed: 03/06/2023]
Abstract
The role of glycosylation in the binding of glycoproteins to carbohydrate substrates has not been well understood. The present study addresses this knowledge gap by elucidating the links between the glycosylation patterns of a model glycoprotein, a Family 1 carbohydrate-binding module (TrCBM1), and the thermodynamic and structural properties of its binding to different carbohydrate substrates using isothermal titration calorimetry and computational simulation. The variations in glycosylation patterns cause a gradual transition of the binding to soluble cellohexaose from an entropy-driven process to an enthalpy-driven one, a trend closely correlated with the glycan-induced shift of the predominant binding force from hydrophobic interactions to hydrogen bonding. However, when binding to a large surface of solid cellulose, glycans on TrCBM1 have a more dispersed distribution and thus have less adverse impact on the hydrophobic interaction forces, leading to overall improved binding. Unexpectedly, our simulation results also suggest an evolutionary role of O-mannosylation in transforming the substrate binding features of TrCBM1 from those of type A CBMs to those of type B CBMs. Taken together, these findings provide new fundamental insights into the molecular basis of the role of glycosylation in protein-carbohydrate interactions and are expected to better facilitate further studies in this area.
Collapse
Affiliation(s)
- Bo Ma
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Materia Medica, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100050, China
| | - Ruihan Wang
- MOE Key Laboratory of Green Chemistry and Technology, College of Chemistry, Sichuan University, Sichuan 610064, China
| | - Baoquan Chen
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Materia Medica, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100050, China
| | - Wenqiang Liu
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Materia Medica, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100050, China
| | - Sen Zhou
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Materia Medica, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100050, China
| | - Xue Li
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Materia Medica, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100050, China
| | - Jinyuan Gong
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Materia Medica, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100050, China
| | - Shiying Shang
- Center of Pharmaceutical Technology, School of Pharmaceutical Sciences, Tsinghua University, Beijing 100084, China
| | - Yaohao Li
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Materia Medica, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100050, China.
| | - Dingguo Xu
- MOE Key Laboratory of Green Chemistry and Technology, College of Chemistry, Sichuan University, Sichuan 610064, China.
| | - Zhongping Tan
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Materia Medica, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100050, China.
| |
Collapse
|
5
|
Siva Shanmugam NR, Jino Blessy J, Veluraja K, Gromiha MM. Prediction of protein-carbohydrate complex binding affinity using structural features. Brief Bioinform 2020; 22:6032626. [PMID: 33313775 DOI: 10.1093/bib/bbaa319] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 09/28/2020] [Accepted: 10/19/2020] [Indexed: 01/03/2023] Open
Abstract
Protein-carbohydrate interactions play a major role in several cellular and biological processes. Elucidating the factors influencing the binding affinity of protein-carbohydrate complexes and predicting their free energy of binding provide deep insights for understanding the recognition mechanism. In this work, we have collected the experimental binding affinity data for a set of 389 protein-carbohydrate complexes and derived several structure-based features such as contact potentials, interaction energy, number of binding residues and contacts between different types of atoms. Our analysis on the relationship between binding affinity and structural features revealed that the important factors depend on the type of the complex based on number of carbohydrate and protein chains. Specifically, binding site residues, accessible surface area, interactions between various atoms and energy contributions are important to understand the binding affinity. Further, we have developed multiple regression equations for predicting the binding affinity of protein-carbohydrate complexes belonging to six categories of protein-carbohydrate complexes. Our method showed an average correlation and mean absolute error of 0.731 and 1.149 kcal/mol, respectively, between experimental and predicted binding affinities on a jackknife test. We have developed a web server PCA-Pred, Protein-Carbohydrate Affinity Predictor, for predicting the binding affinity of protein-carbohydrate complexes. The web server is freely accessible at https://web.iitm.ac.in/bioinfo2/pcapred/. The web server is implemented using HTML and Python and supports recent versions of major browsers such as Chrome, Firefox, IE10 and Opera.
Collapse
Affiliation(s)
| | | | - K Veluraja
- Indian Institute of Science, Bangalore, India
| | | |
Collapse
|
6
|
Nascimento KS, Andrade MLL, Silva IB, Domingues DL, Chicas LS, Silva MTL, Bringel PHSF, Marques GFO, Martins MGQ, Lóssio CF, Nascimento APM, Wolin IAV, Leal RB, Assreuy AMS, Cavada BS. Heterologous production of α-chain of Dioclea sclerocarpa lectin: Enhancing the biological effects of a wild-type lectin. Int J Biol Macromol 2020; 156:1-9. [PMID: 32275993 DOI: 10.1016/j.ijbiomac.2020.04.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2020] [Revised: 03/31/2020] [Accepted: 04/02/2020] [Indexed: 10/24/2022]
Abstract
Lectins from Diocleinae subtribe species (family Leguminosae) are of special interest since they present a wide spectrum of biological activities, despite their high structural similarity. During their synthesis in plant cells, these proteins undergo post-translational processing resulting in the formation of three chains (α, β, γ), which constitute the lectins' subunits. Furthermore, such wild-type proteins are presented as isolectins or with different combinations of these chains, which undermine their biotechnological potential. Thus, the present study aimed to produce a recombinant form of the lectin from Dioclea sclerocarpa seeds (DSL), exclusively constituted by α-chain. The recombinant DSL (rDSL) was successfully expressed in E. coli BL21 (DE3) and purified by affinity chromatography (Sephadex G-50), showing a final yield of 74 mg of protein per liter of culture medium and specificity for D-mannose, α-methyl-mannoside and melibiose, unlike the wild-type protein. rDSL presented an effective vasorelaxant effect in rat aortas up to 100% and also interacted with glioma cells C6 and U87. Our results demonstrated an efficient recombinant production of rDSL in a bacterial system that retained some biochemical properties of the wild-type protein, showing wider versatility in sugar specificities and better efficacy in its activity in the biological models evaluated in this work.
Collapse
Affiliation(s)
- Kyria S Nascimento
- Departamento de Bioquímica e Biologia Molecular, Universidade Federal do Ceará, Campus do Pici, 60440970 Fortaleza, Ceará, Brazil
| | - Maria L L Andrade
- Universidade Federal do Rio Grande do Norte, Escola Agrícola de Jundiaí, Distrito de Jundiaí, 59280000 Macaíba, Rio Grande do Norte, Brazil
| | - Ivanice B Silva
- Departamento de Bioquímica e Biologia Molecular, Universidade Federal do Ceará, Campus do Pici, 60440970 Fortaleza, Ceará, Brazil
| | - Daniel L Domingues
- Departamento de Bioquímica e Biologia Molecular, Universidade Federal do Ceará, Campus do Pici, 60440970 Fortaleza, Ceará, Brazil
| | - Larissa S Chicas
- Departamento de Bioquímica e Biologia Molecular, Universidade Federal do Ceará, Campus do Pici, 60440970 Fortaleza, Ceará, Brazil
| | - Mayara T L Silva
- Departamento de Bioquímica e Programa de Pós-graduação em Bioquímica, Universidade Federal de Santa Catarina, Campus Universitário, 88040900 Florianópolis, Santa Catarina, Brazil
| | - Pedro H S F Bringel
- Instituto Superior de Ciências Biomédicas, Universidade Estadual do Ceará, Campus do Itaperi, 60714903 Fortaleza, Ceará, Brazil
| | - Gabriela F O Marques
- Instituto Superior de Ciências Biomédicas, Universidade Estadual do Ceará, Campus do Itaperi, 60714903 Fortaleza, Ceará, Brazil
| | - Maria G Q Martins
- Departamento de Bioquímica e Biologia Molecular, Universidade Federal do Ceará, Campus do Pici, 60440970 Fortaleza, Ceará, Brazil; Centro Universitário INTA, Programa de pós-graduação em Biotecnologia, Sobral, Ceará, Brazil
| | - Claudia F Lóssio
- Departamento de Bioquímica e Biologia Molecular, Universidade Federal do Ceará, Campus do Pici, 60440970 Fortaleza, Ceará, Brazil
| | - Ana Paula M Nascimento
- Departamento de Bioquímica e Programa de Pós-graduação em Bioquímica, Universidade Federal de Santa Catarina, Campus Universitário, 88040900 Florianópolis, Santa Catarina, Brazil
| | - Ingrid A V Wolin
- Departamento de Bioquímica e Programa de Pós-graduação em Bioquímica, Universidade Federal de Santa Catarina, Campus Universitário, 88040900 Florianópolis, Santa Catarina, Brazil
| | - Rodrigo B Leal
- Departamento de Bioquímica e Programa de Pós-graduação em Bioquímica, Universidade Federal de Santa Catarina, Campus Universitário, 88040900 Florianópolis, Santa Catarina, Brazil
| | - Ana M S Assreuy
- Instituto Superior de Ciências Biomédicas, Universidade Estadual do Ceará, Campus do Itaperi, 60714903 Fortaleza, Ceará, Brazil.
| | - Benildo S Cavada
- Departamento de Bioquímica e Biologia Molecular, Universidade Federal do Ceará, Campus do Pici, 60440970 Fortaleza, Ceará, Brazil.
| |
Collapse
|
7
|
Catelani G, D'Andrea F, Guazzelli L, Griselli A, Testi N, Chiacchio MA, Legnani L, Toma L. Synthesis and conformational analysis of a simplified inositol-model of the Streptococcus pneumoniae 19F capsular polysaccharide repeating unit. Carbohydr Res 2017; 443-444:29-36. [PMID: 28324771 DOI: 10.1016/j.carres.2017.03.012] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2017] [Revised: 03/13/2017] [Accepted: 03/13/2017] [Indexed: 12/27/2022]
Abstract
Carbohydrate mimics have been studied for a long time as useful sugar substitutes, both in the investigation of biological events and in the treatment of sugar-related diseases. Here we report further evaluation of the capabilities of inositols as carbohydrate substitutes. The conformational features of an inositol-model of a simplified repeating unit corresponding to the capsular polysaccharide of Streptococcus pneumoniae 19F has been evaluated by computational analysis, and compared to the native repeating unit. The inositol mimic was synthesized, and its experimental spectroscopic data allowed for verification of the theoretical results.
Collapse
Affiliation(s)
- Giorgio Catelani
- Università di Pisa, Dipartimento di Farmacia, Via Bonanno 33, 56126 Pisa, Italy
| | - Felicia D'Andrea
- Università di Pisa, Dipartimento di Farmacia, Via Bonanno 33, 56126 Pisa, Italy
| | - Lorenzo Guazzelli
- Università di Pisa, Dipartimento di Farmacia, Via Bonanno 33, 56126 Pisa, Italy.
| | - Alessio Griselli
- Università di Pisa, Dipartimento di Farmacia, Via Bonanno 33, 56126 Pisa, Italy
| | - Nicola Testi
- Università di Pisa, Dipartimento di Farmacia, Via Bonanno 33, 56126 Pisa, Italy
| | - Maria Assunta Chiacchio
- Università di Catania, Dipartimento di Scienze del Farmaco, V.le A. Doria 6, 95125 Catania, Italy; Università di Pavia, Dipartimento di Chimica, Via Taramelli 12, 27100 Pavia, Italy
| | - Laura Legnani
- Università di Catania, Dipartimento di Scienze del Farmaco, V.le A. Doria 6, 95125 Catania, Italy; Università di Pavia, Dipartimento di Chimica, Via Taramelli 12, 27100 Pavia, Italy
| | - Lucio Toma
- Università di Pavia, Dipartimento di Chimica, Via Taramelli 12, 27100 Pavia, Italy
| |
Collapse
|