1
|
Yom A, Chiang A, Lewis NE. Boltzmann Model Predicts Glycan Structures from Lectin Binding. Anal Chem 2024; 96:8332-8341. [PMID: 38720429 PMCID: PMC11162346 DOI: 10.1021/acs.analchem.3c04992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/20/2024]
Abstract
Glycans are complex oligosaccharides that are involved in many diseases and biological processes. Unfortunately, current methods for determining glycan composition and structure (glycan sequencing) are laborious and require a high level of expertise. Here, we assess the feasibility of sequencing glycans based on their lectin binding fingerprints. By training a Boltzmann model on lectin binding data, we predict the approximate structures of 88 ± 7% of N-glycans and 87 ± 13% of O-glycans in our test set. We show that our model generalizes well to the pharmaceutically relevant case of Chinese hamster ovary (CHO) cell glycans. We also analyze the motif specificity of a wide array of lectins and identify the most and least predictive lectins and glycan features. These results could help streamline glycoprotein research and be of use to anyone using lectins for glycobiology.
Collapse
Affiliation(s)
- Aria Yom
- Department of Physics, University of California, San Diego, California 92093, United States
| | - Austin Chiang
- Department of Pediatrics, University of California, San Diego, California 92093, United States
- Immunology Center of Georgia, Augusta University, Augusta, Georgia 30912, United States
- Department of Medicine, Augusta University, Augusta, Georgia 30912, United States
| | - Nathan E Lewis
- Department of Pediatrics, University of California, San Diego, California 92093, United States
- Department of Bioengineering, University of California, San Diego, California 92093, United States
| |
Collapse
|
2
|
Yom A, Chiang A, Lewis NE. A Boltzmann model predicts glycan structures from lectin binding. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.06.03.543532. [PMID: 37333412 PMCID: PMC10274649 DOI: 10.1101/2023.06.03.543532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/20/2023]
Abstract
Glycans are complex oligosaccharides involved in many diseases and biological processes. Unfortunately, current methods for determining glycan composition and structure (glycan sequencing) are laborious and require a high level of expertise. Here, we assess the feasibility of sequencing glycans based on their lectin binding fingerprints. By training a Boltzmann model on lectin binding data, we predict the approximate structures of 88 ± 7% of N-glycans and 87 ± 13% of O-glycans in our test set. We show that our model generalizes well to the pharmaceutically relevant case of Chinese Hamster Ovary (CHO) cell glycans. We also analyze the motif specificity of a wide array of lectins and identify the most and least predictive lectins and glycan features. These results could help streamline glycoprotein research and be of use to anyone using lectins for glycobiology.
Collapse
Affiliation(s)
- Aria Yom
- Department of Physics, University of California, San Diego. CA 92093, USA
| | - Austin Chiang
- Department of Pediatrics, University of California, San Diego. CA 92093, USA
- Immunology Center of Georgia, Augusta University, Augusta, GA 30912, USA
- Department of Medicine, Augusta University, Augusta, GA 30912, USA
| | - Nathan E Lewis
- Department of Bioengineering, University of California, San Diego. CA 92093, USA
- Department of Pediatrics, University of California, San Diego. CA 92093, USA
| |
Collapse
|
3
|
Akhtar A, Lata M, Sunsunwal S, Yadav A, Lnu K, Subramanian S, Ramya TNC. New carbohydrate binding domains identified by phage display based functional metagenomic screens of human gut microbiota. Commun Biol 2023; 6:371. [PMID: 37019943 PMCID: PMC10076258 DOI: 10.1038/s42003-023-04718-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 03/16/2023] [Indexed: 04/07/2023] Open
Abstract
Uncultured microbes represent a huge untapped biological resource of novel genes and gene products. Although recent genomic and metagenomic sequencing efforts have led to the identification of numerous genes that are homologous to existing annotated genes, there remains, yet, an enormous pool of unannotated genes that do not find significant sequence homology to existing annotated genes. Functional metagenomics offers a way to identify and annotate novel gene products. Here, we use functional metagenomics to mine novel carbohydrate binding domains that might aid human gut commensals in adherence, gut colonization, and metabolism of complex carbohydrates. We report the construction and functional screening of a metagenomic phage display library from healthy human fecal samples against dietary, microbial and host polysaccharides/glycoconjugates. We identify several protein sequences that do not find a hit to any known protein domain but are predicted to contain carbohydrate binding module-like folds. We heterologously express, purify and biochemically characterize some of these protein domains and demonstrate their carbohydrate-binding function. Our study reveals several previously unannotated carbohydrate-binding domains, including a levan binding domain and four complex N-glycan binding domains that might be useful for the labeling, visualization, and isolation of these glycans.
Collapse
Affiliation(s)
- Akil Akhtar
- CSIR- Institute of Microbial Technology, Sector 39-A, Chandigarh, 160036, India
- Emory Vaccine Center, Emory University, Atlanta, GA, USA
| | - Madhu Lata
- CSIR- Institute of Microbial Technology, Sector 39-A, Chandigarh, 160036, India
- Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India
| | - Sonali Sunsunwal
- CSIR- Institute of Microbial Technology, Sector 39-A, Chandigarh, 160036, India
| | - Amit Yadav
- CSIR- Institute of Microbial Technology, Sector 39-A, Chandigarh, 160036, India
- Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India
| | - Kajal Lnu
- CSIR- Institute of Microbial Technology, Sector 39-A, Chandigarh, 160036, India
- Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India
| | - Srikrishna Subramanian
- CSIR- Institute of Microbial Technology, Sector 39-A, Chandigarh, 160036, India
- Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India
| | - T N C Ramya
- CSIR- Institute of Microbial Technology, Sector 39-A, Chandigarh, 160036, India.
- Academy of Scientific & Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India.
| |
Collapse
|
4
|
Reeves AE, Huang ML. Proximity labeling technologies to illuminate glycan-protein interactions. Curr Opin Chem Biol 2023; 72:102233. [PMID: 36493526 PMCID: PMC9870929 DOI: 10.1016/j.cbpa.2022.102233] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 11/01/2022] [Accepted: 11/03/2022] [Indexed: 12/12/2022]
Abstract
Glycosylation is a ubiquitous post-translational modification read by glycan-binding proteins (GBP) to encode important functions, but a robust understanding of these interactions and their consequences can be challenging to uncover. Glycan-GBP interactions are transient and weak, making them difficult to capture, and glycosylation is dynamic and heterogenous, necessitating study in native cellular environments to identify endogenous ligands. Proximity labeling, an experimental innovation that labels biomolecules close to a protein of interest, has recently emerged as a powerful strategy to overcome these limitations, allowing interactors to be tagged in cells for subsequent enrichment and identification by mass spectrometry-based proteomics. We will describe this nascent technique and discuss its applications in the last five years with different GBP classes, including Siglecs, galectins, and non-human lectins.
Collapse
Affiliation(s)
- Abigail E Reeves
- Skaggs Graduate School of Chemical and Biological Sciences, Scripps Research, 10550 N. Torrey Pines Rd., La Jolla, CA 92037, USA; Department of Molecular Medicine, Scripps Research, 10550 N. Torrey Pines Rd., La Jolla, CA 92037, USA
| | - Mia L Huang
- Skaggs Graduate School of Chemical and Biological Sciences, Scripps Research, 10550 N. Torrey Pines Rd., La Jolla, CA 92037, USA; Department of Molecular Medicine, Scripps Research, 10550 N. Torrey Pines Rd., La Jolla, CA 92037, USA.
| |
Collapse
|
5
|
VT68.2: An Antibody to Chondroitin Sulfate Proteoglycan 4 (CSPG4) Displays Reactivity against a Tumor-Associated Carbohydrate Antigen. Int J Mol Sci 2023; 24:ijms24032506. [PMID: 36768830 PMCID: PMC9917008 DOI: 10.3390/ijms24032506] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 01/15/2023] [Accepted: 01/26/2023] [Indexed: 01/31/2023] Open
Abstract
The anti-CSPG4 monoclonal antibodies (mAbs) have shown anti-tumor activity and therapeutic potential for treating breast cancer. In addition, CSPG4 is a dominant tumor-associated antigen that is also involved in normal-tissue development in humans. Therefore, the potential for off-tumor activity remains a serious concern when targeting CSPG4 therapeutically. Previous work suggested that glycans contribute to the binding of specific anti-CSPG4 antibodies to tumor cells, but the specificity and importance of this contribution are unknown. In this study, the reactivity of anti-CSPG4 mAbs was characterized with a peptide mimetic of carbohydrate antigens expressed in breast cancer. ELISA, flow cytometry, and microarray assays were used to screen mAbs for their ability to bind to carbohydrate-mimicking peptides (CMPs), cancer cells, and glycans. The mAb VT68.2 displayed a distinctly strong binding to a CMP (P10s) and bound to triple-negative breast cancer cells. In addition, VT68.2 showed a higher affinity for N-linked glycans that contain terminal fucose and fucosylated lactosamines. The functional assays demonstrated that VT68.2 inhibited cancer cell migration. These results define the glycoform reactivity of an anti-CSPG4 antibody and may lead to the development of less toxic therapeutic approaches that target tumor-specific glyco-peptides.
Collapse
|
6
|
Bojar D, Meche L, Meng G, Eng W, Smith DF, Cummings RD, Mahal LK. A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities. ACS Chem Biol 2022; 17:2993-3012. [PMID: 35084820 PMCID: PMC9679999 DOI: 10.1021/acschembio.1c00689] [Citation(s) in RCA: 115] [Impact Index Per Article: 57.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Glycans are critical to every facet of biology and medicine, from viral infections to embryogenesis. Tools to study glycans are rapidly evolving; however, the majority of our knowledge is deeply dependent on binding by glycan binding proteins (e.g., lectins). The specificities of lectins, which are often naturally isolated proteins, have not been well-defined, making it difficult to leverage their full potential for glycan analysis. Herein, we use a combination of machine learning algorithms and expert annotation to define lectin specificity for this important probe set. Our analysis uses comprehensive glycan microarray analysis of commercially available lectins we obtained using version 5.0 of the Consortium for Functional Glycomics glycan microarray (CFGv5). This data set was made public in 2011. We report the creation of this data set and its use in large-scale evaluation of lectin-glycan binding behaviors. Our motif analysis was performed by integrating 68 manually defined glycan features with systematic probing of computational rules for significant binding motifs using mono- and disaccharides and linkages. Combining machine learning with manual annotation, we create a detailed interpretation of glycan-binding specificity for 57 unique lectins, categorized by their major binding motifs: mannose, complex-type N-glycan, O-glycan, fucose, sialic acid and sulfate, GlcNAc and chitin, Gal and LacNAc, and GalNAc. Our work provides fresh insights into the complex binding features of commercially available lectins in current use, providing a critical guide to these important reagents.
Collapse
Affiliation(s)
- Daniel Bojar
- Department
of Chemistry and Molecular Biology and Wallenberg Centre for Molecular
and Translational Medicine, University of
Gothenburg, Gothenburg, Sweden 405 30
| | - Lawrence Meche
- Biomedical
Chemistry Institute, Department of Chemistry, New York University, 100 Washington Square East, Room 1001, New
York, New York 10003, United States
| | - Guanmin Meng
- Department
of Chemistry, University of Alberta, Edmonton, Canada, T6G 2G2
| | - William Eng
- Biomedical
Chemistry Institute, Department of Chemistry, New York University, 100 Washington Square East, Room 1001, New
York, New York 10003, United States
| | - David F. Smith
- Department
of Biochemistry, Glycomics Center, School of Medicine, Emory University, Atlanta, Georgia 30322, United States
| | - Richard D. Cummings
- Department
of Surgery, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, Massachusetts 02115, United States
| | - Lara K. Mahal
- Biomedical
Chemistry Institute, Department of Chemistry, New York University, 100 Washington Square East, Room 1001, New
York, New York 10003, United States,Department
of Chemistry, University of Alberta, Edmonton, Canada, T6G 2G2,E-mail:
| |
Collapse
|
7
|
Carpenter EJ, Seth S, Yue N, Greiner R, Derda R. GlyNet: a multi-task neural network for predicting protein-glycan interactions. Chem Sci 2022; 13:6669-6686. [PMID: 35756507 PMCID: PMC9172296 DOI: 10.1039/d1sc05681f] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 05/02/2022] [Indexed: 12/14/2022] Open
Abstract
Advances in diagnostics, therapeutics, vaccines, transfusion, and organ transplantation build on a fundamental understanding of glycan-protein interactions. To aid this, we developed GlyNet, a model that accurately predicts interactions (relative binding strengths) between mammalian glycans and 352 glycan-binding proteins, many at multiple concentrations. For each glycan input, our model produces 1257 outputs, each representing the relative interaction strength between the input glycan and a particular protein sample. GlyNet learns these continuous values using relative fluorescence units (RFUs) measured on 599 glycans in the Consortium for Functional Glycomics glycan arrays and extrapolates these to RFUs from additional, untested glycans. GlyNet's output of continuous values provides more detailed results than the standard binary classification models. After incorporating a simple threshold to transform such continuous outputs the resulting GlyNet classifier outperforms those standard classifiers. GlyNet is the first multi-output regression model for predicting protein-glycan interactions and serves as an important benchmark, facilitating development of quantitative computational glycobiology.
Collapse
Affiliation(s)
- Eric J Carpenter
- Department of Chemistry, University of Alberta Edmonton Alberta Canada
| | - Shaurya Seth
- Department of Chemistry, University of Alberta Edmonton Alberta Canada
| | - Noel Yue
- Department of Chemistry, University of Alberta Edmonton Alberta Canada
| | - Russell Greiner
- Department of Computing Science, University of Alberta Edmonton Alberta Canada
- Alberta Machine Intelligence Institute (AMII) Edmonton Alberta Canada
| | - Ratmir Derda
- Department of Chemistry, University of Alberta Edmonton Alberta Canada
| |
Collapse
|
8
|
Akune Y, Arpinar S, Silva LM, Palma AS, Tajadura-Ortega V, Aoki-Kinoshita KF, Ranzinger R, Liu Y, Feizi T. OUP accepted manuscript. Glycobiology 2022; 32:552-555. [PMID: 35352122 PMCID: PMC9191619 DOI: 10.1093/glycob/cwac018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Revised: 03/07/2022] [Accepted: 03/08/2022] [Indexed: 11/25/2022] Open
Abstract
Glycan microarrays are essential tools in glycobiology and are being widely used for assignment of glycan ligands in diverse glycan recognition systems. We have developed a new software, called Carbohydrate microArray Analysis and Reporting Tool (CarbArrayART), to address the need for a distributable application for glycan microarray data management. The main features of CarbArrayART include: (i) Storage of quantified array data from different array layouts with scan data and array-specific metadata, such as lists of arrayed glycans, array geometry, information on glycan-binding samples, and experimental protocols. (ii) Presentation of microarray data as charts, tables, and heatmaps derived from the average fluorescence intensity values that are calculated based on the imaging scan data and array geometry, as well as filtering and sorting functions according to monosaccharide content and glycan sequences. (iii) Data export for reporting in Word, PDF, and Excel formats, together with metadata that are compliant with the guidelines of MIRAGE (Minimum Information Required for A Glycomics Experiment). CarbArrayART is designed for routine use in recording, storage, and management of any slide-based glycan microarray experiment. In conjunction with the MIRAGE guidelines, CarbArrayART addresses issues that are critical for glycobiology, namely, clarity of data for evaluation of reproducibility and validity.
Collapse
Affiliation(s)
- Yukie Akune
- Corresponding author: The Glycosciences Laboratory, Burlington Danes Building Room 509, Du Cane Road, London W12 0NN, United Kingdom.
| | - Sena Arpinar
- Complex Carbohydrate Research Center, University of Georgia, 315 Riverbend Rd, Athens, GA 30602, United States
| | - Lisete M Silva
- Glycosciences Laboratory, Department of Metabolism, Digestion and Reproduction, Imperial College, Du Cane Road, London W12 0NN, United Kingdom
- LAQV-REQUIMTE, Department of Chemistry, University of Aveiro, 3810-193 Aveiro, Portugal
| | - Angelina S Palma
- UCIBIO, Applied Molecular Biosciences Unit, Department of Chemistry, School of Science and Technology, NOVA University Lisbon, 2819-516 Caparica, Portugal
- Associate Laboratory i4HB-Institute for Health and Bioeconomy, School of Science and Technology, NOVA University Lisbon, Lisbon, 2819-516 Caparica, Portugal
| | - Virginia Tajadura-Ortega
- Glycosciences Laboratory, Department of Metabolism, Digestion and Reproduction, Imperial College, Du Cane Road, London W12 0NN, United Kingdom
| | - Kiyoko F Aoki-Kinoshita
- Glycan and Life Systems Integration Center (GaLSIC), Soka University, 1-236 Tangi-machi, Hachioji, Tokyo 192-8577, Japan
| | - René Ranzinger
- Complex Carbohydrate Research Center, University of Georgia, 315 Riverbend Rd, Athens, GA 30602, United States
| | - Yan Liu
- Glycosciences Laboratory, Department of Metabolism, Digestion and Reproduction, Imperial College, Du Cane Road, London W12 0NN, United Kingdom
| | - Ten Feizi
- Glycosciences Laboratory, Department of Metabolism, Digestion and Reproduction, Imperial College, Du Cane Road, London W12 0NN, United Kingdom
| |
Collapse
|
9
|
Klamer ZL, Harris CM, Beirne JM, Kelly JE, Zhang J, Haab BB. OUP accepted manuscript. Glycobiology 2022; 32:679-690. [PMID: 35352123 PMCID: PMC9280547 DOI: 10.1093/glycob/cwac022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Revised: 03/24/2022] [Accepted: 03/24/2022] [Indexed: 11/12/2022] Open
Abstract
Glycan arrays continue to be the primary resource for determining the glycan-binding specificity of proteins. The volume and diversity of glycan-array data are increasing, but no common method and resource exist to analyze, integrate, and use the available data. To meet this need, we developed a resource of analyzed glycan-array data called CarboGrove. Using the ability to process and interpret data from any type of glycan array, we populated the database with the results from 35 types of glycan arrays, 13 glycan families, 5 experimental methods, and 19 laboratories or companies. In meta-analyses of glycan-binding proteins, we observed glycan-binding specificities that were not uncovered from single sources. In addition, we confirmed the ability to efficiently optimize selections of glycan-binding proteins to be used in experiments for discriminating between closely related motifs. Through descriptive reports and a programmatically accessible Application Programming Interface, CarboGrove yields unprecedented access to the wealth of glycan-array data being produced and powerful capabilities for both experimentalists and bioinformaticians.
Collapse
Affiliation(s)
- Zachary L Klamer
- Department of Cancer and Cell Biology, Van Andel Institute, 333 Bostwick Ave NE, Grand Rapids, MI 49503, United States
| | | | | | | | | | - Brian B Haab
- Corresponding author: Van Andel Institute, 333 Bostwick Ave NE, Grand Rapids, MI 49504, United States.
| |
Collapse
|