1
|
Canner SW, Schnaar RL, Gray JJ. Predictions from Deep Learning Propose Substantial Protein-Carbohydrate Interplay. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.07.641884. [PMID: 40161692 PMCID: PMC11952328 DOI: 10.1101/2025.03.07.641884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
It is a grand challenge to identify all the protein - carbohydrate interactions in an organism. Direct experiments would require extensive libraries of glycans to definitively distinguish binding from non-binding proteins. Computational screening of proteins for carbohydrate-binding provides an attractive and ultimately testable alternative. Recent computational techniques have focused primarily on which protein residues interact with carbohydrates or which carbohydrate species a protein binds to. Current estimates label 1.5 to 5% of proteins as carbohydrate-binding proteins; however, 50-70% of proteins are known to be glycosylated, suggesting a potential wealth of proteins that bind to carbohydrates. We therefore developed a novel dataset and neural network architecture, named Protein interaction of Carbohydrates Predictor (PiCAP), to predict whether a protein non-covalently binds to a carbohydrate. We trained PiCAP on a dataset of known carbohydrate binders, and we selected proteins that we identified as likely not to bind carbohydrates, including DNA-binding transcription factors, cytoskeletal components, selected antibodies, and selected small-molecule-binding proteins. PiCAP achieves a 90% balanced accuracy on protein-level predictions of carbohydrate binding/non-binding. Using the same dataset, we developed a model named Carbohydrate Protein Site Identifier 2 (CAPSIF2) to predict protein residues that interact non-covalently with carbohydrates. CAPSIF2 achieves a Dice coefficient of 0.57 on residue-level predictions on our independent test dataset, outcompeting all previous models for this task. To demonstrate the biological applicability of PiCAP and CAPSIF2, we investigated cell surface proteins of human neural cells and further predicted the likelihood of three proteomes, notably E. coli, M. musculus, and H. sapiens, to bind to carbohydrates. PiCAP predicts that approximately 35-40% of proteins in these proteomes bind carbohydrates, indicating a substantial interplay of protein-carbohydrate interactions for cellular functionality.
Collapse
Affiliation(s)
- Samuel W. Canner
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, United States
| | - Ronald L. Schnaar
- Department of Pharmacology and Molecular Sciences, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States
- Department of Neuroscience, Johns Hopkins University School of Medicine, Baltimore, Maryland, United States
| | - Jeffrey J. Gray
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, United States
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States
| |
Collapse
|
2
|
Lisacek F, Schnider B, Imberty A. Tools for structural lectinomics: From structures to lectomes. BBA ADVANCES 2025; 7:100154. [PMID: 40166736 PMCID: PMC11957679 DOI: 10.1016/j.bbadva.2025.100154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2024] [Revised: 02/24/2025] [Accepted: 03/05/2025] [Indexed: 04/02/2025] Open
Abstract
Lectins are ubiquitous proteins that interact with glycans in a variety of molecular processes and as such, also play a role in diseases, whether infectious, chronic or cancer-related. The systematic study of lectins is therefore essential, in particular for understanding cell-cell communication. Accumulated protein three-dimensional structural data in the past decades boosted advance in AI-based prediction and opened up new options to characterise lectins that are known to often be multimeric and multivalent. This article reviews the methods to obtain structures of lectins, the current data available for lectin 3D structures and their interactions, how this knowledge is used to classify these proteins and shows that the combination of an array of bioinformatics tools should make the prediction of binding specificity possible in a near future.
Collapse
Affiliation(s)
- Frédérique Lisacek
- SIB Swiss Institute of Bioinformatics CH-1227 Geneva, Switzerland
- Computer Science Department, UniGe CH-1227 Geneva, Switzerland
| | - Boris Schnider
- SIB Swiss Institute of Bioinformatics CH-1227 Geneva, Switzerland
- Computer Science Department, UniGe CH-1227 Geneva, Switzerland
| | - Anne Imberty
- Univ. Grenoble Alpes, CNRS, CERMAV 38000 Grenoble, France
| |
Collapse
|
3
|
Yang Y, Liu S, Li Z, Lai C, Wu H, Li Z, Xia W, Du Q, Huang L, Wang W, Wang X, Chen X. Identification and Optimization of more Efficient Olivetolic Acid Synthases. Cannabis Cannabinoid Res 2024; 9:1482-1491. [PMID: 38237126 PMCID: PMC11685297 DOI: 10.1089/can.2023.0226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2024] Open
Abstract
Introduction: Olivetolic acid (OLA) is a key intermediate in cannabidiol (CBD) synthesis, and cannabinoids are important neuroactive drugs. However, the catalytic activity of olivetolic acid synthase (OLS), the key enzyme involved in OLA biosynthesis, remains low and its catalytic mechanism is unclear. Materials and Methods: In this study, we conducted a scrupulous screening of the pivotal rate-limiting enzyme and analyzed its amino acid sites that are critical to enzyme activity as validated by experiments. Results: Through stringent enzyme screening, we pinpointed a highly active OLS sequence, OLS4. Then, we narrowed down three critical amino acid sites (I258, D198, E196) that significantly influence the OLS activity. Conclusions: Our findings laid the groundwork for the efficient biosynthesis of OLA, and thereby facilitate the biosynthesis of CBD.
Collapse
Affiliation(s)
- Yue Yang
- School of Ecology and Environment, Northwestern Polytechnical University, Xi'an, China
| | - Shimeng Liu
- Jiaxing Synbiolab Biotechnology Co., Ltd., Jiaxing, Zhejiang Province, China
| | - Zihe Li
- School of Ecology and Environment, Northwestern Polytechnical University, Xi'an, China
| | - Changlong Lai
- Jiaxing Synbiolab Biotechnology Co., Ltd., Jiaxing, Zhejiang Province, China
| | - Hao Wu
- Jiaxing Synbiolab Biotechnology Co., Ltd., Jiaxing, Zhejiang Province, China
| | - Zhenzhu Li
- School of Ecology and Environment, Northwestern Polytechnical University, Xi'an, China
| | - Wenhao Xia
- School of Ecology and Environment, Northwestern Polytechnical University, Xi'an, China
| | - Qiuhui Du
- Jiaxing Synbiolab Biotechnology Co., Ltd., Jiaxing, Zhejiang Province, China
| | - Lihui Huang
- Jiaxing Synbiolab Biotechnology Co., Ltd., Jiaxing, Zhejiang Province, China
| | - Wen Wang
- School of Ecology and Environment, Northwestern Polytechnical University, Xi'an, China
| | - Xiao Wang
- Jiaxing Synbiolab Biotechnology Co., Ltd., Jiaxing, Zhejiang Province, China
| | - Xianqing Chen
- Jiaxing Synbiolab Biotechnology Co., Ltd., Jiaxing, Zhejiang Province, China
| |
Collapse
|
4
|
Ranaudo A, Giulini M, Pelissou Ayuso A, Bonvin AMJJ. Modeling Protein-Glycan Interactions with HADDOCK. J Chem Inf Model 2024; 64:7816-7825. [PMID: 39360946 PMCID: PMC11480977 DOI: 10.1021/acs.jcim.4c01372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2024] [Revised: 09/23/2024] [Accepted: 09/25/2024] [Indexed: 10/15/2024]
Abstract
The term glycan refers to a broad category of molecules composed of monosaccharide units linked to each other in a variety of ways, whose structural diversity is related to different functions in living organisms. Among others, glycans are recognized by proteins with the aim of carrying information and for signaling purposes. Determining the three-dimensional structures of protein-glycan complexes is essential both for the understanding of the mechanisms glycans are involved in and for applications such as drug design. In this context, molecular docking approaches are of undoubted importance as complementary approaches to experiments. In this study, we show how high ambiguity-driven DOCKing (HADDOCK) can be efficiently used for the prediction of protein-glycan complexes. Using a benchmark of 89 complexes, starting from their bound or unbound forms, and assuming some knowledge of the binding site on the protein, our protocol reaches a 70% and 40% top 5 success rate on bound and unbound data sets, respectively. We show that the main limiting factor is related to the complexity of the glycan to be modeled and the associated conformational flexibility.
Collapse
Affiliation(s)
- Anna Ranaudo
- Department
of Earth and Environmental Sciences, University
of Milano-Bicocca, Piazza Della Scienza 1, Milan 20126, Italy
- Bijvoet
Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht 3584CH, The Netherlands
| | - Marco Giulini
- Bijvoet
Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht 3584CH, The Netherlands
| | - Angela Pelissou Ayuso
- Bijvoet
Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht 3584CH, The Netherlands
| | - Alexandre M. J. J. Bonvin
- Bijvoet
Centre for Biomolecular Research, Faculty of Science - Chemistry, Utrecht University, Padualaan 8, Utrecht 3584CH, The Netherlands
| |
Collapse
|
5
|
Nieto-Fabregat F, Lenza MP, Marseglia A, Di Carluccio C, Molinaro A, Silipo A, Marchetti R. Computational toolbox for the analysis of protein-glycan interactions. Beilstein J Org Chem 2024; 20:2084-2107. [PMID: 39189002 PMCID: PMC11346309 DOI: 10.3762/bjoc.20.180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Accepted: 08/01/2024] [Indexed: 08/28/2024] Open
Abstract
Protein-glycan interactions play pivotal roles in numerous biological processes, ranging from cellular recognition to immune response modulation. Understanding the intricate details of these interactions is crucial for deciphering the molecular mechanisms underlying various physiological and pathological conditions. Computational techniques have emerged as powerful tools that can help in drawing, building and visualising complex biomolecules and provide insights into their dynamic behaviour at atomic and molecular levels. This review provides an overview of the main computational tools useful for studying biomolecular systems, particularly glycans, both in free state and in complex with proteins, also with reference to the principles, methodologies, and applications of all-atom molecular dynamics simulations. Herein, we focused on the programs that are generally employed for preparing protein and glycan input files to execute molecular dynamics simulations and analyse the corresponding results. The presented computational toolbox represents a valuable resource for researchers studying protein-glycan interactions and incorporates advanced computational methods for building, visualising and predicting protein/glycan structures, modelling protein-ligand complexes, and analyse MD outcomes. Moreover, selected case studies have been reported to highlight the importance of computational tools in studying protein-glycan systems, revealing the capability of these tools to provide valuable insights into the binding kinetics, energetics, and structural determinants that govern specific molecular interactions.
Collapse
Affiliation(s)
- Ferran Nieto-Fabregat
- Department of Chemical Sciences, University of Naples Federico II, Via Cinthia 4, 80126, Italy
| | - Maria Pia Lenza
- Department of Chemical Sciences, University of Naples Federico II, Via Cinthia 4, 80126, Italy
| | - Angela Marseglia
- Department of Chemical Sciences, University of Naples Federico II, Via Cinthia 4, 80126, Italy
| | - Cristina Di Carluccio
- Department of Chemical Sciences, University of Naples Federico II, Via Cinthia 4, 80126, Italy
| | - Antonio Molinaro
- Department of Chemical Sciences, University of Naples Federico II, Via Cinthia 4, 80126, Italy
| | - Alba Silipo
- Department of Chemical Sciences, University of Naples Federico II, Via Cinthia 4, 80126, Italy
| | - Roberta Marchetti
- Department of Chemical Sciences, University of Naples Federico II, Via Cinthia 4, 80126, Italy
| |
Collapse
|
6
|
Ha NX, Huong TT, Khanh PN, Hung NP, Loc VT, Ha VT, Quynh DT, Nghi DH, Hai PT, Scarlett CJ, Wessjohann LA, Cuong NM. In Vitro and in Silico Study of New Biscoumarin Glycosides from Paramignya trimera against Angiotensin-Converting Enzyme 2 (ACE-2) for Preventing SARS-CoV-2 Infection. Chem Pharm Bull (Tokyo) 2024; 72:574-583. [PMID: 38866495 DOI: 10.1248/cpb.c23-00844] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2024]
Abstract
In Vietnam, the stems and roots of the Rutaceous plant Paramignya trimera (Oliv.) Burkill (known locally as "Xáo tam phân") are widely used to treat liver diseases such as viral hepatitis and acute and chronic cirrhosis. In an effort to search for Vietnamese natural compounds capable of inhibiting coronavirus based on molecular docking screening, two new dimeric coumarin glycosides, namely cis-paratrimerin B (1) and cis-paratrimerin A (2), and two previously identified coumarins, the trans-isomers paratrimerin B (3) and paratrimerin A (4), were isolated from the roots of P. trimera and tested for their anti-angiotensin-converting enzyme 2 (ACE-2) inhibitory properties in vitro. It was discovered that ACE-2 enzyme was inhibited by cis-paratrimerin B (1), cis-paratrimerin A (2), and trans-paratrimerin B (3), with IC50 values of 28.9, 68, and 77 µM, respectively. Docking simulations revealed that four biscoumarin glycosides had good binding energies (∆G values ranging from -10.6 to -14.7 kcal/mol) and mostly bound to the S1' subsite of the ACE-2 protein. The key interactions of these natural ligands include metal chelation with zinc ions and multiple H-bonds with Ser128, Glu145, His345, Lys363, Thr371, Glu406, and Tyr803. Our findings demonstrated that biscoumarin glycosides from P. trimera roots occur naturally in both cis- and trans-diastereomeric forms. The biscoumarin glycosides Lys363, Thr371, Glu406, and Tyr803. Our findings demonstrated that biscoumarin glycosides from P. trimera roots hold potential for further studies as natural ACE-2 inhibitors for preventing severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection.
Collapse
Affiliation(s)
- Nguyen Xuan Ha
- Institute of Natural Products Chemistry, Vietnam Academy of Science and Technology
- Graduate University of Science and Technology, Vietnam Academy of Science and Technology
| | - Tran Thu Huong
- Institute of Natural Products Chemistry, Vietnam Academy of Science and Technology
| | - Pham Ngoc Khanh
- Institute of Natural Products Chemistry, Vietnam Academy of Science and Technology
- Graduate University of Science and Technology, Vietnam Academy of Science and Technology
| | - Nguyen Phi Hung
- Institute of Natural Products Chemistry, Vietnam Academy of Science and Technology
| | - Vu Thanh Loc
- Institute of Natural Products Chemistry, Vietnam Academy of Science and Technology
| | - Vu Thi Ha
- Institute of Natural Products Chemistry, Vietnam Academy of Science and Technology
| | - Dang Thu Quynh
- Institute of Natural Products Chemistry, Vietnam Academy of Science and Technology
| | - Do Huu Nghi
- Institute of Natural Products Chemistry, Vietnam Academy of Science and Technology
| | - Pham The Hai
- University of Science and Technology of Hanoi, Vietnam Academy of Science and Technology
| | - Christopher J Scarlett
- School of Environmental & Life Sciences, College of Engineering, Science and Environment, The University of Newcastle
| | - Ludger A Wessjohann
- Department of Bioorganic Chemistry, Leibniz Institute of Plant Biochemistry (IPB)
| | - Nguyen Manh Cuong
- Institute of Natural Products Chemistry, Vietnam Academy of Science and Technology
- Graduate University of Science and Technology, Vietnam Academy of Science and Technology
| |
Collapse
|
7
|
Ertelt M, Mulligan VK, Maguire JB, Lyskov S, Moretti R, Schiffner T, Meiler J, Schoeder CT. Combining machine learning with structure-based protein design to predict and engineer post-translational modifications of proteins. PLoS Comput Biol 2024; 20:e1011939. [PMID: 38484014 PMCID: PMC10965067 DOI: 10.1371/journal.pcbi.1011939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Revised: 03/26/2024] [Accepted: 02/20/2024] [Indexed: 03/27/2024] Open
Abstract
Post-translational modifications (PTMs) of proteins play a vital role in their function and stability. These modifications influence protein folding, signaling, protein-protein interactions, enzyme activity, binding affinity, aggregation, degradation, and much more. To date, over 400 types of PTMs have been described, representing chemical diversity well beyond the genetically encoded amino acids. Such modifications pose a challenge to the successful design of proteins, but also represent a major opportunity to diversify the protein engineering toolbox. To this end, we first trained artificial neural networks (ANNs) to predict eighteen of the most abundant PTMs, including protein glycosylation, phosphorylation, methylation, and deamidation. In a second step, these models were implemented inside the computational protein modeling suite Rosetta, which allows flexible combination with existing protocols to model the modified sites and understand their impact on protein stability as well as function. Lastly, we developed a new design protocol that either maximizes or minimizes the predicted probability of a particular site being modified. We find that this combination of ANN prediction and structure-based design can enable the modification of existing, as well as the introduction of novel, PTMs. The potential applications of our work include, but are not limited to, glycan masking of epitopes, strengthening protein-protein interactions through phosphorylation, as well as protecting proteins from deamidation liabilities. These applications are especially important for the design of new protein therapeutics where PTMs can drastically change the therapeutic properties of a protein. Our work adds novel tools to Rosetta's protein engineering toolbox that allow for the rational design of PTMs.
Collapse
Affiliation(s)
- Moritz Ertelt
- Institute for Drug Discovery, Leipzig University Medical Faculty, Leipzig, Germany
- Center for Scalable Data Analytics and Artificial Intelligence ScaDS.AI, Dresden/Leipzig, Germany
| | - Vikram Khipple Mulligan
- Center for Computational Biology, Flatiron Institute, New York, New York, United States of America
| | - Jack B. Maguire
- Program in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Rocco Moretti
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States of America
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Torben Schiffner
- Institute for Drug Discovery, Leipzig University Medical Faculty, Leipzig, Germany
| | - Jens Meiler
- Institute for Drug Discovery, Leipzig University Medical Faculty, Leipzig, Germany
- Center for Scalable Data Analytics and Artificial Intelligence ScaDS.AI, Dresden/Leipzig, Germany
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States of America
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Clara T. Schoeder
- Institute for Drug Discovery, Leipzig University Medical Faculty, Leipzig, Germany
- Center for Scalable Data Analytics and Artificial Intelligence ScaDS.AI, Dresden/Leipzig, Germany
| |
Collapse
|
8
|
Zhao X, Li H, Zhang K, Huang SY. Iterative Knowledge-Based Scoring Function for Protein-Ligand Interactions by Considering Binding Affinity Information. J Phys Chem B 2023; 127:9021-9034. [PMID: 37822259 DOI: 10.1021/acs.jpcb.3c04421] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2023]
Abstract
Scoring functions for protein-ligand interactions play a critical role in structure-based drug design. Owing to the good balance between general applicability and computational efficiency, knowledge-based scoring functions have obtained significant advancements and achieved many successes. Nevertheless, knowledge-based scoring functions face a challenge in utilizing the experimental affinity data and thus may not perform well in binding affinity prediction. Addressing the challenge, we have proposed an improved version of the iterative knowledge-based scoring function ITScore by considering binding affinity information, which is referred to as ITScoreAff, based on a large training set of 6216 protein-ligand complexes with both structures and affinity data. ITScoreAff was extensively evaluated and compared with ITScore, 33 traditional, and 6 machine learning scoring functions in terms of docking power, ranking power, and screening power on the independent CASF-2016 benchmark. It was shown that ITScoreAff obtained an overall better performance than the other 40 scoring functions and gave an average success rate of 85.3% in docking power, a correlation coefficient of 0.723 in scoring power, and an average rank correlation coefficient of 0.668 in ranking power. In addition, ITScoreAff also achieved the overall best screening power when the top 10% of the ranked database were considered. These results demonstrated the robustness of ITScoreAff and its improvement over existing scoring functions.
Collapse
Affiliation(s)
- Xuejun Zhao
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Hao Li
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Keqiong Zhang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
9
|
Canner SW, Shanker S, Gray JJ. Structure-based neural network protein-carbohydrate interaction predictions at the residue level. FRONTIERS IN BIOINFORMATICS 2023; 3:1186531. [PMID: 37409346 PMCID: PMC10318439 DOI: 10.3389/fbinf.2023.1186531] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 05/31/2023] [Indexed: 07/07/2023] Open
Abstract
Carbohydrates dynamically and transiently interact with proteins for cell-cell recognition, cellular differentiation, immune response, and many other cellular processes. Despite the molecular importance of these interactions, there are currently few reliable computational tools to predict potential carbohydrate-binding sites on any given protein. Here, we present two deep learning (DL) models named CArbohydrate-Protein interaction Site IdentiFier (CAPSIF) that predicts non-covalent carbohydrate-binding sites on proteins: (1) a 3D-UNet voxel-based neural network model (CAPSIF:V) and (2) an equivariant graph neural network model (CAPSIF:G). While both models outperform previous surrogate methods used for carbohydrate-binding site prediction, CAPSIF:V performs better than CAPSIF:G, achieving test Dice scores of 0.597 and 0.543 and test set Matthews correlation coefficients (MCCs) of 0.599 and 0.538, respectively. We further tested CAPSIF:V on AlphaFold2-predicted protein structures. CAPSIF:V performed equivalently on both experimentally determined structures and AlphaFold2-predicted structures. Finally, we demonstrate how CAPSIF models can be used in conjunction with local glycan-docking protocols, such as GlycanDock, to predict bound protein-carbohydrate structures.
Collapse
Affiliation(s)
- Samuel W. Canner
- Program in Molecular Biophysics, The Johns Hopkins University, Baltimore, MD, United States
| | - Sudhanshu Shanker
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States
| | - Jeffrey J. Gray
- Program in Molecular Biophysics, The Johns Hopkins University, Baltimore, MD, United States
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States
| |
Collapse
|
10
|
Canner SW, Shanker S, Gray JJ. Structure-Based Neural Network Protein-Carbohydrate Interaction Predictions at the Residue Level. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.14.531382. [PMID: 36993750 PMCID: PMC10054975 DOI: 10.1101/2023.03.14.531382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/13/2023]
Abstract
Carbohydrates dynamically and transiently interact with proteins for cell-cell recognition, cellular differentiation, immune response, and many other cellular processes. Despite the molecular importance of these interactions, there are currently few reliable computational tools to predict potential carbohydrate binding sites on any given protein. Here, we present two deep learning models named CArbohydrate-Protein interaction Site IdentiFier (CAPSIF) that predict carbohydrate binding sites on proteins: (1) a 3D-UNet voxel-based neural network model (CAPSIF:V) and (2) an equivariant graph neural network model (CAPSIF:G). While both models outperform previous surrogate methods used for carbohydrate binding site prediction, CAPSIF:V performs better than CAPSIF:G, achieving test Dice scores of 0.597 and 0.543 and test set Matthews correlation coefficients (MCCs) of 0.599 and 0.538, respectively. We further tested CAPSIF:V on AlphaFold2-predicted protein structures. CAPSIF:V performed equivalently on both experimentally determined structures and AlphaFold2 predicted structures. Finally, we demonstrate how CAPSIF models can be used in conjunction with local glycan-docking protocols, such as GlycanDock, to predict bound protein-carbohydrate structures.
Collapse
Affiliation(s)
- Samuel W Canner
- Program in Molecular Biophysics, The Johns Hopkins University, Baltimore, MD, United States of America
| | - Sudhanshu Shanker
- Dept. of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America
| | - Jeffrey J Gray
- Program in Molecular Biophysics, The Johns Hopkins University, Baltimore, MD, United States of America
- Dept. of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America
| |
Collapse
|
11
|
Meumann N, Cabanes-Creus M, Ertelt M, Navarro RG, Lucifora J, Yuan Q, Nien-Huber K, Abdelrahman A, Vu XK, Zhang L, Franke AC, Schmithals C, Piiper A, Vogt A, Gonzalez-Carmona M, Frueh JT, Ullrich E, Meuleman P, Talbot SR, Odenthal M, Ott M, Seifried E, Schoeder CT, Schwäble J, Lisowski L, Büning H. Adeno-associated virus serotype 2 capsid variants for improved liver-directed gene therapy. Hepatology 2023; 77:802-815. [PMID: 35976053 PMCID: PMC9936986 DOI: 10.1002/hep.32733] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Revised: 07/29/2022] [Accepted: 08/07/2022] [Indexed: 12/08/2022]
Abstract
BACKGROUND AND AIMS Current liver-directed gene therapies look for adeno-associated virus (AAV) vectors with improved efficacy. With this background, capsid engineering is explored. Whereas shuffled capsid library screenings have resulted in potent liver targeting variants with one first vector in human clinical trials, modifying natural serotypes by peptide insertion has so far been less successful. Here, we now report on two capsid variants, MLIV.K and MLIV.A, both derived from a high-throughput in vivo AAV peptide display selection screen in mice. APPROACH AND RESULTS The variants transduce primary murine and human hepatocytes at comparable efficiencies, a valuable feature in clinical development, and show significantly improved liver transduction efficacy, thereby allowing a dose reduction, and outperform parental AAV2 and AAV8 in targeting human hepatocytes in humanized mice. The natural heparan sulfate proteoglycan binding ability is markedly reduced, a feature that correlates with improved hepatocyte transduction. A further property that might contribute to the improved transduction efficacy is the lower capsid melting temperature. Peptide insertion also caused a moderate change in sensitivity to human sera containing anti-AAV2 neutralizing antibodies, revealing the impact of epitopes located at the basis of the AAV capsid protrusions. CONCLUSIONS In conclusion, MLIV.K and MLIV.A are AAV peptide display variants selected in immunocompetent mice with improved hepatocyte tropism and transduction efficiency. Because these features are maintained across species, MLIV variants provide remarkable potential for translation of therapeutic approaches from mice to men.
Collapse
Affiliation(s)
- Nadja Meumann
- Institute of Experimental Hematology , Hannover Medical School , Hannover , Germany.,Center for Molecular Medicine Cologne , University of Cologne , Cologne , Germany
| | - Marti Cabanes-Creus
- Translational Vectorology Research Unit , Children's Medical Research Institute , The University of Sydney , Sydney , New South Wales , Australia
| | - Moritz Ertelt
- Institute for Drug Discovery , University Leipzig Medical School , Leipzig , Germany.,Center for Scalable Data Analytics and Artificial Intelligence ScaDS.AI , Dresden/Leipzig , Germany
| | - Renina Gale Navarro
- Translational Vectorology Research Unit , Children's Medical Research Institute , The University of Sydney , Sydney , New South Wales , Australia
| | - Julie Lucifora
- Cancer Research Center of Lyon , Institut National de la Santé et la Recherche Médicale , Lyon , France
| | - Qinggong Yuan
- Department of Gastroenterology, Hepatology, and Endocrinology , Hannover Medical School , Hannover , Germany.,Twincore Centre for Experimental and Clinical Infection Research , Hannover , Germany
| | - Karin Nien-Huber
- Institute for Transfusion Medicine and Immunohematology , Goethe University Hospital Medical School , German Red Cross Blood Donor Service , Frankfurt , Germany
| | - Ahmed Abdelrahman
- Institute for Transfusion Medicine and Immunohematology , Goethe University Hospital Medical School , German Red Cross Blood Donor Service , Frankfurt , Germany
| | - Xuan-Khang Vu
- Institute of Experimental Hematology , Hannover Medical School , Hannover , Germany
| | - Liang Zhang
- Center for Molecular Medicine Cologne , University of Cologne , Cologne , Germany.,Institute of Pathology , University Hospital Cologne , Cologne , Germany
| | - Ann-Christin Franke
- Institute of Experimental Hematology , Hannover Medical School , Hannover , Germany
| | - Christian Schmithals
- Department of Internal Medicine I , University Hospital Frankfurt , Frankfurt , Germany
| | - Albrecht Piiper
- Department of Internal Medicine I , University Hospital Frankfurt , Frankfurt , Germany
| | - Annabelle Vogt
- Department of Internal Medicine I , University Hospital Bonn , Bonn , Germany
| | | | - Jochen T Frueh
- Experimental Immunology , Children's University Hospital , Goethe University Frankfurt , Frankfurt am Main , Germany
| | - Evelyn Ullrich
- Experimental Immunology , Children's University Hospital , Goethe University Frankfurt , Frankfurt am Main , Germany
| | - Philip Meuleman
- Laboratory of Liver Infectious Diseases , Faculty of Medicine and Health Sciences , Ghent University , Ghent , Belgium
| | - Steven R Talbot
- Institute for Laboratory Animal Science , Hannover Medical School , Hannover , Germany
| | - Margarete Odenthal
- Center for Molecular Medicine Cologne , University of Cologne , Cologne , Germany.,Institute of Pathology , University Hospital Cologne , Cologne , Germany
| | - Michael Ott
- Department of Gastroenterology, Hepatology, and Endocrinology , Hannover Medical School , Hannover , Germany.,Twincore Centre for Experimental and Clinical Infection Research , Hannover , Germany
| | - Erhard Seifried
- Institute for Transfusion Medicine and Immunohematology , Goethe University Hospital Medical School , German Red Cross Blood Donor Service , Frankfurt , Germany
| | - Clara T Schoeder
- Institute for Drug Discovery , University Leipzig Medical School , Leipzig , Germany
| | - Joachim Schwäble
- Institute for Transfusion Medicine and Immunohematology , Goethe University Hospital Medical School , German Red Cross Blood Donor Service , Frankfurt , Germany
| | - Leszek Lisowski
- Translational Vectorology Research Unit , Children's Medical Research Institute , The University of Sydney , Sydney , New South Wales , Australia.,Military Institute of Medicine , Laboratory of Molecular Oncology and Innovative Therapies , Warsaw , Poland
| | - Hildegard Büning
- Institute of Experimental Hematology , Hannover Medical School , Hannover , Germany.,Center for Molecular Medicine Cologne , University of Cologne , Cologne , Germany
| |
Collapse
|
12
|
Abstract
Glycoscience assembles all the scientific disciplines involved in studying various molecules and macromolecules containing carbohydrates and complex glycans. Such an ensemble involves one of the most extensive sets of molecules in quantity and occurrence since they occur in all microorganisms and higher organisms. Once the compositions and sequences of these molecules are established, the determination of their three-dimensional structural and dynamical features is a step toward understanding the molecular basis underlying their properties and functions. The range of the relevant computational methods capable of addressing such issues is anchored by the specificity of stereoelectronic effects from quantum chemistry to mesoscale modeling throughout molecular dynamics and mechanics and coarse-grained and docking calculations. The Review leads the reader through the detailed presentations of the applications of computational modeling. The illustrations cover carbohydrate-carbohydrate interactions, glycolipids, and N- and O-linked glycans, emphasizing their role in SARS-CoV-2. The presentation continues with the structure of polysaccharides in solution and solid-state and lipopolysaccharides in membranes. The full range of protein-carbohydrate interactions is presented, as exemplified by carbohydrate-active enzymes, transporters, lectins, antibodies, and glycosaminoglycan binding proteins. A final section features a list of 150 tools and databases to help address the many issues of structural glycobioinformatics.
Collapse
Affiliation(s)
- Serge Perez
- Centre de Recherche sur les Macromolecules Vegetales, University of Grenoble-Alpes, Centre National de la Recherche Scientifique, Grenoble F-38041, France
| | - Olga Makshakova
- FRC Kazan Scientific Center of Russian Academy of Sciences, Kazan Institute of Biochemistry and Biophysics, Kazan 420111, Russia
| |
Collapse
|
13
|
Nguyen TB, Pires DEV, Ascher DB. CSM-carbohydrate: protein-carbohydrate binding affinity prediction and docking scoring function. Brief Bioinform 2021; 23:6457169. [PMID: 34882232 DOI: 10.1093/bib/bbab512] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Revised: 11/06/2021] [Accepted: 11/08/2021] [Indexed: 12/29/2022] Open
Abstract
Protein-carbohydrate interactions are crucial for many cellular processes but can be challenging to biologically characterise. To improve our understanding and ability to model these molecular interactions, we used a carefully curated set of 370 protein-carbohydrate complexes with experimental structural and biophysical data in order to train and validate a new tool, cutoff scanning matrix (CSM)-carbohydrate, using machine learning algorithms to accurately predict their binding affinity and rank docking poses as a scoring function. Information on both protein and carbohydrate complementarity, in terms of shape and chemistry, was captured using graph-based structural signatures. Across both training and independent test sets, we achieved comparable Pearson's correlations of 0.72 under cross-validation [root mean square error (RMSE) of 1.58 Kcal/mol] and 0.67 on the independent test (RMSE of 1.72 Kcal/mol), providing confidence in the generalisability and robustness of the final model. Similar performance was obtained across mono-, di- and oligosaccharides, further highlighting the applicability of this approach to the study of larger complexes. We show CSM-carbohydrate significantly outperformed previous approaches and have implemented our method and make all data freely available through both a user-friendly web interface and application programming interface, to facilitate programmatic access at http://biosig.unimelb.edu.au/csm_carbohydrate/. We believe CSM-carbohydrate will be an invaluable tool for helping assess docking poses and the effects of mutations on protein-carbohydrate affinity, unravelling important aspects that drive binding recognition.
Collapse
Affiliation(s)
- Thanh Binh Nguyen
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
| | - Douglas E V Pires
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,School of Computing and Information Systems, University of Melbourne, Melbourne, Victoria, Australia
| | - David B Ascher
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia.,Department of Biochemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
14
|
Koehler Leman J, Lyskov S, Lewis SM, Adolf-Bryfogle J, Alford RF, Barlow K, Ben-Aharon Z, Farrell D, Fell J, Hansen WA, Harmalkar A, Jeliazkov J, Kuenze G, Krys JD, Ljubetič A, Loshbaugh AL, Maguire J, Moretti R, Mulligan VK, Nance ML, Nguyen PT, Ó Conchúir S, Roy Burman SS, Samanta R, Smith ST, Teets F, Tiemann JKS, Watkins A, Woods H, Yachnin BJ, Bahl CD, Bailey-Kellogg C, Baker D, Das R, DiMaio F, Khare SD, Kortemme T, Labonte JW, Lindorff-Larsen K, Meiler J, Schief W, Schueler-Furman O, Siegel JB, Stein A, Yarov-Yarovoy V, Kuhlman B, Leaver-Fay A, Gront D, Gray JJ, Bonneau R. Ensuring scientific reproducibility in bio-macromolecular modeling via extensive, automated benchmarks. Nat Commun 2021; 12:6947. [PMID: 34845212 PMCID: PMC8630030 DOI: 10.1038/s41467-021-27222-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 11/02/2021] [Indexed: 01/14/2023] Open
Abstract
Each year vast international resources are wasted on irreproducible research. The scientific community has been slow to adopt standard software engineering practices, despite the increases in high-dimensional data, complexities of workflows, and computational environments. Here we show how scientific software applications can be created in a reproducible manner when simple design goals for reproducibility are met. We describe the implementation of a test server framework and 40 scientific benchmarks, covering numerous applications in Rosetta bio-macromolecular modeling. High performance computing cluster integration allows these benchmarks to run continuously and automatically. Detailed protocol captures are useful for developers and users of Rosetta and other macromolecular modeling tools. The framework and design concepts presented here are valuable for developers and users of any type of scientific software and for the scientific community to create reproducible methods. Specific examples highlight the utility of this framework, and the comprehensive documentation illustrates the ease of adding new tests in a matter of hours.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, 10010, USA.
- Department of Biology, New York University, New York, NY, 10003, USA.
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Steven M Lewis
- Cyrus Biotechnology, 1201 Second Ave, Suite 900, Seattle, WA, 98101, USA
| | - Jared Adolf-Bryfogle
- Department of Immunology and Microbiology, Scripps Research, La Jolla, CA, 92037, USA
- IAVI Neutralizing Antibody Center, Scripps Research, La Jolla, CA, 92037, USA
| | - Rebecca F Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Kyle Barlow
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Ziv Ben-Aharon
- Department of Microbiology and Molecular Genetics, Hebrew University, Hadassah Medical School, POB 12272, Jerusalem, 91120, Israel
| | - Daniel Farrell
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Jason Fell
- Genome Center, University of California, Davis, CA, 95616, USA
- Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, 95616, USA
- Department of Chemistry, University of California, Davis, CA, 95616, USA
| | - William A Hansen
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
| | - Ameya Harmalkar
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Jeliazko Jeliazkov
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Georg Kuenze
- Department of Chemistry, Vanderbilt University, Nashville, TN, 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Institute for Drug Discovery, Medical School, Leipzig University, 04103, Leipzig, Germany
| | - Justyna D Krys
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093, Warsaw, Poland
| | - Ajasja Ljubetič
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Amanda L Loshbaugh
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, 94158, USA
- Biophysics Graduate Program, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Jack Maguire
- Program in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Rocco Moretti
- Department of Chemistry, Vanderbilt University, Nashville, TN, 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
| | - Vikram Khipple Mulligan
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, 10010, USA
| | - Morgan L Nance
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Phuong T Nguyen
- Department of Physiology and Membrane Biology, School of Medicine, University of California, Davis, CA, 95616, USA
| | - Shane Ó Conchúir
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Shourya S Roy Burman
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Rituparna Samanta
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Shannon T Smith
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Chemical and Physical Biology Program, Vanderbilt University, Nashville, TN, 37235, USA
| | - Frank Teets
- Department of Bioochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27516, USA
| | - Johanna K S Tiemann
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200, Copenhagen N., Denmark
| | - Andrew Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Hope Woods
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Chemical and Physical Biology Program, Vanderbilt University, Nashville, TN, 37235, USA
| | - Brahm J Yachnin
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
| | - Christopher D Bahl
- Institute for Protein Innovation, Boston, MA, 02115, USA
- Division of Hematology/Oncology, Boston Children's Hospital, Boston, MA, 02115, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, 02115, USA
| | | | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Sagar D Khare
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
- Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, 08904, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, 94158, USA
- Biophysics Graduate Program, University of California San Francisco, San Francisco, CA, 94158, USA
| | - Jason W Labonte
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Kresten Lindorff-Larsen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200, Copenhagen N., Denmark
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, TN, 37235, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, 37235, USA
- Institute for Drug Discovery, Medical School, Leipzig University, 04103, Leipzig, Germany
| | - William Schief
- Department of Immunology and Microbiology, Scripps Research, La Jolla, CA, 92037, USA
- IAVI Neutralizing Antibody Center, Scripps Research, La Jolla, CA, 92037, USA
| | - Ora Schueler-Furman
- Department of Microbiology and Molecular Genetics, Hebrew University, Hadassah Medical School, POB 12272, Jerusalem, 91120, Israel
| | - Justin B Siegel
- Genome Center, University of California, Davis, CA, 95616, USA
- Department of Biochemistry & Molecular Medicine, University of California, Davis, CA, 95616, USA
- Department of Chemistry, University of California, Davis, CA, 95616, USA
| | - Amelie Stein
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, DK-2200, Copenhagen N., Denmark
| | - Vladimir Yarov-Yarovoy
- Department of Physiology and Membrane Biology, School of Medicine, University of California, Davis, CA, 95616, USA
| | - Brian Kuhlman
- Department of Bioochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27516, USA
| | - Andrew Leaver-Fay
- Department of Bioochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27516, USA
| | - Dominik Gront
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093, Warsaw, Poland
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA.
| | - Richard Bonneau
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, 10010, USA.
- Department of Biology, New York University, New York, NY, 10003, USA.
- Department of Computer Science, New York University, New York, NY, 10003, USA.
| |
Collapse
|