1
|
Zhao Y, Liang Y, Luo G, Li Y, Han X, Wen M. Sequence-Structure Analysis Unlocking the Potential Functional Application of the Local 3D Motifs of Plant-Derived Diterpene Synthases. Biomolecules 2024; 14:120. [PMID: 38254720 PMCID: PMC10813164 DOI: 10.3390/biom14010120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 12/31/2023] [Accepted: 01/11/2024] [Indexed: 01/24/2024] Open
Abstract
Plant-derived diterpene synthases (PdiTPSs) play a critical role in the formation of structurally and functionally diverse diterpenoids. However, the specificity or functional-related features of PdiTPSs are not well understood. For a more profound insight, we collected, constructed, and curated 199 functionally characterized PdiTPSs and their corresponding 3D structures. The complex correlations among their sequences, domains, structures, and corresponding products were comprehensively analyzed. Ultimately, our focus narrowed to the geometric arrangement of local structures. We found that local structural alignment can rapidly localize product-specific residues that have been validated by mutagenesis experiments. Based on the 3D motifs derived from the residues around the substrate, we successfully searched diterpene synthases (diTPSs) from the predicted terpene synthases and newly characterized PdiTPSs, suggesting that the identified 3D motifs can serve as distinctive signatures in diTPSs (I and II class). Local structural analysis revealed the PdiTPSs with more conserved amino acid residues show features unique to class I and class II, whereas those with fewer conserved amino acid residues typically exhibit product diversity and specificity. These results provide an attractive method for discovering novel or functionally equivalent enzymes and probing the product specificity in cases where enzyme characterization is limited.
Collapse
Affiliation(s)
- Yalan Zhao
- National Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming 650091, China; (Y.Z.); (Y.L.); (G.L.); (X.H.)
- Key Laboratory of Microbial Diversity in Southwest China, Ministry of Education, Yunnan Institute of Microbiology, School of Life Sciences, Yunnan University, Kunming 650091, China
| | - Yupeng Liang
- National Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming 650091, China; (Y.Z.); (Y.L.); (G.L.); (X.H.)
- Key Laboratory of Microbial Diversity in Southwest China, Ministry of Education, Yunnan Institute of Microbiology, School of Life Sciences, Yunnan University, Kunming 650091, China
| | - Gan Luo
- National Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming 650091, China; (Y.Z.); (Y.L.); (G.L.); (X.H.)
- Key Laboratory of Microbial Diversity in Southwest China, Ministry of Education, Yunnan Institute of Microbiology, School of Life Sciences, Yunnan University, Kunming 650091, China
| | - Yi Li
- College of Mathematics and Computer Science, Dali University, Dali 671003, China
| | - Xiulin Han
- National Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming 650091, China; (Y.Z.); (Y.L.); (G.L.); (X.H.)
- Key Laboratory of Microbial Diversity in Southwest China, Ministry of Education, Yunnan Institute of Microbiology, School of Life Sciences, Yunnan University, Kunming 650091, China
| | - Mengliang Wen
- National Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming 650091, China; (Y.Z.); (Y.L.); (G.L.); (X.H.)
- Key Laboratory of Microbial Diversity in Southwest China, Ministry of Education, Yunnan Institute of Microbiology, School of Life Sciences, Yunnan University, Kunming 650091, China
| |
Collapse
|
2
|
Sankar S, Preeti P, Ravikumar K, Kumar A, Prasad Y, Pal S, Rao DN, Savithri HS, Chandra N. Structural similarities between SAM and ATP recognition motifs and detection of ATP binding in a SAM binding DNA methyltransferase. Curr Res Struct Biol 2023; 6:100108. [PMID: 38106461 PMCID: PMC10724544 DOI: 10.1016/j.crstbi.2023.100108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2023] [Revised: 10/05/2023] [Accepted: 10/24/2023] [Indexed: 12/19/2023] Open
Abstract
S-adenosylmethionine (SAM) is a ubiquitous co-factor that serves as a donor for methylation reactions and additionally serves as a donor of other functional groups such as amino and ribosyl moieties in a variety of other biochemical reactions. Such versatility in function is enabled by the ability of SAM to be recognized by a wide variety of protein molecules that vary in their sequences and structural folds. To understand what gives rise to specific SAM binding in diverse proteins, we set out to study if there are any structural patterns at their binding sites. A comprehensive analysis of structures of the binding sites of SAM by all-pair comparison and clustering, indicated the presence of 4 different site-types, only one among them being well studied. For each site-type we decipher the common minimum principle involved in SAM recognition by diverse proteins and derive structural motifs that are characteristic of SAM binding. The presence of the structural motifs with precise three-dimensional arrangement of amino acids in SAM sites that appear to have evolved independently, indicates that these are winning arrangements of residues to bring about SAM recognition. Further, we find high similarity between one of the SAM site types and a well known ATP binding site type. We demonstrate using in vitro experiments that a known SAM binding protein, HpyAII.M1, a type 2 methyltransferase can bind and hydrolyse ATP. We find common structural motifs that explain this, further supported through site-directed mutagenesis. Observation of similar motifs for binding two of the most ubiquitous ligands in multiple protein families with diverse sequences and structural folds presents compelling evidence at the molecular level in favour of convergent evolution.
Collapse
Affiliation(s)
- Santhosh Sankar
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Preeti Preeti
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Kavya Ravikumar
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Amrendra Kumar
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Yedu Prasad
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Sukriti Pal
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Desirazu N. Rao
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Handanahal S. Savithri
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Nagasuma Chandra
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
- Department of BioEngineering, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| |
Collapse
|
3
|
Eguida M, Rognan D. Estimating the Similarity between Protein Pockets. Int J Mol Sci 2022; 23:12462. [PMID: 36293316 PMCID: PMC9604425 DOI: 10.3390/ijms232012462] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 10/15/2022] [Accepted: 10/16/2022] [Indexed: 10/28/2023] Open
Abstract
With the exponential increase in publicly available protein structures, the comparison of protein binding sites naturally emerged as a scientific topic to explain observations or generate hypotheses for ligand design, notably to predict ligand selectivity for on- and off-targets, explain polypharmacology, and design target-focused libraries. The current review summarizes the state-of-the-art computational methods applied to pocket detection and comparison as well as structural druggability estimates. The major strengths and weaknesses of current pocket descriptors, alignment methods, and similarity search algorithms are presented. Lastly, an exhaustive survey of both retrospective and prospective applications in diverse medicinal chemistry scenarios illustrates the capability of the existing methods and the hurdle that still needs to be overcome for more accurate predictions.
Collapse
Affiliation(s)
| | - Didier Rognan
- Laboratoire d’Innovation Thérapeutique, UMR7200 CNRS-Université de Strasbourg, 67400 Illkirch, France
| |
Collapse
|
4
|
Sankar S, Chandran Sakthivel N, Chandra N. Fast Local Alignment of Protein Pockets (FLAPP): A System-Compiled Program for Large-Scale Binding Site Alignment. J Chem Inf Model 2022; 62:4810-4819. [PMID: 36122166 DOI: 10.1021/acs.jcim.2c00967] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Protein function is a direct consequence of its sequence, structure, and the arrangement at the binding site. Bioinformatics using sequence analysis is typically used to gain a first insight into protein function. Protein structures, on the other hand, provide a higher resolution platform into understanding functions. As the protein structural information is increasingly becoming available through experimental structure determination and through advances in computational methods for structure prediction, the opportunity to utilize these data is also increasing. Structural analysis of small molecule ligand binding sites in particular provides a direct and more accurate window to infer protein function. However, it remains a poorly utilized resource due to the huge computational cost of existing methods that make large-scale structural comparisons of binding sites prohibitive. Here, we present an algorithm called FLAPP that produces very rapid atomic level alignments. By combining clique matching in graphs and the power of modern CPU architectures, FLAPP aligns a typical pair of binding sites at ∼12.5 ms using a single CPU core, ∼1 ms using 12 cores on a standard desktop machine, and performs a PDB-wide scan in 1-2 min. We perform rigorous validation of the algorithm at multiple levels of complexity and show that FLAPP provides accurate alignments. We also present a case study involving vitamin B12 binding sites to showcase the usefulness of FLAPP for performing an exhaustive alignment-based PDB-wide scan. We expect that this tool will be invaluable to the scientific community to quickly align millions of site pairs on a normal desktop machine to gain insights into protein function and drug discovery for drug target and off-target identification and polypharmacology.
Collapse
Affiliation(s)
- Santhosh Sankar
- Department of Biochemistry, Indian Institute of Science, Bangalore 560012, Karnataka, India
| | | | - Nagasuma Chandra
- Department of Biochemistry, Indian Institute of Science, Bangalore 560012, Karnataka, India.,BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, Karnataka, India
| |
Collapse
|
5
|
Shi W, Singha M, Pu L, Srivastava G, Ramanujam J, Brylinski M. GraphSite: Ligand Binding Site Classification with Deep Graph Learning. Biomolecules 2022; 12:biom12081053. [PMID: 36008947 PMCID: PMC9405584 DOI: 10.3390/biom12081053] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Revised: 07/18/2022] [Accepted: 07/20/2022] [Indexed: 12/10/2022] Open
Abstract
The binding of small organic molecules to protein targets is fundamental to a wide array of cellular functions. It is also routinely exploited to develop new therapeutic strategies against a variety of diseases. On that account, the ability to effectively detect and classify ligand binding sites in proteins is of paramount importance to modern structure-based drug discovery. These complex and non-trivial tasks require sophisticated algorithms from the field of artificial intelligence to achieve a high prediction accuracy. In this communication, we describe GraphSite, a deep learning-based method utilizing a graph representation of local protein structures and a state-of-the-art graph neural network to classify ligand binding sites. Using neural weighted message passing layers to effectively capture the structural, physicochemical, and evolutionary characteristics of binding pockets mitigates model overfitting and improves the classification accuracy. Indeed, comprehensive cross-validation benchmarks against a large dataset of binding pockets belonging to 14 diverse functional classes demonstrate that GraphSite yields the class-weighted F1-score of 81.7%, outperforming other approaches such as molecular docking and binding site matching. Further, it also generalizes well to unseen data with the F1-score of 70.7%, which is the expected performance in real-world applications. We also discuss new directions to improve and extend GraphSite in the future.
Collapse
Affiliation(s)
- Wentao Shi
- Division of Electrical and Computer Engineering, Louisiana State University, Baton Rouge, LA 70803, USA; (W.S.); (J.R.)
| | - Manali Singha
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA; (M.S.); (G.S.)
| | - Limeng Pu
- Center for Computation and Technology, Louisiana State University, Baton Rouge, LA 70803, USA;
| | - Gopal Srivastava
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA; (M.S.); (G.S.)
| | - Jagannathan Ramanujam
- Division of Electrical and Computer Engineering, Louisiana State University, Baton Rouge, LA 70803, USA; (W.S.); (J.R.)
- Center for Computation and Technology, Louisiana State University, Baton Rouge, LA 70803, USA;
| | - Michal Brylinski
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA; (M.S.); (G.S.)
- Center for Computation and Technology, Louisiana State University, Baton Rouge, LA 70803, USA;
- Correspondence: ; Tel.: +1-(225)-578-2791; Fax: +1-(225)-578-2597
| |
Collapse
|
6
|
Mazola Y, Márquez Montesinos JCE, Ramírez D, Zúñiga L, Decher N, Ravens U, Yarov-Yarovoy V, González W. Common Structural Pattern for Flecainide Binding in Atrial-Selective Kv1.5 and Nav1.5 Channels: A Computational Approach. Pharmaceutics 2022; 14:pharmaceutics14071356. [PMID: 35890252 PMCID: PMC9318806 DOI: 10.3390/pharmaceutics14071356] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2022] [Revised: 06/20/2022] [Accepted: 06/20/2022] [Indexed: 02/04/2023] Open
Abstract
Atrial fibrillation (AF) is the most common cardiac arrhythmia. Its treatment includes antiarrhythmic drugs (AADs) to modulate the function of cardiac ion channels. However, AADs have been limited by proarrhythmic effects, non-cardiovascular toxicities as well as often modest antiarrhythmic efficacy. Theoretical models showed that a combined blockade of Nav1.5 (and its current, INa) and Kv1.5 (and its current, IKur) ion channels yield a synergistic anti-arrhythmic effect without alterations in ventricles. We focused on Kv1.5 and Nav1.5 to search for structural similarities in their binding site (BS) for flecainide (a common blocker and widely prescribed AAD) as a first step for prospective rational multi-target directed ligand (MTDL) design strategies. We present a computational workflow for a flecainide BS comparison in a flecainide-Kv1.5 docking model and a solved structure of the flecainide-Nav1.5 complex. The workflow includes docking, molecular dynamics, BS characterization and pattern matching. We identified a common structural pattern in flecainide BS for these channels. The latter belongs to the central cavity and consists of a hydrophobic patch and a polar region, involving residues from the S6 helix and P-loop. Since the rational MTDL design for AF is still incipient, our findings could advance multi-target atrial-selective strategies for AF treatment.
Collapse
Affiliation(s)
- Yuliet Mazola
- Center for Bioinformatics, Simulation and Modeling (CBSM), Universidad de Talca, Talca 3460000, Chile; (Y.M.); (J.C.E.M.M.)
| | - José C. E. Márquez Montesinos
- Center for Bioinformatics, Simulation and Modeling (CBSM), Universidad de Talca, Talca 3460000, Chile; (Y.M.); (J.C.E.M.M.)
| | - David Ramírez
- Departamento de Farmacología, Facultad de Ciencias Biológicas, Universidad de Concepción, Concepción 4030000, Chile;
| | - Leandro Zúñiga
- Escuela de Medicina, Centro de Investigaciones Médicas, Universidad de Talca, Talca 3460000, Chile;
| | - Niels Decher
- Institute for Physiology and Pathophysiology, Vegetative Physiology, Philipps-University of Marburg, 35043 Marburg, Germany;
| | - Ursula Ravens
- Institut für Experimentelle Kardiovaskuläre Medizin, Universitäts-Herzzentrum Freiburg Bad Krotzingen, 79110 Freiburg im Breisgau, Germany;
| | - Vladimir Yarov-Yarovoy
- Department of Physiology and Membrane Biology, University of California, Davis, CA 95616, USA;
| | - Wendy González
- Center for Bioinformatics, Simulation and Modeling (CBSM), Universidad de Talca, Talca 3460000, Chile; (Y.M.); (J.C.E.M.M.)
- Millennium Nucleus of Ion Channels-Associated Diseases (MiNICAD), Talca 3530000, Chile
- Correspondence:
| |
Collapse
|
7
|
Valdés-Jiménez A, Jiménez-González D, Kiper AK, Rinné S, Decher N, González W, Reyes-Parada M, Núñez-Vivanco G. A New Strategy for Multitarget Drug Discovery/Repositioning Through the Identification of Similar 3D Amino Acid Patterns Among Proteins Structures: The Case of Tafluprost and its Effects on Cardiac Ion Channels. Front Pharmacol 2022; 13:855792. [PMID: 35370665 PMCID: PMC8971525 DOI: 10.3389/fphar.2022.855792] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2022] [Accepted: 02/21/2022] [Indexed: 01/01/2023] Open
Abstract
The identification of similar three-dimensional (3D) amino acid patterns among different proteins might be helpful to explain the polypharmacological profile of many currently used drugs. Also, it would be a reasonable first step for the design of novel multitarget compounds. Most of the current computational tools employed for this aim are limited to the comparisons among known binding sites, and do not consider several additional important 3D patterns such as allosteric sites or other conserved motifs. In the present work, we introduce Geomfinder2.0, which is a new and improved version of our previously described algorithm for the deep exploration and discovery of similar and druggable 3D patterns. As compared with the original version, substantial improvements that have been incorporated to our software allow: (i) to compare quaternary structures, (ii) to deal with a list of pairs of structures, (iii) to know how druggable is the zone where similar 3D patterns are detected and (iv) to significantly reduce the execution time. Thus, the new algorithm achieves up to 353x speedup as compared to the previous sequential version, allowing the exploration of a significant number of quaternary structures in a reasonable time. In order to illustrate the potential of the updated Geomfinder version, we show a case of use in which similar 3D patterns were detected in the cardiac ions channels NaV1.5 and TASK-1. These channels are quite different in terms of structure, sequence and function and both have been regarded as important targets for drugs aimed at treating atrial fibrillation. Finally, we describe the in vitro effects of tafluprost (a drug currently used to treat glaucoma, which was identified as a novel putative ligand of NaV1.5 and TASK-1) upon both ion channels’ activity and discuss its possible repositioning as a novel antiarrhythmic drug.
Collapse
Affiliation(s)
- Alejandro Valdés-Jiménez
- Center for Bioinformatics, Simulations and Modelling, Faculty of Engineering, University of Talca, Talca, Chile
- Computer Architecture Department, Universitat Politécnica de Catalunya, Barcelona, Spain
| | - Daniel Jiménez-González
- Computer Architecture Department, Universitat Politécnica de Catalunya, Barcelona, Spain
- Barcelona Supercomputing Center, Barcelona, Spain
| | - Aytug K. Kiper
- Institute for Physiology and Pathophysiology, Philipps-University Marburg, Marburg, Germany
| | - Susanne Rinné
- Institute for Physiology and Pathophysiology, Philipps-University Marburg, Marburg, Germany
| | - Niels Decher
- Institute for Physiology and Pathophysiology, Philipps-University Marburg, Marburg, Germany
| | - Wendy González
- Center for Bioinformatics, Simulations and Modelling, Faculty of Engineering, University of Talca, Talca, Chile
- Millennium Nucleus of Ion Channels-Associated Diseases (MiNICAD), Universidad de Talca, Talca, Chile
- *Correspondence: Wendy González, ; Miguel Reyes-Parada, ; Gabriel Núñez-Vivanco,
| | - Miguel Reyes-Parada
- Centro de Investigación Biomédica y Aplicada (CIBAP), Escuela de Medicina, Facultad de Ciencias Médicas, Universidad de Santiago de Chile, Santiago, Chile
- Facultad de Ciencias de la Salud, Universidad Autónoma de Chile, Talca, Chile
- *Correspondence: Wendy González, ; Miguel Reyes-Parada, ; Gabriel Núñez-Vivanco,
| | - Gabriel Núñez-Vivanco
- Departamento de Ciencias Naturales y Tecnología, Universidad de Aysén, Coyhaique, Chile
- *Correspondence: Wendy González, ; Miguel Reyes-Parada, ; Gabriel Núñez-Vivanco,
| |
Collapse
|
8
|
Shi W, Singha M, Srivastava G, Pu L, Ramanujam J, Brylinski M. Pocket2Drug: An Encoder-Decoder Deep Neural Network for the Target-Based Drug Design. Front Pharmacol 2022; 13:837715. [PMID: 35359869 PMCID: PMC8962739 DOI: 10.3389/fphar.2022.837715] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 02/10/2022] [Indexed: 11/13/2022] Open
Abstract
Computational modeling is an essential component of modern drug discovery. One of its most important applications is to select promising drug candidates for pharmacologically relevant target proteins. Because of continuing advances in structural biology, putative binding sites for small organic molecules are being discovered in numerous proteins linked to various diseases. These valuable data offer new opportunities to build efficient computational models predicting binding molecules for target sites through the application of data mining and machine learning. In particular, deep neural networks are powerful techniques capable of learning from complex data in order to make informed drug binding predictions. In this communication, we describe Pocket2Drug, a deep graph neural network model to predict binding molecules for a given a ligand binding site. This approach first learns the conditional probability distribution of small molecules from a large dataset of pocket structures with supervised training, followed by the sampling of drug candidates from the trained model. Comprehensive benchmarking simulations show that using Pocket2Drug significantly improves the chances of finding molecules binding to target pockets compared to traditional drug selection procedures. Specifically, known binders are generated for as many as 80.5% of targets present in the testing set consisting of dissimilar data from that used to train the deep graph neural network model. Overall, Pocket2Drug is a promising computational approach to inform the discovery of novel biopharmaceuticals.
Collapse
Affiliation(s)
- Wentao Shi
- Division of Electrical and Computer Engineering, Louisiana State University, Baton Rouge, LA, United States
| | - Manali Singha
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, United States
| | - Gopal Srivastava
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, United States
| | - Limeng Pu
- Center for Computation and Technology, Louisiana State University, Baton Rouge, LA, United States
| | - J. Ramanujam
- Division of Electrical and Computer Engineering, Louisiana State University, Baton Rouge, LA, United States
- Center for Computation and Technology, Louisiana State University, Baton Rouge, LA, United States
| | - Michal Brylinski
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, United States
- Center for Computation and Technology, Louisiana State University, Baton Rouge, LA, United States
- *Correspondence: Michal Brylinski,
| |
Collapse
|
9
|
Sankar S, Chandra N. SiteMotif: A graph-based algorithm for deriving structural motifs in Protein Ligand binding sites. PLoS Comput Biol 2022; 18:e1009901. [PMID: 35202398 PMCID: PMC8903255 DOI: 10.1371/journal.pcbi.1009901] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Revised: 03/08/2022] [Accepted: 02/07/2022] [Indexed: 12/03/2022] Open
Abstract
Studying similarities in protein molecules has become a fundamental activity in much of biology and biomedical research, for which methods such as multiple sequence alignments are widely used. Most methods available for such comparisons cater to studying proteins which have clearly recognizable evolutionary relationships but not to proteins that recognize the same or similar ligands but do not share similarities in their sequence or structural folds. In many cases, proteins in the latter class share structural similarities only in their binding sites. While several algorithms are available for comparing binding sites, there are none for deriving structural motifs of the binding sites, independent of the whole proteins. We report the development of SiteMotif, a new algorithm that compares binding sites from multiple proteins and derives sequence-order independent structural site motifs. We have tested the algorithm at multiple levels of complexity and demonstrate its performance in different scenarios. We have benchmarked against 3 current methods available for binding site comparison and demonstrate superior performance of our algorithm. We show that SiteMotif identifies new structural motifs of spatially conserved residues in proteins, even when there is no sequence or fold-level similarity. We expect SiteMotif to be useful for deriving key mechanistic insights into the mode of ligand interaction, predict the ligand type that a protein can bind and improve the sensitivity of functional annotation. A large number of biological functions are orchestrated by proteins. The function of proteins is governed by its structure and its interacting ligand. However, it is known that not all residues are involved in ligand recognition. More specifically, residues that are located within 4.5 Å of ligand atoms are considered to be ’binding sites’. Here, we have developed an algorithm called SiteMotif that efficiently aligns multiple binding sites into a common frame. This process enables us to derive conservation among the binding site residues in a sequence order independent manner. The algorithm was validated extensively across five different levels and measured binding site similarities in each of them. Previous research has found multiple instances where different proteins have comparable binding sites and hence perform the same function. We present the ability of our method to detect such scenarios. Finally, As a use case, we applied SiteMotif to a set of glutathione binding proteins and derived a site based sequence motif characteristic of all glutathione binding proteins.
Collapse
Affiliation(s)
- Santhosh Sankar
- Department of Biochemistry, Indian Institute of Science, Bangalore, Karnataka, India
| | - Nagasuma Chandra
- Department of Biochemistry, Indian Institute of Science, Bangalore, Karnataka, India
- BioSystems Science and Engineering, Indian Institute of Science, Bangalore, Karnataka, India
- * E-mail:
| |
Collapse
|
10
|
Charoute H, Elkarhat Z, Elkhattabi L, El Fahime E, Oukkache N, Rouba H, Barakat A. Computational screening of potential drugs against COVID-19 disease: the Neuropilin-1 receptor as molecular target. Virusdisease 2022; 33:23-31. [PMID: 35079600 PMCID: PMC8776366 DOI: 10.1007/s13337-021-00751-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Accepted: 11/01/2021] [Indexed: 12/28/2022] Open
Abstract
The transmembrane receptor Neuropilin-1 (NRP-1) was reported to serve as a host cell entry factor for the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causal agent of COVID-19 disease. Therefore, molecular compounds interfering with SARS-CoV-2 binding to NRP-1 seem to be potential candidates as new antiviral drugs. In this study, NRP-1 receptor was targeted using a library of 1167 compounds previously analyzed in COVID-19 related studies. The results show the effectiveness of Nafamostat, Y96, Selinexor, Ebastine and UGS, in binding to NRP-1 receptor, with docking scores lower than − 8.2 kcal/mol. These molecules interact with NRP-1 receptor key residues, which makes them promising drugs to pursue further biological assays to explore their potential use in the treatment of COVID-19.
Collapse
Affiliation(s)
- Hicham Charoute
- Research Unit of Epidemiology, Biostatistics and Bioinformatics, 1, Place Louis Pasteur, Institut Pasteur du Maroc, 20360 Casablanca, Morocco
- Laboratory of Genomics and Human Genetics, Institut Pasteur du Maroc, Casablanca, Morocco
| | - Zouhair Elkarhat
- Laboratory of Genomics and Human Genetics, Institut Pasteur du Maroc, Casablanca, Morocco
| | - Lamiae Elkhattabi
- Laboratory of Genomics and Human Genetics, Institut Pasteur du Maroc, Casablanca, Morocco
| | - Elmostafa El Fahime
- Molecular Biology and Functional Genomics Platform, National Center for Scientific and Technical Research, Rabat, Morocco
| | - Naoual Oukkache
- Laboratory of Venoms and Toxins, Institut Pasteur du Maroc, Casablanca, Morocco
| | - Hassan Rouba
- Laboratory of Genomics and Human Genetics, Institut Pasteur du Maroc, Casablanca, Morocco
| | - Abdelhamid Barakat
- Laboratory of Genomics and Human Genetics, Institut Pasteur du Maroc, Casablanca, Morocco
| |
Collapse
|
11
|
Bhadra A, Yeturu K. Site2Vec: a reference frame invariant algorithm for vector embedding of protein–ligand binding sites. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2021. [DOI: 10.1088/2632-2153/abad88] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Abstract
Protein–ligand interactions are one of the fundamental types of molecular interactions in living systems. Ligands are small molecules that interact with protein molecules at specific regions on their surfaces called binding sites. Binding sites would also determine ADMET properties of a drug molecule. Tasks such as assessment of protein functional similarity and detection of side effects of drugs need identification of similar binding sites of disparate proteins across diverse pathways. To this end, methods for computing similarities between binding sites are still evolving and is an active area of research even today. Machine learning methods for similarity assessment require feature descriptors of binding sites. Traditional methods based on hand engineered motifs and atomic configurations are not scalable across several thousands of sites. In this regard, deep neural network algorithms are now deployed which can capture very complex input feature space. However, one fundamental challenge in applying deep learning to structures of binding sites is the input representation and the reference frame. We report here a novel algorithm, Site2Vec, that derives reference frame invariant vector embedding of a protein–ligand binding site. The method is based on pairwise distances between representative points and chemical compositions in terms of constituent amino acids of a site. The vector embedding serves as a locality sensitive hash function for proximity queries and determining similar sites. The method has been the top performer with more than 95% quality scores in extensive benchmarking studies carried over 10 data sets and against 23 other site comparison methods in the field. The algorithm serves for high throughput processing and has been evaluated for stability with respect to reference frame shifts, coordinate perturbations and residue mutations. We also provide the method as a standalone executable and a web service hosted at (http://services.iittp.ac.in/bioinfo/home).
Collapse
|
12
|
Naderi M, Lemoine JM, Govindaraj RG, Kana OZ, Feinstein WP, Brylinski M. Binding site matching in rational drug design: algorithms and applications. Brief Bioinform 2019; 20:2167-2184. [PMID: 30169563 PMCID: PMC6954434 DOI: 10.1093/bib/bby078] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Revised: 07/18/2018] [Accepted: 07/29/2018] [Indexed: 01/06/2023] Open
Abstract
Interactions between proteins and small molecules are critical for biological functions. These interactions often occur in small cavities within protein structures, known as ligand-binding pockets. Understanding the physicochemical qualities of binding pockets is essential to improve not only our basic knowledge of biological systems, but also drug development procedures. In order to quantify similarities among pockets in terms of their geometries and chemical properties, either bound ligands can be compared to one another or binding sites can be matched directly. Both perspectives routinely take advantage of computational methods including various techniques to represent and compare small molecules as well as local protein structures. In this review, we survey 12 tools widely used to match pockets. These methods are divided into five categories based on the algorithm implemented to construct binding-site alignments. In addition to the comprehensive analysis of their algorithms, test sets and the performance of each method are described. We also discuss general pharmacological applications of computational pocket matching in drug repurposing, polypharmacology and side effects. Reflecting on the importance of these techniques in drug discovery, in the end, we elaborate on the development of more accurate meta-predictors, the incorporation of protein flexibility and the integration of powerful artificial intelligence technologies such as deep learning.
Collapse
Affiliation(s)
- Misagh Naderi
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Jeffrey Mitchell Lemoine
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
- Division of Computer Science and Engineering, Louisiana State University, Baton Rouge, LA 70803, USA
| | | | - Omar Zade Kana
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Wei Pan Feinstein
- High-Performance Computing, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Michal Brylinski
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
- Center for Computation & Technology, Louisiana State University, Baton Rouge, LA 70803, USA
| |
Collapse
|
13
|
Abstract
INTRODUCTION The success of binding site comparisons in drug discovery is based on the recognized fact that many different proteins have similar binding sites. Indeed, binding site comparisons have found many uses in drug development and have the potential to dramatically cut the cost and shorten the time necessary for the development of new drugs. Areas covered: The authors review recent methods for comparing protein binding sites and their use in drug repurposing and polypharmacology. They examine emerging fields including the use of binding site comparisons in precision medicine, the prediction of structured water molecules, the search for targets of natural compounds, and their application in the development of protein-based drugs by loop modeling and for comparison of RNA binding sites. Expert opinion: Binding site comparisons have produced many interesting results in drug development, but relatively little work has been done on protein-protein interaction sites, which are particularly relevant in view of the success of biological drugs. Growth of protein loop modeling for modulating biological drugs is anticipated. The fusion of currently distinct methods for the comparison of RNA and protein binding sites into a single comprehensive approach could allow the search for new selective ribosomal antibiotics and initiate pharmaceutical research into other nucleoproteins.
Collapse
Affiliation(s)
- Janez Konc
- a Theory Department , National Institute of Chemistry , Ljubljana , Slovenia.,b Faculty of Pharmacy , University of Ljubljana , Ljubljana , Slovenia.,c Faculty of Mathematics , Natural Sciences and Information Technologies, University of Primorska , Koper , Slovenia.,d Faculty of Chemistry and Chemical Technology , University of Maribor , Maribor , Slovenia
| |
Collapse
|
14
|
Bhagavat R, Sankar S, Srinivasan N, Chandra N. An Augmented Pocketome: Detection and Analysis of Small-Molecule Binding Pockets in Proteins of Known 3D Structure. Structure 2019. [PMID: 29514079 DOI: 10.1016/j.str.2018.02.001] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Protein-ligand interactions form the basis of most cellular events. Identifying ligand binding pockets in proteins will greatly facilitate rationalizing and predicting protein function. Ligand binding sites are unknown for many proteins of known three-dimensional (3D) structure, creating a gap in our understanding of protein structure-function relationships. To bridge this gap, we detect pockets in proteins of known 3D structures, using computational techniques. This augmented pocketome (PocketDB) consists of 249,096 pockets, which is about seven times larger than what is currently known. We deduce possible ligand associations for about 46% of the newly identified pockets. The augmented pocketome, when subjected to clustering based on similarities among pockets, yielded 2,161 site types, which are associated with 1,037 ligand types, together providing fold-site-type-ligand-type associations. The PocketDB resource facilitates a structure-based function annotation, delineation of the structural basis of ligand recognition, and provides functional clues for domains of unknown functions, allosteric proteins, and druggable pockets.
Collapse
Affiliation(s)
- Raghu Bhagavat
- National Mathematics Initiative, Indian Institute of Science, Bangalore 560012, India
| | - Santhosh Sankar
- Department of Biochemistry, Indian Institute of Science, Bangalore 560012, India
| | - Narayanaswamy Srinivasan
- National Mathematics Initiative, Indian Institute of Science, Bangalore 560012, India; Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | - Nagasuma Chandra
- National Mathematics Initiative, Indian Institute of Science, Bangalore 560012, India; Department of Biochemistry, Indian Institute of Science, Bangalore 560012, India.
| |
Collapse
|
15
|
A genome-wide structure-based survey of nucleotide binding proteins in M. tuberculosis. Sci Rep 2017; 7:12489. [PMID: 28970579 PMCID: PMC5624866 DOI: 10.1038/s41598-017-12471-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2017] [Accepted: 09/04/2017] [Indexed: 01/09/2023] Open
Abstract
Nucleoside tri-phosphates (NTP) form an important class of small molecule ligands that participate in, and are essential to a large number of biological processes. Here, we seek to identify the NTP binding proteome (NTPome) in M. tuberculosis (M.tb), a deadly pathogen. Identifying the NTPome is useful not only for gaining functional insights of the individual proteins but also for identifying useful drug targets. From an earlier study, we had structural models of M.tb at a proteome scale from which a set of 13,858 small molecule binding pockets were identified. We use a set of NTP binding sub-structural motifs derived from a previous study and scan the M.tb pocketome, and find that 1,768 proteins or 43% of the proteome can theoretically bind NTP ligands. Using an experimental proteomics approach involving dye-ligand affinity chromatography, we confirm NTP binding to 47 different proteins, of which 4 are hypothetical proteins. Our analysis also provides the precise list of binding site residues in each case, and the probable ligand binding pose. As the list includes a number of known and potential drug targets, the identification of NTP binding can directly facilitate structure-based drug design of these targets.
Collapse
|
16
|
Bhagavat R, Srinivasan N, Chandra N. Deciphering common recognition principles of nucleoside mono/di and tri-phosphates binding in diverse proteins via structural matching of their binding sites. Proteins 2017; 85:1699-1712. [PMID: 28547747 DOI: 10.1002/prot.25328] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2017] [Revised: 05/04/2017] [Accepted: 05/20/2017] [Indexed: 12/14/2022]
Abstract
Nucleoside triphosphate (NTP) ligands are of high biological importance and are essential for all life forms. A pre-requisite for them to participate in diverse biochemical processes is their recognition by diverse proteins. It is thus of great interest to understand the basis for such recognition in different proteins. Towards this, we have used a structural bioinformatics approach and analyze structures of 4677 NTP complexes available in Protein Data Bank (PDB). Binding sites were extracted and compared exhaustively using PocketMatch, a sensitive in-house site comparison algorithm, which resulted in grouping the entire dataset into 27 site-types. Each of these site-types represent a structural motif comprised of two or more residue conservations, derived using another in-house tool for superposing binding sites, PocketAlign. The 27 site-types could be grouped further into 9 super-types by considering partial similarities in the sites, which indicated that the individual site-types comprise different combinations of one or more site features. A scan across PDB using the 27 structural motifs determined the motifs to be specific to NTP binding sites, and a computational alanine mutagenesis indicated that residues identified to be highly conserved in the motifs are also most contributing to binding. Alternate orientations of the ligand in several site-types were observed and rationalized, indicating the possibility of some residues serving as anchors for NTP recognition. The presence of multiple site-types and the grouping of multiple folds into each site-type is strongly suggestive of convergent evolution. Knowledge of determinants obtained from this study will be useful for detecting function in unknown proteins. Proteins 2017; 85:1699-1712. © 2017 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Raghu Bhagavat
- Department of Biochemistry, Molecular Biophysics Unit, National Mathematics Initiative, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Narayanaswamy Srinivasan
- Department of Biochemistry, Molecular Biophysics Unit, National Mathematics Initiative, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Nagasuma Chandra
- Department of Biochemistry, Molecular Biophysics Unit, National Mathematics Initiative, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| |
Collapse
|
17
|
Mudgal R, Srinivasan N, Chandra N. Resolving protein structure-function-binding site relationships from a binding site similarity network perspective. Proteins 2017; 85:1319-1335. [PMID: 28342236 DOI: 10.1002/prot.25293] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2016] [Revised: 03/18/2017] [Accepted: 03/20/2017] [Indexed: 11/05/2022]
Abstract
Functional annotation is seldom straightforward with complexities arising due to functional divergence in protein families or functional convergence between non-homologous protein families, leading to mis-annotations. An enzyme may contain multiple domains and not all domains may be involved in a given function, adding to the complexity in function annotation. To address this, we use binding site information from bound cognate ligands and catalytic residues, since it can help in resolving fold-function relationships at a finer level and with higher confidence. A comprehensive database of 2,020 fold-function-binding site relationships has been systematically generated. A network-based approach is employed to capture the complexity in these relationships, from which different types of associations are deciphered, that identify versatile protein folds performing diverse functions, same function associated with multiple folds and one-to-one relationships. Binding site similarity networks integrated with fold, function, and ligand similarity information are generated to understand the depth of these relationships. Apart from the observed continuity in the functional site space, network properties of these revealed versatile families with topologically different or dissimilar binding sites and structural families that perform very similar functions. As a case study, subtle changes in the active site of a set of evolutionarily related superfamilies are studied using these networks. Tracing of such similarities in evolutionarily related proteins provide clues into the transition and evolution of protein functions. Insights from this study will be helpful in accurate and reliable functional annotations of uncharacterized proteins, poly-pharmacology, and designing enzymes with new functional capabilities. Proteins 2017; 85:1319-1335. © 2017 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Richa Mudgal
- IISc Mathematics Initiative, Indian Institute of Science, Bangalore, Karnataka, 560 012, India
| | | | - Nagasuma Chandra
- Department of Biochemistry, Indian Institute of Science, Bangalore, Karnataka, 560 012, India
| |
Collapse
|
18
|
Chakraborty C, Bandyopadhyay S, Agoramoorthy G. India's Computational Biology Growth and Challenges. Interdiscip Sci 2016; 8:263-76. [PMID: 27465042 DOI: 10.1007/s12539-016-0179-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Revised: 09/08/2015] [Accepted: 09/08/2015] [Indexed: 11/30/2022]
Abstract
India's computational science is growing swiftly due to the outburst of internet and information technology services. The bioinformatics sector of India has been transforming rapidly by creating a competitive position in global bioinformatics market. Bioinformatics is widely used across India to address a wide range of biological issues. Recently, computational researchers and biologists are collaborating in projects such as database development, sequence analysis, genomic prospects and algorithm generations. In this paper, we have presented the Indian computational biology scenario highlighting bioinformatics-related educational activities, manpower development, internet boom, service industry, research activities, conferences and trainings undertaken by the corporate and government sectors. Nonetheless, this new field of science faces lots of challenges.
Collapse
Affiliation(s)
- Chiranjib Chakraborty
- Department of Bio-informatics, School of Computer and Information Sciences, Galgotias University, Greater Noida, India
| | | | | |
Collapse
|
19
|
Bhosle A, Chandra N. Structural analysis of dihydrofolate reductases enables rationalization of antifolate binding affinities and suggests repurposing possibilities. FEBS J 2016; 283:1139-67. [PMID: 26797763 DOI: 10.1111/febs.13662] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2015] [Revised: 12/03/2015] [Accepted: 01/14/2016] [Indexed: 11/28/2022]
Abstract
Antifolates are competitive inhibitors of dihydrofolate reductase (DHFR), a conserved enzyme that is central to metabolism and widely targeted in pathogenic diseases, cancer and autoimmune disorders. Although most clinically used antifolates are known to be target specific, some display a fair degree of cross-reactivity with DHFRs from other species. A method that enables identification of determinants of affinity and specificity in target DHFRs from different species and provides guidelines for the design of antifolates is currently lacking. To address this, we first captured the potential druggable space of a DHFR in a substructure called the 'supersite' and classified supersites of DHFRs from 56 species into 16 'site-types' based on pairwise structural similarity. Analysis of supersites across these site-types revealed that DHFRs exhibit varying extents of dissimilarity at structurally equivalent positions in and around the binding site. We were able to explain the pattern of affinities towards chemically diverse antifolates exhibited by DHFRs of different site-types based on these structural differences. We then generated an antifolate-DHFR network by mapping known high-affinity antifolates to their respective supersites and used this to identify antifolates that can be repurposed based on similarity between supersites or antifolates. Thus, we identified 177 human-specific and 458 pathogen-specific antifolates, a large number of which are supported by available experimental data. Thus, in the light of the clinical importance of DHFR, we present a novel approach to identifying differences in the druggable space of DHFRs that can be utilized for rational design of antifolates.
Collapse
Affiliation(s)
- Amrisha Bhosle
- Department of Biochemistry, Indian Institute of Science, Bangalore, India
| | - Nagasuma Chandra
- Department of Biochemistry, Indian Institute of Science, Bangalore, India
| |
Collapse
|
20
|
Shivashankar N, Patil S, Bhosle A, Chandra N, Natarajan V. MS3ALIGN: an efficient molecular surface aligner using the topology of surface curvature. BMC Bioinformatics 2016; 17:26. [PMID: 26753741 PMCID: PMC4710026 DOI: 10.1186/s12859-015-0874-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2015] [Accepted: 12/15/2015] [Indexed: 11/17/2022] Open
Abstract
Background Aligning similar molecular structures is an important step in the process of bio-molecular structure and function analysis. Molecular surfaces are simple representations of molecular structure that are easily constructed from various forms of molecular data such as 3D atomic coordinates (PDB) and Electron Microscopy (EM) data. Methods We present a Multi-Scale Morse-Smale Molecular-Surface Alignment tool, MS3ALIGN, which aligns molecular surfaces based on significant protrusions on the molecular surface. The input is a pair of molecular surfaces represented as triangle meshes. A key advantage of MS3ALIGN is computational efficiency that is achieved because it processes only a few carefully chosen protrusions on the molecular surface. Furthermore, the alignments are partial in nature and therefore allows for inexact surfaces to be aligned. Results The method is evaluated in four settings. First, we establish performance using known alignments with varying overlap and noise values. Second, we compare the method with SurfComp, an existing surface alignment method. We show that we are able to determine alignments reported by SurfComp, as well as report relevant alignments not found by SurfComp. Third, we validate the ability of MS3ALIGN to determine alignments in the case of structurally dissimilar binding sites. Fourth, we demonstrate the ability of MS3ALIGN to align iso-surfaces derived from cryo-electron microscopy scans. Conclusions We have presented an algorithm that aligns Molecular Surfaces based on the topology of surface curvature. A webserver and standalone software implementation of the algorithm available at http://vgl.serc.iisc.ernet.in/ms3align. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0874-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nithin Shivashankar
- Department of Computer Science and Automation, Indian Institute of Science, Bangalore, 560012, India.
| | - Sonali Patil
- Department of Computer Science and Automation, Indian Institute of Science, Bangalore, 560012, India
| | - Amrisha Bhosle
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, India
| | - Nagasuma Chandra
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, India
| | - Vijay Natarajan
- Department of Computer Science and Automation, and Supercomputer Education and Research Centre, Indian Institute of Science, Bangalore, 560012, India.
| |
Collapse
|
21
|
Bartolowits M, Davisson VJ. Considerations of Protein Subpockets in Fragment-Based Drug Design. Chem Biol Drug Des 2015; 87:5-20. [PMID: 26307335 DOI: 10.1111/cbdd.12631] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
While the fragment-based drug design approach continues to gain importance, gaps in the tools and methods available in the identification and accurate utilization of protein subpockets have limited the scope. The importance of these features of small molecule-protein recognition is highlighted with several examples. A generalized solution for the identification of subpockets and corresponding chemical fragments remains elusive, but there are numerous advancements in methods that can be used in combination to address subpockets. Finally, additional examples of approaches that consider the relative importance of small-molecule co-dependence of protein conformations are highlighted to emphasize an increased significance of subpockets, especially at protein interfaces.
Collapse
Affiliation(s)
- Matthew Bartolowits
- Department of Medicinal Chemistry and Molecular Pharmacology, College of Pharmacy, Purdue University, 575 Stadium Mall Dr., West Lafayette, IN, 47907, USA
| | - V Jo Davisson
- Department of Medicinal Chemistry and Molecular Pharmacology, College of Pharmacy, Purdue University, 575 Stadium Mall Dr., West Lafayette, IN, 47907, USA
| |
Collapse
|
22
|
Metri R, Hariharaputran S, Ramakrishnan G, Anand P, Raghavender US, Ochoa-Montaño B, Higueruelo AP, Sowdhamini R, Chandra NR, Blundell TL, Srinivasan N. SInCRe-structural interactome computational resource for Mycobacterium tuberculosis. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2015; 2015:bav060. [PMID: 26130660 PMCID: PMC4485431 DOI: 10.1093/database/bav060] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/23/2015] [Accepted: 05/26/2015] [Indexed: 11/20/2022]
Abstract
We have developed an integrated database for Mycobacterium tuberculosis H37Rv (Mtb) that collates information on protein sequences, domain assignments, functional annotation and 3D structural information along with protein–protein and protein–small molecule interactions. SInCRe (Structural Interactome Computational Resource) is developed out of CamBan (Cambridge and Bangalore) collaboration. The motivation for development of this database is to provide an integrated platform to allow easily access and interpretation of data and results obtained by all the groups in CamBan in the field of Mtb informatics. In-house algorithms and databases developed independently by various academic groups in CamBan are used to generate Mtb-specific datasets and are integrated in this database to provide a structural dimension to studies on tuberculosis. The SInCRe database readily provides information on identification of functional domains, genome-scale modelling of structures of Mtb proteins and characterization of the small-molecule binding sites within Mtb. The resource also provides structure-based function annotation, information on small-molecule binders including FDA (Food and Drug Administration)-approved drugs, protein–protein interactions (PPIs) and natural compounds that bind to pathogen proteins potentially and result in weakening or elimination of host–pathogen protein–protein interactions. Together they provide prerequisites for identification of off-target binding. Database URL:http://proline.biochem.iisc.ernet.in/sincre
Collapse
Affiliation(s)
- Rahul Metri
- Department of Biochemistry and Indian Institute of Science Mathematics Initiative, Indian Institute of Science, Bangalore, India
| | - Sridhar Hariharaputran
- Department of Biochemistry and National Centre for Biological Sciences, TIFR, UAS-GKVK Campus, Bellary Road, Bangalore, India
| | - Gayatri Ramakrishnan
- Indian Institute of Science Mathematics Initiative, Indian Institute of Science, Bangalore, India, Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India, and
| | | | | | | | - Alicia P Higueruelo
- Department of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge, UK
| | - Ramanathan Sowdhamini
- National Centre for Biological Sciences, TIFR, UAS-GKVK Campus, Bellary Road, Bangalore, India
| | | | - Tom L Blundell
- Department of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge, UK
| | | |
Collapse
|
23
|
Silverstein TP, Kirk SR, Meyer SC, Holman KLM. Myoglobin structure and function: A multiweek biochemistry laboratory project. BIOCHEMISTRY AND MOLECULAR BIOLOGY EDUCATION : A BIMONTHLY PUBLICATION OF THE INTERNATIONAL UNION OF BIOCHEMISTRY AND MOLECULAR BIOLOGY 2015; 43:181-8. [PMID: 25726810 DOI: 10.1002/bmb.20845] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2014] [Accepted: 11/02/2014] [Indexed: 05/25/2023]
Abstract
We have developed a multiweek laboratory project in which students isolate myoglobin and characterize its structure, function, and redox state. The important laboratory techniques covered in this project include size-exclusion chromatography, electrophoresis, spectrophotometric titration, and FTIR spectroscopy. Regarding protein structure, students work with computer modeling and visualization of myoglobin and its homologues, after which they spectroscopically characterize its thermal denaturation. Students also study protein function (ligand binding equilibrium) and are instructed on topics in data analysis (calibration curves, nonlinear vs. linear regression). This upper division biochemistry laboratory project is a challenging and rewarding one that not only exposes students to a wide variety of important biochemical laboratory techniques but also ties those techniques together to work with a single readily available and easily characterized protein, myoglobin.
Collapse
Affiliation(s)
| | - Sarah R Kirk
- Department of Chemistry, Willamette University, Salem, Oregon, 97301
| | - Scott C Meyer
- Department of Chemistry, Willamette University, Salem, Oregon, 97301
| | | |
Collapse
|
24
|
Anand P, Chandra N. Characterizing the pocketome of Mycobacterium tuberculosis and application in rationalizing polypharmacological target selection. Sci Rep 2014; 4:6356. [PMID: 25220818 PMCID: PMC5376175 DOI: 10.1038/srep06356] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2014] [Accepted: 06/20/2014] [Indexed: 01/13/2023] Open
Abstract
Polypharmacology is beginning to emerge as an important concept in the field of drug discovery. However, there are no established approaches to either select appropriate target sets or design polypharmacological drugs. Here, we propose a structural-proteomics approach that utilizes the structural information of the binding sites at a genome-scale obtained through in-house algorithms to characterize the pocketome, yielding a list of ligands that can participate in various biochemical events in the mycobacterial cell. The pocket-type space is seen to be much larger than the sequence or fold-space, suggesting that variations at the site-level contribute significantly to functional repertoire of the organism. All-pair comparisons of binding sites within Mycobacterium tuberculosis (Mtb), pocket-similarity network construction and clustering result in identification of binding-site sets, each containing a group of similar binding sites, theoretically having a potential to interact with a common set of compounds. A polypharmacology index is formulated to rank targets by incorporating a measure of druggability and similarity to other pockets within the proteome. This study presents a rational approach to identify targets with polypharmacological potential along with possible drugs for repurposing, while simultaneously, obtaining clues on lead compounds for use in new drug-discovery pipelines.
Collapse
Affiliation(s)
- Praveen Anand
- Department of Biochemistry, Indian Institute of Science, Bangalore-560012, India
| | - Nagasuma Chandra
- Department of Biochemistry, Indian Institute of Science, Bangalore-560012, India
| |
Collapse
|
25
|
Anand P, Nagarajan D, Mukherjee S, Chandra N. PLIC: protein-ligand interaction clusters. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2014; 2014:bau029. [PMID: 24763918 PMCID: PMC3998096 DOI: 10.1093/database/bau029] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Most of the biological processes are governed through specific protein–ligand interactions. Discerning different components that contribute toward a favorable protein– ligand interaction could contribute significantly toward better understanding protein function, rationalizing drug design and obtaining design principles for protein engineering. The Protein Data Bank (PDB) currently hosts the structure of ∼68 000 protein–ligand complexes. Although several databases exist that classify proteins according to sequence and structure, a mere handful of them annotate and classify protein–ligand interactions and provide information on different attributes of molecular recognition. In this study, an exhaustive comparison of all the biologically relevant ligand-binding sites (84 846 sites) has been conducted using PocketMatch: a rapid, parallel, in-house algorithm. PocketMatch quantifies the similarity between binding sites based on structural descriptors and residue attributes. A similarity network was constructed using binding sites whose PocketMatch scores exceeded a high similarity threshold (0.80). The binding site similarity network was clustered into discrete sets of similar sites using the Markov clustering (MCL) algorithm. Furthermore, various computational tools have been used to study different attributes of interactions within the individual clusters. The attributes can be roughly divided into (i) binding site characteristics including pocket shape, nature of residues and interaction profiles with different kinds of atomic probes, (ii) atomic contacts consisting of various types of polar, hydrophobic and aromatic contacts along with binding site water molecules that could play crucial roles in protein–ligand interactions and (iii) binding energetics involved in interactions derived from scoring functions developed for docking. For each ligand-binding site in each protein in the PDB, site similarity information, clusters they belong to and description of site attributes are provided as a relational database—protein–ligand interaction clusters (PLIC). Database URL: http://proline.biochem.iisc.ernet.in/PLIC
Collapse
Affiliation(s)
- Praveen Anand
- Department of Biochemistry, Indian Institute of Science, Bangalore 560012, Karnataka, India and IISc Mathematics Initiative, Indian Institute of Science, Banglaore 560012, Karnataka, India
| | | | | | | |
Collapse
|
26
|
Abstract
Efforts from the TB Structural Genomics Consortium together with those of tuberculosis structural biologists worldwide have led to the determination of about 350 structures, making up nearly a tenth of the pathogen's proteome. Given that knowledge of protein structures is essential to obtaining a high-resolution understanding of the underlying biology, it is desirable to have a structural view of the entire proteome. Indeed, structure prediction methods have advanced sufficiently to allow structural models of many more proteins to be built based on homology modeling and fold recognition strategies. By means of these approaches, structural models for about 2,877 proteins, making up nearly 70% of the Mycobacterium tuberculosis proteome, are available. Knowledge from bioinformatics has made significant inroads into an improved annotation of the M. tuberculosis genome and in the prediction of key protein players that interact in vital pathways, some of which are unique to the organism. Functional inferences have been made for a large number of proteins based on fold-function associations. More importantly, ligand-binding pockets of the proteins are identified and scanned against a large database, leading to binding site-based ligand associations and hence structure-based function annotation. Near proteome-wide structural models provide a global perspective of the fold distribution in the genome. New insights about the folds that predominate in the genome, as well as the fold combinations that make up multidomain proteins, are also obtained. This chapter describes the structural proteome, functional inferences drawn from it, and its applications in drug discovery.
Collapse
|
27
|
Andreini C, Cavallaro G, Rosato A, Valasatava Y. MetalS2: a tool for the structural alignment of minimal functional sites in metal-binding proteins and nucleic acids. J Chem Inf Model 2013; 53:3064-75. [PMID: 24117467 DOI: 10.1021/ci400459w] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
We developed a new software tool, MetalS(2), for the structural alignment of Minimal Functional Sites (MFSs) in metal-binding biological macromolecules. MFSs are 3D templates that describe the local environment around the metal(s) independently of the larger context of the macromolecular structure. Such local environment has a determinant role in tuning the chemical reactivity of the metal, ultimately contributing to the functional properties of the whole system. On our example data sets, MetalS(2) unveiled structural similarities that other programs for protein structure comparison do not consistently point out and overall identified a larger number of structurally similar MFSs. MetalS(2) supports the comparison of MFSs harboring different metals and/or with different nuclearity and is available both as a stand-alone program and a Web tool ( http://metalweb.cerm.unifi.it/tools/metals2/).
Collapse
Affiliation(s)
- Claudia Andreini
- Magnetic Resonance Center (CERM) - University of Florence , Via L. Sacconi 6, 50019 Sesto Fiorentino, Florence, Italy
| | | | | | | |
Collapse
|
28
|
Bhagavat R, Chandra N. Common recognition principles across diverse sequence and structural families of sialic acid binding proteins. Glycobiology 2013; 24:5-16. [PMID: 24043392 DOI: 10.1093/glycob/cwt063] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Sialic acids form a large family of 9-carbon monosaccharides and are integral components of glycoconjugates. They are known to bind to a wide range of receptors belonging to diverse sequence families and fold classes and are key mediators in a plethora of cellular processes. Thus, it is of great interest to understand the features that give rise to such a recognition capability. Structural analyses using a non-redundant data set of known sialic acid binding proteins was carried out, which included exhaustive binding site comparisons and site alignments using in-house algorithms, followed by clustering and tree computation, which has led to derivation of sialic acid recognition principles. Although the proteins in the data set belong to several sequence and structure families, their binding sites could be grouped into only six types. Structural comparison of the binding sites indicates that all sites contain one or more different combinations of key structural features over a common scaffold. The six binding site types thus serve as structural motifs for recognizing sialic acid. Scanning the motifs against a non-redundant set of binding sites from PDB indicated the motifs to be specific for sialic acid recognition. Knowledge of determinants obtained from this study will be useful for detecting function in unknown proteins. As an example analysis, a genome-wide scan for the motifs in structures of Mycobacterium tuberculosis proteome identified 17 hits that contain combinations of the features, suggesting a possible function of sialic acid binding by these proteins.
Collapse
Affiliation(s)
- Raghu Bhagavat
- Department of Biochemistry, Indian Institute of Science, Bangalore 560012, India
| | | |
Collapse
|
29
|
Csermely P, Korcsmáros T, Kiss HJM, London G, Nussinov R. Structure and dynamics of molecular networks: a novel paradigm of drug discovery: a comprehensive review. Pharmacol Ther 2013; 138:333-408. [PMID: 23384594 PMCID: PMC3647006 DOI: 10.1016/j.pharmthera.2013.01.016] [Citation(s) in RCA: 506] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Accepted: 01/22/2013] [Indexed: 02/02/2023]
Abstract
Despite considerable progress in genome- and proteome-based high-throughput screening methods and in rational drug design, the increase in approved drugs in the past decade did not match the increase of drug development costs. Network description and analysis not only give a systems-level understanding of drug action and disease complexity, but can also help to improve the efficiency of drug design. We give a comprehensive assessment of the analytical tools of network topology and dynamics. The state-of-the-art use of chemical similarity, protein structure, protein-protein interaction, signaling, genetic interaction and metabolic networks in the discovery of drug targets is summarized. We propose that network targeting follows two basic strategies. The "central hit strategy" selectively targets central nodes/edges of the flexible networks of infectious agents or cancer cells to kill them. The "network influence strategy" works against other diseases, where an efficient reconfiguration of rigid networks needs to be achieved by targeting the neighbors of central nodes/edges. It is shown how network techniques can help in the identification of single-target, edgetic, multi-target and allo-network drug target candidates. We review the recent boom in network methods helping hit identification, lead selection optimizing drug efficacy, as well as minimizing side-effects and drug toxicity. Successful network-based drug development strategies are shown through the examples of infections, cancer, metabolic diseases, neurodegenerative diseases and aging. Summarizing >1200 references we suggest an optimized protocol of network-aided drug development, and provide a list of systems-level hallmarks of drug quality. Finally, we highlight network-related drug development trends helping to achieve these hallmarks by a cohesive, global approach.
Collapse
Affiliation(s)
- Peter Csermely
- Department of Medical Chemistry, Semmelweis University, P.O. Box 260, H-1444 Budapest 8, Hungary.
| | | | | | | | | |
Collapse
|
30
|
Lee HS, Im W. Identification of ligand templates using local structure alignment for structure-based drug design. J Chem Inf Model 2012; 52:2784-95. [PMID: 22978550 DOI: 10.1021/ci300178e] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
With a rapid increase in the number of high-resolution protein-ligand structures, the known protein-ligand structures can be used to gain insight into ligand-binding modes in a target protein. On the basis of the fact that the structurally similar binding sites share information about their ligands, we have developed a local structure alignment tool, G-LoSA (graph-based local structure alignment). The known protein-ligand binding-site structure library is searched by G-LoSA to detect binding-site structures with similar geometry and physicochemical properties to a query binding-site structure regardless of sequence continuity and protein fold. Then, the ligands in the identified complexes are used as templates (i.e., template ligands) to predict/design a ligand for the target protein. The performance of G-LoSA is validated against 76 benchmark targets from the Astex diverse set. Using the currently available protein-ligand structure library, G-LoSA is able to identify a single template ligand (from a nonhomologous protein complex) that is highly similar to the target ligand in more than half of the benchmark targets. In addition, our benchmark analyses show that an assembly of structural fragments from multiple template ligands with partial similarity to the target ligand can be used to design novel ligand structures specific to the target protein. This study clearly indicates that a template-based ligand modeling has potential for de novo ligand design and can be a complementary approach to the receptor structure based methods.
Collapse
Affiliation(s)
- Hui Sun Lee
- Department of Molecular Biosciences and Center for Bioinformatics, The University of Kansas, 2030 Becker Drive, Lawrence, Kansas 66047, USA.
| | | |
Collapse
|
31
|
Wu MY, Dai DQ, Yan H. PRL-dock: Protein-ligand docking based on hydrogen bond matching and probabilistic relaxation labeling. Proteins 2012; 80:2137-53. [DOI: 10.1002/prot.24104] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2012] [Revised: 04/14/2012] [Accepted: 04/17/2012] [Indexed: 11/08/2022]
|
32
|
Structure-based computational analysis of protein binding sites for function and druggability prediction. J Biotechnol 2012; 159:123-34. [DOI: 10.1016/j.jbiotec.2011.12.005] [Citation(s) in RCA: 81] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2011] [Revised: 12/02/2011] [Accepted: 12/06/2011] [Indexed: 11/19/2022]
|
33
|
Abstract
A computational pipeline PocketAnnotate for functional annotation of proteins at the level of binding sites has been proposed in this study. The pipeline integrates three in-house algorithms for site-based function annotation: PocketDepth, for prediction of binding sites in protein structures; PocketMatch, for rapid comparison of binding sites and PocketAlign, to obtain detailed alignment between pair of binding sites. A novel scheme has been developed to rapidly generate a database of non-redundant binding sites. For a given input protein structure, putative ligand-binding sites are identified, matched in real time against the database and the query substructure aligned with the promising hits, to obtain a set of possible ligands that the given protein could bind to. The input can be either whole protein structures or merely the substructures corresponding to possible binding sites. Structure-based function annotation at the level of binding sites thus achieved could prove very useful for cases where no obvious functional inference can be obtained based purely on sequence or fold-level analyses. An attempt has also been made to analyse proteins of no known function from Protein Data Bank. PocketAnnotate would be a valuable tool for the scientific community and contribute towards structure-based functional inference. The web server can be freely accessed at http://proline.biochem.iisc.ernet.in/pocketannotate/.
Collapse
Affiliation(s)
- Praveen Anand
- Department of Biochemistry, Indian Institute of Science, Bangalore 560012, Karnataka, India
| | | | | |
Collapse
|