1
|
Li S, Cai C, Gong J, Liu X, Li H. A fast protein binding site comparison algorithm for proteome-wide protein function prediction and drug repurposing. Proteins 2021; 89:1541-1556. [PMID: 34245187 DOI: 10.1002/prot.26176] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Revised: 06/26/2021] [Accepted: 06/30/2021] [Indexed: 01/18/2023]
Abstract
The expansion of three-dimensional protein structures and enhanced computing power have significantly facilitated our understanding of protein sequence/structure/function relationships. A challenge in structural genomics is to predict the function of uncharacterized proteins. Protein function deconvolution based on global sequence or structural homology is impracticable when a protein relates to no other proteins with known function, and in such cases, functional relationships can be established by detecting their local ligand binding site similarity. Here, we introduce a sequence order-independent comparison algorithm, PocketShape, for structural proteome-wide exploration of protein functional site by fully considering the geometry of the backbones, orientation of the sidechains, and physiochemical properties of the pocket-lining residues. PocketShape is efficient in distinguishing similar from dissimilar ligand binding site pairs by retrieving 99.3% of the similar pairs while rejecting 100% of the dissimilar pairs on a dataset containing 1538 binding site pairs. This method successfully classifies 83 enzyme structures with diverse functions into 12 clusters, which is highly in accordance with the actual structural classification of proteins classification. PocketShape also achieves superior performances than other methods in protein profiling based on experimental data. Potential new applications for representative SARS-CoV-2 drugs Remdesivir and 11a are predicted. The high accuracy and time-efficient characteristics of PocketShape will undoubtedly make it a promising complementary tool for proteome-wide protein function inference and drug repurposing study.
Collapse
Affiliation(s)
- Shiliang Li
- State Key Laboratory of Bioreactor Engineering, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Chaoqian Cai
- State Key Laboratory of Bioreactor Engineering, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China.,School of Information Science and Engineering, East China University of Science and Technology, Shanghai, China
| | - Jiayu Gong
- State Key Laboratory of Bioreactor Engineering, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China.,School of Information Science and Engineering, East China University of Science and Technology, Shanghai, China
| | - Xiaofeng Liu
- State Key Laboratory of Bioreactor Engineering, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| | - Honglin Li
- State Key Laboratory of Bioreactor Engineering, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China.,School of Information Science and Engineering, East China University of Science and Technology, Shanghai, China.,Research and Development Department, Jiangzhong Pharmaceutical Co., Ltd., Nanchang, China
| |
Collapse
|
2
|
Borba JVVB, Silva AC, Lima MNN, Mendonca SS, Furnham N, Costa FTM, Andrade CH. Chemogenomics and bioinformatics approaches for prioritizing kinases as drug targets for neglected tropical diseases. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2020; 124:187-223. [PMID: 33632465 DOI: 10.1016/bs.apcsb.2020.10.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Neglected tropical diseases (NTDs) are a group of twenty-one diseases classified by the World Health Organization that prevail in regions with tropical and subtropical climate and affect more than one billion people. There is an urgent need to develop new and safer drugs for these diseases. Protein kinases are a potential class of targets for developing new drugs against NTDs, since they play crucial role in many biological processes, such as signaling pathways, regulating cellular communication, division, metabolism and death. Bioinformatics is a field that aims to organize large amounts of biological data as well as develop and use tools for understanding and analyze them in order to produce meaningful information in a biological manner. In combination with chemogenomics, which analyzes chemical-biological interactions to screen ligands against selected targets families, these approaches can be used to stablish a rational strategy for prioritizing new drug targets for NTDs. Here, we describe how bioinformatics and chemogenomics tools can help to identify protein kinases and their potential inhibitors for the development of new drugs for NTDs. We present a review of bioinformatics tools and techniques that can be used to define an organisms kinome for drug prioritization, drug and target repurposing, multi-quinase inhibition approachs and selectivity profiling. We also present some successful examples of the application of such approaches in recent case studies.
Collapse
Affiliation(s)
- Joyce Villa Verde Bastos Borba
- LabMol-Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO, Brazil; Laboratory of Tropical Diseases-Prof. Luiz Jacintho da Silva, Department of Genetics, Evolution and Bioagents, University of Campinas, Campinas, SP, Brazil
| | - Arthur Carvalho Silva
- LabMol-Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO, Brazil
| | - Marilia Nunes Nascimento Lima
- LabMol-Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO, Brazil
| | - Sabrina Silva Mendonca
- LabMol-Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO, Brazil
| | - Nicholas Furnham
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Fabio Trindade Maranhão Costa
- Laboratory of Tropical Diseases-Prof. Luiz Jacintho da Silva, Department of Genetics, Evolution and Bioagents, University of Campinas, Campinas, SP, Brazil
| | - Carolina Horta Andrade
- LabMol-Laboratory for Molecular Modeling and Drug Design, Faculty of Pharmacy, Federal University of Goiás, Goiânia, GO, Brazil; Department of Infection Biology, London School of Hygiene and Tropical Medicine, London, United Kingdom.
| |
Collapse
|
3
|
Da C, Zhang D, Stashko M, Vasileiadi E, Parker R, Minson KA, Huey MG, Huelse JM, Hunter D, Gilbert TSK, Norris-Drouin J, Miley M, Herring LE, Graves LM, DeRyckere D, Earp HS, Graham D, Frye SV, Wang X, Kireev D. Data-Driven Construction of Antitumor Agents with Controlled Polypharmacology. J Am Chem Soc 2019; 141:15700-15709. [PMID: 31497954 PMCID: PMC6894422 DOI: 10.1021/jacs.9b08660] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Controlling which particular members of a large protein family are targeted by a drug is key to achieving a desired therapeutic response. In this study, we report a rational data-driven strategy for achieving restricted polypharmacology in the design of antitumor agents selectively targeting the TYRO3, AXL, and MERTK (TAM) family tyrosine kinases. Our computational approach, based on the concept of fragments in structural environments (FRASE), distills relevant chemical information from structural and chemogenomic databases to assemble a three-dimensional inhibitor structure directly in the protein pocket. Target engagement by the inhibitors designed led to disruption of oncogenic phenotypes as demonstrated in enzymatic assays and in a panel of cancer cell lines, including acute lymphoblastic and myeloid leukemia (ALL/AML) and nonsmall cell lung cancer (NSCLC). Structural rationale underlying the approach was corroborated by X-ray crystallography. The lead compound demonstrated potent target inhibition in a pharmacodynamic study in leukemic mice.
Collapse
Affiliation(s)
- Chenxiao Da
- Center for Integrative Chemical Biology and Drug Discovery, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, 27599-7363
| | - Dehui Zhang
- Center for Integrative Chemical Biology and Drug Discovery, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, 27599-7363
| | - Michael Stashko
- Center for Integrative Chemical Biology and Drug Discovery, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, 27599-7363
| | - Eleana Vasileiadi
- Aflac Cancer and Blood Disorders Center, Children’s Healthcare of Atlanta, and Department of Pediatrics, Emory University, Atlanta, GA 30322
| | - Rebecca Parker
- Aflac Cancer and Blood Disorders Center, Children’s Healthcare of Atlanta, and Department of Pediatrics, Emory University, Atlanta, GA 30322
| | - Katherine A. Minson
- Aflac Cancer and Blood Disorders Center, Children’s Healthcare of Atlanta, and Department of Pediatrics, Emory University, Atlanta, GA 30322
| | - Madeline G. Huey
- Aflac Cancer and Blood Disorders Center, Children’s Healthcare of Atlanta, and Department of Pediatrics, Emory University, Atlanta, GA 30322
| | - Justus M. Huelse
- Aflac Cancer and Blood Disorders Center, Children’s Healthcare of Atlanta, and Department of Pediatrics, Emory University, Atlanta, GA 30322
| | - Debra Hunter
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
- Lineberger Comprehensive Cancer Center, Department of Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Thomas S. K. Gilbert
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Jacqueline Norris-Drouin
- Center for Integrative Chemical Biology and Drug Discovery, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, 27599-7363
| | - Michael Miley
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Laura E. Herring
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Lee M. Graves
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Deborah DeRyckere
- Aflac Cancer and Blood Disorders Center, Children’s Healthcare of Atlanta, and Department of Pediatrics, Emory University, Atlanta, GA 30322
| | - H. Shelton Earp
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
- Lineberger Comprehensive Cancer Center, Department of Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Douglas Graham
- Aflac Cancer and Blood Disorders Center, Children’s Healthcare of Atlanta, and Department of Pediatrics, Emory University, Atlanta, GA 30322
| | - Stephen V. Frye
- Center for Integrative Chemical Biology and Drug Discovery, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, 27599-7363
- Lineberger Comprehensive Cancer Center, Department of Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Xiaodong Wang
- Center for Integrative Chemical Biology and Drug Discovery, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, 27599-7363
| | - Dmitri Kireev
- Center for Integrative Chemical Biology and Drug Discovery, Division of Chemical Biology and Medicinal Chemistry, Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, 27599-7363
| |
Collapse
|
4
|
Rabal O, Castellar A, Oyarzabal J. Novel pharmacological maps of protein lysine methyltransferases: key for target deorphanization. J Cheminform 2018; 10:32. [PMID: 30032331 PMCID: PMC6054832 DOI: 10.1186/s13321-018-0288-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 07/14/2018] [Indexed: 11/30/2022] Open
Abstract
Epigenetic therapies are being investigated for the treatment of cancer, cognitive disorders, metabolic alterations and autoinmune diseases. Among the different epigenetic target families, protein lysine methyltransferases (PKMTs), are especially interesting because it is believed that their inhibition may be highly specific at the functional level. Despite its relevance, there are currently known inhibitors against only 10 out of the 50 SET-domain containing members of the PKMT family. Accordingly, the identification of chemical probes for the validation of the therapeutic impact of epigenetic modulation is key. Moreover, little is known about the mechanisms that dictate their substrate specificity and ligand selectivity. Consequently, it is desirable to explore novel methods to characterize the pharmacological similarity of PKMTs, going beyond classical phylogenetic relationships. Such characterization would enable the prediction of ligand off-target effects caused by lack of ligand selectivity and the repurposing of known compounds against alternative targets. This is particularly relevant in the case of orphan targets with unreported inhibitors. Here, we first perform a systematic study of binding modes of cofactor and substrate bound ligands with all available SET domain-containing PKMTs. Protein ligand interaction fingerprints were applied to identify conserved hot spots and contact-specific residues across subfamilies at each binding site; a relevant analysis for guiding the design of novel, selective compounds. Then, a recently described methodology (GPCR-CoINPocket) that incorporates ligand contact information into classical alignment-based comparisons was applied to the entire family of 50 SET-containing proteins to devise pharmacological similarities between them. The main advantage of this approach is that it is not restricted to proteins for which crystallographic data with bound ligands is available. The resulting family organization from the separate analysis of both sites (cofactor and substrate) was retrospectively and prospectively validated. Of note, three hits (inhibition > 50% at 10 µM) were identified for the orphan NSD1.
Collapse
Affiliation(s)
- Obdulia Rabal
- Small Molecule Discovery Platform. Molecular Therapeutics Program, Center for Applied Medical Research, CIMA, University of Navarra, Pio XII, 55, 31008, Pamplona, Spain.
| | - Andrea Castellar
- Small Molecule Discovery Platform. Molecular Therapeutics Program, Center for Applied Medical Research, CIMA, University of Navarra, Pio XII, 55, 31008, Pamplona, Spain
| | - Julen Oyarzabal
- Small Molecule Discovery Platform. Molecular Therapeutics Program, Center for Applied Medical Research, CIMA, University of Navarra, Pio XII, 55, 31008, Pamplona, Spain.
| |
Collapse
|
5
|
Waldner BJ, Kraml J, Kahler U, Spinn A, Schauperl M, Podewitz M, Fuchs JE, Cruciani G, Liedl KR. Electrostatic recognition in substrate binding to serine proteases. J Mol Recognit 2018; 31:e2727. [PMID: 29785722 PMCID: PMC6175425 DOI: 10.1002/jmr.2727] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2017] [Revised: 04/11/2018] [Accepted: 04/11/2018] [Indexed: 12/16/2022]
Abstract
Serine proteases of the Chymotrypsin family are structurally very similar but have very different substrate preferences. This study investigates a set of 9 different proteases of this family comprising proteases that prefer substrates containing positively charged amino acids, negatively charged amino acids, and uncharged amino acids with varying degree of specificity. Here, we show that differences in electrostatic substrate preferences can be predicted reliably by electrostatic molecular interaction fields employing customized GRID probes. Thus, we are able to directly link protease structures to their electrostatic substrate preferences. Additionally, we present a new metric that measures similarities in substrate preferences focusing only on electrostatics. It efficiently compares these electrostatic substrate preferences between different proteases. This new metric can be interpreted as the electrostatic part of our previously developed substrate similarity metric. Consequently, we suggest, that substrate recognition in terms of electrostatics and shape complementarity are rather orthogonal aspects of substrate recognition. This is in line with a 2‐step mechanism of protein‐protein recognition suggested in the literature.
Collapse
Affiliation(s)
- Birgit J Waldner
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
| | - Johannes Kraml
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
| | - Ursula Kahler
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
| | - Alexander Spinn
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
| | - Michael Schauperl
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
| | - Maren Podewitz
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
| | - Julian E Fuchs
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
| | - Gabriele Cruciani
- Laboratory of Chemometrics, Department of Chemistry, University of Perugia, Perugia, Italy
| | - Klaus R Liedl
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
| |
Collapse
|
6
|
Schneider P, Schneider G. Privileged Structures Revisited. Angew Chem Int Ed Engl 2017; 56:7971-7974. [PMID: 28558125 PMCID: PMC5502582 DOI: 10.1002/anie.201702816] [Citation(s) in RCA: 72] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Revised: 04/16/2017] [Indexed: 12/13/2022]
Abstract
Privileged structures inspire compound library design in medicinal chemistry. We performed a comprehensive analysis of 1.4 million bioactive compounds, with the aim of assessing the prevalence of certain molecular frameworks. We used the Shannon entropy formalism to quantify the promiscuity of the most frequently observed atom scaffolds across the annotated target families. This analysis revealed an apparent inverse relationship between hydrogen-bond-acceptor count of a scaffold and its potential promiscuity. The results further suggest that chemically easily accessible scaffolds can serve as templates for the generation of bespoke compound libraries with differing degrees of multiple target engagement, and heterocyclic, sp3 -rich frameworks are particularly suited for target-focused library design. The outcome of our study enables us to place some of the many narratives surrounding the concept of privileged structures into a critical context.
Collapse
Affiliation(s)
- Petra Schneider
- Department of Chemistry and Applied BiosciencesSwiss Federal Institute of Technology (ETH)Vladimir-Prelog-Weg 48093ZurichSwitzerland
- inSili.com LLCSegantinisteig 38049ZurichSwitzerland
| | - Gisbert Schneider
- Department of Chemistry and Applied BiosciencesSwiss Federal Institute of Technology (ETH)Vladimir-Prelog-Weg 48093ZurichSwitzerland
| |
Collapse
|
7
|
Affiliation(s)
- Petra Schneider
- Departement Chemie und Angewandte Biowissenschaften; Eidgenössische Technische Hochschule (ETH); Vladimir-Prelog-Weg 4 8093 Zürich Schweiz
- inSili.com GmbH; Segantinisteig 3 8049 Zürich Schweiz
| | - Gisbert Schneider
- Departement Chemie und Angewandte Biowissenschaften; Eidgenössische Technische Hochschule (ETH); Vladimir-Prelog-Weg 4 8093 Zürich Schweiz
| |
Collapse
|
8
|
Krotzky T, Klebe G. Acceleration of Binding Site Comparisons by Graph Partitioning. Mol Inform 2015; 34:550-8. [PMID: 27490500 DOI: 10.1002/minf.201500028] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2015] [Accepted: 05/04/2015] [Indexed: 11/05/2022]
Abstract
The comparison of protein binding sites is a prominent task in computational chemistry and has been studied in many different ways. For the automatic detection and comparison of putative binding cavities the Cavbase system has been developed which uses a coarse-grained set of pseudocenters to represent the physicochemical properties of a binding site and employs a graph-based procedure to calculate similarities between two binding sites. However, the comparison of two graphs is computationally quite demanding which makes large-scale studies such as the rapid screening of entire databases hardly feasible. In a recent work, we proposed the method Local Cliques (LC) for the efficient comparison of Cavbase binding sites. It employs a clique heuristic to detect the maximum common subgraph of two binding sites and an extended graph model to additionally compare the shape of individual surface patches. In this study, we present an alternative to further accelerate the LC method by partitioning the binding-site graphs into disjoint components prior to their comparisons. The pseudocenter sets are split with regard to their assigned phyiscochemical type, which leads to seven much smaller graphs than the original one. Applying this approach on the same test scenarios as in the former comprehensive way results in a significant speed-up without sacrificing accuracy.
Collapse
Affiliation(s)
- Timo Krotzky
- Department of Pharmaceutical Chemistry, Philipps-Universität, 35032 Marburg, Germany
| | - Gerhard Klebe
- Department of Pharmaceutical Chemistry, Philipps-Universität, 35032 Marburg, Germany.
| |
Collapse
|
9
|
Harigua-Souiai E, Cortes-Ciriano I, Desdouits N, Malliavin TE, Guizani I, Nilges M, Blondel A, Bouvier G. Identification of binding sites and favorable ligand binding moieties by virtual screening and self-organizing map analysis. BMC Bioinformatics 2015; 16:93. [PMID: 25888251 PMCID: PMC4381396 DOI: 10.1186/s12859-015-0518-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2014] [Accepted: 02/24/2015] [Indexed: 11/24/2022] Open
Abstract
Background Identifying druggable cavities on a protein surface is a crucial step in structure based drug design. The cavities have to present suitable size and shape, as well as appropriate chemical complementarity with ligands. Results We present a novel cavity prediction method that analyzes results of virtual screening of specific ligands or fragment libraries by means of Self-Organizing Maps. We demonstrate the method with two thoroughly studied proteins where it successfully identified their active sites (AS) and relevant secondary binding sites (BS). Moreover, known active ligands mapped the AS better than inactive ones. Interestingly, docking a naive fragment library brought even more insight. We then systematically applied the method to the 102 targets from the DUD-E database, where it showed a 90% identification rate of the AS among the first three consensual clusters of the SOM, and in 82% of the cases as the first one. Further analysis by chemical decomposition of the fragments improved BS prediction. Chemical substructures that are representative of the active ligands preferentially mapped in the AS. Conclusion The new approach provides valuable information both on relevant BSs and on chemical features promoting bioactivity. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0518-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Emna Harigua-Souiai
- Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR 3528, Département de Biologie Structurale et Chimie, 25, rue du Dr Roux, Paris, 75015, France. .,Laboratory of Molecular Epidemiology and Experimental Pathology - LR11IPT04, Institut Pasteur de Tunis, Université Tunis el Manar - Tunisia, 13, Place Pasteur, Tunis, 1002, Tunisia. .,University of Carthage, Faculty of sciences of Bizerte - Tunisia, Jarzouna, 7021, Tunisia.
| | - Isidro Cortes-Ciriano
- Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR 3528, Département de Biologie Structurale et Chimie, 25, rue du Dr Roux, Paris, 75015, France.
| | - Nathan Desdouits
- Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR 3528, Département de Biologie Structurale et Chimie, 25, rue du Dr Roux, Paris, 75015, France.
| | - Thérèse E Malliavin
- Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR 3528, Département de Biologie Structurale et Chimie, 25, rue du Dr Roux, Paris, 75015, France.
| | - Ikram Guizani
- Laboratory of Molecular Epidemiology and Experimental Pathology - LR11IPT04, Institut Pasteur de Tunis, Université Tunis el Manar - Tunisia, 13, Place Pasteur, Tunis, 1002, Tunisia.
| | - Michael Nilges
- Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR 3528, Département de Biologie Structurale et Chimie, 25, rue du Dr Roux, Paris, 75015, France.
| | - Arnaud Blondel
- Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR 3528, Département de Biologie Structurale et Chimie, 25, rue du Dr Roux, Paris, 75015, France.
| | - Guillaume Bouvier
- Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR 3528, Département de Biologie Structurale et Chimie, 25, rue du Dr Roux, Paris, 75015, France.
| |
Collapse
|
10
|
Cortés-Ciriano I, Ain QU, Subramanian V, Lenselink EB, Méndez-Lucio O, IJzerman AP, Wohlfahrt G, Prusis P, Malliavin TE, van Westen GJP, Bender A. Polypharmacology modelling using proteochemometrics (PCM): recent methodological developments, applications to target families, and future prospects. MEDCHEMCOMM 2015. [DOI: 10.1039/c4md00216d] [Citation(s) in RCA: 80] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Proteochemometric (PCM) modelling is a computational method to model the bioactivity of multiple ligands against multiple related protein targets simultaneously.
Collapse
Affiliation(s)
- Isidro Cortés-Ciriano
- Unité de Bioinformatique Structurale
- Institut Pasteur and CNRS UMR 3825
- Structural Biology and Chemistry Department
- 75 724 Paris
- France
| | - Qurrat Ul Ain
- Unilever Centre for Molecular Informatics
- Department of Chemistry
- CB2 1EW Cambridge
- UK
| | | | - Eelke B. Lenselink
- Division of Medicinal Chemistry
- Leiden Academic Centre for Drug Research
- Leiden
- The Netherlands
| | - Oscar Méndez-Lucio
- Unilever Centre for Molecular Informatics
- Department of Chemistry
- CB2 1EW Cambridge
- UK
| | - Adriaan P. IJzerman
- Division of Medicinal Chemistry
- Leiden Academic Centre for Drug Research
- Leiden
- The Netherlands
| | - Gerd Wohlfahrt
- Computer-Aided Drug Design
- Orion Pharma
- FIN-02101 Espoo
- Finland
| | - Peteris Prusis
- Computer-Aided Drug Design
- Orion Pharma
- FIN-02101 Espoo
- Finland
| | - Thérèse E. Malliavin
- Unité de Bioinformatique Structurale
- Institut Pasteur and CNRS UMR 3825
- Structural Biology and Chemistry Department
- 75 724 Paris
- France
| | - Gerard J. P. van Westen
- European Molecular Biology Laboratory
- European Bioinformatics Institute
- Wellcome Trust Genome Campus
- Hinxton
- UK
| | - Andreas Bender
- Unilever Centre for Molecular Informatics
- Department of Chemistry
- CB2 1EW Cambridge
- UK
| |
Collapse
|
11
|
Krotzky T, Grunwald C, Egerland U, Klebe G. Large-scale mining for similar protein binding pockets: with RAPMAD retrieval on the fly becomes real. J Chem Inf Model 2014; 55:165-79. [PMID: 25474400 DOI: 10.1021/ci5005898] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Determination of structural similarities between protein binding pockets is an important challenge in in silico drug design. It can help to understand selectivity considerations, predict unexpected ligand cross-reactivity, and support the putative annotation of function to orphan proteins. To this end, Cavbase was developed as a tool for the automated detection, storage, and classification of putative protein binding sites. In this context, binding sites are characterized as sets of pseudocenters, which denote surface-exposed physicochemical properties, and can be used to enable mutual binding site comparisons. However, these comparisons tend to be computationally very demanding and often lead to very slow computations of the similarity measures. In this study, we propose RAPMAD (RApid Pocket MAtching using Distances), a new evaluation formalism for Cavbase entries that allows for ultrafast similarity comparisons. Protein binding sites are represented by sets of distance histograms that are both generated and compared with linear complexity. Attaining a speed of more than 20 000 comparisons per second, screenings across large data sets and even entire databases become easily feasible. We demonstrate the discriminative power and the short runtime by performing several classification and retrieval experiments. RAPMAD attains better success rates than the comparison formalism originally implemented into Cavbase or several alternative approaches developed in recent time, while requiring only a fraction of their runtime. The pratical use of our method is finally proven by a successful prospective virtual screening study that aims for the identification of novel inhibitors of the NMDA receptor.
Collapse
Affiliation(s)
- Timo Krotzky
- Department of Pharmaceutical Chemistry, Philipps-Universität Marburg , Marbacher Weg 6-10, 35032 Marburg, Germany
| | | | | | | |
Collapse
|
12
|
Krotzky T, Fober T, Hüllermeier E, Klebe G. Extended Graph-Based Models for Enhanced Similarity Search in Cavbase. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014; 11:878-890. [PMID: 26356860 DOI: 10.1109/tcbb.2014.2325020] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
To calculate similarities between molecular structures, measures based on the maximum common subgraph are frequently applied. For the comparison of protein binding sites, these measures are not fully appropriate since graphs representing binding sites on a detailed atomic level tend to get very large. In combination with an NP-hard problem, a large graph leads to a computationally demanding task. Therefore, for the comparison of binding sites, a less detailed coarse graph model is used building upon so-called pseudocenters. Consistently, a loss of structural data is caused since many atoms are discarded and no information about the shape of the binding site is considered. This is usually resolved by performing subsequent calculations based on additional information. These steps are usually quite expensive, making the whole approach very slow. The main drawback of a graph-based model solely based on pseudocenters, however, is the loss of information about the shape of the protein surface. In this study, we propose a novel and efficient modeling formalism that does not increase the size of the graph model compared to the original approach, but leads to graphs containing considerably more information assigned to the nodes. More specifically, additional descriptors considering surface characteristics are extracted from the local surface and attributed to the pseudocenters stored in Cavbase. These properties are evaluated as additional node labels, which lead to a gain of information and allow for much faster but still very accurate comparisons between different structures.
Collapse
|
13
|
Fuchs JE, Liedl KR. Substrate sequences tell similar stories as binding cavities: commentary. J Chem Inf Model 2014; 53:3115-6. [PMID: 24359119 PMCID: PMC3871284 DOI: 10.1021/ci4005783] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Julian E Fuchs
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck , Innrain 80/82, A-6020 Innsbruck, Austria
| | | |
Collapse
|
14
|
Schomburg KT, Rarey M. Benchmark Data Sets for Structure-Based Computational Target Prediction. J Chem Inf Model 2014; 54:2261-74. [DOI: 10.1021/ci500131x] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- Karen T. Schomburg
- Center
for Bioinformatics, University of Hamburg, Bundesstrasse 43, 20146 Hamburg, Germany
| | - Matthias Rarey
- Center
for Bioinformatics, University of Hamburg, Bundesstrasse 43, 20146 Hamburg, Germany
| |
Collapse
|
15
|
Asaoka R. Mapping glaucoma patients' 30-2 and 10-2 visual fields reveals clusters of test points damaged in the 10-2 grid that are not sampled in the sparse 30-2 grid. PLoS One 2014; 9:e98525. [PMID: 24950300 PMCID: PMC4064971 DOI: 10.1371/journal.pone.0098525] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2014] [Accepted: 05/01/2014] [Indexed: 11/19/2022] Open
Abstract
Purpose To cluster test points in glaucoma patients' 30-2 and 10-2 visual field (VF) (Humphrey Field Analyzer: HFA, Carl Zeiss Meditec, Dublin, CA) in order to map the different regions damaged by the disease. Method This retrospective study included 128 eyes from 128 patients. 142 total deviation (TD) values (74 from the 30-2 VF and 68 from the 10-2 VF) were clustered using the ‘Hierarchical Ordered Partitioning And Collapsing Hybrid – Partitioning Around Medoids’ algorithm. The stability of the identified clusters was evaluated using bootstrapping. Results 65 sectors were identified in total: 38 sectors were located outside the 10-2 VF whereas 29 sectors were located inside the 10-2 VF (two sectors overlap in both grids). The mapping of many sectors appeared to follow the distribution of retinal nerve fiber bundles. The results of bootstrapping suggested clusters were stable whether they were outside or inside the 10-2 VF. Conclusion A considerable number of sectors were identified in the 10-2 VF area, despite the fact that clustering was carried out on all points in both the 30-2 VF and 10-2 VF simultaneously. These findings suggest that glaucomatous central VF deterioration cannot be picked up by the 30-2 test grid alone, because of poor spatial sampling; denser estimation of the central ten degrees, than offered by the 30-2 test grid alone, is needed. It may be beneficial to develop a new VF test grid that combines test points from 30-2 and 10-2 VFs – the results of this study could help to devise this test grid.
Collapse
Affiliation(s)
- Ryo Asaoka
- Department of Ophthalmology, The University of Tokyo Graduate School of medicine, Tokyo, Japan
- * E-mail:
| |
Collapse
|
16
|
Krotzky T, Klebe G. A new method for rapid comparison of protein binding pockets by capturing spatial distributions. J Cheminform 2014. [PMCID: PMC3980086 DOI: 10.1186/1758-2946-6-s1-p45] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
17
|
Fuchs JE, von Grafenstein S, Huber RG, Kramer C, Liedl KR. Substrate-driven mapping of the degradome by comparison of sequence logos. PLoS Comput Biol 2013; 9:e1003353. [PMID: 24244149 PMCID: PMC3828135 DOI: 10.1371/journal.pcbi.1003353] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2013] [Accepted: 10/05/2013] [Indexed: 12/27/2022] Open
Abstract
Sequence logos are frequently used to illustrate substrate preferences and specificity of proteases. Here, we employed the compiled substrates of the MEROPS database to introduce a novel metric for comparison of protease substrate preferences. The constructed similarity matrix of 62 proteases can be used to intuitively visualize similarities in protease substrate readout via principal component analysis and construction of protease specificity trees. Since our new metric is solely based on substrate data, we can engraft the protease tree including proteolytic enzymes of different evolutionary origin. Thereby, our analyses confirm pronounced overlaps in substrate recognition not only between proteases closely related on sequence basis but also between proteolytic enzymes of different evolutionary origin and catalytic type. To illustrate the applicability of our approach we analyze the distribution of targets of small molecules from the ChEMBL database in our substrate-based protease specificity trees. We observe a striking clustering of annotated targets in tree branches even though these grouped targets do not necessarily share similarity on protein sequence level. This highlights the value and applicability of knowledge acquired from peptide substrates in drug design of small molecules, e.g., for the prediction of off-target effects or drug repurposing. Consequently, our similarity metric allows to map the degradome and its associated drug target network via comparison of known substrate peptides. The substrate-driven view of protein-protein interfaces is not limited to the field of proteases but can be applied to any target class where a sufficient amount of known substrate data is available.
Collapse
Affiliation(s)
- Julian E. Fuchs
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
| | - Susanne von Grafenstein
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
| | - Roland G. Huber
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
| | - Christian Kramer
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
| | - Klaus R. Liedl
- Institute of General, Inorganic and Theoretical Chemistry, and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, Innsbruck, Austria
- * E-mail:
| |
Collapse
|