1
|
Remington JM, McKay KT, Beckage NB, Ferrell JB, Schneebeli ST, Li J. GPCRLigNet: rapid screening for GPCR active ligands using machine learning. J Comput Aided Mol Des 2023; 37:147-156. [PMID: 36840893 PMCID: PMC10379640 DOI: 10.1007/s10822-023-00497-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 02/03/2023] [Indexed: 02/26/2023]
Abstract
Molecules with bioactivity towards G protein-coupled receptors represent a subset of the vast space of small drug-like molecules. Here, we compare machine learning models, including dilated graph convolutional networks, that conduct binary classification to quickly identify molecules with activity towards G protein-coupled receptors. The models are trained and validated using a large set of over 600,000 active, inactive, and decoy compounds. The best performing machine learning model, dubbed GPCRLigNet, was a surprisingly simple feedforward dense neural network mapping from Morgan fingerprints to activity. Incorporation of GPCRLigNet into a high-throughput virtual screening workflow is demonstrated with molecular docking towards a particular G protein-coupled receptor, the pituitary adenylate cyclase-activating polypeptide receptor type 1. Through rigorous comparison of docking scores for molecules selected with and without using GPCRLigNet, we demonstrate an enrichment of potentially potent molecules using GPCRLigNet. This work provides a proof of principle that GPCRLigNet can effectively hone the chemical search space towards ligands with G protein-coupled receptor activity.
Collapse
Affiliation(s)
- Jacob M Remington
- Department of Chemistry, University of Vermont, Burlington, VT, 05405, USA
| | - Kyle T McKay
- Department of Chemistry, University of Vermont, Burlington, VT, 05405, USA
| | - Noah B Beckage
- Department of Chemistry, University of Vermont, Burlington, VT, 05405, USA
| | - Jonathon B Ferrell
- Department of Chemistry, University of Vermont, Burlington, VT, 05405, USA
| | - Severin T Schneebeli
- Department of Chemistry, University of Vermont, Burlington, VT, 05405, USA.,Department of Industrial and Physical Pharmacy, Department of Chemistry, Purdue University, West Lafayette, IN, 47906, USA.,Department of Pathology, University of Vermont, Burlington, VT, 05405, USA
| | - Jianing Li
- Department of Chemistry, University of Vermont, Burlington, VT, 05405, USA. .,Department of Pathology, University of Vermont, Burlington, VT, 05405, USA. .,Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West Lafayette, IN, 47906, USA.
| |
Collapse
|
2
|
Binding site identification of G protein-coupled receptors through a 3D Zernike polynomials-based method: application to C. elegans olfactory receptors. J Comput Aided Mol Des 2022; 36:11-24. [PMID: 34977999 PMCID: PMC8831295 DOI: 10.1007/s10822-021-00434-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 11/18/2021] [Indexed: 11/01/2022]
Abstract
Studying the binding processes of G protein-coupled receptors (GPCRs) proteins is of particular interest both to better understand the molecular mechanisms that regulate the signaling between the extracellular and intracellular environment and for drug design purposes. In this study, we propose a new computational approach for the identification of the binding site for a specific ligand on a GPCR. The method is based on the Zernike polynomials and performs the ligand-GPCR association through a shape complementarity analysis of the local molecular surfaces. The method is parameter-free and it can distinguish, working on hundreds of experimentally GPCR-ligand complexes, binding pockets from randomly sampled regions on the receptor surface, obtaining an Area Under ROC curve of 0.77. Given its importance both as a model organism and in terms of applications, we thus investigated the olfactory receptors of the C. elegans, building a list of associations between 21 GPCRs belonging to its olfactory neurons and a set of possible ligands. Thus, we can not only carry out rapid and efficient screenings of drugs proposed for GPCRs, key targets in many pathologies, but also we laid the groundwork for computational mutagenesis processes, aimed at increasing or decreasing the binding affinity between ligands and receptors.
Collapse
|
3
|
Applicability Domain of Active Learning in Chemical Probe Identification: Convergence in Learning from Non-Specific Compounds and Decision Rule Clarification. Molecules 2019; 24:molecules24152716. [PMID: 31357419 PMCID: PMC6696588 DOI: 10.3390/molecules24152716] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2019] [Revised: 07/19/2019] [Accepted: 07/24/2019] [Indexed: 12/27/2022] Open
Abstract
Efficient identification of chemical probes for the manipulation and understanding of biological systems demands specificity for target proteins. Computational means to optimize candidate compound selection for experimental selectivity evaluation are being sought. The active learning virtual screening method has demonstrated the ability to efficiently converge on predictive models with reduced datasets, though its applicability domain to probe identification has yet to be determined. In this article, we challenge active learning’s ability to predict inhibitory bioactivity profiles of selective compounds when learning from chemogenomic features found in non-selective ligand-target pairs. Comparison of controls versus multiple molecule representations de-convolutes factors contributing to predictive capability. Experiments using the matrix metalloproteinase family demonstrate maximum probe bioactivity prediction achieved from only approximately 20% of non-probe bioactivity; this data volume is consistent with prior chemogenomic active learning studies despite the increased difficulty from chemical biology experimental settings used here. Feature weight analyses are combined with a custom visualization to unambiguously detail how active learning arrives at classification decisions, yielding clarified expectations for chemogenomic modeling. The results influence tactical decisions for computational probe design and discovery.
Collapse
|
4
|
Prediction of GPCR-Ligand Binding Using Machine Learning Algorithms. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2018; 2018:6565241. [PMID: 29666662 PMCID: PMC5831789 DOI: 10.1155/2018/6565241] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/04/2017] [Revised: 11/27/2017] [Accepted: 12/18/2017] [Indexed: 01/18/2023]
Abstract
We propose a novel method that predicts binding of G-protein coupled receptors (GPCRs) and ligands. The proposed method uses hub and cycle structures of ligands and amino acid motif sequences of GPCRs, rather than the 3D structure of a receptor or similarity of receptors or ligands. The experimental results show that these new features can be effective in predicting GPCR-ligand binding (average area under the curve [AUC] of 0.944), because they are thought to include hidden properties of good ligand-receptor binding. Using the proposed method, we were able to identify novel ligand-GPCR bindings, some of which are supported by several studies.
Collapse
|
5
|
Schneider P, Schneider G. A Computational Method for Unveiling the Target Promiscuity of Pharmacologically Active Compounds. Angew Chem Int Ed Engl 2017; 56:11520-11524. [DOI: 10.1002/anie.201706376] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2017] [Revised: 07/07/2017] [Indexed: 12/17/2022]
Affiliation(s)
| | - Gisbert Schneider
- Department of Chemistry and Applied Biosciences; Swiss Federal Institute of Technology (ETH); Vladimir-Prelog-Weg 4 8093 Zurich Switzerland
- inSili.com LLC; Segantinisteig 3 8049 Zurich Switzerland
| |
Collapse
|
6
|
Schneider P, Schneider G. A Computational Method for Unveiling the Target Promiscuity of Pharmacologically Active Compounds. Angew Chem Int Ed Engl 2017. [DOI: 10.1002/ange.201706376] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
| | - Gisbert Schneider
- Department of Chemistry and Applied Biosciences; Swiss Federal Institute of Technology (ETH); Vladimir-Prelog-Weg 4 8093 Zurich Switzerland
- inSili.com LLC; Segantinisteig 3 8049 Zurich Switzerland
| |
Collapse
|
7
|
Abstract
Aim: Computational chemogenomics models the compound–protein interaction space, typically for drug discovery, where existing methods predominantly either incorporate increasing numbers of bioactivity samples or focus on specific subfamilies of proteins and ligands. As an alternative to modeling entire large datasets at once, active learning adaptively incorporates a minimum of informative examples for modeling, yielding compact but high quality models. Results/methodology: We assessed active learning for protein/target family-wide chemogenomic modeling by replicate experiment. Results demonstrate that small yet highly predictive models can be extracted from only 10–25% of large bioactivity datasets, irrespective of molecule descriptors used. Conclusion: Chemogenomic active learning identifies small subsets of ligand–target interactions in a large screening database that lead to knowledge discovery and highly predictive models.
Collapse
|
8
|
Schneider G, Schneider P. Macromolecular target prediction by self-organizing feature maps. Expert Opin Drug Discov 2016; 12:271-277. [DOI: 10.1080/17460441.2017.1274727] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Gisbert Schneider
- Department of Chemistry and Applied Biosciences, Swiss Federal Institute of Technology (ETH), Zurich, Switzerland
| | - Petra Schneider
- Department of Chemistry and Applied Biosciences, Swiss Federal Institute of Technology (ETH), Zurich, Switzerland
- inSili.com LLC, Zurich, Switzerland
| |
Collapse
|
9
|
Gawehn E, Hiss JA, Schneider G. Deep Learning in Drug Discovery. Mol Inform 2015; 35:3-14. [PMID: 27491648 DOI: 10.1002/minf.201501008] [Citation(s) in RCA: 309] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2015] [Accepted: 12/01/2015] [Indexed: 12/18/2022]
Abstract
Artificial neural networks had their first heyday in molecular informatics and drug discovery approximately two decades ago. Currently, we are witnessing renewed interest in adapting advanced neural network architectures for pharmaceutical research by borrowing from the field of "deep learning". Compared with some of the other life sciences, their application in drug discovery is still limited. Here, we provide an overview of this emerging field of molecular informatics, present the basic concepts of prominent deep learning methods and offer motivation to explore these techniques for their usefulness in computer-assisted drug discovery and design. We specifically emphasize deep neural networks, restricted Boltzmann machine networks and convolutional networks.
Collapse
Affiliation(s)
- Erik Gawehn
- Swiss Federal Institute of Technology (ETH), Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, CH-8093 Zurich, Switzerland, Fax: +41 44 633 13 79, Tel: +41 44 633 74 38
| | - Jan A Hiss
- Swiss Federal Institute of Technology (ETH), Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, CH-8093 Zurich, Switzerland, Fax: +41 44 633 13 79, Tel: +41 44 633 74 38
| | - Gisbert Schneider
- Swiss Federal Institute of Technology (ETH), Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, CH-8093 Zurich, Switzerland, Fax: +41 44 633 13 79, Tel: +41 44 633 74 38.
| |
Collapse
|
10
|
Koch CP, Perna AM, Weissmüller S, Bauer S, Pillong M, Baleeiro RB, Reutlinger M, Folkers G, Walden P, Wrede P, Hiss JA, Waibler Z, Schneider G. Exhaustive proteome mining for functional MHC-I ligands. ACS Chem Biol 2013; 8:1876-81. [PMID: 23772559 DOI: 10.1021/cb400252t] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
We present the development and application of a new machine-learning approach to exhaustively and reliably identify major histocompatibility complex class I (MHC-I) ligands among all 20(8) octapeptides and in genome-derived proteomes of Mus musculus , influenza A H3N8, and vesicular stomatitis virus (VSV). Focusing on murine H-2K(b), we identified potent octapeptides exhibiting direct MHC-I binding and stabilization on the surface of TAP-deficient RMA-S cells. Computationally identified VSV-derived peptides induced CD8(+) T-cell proliferation after VSV-infection of mice. The study demonstrates that high-level machine-learning models provide a unique access to rationally designed peptides and a promising approach toward "reverse vaccinology".
Collapse
Affiliation(s)
- Christian P. Koch
- Department of Chemistry and
Applied Biosciences, Eidgenössische Technische Hochschule (ETH), Wolfgang-Pauli-Str. 10, 8093 Zürich,
Switzerland
| | - Anna M. Perna
- Department of Chemistry and
Applied Biosciences, Eidgenössische Technische Hochschule (ETH), Wolfgang-Pauli-Str. 10, 8093 Zürich,
Switzerland
| | | | - Stefanie Bauer
- Paul-Ehrlich-Institut, Paul-Ehrlich-Str. 51-59, 63225
Langen, Germany
| | - Max Pillong
- Department of Chemistry and
Applied Biosciences, Eidgenössische Technische Hochschule (ETH), Wolfgang-Pauli-Str. 10, 8093 Zürich,
Switzerland
| | - Renato B. Baleeiro
- Charité - Universitätsmedizin Berlin, Department of Dermatology, Venerology and Allergology, Charitéplatz 1, 10117 Berlin,
Germany
| | - Michael Reutlinger
- Department of Chemistry and
Applied Biosciences, Eidgenössische Technische Hochschule (ETH), Wolfgang-Pauli-Str. 10, 8093 Zürich,
Switzerland
| | - Gerd Folkers
- Department of Chemistry and
Applied Biosciences, Eidgenössische Technische Hochschule (ETH), Wolfgang-Pauli-Str. 10, 8093 Zürich,
Switzerland
- Collegium Helveticum, Schmelzbergstr. 25, 8092 Zürich,
Switzerland
| | - Peter Walden
- Charité - Universitätsmedizin Berlin, Department of Dermatology, Venerology and Allergology, Charitéplatz 1, 10117 Berlin,
Germany
| | - Paul Wrede
- Charité - Universitätsmedizin
Berlin, Molecular Biology and Bioinformatics, Campus Benjamin Franklin,
Arnimallee 22, 14195 Berlin, Germany
| | - Jan A. Hiss
- Department of Chemistry and
Applied Biosciences, Eidgenössische Technische Hochschule (ETH), Wolfgang-Pauli-Str. 10, 8093 Zürich,
Switzerland
| | - Zoe Waibler
- Paul-Ehrlich-Institut, Paul-Ehrlich-Str. 51-59, 63225
Langen, Germany
| | - Gisbert Schneider
- Department of Chemistry and
Applied Biosciences, Eidgenössische Technische Hochschule (ETH), Wolfgang-Pauli-Str. 10, 8093 Zürich,
Switzerland
| |
Collapse
|
11
|
Geppert T, Reisen F, Pillong M, Hähnke V, Tanrikulu Y, Koch CP, Perna AM, Perez TB, Schneider P, Schneider G. Virtual screening for compounds that mimic protein-protein interface epitopes. J Comput Chem 2011; 33:573-9. [DOI: 10.1002/jcc.22894] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2011] [Accepted: 10/03/2011] [Indexed: 12/22/2022]
|
12
|
Jäger N, Wisniewska JM, Hiss JA, Freier A, Losch FO, Walden P, Wrede P, Schneider G. Attractors in Sequence Space: Agent-Based Exploration of MHC I Binding Peptides. Mol Inform 2010; 29:65-74. [DOI: 10.1002/minf.200900008] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2009] [Accepted: 12/10/2009] [Indexed: 11/10/2022]
|
13
|
The T1R2/T1R3 Sweet Receptor and TRPM5 Ion Channel. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2010; 91:151-208. [DOI: 10.1016/s1877-1173(10)91006-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
|
14
|
Tanrikulu Y, Proschak E, Werner T, Geppert T, Todoroff N, Klenner A, Kottke T, Sander K, Schneider E, Seifert R, Stark H, Clark T, Schneider G. Homology Model Adjustment and Ligand Screening with a Pseudoreceptor of the Human Histamine H4Receptor. ChemMedChem 2009; 4:820-7. [DOI: 10.1002/cmdc.200800443] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
15
|
Renner S, Hechenberger M, Noeske T, Böcker A, Jatzke C, Schmuker M, Parsons C, Weil T, Schneider G. Suche nach Wirkstoff-Grundgerüsten mit 3D-Pharmakophorhypothesen und Ensembles neuronaler Netze. Angew Chem Int Ed Engl 2007. [DOI: 10.1002/ange.200604125] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
16
|
Renner S, Hechenberger M, Noeske T, Böcker A, Jatzke C, Schmuker M, Parsons CG, Weil T, Schneider G. Searching for Drug Scaffolds with 3D Pharmacophores and Neural Network Ensembles. Angew Chem Int Ed Engl 2007; 46:5336-9. [PMID: 17604383 DOI: 10.1002/anie.200604125] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Affiliation(s)
- Steffen Renner
- Chemical R&D, Medicinal Chemistry/Cheminformatics, Merz Pharmaceuticals GmbH, Eckenheimer Landstrasse 100, 60318 Frankfurt am Main, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Willett P. Enhancing the Effectiveness of Ligand-Based Virtual Screening Using Data Fusion. ACTA ACUST UNITED AC 2006. [DOI: 10.1002/qsar.200610084] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
18
|
Givehchi A, Bender A, Glen RC. Analysis of activity space by fragment fingerprints, 2D descriptors, and multitarget dependent transformation of 2D descriptors. J Chem Inf Model 2006; 46:1078-83. [PMID: 16711727 DOI: 10.1021/ci0500233] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The effect of multitarget dependent descriptor transformation on classification performance is explored in this work. To this end decision trees as well as neural net QSAR in combination with PLS were applied to predict the activity class of 5HT3 ligands, angiotensin converting enzyme inhibitors, 3-hydroxyl-3-methyl glutaryl coenzyme A reductase inhibitors, platelet activating factor antagonists, and thromboxane A2 antagonists. Physicochemical descriptors calculated by MOE and fragment-based descriptors (MOLPRINT 2D) were employed to generate descriptor vectors. In a subsequent step the physicochemical descriptor vectors were transformed to a lower dimensional space using multitarget dependent descriptor transformation. Cross-validation of the original physicochemical descriptors in combination with decision trees and neural net QSAR as well as cross-validation of PLS multitarget transformed descriptors with neural net QSAR were performed. For comparison this was repeated using fragment-based descriptors in combination with decision trees.
Collapse
Affiliation(s)
- Alireza Givehchi
- Institut für Organische Chemie und Chemische Biologie, Johann Wolfgang Goethe-Universität, Marie-Curie-Strasse 11, D-60439 Frankfurt, Germany.
| | | | | |
Collapse
|