1
|
Agyapong O, Miller WA, Wilson MD, Kwofie SK. Development of a proteochemometric-based support vector machine model for predicting bioactive molecules of tubulin receptors. Mol Divers 2021; 26:2231-2242. [PMID: 34626303 DOI: 10.1007/s11030-021-10329-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 09/23/2021] [Indexed: 11/26/2022]
Abstract
Microtubules are receiving enormous interest in drug discovery due to the important roles they play in cellular functions. Targeting tubulin polymerization presents an excellent opportunity for the development of anti-tubulin drugs. Drug resistance and high toxicity of currently used tubulin-binding agents have necessitated the pursuit of novel drug candidates with increased therapeutic potency. The design of novel drug candidates can be achieved using efficient computational techniques to support existing efforts. Proteochemometric (PCM) modeling is a computational technique that can be employed to elucidate the bioactivity relations between related targets and multiple ligands. We have developed a PCM-based Support Vector Machine (SVM) approach for predicting the bioactivity between tubulin receptors and small, drug-like molecules. The bioactivity datasets used for training the SVM algorithm were obtained from the Binding DB database. The SVM-based PCM model yielded a good overall predictive performance with an area under the curve (AUC) of 87%, Matthews correlation coefficient (MCC) of 72%, overall accuracy of 93%, and a classification error of 7%. The algorithm allows the prediction of the likelihood of new interactions based on confidence scores between the query datasets, comprising ligands in SMILES format and protein sequences of tubulin targets. The algorithm has been implemented as a web server known as TubPred, accessible via http://35.167.90.225:5000/ .
Collapse
Affiliation(s)
- Odame Agyapong
- Department of Biomedical Engineering, School of Engineering Sciences, College of Basic and Applied Sciences, University of Ghana, PMB LG 77, Legon, Accra, Ghana
- Department of Parasitology, Noguchi Memorial Institute for Medical Research (NMIMR), College of Health Sciences (CHS), University of Ghana, P.O. Box LG 581, Legon, Accra, Ghana
| | - Whelton A Miller
- Department of Medicine, Loyola University Medical Center, Maywood, IL, 60153, USA
- School of Engineering and Applied Science, Department of Chemical and Biomolecular Engineering, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Department of Molecular Pharmacology and Neuroscience, Loyola University Medical Center, Maywood, IL, 60153, USA
| | - Michael D Wilson
- Department of Parasitology, Noguchi Memorial Institute for Medical Research (NMIMR), College of Health Sciences (CHS), University of Ghana, P.O. Box LG 581, Legon, Accra, Ghana
- Department of Medicine, Loyola University Medical Center, Maywood, IL, 60153, USA
| | - Samuel K Kwofie
- Department of Biomedical Engineering, School of Engineering Sciences, College of Basic and Applied Sciences, University of Ghana, PMB LG 77, Legon, Accra, Ghana.
- West African Centre for Cell Biology of Infectious Pathogens, Department of Biochemistry, Cell and Molecular Biology, College of Basic and Applied Sciences, University of Ghana, Accra, Ghana.
| |
Collapse
|
2
|
He SB, Ben Hu, Kuang ZK, Wang D, Kong DX. Predicting Subtype Selectivity for Adenosine Receptor Ligands with Three-Dimensional Biologically Relevant Spectrum (BRS-3D). Sci Rep 2016; 6:36595. [PMID: 27812030 PMCID: PMC5095671 DOI: 10.1038/srep36595] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2016] [Accepted: 10/18/2016] [Indexed: 02/02/2023] Open
Abstract
Adenosine receptors (ARs) are potential therapeutic targets for Parkinson’s disease, diabetes, pain, stroke and cancers. Prediction of subtype selectivity is therefore important from both therapeutic and mechanistic perspectives. In this paper, we introduced a shape similarity profile as molecular descriptor, namely three-dimensional biologically relevant spectrum (BRS-3D), for AR selectivity prediction. Pairwise regression and discrimination models were built with the support vector machine methods. The average determination coefficient (r2) of the regression models was 0.664 (for test sets). The 2B-3 (A2Bvs A3) model performed best with q2 = 0.769 for training sets (10-fold cross-validation), and r2 = 0.766, RMSE = 0.828 for test sets. The models’ robustness and stability were validated with 100 times resampling and 500 times Y-randomization. We compared the performance of BRS-3D with 3D descriptors calculated by MOE. BRS-3D performed as good as, or better than, MOE 3D descriptors. The performances of the discrimination models were also encouraging, with average accuracy (ACC) 0.912 and MCC 0.792 (test set). The 2A-3 (A2Avs A3) selectivity discrimination model (ACC = 0.882 and MCC = 0.715 for test set) outperformed an earlier reported one (ACC = 0.784). These results demonstrated that, through multiple conformation encoding, BRS-3D can be used as an effective molecular descriptor for AR subtype selectivity prediction.
Collapse
Affiliation(s)
- Song-Bing He
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan 430070, China.,College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - Ben Hu
- Agricultural Bioinformatics Key Laboratory of Hubei Province, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Zheng-Kun Kuang
- Agricultural Bioinformatics Key Laboratory of Hubei Province, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| | - Dong Wang
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
| | - De-Xin Kong
- State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan 430070, China.,Agricultural Bioinformatics Key Laboratory of Hubei Province, College of Informatics, Huazhong Agricultural University, Wuhan 430070, China
| |
Collapse
|
3
|
Ahmed HEA, Zayed MF, Ihmaid S. Molecular pharmacophore selectivity studies, virtual screening, and in silico ADMET analysis of GPCR antagonists. Med Chem Res 2015. [DOI: 10.1007/s00044-015-1389-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
4
|
Hu Y, Bajorath J. Follow up: Compound data sets and software tools for chemoinformatics and medicinal chemistry applications: update and data transfer. F1000Res 2014; 3:69. [PMID: 25520777 PMCID: PMC4264635 DOI: 10.12688/f1000research.3713.1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/07/2014] [Indexed: 12/12/2022] Open
Abstract
In 2012, we reported 30 compound data sets and/or programs developed in our laboratory in a data article and made them freely available to the scientific community to support chemoinformatics and computational medicinal chemistry applications. These data sets and computational tools were provided for download from our website. Since publication of this data article, we have generated 13 new data sets with which we further extend our collection of publicly available data and tools. Due to changes in web servers and website architectures, data accessibility has recently been limited at times. Therefore, we have also transferred our data sets and tools to a public repository to ensure full and stable accessibility. To aid in data selection, we have classified the data sets according to scientific subject areas. Herein, we describe new data sets, introduce the data organization scheme, summarize the database content and provide detailed access information in ZENODO (doi: 10.5281/zenodo.8451 and doi:10.5281/zenodo.8455).
Collapse
Affiliation(s)
- Ye Hu
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms University, Bonn, D-53113, Germany
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms University, Bonn, D-53113, Germany
| |
Collapse
|
5
|
Hu Y, Bajorath J. Freely available compound data sets and software tools for chemoinformatics and computational medicinal chemistry applications. F1000Res 2012; 1:11. [PMID: 24358818 PMCID: PMC3782340 DOI: 10.12688/f1000research.1-11.v1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/07/2012] [Indexed: 01/22/2023] Open
Abstract
We have generated a number of compound data sets and programs for different types of applications in pharmaceutical research. These data sets and programs were originally designed for our research projects and are made publicly available. Without consulting original literature sources, it is difficult to understand specific features of data sets and software tools, basic ideas underlying their design, and applicability domains. Currently, 30 different entries are available for download from our website. In this data article, we provide an overview of the data and tools we make available and designate the areas of research for which they should be useful. For selected data sets and methods/programs, detailed descriptions are given. This article should help interested readers to select data and tools for specific computational investigations.
Collapse
Affiliation(s)
- Ye Hu
- Department of Life Science Informatics, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr, Bonn, D-53113, Germany
| | - Jurgen Bajorath
- Department of Life Science Informatics, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr, Bonn, D-53113, Germany
| |
Collapse
|
6
|
Ning X, Walters M, Karypisxy G. Improved Machine Learning Models for Predicting Selective Compounds. J Chem Inf Model 2011; 52:38-50. [DOI: 10.1021/ci200346b] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Xia Ning
- Department of Computer Science & Engineering and ‡College of Pharmacy, University of Minnesota, Twin Cities, Minneapolis, Minnesota 55455, United States
| | - Michael Walters
- Department of Computer Science & Engineering and ‡College of Pharmacy, University of Minnesota, Twin Cities, Minneapolis, Minnesota 55455, United States
| | - George Karypisxy
- Department of Computer Science & Engineering and ‡College of Pharmacy, University of Minnesota, Twin Cities, Minneapolis, Minnesota 55455, United States
| |
Collapse
|
7
|
Stumpfe D, Bajorath J. Applied Virtual Screening: Strategies, Recommendations, and Caveats. METHODS AND PRINCIPLES IN MEDICINAL CHEMISTRY 2011. [DOI: 10.1002/9783527633326.ch11] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
8
|
Stumpfe D, Lounkine E, Bajorath J. Molecular test systems for computational selectivity studies and systematic analysis of compound selectivity profiles. Methods Mol Biol 2011; 672:503-15. [PMID: 20838982 DOI: 10.1007/978-1-60761-839-3_20] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
For chemical genetics and chemical biology, an important task is the identification of small molecules that are selective against individual targets and can be used as molecular probes for specific biological functions. To aid in the development of computational methods for selectivity analysis, molecular benchmark systems have been developed that capture compound selectivity data for pairs of targets. These molecular test systems are utilized for "selectivity searching" and the analysis of structure-selectivity relationships. Going beyond binary selectivity sets focusing on target pairs, a methodological framework, Molecular Formal Concept Analysis (MolFCA), is described for the definition and systematic mining of compound selectivity profiles.
Collapse
Affiliation(s)
- Dagmar Stumpfe
- Department of Life Science Informatics, B-IT, Rheinische Friedrich-Wilhelms-Universität, Bonn, Germany
| | | | | |
Collapse
|
9
|
Geppert H, Vogt M, Bajorath J. Current trends in ligand-based virtual screening: molecular representations, data mining methods, new application areas, and performance evaluation. J Chem Inf Model 2010; 50:205-16. [PMID: 20088575 DOI: 10.1021/ci900419k] [Citation(s) in RCA: 231] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
- Hanna Geppert
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universitat, Dahlmannstrasse 2, D-53113 Bonn, Germany
| | | | | |
Collapse
|
10
|
Wassermann AM, Vogt M, Bajorath J. Iterative Shannon Entropy - a Methodology to Quantify the Information Content of Value Range Dependent Data Distributions. Application to Descriptor and Compound Selectivity Profiling. Mol Inform 2010; 29:432-40. [PMID: 27463198 DOI: 10.1002/minf.201000029] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2010] [Accepted: 04/09/2010] [Indexed: 11/06/2022]
Abstract
We introduce an entropy-based methodology, Iterative Shannon entropy (ISE), to quantify the information contained in molecular descriptors and compound selectivity data sets taking data spread directly into account. The method is applicable to determine the information content of any value range dependent data distribution. An analysis of descriptor information content has been carried out to explore alternative binning schemes for entropy calculation. Using this entropic measure we have profiled 153 compound selectivity data sets for combinations of 68 target proteins belonging to 10 target families. With the ISE measure, we aim to assign high information content to compound data sets that span a wide range of selectivity values and different selectivity relationships and hence correspond to more than one biological phenotype. Target families with high average entropy scores are identified. For members of these families, active compounds display highly differentiated selectivity profiles.
Collapse
Affiliation(s)
- Anne Mai Wassermann
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität Bonn, Dahlmannstr. 2, D-53113 Bonn phone/fax: +49-228-2699-306/341
| | - Martin Vogt
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität Bonn, Dahlmannstr. 2, D-53113 Bonn phone/fax: +49-228-2699-306/341
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität Bonn, Dahlmannstr. 2, D-53113 Bonn phone/fax: +49-228-2699-306/341.
| |
Collapse
|
11
|
Ahmed HEA, Bajorath J. Methods for Computer-Aided Chemical Biology. Part 5: Rationalizing the Selectivity of Cathepsin Inhibitors on the Basis of Molecular Fragments and Topological Feature Distributions. Chem Biol Drug Des 2009; 74:129-41. [DOI: 10.1111/j.1747-0285.2009.00848.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
12
|
Vogt M, Nisius B, Bajorath J. Predicting the similarity search performance of fingerprints and their combination with molecular property descriptors using probabilistic and information theoretic modeling. Stat Anal Data Min 2009. [DOI: 10.1002/sam.10035] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
13
|
Geppert H, Humrich J, Stumpfe D, Gärtner T, Bajorath J. Ligand prediction from protein sequence and small molecule information using support vector machines and fingerprint descriptors. J Chem Inf Model 2009; 49:767-79. [PMID: 19309114 DOI: 10.1021/ci900004a] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Support vector machine (SVM) database search strategies are presented that aim at the identification of small molecule ligands for targets for which no ligand information is currently available. In pharmaceutical research and chemical biology, this situation is faced, for example, when studying orphan targets or newly identified members of protein families. To investigate methods for de novo ligand identification in the absence of known three-dimensional target structures or active molecules, we have focused on combining sequence and ligand information for closely and distantly related proteins. To provide a basis for these investigations, a set of 11 protease targets from different families was assembled together with more than 2000 inhibitors directed against individual proteases. We have compared SVM approaches that combine protein sequence and ligand information in different ways and utilize 2D fingerprints as ligand descriptors. These methodologies were applied to search for inhibitors of individual proteases not taken into account during learning. A target sequence-ligand kernel and, in particular, a linear combination of multiple target-directed SVMs consistently identified inhibitors with high accuracy including test cases where homology-based similarity searching using data fusion and conventional SVM ranking nearly or completely failed. The SVM linear combination and target-ligand kernel methods described herein are intuitive and straightforward to adopt for ligand prediction against other targets.
Collapse
Affiliation(s)
- Hanna Geppert
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universitat Bonn, Dahlmannstr. 2, D-53113 Bonn, Germany
| | | | | | | | | |
Collapse
|
14
|
Stumpfe D, Geppert H, Bajorath J. Analysis of structure-selectivity relationships through single- or dual step selectivity searching using 2D molecular fingerprints. Chem Cent J 2009. [DOI: 10.1186/1752-153x-3-s1-p4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
|
15
|
Wassermann AM, Geppert H, Bajorath J. Searching for target-selective compounds using different combinations of multiclass support vector machine ranking methods, kernel functions, and fingerprint descriptors. J Chem Inf Model 2009; 49:582-92. [PMID: 19249858 DOI: 10.1021/ci800441c] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The identification of small chemical compounds that are selective for a target protein over one or more closely related members of the same family is of high relevance for applications in chemical biology. Conventional 2D similarity searching using known selective molecules as templates has recently been found to preferentially detect selective over non-selective and inactive database compounds. To improve the initially observed search performance, we have attempted to use 2D fingerprints as descriptors for support vector machine (SVM)-based selectivity searching. Different from typically applied binary SVM compound classification, SVM analysis has been adapted here for multiclass predictions and compound ranking to distinguish between selective, active but non-selective, and inactive compounds. In systematic database search calculations, we tested combinations of four alternative SVM ranking schemes, four different kernel functions, and four fingerprints and were able to further improve selectivity search performance by effectively removing non-selective molecules from high ranking positions while retaining high recall of selective compounds.
Collapse
Affiliation(s)
- Anne Mai Wassermann
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, D-53113 Bonn, Germany
| | | | | |
Collapse
|
16
|
Lounkine E, Stumpfe D, Bajorath J. Molecular Formal Concept Analysis for Compound Selectivity Profiling in Biologically Annotated Databases. J Chem Inf Model 2009; 49:1359-68. [DOI: 10.1021/ci900095v] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Eugen Lounkine
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität Bonn, Dahlmannstrasse 2, D-53113 Bonn, Germany
| | - Dagmar Stumpfe
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität Bonn, Dahlmannstrasse 2, D-53113 Bonn, Germany
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität Bonn, Dahlmannstrasse 2, D-53113 Bonn, Germany
| |
Collapse
|
17
|
Ahmed HEA, Geppert H, Stumpfe D, Lounkine E, Bajorath J. Methods for Computer-Aided Chemical Biology. Part 4: Selectivity Searching for Ion Channel Ligands and Mapping of Molecular Fragments as Selectivity Markers. Chem Biol Drug Des 2009; 73:273-82. [DOI: 10.1111/j.1747-0285.2009.00784.x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
18
|
Stumpfe D, Frizler M, Sisay M, Batista J, Vogt I, Gütschow M, Bajorath J. Hit Expansion through Computational Selectivity Searching. ChemMedChem 2009; 4:52-4. [DOI: 10.1002/cmdc.200800304] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Dagmar Stumpfe
- Department of Life Science Informatics, Bonn–Aachen International Center for Information Technology, Rheinische Friedrich‐Wilhelms‐Universität Bonn, Dahlmannstr. 2, 53113 Bonn (Germany), Fax: (+49) 228‐2699‐341
| | - Maxim Frizler
- Pharmazeutisches Institut, Pharmazeutische Chemie I, Rheinische Friedrich‐Wilhelms‐Universität Bonn, An der Immenburg 4, 53121 Bonn (Germany)
| | - Mihiret T. Sisay
- Pharmazeutisches Institut, Pharmazeutische Chemie I, Rheinische Friedrich‐Wilhelms‐Universität Bonn, An der Immenburg 4, 53121 Bonn (Germany)
| | - José Batista
- Department of Life Science Informatics, Bonn–Aachen International Center for Information Technology, Rheinische Friedrich‐Wilhelms‐Universität Bonn, Dahlmannstr. 2, 53113 Bonn (Germany), Fax: (+49) 228‐2699‐341
| | - Ingo Vogt
- Department of Life Science Informatics, Bonn–Aachen International Center for Information Technology, Rheinische Friedrich‐Wilhelms‐Universität Bonn, Dahlmannstr. 2, 53113 Bonn (Germany), Fax: (+49) 228‐2699‐341
| | - Michael Gütschow
- Pharmazeutisches Institut, Pharmazeutische Chemie I, Rheinische Friedrich‐Wilhelms‐Universität Bonn, An der Immenburg 4, 53121 Bonn (Germany)
| | - Jürgen Bajorath
- Department of Life Science Informatics, Bonn–Aachen International Center for Information Technology, Rheinische Friedrich‐Wilhelms‐Universität Bonn, Dahlmannstr. 2, 53113 Bonn (Germany), Fax: (+49) 228‐2699‐341
| |
Collapse
|
19
|
Robert S, Raikhel NV, Hicks GR. Powerful partners: Arabidopsis and chemical genomics. THE ARABIDOPSIS BOOK 2009; 7:e0109. [PMID: 22303245 PMCID: PMC3243329 DOI: 10.1199/tab.0109] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Chemical genomics (i.e. genomics scale chemical genetics) approaches capitalize on the ability of low molecular mass molecules to modify biological processes. Such molecules are used to modify the activity of a protein or a pathway in a manner that it is tunable and reversible. Bioactive chemicals resulting from forward or reverse chemical screens can be useful in understanding and dissecting complex biological processes due to the essentially limitless variation in structure and activities inherent in chemical space. A major advantage of this approach as a powerful addition to conventional plant genetics is the fact that chemical genomics can address loss-of-function lethality and redundancy. Furthermore, the ability of chemicals to be added at will and to act quickly can permit the study of processes that are highly dynamic such as endomembrane trafficking. An important aspect of utilizing small molecules effectively is to characterize bioactive chemicals in detail including an understanding of structure-activity relationships and the identification of active and inactive analogs. Bioactive chemicals can be useful as reagents to probe biological pathways directly. However, the identification of cognate targets and their pathways is also informative and can be achieved by screens for genetic resistance or hypersensitivity in Arabidopsis thaliana or other organisms from which the results can be translated to plants. In addition, there are approaches utilizing "tagged" chemical libraries that possess reactive moieties permitting the immobilization of active compounds. This opens the possibility for biochemical purification of putative cognate targets. We will review approaches to screen for bioactive chemicals that affect biological processes in Arabidopsis and provide several examples of the power and challenges inherent in this new approach in plant biology.
Collapse
Affiliation(s)
- Stéphanie Robert
- Center for Plant Cell Biology & Department of Botany and Plant Sciences, University of California, Riverside, CA 92521
- Current address: VIB Department of Plant Systems Biology, University of Ghent, 9052 Ghent, Belgium
| | - Natasha V. Raikhel
- Center for Plant Cell Biology & Department of Botany and Plant Sciences, University of California, Riverside, CA 92521
| | - Glenn R. Hicks
- Center for Plant Cell Biology & Department of Botany and Plant Sciences, University of California, Riverside, CA 92521
- Address correspondence to
| |
Collapse
|
20
|
Bajorath J. Computational approaches in chemogenomics and chemical biology: current and future impact on drug discovery. Expert Opin Drug Discov 2008; 3:1371-6. [DOI: 10.1517/17460440802536496] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
|
21
|
Lounkine E, Auer J, Bajorath J. Formal concept analysis for the identification of molecular fragment combinations specific for active and highly potent compounds. J Med Chem 2008; 51:5342-8. [PMID: 18698757 DOI: 10.1021/jm800515r] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We introduce fragment formal concept analysis (FragFCA) to study complex relationships between fragments in active compounds taking potency information into account. Fragment combinations that are unique to active or highly potent compounds or that are shared by molecules having different or overlapping activity profiles are systematically identified using chemically intuitive queries of varying complexity. The methodology is applied to analyze fragment distributions in antagonists of seven G protein coupled receptor targets and identify signature fragments. Pairs or triplets of molecular fragments are found to be most specific for different activity profiles and compound potency levels. In addition, we demonstrate the ability of FragFCA to identify selective hits in high-throughput screening data sets.
Collapse
Affiliation(s)
- Eugen Lounkine
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrasse 2, D-53113 Bonn, Germany
| | | | | |
Collapse
|
22
|
Stumpfe D, Geppert H, Bajorath J. Methods for Computer-Aided Chemical Biology. Part 3: Analysis of Structure–Selectivity Relationships through Single- or Dual-Step Selectivity Searching and Bayesian Classification. Chem Biol Drug Des 2008; 71:518-28. [DOI: 10.1111/j.1747-0285.2008.00670.x] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
23
|
Bajorath J. Computational analysis of ligand relationships within target families. Curr Opin Chem Biol 2008; 12:352-8. [PMID: 18312862 DOI: 10.1016/j.cbpa.2008.01.044] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2007] [Accepted: 01/31/2008] [Indexed: 11/16/2022]
Abstract
Computational tools for the large-scale analysis and prediction of ligand-target interactions and the identification of small molecules having different selectivity profiles within target protein families complement research in chemical genetics and chemogenomics. For computational analysis and design, such tasks require a departure from the traditional focus on single targets, hit identification, and lead optimization. Recently, studies have been reported that profile compounds in silico against arrays of targets or systematically map ligand-target space. In order to identify small molecular probes that are suitable for chemical genetics applications, molecular diversity needs to be viewed in a way that partly differs from principles guiding conventional library design.
Collapse
Affiliation(s)
- Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Bonn, Germany.
| |
Collapse
|
24
|
Vogt I, Ahmed HEA, Auer J, Bajorath J. Exploring structure-selectivity relationships of biogenic amine GPCR antagonists using similarity searching and dynamic compound mapping. Mol Divers 2008; 12:25-40. [PMID: 18317941 DOI: 10.1007/s11030-008-9071-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2007] [Accepted: 02/05/2008] [Indexed: 11/28/2022]
Abstract
We design and analyze compound selectivity sets of antagonists with differential selectivity against seven biogenic amine G-protein coupled receptors. The selectivity sets consist of a total of 267 antagonists and contain a spectrum of in part closely related molecular scaffolds. Each set represents a different selectivity profile. Using these com- pound sets, a systematic computational analysis of structure-selectivity relationships is carried out with different 2D similarity methods including fingerprints, recursive partitioning, clustering, and dynamic compound mapping. Screening calculations are performed in a background database containing nearly four million molecules. Fingerprint searching and compound mapping are found to enrich target-selective antagonists over family-selective ones. Dynamic compound mapping effectively discriminates database compounds from GPCR antagonists and consistently retains target-selective antagonists during the final dimension extension levels. Furthermore, the widely used MACCS key fingerprint displays a strong tendency to distinguish between target- and family-selective GPCR antagonists. Taken together, the results indicate that different types of 2D similarity methods are capable of distinguishing closely related molecules having different selectivity. The reported compound benchmark system is made freely available in order to enable selectivity-oriented analyses using other computational approaches.
Collapse
Affiliation(s)
- Ingo Vogt
- Department of Life Science Informatics, B-IT, LIMES Institute, Rheinische Friedrich-Wilhelms-Universität Bonn, Dahlmannstr. 2, 53113, Bonn, Germany
| | | | | | | |
Collapse
|