1
|
Whitehouse AJ, Sanchez-Martinez M, Salehi SM, Kurbatova N, Dean E. Open-Source Approach to GPU-Accelerated Substructure Search. J Chem Inf Model 2024. [PMID: 39225069 DOI: 10.1021/acs.jcim.4c00679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/04/2024]
Abstract
Chemical substructure search is a critical task in medicinal chemistry and small-molecule drug discovery, enabling the retrieval of molecules from databases based on specific chemical features. While systems exist for this purpose, the challenge of efficient and swift searching persists, particularly as data storage migrates to the cloud, introducing new complexities. This study provides a comprehensive analysis of chemical substructure searches, showcasing the benefits of graphics processing unit-accelerated fingerprint screening. The research highlights strategies for optimizing performance, making significant advancements in substructure searching, a pivotal aspect of drug discovery and molecular research. The accessible and scalable nature of the proposed approach makes it a valuable resource for scientists aiming to enhance their substructure search capabilities.
Collapse
Affiliation(s)
- Andrew J Whitehouse
- Zifo Technologies Ltd, Office 7, 37-39 Shakespeare Street, Southport, Merseyside PR8 5AB, U.K
| | | | - Seyedeh Maryam Salehi
- Zifo Technologies Ltd, Office 7, 37-39 Shakespeare Street, Southport, Merseyside PR8 5AB, U.K
| | - Natalja Kurbatova
- Zifo Technologies Ltd, Office 7, 37-39 Shakespeare Street, Southport, Merseyside PR8 5AB, U.K
| | - Euan Dean
- Zifo Technologies Ltd, Office 7, 37-39 Shakespeare Street, Southport, Merseyside PR8 5AB, U.K
| |
Collapse
|
2
|
Fang L, Li J, Zhao M, Tan L, Lou JG. Single-step retrosynthesis prediction by leveraging commonly preserved substructures. Nat Commun 2023; 14:2446. [PMID: 37117216 PMCID: PMC10147675 DOI: 10.1038/s41467-023-37969-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2022] [Accepted: 03/31/2023] [Indexed: 04/30/2023] Open
Abstract
Retrosynthesis analysis is an important task in organic chemistry with numerous industrial applications. Previously, machine learning approaches employing natural language processing techniques achieved promising results in this task by first representing reactant molecules as strings and subsequently predicting reactant molecules using text generation or machine translation models. Chemists cannot readily derive useful insights from traditional approaches that rely largely on atom-level decoding in the string representations, because human experts tend to interpret reactions by analyzing substructures that comprise a molecule. It is well-established that some substructures are stable and remain unchanged in reactions. In this paper, we developed a substructure-level decoding model, where commonly preserved portions of product molecules were automatically extracted with a fully data-driven approach. Our model achieves improvement over previously reported models, and we demonstrate that its performance can be boosted further by enhancing the accuracy of these substructures. Analyzing substructures extracted from our machine learning model can provide human experts with additional insights to assist decision-making in retrosynthesis analysis.
Collapse
Affiliation(s)
- Lei Fang
- Microsoft Research Asia, No.5 Dan Ling Street, Beijing, China.
| | - Junren Li
- College of Chemistry and Molecular Engineering, Peking University, No.5 Yiheyuan Road, Beijing, China
| | - Ming Zhao
- IPS, Waseda University, 2-7 Hibikino, Wakamatsu-ku, Kitakyushu-shi, Fukuoka, 808-0135, Japan
| | - Li Tan
- Mincui Therapeutix, No.1 Yongtaizhuang North Road, Beijing, China
| | - Jian-Guang Lou
- Microsoft Research Asia, No.5 Dan Ling Street, Beijing, China
| |
Collapse
|
3
|
Yang Y, Gao D, Xie X, Qin J, Li J, Lin H, Yan D, Deng K. DeepIDC: A Prediction Framework of Injectable Drug Combination Based on Heterogeneous Information and Deep Learning. Clin Pharmacokinet 2022; 61:1749-1759. [PMID: 36369328 DOI: 10.1007/s40262-022-01180-9] [Citation(s) in RCA: 40] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/27/2022] [Indexed: 11/13/2022]
Abstract
BACKGROUND AND OBJECTIVE In clinical practice, injectable drug combination (IDC) usually provides good therapeutic effects for patients. Numerous clinical studies have directly indicated that inappropriate IDC generates adverse drug events (ADEs). The clinical application of injections is increasing, and many injections lack relevant combination information. It is still a significant need for experienced clinical pharmacists to participate in evidence-based drug decision making, monitor medication safety, and manage drug interactions. Meanwhile, a large number of injection pairs and dosage combinations limit exhaustive screening. Here, we present a prediction framework, called DeepIDC, that can expediently screen the feasibility of IDCs using heterogeneous information with deep learning. This is the first specific prediction framework to identify IDCs. METHODS Since the interaction between the injected drugs may occur in the direct physical and chemical reactions at the time of mixing or may be the indirect interaction of their drug targets and pathways, we used molecular fingerprints, drug-target associations, and drug-pathway associations to convert injections into a string of digital vectors. Then, based on these injection vectors, we combined a bidirectional long short-term memory and a feed-forward neural network to build a prediction model for accurate and instructive prediction of IDC. RESULTS In three realistic evaluation scenarios, DeepIDC has achieved ideal prediction results. Furthermore, compared with the other five machine-learning methods, the proposed predictor is more efficient and robust. Among the top 30 potential IDCs of each IDC class predicted by DeepIDC, we found that 9 cases were experimentally verified in the literature or available on Drug.com. CONCLUSION The information we extracted in vivo and in vitro can effectively characterize injectable drugs. DeepIDC developed based on deep learning algorithm provides a valuable unified framework for new IDC discovery, which can make up for the lack of IDC information and predict potential IDC events.
Collapse
Affiliation(s)
- Yuhe Yang
- College of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, Sichuan, China
| | - Dong Gao
- College of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, Sichuan, China
| | - Xueqin Xie
- College of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, Sichuan, China
| | - Jiaan Qin
- Beijing Friendship Hospital, Capital Medical University, Beijing, China.,Beijing Institute of Clinical Pharmacy, Beijing, China
| | - Jian Li
- School of Basic Medical Science, Chengdu University, Chengdu, China
| | - Hao Lin
- College of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, Sichuan, China.
| | - Dan Yan
- Beijing Friendship Hospital, Capital Medical University, Beijing, China. .,Beijing Institute of Clinical Pharmacy, Beijing, China.
| | - Kejun Deng
- College of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, Sichuan, China.
| |
Collapse
|
4
|
Sedykh AY, Shah RR, Kleinstreuer NC, Auerbach SS, Gombar VK. Saagar-A New, Extensible Set of Molecular Substructures for QSAR/QSPR and Read-Across Predictions. Chem Res Toxicol 2020; 34:634-640. [PMID: 33356152 DOI: 10.1021/acs.chemrestox.0c00464] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Molecular structure-based predictive models provide a proven alternative to costly and inefficient animal testing. However, due to a lack of interpretability of predictive models built with abstract molecular descriptors they have earned the notoriety of being black boxes. Interpretable models require interpretable descriptors to provide chemistry-backed predictive reasoning and facilitate intelligent molecular design. We developed a novel set of extensible chemistry-aware substructures, Saagar, to support interpretable predictive models and read-across protocols. Performance of Saagar in chemical characterization and search for structurally similar actives for read-across applications was compared with four publicly available fingerprint sets (MACCS (166), PubChem (881), ECFP4 (1024), ToxPrint (729)) in three benchmark sets (MUV, ULS, and Tox21) spanning ∼145 000 compounds and 78 molecular targets at 1%, 2%, 5%, and 10% false discovery rates. In 18 of the 20 comparisons, interpretable Saagar features performed better than the publicly available, but less interpretable and fixed-bit length, fingerprints. Examples are provided to show the enhanced capability of Saagar in extracting compounds with higher scaffold similarity. Saagar features are interpretable and efficiently characterize diverse chemical collections, thus making them a better choice for building interpretable predictive in silico models and read-across protocols.
Collapse
Affiliation(s)
| | - Ruchir R Shah
- Sciome LLC, Research Triangle Park, North Carolina 27709, United States
| | - Nicole C Kleinstreuer
- National Institute of Environmental Health Sciences (NIEHS), National Toxicology Program (NTP), Research Triangle Park, North Carolina 27709, United States
| | - Scott S Auerbach
- National Institute of Environmental Health Sciences (NIEHS), National Toxicology Program (NTP), Research Triangle Park, North Carolina 27709, United States
| | - Vijay K Gombar
- Sciome LLC, Research Triangle Park, North Carolina 27709, United States
| |
Collapse
|
5
|
Consensus-Based Pharmacophore Mapping for New Set of N-(disubstituted-phenyl)-3-hydroxyl-naphthalene-2-carboxamides. Int J Mol Sci 2020; 21:ijms21186583. [PMID: 32916824 PMCID: PMC7555178 DOI: 10.3390/ijms21186583] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Revised: 09/01/2020] [Accepted: 09/07/2020] [Indexed: 02/07/2023] Open
Abstract
A series of twenty-two novel N-(disubstituted-phenyl)-3-hydroxynaphthalene- 2-carboxamide derivatives was synthesized and characterized as potential antimicrobial agents. N-[3,5-bis(trifluoromethyl)phenyl]- and N-[2-chloro-5-(trifluoromethyl)phenyl]-3-hydroxy- naphthalene-2-carboxamide showed submicromolar (MICs 0.16–0.68 µM) activity against methicillin-resistant Staphylococcus aureus isolates. N-[3,5-bis(trifluoromethyl)phenyl]- and N-[4-bromo-3-(trifluoromethyl)phenyl]-3-hydroxynaphthalene-2-carboxamide revealed activity against M. tuberculosis (both MICs 10 µM) comparable with that of rifampicin. Synergistic activity was observed for the combinations of ciprofloxacin with N-[4-bromo-3-(trifluoromethyl)phenyl]- and N-(4-bromo-3-fluorophenyl)-3-hydroxynaphthalene-2-carboxamides against MRSA SA 630 isolate. The similarity-related property space assessment for the congeneric series of structurally related carboxamide derivatives was performed using the principal component analysis. Interestingly, different distribution of mono-halogenated carboxamide derivatives with the –CF3 substituent is accompanied by the increased activity profile. A symmetric matrix of Tanimoto coefficients indicated the structural dissimilarities of dichloro- and dimetoxy-substituted isomers from the remaining ones. Moreover, the quantitative sampling of similarity-related activity landscape provided a subtle picture of favorable and disallowed structural modifications that are valid for determining activity cliffs. Finally, the advanced method of neural network quantitative SAR was engaged to illustrate the key 3D steric/electronic/lipophilic features of the ligand-site composition by the systematic probing of the functional group.
Collapse
|
6
|
Morris EJ, Kawamura E, Gillespie JA, Balgi A, Kannan N, Muller WJ, Roberge M, Dedhar S. Stat3 regulates centrosome clustering in cancer cells via Stathmin/PLK1. Nat Commun 2017; 8:15289. [PMID: 28474672 PMCID: PMC5424153 DOI: 10.1038/ncomms15289] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2016] [Accepted: 03/14/2017] [Indexed: 12/17/2022] Open
Abstract
Cancer cells frequently have amplified centrosomes that must be clustered together to form a bipolar mitotic spindle, and targeting centrosome clustering is considered a promising therapeutic strategy. A high-content chemical screen for inhibitors of centrosome clustering identified Stattic, a Stat3 inhibitor. Stat3 depletion and inhibition in cancer cell lines and in tumours in vivo caused significant inhibition of centrosome clustering and viability. Here we describe a transcription-independent mechanism for Stat3-mediated centrosome clustering that involves Stathmin, a Stat3 interactor involved in microtubule depolymerization, and the mitotic kinase PLK1. Furthermore, PLK4-driven centrosome amplified breast tumour cells are highly sensitive to Stat3 inhibitors. We have identified an unexpected role of Stat3 in the regulation of centrosome clustering, and this role of Stat3 may be critical in identifying tumours that are sensitive to Stat3 inhibitors.
Collapse
Affiliation(s)
- Edward J. Morris
- Department of Integrative Oncology, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia, Canada V5Z 1L3
| | - Eiko Kawamura
- Department of Integrative Oncology, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia, Canada V5Z 1L3
| | - Jordan A. Gillespie
- Department of Integrative Oncology, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia, Canada V5Z 1L3
| | - Aruna Balgi
- Department of Biochemistry and Molecular Biology, Life Sciences Institute, University of British Columbia, Vancouver, British Columbia, Canada V6E 4A2
| | - Nagarajan Kannan
- Terry Fox Laboratory, BC Cancer Agency, Vancouver, British Columbia, Canada V5Z 1L3
| | - William J. Muller
- Department of Biochemistry, Rosalind and Morris Goodman Cancer Centre, McGill University, Montreal, Quebec, Canada H3A 1A3
| | - Michel Roberge
- Department of Biochemistry and Molecular Biology, Life Sciences Institute, University of British Columbia, Vancouver, British Columbia, Canada V6E 4A2
| | - Shoukat Dedhar
- Department of Integrative Oncology, BC Cancer Research Centre, BC Cancer Agency, Vancouver, British Columbia, Canada V5Z 1L3
- Department of Biochemistry and Molecular Biology, Life Sciences Institute, University of British Columbia, Vancouver, British Columbia, Canada V6E 4A2
| |
Collapse
|
7
|
Singh PK, Negi A, Gupta PK, Chauhan M, Kumar R. Toxicophore exploration as a screening technology for drug design and discovery: techniques, scope and limitations. Arch Toxicol 2015; 90:1785-802. [PMID: 26341667 DOI: 10.1007/s00204-015-1587-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2015] [Accepted: 08/13/2015] [Indexed: 01/11/2023]
Abstract
Toxicity is a common drawback of newly designed chemotherapeutic agents. With the exception of pharmacophore-induced toxicity (lack of selectivity at higher concentrations of a drug), the toxicity due to chemotherapeutic agents is based on the toxicophore moiety present in the drug. To date, methodologies implemented to determine toxicophores may be broadly classified into biological, bioanalytical and computational approaches. The biological approach involves analysis of bioactivated metabolites, whereas the computational approach involves a QSAR-based method, mapping techniques, an inverse docking technique and a few toxicophore identification/estimation tools. Being one of the major steps in drug discovery process, toxicophore identification has proven to be an essential screening step in drug design and development. The paper is first of its kind, attempting to cover and compare different methodologies employed in predicting and determining toxicophores with an emphasis on their scope and limitations. Such information may prove vital in the appropriate selection of methodology and can be used as screening technology by researchers to discover the toxicophoric potentials of their designed and synthesized moieties. Additionally, it can be utilized in the manipulation of molecules containing toxicophores in such a manner that their toxicities might be eliminated or removed.
Collapse
Affiliation(s)
- Pankaj Kumar Singh
- Laboratory for Drug Design and Synthesis, Centre for Pharmaceutical Sciences and Natural Products, School of Basic and Applied Sciences, Central University of Punjab, Bathinda, 151 001, India
| | - Arvind Negi
- Laboratory for Drug Design and Synthesis, Centre for Pharmaceutical Sciences and Natural Products, School of Basic and Applied Sciences, Central University of Punjab, Bathinda, 151 001, India
| | - Pawan Kumar Gupta
- Centre for Computational Sciences, School of Basic and Applied Sciences, Central University of Punjab, Bathinda, 151 001, India
| | - Monika Chauhan
- Laboratory for Drug Design and Synthesis, Centre for Pharmaceutical Sciences and Natural Products, School of Basic and Applied Sciences, Central University of Punjab, Bathinda, 151 001, India
| | - Raj Kumar
- Laboratory for Drug Design and Synthesis, Centre for Pharmaceutical Sciences and Natural Products, School of Basic and Applied Sciences, Central University of Punjab, Bathinda, 151 001, India.
| |
Collapse
|
8
|
Kang H, Tang K, Liu Q, Sun Y, Huang Q, Zhu R, Gao J, Zhang D, Huang C, Cao Z. HIM-herbal ingredients in-vivo metabolism database. J Cheminform 2013; 5:28. [PMID: 23721660 PMCID: PMC3679852 DOI: 10.1186/1758-2946-5-28] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2013] [Accepted: 05/28/2013] [Indexed: 12/20/2022] Open
Abstract
Background Herbal medicine has long been viewed as a valuable asset for potential new drug discovery and herbal ingredients’ metabolites, especially the in vivo metabolites were often found to gain better pharmacological, pharmacokinetic and even better safety profiles compared to their parent compounds. However, these herbal metabolite information is still scattered and waiting to be collected. Description HIM database manually collected so far the most comprehensive available in-vivo metabolism information for herbal active ingredients, as well as their corresponding bioactivity, organs and/or tissues distribution, toxicity, ADME and the clinical research profile. Currently HIM contains 361 ingredients and 1104 corresponding in-vivo metabolites from 673 reputable herbs. Tools of structural similarity, substructure search and Lipinski’s Rule of Five are also provided. Various links were made to PubChem, PubMed, TCM-ID (Traditional Chinese Medicine Information database) and HIT (Herbal ingredients’ targets databases). Conclusions A curated database HIM is set up for the in vivo metabolites information of the active ingredients for Chinese herbs, together with their corresponding bioactivity, toxicity and ADME profile. HIM is freely accessible to academic researchers at http://www.bioinformatics.org.cn/.
Collapse
Affiliation(s)
- Hong Kang
- School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Bentzien J, Muegge I, Hamner B, Thompson DC. Crowd computing: using competitive dynamics to develop and refine highly predictive models. Drug Discov Today 2013; 18:472-8. [PMID: 23337388 DOI: 10.1016/j.drudis.2013.01.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2012] [Revised: 12/20/2012] [Accepted: 01/03/2013] [Indexed: 12/16/2022]
Abstract
A recent application of a crowd computing platform to develop highly predictive in silico models for use in the drug discovery process is described. The platform, Kaggle™, exploits a competitive dynamic that results in model optimization as the competition unfolds. Here, this dynamic is described in detail and compared with more-conventional modeling strategies. The complete and full structure of the underlying dataset is disclosed and some thoughts as to the broader utility of such 'gamification' approaches to the field of modeling are offered.
Collapse
Affiliation(s)
- Jörg Bentzien
- Boehringer Ingelheim Pharmaceuticals, 900 Ridgebury Road, Ridgefield, CT 06877, USA
| | | | | | | |
Collapse
|
10
|
Hu Y, Bajorath J. Target Family-Directed Exploration of Scaffolds with Different SAR Profiles. J Chem Inf Model 2011; 51:3138-48. [DOI: 10.1021/ci200461w] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Ye Hu
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrasse 2, D-53113 Bonn, Germany
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrasse 2, D-53113 Bonn, Germany
| |
Collapse
|
11
|
Hack MD, Rassokhin DN, Buyck C, Seierstad M, Skalkin A, ten Holte P, Jones TK, Mirzadegan T, Agrafiotis DK. Library Enhancement through the Wisdom of Crowds. J Chem Inf Model 2011; 51:3275-86. [DOI: 10.1021/ci200446y] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Michael D. Hack
- Johnson & Johnson Pharmaceutical Research & Development, L.L.C., 3210 Merryfield Row, San Diego, California 92121, United States
| | - Dmitrii N. Rassokhin
- Johnson & Johnson Pharmaceutical Research & Development, L.L.C., Welsh & McKean Roads, Spring House, Pennsylvania 19477, United States
| | - Christophe Buyck
- Janssen Research & Development, Division of Janssen Pharmaceutica NV, Turnhoutseweg 30, B-2340 Beerse, Belgium
| | - Mark Seierstad
- Johnson & Johnson Pharmaceutical Research & Development, L.L.C., 3210 Merryfield Row, San Diego, California 92121, United States
| | - Andrew Skalkin
- Johnson & Johnson Pharmaceutical Research & Development, L.L.C., Welsh & McKean Roads, Spring House, Pennsylvania 19477, United States
| | - Peter ten Holte
- Janssen Research & Development, Division of Janssen Pharmaceutica NV, Turnhoutseweg 30, B-2340 Beerse, Belgium
| | - Todd K. Jones
- Johnson & Johnson Pharmaceutical Research & Development, L.L.C., 3210 Merryfield Row, San Diego, California 92121, United States
- Todd Jones Consulting, San Diego, California
| | - Taraneh Mirzadegan
- Johnson & Johnson Pharmaceutical Research & Development, L.L.C., 3210 Merryfield Row, San Diego, California 92121, United States
| | - Dimitris K. Agrafiotis
- Johnson & Johnson Pharmaceutical Research & Development, L.L.C., Welsh & McKean Roads, Spring House, Pennsylvania 19477, United States
| |
Collapse
|
12
|
Hu Y, Stumpfe D, Bajorath J. Lessons Learned from Molecular Scaffold Analysis. J Chem Inf Model 2011; 51:1742-53. [DOI: 10.1021/ci200179y] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Affiliation(s)
- Ye Hu
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, D-53113 Bonn, Germany
| | - Dagmar Stumpfe
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, D-53113 Bonn, Germany
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, D-53113 Bonn, Germany
| |
Collapse
|
13
|
Schnur DM, Beno BR, Tebben AJ, Cavallaro C. Methods for combinatorial and parallel library design. Methods Mol Biol 2011; 672:387-434. [PMID: 20838978 DOI: 10.1007/978-1-60761-839-3_16] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
Diversity has historically played a critical role in design of combinatorial libraries, screening sets and corporate collections for lead discovery. Large library design dominated the field in the 1990s with methods ranging anywhere from purely arbitrary through property based reagent selection to product based approaches. In recent years, however, there has been a downward trend in library size. This was due to increased information about the desirable targets gleaned from the genomics revolution and to the ever growing availability of target protein structures from crystallography and homology modeling. Creation of libraries directed toward families of receptors such as GPCRs, kinases, nuclear hormone receptors, proteases, etc., replaced the generation of libraries based primarily on diversity while single target focused library design has remained an important objective. Concurrently, computing grids and cpu clusters have facilitated the development of structure based tools that screen hundreds of thousands of molecules. Smaller "smarter" combinatorial and focused parallel libraries replaced those early un-focused large libraries in the twenty-first century drug design paradigm. While diversity still plays a role in lead discovery, the focus of current library design methods has shifted to receptor based methods, scaffold hopping/bio-isostere searching, and a much needed emphasis on synthetic feasibility. Methods such as "privileged substructures based design" and pharmacophore based design still are important methods for parallel and small combinatorial library design. This chapter discusses some of the possible design methods and presents examples where they are available.
Collapse
Affiliation(s)
- Dora M Schnur
- Computer Aided Drug Design, Pharmaceutical Research Institute, Bristol-Myers Squibb Company, Princeton, NJ, USA
| | | | | | | |
Collapse
|
14
|
Hu Y, Bajorath J. Structural and Potency Relationships between Scaffolds of Compounds Active against Human Targets. ChemMedChem 2010; 5:1681-5. [DOI: 10.1002/cmdc.201000272] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
15
|
Hu Y, Bajorath J. Molecular Scaffolds with High Propensity to Form Multi-Target Activity Cliffs. J Chem Inf Model 2010; 50:500-10. [DOI: 10.1021/ci100059q] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Ye Hu
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, D-53113 Bonn, Germany
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, D-53113 Bonn, Germany
| |
Collapse
|
16
|
Shapiro AB, Walkup GK, Keating TA. Correction for Interference by Test Samples in High-Throughput Assays. ACTA ACUST UNITED AC 2009; 14:1008-16. [DOI: 10.1177/1087057109341768] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
In high-throughput biochemical assays performed in multiwell plates, the effect of test samples on the activity of the biochemical system is usually measured by optical means such as absorbance, fluorescence, luminescence, or scintillation counting. The test sample often causes detection interference when it remains in the well during the measurement. Interference may be due to light absorption, fluorescence quenching, sample fluorescence, chemical interaction of the sample with a detection reagent, or depression of the meniscus. A simple method is described that corrects for such interference well by well. The interference is measured in a separate artifact assay plate. An appropriate arithmetic correction is then applied to the measurement in the corresponding well of the activity assay plate. The correction procedure can be used for single-point screening or potency measurements on serial dilutions of test samples.
Collapse
|
17
|
Krüger F, Lounkine E, Bajorath J. Fragment Formal Concept Analysis Accurately Classifies Compounds with Closely Related Biological Activities. ChemMedChem 2009; 4:1174-81. [DOI: 10.1002/cmdc.200900035] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
18
|
Hu Y, Lounkine E, Bajorath J. Improving the Search Performance of Extended Connectivity Fingerprints through Activity-Oriented Feature Filtering and Application of a Bit-Density-Dependent Similarity Function. ChemMedChem 2009; 4:540-8. [DOI: 10.1002/cmdc.200800408] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
19
|
Rodríguez-Rodríguez C, Sánchez de Groot N, Rimola A, Alvarez-Larena A, Lloveras V, Vidal-Gancedo J, Ventura S, Vendrell J, Sodupe M, González-Duarte P. Design, selection, and characterization of thioflavin-based intercalation compounds with metal chelating properties for application in Alzheimer's disease. J Am Chem Soc 2009; 131:1436-51. [PMID: 19133767 DOI: 10.1021/ja806062g] [Citation(s) in RCA: 174] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Metal chelation is considered a rational therapeutic approach for interdicting Alzheimer's amyloid pathogenesis. At present, enhancing the targeting and efficacy of metal-ion chelating agents through ligand design is a main strategy in the development of the next generation of metal chelators. Inspired by the traditional dye Thioflavin-T, we have designed new multifunctional molecules that contain both amyloid binding and metal chelating properties. In silico techniques have enabled us to identify commercial compounds that enclose the designed molecular framework (M1), include potential antioxidant properties, facilitate the formation of iodine-labeled derivatives, and can be permeable through the blood-brain barrier. Iodination reactions of the selected compounds, 2-(2-hydroxyphenyl)benzoxazole (HBX), 2-(2-hydroxyphenyl)benzothiazole (HBT), and 2-(2-aminophenyl)-1H-benzimidazole (BM), have led to the corresponding iodinated derivatives HBXI, HBTI, and BMI, which have been characterized by X-ray diffraction. The chelating properties of the latter compounds toward Cu(II) and Zn(II) have been examined in the solid phase and in solution. The acidity constants of HBXI, HBTI, and BMI and the formation constants of the corresponding ML and ML2 complexes [M = Cu(II), Zn(II)] have been determined by UV-vis pH titrations. The calculated values for the overall formation constants for the ML2 complexes indicate the suitability of the HBXI, HBTI, and BMI ligands for sequestering Cu(II) and Zn(II) metal ions present in freshly prepared solutions of beta-amyloid (Abeta) peptide. This was confirmed by Abeta aggregation studies showing that these compounds are able to arrest the metal-promoted increase in amyloid fibril buildup. The fluorescence features of HBX, HBT, BM, and the corresponding iodinated derivatives, together with fluorescence microscopy studies on two types of pregrown fibrils, have shown that HBX and HBT compounds could behave as potential markers for the presence of amyloid fibrils, whereas HBXI and HBTI may be especially suitable for radioisotopic detection of Abeta deposits. Taken together, the results reported in this work show the potential of new multifunctional thioflavin-based chelating agents as Alzheimer's disease therapeutics.
Collapse
|
20
|
Gianti E, Sartori L. Identification and selection of "privileged fragments" suitable for primary screening. J Chem Inf Model 2009; 48:2129-39. [PMID: 18991373 DOI: 10.1021/ci800219h] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The use of small molecule libraries for fragment-based primary screening (FBS) is a well-known approach to identify protein binders in the low affinity range. However, the search, analysis, and selection of suitable screening fragments can be a lengthy process, because of the large number of compounds that must be analyzed for different levels of ring/substituents identification and submitted to selection/exclusion criteria based on their physicochemical properties. The purpose of the present work is to propose a strategy to identify substructures from databases of known drugs, which can be used as templates for the generation of libraries of "privileged fragments" that are able to provide high-quality hits. The entire process has been developed integrating Pipeline Pilot (Accelrys Inc., San Diego, CA; http://www.accelrys.com ) native components and user-defined molecular files containing ISIS-like substructure query features (Symyx, San Ramon, CA; http://www.symyx.com ). The method is effortless, easy to put in place, and fast enough to be iteratively applied to different sources of druglike compounds.
Collapse
Affiliation(s)
- Eleonora Gianti
- Computational Sciences Group, Department of Chemistry (Congenia s.r.l.), Genextra S.p.A., Milan MI 20100, Italy
| | | |
Collapse
|
21
|
Arévalo MJ, Kielland N, Masdeu C, Miguel M, Isambert N, Lavilla R. Multicomponent Access to Functionalized Mesoionic Structures Based on TFAA Activation of Isocyanides: Novel Domino Reactions. European J Org Chem 2009. [DOI: 10.1002/ejoc.200801084] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
22
|
Lounkine E, Bajorath J. Topological Fragment Index for the Analysis of Molecular Substructures and Their Topological Environment in Active Compounds. J Chem Inf Model 2008; 49:162-8. [DOI: 10.1021/ci8002599] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Eugen Lounkine
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology & Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrassse 2, D-53113 Bonn, Germany
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology & Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrassse 2, D-53113 Bonn, Germany
| |
Collapse
|
23
|
Hu Y, Lounkine E, Batista J, Bajorath J. RelACCS-FP: A Structural Minimalist Approach to Fingerprint Design. Chem Biol Drug Des 2008; 72:341-9. [DOI: 10.1111/j.1747-0285.2008.00723.x] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
24
|
Crisman TJ, Sisay MT, Bajorath J. Ligand-Target Interaction-Based Weighting of Substructures for Virtual Screening. J Chem Inf Model 2008; 48:1955-64. [DOI: 10.1021/ci800229q] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Thomas J. Crisman
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrasse 2, D-53113 Bonn, Germany, and Pharmazeutisches Institut, Pharmazeutische Chemie I, Rheinische Friedrich-Wilhelms-Universität, An der Immenburg 4, D-53121 Bonn, Germany
| | - Mihiret T. Sisay
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrasse 2, D-53113 Bonn, Germany, and Pharmazeutisches Institut, Pharmazeutische Chemie I, Rheinische Friedrich-Wilhelms-Universität, An der Immenburg 4, D-53121 Bonn, Germany
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrasse 2, D-53113 Bonn, Germany, and Pharmazeutisches Institut, Pharmazeutische Chemie I, Rheinische Friedrich-Wilhelms-Universität, An der Immenburg 4, D-53121 Bonn, Germany
| |
Collapse
|
25
|
Isambert N, Lavilla R. Heterocycles as Key Substrates in Multicomponent Reactions: The Fast Lane towards Molecular Complexity. Chemistry 2008; 14:8444-54. [DOI: 10.1002/chem.200800473] [Citation(s) in RCA: 289] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
26
|
Distribution of randomly generated activity class characteristic substructures in diverse active and database compounds. Mol Divers 2008; 12:77-83. [DOI: 10.1007/s11030-008-9078-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2008] [Accepted: 05/02/2008] [Indexed: 10/22/2022]
|
27
|
Lounkine E, Bajorath J. Core Trees and Consensus Fragment Sequences for Molecular Representation and Similarity Analysis. J Chem Inf Model 2008; 48:1161-6. [DOI: 10.1021/ci800020s] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Affiliation(s)
- Eugen Lounkine
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology & Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrasse 2, D-53113 Bonn, Germany
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology & Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrasse 2, D-53113 Bonn, Germany
| |
Collapse
|
28
|
Batista J, Bajorath J. Similarity Searching using Compound Class-Specific Combinations of Substructures Found in Randomly Generated Molecular Fragment Populations. ChemMedChem 2008; 3:67-73. [DOI: 10.1002/cmdc.200700199] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
29
|
Lounkine E, Batista J, Bajorath J. Mapping of Activity-Specific Fragment Pathways Isolated from Random Fragment Populations Reveals the Formation of Coherent Molecular Cores. J Chem Inf Model 2007; 47:2133-9. [DOI: 10.1021/ci700251b] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Eugen Lounkine
- Department of Life Science Informatics, B-IT, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrasse 2, D-53113 Bonn, Germany
| | - José Batista
- Department of Life Science Informatics, B-IT, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrasse 2, D-53113 Bonn, Germany
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrasse 2, D-53113 Bonn, Germany
| |
Collapse
|
30
|
Leitão A, Andricopulo AD, Montanari CA. In silico screening of HIV-1 non-nucleoside reverse transcriptase and protease inhibitors. Eur J Med Chem 2007; 43:1412-22. [PMID: 17954002 DOI: 10.1016/j.ejmech.2007.08.017] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2007] [Revised: 06/24/2007] [Accepted: 08/09/2007] [Indexed: 11/29/2022]
Abstract
Two targets, reverse transcriptase (RT) and protease from HIV-1, were used during the past two decades to the discovery of non-nucleoside reverse transcriptase inhibitors (NNRTI) and protease inhibitors (PI) that belong to the arsenal of the antiretroviral therapy. Herein these enzymes were chosen as templates for conducting a computer-aided ligand design. Ligand and structure-based drug designs were the starting points to select compounds from a database bearing more than five million compounds by means of cheminformatic tools. New promising lead structures are retrieved from the database, which are open to acquisition and test. Classes of molecules already described as NNRTI or PI in the literature also came out and were useful to prove the reliability of the workflow, and thus validating the work carried out so far.
Collapse
Affiliation(s)
- Andrei Leitão
- Núcleo de Estudos em Química Medicinal, NEQUIM, Universidade Federal de Minas Gerais, Campus Pampulha, Belo Horizonte, MG, Brazil
| | | | | |
Collapse
|
31
|
Talevi A, Prieto JJ, Bruno-Blanch LE, Castro EA. New similarity-based algorithm and its application to classification of anticonvulsant compounds. J Enzyme Inhib Med Chem 2007; 22:253-65. [PMID: 17674806 DOI: 10.1080/14756360701190170] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
Abstract
A similarity-based algorithm based on a previously developed model is applied in the classification of two sets of anticonvulsant and non-anticonvulsant drugs. Each set is composed of a) anticonvulsant compounds that have shown moderate to high activity in the Maximal Electroshock Seizure (MES) test and b) drugs with other biological activities or poor activity in the MES test. The results from the analysis of variance (ANOVA) indicate that the proposed algorithm is able to differentiate anticonvulsant from non-anticonvulsant drugs. The proposed model may then be useful in the identification of new anticonvulsant agents through virtual screening of large virtual libraries of chemical structures.
Collapse
Affiliation(s)
- Alan Talevi
- Medicinal Chemistry, Department of Biological Sciences, Faculty of Exact Sciences, Universidad Nacional de La Plata (UNLP), B1900AVV, La Plata, Buenos Aires, Argentina.
| | | | | | | |
Collapse
|
32
|
Batista J, Bajorath J. Mining of Randomly Generated Molecular Fragment Populations Uncovers Activity-Specific Fragment Hierarchies. J Chem Inf Model 2007; 47:1405-13. [PMID: 17585755 DOI: 10.1021/ci700108q] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We introduce a methodology to analyze random molecular fragment populations and determine conditional probability relationships between fragments. Random fragment profiles are generated for an arbitrary set of molecules, and each observed fragment is assigned a frequency vector. An algorithm is designed to compare frequency vectors and derive dependencies of fragment occurrence. Using calculated dependency values, random fragment populations can be organized in graphs that capture their relationships and make it possible to map fragment pathways of biologically active molecules. For sets of molecules having similar activity, unique fragment signatures are identified. The analysis reveals that random fragment profiles contain compound class-specific information and provides evidence for the existence of activity-specific fragment hierarchies.
Collapse
Affiliation(s)
- José Batista
- Department of Life Science Informatics, B-IT, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrasse 2, D-53113 Bonn, Germany
| | | |
Collapse
|
33
|
Ekins S, Mestres J, Testa B. In silico pharmacology for drug discovery: methods for virtual ligand screening and profiling. Br J Pharmacol 2007; 152:9-20. [PMID: 17549047 PMCID: PMC1978274 DOI: 10.1038/sj.bjp.0707305] [Citation(s) in RCA: 399] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Pharmacology over the past 100 years has had a rich tradition of scientists with the ability to form qualitative or semi-quantitative relations between molecular structure and activity in cerebro. To test these hypotheses they have consistently used traditional pharmacology tools such as in vivo and in vitro models. Increasingly over the last decade however we have seen that computational (in silico) methods have been developed and applied to pharmacology hypothesis development and testing. These in silico methods include databases, quantitative structure-activity relationships, pharmacophores, homology models and other molecular modeling approaches, machine learning, data mining, network analysis tools and data analysis tools that use a computer. In silico methods are primarily used alongside the generation of in vitro data both to create the model and to test it. Such models have seen frequent use in the discovery and optimization of novel molecules with affinity to a target, the clarification of absorption, distribution, metabolism, excretion and toxicity properties as well as physicochemical characterization. The aim of this review is to illustrate some of the in silico methods for pharmacology that are used in drug discovery. Further applications of these methods to specific targets and their limitations will be discussed in the second accompanying part of this review.
Collapse
Affiliation(s)
- S Ekins
- ACT LLC, 1 Penn Plaza, New York, NY 10119, USA.
| | | | | |
Collapse
|
34
|
Nagamine N, Sakakibara Y. Statistical prediction of protein chemical interactions based on chemical structure and mass spectrometry data. ACTA ACUST UNITED AC 2007; 23:2004-12. [PMID: 17510168 DOI: 10.1093/bioinformatics/btm266] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
MOTIVATION Prediction of interactions between proteins and chemical compounds is of great benefit in drug discovery processes. In this field, 3D structure-based methods such as docking analysis have been developed. However, the genomewide application of these methods is not really feasible as 3D structural information is limited in availability. RESULTS We describe a novel method for predicting protein-chemical interaction using SVM. We utilize very general protein data, i.e. amino acid sequences, and combine these with chemical structures and mass spectrometry (MS) data. MS data can be of great use in finding new chemical compounds in the future. We assessed the validity of our method in the dataset of the binding of existing drugs and found that more than 80% accuracy could be obtained. Furthermore, we conducted comprehensive target protein predictions for MDMA, and validated the biological significance of our method by successfully finding proteins relevant to its known functions. AVAILABILITY Available on request from the authors.
Collapse
Affiliation(s)
- Nobuyoshi Nagamine
- Department of Biosciences and Informatics, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama, Japan
| | | |
Collapse
|
35
|
Engels MFM, Gibbs AC, Jaeger EP, Verbinnen D, Lobanov VS, Agrafiotis DK. A Cluster-Based Strategy for Assessing the Overlap between Large Chemical Libraries and Its Application to a Recent Acquisition. J Chem Inf Model 2006; 46:2651-60. [PMID: 17125205 DOI: 10.1021/ci600219n] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We report on the structural comparison of the corporate collections of Johnson & Johnson Pharmaceutical Research & Development (JNJPRD) and 3-Dimensional Pharmaceuticals (3DP), performed in the context of the recent acquisition of 3DP by JNJPRD. The main objective of the study was to assess the druglikeness of the 3DP library and the extent to which it enriched the chemical diversity of the JNJPRD corporate collection. The two databases, at the time of acquisition, collectively contained more than 1.1 million compounds with a clearly defined structural description. The analysis was based on a clustering approach and aimed at providing an intuitive quantitative estimate and visual representation of this enrichment. A novel hierarchical clustering algorithm called divisive k-means was employed in combination with Kelley's cluster-level selection method to partition the combined data set into clusters, and the diversity contribution of each library was evaluated as a function of the relative occupancy of these clusters. Typical 3DP chemotypes enriching the diversity of the JNJPRD collection were catalogued and visualized using a modified maximum common substructure algorithm. The joint collection of JNJPRD and 3DP compounds was also compared to other databases of known medicinally active or druglike compounds. The potential of the methodology for the analysis of very large chemical databases is discussed.
Collapse
Affiliation(s)
- Michael F M Engels
- Johnson and Johnson Pharmaceutical Research and Development, Division of Janssen Pharmaceutica, Turnhoutsweg 30, 2340 Beerse, Belgium.
| | | | | | | | | | | |
Collapse
|
36
|
Liao Q, Yao J, Yuan S. SVM approach for predicting LogP. Mol Divers 2006; 10:301-9. [PMID: 17031534 DOI: 10.1007/s11030-006-9036-2] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2005] [Accepted: 10/27/2005] [Indexed: 01/04/2023]
Abstract
The logarithm of the partition coefficient between n-octanol and water (logP) is an important parameter for drug discovery. Based upon the comparison of several prediction logP models, i.e. Support Vector Machines (SVM), Partial Least Squares (PLS) and Multiple Linear Regression (MLR), the authors reported SVM model is the best one in this paper.
Collapse
Affiliation(s)
- Quan Liao
- Department of Computer Chemistry and Chemoinformatics, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, Shanghai, China
| | | | | |
Collapse
|
37
|
Batista J, Godden JW, Bajorath J. Assessment of Molecular Similarity from the Analysis of Randomly Generated Structural Fragment Populations. J Chem Inf Model 2006; 46:1937-44. [PMID: 16995724 DOI: 10.1021/ci0601261] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A novel method termed MolBlaster is introduced for the evaluation of molecular similarity relationships on the basis of randomly generated fragment populations. Our motivation has been to develop a similarity method that does not depend on the use of predefined structural or property descriptors. Fragment profiles of molecules are generated by random deletion of bonds in connectivity tables and quantitatively compared using entropy-based metrics. In test calculations, MolBlaster accurately reproduced a structural key-based similarity ranking of druglike molecules.
Collapse
Affiliation(s)
- José Batista
- Department of Life Science Informatics, B-IT, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstrasse 2, D-53113 Bonn, Germany
| | | | | |
Collapse
|
38
|
Marsden BD, Sundstrom M, Knapp S. High-throughput structural characterisation of therapeutic protein targets. Expert Opin Drug Discov 2006; 1:123-36. [DOI: 10.1517/17460441.1.2.123] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
39
|
Godden JW, Stahura FL, Bajorath J. Anatomy of fingerprint search calculations on structurally diverse sets of active compounds. J Chem Inf Model 2006; 45:1812-9. [PMID: 16309288 DOI: 10.1021/ci050276w] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Similarity searching using molecular fingerprints is a widely used approach for the identification of novel hits. A fingerprint search involves many pairwise comparisons of bit string representations of known active molecules with those precomputed for database compounds. Bit string overlap, as evaluated by various similarity metrics, is used as a measure of molecular similarity. Results of a number of studies focusing on fingerprints suggest that it is difficult, if not impossible, to develop generally applicable search parameters and strategies, irrespective of the compound classes under investigation. Rather, more or less, each individual search problem requires an adjustment of calculation conditions. Thus, there is a need for diagnostic tools to analyze fingerprint-based similarity searching. We report an analysis of fingerprint search calculations on different sets of structurally diverse active compounds. Calculations on five biological activity classes were carried out with two fingerprints in two compound source databases, and the results were analyzed in histograms. Tanimoto coefficient (Tc) value ranges where active compounds were detected were compared to the distribution of Tc values in the database. The analysis revealed that compound class-specific effects strongly influenced the outcome of these fingerprint calculations. Among the five diverse compound sets studied, very different search results were obtained. The analysis described here can be applied to determine Tc intervals where scaffold hopping occurs. It can also be used to benchmark fingerprint calculations or estimate their probability of success.
Collapse
Affiliation(s)
- Jeffrey W Godden
- Department of Life Science Informatics, B-IT, Rheinische Friedrich-Wilhelms-Universität, Görresstrasse 13, D-53113 Bonn, Germany
| | | | | |
Collapse
|
40
|
Eckert H, Vogt I, Bajorath J. Mapping Algorithms for Molecular Similarity Analysis and Ligand-Based Virtual Screening: Design of DynaMAD and Comparison with MAD and DMC. J Chem Inf Model 2006; 46:1623-34. [PMID: 16859294 DOI: 10.1021/ci060083o] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Here, we introduce the DynaMAD algorithm that is designed to map database compounds to combinations of activity-class-dependent descriptor value ranges in order to identify novel active molecules. The method combines and extends key features of two previously developed algorithms, MAD and DMC. These methods were first described as compound-mapping algorithms for large-scale virtual screening applications. DynaMAD and DMC operate in chemical spaces of stepwise increasing dimensionality. However, in contrast to DMC, which utilizes binary transformed descriptors, DynaMAD uses unmodified descriptor value distributions. The performance of these mapping methods was compared in detail in virtual screening trials on 24 different compound activity classes against a background of about 2 million database compounds. In these calculations, all three approaches produced results of considerable predictive value, and the enrichment of active molecules in small selection sets consisting of only about 20 or fewer database compounds emerged as a common feature. Furthermore, mapping methods were capable of recognizing remote molecular similarity relationships. Overall, DynaMAD performed better than MAD and DMC, producing average hit and recovery rates of 55% and 33%, respectively, over all 24 classes. Taken together, our findings suggest that dynamic compound mapping to combinations of activity-class-selective descriptor settings has significant potential for molecular similarity analysis and ligand-based virtual screening.
Collapse
Affiliation(s)
- Hanna Eckert
- Department of Life Science Informatics, B-IT, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, D-53113 Bonn, Germany
| | | | | |
Collapse
|
41
|
Perrin D, Frémaux C, Scheer A. Assay Development and Screening of a Serine/Threonine Kinase in an On-Chip Mode Using Caliper Nanofluidics Technology. ACTA ACUST UNITED AC 2006; 11:359-68. [PMID: 16751332 DOI: 10.1177/1087057106286653] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Kinases are key targets for drug discovery. In the field of screening in general and especially in the kinase area, because of considerations of efficiency and cost, radioactivity-based assays tend to be replaced by alternative, mostly fluorescence-based, assays. Today, the limiting factor is rarely the number of data points that can be obtained but rather the quality of the data, enzyme availability, and cost. In this article, the authors describe the development of an assay for a kinase screen based on the electrophoretic separation of fluorescent product and substrate using a Caliper-based nanofluidics environment in on-chip incubation mode. The authors present the results of screening a focused set of 32,000 compounds together with confirmation data obtained in a filtration assay. In addition, they have made a small-scale comparison between the on-chip and off-chip nanofluidics screening modes. In their hands, the screen in on-chip mode is characterized by high precision most likely due to the absence of liquid pipetting; an excellent confirmation rate (62%) in an independent assay format, namely, filtration; and good sensitivity. This study led to the identification of 4 novel chemical series of inhibitors.
Collapse
Affiliation(s)
- Dominique Perrin
- Molecular Screening and Cellular Pharmacology Department, Serono Pharmaceutical Research Institute, Geneva, Switzerland.
| | | | | |
Collapse
|
42
|
Liu J, Yang L. Effect of cholesterol on DMPC phospholipid membranes and QSAR model construction in membrane-interaction QSAR study through molecular dynamics simulation. Bioorg Med Chem 2006; 14:2225-34. [PMID: 16303310 DOI: 10.1016/j.bmc.2005.11.009] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2005] [Revised: 10/12/2005] [Accepted: 11/02/2005] [Indexed: 11/22/2022]
Abstract
In this study, both pure DMPC and DMPC/cholesterol mixed membrane monolayer were built to compare the physical-chemical properties and dynamics properties through molecular dynamics simulation and normal-mode analysis. The results show that the addition of cholesterol decreases the area of per molecule of membrane, increases the lipid amplitude motion, and changes the solute diffusion coefficient. It is also found that the addition of cholesterol greatly changes the solute-membrane 1,4-nonbonded interaction energy (deltaE14). MI-QSAR models were constructed based on solute-membrane interaction energy descriptors and other intramolecular descriptors. The results show that deltaE14 substitutes deltaE(HB) as the second important descriptor compared with the previous study. Final results suggest that short range solute-membrane interaction energy changes due to the uptake of the solute may play an important decision on permeability in DMPC/cholesterol membrane. A test set was applied to evaluate the predictivity of MI-QSAR models. The result suggests that the combination of F(H2O) and deltaE14 not only improves r2 and q2, but also greatly improves the model predictivity. Based on the combination of q2 and r(pre)2 values, a two-term model is better used to predict the solute permeability in this study (r2 = 0.859, q2 = 0.803, and r(pre)2 = 0.540). Due to the small sample both in training set and test set, more datasets are necessary to make a final decision about the model construction and prediction.
Collapse
Affiliation(s)
- Jianzhong Liu
- Department of Chemistry and Biochemistry, University of Delaware, Newark, Delaware 19716, USA.
| | | |
Collapse
|
43
|
Schnur DM, Hermsmeier MA, Tebben AJ. Are Target-Family-Privileged Substructures Truly Privileged? J Med Chem 2006; 49:2000-9. [PMID: 16539387 DOI: 10.1021/jm0502900] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
One of the early and effective approaches to G-coupled protein receptor target family library design was the analysis of a set of ligands for frequently occurring chemical moieties or substructures. Various methods ranging from frameworks analysis to pharmacophores have been employed to find these so-called target-family-privileged substructures. Although the use of these substructures is common practice in combinatorial library design and has produced leads, the methods used for finding them rarely verified their selectivity for the particular target family from which they were derived. The frequency of occurrence among ligands associated with a target receptor family is not a sufficient criterion for those substructures to receive the label of target-family-privileged substructure. This study explores the question of selectivity of ClassPharmer generated fragments for a series of target families: GPCRs, nuclear hormone receptors, serine proteases, protein kinases, and ligand-gated ion channels. In addition, a GPCR focused library and a random set of 10k compounds are examined in terms of their target-family-privileged-substructure composition. The results challenge the combinatorial chemistry concept of target-family-privileged substructures and suggest that many of these fragments may simply be drug-like or attractive for various receptors in accordance with the original definition of privileged substructures.
Collapse
Affiliation(s)
- Dora M Schnur
- Computer Aided Drug Design and Lead Discovery, Pharmaceutical Research Institute, Bristol-Myers Squibb Company, P.O. Box 5400, Princeton, New Jersey 08543-5400, USA.
| | | | | |
Collapse
|
44
|
Watanabe T, Schulz D, Morisseau C, Hammock BD. High-throughput pharmacokinetic method: cassette dosing in mice associated with minuscule serial bleedings and LC/MS/MS analysis. Anal Chim Acta 2006; 559:37-44. [PMID: 16636700 PMCID: PMC1447531 DOI: 10.1016/j.aca.2005.11.049] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
A method for pharmacokinetic studies using cassette dosing associated with serial bleeding in mice is described. PK profiles of four soluble epoxide hydrolase inhibitors were determined following oral, subcutaneous or intraperitoneal administration individually or in cassette dosing. Parent analyses were performed on only 5 microL of whole blood from serial bleeds (up to 10 per animal), by LC/MS/MS. An accuracy (88-100%) and precision (<10% RSD) were observed, leading to reliable datum points for PK calculation. PK profiles, T(max), C(max) and half-life values after cassette dosing were similar to the individual PK results. This method dramatically increases speed of data collection while dramatically reducing cost and animal usage. The results presented here clearly indicate that this proposed method could be applicable to high-throughput PK studies.
Collapse
Affiliation(s)
- Takaho Watanabe
- Department of Entomology and UCD Cancer Center, University of California, One Shields Avenue, Davis, CA 95616, USA
| | | | | | | |
Collapse
|
45
|
Quattropani A, Dorbais J, Covini D, Pittet PA, Colovray V, Thomas RJ, Coxhead R, Halazy S, Scheer A, Missotten M, Ayala G, Bradshaw C, De Raemy-Schenk AM, Nichols A, Cirillo R, Tos EG, Giachetti C, Golzio L, Marinelli P, Church DJ, Barberis C, Chollet A, Schwarz MK. Discovery and development of a new class of potent, selective, orally active oxytocin receptor antagonists. J Med Chem 2006; 48:7882-905. [PMID: 16302826 DOI: 10.1021/jm050645f] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We report a novel chemical class of potent oxytocin receptor antagonists showing a high degree of selectivity against the closely related vasopressin receptors (V1a, V1b, V2). An initial compound, 7, was shown to be active in an animal model of preterm labor when administered by the intravenous but not by the oral route. Stepwise SAR investigations around the different structural elements revealed one position, the arenesulfonyl moiety, to be amenable to structural changes. Consequently, this position was used to introduce a variety of substituents to improve the physicochemical properties. Some of the resulting analogues were found to be superior to 7 both in terms of potency in vitro and aqueous solubility, which translated into significantly improved efficacy in the animal model after intravenous and oral administration. The best compound, 73, potently inhibited oxytocin-induced uterine contractions in nonpregnant rats and reduced spontaneous uterine contractions in late-term pregnant rats.
Collapse
Affiliation(s)
- Anna Quattropani
- Serono Pharmaceutical Research Institute, Departments of Chemistry and Biochemical Pharmacology, 14 Chemin des Aulx, 1228 Plan-Les-Ouates, Geneva, Switzerland
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
46
|
Affiliation(s)
- Xavier Barril
- Senior Scientist, Vernalis (R&D), Granta Park, Abington, Cambridge, UK
| | | |
Collapse
|
47
|
Xue L, Stahura FL, Bajorath J. Similarity search profiling reveals effects of fingerprint scaling in virtual screening. ACTA ACUST UNITED AC 2005; 44:2032-9. [PMID: 15554672 DOI: 10.1021/ci0400819] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Fingerprint scaling is a method to increase the performance of similarity search calculations. It is based on the detection of bit patterns in keyed fingerprints that are signatures of specific compound classes. Application of scaling factors to consensus bits that are mostly set on emphasizes signature bit patterns during similarity searching and has been shown to improve search results for different fingerprints. Similarity search profiling has recently been introduced as a method to analyze similarity search calculations. Profiles separately monitor correctly identified hits and other detected database compounds as a function of similarity threshold values and make it possible to estimate whether virtual screening calculations can be successful or to evaluate why they fail. This similarity search profile technique has been applied here to study fingerprint scaling in detail and better understand effects that are responsible for its performance. In particular, we have focused on the qualitative and quantitative analysis of similarity search profiles under scaling conditions. Therefore, we have carried out systematic similarity search calculations for 23 biological activity classes under scaling conditions over a wide range of scaling factors in a compound database containing approximately 1.3 million molecules and monitored these calculations in similarity search profiles. Analysis of these profiles confirmed increases in hit rates as a consequence of scaling and revealed that scaling influences similarity search calculations in different ways. Based on scaled similarity search profiles, compound sets could be divided into different categories. In a number of cases, increases in search performance under scaling conditions were due to a more significant relative increase in correctly identified hits than detected false-positives. This was also consistent with the finding that preferred similarity threshold values increased due to fingerprint scaling, which was well illustrated by similarity search profiling.
Collapse
Affiliation(s)
- Ling Xue
- Department of Computer-Aided Drug Discovery, Albany Molecular Research, Inc., AMRI Bothell Research Center, 18804 North Creek Parkway, Bothell, Washington 98011-8012, USA
| | | | | |
Collapse
|
48
|
Raymond JW, Kibbey CE. An Automated Method for Exploring Targeted Substructural Diversity within Sets of Chemical Structures. J Chem Inf Model 2005; 45:1195-204. [PMID: 16180896 DOI: 10.1021/ci0502247] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Practicing medicinal chemists tend to treat a lead compound as an assemblage of its substructural parts. By iteratively confining their synthetic efforts in a localized fashion, they are able to systematically investigate how minor changes in certain portions of the molecule effect the properties of interest in the logical expectation that the observed beneficial changes will be cumulative. One disadvantage to this approach arises when large amounts of structure data begin to accumulate which is often the case in recent times due to such developments as high-throughput screening, virtual screening, and combinatorial chemistry. How then does one interactively mine this diverse data consistent with the desired substructural template, so those desirable structural features can be discovered and interpreted, especially when they may not occur in the most active compounds due to structural deficiencies in other portions of the molecule? In this paper, we present an algorithm to automate this process that has historically been performed in an ad-hoc and manual fashion. Using the proposed method, significantly larger numbers of compounds can be analyzed in this fashion, potentially discovering useful structural feature combinations that would not have otherwise been detected due to the sheer scale of modern structural and biological data collections.
Collapse
Affiliation(s)
- John W Raymond
- Scientific Computing Group, Pfizer Global Research and Development, Ann Arbor Laboratories, 2800 Plymouth Road, Ann Arbor, Michigan 48105, USA.
| | | |
Collapse
|
49
|
Savchuk NP, Balakin KV, Tkachenko SE. Exploring the chemogenomic knowledge space with annotated chemical libraries. Curr Opin Chem Biol 2005; 8:412-7. [PMID: 15288252 DOI: 10.1016/j.cbpa.2004.06.003] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
The recent human genome initiatives have led to the discovery of a multitude of genes that are potentially associated with various pathologic conditions and, thus, have opened new horizons in drug discovery. Simultaneously, annotated chemical libraries have emerged as information-rich databases to integrate biological and chemical data. They can be useful for the discovery of new pharmaceutical leads, the validation of new biotargets and the determination of the structural basis of ligand selectivity within target families. Annotated libraries provide a strong information basis for computational design of target-directed combinatorial libraries, which are a key component of modern drug discovery. Today, the rational design of chemical libraries enhanced with chemogenomics data is a new area of progressive research.
Collapse
Affiliation(s)
- Nikolay P Savchuk
- Chemical Diversity Labs, Inc., 11558 Sorrento Valley Road, San Diego, California 92121, USA.
| | | | | |
Collapse
|
50
|
Carranco I, Díaz JL, Jiménez O, Vendrell M, Albericio F, Royo M, Lavilla R. Multicomponent Reactions with Dihydroazines: Efficient Synthesis of a Diverse Set of Pyrido-Fused Tetrahydroquinolines. ACTA ACUST UNITED AC 2004; 7:33-41. [PMID: 15638476 DOI: 10.1021/cc049877a] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A multicomponent assembly of pyrido-fused tetrahydroquinolines is accomplished in a one-pot process from the interaction of dihydroazines, aldehydes, and anilines. A rational screening of the different components and parameters of this reaction, such as the range of reactive starting materials, catalysts and reaction conditions (solvent range; thermal, high pressure- and microwave-promoted processes) is carried out. Optimized conditions allow an efficient preparation of pyrido-fused tetrahydroquinolines with good yields, bypassing the biomimetic NADH-like reductive pathway which is typical in the interaction of dihydropyridines with carbonyl compounds and amines. Furthermore, solid-supported versions of the process have been developed, which should facilitate the preparation of libraries.
Collapse
Affiliation(s)
- Inés Carranco
- Parc Científic de Barcelona, University of Barcelona, Josep Samitier 1-5, 08028 Barcelona, Spain
| | | | | | | | | | | | | |
Collapse
|