1
|
Dolciami D, Ziolek RM, Davies DW, Carter M, Mok NY, Sherhod R. Exploiting Vector Pattern Diversity of Molecular Scaffolds for Cheminformatics Tasks in Drug Discovery. J Chem Inf Model 2024; 64:1966-1974. [PMID: 38437714 DOI: 10.1021/acs.jcim.3c01674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2024]
Abstract
Chemical diversity is challenging to describe objectively. Despite this, various notions of chemical diversity are used throughout the medicinal chemistry optimization process in drug discovery. In this work, we show the usefulness of considering exploited vectors during different phases of the drug design process to provide a quantitative and objective description of chemical diversity. We have developed a concise and fast approach to enumerate and analyze the exploited vector patterns (EVPs) of molecular compound series, which can then be used in archetypal compound selection tasks, from hit matter identification to hit expansion and lead optimization. We first show that EVPs can be used to assess the progressibility of compounds in a fragment library design exercise. By considering EVPs, we then show how a set of compounds can be prioritized for hit expansion using EVP-based, customizable diversity sampling approaches, reducing the time taken and mitigating human biases. We also show that EVPs are a useful tool to analyze SAR data, offering the chance to uncover correlations between different vectors without predetermining the molecular scaffold structures. The codes used to perform these tasks are presented as easy-to-use Jupyter notebooks, which can be readily adapted for further related tasks.
Collapse
Affiliation(s)
| | | | | | | | - N Yi Mok
- BenevolentAI, 4-8 Maple Street, London W1T 5HD, U.K
| | | |
Collapse
|
2
|
Ma M, Zhang X, Zhou L, Han Z, Shi Y, Li J, Wu L, Xu Z, Zhu W. D3Rings: A Fast and Accurate Method for Ring System Identification and Deep Generation of Drug-Like Cyclic Compounds. J Chem Inf Model 2024; 64:724-736. [PMID: 38206320 DOI: 10.1021/acs.jcim.3c01657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2024]
Abstract
Continuous exploration of the chemical space of molecules to find ligands with high affinity and specificity for specific targets is an important topic in drug discovery. A focus on cyclic compounds, particularly natural compounds with diverse scaffolds, provides important insights into novel molecular structures for drug design. However, the complexity of their ring structures has hindered the applicability of widely accepted methods and software for the systematic identification and classification of cyclic compounds. Herein, we successfully developed a new method, D3Rings, to identify acyclic, monocyclic, spiro ring, fused and bridged ring, and cage ring compounds, as well as macrocyclic compounds. By using D3Rings, we completed the statistics of cyclic compounds in three different databases, e.g., ChEMBL, DrugBank, and COCONUT. The results demonstrated the richness of ring structures in natural products, especially spiro, macrocycles, and fused and bridged rings. Based on this, three deep generative models, namely, VAE, AAE, and CharRNN, were trained and used to construct two data sets similar to DrugBank and COCONUT but 10 times larger than them. The enlarged data sets were then used to explore the molecular chemical space, focusing on complex ring structures, for novel drug discovery and development. Docking experiments with the newly generated COCONUT-like data set against three SARS-CoV-2 target proteins revealed that an expanded compound database improves molecular docking results. Cyclic structures exhibited the best docking scores among the top-ranked docking molecules. These results suggest the importance of exploring the chemical space of structurally novel cyclic compounds and continuous expansion of the library of drug-like compounds to facilitate the discovery of potent ligands with high binding affinity to specific targets. D3Rings is now freely available at http://www.d3pharma.com/D3Rings/.
Collapse
Affiliation(s)
- Minfei Ma
- Stake Key Laboratory of Drug Research; Drug Discovery and Design Center, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xinben Zhang
- Stake Key Laboratory of Drug Research; Drug Discovery and Design Center, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
| | - Liping Zhou
- Stake Key Laboratory of Drug Research; Drug Discovery and Design Center, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zijian Han
- Stake Key Laboratory of Drug Research; Drug Discovery and Design Center, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yulong Shi
- Stake Key Laboratory of Drug Research; Drug Discovery and Design Center, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jintian Li
- Stake Key Laboratory of Drug Research; Drug Discovery and Design Center, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Leyun Wu
- Stake Key Laboratory of Drug Research; Drug Discovery and Design Center, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhijian Xu
- Stake Key Laboratory of Drug Research; Drug Discovery and Design Center, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Weiliang Zhu
- Stake Key Laboratory of Drug Research; Drug Discovery and Design Center, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai 201203, China
- School of Pharmacy, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
3
|
Alqahtani S. Improving on in-silico prediction of oral drug bioavailability. Expert Opin Drug Metab Toxicol 2023; 19:665-670. [PMID: 37728393 DOI: 10.1080/17425255.2023.2261366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Accepted: 09/18/2023] [Indexed: 09/21/2023]
Abstract
INTRODUCTION Although significant development has been made in high-throughput screening of oral drug absorption and oral bioavailability, in silico prediction continues to play an important role in prediction of oral bioavailability and assisting in the proper selection of potential drug candidates. AREAS COVERED This review describes the improvements and latest modeling methods and algorithms available for the prediction of this important parameter. We performed a PubMed database search with a focus on the literature published in the last 15 years. EXPERT OPINION A tremendous efforts have been done in the past several years to develop reliable prediction tools that can provide accurate prediction for oral bioavailability. Several studies demonstrated new methodologies and techniques to develop either web-based in silico predictive tools or integrated PBPK models to predict oral bioavailability for new molecules. Improvements in the databases and the computational power will enhance the in silico prediction accuracy and reliability. Finally, introducing artificial intelligence to the drug development process will help improve the prediction tools.
Collapse
Affiliation(s)
- Saeed Alqahtani
- Department of Clinical Pharmacy, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
- Clinical Pharmacokinetics and Pharmacogenetics Unit, King Saud University Medical City, Riyadh, Saudi Arabia
| |
Collapse
|
4
|
Kim GU, Cho H, Lee JK, Lee JY, Tae J, Min SJ, Kang T, Cho YS. Stereoselective synthesis of 1,6-diazecanes by a tandem aza-Prins type dimerization and cyclization process. Chem Commun (Camb) 2022; 59:82-85. [PMID: 36475509 DOI: 10.1039/d2cc05133h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]
Abstract
We report the stereocontrolled synthesis of 1,6-diazecanes via a tandem aza-Prins type reaction of N-acyliminium ions with allylsilanes. It involves an aza-Prins type dimerization and cyclization in a single-step operation. This reaction represents the first example of 10-membered N-heterocycle synthesis using an aza-Prins reaction. Also, the interesting formation of an unusual tetracyclic compound through further cyclization of 1,6-diazecane and bicyclic compounds by the intramolecular cyclization of linear allylsilane are described. This tandem aza-Prins protocol provides a new synthetic strategy for the direct synthesis of medium-sized nitrogen heterocycles.
Collapse
Affiliation(s)
- Gyeong Un Kim
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Republic of Korea. .,Department of Chemistry, Kyung Hee University, Seoul 02447, Republic of Korea
| | - Hyunmi Cho
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Republic of Korea. .,Department of Chemistry, Yonsei University, Seoul 03722, Republic of Korea
| | - Jae Kyun Lee
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Republic of Korea.
| | - Jae Yeol Lee
- Department of Chemistry, Kyung Hee University, Seoul 02447, Republic of Korea
| | - Jinsung Tae
- Department of Chemistry, Yonsei University, Seoul 03722, Republic of Korea
| | - Sun-Joon Min
- Department of Chemical & Molecular Engineering/Applied Chemistry, Center for Bionano Intelligence Education and Research, Hanyang University, Ansan 15588, Republic of Korea.
| | - Taek Kang
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Republic of Korea.
| | - Yong Seo Cho
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Republic of Korea.
| |
Collapse
|
5
|
Mahon N, Slater K, O'Brien J, Alvarez Y, Reynolds A, Kennedy B. Discovery and Development of the Quininib Series of Ocular Drugs. J Ocul Pharmacol Ther 2022; 38:33-42. [PMID: 35089801 DOI: 10.1089/jop.2021.0074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022] Open
Abstract
The quininib series is a novel collection of small-molecule drugs with antiangiogenic, antivascular permeability, anti-inflammatory, and antiproliferative activity. Quininib was initially identified as a drug hit during a random chemical library screen for determinants of developmental ocular angiogenesis in zebrafish. To enhance drug efficacy, novel quininib analogs were designed by applying medicinal chemistry approaches. The resulting quininib drug series has efficacy in in vitro and ex vivo models of angiogenesis utilizing human cell lines and tissues. In vivo, quininib drugs reduce pathological angiogenesis and retinal vascular permeability in rodent models. Quininib acts as a cysteinyl leukotriene (CysLT) receptor antagonist, revealing new roles of these G-protein-coupled receptors in developmental angiogenesis of the eye and unexpectedly in uveal melanoma (UM). The quininib series highlighted the potential of CysLT receptors as therapeutic targets for retinal vasculopathies (e.g., neovascular age-related macular degeneration, diabetic retinopathy, and diabetic macular edema) and ocular cancers (e.g., UM).
Collapse
Affiliation(s)
- Niamh Mahon
- UCD School of Biomolecular and Biomedical Science, University College Dublin, Dublin, Ireland.,UCD Conway Institute, University College Dublin, Dublin, Ireland
| | - Kayleigh Slater
- UCD School of Biomolecular and Biomedical Science, University College Dublin, Dublin, Ireland.,UCD Conway Institute, University College Dublin, Dublin, Ireland
| | - Justine O'Brien
- UCD School of Biomolecular and Biomedical Science, University College Dublin, Dublin, Ireland.,UCD Conway Institute, University College Dublin, Dublin, Ireland
| | - Yolanda Alvarez
- UCD School of Biomolecular and Biomedical Science, University College Dublin, Dublin, Ireland.,UCD Conway Institute, University College Dublin, Dublin, Ireland
| | - Alison Reynolds
- UCD Conway Institute, University College Dublin, Dublin, Ireland.,UCD School of Veterinary Medicine, Veterinary Sciences Center, University College Dublin, Dublin, Ireland
| | - Breandán Kennedy
- UCD School of Biomolecular and Biomedical Science, University College Dublin, Dublin, Ireland.,UCD Conway Institute, University College Dublin, Dublin, Ireland
| |
Collapse
|
6
|
Zabolotna Y, Volochnyuk DM, Ryabukhin SV, Horvath D, Gavrilenko KS, Marcou G, Moroz YS, Oksiuta O, Varnek A. A Close-up Look at the Chemical Space of Commercially Available Building Blocks for Medicinal Chemistry. J Chem Inf Model 2021; 62:2171-2185. [PMID: 34928600 DOI: 10.1021/acs.jcim.1c00811] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The ability to efficiently synthesize desired compounds can be a limiting factor for chemical space exploration in drug discovery. This ability is conditioned not only by the existence of well-studied synthetic protocols but also by the availability of corresponding reagents, so-called building blocks (BBs). In this work, we present a detailed analysis of the chemical space of 400 000 purchasable BBs. The chemical space was defined by corresponding synthons─fragments contributed to the final molecules upon reaction. They allow an analysis of BB physicochemical properties and diversity, unbiased by the leaving and protective groups in actual reagents. The main classes of BBs were analyzed in terms of their availability, rule-of-two-defined quality, and diversity. Available BBs were eventually compared to a reference set of biologically relevant synthons derived from ChEMBL fragmentation, in order to illustrate how well they cover the actual medicinal chemistry needs. This was performed on a newly constructed universal generative topographic map of synthon chemical space that enables visualization of both libraries and analysis of their overlapped and library-specific regions.
Collapse
Affiliation(s)
- Yuliana Zabolotna
- University of Strasbourg, Laboratoire de Chemoinformatique, 4, rue B. Pascal, Strasbourg 67081, France
| | - Dmitriy M Volochnyuk
- Institute of Organic Chemistry, National Academy of Sciences of Ukraine, Murmanska Street 5, Kyiv 02660, Ukraine.,Enamine Ltd., 78 Chervonotkatska str., 02660 Kiev, Ukraine
| | - Sergey V Ryabukhin
- The Institute of High Technologies, Kyiv National Taras Shevchenko University, 64 Volodymyrska Street, Kyiv 01601, Ukraine.,Enamine Ltd., 78 Chervonotkatska str., 02660 Kiev, Ukraine
| | - Dragos Horvath
- University of Strasbourg, Laboratoire de Chemoinformatique, 4, rue B. Pascal, Strasbourg 67081, France
| | - Konstantin S Gavrilenko
- Research-And-Education ChemBioCenter, National Taras Shevchenko University of Kyiv, Chervonotkatska str., 61, 03022 Kiev, Ukraine.,Enamine Ltd., 78 Chervonotkatska str., 02660 Kiev, Ukraine
| | - Gilles Marcou
- University of Strasbourg, Laboratoire de Chemoinformatique, 4, rue B. Pascal, Strasbourg 67081, France
| | - Yurii S Moroz
- Research-And-Education ChemBioCenter, National Taras Shevchenko University of Kyiv, Chervonotkatska str., 61, 03022 Kiev, Ukraine.,Chemspace, Chervonotkatska Street 78, 02094 Kyiv, Ukraine
| | - Oleksandr Oksiuta
- Institute of Organic Chemistry, National Academy of Sciences of Ukraine, Murmanska Street 5, Kyiv 02660, Ukraine.,Chemspace, Chervonotkatska Street 78, 02094 Kyiv, Ukraine
| | - Alexandre Varnek
- University of Strasbourg, Laboratoire de Chemoinformatique, 4, rue B. Pascal, Strasbourg 67081, France.,Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21 Nishi 10, Kita-ku, 001-0021 Sapporo, Japan
| |
Collapse
|
7
|
Yang L, Yang G, Bing Z, Tian Y, Niu Y, Huang L, Yang L. Transformer-Based Generative Model Accelerating the Development of Novel BRAF Inhibitors. ACS OMEGA 2021; 6:33864-33873. [PMID: 34926933 PMCID: PMC8674994 DOI: 10.1021/acsomega.1c05145] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Accepted: 11/08/2021] [Indexed: 06/14/2023]
Abstract
The de novo drug design based on SMILES format is a typical sequence-processing problem. Previous methods based on recurrent neural network (RNN) exhibit limitation in capturing long-range dependency, resulting in a high invalid percentage in generated molecules. Recent studies have shown the potential of Transformer architecture to increase the capacity of handling sequence data. In this work, the encoder module in the Transformer is used to build a generative model. First, we train a Transformer-encoder-based generative model to learn the grammatical rules of known drug molecules and a predictive model to predict the activity of the molecules. Subsequently, transfer learning and reinforcement learning were used to fine-tune and optimize the generative model, respectively, to design new molecules with desirable activity. Compared with previous RNN-based methods, our method has improved the percentage of generating chemically valid molecules (from 95.6 to 98.2%), the structural diversity of the generated molecules, and the feasibility of molecular synthesis. The pipeline is validated by designing inhibitors against the human BRAF protein. Molecular docking and binding mode analysis showed that our method can generate small molecules with higher activity than those carrying ligands in the crystal structure and have similar interaction sites with these ligands, which can provide new ideas and suggestions for pharmaceutical chemists.
Collapse
Affiliation(s)
- Lijuan Yang
- Institute
of Modern Physics, Chinese Academy of Sciences, Lanzhou 730000, China
- School
of Physics and Technology, Lanzhou University, Lanzhou 730000, China
- School
of Physics, University of Chinese Academy
of Sciences, Beijing 100049, China
- Advanced
Energy Science and Technology, Guangdong
Laboratory, Huizhou 516000, China
| | - Guanghui Yang
- Institute
of Modern Physics, Chinese Academy of Sciences, Lanzhou 730000, China
- Advanced
Energy Science and Technology, Guangdong
Laboratory, Huizhou 516000, China
| | - Zhitong Bing
- Institute
of Modern Physics, Chinese Academy of Sciences, Lanzhou 730000, China
- Advanced
Energy Science and Technology, Guangdong
Laboratory, Huizhou 516000, China
| | - Yuan Tian
- Institute
of Modern Physics, Chinese Academy of Sciences, Lanzhou 730000, China
- School
of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China
| | - Yuzhen Niu
- Shandong
Provincial Research Center for Bioinformatic Engineering and Technique,
School of Life Sciences, Shandong University
of Technology, Zibo 255000, China
| | - Liang Huang
- School
of Physics and Technology, Lanzhou University, Lanzhou 730000, China
| | - Lei Yang
- Institute
of Modern Physics, Chinese Academy of Sciences, Lanzhou 730000, China
- Advanced
Energy Science and Technology, Guangdong
Laboratory, Huizhou 516000, China
| |
Collapse
|
8
|
Leguy J, Glavatskikh M, Cauchy T, Da Mota B. Scalable estimator of the diversity for de novo molecular generation resulting in a more robust QM dataset (OD9) and a more efficient molecular optimization. J Cheminform 2021; 13:76. [PMID: 34600576 PMCID: PMC8487551 DOI: 10.1186/s13321-021-00554-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Accepted: 09/15/2021] [Indexed: 01/21/2023] Open
Abstract
Chemical diversity is one of the key term when dealing with machine learning and molecular generation. This is particularly true for quantum chemical datasets. The composition of which should be done meticulously since the calculation is highly time demanding. Previously we have seen that the most known quantum chemical dataset QM9 lacks chemical diversity. As a consequence, ML models trained on QM9 showed generalizability shortcomings. In this paper we would like to present (i) a fast and generic method to evaluate chemical diversity, (ii) a new quantum chemical dataset of 435k molecules, OD9, that includes QM9 and new molecules generated with a diversity objective, (iii) an analysis of the diversity impact on unconstrained and goal-directed molecular generation on the example of QED optimization. Our innovative approach makes it possible to individually estimate the impact of a solution to the diversity of a set, allowing for effective incremental evaluation. In the first application, we will see how the diversity constraint allows us to generate more than a million of molecules that would efficiently complete the reference datasets. The compounds were calculated with DFT thanks to a collaborative effort through the QuChemPedIA@home BOINC project. With regard to goal-directed molecular generation, getting a high QED score is not complicated, but adding a little diversity can cut the number of calls to the evaluation function by a factor of ten.
Collapse
Affiliation(s)
- Jules Leguy
- Univ Angers, LERIA, SFR MATHSTIC, 49000, Angers, France
| | - Marta Glavatskikh
- Univ Angers, LERIA, SFR MATHSTIC, 49000, Angers, France.,Univ Angers, CNRS, MOLTECH-ANJOU, SFR MATRIX, 49000, Angers, France
| | - Thomas Cauchy
- Univ Angers, CNRS, MOLTECH-ANJOU, SFR MATRIX, 49000, Angers, France.
| | - Benoit Da Mota
- Univ Angers, LERIA, SFR MATHSTIC, 49000, Angers, France.
| |
Collapse
|
9
|
A chemoinformatic analysis of atoms, scaffolds and functional groups in natural products. PHYSICAL SCIENCES REVIEWS 2021. [DOI: 10.1515/psr-2019-0096] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Abstract
In the quest to know why natural products (NPs) have often been considered as privileged scaffolds for drug discovery purposes, many investigations into the differences between NPs and synthetic compounds have been carried out. Several attempts to answer this question have led to the investigation of the atomic composition, scaffolds and functional groups (FGs) of NPs, in comparison with synthetic drugs analysis. This chapter briefly describes an atomic enumeration method for chemical libraries that has been applied for the analysis of NP libraries, followed by a description of the main differences between NPs of marine and terrestrial origin in terms of their general physicochemical properties, most common scaffolds and “drug-likeness” properties. The last parts of the work describe an analysis of scaffolds and FGs common in NP libraries, focusing on huge NP databases, e.g. those in the Dictionary of Natural Products (DNP), NPs from cyanobacteria and the largest chemical class of NP – terpenoids.
Collapse
|
10
|
Discovery of Novel Chemical Series of OXA-48 β-Lactamase Inhibitors by High-Throughput Screening. Pharmaceuticals (Basel) 2021; 14:ph14070612. [PMID: 34202402 PMCID: PMC8308845 DOI: 10.3390/ph14070612] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 06/21/2021] [Accepted: 06/23/2021] [Indexed: 11/17/2022] Open
Abstract
The major cause of bacterial resistance to β-lactams is the production of hydrolytic β-lactamase enzymes. Nowadays, the combination of β-lactam antibiotics with β-lactamase inhibitors (BLIs) is the main strategy for overcoming such issues. Nevertheless, particularly challenging β-lactamases, such as OXA-48, pose the need for novel and effective treatments. Herein, we describe the screening of a proprietary compound collection against Klebsiella pneumoniae OXA-48, leading to the identification of several chemotypes, like the 4-ideneamino-4H-1,2,4-triazole (SC_2) and pyrazolo[3,4-b]pyridine (SC_7) cores as potential inhibitors. Importantly, the most potent representative of the latter series (ID2, AC50 = 0.99 μM) inhibited OXA-48 via a reversible and competitive mechanism of action, as demonstrated by biochemical and X-ray studies; furthermore, it slightly improved imipenem’s activity in Escherichia coli ATCC BAA-2523 β-lactam resistant strain. Also, ID2 showed good solubility and no sign of toxicity up to the highest tested concentration, resulting in a promising starting point for further optimization programs toward novel and effective non-β-lactam BLIs.
Collapse
|
11
|
Egieyeh S, Malan SF, Christoffels A. Cheminformatics techniques in antimalarial drug discovery and development from natural products 2: Molecular scaffold and machine learning approaches. PHYSICAL SCIENCES REVIEWS 2021. [DOI: 10.1515/psr-2019-0029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Abstract
A large number of natural products, especially those used in ethnomedicine of malaria, have shown varying in-vitro antiplasmodial activities. Cheminformatics involves the organization, integration, curation, standardization, simulation, mining and transformation of pharmacology data (compounds and bioactivity) into knowledge that can drive rational and viable drug development decisions. This chapter will review the application of two cheminformatics techniques (including molecular scaffold analysis and bioactivity predictive modeling via Machine learning) to natural products with in-vitro and in-vivo antiplasmodial activities in order to facilitate their development into antimalarial drug candidates and design of new potential antimalarial compounds.
Collapse
Affiliation(s)
- Samuel Egieyeh
- School of Pharmacy , University of the Western Cape Faculty of Natural Science , Belville , South Africa
- South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute , University of the Western Cape Faculty of Natural Science , Belville , South Africa
| | - Sarel F. Malan
- School of Pharmacy , University of the Western Cape Faculty of Natural Science , Belville , South Africa
| | - Alan Christoffels
- South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute , University of the Western Cape Faculty of Natural Science , Belville , South Africa
| |
Collapse
|
12
|
Sakai M, Nagayasu K, Shibui N, Andoh C, Takayama K, Shirakawa H, Kaneko S. Prediction of pharmacological activities from chemical structures with graph convolutional neural networks. Sci Rep 2021; 11:525. [PMID: 33436854 PMCID: PMC7803991 DOI: 10.1038/s41598-020-80113-7] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 12/17/2020] [Indexed: 01/29/2023] Open
Abstract
Many therapeutic drugs are compounds that can be represented by simple chemical structures, which contain important determinants of affinity at the site of action. Recently, graph convolutional neural network (GCN) models have exhibited excellent results in classifying the activity of such compounds. For models that make quantitative predictions of activity, more complex information has been utilized, such as the three-dimensional structures of compounds and the amino acid sequences of their respective target proteins. As another approach, we hypothesized that if sufficient experimental data were available and there were enough nodes in hidden layers, a simple compound representation would quantitatively predict activity with satisfactory accuracy. In this study, we report that GCN models constructed solely from the two-dimensional structural information of compounds demonstrated a high degree of activity predictability against 127 diverse targets from the ChEMBL database. Using the information entropy as a metric, we also show that the structural diversity had less effect on the prediction performance. Finally, we report that virtual screening using the constructed model identified a new serotonin transporter inhibitor with activity comparable to that of a marketed drug in vitro and exhibited antidepressant effects in behavioural studies.
Collapse
Affiliation(s)
- Miyuki Sakai
- grid.258799.80000 0004 0372 2033Department of Molecular Pharmacology, Graduate School of Pharmaceutical Sciences, Kyoto University, 46-29 Yoshida-Shimoadachi-cho, Sakyo-ku, Kyoto, 606-8501 Japan ,Medical Database Ltd., 2-5-5 Sumitomoshibadaimon building, Shibadaimon, Minato-ku, Tokyo, 105-0012 Japan
| | - Kazuki Nagayasu
- grid.258799.80000 0004 0372 2033Department of Molecular Pharmacology, Graduate School of Pharmaceutical Sciences, Kyoto University, 46-29 Yoshida-Shimoadachi-cho, Sakyo-ku, Kyoto, 606-8501 Japan
| | - Norihiro Shibui
- grid.258799.80000 0004 0372 2033Department of Molecular Pharmacology, Graduate School of Pharmaceutical Sciences, Kyoto University, 46-29 Yoshida-Shimoadachi-cho, Sakyo-ku, Kyoto, 606-8501 Japan
| | - Chihiro Andoh
- grid.258799.80000 0004 0372 2033Department of Molecular Pharmacology, Graduate School of Pharmaceutical Sciences, Kyoto University, 46-29 Yoshida-Shimoadachi-cho, Sakyo-ku, Kyoto, 606-8501 Japan
| | - Kaito Takayama
- grid.258799.80000 0004 0372 2033Department of Molecular Pharmacology, Graduate School of Pharmaceutical Sciences, Kyoto University, 46-29 Yoshida-Shimoadachi-cho, Sakyo-ku, Kyoto, 606-8501 Japan
| | - Hisashi Shirakawa
- grid.258799.80000 0004 0372 2033Department of Molecular Pharmacology, Graduate School of Pharmaceutical Sciences, Kyoto University, 46-29 Yoshida-Shimoadachi-cho, Sakyo-ku, Kyoto, 606-8501 Japan
| | - Shuji Kaneko
- grid.258799.80000 0004 0372 2033Department of Molecular Pharmacology, Graduate School of Pharmaceutical Sciences, Kyoto University, 46-29 Yoshida-Shimoadachi-cho, Sakyo-ku, Kyoto, 606-8501 Japan
| |
Collapse
|
13
|
Zabolotna Y, Lin A, Horvath D, Marcou G, Volochnyuk DM, Varnek A. Chemography: Searching for Hidden Treasures. J Chem Inf Model 2020; 61:179-188. [PMID: 33334102 DOI: 10.1021/acs.jcim.0c00936] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
The days when medicinal chemistry was limited to a few series of compounds of therapeutic interest are long gone. Nowadays, no human may succeed to acquire a complete overview of more than a billion existing or feasible compounds within which the potential "blockbuster drugs" are well hidden and yet only a few mouse clicks away. To reach these "hidden treasures", we adapted the generative topographic mapping method to enable efficient navigation through the chemical space, from a global overview to a structural pattern detection, covering, for the first time, the complete ZINC library of purchasable compounds, relative to 1.6 million biologically relevant ChEMBL molecules. About 40 000 hierarchical maps of the chemical space were constructed. Structural motifs inherent to only one library were identified. Roughly 20 000 off-market ChEMBL compound families represent incentives to enrich commercial catalogs. Alternatively, 125 000 ZINC-specific compound classes, absent in structure-activity bases, are novel paths to explore in medicinal chemistry. The complete list of these chemotypes can be downloaded using the link https://forms.gle/B6bUJj82t9EfmttV6.
Collapse
Affiliation(s)
- Yuliana Zabolotna
- University of Strasbourg, Laboratoire de Chemoinformatique, 4, rue B. Pascal, Strasbourg 67081 France
| | - Arkadii Lin
- University of Strasbourg, Laboratoire de Chemoinformatique, 4, rue B. Pascal, Strasbourg 67081 France
| | - Dragos Horvath
- University of Strasbourg, Laboratoire de Chemoinformatique, 4, rue B. Pascal, Strasbourg 67081 France
| | - Gilles Marcou
- University of Strasbourg, Laboratoire de Chemoinformatique, 4, rue B. Pascal, Strasbourg 67081 France
| | - Dmitriy M Volochnyuk
- Institute of Organic Chemistry National Academy of Sciences of Ukraine, Murmanska Street 5, Kyiv 02660, Ukraine.,Enamine Ltd., Chervonotkatska Street 78, Kyiv 02094, Ukraine
| | - Alexandre Varnek
- University of Strasbourg, Laboratoire de Chemoinformatique, 4, rue B. Pascal, Strasbourg 67081 France
| |
Collapse
|
14
|
Wu F, Zhou Y, Li L, Shen X, Chen G, Wang X, Liang X, Tan M, Huang Z. Computational Approaches in Preclinical Studies on Drug Discovery and Development. Front Chem 2020; 8:726. [PMID: 33062633 PMCID: PMC7517894 DOI: 10.3389/fchem.2020.00726] [Citation(s) in RCA: 106] [Impact Index Per Article: 26.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2020] [Accepted: 07/14/2020] [Indexed: 12/11/2022] Open
Abstract
Because undesirable pharmacokinetics and toxicity are significant reasons for the failure of drug development in the costly late stage, it has been widely recognized that drug ADMET properties should be considered as early as possible to reduce failure rates in the clinical phase of drug discovery. Concurrently, drug recalls have become increasingly common in recent years, prompting pharmaceutical companies to increase attention toward the safety evaluation of preclinical drugs. In vitro and in vivo drug evaluation techniques are currently more mature in preclinical applications, but these technologies are costly. In recent years, with the rapid development of computer science, in silico technology has been widely used to evaluate the relevant properties of drugs in the preclinical stage and has produced many software programs and in silico models, further promoting the study of ADMET in vitro. In this review, we first introduce the two ADMET prediction categories (molecular modeling and data modeling). Then, we perform a systematic classification and description of the databases and software commonly used for ADMET prediction. We focus on some widely studied ADMT properties as well as PBPK simulation, and we list some applications that are related to the prediction categories and web tools. Finally, we discuss challenges and limitations in the preclinical area and propose some suggestions and prospects for the future.
Collapse
Affiliation(s)
- Fengxu Wu
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, China
| | - Yuquan Zhou
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- The Second School of Clinical Medicine, Guangdong Medical University, Dongguan, China
| | - Langhui Li
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- Key Laboratory for Research and Development of Natural Drugs of Guangdong Province, School of Pharmacy, Guangdong Medical University, Dongguan, China
| | - Xianhuan Shen
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- Key Laboratory for Research and Development of Natural Drugs of Guangdong Province, School of Pharmacy, Guangdong Medical University, Dongguan, China
| | - Ganying Chen
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- The Second School of Clinical Medicine, Guangdong Medical University, Dongguan, China
| | - Xiaoqing Wang
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- Key Laboratory for Research and Development of Natural Drugs of Guangdong Province, School of Pharmacy, Guangdong Medical University, Dongguan, China
| | - Xianyang Liang
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- The Second School of Clinical Medicine, Guangdong Medical University, Dongguan, China
| | - Mengyuan Tan
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- Key Laboratory for Research and Development of Natural Drugs of Guangdong Province, School of Pharmacy, Guangdong Medical University, Dongguan, China
| | - Zunnan Huang
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- Key Laboratory for Research and Development of Natural Drugs of Guangdong Province, School of Pharmacy, Guangdong Medical University, Dongguan, China
- Marine Biomedical Research Institute of Guangdong Zhanjiang, Zhanjiang, China
| |
Collapse
|
15
|
Sharma R. Data science-driven analyses of drugs inducing hypertension as an adverse effect. Mol Divers 2020; 25:801-810. [PMID: 32415493 DOI: 10.1007/s11030-020-10059-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2019] [Accepted: 02/21/2020] [Indexed: 12/15/2022]
Abstract
The utilization of approved medication is a requisite to combat certain diseases for health; however, the undesirable adverse effects (AEs) due to medication are generally unavoidable. Hypertension is one of such AEs resulting from approved medication in which blood pressure in the arteries gets elevated and is a risk factor for several diseases including heart and kidney failure. HTs are the approved drugs that can cause hypertension as an AE. Here, the goal of the study is to investigate the structural and functional diversities of HTs. In our quest to unravel the structural parameters of the HTs, a systematic analysis of the HTs having a different number and type of ring systems was conducted. The cellular component, molecular function and biological processes adopted by the gene products were analyzed. Moreover, our systematically done analysis suggests that all the target families are active in a common pathway, that is, nerve transmission. A comparison of the selected structural and functional aspect of HTs with anti-hypertensives suggests that HTs follow certain structural and functional features in spite of many possibilities. Our study provides a promising methodology that considers the influence of structural diversity of AE causing agents on a functional perspective for precursory clinical decision making. This could be extended to explore the structural and functional trends that are adopted by agents causing certain diseases or AEs.
Collapse
Affiliation(s)
- Reetu Sharma
- CSIR-Indian Institute of Chemical Technology, Tarnaka, Hyderabad, 500007, India.
| |
Collapse
|
16
|
Yang J, Wang D, Jia C, Wang M, Hao G, Yang G. Freely Accessible Chemical Database Resources of Compounds for In Silico Drug Discovery. Curr Med Chem 2020; 26:7581-7597. [PMID: 29737247 DOI: 10.2174/0929867325666180508100436] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Revised: 01/26/2018] [Accepted: 04/18/2018] [Indexed: 11/22/2022]
Abstract
BACKGROUND In silico drug discovery has been proved to be a solidly established key component in early drug discovery. However, this task is hampered by the limitation of quantity and quality of compound databases for screening. In order to overcome these obstacles, freely accessible database resources of compounds have bloomed in recent years. Nevertheless, how to choose appropriate tools to treat these freely accessible databases is crucial. To the best of our knowledge, this is the first systematic review on this issue. OBJECTIVE The existed advantages and drawbacks of chemical databases were analyzed and summarized based on the collected six categories of freely accessible chemical databases from literature in this review. RESULTS Suggestions on how and in which conditions the usage of these databases could be reasonable were provided. Tools and procedures for building 3D structure chemical libraries were also introduced. CONCLUSION In this review, we described the freely accessible chemical database resources for in silico drug discovery. In particular, the chemical information for building chemical database appears as attractive resources for drug design to alleviate experimental pressure.
Collapse
Affiliation(s)
- JingFang Yang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China
| | - Di Wang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China
| | - Chenyang Jia
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China
| | - Mengyao Wang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China
| | - GeFei Hao
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China
| | - GuangFu Yang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China.,Collaborative Innovation Center of Chemical Science and Engineering, Tianjin 300072, China
| |
Collapse
|
17
|
Yang Y, Zhang Y, Hua Y, Chen X, Fan Y, Wang Y, Liang L, Deng C, Lu T, Chen Y, Liu H. In Silico Design and Analysis of a Kinase-Focused Combinatorial Library Considering Diversity and Quality. J Chem Inf Model 2020; 60:92-107. [PMID: 31886658 DOI: 10.1021/acs.jcim.9b00841] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
A structurally diverse, high-quality, and kinase-focused database plays a critical role in finding hits or leads in kinase drug discovery. Here, we propose a workflow for designing a virtual kinase-focused combinatorial library using existing structures. Based on the analysis of known protein kinase inhibitors (PKIs), detailed fragment optimization, fragment selection, fragment linking, and a molecular filtering scheme were defined. Quick recognition of core fragments that can possibly form dual hydrogen bonds with the hinge region of the ATP-pocket was proposed. Furthermore, three diversity and four quality metrics were chosen for compound library analysis, which can be applied to databases with over 30 million structures. Compared with 13 commercial libraries, our protocol demonstrates a special advantage in terms of good skeleton diversity, acceptable fingerprint diversity, balanced scaffold distribution, and high quality, which can work well not only on existing PKIs, but also on four chosen commercial libraries. Overall, the strategy can greatly facilitate the expansion of a desirable chemical space for kinase drug discovery.
Collapse
Affiliation(s)
- Yan Yang
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Yanmin Zhang
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Yi Hua
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Xingye Chen
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Yuanrong Fan
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Yuchen Wang
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Li Liang
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Chenglong Deng
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Tao Lu
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China.,State Key Laboratory of Natural Medicines , China Pharmaceutical University , 24 Tongjiaxiang , Nanjing 210009 , China
| | - Yadong Chen
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| | - Haichun Liu
- Laboratory of Molecular Design and Drug Discovery, School of Science , China Pharmaceutical University , 639 Longmian Avenue , Nanjing 211198 , China
| |
Collapse
|
18
|
Cluster analysis to identify prominent patterns of anti-hypertensives: A three-tiered unsupervised learning approach. INFORMATICS IN MEDICINE UNLOCKED 2020. [DOI: 10.1016/j.imu.2020.100303] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
|
19
|
Glavatskikh M, Leguy J, Hunault G, Cauchy T, Da Mota B. Dataset's chemical diversity limits the generalizability of machine learning predictions. J Cheminform 2019; 11:69. [PMID: 33430991 PMCID: PMC6852905 DOI: 10.1186/s13321-019-0391-2] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Accepted: 10/28/2019] [Indexed: 01/18/2023] Open
Abstract
The QM9 dataset has become the golden standard for Machine Learning (ML) predictions of various chemical properties. QM9 is based on the GDB, which is a combinatorial exploration of the chemical space. ML molecular predictions have been recently published with an accuracy on par with Density Functional Theory calculations. Such ML models need to be tested and generalized on real data. PC9, a new QM9 equivalent dataset (only H, C, N, O and F and up to 9 “heavy” atoms) of the PubChemQC project is presented in this article. A statistical study of bonding distances and chemical functions shows that this new dataset encompasses more chemical diversity. Kernel Ridge Regression, Elastic Net and the Neural Network model provided by SchNet have been used on both datasets. The overall accuracy in energy prediction is higher for the QM9 subset. However, a model trained on PC9 shows a stronger ability to predict energies of the other dataset. ![]()
Collapse
Affiliation(s)
- Marta Glavatskikh
- LERIA, University of Angers, 2 Bd Lavoisier, 49045, Angers, France.,Laboratoire MOLTECH-Anjou, UMR CNRS 6200, SFR MATRIX, UNIV Angers, 2 Bd Lavoisier, 49045, Angers, France
| | - Jules Leguy
- LERIA, University of Angers, 2 Bd Lavoisier, 49045, Angers, France
| | - Gilles Hunault
- LERIA, University of Angers, 2 Bd Lavoisier, 49045, Angers, France.,HIFIH, EA 3859, Institut de Biologie en Santé PBH-IRIS, CHU, University of Angers, 4, Rue Larrey, 49933, Angers, France
| | - Thomas Cauchy
- Laboratoire MOLTECH-Anjou, UMR CNRS 6200, SFR MATRIX, UNIV Angers, 2 Bd Lavoisier, 49045, Angers, France.
| | - Benoit Da Mota
- LERIA, University of Angers, 2 Bd Lavoisier, 49045, Angers, France
| |
Collapse
|
20
|
Naveja JJ, Pilón-Jiménez BA, Bajorath J, Medina-Franco JL. A general approach for retrosynthetic molecular core analysis. J Cheminform 2019; 11:61. [PMID: 33430974 PMCID: PMC6760108 DOI: 10.1186/s13321-019-0380-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Accepted: 08/04/2019] [Indexed: 11/13/2022] Open
Abstract
Scaffold analysis of compound data sets has reemerged as a chemically interpretable alternative to machine learning for chemical space and structure–activity relationships analysis. In this context, analog series-based scaffolds (ASBS) are synthetically relevant core structures that represent individual series of analogs. As an extension to ASBS, we herein introduce the development of a general conceptual framework that considers all putative cores of molecules in a compound data set, thus softening the often applied “single molecule–single scaffold” correspondence. A putative core is here defined as any substructure of a molecule complying with two basic rules: (a) the size of the core is a significant proportion of the whole molecule size and (b) the substructure can be reached from the original molecule through a succession of retrosynthesis rules. Thereafter, a bipartite network consisting of molecules and cores can be constructed for a database of chemical structures. Compounds linked to the same cores are considered analogs. We present case studies illustrating the potential of the general framework. The applications range from inter- and intra-core diversity analysis of compound data sets, structure–property relationships, and identification of analog series and ASBS. The molecule–core network herein presented is a general methodology with multiple applications in scaffold analysis. New statistical methods are envisioned that will be able to draw quantitative conclusions from these data. The code to use the method presented in this work is freely available as an additional file. Follow-up applications include analog searching and core structure–property relationships analyses.![]()
Collapse
Affiliation(s)
- J Jesús Naveja
- PECEM, School of Medicine, Universidad Nacional Autónoma de México, Avenida Universidad 3000, 04510, Mexico City, Mexico. .,Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, 04510, Mexico City, Mexico.
| | - B Angélica Pilón-Jiménez
- Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, 04510, Mexico City, Mexico
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Endenicher Allee 19c, 53115, Bonn, Germany
| | - José L Medina-Franco
- Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de México, Avenida Universidad 3000, 04510, Mexico City, Mexico.
| |
Collapse
|
21
|
Volochnyuk DM, Ryabukhin SV, Moroz YS, Savych O, Chuprina A, Horvath D, Zabolotna Y, Varnek A, Judd DB. Evolution of commercially available compounds for HTS. Drug Discov Today 2019; 24:390-402. [DOI: 10.1016/j.drudis.2018.10.016] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2018] [Revised: 10/02/2018] [Accepted: 10/30/2018] [Indexed: 12/17/2022]
|
22
|
Shang J, Hu B, Wang J, Zhu F, Kang Y, Li D, Sun H, Kong DX, Hou T. Cheminformatic Insight into the Differences between Terrestrial and Marine Originated Natural Products. J Chem Inf Model 2018; 58:1182-1193. [PMID: 29792805 DOI: 10.1021/acs.jcim.8b00125] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
This is a new golden age for drug discovery based on natural products derived from both marine and terrestrial sources. Herein, a straightforward but important question is "what are the major structural differences between marine natural products (MNPs) and terrestrial natural products (TNPs)?" To answer this question, we analyzed the important physicochemical properties, structural features, and drug-likeness of the two types of natural products and discussed their differences from the perspective of evolution. In general, MNPs have lower solubility and are often larger than TNPs. On average, particularly from the perspective of unique fragments and scaffolds, MNPs usually possess more long chains and large rings, especially 8- to 10-membered rings. MNPs also have more nitrogen atoms and halogens, notably bromines, and fewer oxygen atoms, suggesting that MNPs may be synthesized by more diverse biosynthetic pathways than TNPs. Analysis of the frequently occurring Murcko frameworks in MNPs and TNPS also reveals a striking difference between MNPs and TNPs. The scaffolds of the former tend to be longer and often contain ester bonds connected to 10-membered rings, while the scaffolds of the latter tend to be shorter and often bear more stable ring systems and bond types. Besides, the prediction from the naïve Bayesian drug-likeness classification model suggests that most compounds in MNPs and TNPs are drug-like, although MNPs are slightly more drug-like than TNPs. We believe that MNPs and TNPs with novel drug-like scaffolds have great potential to be drug leads or drug candidates in drug discovery campaigns.
Collapse
Affiliation(s)
- Jun Shang
- College of Pharmaceutical Sciences , Zhejiang University , Hangzhou , Zhejiang 310058 , China.,State Key Laboratory of Agricultural Microbiology and Agricultural Bioinformatics, Key Laboratory of Hubei Province, College of Informatics , Huazhong Agricultural University , Wuhan 430070 , China.,State Key Lab of CAD&CG , Zhejiang University , Hangzhou , Zhejiang 310058 , China
| | - Ben Hu
- State Key Laboratory of Agricultural Microbiology and Agricultural Bioinformatics, Key Laboratory of Hubei Province, College of Informatics , Huazhong Agricultural University , Wuhan 430070 , China
| | - Junmei Wang
- Department of Pharmaceutical Sciences , University of Pittsburgh , Pittsburgh , Pennsylvania 15261 , United States
| | - Feng Zhu
- College of Pharmaceutical Sciences , Zhejiang University , Hangzhou , Zhejiang 310058 , China
| | - Yu Kang
- College of Pharmaceutical Sciences , Zhejiang University , Hangzhou , Zhejiang 310058 , China
| | - Dan Li
- College of Pharmaceutical Sciences , Zhejiang University , Hangzhou , Zhejiang 310058 , China
| | - Huiyong Sun
- College of Pharmaceutical Sciences , Zhejiang University , Hangzhou , Zhejiang 310058 , China
| | - De-Xin Kong
- State Key Laboratory of Agricultural Microbiology and Agricultural Bioinformatics, Key Laboratory of Hubei Province, College of Informatics , Huazhong Agricultural University , Wuhan 430070 , China
| | - Tingjun Hou
- College of Pharmaceutical Sciences , Zhejiang University , Hangzhou , Zhejiang 310058 , China.,State Key Lab of CAD&CG , Zhejiang University , Hangzhou , Zhejiang 310058 , China
| |
Collapse
|