1
|
Arab I, Egghe K, Laukens K, Chen K, Barakat K, Bittremieux W. Benchmarking of Small Molecule Feature Representations for hERG, Nav1.5, and Cav1.2 Cardiotoxicity Prediction. J Chem Inf Model 2024; 64:2515-2527. [PMID: 37870574 DOI: 10.1021/acs.jcim.3c01301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2023]
Abstract
In the field of drug discovery, there is a substantial challenge in seeking out chemical structures that possess desirable pharmacological, toxicological, and pharmacokinetic properties. Complications arise when drugs interfere with the functioning of cardiac ion channels, leading to serious cardiovascular consequences. The discontinuation and removal of numerous approved drugs from the market or at late development stages in the pipeline due to such inhibitory effects further highlight the urgency of addressing this issue. Consequently, the early prediction of potential blockers targeting cardiac ion channels during the drug discovery process is of paramount importance. This study introduces a deep learning framework that computationally determines the cardiotoxicity associated with the voltage-gated potassium channel (hERG), the voltage-gated calcium channel (Cav1.2), and the voltage-gated sodium channel (Nav1.5) for drug candidates. The predictive capabilities of three feature representations─molecular fingerprints, descriptors, and graph-based numerical representations─are rigorously benchmarked. Additionally, a novel training and evaluation data set framework is presented, enabling predictive model training of drug off-target cardiotoxicity using a comprehensive and large curated data set covering these three cardiac ion channels. To facilitate these predictions, a robust and comprehensive small molecule cardiotoxicity prediction tool named CToxPred has been developed. It is made available as open source under the permissive MIT license at https://github.com/issararab/CToxPred.
Collapse
Affiliation(s)
- Issar Arab
- Department of Computer Science, University of Antwerp, 2020 Antwerp, Belgium
- Biomedical Informatics Network Antwerpen (Biomina), 2020 Antwerp, Belgium
| | - Kristof Egghe
- Department of Computer Science, University of Antwerp, 2020 Antwerp, Belgium
| | - Kris Laukens
- Department of Computer Science, University of Antwerp, 2020 Antwerp, Belgium
- Biomedical Informatics Network Antwerpen (Biomina), 2020 Antwerp, Belgium
| | - Ke Chen
- Chair for Theoretical Chemistry, Catalysis Research Center, Technische Universität München, Lichtenbergstraße 4, D-85747 Garching, Germany
| | - Khaled Barakat
- Faculty of Pharmacy and Pharmaceutical Sciences, University of Alberta, Edmonton, Alberta 8613, Canada
| | - Wout Bittremieux
- Department of Computer Science, University of Antwerp, 2020 Antwerp, Belgium
- Biomedical Informatics Network Antwerpen (Biomina), 2020 Antwerp, Belgium
| |
Collapse
|
2
|
Lindley S, Lu Y, Shukla D. The Experimentalist's Guide to Machine Learning for Small Molecule Design. ACS APPLIED BIO MATERIALS 2024; 7:657-684. [PMID: 37535819 PMCID: PMC10880109 DOI: 10.1021/acsabm.3c00054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Accepted: 07/17/2023] [Indexed: 08/05/2023]
Abstract
Initially part of the field of artificial intelligence, machine learning (ML) has become a booming research area since branching out into its own field in the 1990s. After three decades of refinement, ML algorithms have accelerated scientific developments across a variety of research topics. The field of small molecule design is no exception, and an increasing number of researchers are applying ML techniques in their pursuit of discovering, generating, and optimizing small molecule compounds. The goal of this review is to provide simple, yet descriptive, explanations of some of the most commonly utilized ML algorithms in the field of small molecule design along with those that are highly applicable to an experimentally focused audience. The algorithms discussed here span across three ML paradigms: supervised learning, unsupervised learning, and ensemble methods. Examples from the published literature will be provided for each algorithm. Some common pitfalls of applying ML to biological and chemical data sets will also be explained, alongside a brief summary of a few more advanced paradigms, including reinforcement learning and semi-supervised learning.
Collapse
Affiliation(s)
- Sarah
E. Lindley
- Department
of Bioengineering, University of Illinois, Urbana−Champaign, Illinois 61801, United States
| | - Yiyang Lu
- Department
of Chemical and Biomolecular Engineering, University of Illinois, Urbana−Champaign, Illinois 61801, United States
| | - Diwakar Shukla
- Department
of Bioengineering, University of Illinois, Urbana−Champaign, Illinois 61801, United States
- Department
of Chemical and Biomolecular Engineering, University of Illinois, Urbana−Champaign, Illinois 61801, United States
- Center
for Biophysics & Computational Biology, University of Illinois, Urbana−Champaign, Illinois 61801, United States
- Department
of Plant Biology, University of Illinois, Urbana−Champaign, Illinois 61801, United States
| |
Collapse
|
3
|
Wiley AM, Yang J, Madhani R, Nath A, Totah RA. Investigating the association between CYP2J2 inhibitors and QT prolongation: a literature review. Drug Metab Rev 2024; 56:145-163. [PMID: 38478383 DOI: 10.1080/03602532.2024.2329928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 03/06/2024] [Indexed: 03/21/2024]
Abstract
Drug withdrawal post-marketing due to cardiotoxicity is a major concern for drug developers, regulatory agencies, and patients. One common mechanism of cardiotoxicity is through inhibition of cardiac ion channels, leading to prolongation of the QT interval and sometimes fatal arrythmias. Recently, oxylipin signaling compounds have been shown to bind to and alter ion channel function, and disruption in their cardiac levels may contribute to QT prolongation. Cytochrome P450 2J2 (CYP2J2) is the predominant CYP isoform expressed in cardiomyocytes, where it oxidizes arachidonic acid to cardioprotective epoxyeicosatrienoic acids (EETs). In addition to roles in vasodilation and angiogenesis, EETs bind to and activate various ion channels. CYP2J2 inhibition can lower EET levels and decrease their ability to preserve cardiac rhythm. In this review, we investigated the ability of known CYP inhibitors to cause QT prolongation using Certara's Drug Interaction Database. We discovered that among the multiple CYP isozymes, CYP2J2 inhibitors were more likely to also be QT-prolonging drugs (by approximately 2-fold). We explored potential binding interactions between these inhibitors and CYP2J2 using molecular docking and identified four amino acid residues (Phe61, Ala223, Asn231, and Leu402) predicted to interact with QT-prolonging drugs. The four residues are located near the opening of egress channel 2, highlighting the potential importance of this channel in CYP2J2 binding and inhibition. These findings suggest that if a drug inhibits CYP2J2 and interacts with one of these four residues, then it may have a higher risk of QT prolongation and more preclinical studies are warranted to assess cardiovascular safety.
Collapse
Affiliation(s)
- Alexandra M Wiley
- Department of Medicinal Chemistry, University of WA School of Pharmacy, Seattle, WA, USA
| | - Jade Yang
- Department of Medicinal Chemistry, University of WA School of Pharmacy, Seattle, WA, USA
| | - Rivcka Madhani
- Department of Medicinal Chemistry, University of WA School of Pharmacy, Seattle, WA, USA
| | - Abhinav Nath
- Department of Medicinal Chemistry, University of WA School of Pharmacy, Seattle, WA, USA
| | - Rheem A Totah
- Department of Medicinal Chemistry, University of WA School of Pharmacy, Seattle, WA, USA
| |
Collapse
|
4
|
Lynch C, Sakamuru S, Ooka M, Huang R, Klumpp-Thomas C, Shinn P, Gerhold D, Rossoshek A, Michael S, Casey W, Santillo MF, Fitzpatrick S, Thomas RS, Simeonov A, Xia M. High-Throughput Screening to Advance In Vitro Toxicology: Accomplishments, Challenges, and Future Directions. Annu Rev Pharmacol Toxicol 2024; 64:191-209. [PMID: 37506331 PMCID: PMC10822017 DOI: 10.1146/annurev-pharmtox-112122-104310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/30/2023]
Abstract
Traditionally, chemical toxicity is determined by in vivo animal studies, which are low throughput, expensive, and sometimes fail to predict compound toxicity in humans. Due to the increasing number of chemicals in use and the high rate of drug candidate failure due to toxicity, it is imperative to develop in vitro, high-throughput screening methods to determine toxicity. The Tox21 program, a unique research consortium of federal public health agencies, was established to address and identify toxicity concerns in a high-throughput, concentration-responsive manner using a battery of in vitro assays. In this article, we review the advancements in high-throughput robotic screening methodology and informatics processes to enable the generation of toxicological data, and their impact on the field; further, we discuss the future of assessing environmental toxicity utilizing efficient and scalable methods that better represent the corresponding biological and toxicodynamic processes in humans.
Collapse
Affiliation(s)
- Caitlin Lynch
- National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA; ,
| | - Srilatha Sakamuru
- National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA; ,
| | - Masato Ooka
- National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA; ,
| | - Ruili Huang
- National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA; ,
| | - Carleen Klumpp-Thomas
- National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA; ,
| | - Paul Shinn
- National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA; ,
| | - David Gerhold
- National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA; ,
| | - Anna Rossoshek
- National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA; ,
| | - Sam Michael
- National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA; ,
| | - Warren Casey
- Division of the National Toxicology Program, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, North Carolina, USA
| | - Michael F Santillo
- Division of Toxicology, Office of Applied Research and Safety Assessment, Center for Food Safety and Applied Nutrition, U.S. Food and Drug Administration, Laurel, Maryland, USA
| | - Suzanne Fitzpatrick
- Center for Food Safety and Applied Nutrition, U.S. Food and Drug Administration, College Park, Maryland, USA
| | - Russell S Thomas
- Center for Computational Toxicology and Exposure, Office of Research and Development, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
| | - Anton Simeonov
- National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA; ,
| | - Menghang Xia
- National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA; ,
| |
Collapse
|
5
|
Vemula D, Jayasurya P, Sushmitha V, Kumar YN, Bhandari V. CADD, AI and ML in drug discovery: A comprehensive review. Eur J Pharm Sci 2023; 181:106324. [PMID: 36347444 DOI: 10.1016/j.ejps.2022.106324] [Citation(s) in RCA: 31] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 10/26/2022] [Accepted: 11/03/2022] [Indexed: 11/06/2022]
Abstract
Computer-aided drug design (CADD) is an emerging field that has drawn a lot of interest because of its potential to expedite and lower the cost of the drug development process. Drug discovery research is expensive and time-consuming, and it frequently took 10-15 years for a drug to be commercially available. CADD has significantly impacted this area of research. Further, the combination of CADD with Artificial Intelligence (AI), Machine Learning (ML), and Deep Learning (DL) technologies to handle enormous amounts of biological data has reduced the time and cost associated with the drug development process. This review will discuss how CADD, AI, ML, and DL approaches help identify drug candidates and various other steps of the drug discovery process. It will also provide a detailed overview of the different in silico tools used and how these approaches interact.
Collapse
Affiliation(s)
- Divya Vemula
- National Institute of Pharmaceutical Education and Research- Hyderabad, India
| | - Perka Jayasurya
- National Institute of Pharmaceutical Education and Research- Hyderabad, India
| | - Varthiya Sushmitha
- National Institute of Pharmaceutical Education and Research- Hyderabad, India
| | | | - Vasundhra Bhandari
- National Institute of Pharmaceutical Education and Research- Hyderabad, India.
| |
Collapse
|
6
|
Melnikov F, Anger LT, Hasselgren C. Toward Quantitative Models in Safety Assessment: A Case Study to Show Impact of Dose-Response Inference on hERG Inhibition Models. Int J Mol Sci 2022; 24:ijms24010635. [PMID: 36614078 PMCID: PMC9820331 DOI: 10.3390/ijms24010635] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 12/23/2022] [Accepted: 12/24/2022] [Indexed: 12/31/2022] Open
Abstract
Due to challenges with historical data and the diversity of assay formats, in silico models for safety-related endpoints are often based on discretized data instead of the data on a natural continuous scale. Models for discretized endpoints have limitations in usage and interpretation that can impact compound design. Here, we present a consistent data inference approach, exemplified on two data sets of Ether-à-go-go-Related Gene (hERG) K+ inhibition data, for dose-response and screening experiments that are generally applicable for in vitro assays. hERG inhibition has been associated with severe cardiac effects and is one of the more prominent safety targets assessed in drug development, using a wide array of in vitro and in silico screening methods. In this study, the IC50 for hERG inhibition is estimated from diverse historical proprietary data. The IC50 derived from a two-point proprietary screening data set demonstrated high correlation (R = 0.98, MAE = 0.08) with IC50s derived from six-point dose-response curves. Similar IC50 estimation accuracy was obtained on a public thallium flux assay data set (R = 0.90, MAE = 0.2). The IC50 data were used to develop a robust quantitative model. The model's MAE (0.47) and R2 (0.46) were on par with literature statistics and approached assay reproducibility. Using a continuous model has high value for pharmaceutical projects, as it enables rank ordering of compounds and evaluation of compounds against project-specific inhibition thresholds. This data inference approach can be widely applicable to assays with quantitative readouts and has the potential to impact experimental design and improve model performance, interpretation, and acceptance across many standard safety endpoints.
Collapse
|
7
|
Delre P, Lavado GJ, Lamanna G, Saviano M, Roncaglioni A, Benfenati E, Mangiatordi GF, Gadaleta D. Ligand-based prediction of hERG-mediated cardiotoxicity based on the integration of different machine learning techniques. Front Pharmacol 2022; 13:951083. [PMID: 36133824 PMCID: PMC9483173 DOI: 10.3389/fphar.2022.951083] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 07/20/2022] [Indexed: 11/13/2022] Open
Abstract
Drug-induced cardiotoxicity is a common side effect of drugs in clinical use or under postmarket surveillance and is commonly due to off-target interactions with the cardiac human-ether-a-go-go-related (hERG) potassium channel. Therefore, prioritizing drug candidates based on their hERG blocking potential is a mandatory step in the early preclinical stage of a drug discovery program. Herein, we trained and properly validated 30 ligand-based classifiers of hERG-related cardiotoxicity based on 7,963 curated compounds extracted by the freely accessible repository ChEMBL (version 25). Different machine learning algorithms were tested, namely, random forest, K-nearest neighbors, gradient boosting, extreme gradient boosting, multilayer perceptron, and support vector machine. The application of 1) the best practices for data curation, 2) the feature selection method VSURF, and 3) the synthetic minority oversampling technique (SMOTE) to properly handle the unbalanced data, allowed for the development of highly predictive models (BAMAX = 0.91, AUCMAX = 0.95). Remarkably, the undertaken temporal validation approach not only supported the predictivity of the herein presented classifiers but also suggested their ability to outperform those models commonly used in the literature. From a more methodological point of view, the study put forward a new computational workflow, freely available in the GitHub repository (https://github.com/PDelre93/hERG-QSAR), as valuable for building highly predictive models of hERG-mediated cardiotoxicity.
Collapse
Affiliation(s)
- Pietro Delre
- CNR—Institute of Crystallography, Bari, Italy
- Chemistry Department, University of Bari “Aldo Moro”, Bari, Italy
| | - Giovanna J. Lavado
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Giuseppe Lamanna
- CNR—Institute of Crystallography, Bari, Italy
- Chemistry Department, University of Bari “Aldo Moro”, Bari, Italy
| | | | - Alessandra Roncaglioni
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Emilio Benfenati
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Giuseppe Felice Mangiatordi
- CNR—Institute of Crystallography, Bari, Italy
- *Correspondence: Giuseppe Felice Mangiatordi, ; Domenico Gadaleta,
| | - Domenico Gadaleta
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
- *Correspondence: Giuseppe Felice Mangiatordi, ; Domenico Gadaleta,
| |
Collapse
|
8
|
Shan M, Jiang C, Qin L, Cheng G. A Review of Computational Methods in Predicting hERG Channel Blockers. ChemistrySelect 2022. [DOI: 10.1002/slct.202201221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Mengyi Shan
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| | - Chen Jiang
- QuanMin RenZheng (HangZhou) Technology Co. Ltd. China
| | - Lu‐Ping Qin
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| | - Gang Cheng
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| |
Collapse
|
9
|
Wang Y, Michael S, Yang SM, Huang R, Cruz-Gutierrez K, Zhang Y, Zhao J, Xia M, Shinn P, Sun H. Retro Drug Design: From Target Properties to Molecular Structures. J Chem Inf Model 2022; 62:2659-2669. [PMID: 35653613 PMCID: PMC9198977 DOI: 10.1021/acs.jcim.2c00123] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
![]()
To
deliver more therapeutics to more patients more quickly and
economically is the ultimate goal of pharmaceutical researchers. The
advent and rapid development of artificial intelligence (AI), in combination
with other powerful computational methods in drug discovery, makes
this goal more practical than ever before. Here, we describe a new
strategy, retro drug design, or RDD, to create novel small-molecule
drugs from scratch to meet multiple predefined requirements, including
biological activity against a drug target and optimal range of physicochemical
and ADMET properties. The molecular structure was represented by an
atom typing based molecular descriptor system, optATP, which was further
transformed to the space of loading vectors from principal component
analysis. Traditional predictive models were trained over experimental
data for the target properties using optATP and shallow machine learning
methods. The Monte Carlo sampling algorithm was then utilized to find
the solutions in the space of loading vectors that have the target
properties. Finally, a deep learning model was employed to decode
molecular structures from the solutions. To test the feasibility of
the algorithm, we challenged RDD to generate novel kinase inhibitors
from random numbers with five different ADMET properties optimized
at the same time. The best Tanimoto similarity score between the generated
valid structures and the available 4,314 kinase inhibitors was <
0.50, indicating a high extent of novelty of the generated compounds.
From the 3,040 structures that met all six target properties, 20 were
selected for synthesis and experimental measurement of inhibition
activity over 97 representative kinases and the ADMET properties.
Fifteen and eight compounds were determined to be hits or strong hits,
respectively. Five of the six strong kinase inhibitors have excellent
experimental ADMET properties. The results presented in this paper
illustrate that RDD has the potential to significantly improve the
current drug discovery process.
Collapse
Affiliation(s)
- Yuhong Wang
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Sam Michael
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Shyh-Ming Yang
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Ruili Huang
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Kennie Cruz-Gutierrez
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Yaqing Zhang
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Jinghua Zhao
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Menghang Xia
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Paul Shinn
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Hongmao Sun
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| |
Collapse
|
10
|
Zhang X, Mao J, Wei M, Qi Y, Zhang JZH. HergSPred: Accurate Classification of hERG Blockers/Nonblockers with Machine-Learning Models. J Chem Inf Model 2022; 62:1830-1839. [DOI: 10.1021/acs.jcim.2c00256] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Affiliation(s)
- Xudong Zhang
- Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University at Shanghai, Shanghai 200062, China
| | - Jun Mao
- Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University at Shanghai, Shanghai 200062, China
| | - Min Wei
- Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University at Shanghai, Shanghai 200062, China
| | - Yifei Qi
- Department of Medicinal Chemistry, School of Pharmacy, Fudan University, Shanghai 201203, China
| | - John Z. H. Zhang
- Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University at Shanghai, Shanghai 200062, China
- CAS Key Laboratory of Quantitative Engineering Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- Faculty of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
- NYU-ECNU Center for Computational Chemistry at NYU, Shanghai 200062, China
| |
Collapse
|
11
|
Ngan DK, Xu T, Xia M, Zheng W, Huang R. Repurposing drugs as COVID-19 therapies: a toxicity evaluation. Drug Discov Today 2022; 27:1983-1993. [PMID: 35395401 PMCID: PMC8983078 DOI: 10.1016/j.drudis.2022.04.001] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 02/17/2022] [Accepted: 04/01/2022] [Indexed: 12/24/2022]
Abstract
Drug repurposing is an appealing method to address the Coronavirus 2019 (COVID-19) pandemic because of the low cost and efficiency. We analyzed our in-house database of approved drug screens and compared their activity profiles with results from a severe acute respiratory syndrome-coronavirus 2 (SARS-CoV-2) cytopathic effect (CPE) assay. The activity profiles of the human ether-à-go-go-related gene (hERG), phospholipidosis (PLD), and many cytotoxicity screens were found significantly correlated with anti-SARS-CoV-2 activity. hERG inhibition is a nonspecific off-target effect that has contributed to promiscuous drug interactions, whereas drug-induced PLD is an undesirable effect linked to hERG blockers. Thus, this study identifies preferred drug candidates as well as chemical structures that should be avoided because of their potential to induce toxicity. Lastly, we highlight the hERG liability of anti-SARS-CoV-2 drugs currently enrolled in clinical trials.
Collapse
Affiliation(s)
- Deborah K Ngan
- Division of Preclinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD 20850, USA
| | - Tuan Xu
- Division of Preclinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD 20850, USA
| | - Menghang Xia
- Division of Preclinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD 20850, USA
| | - Wei Zheng
- Division of Preclinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD 20850, USA
| | - Ruili Huang
- Division of Preclinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, MD 20850, USA.
| |
Collapse
|
12
|
Krishna S, Borrel A, Huang R, Zhao J, Xia M, Kleinstreuer N. High-Throughput Chemical Screening and Structure-Based Models to Predict hERG Inhibition. BIOLOGY 2022; 11:biology11020209. [PMID: 35205076 PMCID: PMC8869358 DOI: 10.3390/biology11020209] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/04/2021] [Revised: 01/18/2022] [Accepted: 01/21/2022] [Indexed: 12/23/2022]
Abstract
Simple Summary Cardiovascular disease is the leading cause of death for people of most ethnicities in the United States. The human ether-a-go-go-related gene (hERG) potassium channel plays a pivotal role in cardiac rhythm regulation, and cardiotoxicity associated with hERG inhibition by drug molecules and environmental chemicals is a major public health concern. An evaluation of the effect of environmental chemicals on hERG channel function can help inform the potential public health risks of these compounds. To assess the cardiotoxic effect of diverse drugs and environmental compounds, the Tox21 federal research program has screened a collection of 9667 chemicals for inhibitory activity against the hERG channel. A set of molecular descriptors covering physicochemical and structural properties of chemicals, self-organizing maps, and hierarchical clustering were applied to characterize the chemicals inhibiting hERG. Machine learning approaches were applied to build robust statistical models that can predict the probability of any new chemical to cause cardiotoxicity via this mechanism. Abstract Chemical inhibition of the human ether-a -go-go-related gene (hERG) potassium channel leads to a prolonged QT interval that can contribute to severe cardiotoxicity. The adverse effects of hERG inhibition are one of the principal causes of drug attrition in clinical and pre-clinical development. Preliminary studies have demonstrated that a wide range of environmental chemicals and toxicants may also inhibit the hERG channel and contribute to the pathophysiology of cardiovascular (CV) diseases. As part of the US federal Tox21 program, the National Center for Advancing Translational Science (NCATS) applied a quantitative high throughput screening (qHTS) approach to screen the Tox21 library of 10,000 compounds (~7871 unique chemicals) at 14 concentrations in triplicate to identify chemicals perturbing hERG activity in the U2OS cell line thallium flux assay platform. The qHTS cell-based thallium influx assay provided a robust and reliable dataset to evaluate the ability of thousands of drugs and environmental chemicals to inhibit hERG channel protein, and the use of chemical structure-based clustering and chemotype enrichment analysis facilitated the identification of molecular features that are likely responsible for the observed hERG activity. We employed several machine-learning approaches to develop QSAR prediction models for the assessment of hERG liabilities for drug-like and environmental chemicals. The training set was compiled by integrating hERG bioactivity data from the ChEMBL database with the Tox21 qHTS thallium flux assay data. The best results were obtained with the random forest method (~92.6% balanced accuracy). The data and scripts used to generate hERG prediction models are provided in an open-access format as key in vitro and in silico tools that can be applied in a translational toxicology pipeline for drug development and environmental chemical screening.
Collapse
Affiliation(s)
- Shagun Krishna
- Division of the National Toxicology Program, National Institute of Environmental Health Sciences (NIEHS), Research Triangle, NC 27560, USA;
| | | | - Ruili Huang
- Division of Preclinical Innovation, National Center for Advancing Translational Sciences (NCATS), Bethesda, MD 20892-4874, USA; (R.H.); (J.Z.); (M.X.)
| | - Jinghua Zhao
- Division of Preclinical Innovation, National Center for Advancing Translational Sciences (NCATS), Bethesda, MD 20892-4874, USA; (R.H.); (J.Z.); (M.X.)
| | - Menghang Xia
- Division of Preclinical Innovation, National Center for Advancing Translational Sciences (NCATS), Bethesda, MD 20892-4874, USA; (R.H.); (J.Z.); (M.X.)
| | - Nicole Kleinstreuer
- Division of the National Toxicology Program, National Institute of Environmental Health Sciences (NIEHS), Research Triangle, NC 27560, USA;
- Correspondence: ; Tel.: +1-984-287-3150
| |
Collapse
|
13
|
Shan M, Jiang C, Chen J, Qin LP, Qin JJ, Cheng G. Predicting hERG channel blockers with directed message passing neural networks. RSC Adv 2022; 12:3423-3430. [PMID: 35425351 PMCID: PMC8979305 DOI: 10.1039/d1ra07956e] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 12/13/2021] [Indexed: 11/30/2022] Open
Abstract
Compounds with human ether-à-go-go related gene (hERG) blockade activity may cause severe cardiotoxicity. Assessing the hERG liability in the early stages of the drug discovery process is important, and the in silico methods for predicting hERG channel blockers are actively pursued. In the present study, the directed message passing neural network (D-MPNN) was applied to construct classification models for identifying hERG blockers based on diverse datasets. Several descriptors and fingerprints were tested along with the D-MPNN model. Among all these combinations, D-MPNN with the moe206 descriptors generated from MOE (D-MPNN + moe206) showed significantly improved performances. The AUC-ROC values of the D-MPNN + moe206 model reached 0.956 ± 0.005 under random split and 0.922 ± 0.015 under scaffold split on Cai's hERG dataset, respectively. Moreover, the comparisons between our models and several recently reported machine learning models were made based on various datasets. Our results indicated that the D-MPNN + moe206 model is among the best classification models. Overall, the excellent performance of the DMPNN + moe206 model achieved in this study highlights its potential application in the discovery of novel and effective hERG blockers. Compounds with human ether-à-go-go related gene (hERG) blockade activity may cause severe cardiotoxicity.![]()
Collapse
Affiliation(s)
- Mengyi Shan
- College of Pharmaceutical Sciences, Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| | - Chen Jiang
- College of Pharmaceutical Sciences, Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China .,Hangzhou Jingchun Trading Co., Ltd. China
| | - Jing Chen
- College of Pharmaceutical Sciences, Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China .,College of Pharmaceutical Sciences, Zhejiang University Hangzhou Zhejiang 310058 PR China
| | - Lu-Ping Qin
- College of Pharmaceutical Sciences, Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| | - Jiang-Jiang Qin
- The Cancer Hospital of the University of Chinese Academy of Sciences, Zhejiang Cancer Hospital, Institute of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences Hangzhou 310022 China
| | - Gang Cheng
- College of Pharmaceutical Sciences, Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| |
Collapse
|
14
|
Rishton GM, Look GC, Ni ZJ, Zhang J, Wang Y, Huang Y, Wu X, Izzo NJ, LaBarbera KM, Limegrover CS, Rehak C, Yurko R, Catalano SM. Discovery of Investigational Drug CT1812, an Antagonist of the Sigma-2 Receptor Complex for Alzheimer's Disease. ACS Med Chem Lett 2021; 12:1389-1395. [PMID: 34531947 DOI: 10.1021/acsmedchemlett.1c00048] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Accepted: 08/03/2021] [Indexed: 02/08/2023] Open
Abstract
An unbiased phenotypic neuronal assay was developed to measure the synaptotoxic effects of soluble Aβ oligomers. A collection of CNS druglike small molecules prepared by conditioned extraction was screened. Compounds that prevented and reversed synaptotoxic effects of Aβ oligomers in neurons were discovered to bind to the sigma-2 receptor complex. Select development compounds displaced receptor-bound Aβ oligomers, rescued synapses, and restored cognitive function in transgenic hAPP Swe/Ldn mice. Our first-in-class orally administered small molecule investigational drug 7 (CT1812) has been advanced to Phase II clinical studies for Alzheimer's disease.
Collapse
Affiliation(s)
- Gilbert M. Rishton
- Cognition Therapeutics, 2403 Sidney Street, Suite 261, Pittsburgh, Pennsylvania 15203, United States
| | - Gary C. Look
- Cognition Therapeutics, 2403 Sidney Street, Suite 261, Pittsburgh, Pennsylvania 15203, United States
| | - Zhi-Jie Ni
- Acme Bioscience, Inc., 3941 East Bayshore Road, Palo Alto, California 94303, United States
| | - Jason Zhang
- Acme Bioscience, Inc., 3941 East Bayshore Road, Palo Alto, California 94303, United States
| | - Yingcai Wang
- Acme Bioscience, Inc., 3941 East Bayshore Road, Palo Alto, California 94303, United States
| | - Yaodong Huang
- Acme Bioscience, Inc., 3941 East Bayshore Road, Palo Alto, California 94303, United States
| | - Xiaodong Wu
- Acme Bioscience, Inc., 3941 East Bayshore Road, Palo Alto, California 94303, United States
| | - Nicholas J. Izzo
- Cognition Therapeutics, 2403 Sidney Street, Suite 261, Pittsburgh, Pennsylvania 15203, United States
| | - Kelsie M LaBarbera
- Cognition Therapeutics, 2403 Sidney Street, Suite 261, Pittsburgh, Pennsylvania 15203, United States
| | - Colleen S. Limegrover
- Cognition Therapeutics, 2403 Sidney Street, Suite 261, Pittsburgh, Pennsylvania 15203, United States
| | - Courtney Rehak
- Cognition Therapeutics, 2403 Sidney Street, Suite 261, Pittsburgh, Pennsylvania 15203, United States
| | - Raymond Yurko
- Cognition Therapeutics, 2403 Sidney Street, Suite 261, Pittsburgh, Pennsylvania 15203, United States
| | - Susan M. Catalano
- Cognition Therapeutics, 2403 Sidney Street, Suite 261, Pittsburgh, Pennsylvania 15203, United States
| |
Collapse
|
15
|
Rácz A, Bajusz D, Miranda-Quintana RA, Héberger K. Machine learning models for classification tasks related to drug safety. Mol Divers 2021; 25:1409-1424. [PMID: 34110577 PMCID: PMC8342376 DOI: 10.1007/s11030-021-10239-x] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Accepted: 05/27/2021] [Indexed: 12/23/2022]
Abstract
In this review, we outline the current trends in the field of machine learning-driven classification studies related to ADME (absorption, distribution, metabolism and excretion) and toxicity endpoints from the past six years (2015-2021). The study focuses only on classification models with large datasets (i.e. more than a thousand compounds). A comprehensive literature search and meta-analysis was carried out for nine different targets: hERG-mediated cardiotoxicity, blood-brain barrier penetration, permeability glycoprotein (P-gp) substrate/inhibitor, cytochrome P450 enzyme family, acute oral toxicity, mutagenicity, carcinogenicity, respiratory toxicity and irritation/corrosion. The comparison of the best classification models was targeted to reveal the differences between machine learning algorithms and modeling types, endpoint-specific performances, dataset sizes and the different validation protocols. Based on the evaluation of the data, we can say that tree-based algorithms are (still) dominating the field, with consensus modeling being an increasing trend in drug safety predictions. Although one can already find classification models with great performances to hERG-mediated cardiotoxicity and the isoenzymes of the cytochrome P450 enzyme family, these targets are still central to ADMET-related research efforts.
Collapse
Affiliation(s)
- Anita Rácz
- Plasma Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt. 2, Budapest, 1117, Hungary.
| | - Dávid Bajusz
- Medicinal Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt. 2, Budapest, 1117, Hungary
| | | | - Károly Héberger
- Plasma Chemistry Research Group, Research Centre for Natural Sciences, Magyar tudósok krt. 2, Budapest, 1117, Hungary.
| |
Collapse
|
16
|
Wang Y, Michael S, Huang R, Zhao J, Recabo K, Bougie D, Shu Q, Shinn P, Sun H. Retro Drug Design: From Target Properties to Molecular Structures. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021. [PMID: 34013260 PMCID: PMC8132216 DOI: 10.1101/2021.05.11.442656] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
To generate drug molecules of desired properties with computational methods is the holy grail in pharmaceutical research. Here we describe an AI strategy, retro drug design, or RDD, to generate novel small molecule drugs from scratch to meet predefined requirements, including but not limited to biological activity against a drug target, and optimal range of physicochemical and ADMET properties. Traditional predictive models were first trained over experimental data for the target properties, using an atom typing based molecular descriptor system, ATP. Monte Carlo sampling algorithm was then utilized to find the solutions in the ATP space defined by the target properties, and the deep learning model of Seq2Seq was employed to decode molecular structures from the solutions. To test feasibility of the algorithm, we challenged RDD to generate novel drugs that can activate μ opioid receptor (MOR) and penetrate blood brain barrier (BBB). Starting from vectors of random numbers, RDD generated 180,000 chemical structures, of which 78% were chemically valid. About 42,000 (31%) of the valid structures fell into the property space defined by MOR activity and BBB permeability. Out of the 42,000 structures, only 267 chemicals were commercially available, indicating a high extent of novelty of the AI-generated compounds. We purchased and assayed 96 compounds, and 25 of which were found to be MOR agonists. These compounds also have excellent BBB scores. The results presented in this paper illustrate that RDD has potential to revolutionize the current drug discovery process and create novel structures with multiple desired properties, including biological functions and ADMET properties. Availability of an AI-enabled fast track in drug discovery is essential to cope with emergent public health threat, such as pandemic of COVID-19.
Collapse
|
17
|
Siramshetty VB, Nguyen DT, Martinez NJ, Southall NT, Simeonov A, Zakharov AV. Critical Assessment of Artificial Intelligence Methods for Prediction of hERG Channel Inhibition in the "Big Data" Era. J Chem Inf Model 2020; 60:6007-6019. [PMID: 33259212 DOI: 10.1021/acs.jcim.0c00884] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
The rise of novel artificial intelligence (AI) methods necessitates their benchmarking against classical machine learning for a typical drug-discovery project. Inhibition of the potassium ion channel, whose alpha subunit is encoded by the human ether-à-go-go-related gene (hERG), leads to a prolonged QT interval of the cardiac action potential and is a significant safety pharmacology target for the development of new medicines. Several computational approaches have been employed to develop prediction models for the assessment of hERG liabilities of small molecules including recent work using deep learning methods. Here, we perform a comprehensive comparison of hERG effect prediction models based on classical approaches (random forests and gradient boosting) and modern AI methods [deep neural networks (DNNs) and recurrent neural networks (RNNs)]. The training set (∼9000 compounds) was compiled by integrating the hERG bioactivity data from the ChEMBL database with experimental data generated from an in-house, high-throughput thallium flux assay. We utilized different molecular descriptors including the latent descriptors, which are real-value continuous vectors derived from chemical autoencoders trained on a large chemical space (>1.5 million compounds). The models were prospectively validated on ∼840 in-house compounds screened in the same thallium flux assay. The best results were obtained with the XGBoost method and RDKit descriptors. The comparison of models based only on latent descriptors revealed that the DNNs performed significantly better than the classical methods. The RNNs that operate on SMILES provided the highest model sensitivity. The best models were merged into a consensus model that offered superior performance compared to reference models from academic and commercial domains. Furthermore, we shed light on the potential of AI methods to exploit the big data in chemistry and generate novel chemical representations useful in predictive modeling and tailoring a new chemical space.
Collapse
Affiliation(s)
- Vishal B Siramshetty
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Dac-Trung Nguyen
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Natalia J Martinez
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Noel T Southall
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Anton Simeonov
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| | - Alexey V Zakharov
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Drive, Rockville, Maryland 20850, United States
| |
Collapse
|
18
|
Sun H, Wang Y, Cheff DM, Hall MD, Shen M. Predictive models for estimating cytotoxicity on the basis of chemical structures. Bioorg Med Chem 2020; 28:115422. [PMID: 32234277 DOI: 10.1016/j.bmc.2020.115422] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Revised: 02/28/2020] [Accepted: 03/02/2020] [Indexed: 12/19/2022]
Abstract
Cytotoxicity is a critical property in determining the fate of a small molecule in the drug discovery pipeline. Cytotoxic compounds are identified and triaged in both target-based and cell-based phenotypic approaches due to their off-target toxicity or on-target and on-mechanism toxicity for oncology and neurodegenerative targets. It is critical that chemical-induced cytotoxicity be reliably predicted before drug candidates advance to the late stage of development, or more ideally, before compounds are synthesized. In this study, we assessed the cell-based cytotoxicity of nearly 10,000 compounds in NCATS annotated libraries against four 'normal' cell lines (HEK 293, NIH 3T3, CRL-7250 and HaCat) using CellTiter-Glo (CTG) technology and constructed highly predictive models to estimate cytotoxicity from chemical structures. There are 5,241 non-redundant compounds having unambiguous activities in the four different cell lines, among which 11.8% compounds exhibited cytotoxicity in two or more cell lines and are thus labelled cytotoxic. The support vector classification (SVC) models trained with 80% randomly selected molecules achieved the area under the receiver operating characteristic curve (AUC-ROC) of 0.88 on average for the remaining 20% compounds in the test sets in 10 repeating experiments. Application of under-sampling rebalancing method further improved the averaged AUC-ROC to 0.90. Analysis of structural features shared by cytotoxic compounds may offer medicinal chemists heuristic design ideas to eliminate undesirable cytotoxicity. The profiling of cytotoxicity of drug-like molecules with annotated primary mechanism of action (MOA) will inform on the roles played by different targets or pathways in cellular viability. The predictive models for cytotoxicity (accessible at https://tripod.nih.gov/web_adme/cytotox.html) provide the scientific community a fast yet reliable way to prioritize molecules with little or no cytotoxicity for downstream development.
Collapse
Affiliation(s)
- Hongmao Sun
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Dr., Rockville, MD 20850, United States.
| | - Yuhong Wang
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Dr., Rockville, MD 20850, United States
| | - Dorian M Cheff
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Dr., Rockville, MD 20850, United States
| | - Matthew D Hall
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Dr., Rockville, MD 20850, United States
| | - Min Shen
- National Center for Advancing Translational Sciences (NCATS), 9800 Medical Center Dr., Rockville, MD 20850, United States.
| |
Collapse
|
19
|
Wang Y, Huang L, Jiang S, Wang Y, Zou J, Fu H, Yang S. Capsule Networks Showed Excellent Performance in the Classification of hERG Blockers/Nonblockers. Front Pharmacol 2020; 10:1631. [PMID: 32063849 PMCID: PMC6997788 DOI: 10.3389/fphar.2019.01631] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Accepted: 12/13/2019] [Indexed: 02/05/2023] Open
Abstract
Capsule networks (CapsNets), a new class of deep neural network architectures proposed recently by Hinton et al., have shown a great performance in many fields, particularly in image recognition and natural language processing. However, CapsNets have not yet been applied to drug discovery-related studies. As the first attempt, we in this investigation adopted CapsNets to develop classification models of hERG blockers/nonblockers; drugs with hERG blockade activity are thought to have a potential risk of cardiotoxicity. Two capsule network architectures were established: convolution-capsule network (Conv-CapsNet) and restricted Boltzmann machine-capsule networks (RBM-CapsNet), in which convolution and a restricted Boltzmann machine (RBM) were used as feature extractors, respectively. Two prediction models of hERG blockers/nonblockers were then developed by Conv-CapsNet and RBM-CapsNet with the Doddareddy's training set composed of 2,389 compounds. The established models showed excellent performance in an independent test set comprising 255 compounds, with prediction accuracies of 91.8 and 92.2% for Conv-CapsNet and RBM-CapsNet models, respectively. Various comparisons were also made between our models and those developed by other machine learning methods including deep belief network (DBN), convolutional neural network (CNN), multilayer perceptron (MLP), support vector machine (SVM), k-nearest neighbors (kNN), logistic regression (LR), and LightGBM, and with different training sets. All the results showed that the models by Conv-CapsNet and RBM-CapsNet are among the best classification models. Overall, the excellent performance of capsule networks achieved in this investigation highlights their potential in drug discovery-related studies.
Collapse
Affiliation(s)
- Yiwei Wang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, China
- College of Preclinical Medicine, Southwest Medical University, Luzhou, China
| | - Lei Huang
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China
- Basic Teaching Department, Sichuan College of Architectural Technology, Deyang, China
| | - Siwen Jiang
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China
| | - Yifei Wang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, China
| | - Jun Zou
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, China
| | - Hongguang Fu
- School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, China
| | - Shengyong Yang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, China
| |
Collapse
|
20
|
|
21
|
Molecular Docking Guided Grid-Independent Descriptor Analysis to Probe the Impact of Water Molecules on Conformational Changes of hERG Inhibitors in Drug Trapping Phenomenon. Int J Mol Sci 2019; 20:ijms20143385. [PMID: 31295848 PMCID: PMC6678931 DOI: 10.3390/ijms20143385] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2019] [Revised: 07/04/2019] [Accepted: 07/07/2019] [Indexed: 12/17/2022] Open
Abstract
Human ether a-go-go related gene (hERG) or KV11.1 potassium channels mediate the rapid delayed rectifier current (IKr) in cardiac myocytes. Drug-induced inhibition of hERG channels has been implicated in the development of acquired long QT syndrome type (aLQTS) and fatal arrhythmias. Several marketed drugs have been withdrawn for this reason. Therefore, there is considerable interest in developing better tests for predicting drugs which can block the hERG channel. The drug-binding pocket in hERG channels, which lies below the selectivity filter, normally contains K+ ions and water molecules. In this study, we test the hypothesis that these water molecules impact drug binding to hERG. We developed 3D QSAR models based on alignment independent descriptors (GRIND) using docked ligands in open and closed conformations of hERG in the presence (solvated) and absence (non-solvated) of water molecules. The ligand–protein interaction fingerprints (PLIF) scheme was used to summarize and compare the interactions. All models delineated similar 3D hERG binding features, however, small deviations of about ~0.4 Å were observed between important hotspots of molecular interaction fields (MIFs) between solvated and non-solvated hERG models. These small changes in conformations do not affect the performance and predictive power of the model to any significant extent. The model that exhibits the best statistical values was attained with a cryo_EM structure of the hERG channel in open state without water. This model also showed the best R2 of 0.58 and 0.51 for the internal and external validation test sets respectively. Our results suggest that the inclusion of water molecules during the docking process has little effect on conformations and this conformational change does not impact the predictive ability of the 3D QSAR models.
Collapse
|
22
|
Mayr F, Vieider C, Temml V, Stuppner H, Schuster D. Open-Access Activity Prediction Tools for Natural Products. Case Study: hERG Blockers. PROGRESS IN THE CHEMISTRY OF ORGANIC NATURAL PRODUCTS 2019; 110:177-238. [PMID: 31621014 DOI: 10.1007/978-3-030-14632-0_6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Interference with the hERG potassium ion channel may cause cardiac arrhythmia and can even lead to death. Over the last few decades, several drugs, already on the market, and many more investigational drugs in various development stages, have had to be discontinued because of their hERG-associated toxicity. To recognize potential hERG activity in the early stages of drug development, a wide array of computational tools, based on different principles, such as 3D QSAR, 2D and 3D similarity, and machine learning, have been developed and are reviewed in this chapter. The various available prediction tools Similarity Ensemble Approach, SuperPred, SwissTargetPrediction, HitPick, admetSAR, PASSonline, Pred-hERG, and VirtualToxLab™ were used to screen a dataset of known hERG synthetic and natural product actives and inactives to quantify and compare their predictive power. This contribution will allow the reader to evaluate the suitability of these computational methods for their own related projects. There is an unmet need for natural product-specific prediction tools in this field.
Collapse
Affiliation(s)
- Fabian Mayr
- Institute of Pharmacy/Pharmacognosy, University of Innsbruck, Innsbruck, Austria
- Institute of Pharmacy/Pharmaceutical Chemistry, University of Innsbruck, Innsbruck, Austria
| | - Christian Vieider
- Institute of Pharmacy/Pharmaceutical Chemistry, University of Innsbruck, Innsbruck, Austria
| | - Veronika Temml
- Institute of Pharmacy/Pharmacognosy, University of Innsbruck, Innsbruck, Austria
| | - Hermann Stuppner
- Institute of Pharmacy/Pharmacognosy, University of Innsbruck, Innsbruck, Austria
| | - Daniela Schuster
- Institute of Pharmacy/Pharmaceutical Chemistry, University of Innsbruck, Innsbruck, Austria.
- Department of Pharmaceutical and Medicinal Chemistry, Institute of Pharmacy, Paracelsus Medical University Salzburg, Salzburg, Austria.
| |
Collapse
|
23
|
Maltarollo VG, Kronenberger T, Espinoza GZ, Oliveira PR, Honorio KM. Advances with support vector machines for novel drug discovery. Expert Opin Drug Discov 2018; 14:23-33. [PMID: 30488731 DOI: 10.1080/17460441.2019.1549033] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
INTRODUCTION Novel drug discovery remains an enormous challenge, with various computer-aided drug design (CADD) approaches having been widely employed for this purpose. CADD, specifically the commonly used support vector machines (SVMs), can employ machine learning techniques. SVMs and their variations offer numerous drug discovery applications, which range from the classification of substances (as active or inactive) to the construction of regression models and the ranking/virtual screening of databased compounds. Areas covered: Herein, the authors consider some of the applications of SVMs in medicinal chemistry, illustrating their main advantages and disadvantages, as well as trends in their utilization, via the available published literature. The aim of this review is to provide an up-to-date review of the recent applications of SVMs in drug discovery as described by the literature, thereby highlighting their strengths, weaknesses, and future challenges. Expert opinion: Techniques based on SVMs are considered as powerful approaches in early drug discovery. The ability of SVMs to classify active or inactive compounds has enabled the prioritization of substances for virtual screening. Indeed, one of the main advantages of SVMs is related to their potential in the analysis of nonlinear problems. However, despite successes in employing SVMs, the challenges of improving accuracy remain.
Collapse
Affiliation(s)
- Vinicius Gonçalves Maltarollo
- a Departamento de Produtos Farmacêuticos, Faculdade de Farmácia , Universidade Federal de Minas Gerais , Belo Horizonte , Brazil
| | - Thales Kronenberger
- b Department of Internal Medicine VIII , University Hospital of Tübingen , Tübingen , Germany
| | - Gabriel Zarzana Espinoza
- c Escola de Artes, Ciências e Humanidades , Universidade de São Paulo (USP) , São Paulo , Brazil
| | - Patricia Rufino Oliveira
- c Escola de Artes, Ciências e Humanidades , Universidade de São Paulo (USP) , São Paulo , Brazil
| | - Kathia Maria Honorio
- c Escola de Artes, Ciências e Humanidades , Universidade de São Paulo (USP) , São Paulo , Brazil.,d Centro de Ciências Naturais e Humanas , Universidade Federal do ABC , Santo André , Brazil
| |
Collapse
|
24
|
Munawar S, Windley MJ, Tse EG, Todd MH, Hill AP, Vandenberg JI, Jabeen I. Experimentally Validated Pharmacoinformatics Approach to Predict hERG Inhibition Potential of New Chemical Entities. Front Pharmacol 2018; 9:1035. [PMID: 30333745 PMCID: PMC6176658 DOI: 10.3389/fphar.2018.01035] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Accepted: 08/27/2018] [Indexed: 12/17/2022] Open
Abstract
The hERG (human ether-a-go-go-related gene) encoded potassium ion (K+) channel plays a major role in cardiac repolarization. Drug-induced blockade of hERG has been a major cause of potentially lethal ventricular tachycardia termed Torsades de Pointes (TdPs). Therefore, we presented a pharmacoinformatics strategy using combined ligand and structure based models for the prediction of hERG inhibition potential (IC50) of new chemical entities (NCEs) during early stages of drug design and development. Integrated GRid-INdependent Descriptor (GRIND) models, and lipophilic efficiency (LipE), ligand efficiency (LE) guided template selection for the structure based pharmacophore models have been used for virtual screening and subsequent hERG activity (pIC50) prediction of identified hits. Finally selected two hits were experimentally evaluated for hERG inhibition potential (pIC50) using whole cell patch clamp assay. Overall, our results demonstrate a difference of less than ±1.6 log unit between experimentally determined and predicted hERG inhibition potential (IC50) of the selected hits. This revealed predictive ability and robustness of our models and could help in correctly rank the potency order (lower μM to higher nM range) against hERG.
Collapse
Affiliation(s)
- Saba Munawar
- Research Center for Modeling and Simulation, National University of Science and Technology, Islamabad, Pakistan.,Victor Chang Cardiac Research Institute, Sydney, NSW, Australia
| | | | - Edwin G Tse
- School of Chemistry, The University of Sydney, Sydney, NSW, Australia
| | - Matthew H Todd
- School of Chemistry, The University of Sydney, Sydney, NSW, Australia
| | - Adam P Hill
- Victor Chang Cardiac Research Institute, Sydney, NSW, Australia
| | | | - Ishrat Jabeen
- Research Center for Modeling and Simulation, National University of Science and Technology, Islamabad, Pakistan
| |
Collapse
|
25
|
Sato T, Yuki H, Ogura K, Honma T. Construction of an integrated database for hERG blocking small molecules. PLoS One 2018; 13:e0199348. [PMID: 29979714 PMCID: PMC6034787 DOI: 10.1371/journal.pone.0199348] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2018] [Accepted: 06/06/2018] [Indexed: 11/19/2022] Open
Abstract
The inhibition of the hERG potassium channel is closely related to the prolonged QT interval, and thus assessing this risk could greatly facilitate the development of therapeutic compounds and the withdrawal of hazardous marketed drugs. The recent increase in SAR information about hERG inhibitors in public databases has led to many successful applications of machine learning techniques to predict hERG inhibition. However, most of these reports constructed their prediction models based on only one SAR database because the differences in the data format and ontology hindered the integration of the databases. In this study, we curated the hERG-related data in ChEMBL, PubChem, GOSTAR, and hERGCentral, and integrated them into the largest database about hERG inhibition by small molecules. Assessment of structural diversity using Murcko frameworks revealed that the integrated database contains more than twice as many chemical scaffolds for hERG inhibitors than any of the individual databases, and covers 18.2% of the Murcko framework-based chemical space occupied by the compounds in ChEMBL. The database provides the most comprehensive information about hERG inhibitors and will be useful to design safer compounds for drug discovery. The database is freely available at http://drugdesign.riken.jp/hERGdb/.
Collapse
Affiliation(s)
- Tomohiro Sato
- Center for Life Science Technologies, RIKEN, Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, Japan
| | - Hitomi Yuki
- Center for Life Science Technologies, RIKEN, Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, Japan
| | - Keiji Ogura
- Center for Life Science Technologies, RIKEN, Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, Japan
| | - Teruki Honma
- Center for Life Science Technologies, RIKEN, Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa, Japan
| |
Collapse
|
26
|
Siramshetty VB, Chen Q, Devarakonda P, Preissner R. The Catch-22 of Predicting hERG Blockade Using Publicly Accessible Bioactivity Data. J Chem Inf Model 2018; 58:1224-1233. [PMID: 29772901 DOI: 10.1021/acs.jcim.8b00150] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Drug-induced inhibition of the human ether-à-go-go-related gene (hERG)-encoded potassium ion channels can lead to fatal cardiotoxicity. Several marketed drugs and promising drug candidates were recalled because of this concern. Diverse modeling methods ranging from molecular similarity assessment to quantitative structure-activity relationship analysis employing machine learning techniques have been applied to data sets of varying size and composition (number of blockers and nonblockers). In this study, we highlight the challenges involved in the development of a robust classifier for predicting the hERG end point using bioactivity data extracted from the public domain. To this end, three different modeling methods, nearest neighbors, random forests, and support vector machines, were employed to develop predictive models using different molecular descriptors, activity thresholds, and training set compositions. Our models demonstrated superior performance in external validations in comparison with those reported in the previous studies from which the data sets were extracted. The choice of descriptors had little influence on the model performance, with minor exceptions. The criteria used to filter bioactivity data, the activity threshold settings used to separate blockers from nonblockers, and the structural diversity of blockers in training data set were found to be the crucial indicators of model performance. Training sets based on a binary threshold of 1 μM/10 μM to separate blockers (IC50/ Ki ≤ 1 μM) from nonblockers (IC50/ Ki > 10 μM) provided superior performance in comparison with those defined using a single threshold (1 μM or 10 μM). A major limitation in using the public domain hERG activity data is the abundance of blockers in comparison with nonblockers at usual activity thresholds, since not many studies report the latter.
Collapse
Affiliation(s)
- Vishal B Siramshetty
- Structural Bioinformatics Group , Charité - University Medicine Berlin , 10115 Berlin , Germany.,BB3R - Berlin Brandenburg 3R Graduate School , Freie Universität Berlin , 14195 Berlin , Germany
| | - Qiaofeng Chen
- Structural Bioinformatics Group , Charité - University Medicine Berlin , 10115 Berlin , Germany.,China Scholarship Council (CSC) , Beijing 100044 , China
| | - Prashanth Devarakonda
- Structural Bioinformatics Group , Charité - University Medicine Berlin , 10115 Berlin , Germany
| | - Robert Preissner
- Structural Bioinformatics Group , Charité - University Medicine Berlin , 10115 Berlin , Germany.,BB3R - Berlin Brandenburg 3R Graduate School , Freie Universität Berlin , 14195 Berlin , Germany
| |
Collapse
|