1
|
Chen Z, Li N, Zhang P, Li Y, Li X. CardioDPi: An explainable deep-learning model for identifying cardiotoxic chemicals targeting hERG, Cav1.2, and Nav1.5 channels. JOURNAL OF HAZARDOUS MATERIALS 2024; 474:134724. [PMID: 38805819 DOI: 10.1016/j.jhazmat.2024.134724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 05/08/2024] [Accepted: 05/22/2024] [Indexed: 05/30/2024]
Abstract
The cardiotoxic effects of various pollutants have been a growing concern in environmental and material science. These effects encompass arrhythmias, myocardial injury, cardiac insufficiency, and pericardial inflammation. Compounds such as organic solvents and air pollutants disrupt the potassium, sodium, and calcium ion channels cardiac cell membranes, leading to the dysregulation of cardiac function. However, current cardiotoxicity models have disadvantages of incomplete data, ion channels, interpretability issues, and inability of toxic structure visualization. Herein, an interpretable deep-learning model known as CardioDPi was developed, which is capable of discriminating cardiotoxicity induced by the human Ether-à-go-go-related gene (hERG) channel, sodium channel (Na_v1.5), and calcium channel (Ca_v1.5) blockade. External validation yielded promising area under the ROC curve (AUC) values of 0.89, 0.89, and 0.94 for the hERG, Na_v1.5, and Ca_v1.5 channels, respectively. The CardioDPi can be freely accessed on the web server CardioDPipredictor (http://cardiodpi.sapredictor.cn/). Furthermore, the structural characteristics of cardiotoxic compounds were analyzed and structural alerts (SAs) can be extracted using the user-friendly CardioDPi-SAdetector web service (http://cardiosa.sapredictor.cn/). CardioDPi is a valuable tool for identifying cardiotoxic chemicals that are environmental and health risks. Moreover, the SA system provides essential insights for mode-of-action studies concerning cardiotoxic compounds.
Collapse
Affiliation(s)
- Zhaoyang Chen
- Department of Clinical Pharmacy, The First Affiliated Hospital of Shandong First Medical University & Shandong Provincial Qianfoshan Hospital, Shandong Engineering and Technology Research Center for Pediatric Drug Development, Shandong Medicine and Health Key Laboratory of Clinical Pharmacy, Jinan 250014, China
| | - Na Li
- Department of Clinical Pharmacy, The First Affiliated Hospital of Shandong First Medical University & Shandong Provincial Qianfoshan Hospital, Shandong Engineering and Technology Research Center for Pediatric Drug Development, Shandong Medicine and Health Key Laboratory of Clinical Pharmacy, Jinan 250014, China
| | - Pei Zhang
- Department of Clinical Pharmacy, The First Affiliated Hospital of Shandong First Medical University & Shandong Provincial Qianfoshan Hospital, Shandong Engineering and Technology Research Center for Pediatric Drug Development, Shandong Medicine and Health Key Laboratory of Clinical Pharmacy, Jinan 250014, China
| | - Yan Li
- Department of Clinical Pharmacy, The First Affiliated Hospital of Shandong First Medical University & Shandong Provincial Qianfoshan Hospital, Shandong Engineering and Technology Research Center for Pediatric Drug Development, Shandong Medicine and Health Key Laboratory of Clinical Pharmacy, Jinan 250014, China
| | - Xiao Li
- Department of Clinical Pharmacy, The First Affiliated Hospital of Shandong First Medical University & Shandong Provincial Qianfoshan Hospital, Shandong Engineering and Technology Research Center for Pediatric Drug Development, Shandong Medicine and Health Key Laboratory of Clinical Pharmacy, Jinan 250014, China.
| |
Collapse
|
2
|
Wang T, Du Z, Zhuo L, Fu X, Zou Q, Yao X. MultiCBlo: Enhancing predictions of compound-induced inhibition of cardiac ion channels with advanced multimodal learning. Int J Biol Macromol 2024:133825. [PMID: 39002900 DOI: 10.1016/j.ijbiomac.2024.133825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2024] [Revised: 07/09/2024] [Accepted: 07/10/2024] [Indexed: 07/15/2024]
Abstract
Predicting compound-induced inhibition of cardiac ion channels is crucial and challenging, significantly impacting cardiac drug efficacy and safety assessments. Despite the development of various computational methods for compound-induced inhibition prediction in cardiac ion channels, their performance remains limited. Most methods struggle to fuse multi-source data, relying solely on specific dataset training, leading to poor accuracy and generalization. We introduce MultiCBlo, a model that fuses multimodal information through a progressive learning approach, designed to predict compound-induced inhibition of cardiac ion channels with high accuracy. MultiCBlo employs progressive multimodal information fusion technology to integrate the compound's SMILES sequence, graph structure, and fingerprint, enhancing its representation. This is the first application of progressive multimodal learning for predicting compound-induced inhibition of cardiac ion channels, to our knowledge. The objective of this study was to predict the compound-induced inhibition of three major cardiac ion channels: hERG, Cav1.2, and Nav1.5. The results indicate that MultiCBlo significantly outperforms current models in predicting compound-induced inhibition of cardiac ion channels. We hope that MultiCBlo will facilitate cardiac drug development and reduce compound toxicity risks. Code and data are accessible at: https://github.com/taowang11/MultiCBlo. The online prediction platform is freely accessible at: https://huggingface.co/spaces/wtttt/PCICB.
Collapse
Affiliation(s)
- Tao Wang
- School of Data Science and Artificial Intelligence, Wenzhou University of Technology, 325027 Wenzhou, China
| | - Zhenya Du
- Guangzhou Xinhua University, 510520 Guangzhou, China
| | - Linlin Zhuo
- School of Data Science and Artificial Intelligence, Wenzhou University of Technology, 325027 Wenzhou, China.
| | - Xiangzheng Fu
- College of Computer Science and Electronic Engineering, Hunan University, 410012 Changsha, China.
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, 611730 Chengdu, China
| | - Xiaojun Yao
- Faculty of Applied Sciences, Macao Polytechnic University, 999078 Macao, China.
| |
Collapse
|
3
|
Liu J, Khan MKH, Guo W, Dong F, Ge W, Zhang C, Gong P, Patterson TA, Hong H. Machine learning and deep learning approaches for enhanced prediction of hERG blockade: a comprehensive QSAR modeling study. Expert Opin Drug Metab Toxicol 2024:1-20. [PMID: 38968091 DOI: 10.1080/17425255.2024.2377593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Accepted: 06/26/2024] [Indexed: 07/07/2024]
Abstract
BACKGROUND Cardiotoxicity is a major cause of drug withdrawal. The hERG channel, regulating ion flow, is pivotal for heart and nervous system function. Its blockade is a concern in drug development. Predicting hERG blockade is essential for identifying cardiac safety issues. Various QSAR models exist, but their performance varies. Ongoing improvements show promise, necessitating continued efforts to enhance accuracy using emerging deep learning algorithms in predicting potential hERG blockade. STUDY DESIGN AND METHOD Using a large training dataset, six individual QSAR models were developed. Additionally, three ensemble models were constructed. All models were evaluated using 10-fold cross-validations and two external datasets. RESULTS The 10-fold cross-validations resulted in Mathews correlation coefficient (MCC) values from 0.682 to 0.730, surpassing the best-reported model on the same dataset (0.689). External validations yielded MCC values from 0.520 to 0.715 for the first dataset, exceeding those of previously reported models (0-0.599). For the second dataset, MCC values fell between 0.025 and 0.215, aligning with those of reported models (0.112-0.220). CONCLUSIONS The developed models can assist the pharmaceutical industry and regulatory agencies in predicting hERG blockage activity, thereby enhancing safety assessments and reducing the risk of adverse cardiac events associated with new drug candidates.
Collapse
Affiliation(s)
- Jie Liu
- National Center for Toxicological Research, US Food & Drug Administration, Jefferson, AR, USA
| | - Md Kamrul Hasan Khan
- National Center for Toxicological Research, US Food & Drug Administration, Jefferson, AR, USA
| | - Wenjing Guo
- National Center for Toxicological Research, US Food & Drug Administration, Jefferson, AR, USA
| | - Fan Dong
- National Center for Toxicological Research, US Food & Drug Administration, Jefferson, AR, USA
| | - Weigong Ge
- National Center for Toxicological Research, US Food & Drug Administration, Jefferson, AR, USA
| | - Chaoyang Zhang
- School of Computing Sciences and Computer Engineering, University of Southern Mississippi, Hattiesburg, MS, USA
| | - Ping Gong
- Environmental Laboratory, US Army Engineer Research and Development Center, Vicksburg, MS, USA
| | - Tucker A Patterson
- National Center for Toxicological Research, US Food & Drug Administration, Jefferson, AR, USA
| | - Huixiao Hong
- National Center for Toxicological Research, US Food & Drug Administration, Jefferson, AR, USA
| |
Collapse
|
4
|
Amorim AM, Piochi LF, Gaspar AT, Preto A, Rosário-Ferreira N, Moreira IS. Advancing Drug Safety in Drug Development: Bridging Computational Predictions for Enhanced Toxicity Prediction. Chem Res Toxicol 2024; 37:827-849. [PMID: 38758610 PMCID: PMC11187637 DOI: 10.1021/acs.chemrestox.3c00352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 04/29/2024] [Accepted: 05/07/2024] [Indexed: 05/19/2024]
Abstract
The attrition rate of drugs in clinical trials is generally quite high, with estimates suggesting that approximately 90% of drugs fail to make it through the process. The identification of unexpected toxicity issues during preclinical stages is a significant factor contributing to this high rate of failure. These issues can have a major impact on the success of a drug and must be carefully considered throughout the development process. These late-stage rejections or withdrawals of drug candidates significantly increase the costs associated with drug development, particularly when toxicity is detected during clinical trials or after market release. Understanding drug-biological target interactions is essential for evaluating compound toxicity and safety, as well as predicting therapeutic effects and potential off-target effects that could lead to toxicity. This will enable scientists to predict and assess the safety profiles of drug candidates more accurately. Evaluation of toxicity and safety is a critical aspect of drug development, and biomolecules, particularly proteins, play vital roles in complex biological networks and often serve as targets for various chemicals. Therefore, a better understanding of these interactions is crucial for the advancement of drug development. The development of computational methods for evaluating protein-ligand interactions and predicting toxicity is emerging as a promising approach that adheres to the 3Rs principles (replace, reduce, and refine) and has garnered significant attention in recent years. In this review, we present a thorough examination of the latest breakthroughs in drug toxicity prediction, highlighting the significance of drug-target binding affinity in anticipating and mitigating possible adverse effects. In doing so, we aim to contribute to the development of more effective and secure drugs.
Collapse
Affiliation(s)
- Ana M.
B. Amorim
- Department
of Life Sciences, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- CNC-UC—Center
for Neuroscience and Cell Biology, University
of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- CIBB—Centre
for Innovative Biomedicine and Biotechnology, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- PhD
Programme in Biosciences, Department of Life Sciences, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- PURR.AI,
Rua Pedro Nunes, IPN Incubadora, Ed C, 3030-199 Coimbra, Portugal
| | - Luiz F. Piochi
- Department
of Life Sciences, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- CNC-UC—Center
for Neuroscience and Cell Biology, University
of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- CIBB—Centre
for Innovative Biomedicine and Biotechnology, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
| | - Ana T. Gaspar
- Department
of Life Sciences, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- CNC-UC—Center
for Neuroscience and Cell Biology, University
of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- CIBB—Centre
for Innovative Biomedicine and Biotechnology, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
| | - António
J. Preto
- CNC-UC—Center
for Neuroscience and Cell Biology, University
of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- CIBB—Centre
for Innovative Biomedicine and Biotechnology, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- PhD Programme
in Experimental Biology and Biomedicine, Institute for Interdisciplinary
Research (IIIUC), University of Coimbra, Casa Costa Alemão, 3030-789 Coimbra, Portugal
| | - Nícia Rosário-Ferreira
- CNC-UC—Center
for Neuroscience and Cell Biology, University
of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- CIBB—Centre
for Innovative Biomedicine and Biotechnology, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
| | - Irina S. Moreira
- Department
of Life Sciences, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- CNC-UC—Center
for Neuroscience and Cell Biology, University
of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
- CIBB—Centre
for Innovative Biomedicine and Biotechnology, University of Coimbra, Calçada Martim de Freitas, 3000-456 Coimbra, Portugal
| |
Collapse
|
5
|
Sanches IH, Braga RC, Alves VM, Andrade CH. Enhancing hERG Risk Assessment with Interpretable Classificatory and Regression Models. Chem Res Toxicol 2024; 37:910-922. [PMID: 38781421 PMCID: PMC11187631 DOI: 10.1021/acs.chemrestox.3c00400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 04/22/2024] [Accepted: 05/14/2024] [Indexed: 05/25/2024]
Abstract
The human Ether-à-go-go-Related Gene (hERG) is a transmembrane protein that regulates cardiac action potential, and its inhibition can induce a potentially deadly cardiac syndrome. In vitro tests help identify hERG blockers at early stages; however, the high cost motivates searching for alternative, cost-effective methods. The primary goal of this study was to enhance the Pred-hERG tool for predicting hERG blockage. To achieve this, we developed new QSAR models that incorporated additional data, updated existing classificatory and multiclassificatory models, and introduced new regression models. Notably, we integrated SHAP (SHapley Additive exPlanations) values to offer a visual interpretation of these models. Utilizing the latest data from ChEMBL v30, encompassing over 14,364 compounds with hERG data, our binary and multiclassification models outperformed both the previous iteration of Pred-hERG and all publicly available models. Notably, the new version of our tool introduces a regression model for predicting hERG activity (pIC50). The optimal model demonstrated an R2 of 0.61 and an RMSE of 0.48, surpassing the only available regression model in the literature. Pred-hERG 5.0 now offers users a swift, reliable, and user-friendly platform for the early assessment of chemically induced cardiotoxicity through hERG blockage. The tool provides versatile outcomes, including (i) classificatory predictions of hERG blockage with prediction reliability, (ii) multiclassificatory predictions of hERG blockage with reliability, (iii) regression predictions with estimated pIC50 values, and (iv) probability maps illustrating the contribution of chemical fragments for each prediction. Furthermore, we implemented explainable AI analysis (XAI) to visualize SHAP values, providing insights into the contribution of each feature to binary classification predictions. A consensus prediction calculated based on the predictions of the three developed models is also present to assist the user's decision-making process. Pred-hERG 5.0 has been designed to be user-friendly, making it accessible to users without computational or programming expertise. The tool is freely available at http://predherg.labmol.com.br.
Collapse
Affiliation(s)
- Igor H. Sanches
- Laboratory
for Molecular Modeling and Drug Design (LabMol), Faculty of Pharmacy, Universidade Federal de Goiás, Goiânia, GO 74690-900, Brazil
- Center
for Excellence in Artificial Intelligence (CEIA), Institute of Informatics, Universidade Federal de Goiás, Goiânia, GO 74690-900, Brazil
- Center
for the Research and Advancement in Fragments and Molecular Targets
(CRAFT), School of Pharmaceutical Sciences at Ribeirao Preto, University of São Paulo, Ribeirão Preto, SP 05508-220, Brazil
| | | | - Vinicius M. Alves
- University
of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Carolina Horta Andrade
- Laboratory
for Molecular Modeling and Drug Design (LabMol), Faculty of Pharmacy, Universidade Federal de Goiás, Goiânia, GO 74690-900, Brazil
- Center
for Excellence in Artificial Intelligence (CEIA), Institute of Informatics, Universidade Federal de Goiás, Goiânia, GO 74690-900, Brazil
- Center
for the Research and Advancement in Fragments and Molecular Targets
(CRAFT), School of Pharmaceutical Sciences at Ribeirao Preto, University of São Paulo, Ribeirão Preto, SP 05508-220, Brazil
| |
Collapse
|
6
|
Arab I, Egghe K, Laukens K, Chen K, Barakat K, Bittremieux W. Benchmarking of Small Molecule Feature Representations for hERG, Nav1.5, and Cav1.2 Cardiotoxicity Prediction. J Chem Inf Model 2024; 64:2515-2527. [PMID: 37870574 DOI: 10.1021/acs.jcim.3c01301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2023]
Abstract
In the field of drug discovery, there is a substantial challenge in seeking out chemical structures that possess desirable pharmacological, toxicological, and pharmacokinetic properties. Complications arise when drugs interfere with the functioning of cardiac ion channels, leading to serious cardiovascular consequences. The discontinuation and removal of numerous approved drugs from the market or at late development stages in the pipeline due to such inhibitory effects further highlight the urgency of addressing this issue. Consequently, the early prediction of potential blockers targeting cardiac ion channels during the drug discovery process is of paramount importance. This study introduces a deep learning framework that computationally determines the cardiotoxicity associated with the voltage-gated potassium channel (hERG), the voltage-gated calcium channel (Cav1.2), and the voltage-gated sodium channel (Nav1.5) for drug candidates. The predictive capabilities of three feature representations─molecular fingerprints, descriptors, and graph-based numerical representations─are rigorously benchmarked. Additionally, a novel training and evaluation data set framework is presented, enabling predictive model training of drug off-target cardiotoxicity using a comprehensive and large curated data set covering these three cardiac ion channels. To facilitate these predictions, a robust and comprehensive small molecule cardiotoxicity prediction tool named CToxPred has been developed. It is made available as open source under the permissive MIT license at https://github.com/issararab/CToxPred.
Collapse
Affiliation(s)
- Issar Arab
- Department of Computer Science, University of Antwerp, 2020 Antwerp, Belgium
- Biomedical Informatics Network Antwerpen (Biomina), 2020 Antwerp, Belgium
| | - Kristof Egghe
- Department of Computer Science, University of Antwerp, 2020 Antwerp, Belgium
| | - Kris Laukens
- Department of Computer Science, University of Antwerp, 2020 Antwerp, Belgium
- Biomedical Informatics Network Antwerpen (Biomina), 2020 Antwerp, Belgium
| | - Ke Chen
- Chair for Theoretical Chemistry, Catalysis Research Center, Technische Universität München, Lichtenbergstraße 4, D-85747 Garching, Germany
| | - Khaled Barakat
- Faculty of Pharmacy and Pharmaceutical Sciences, University of Alberta, Edmonton, Alberta 8613, Canada
| | - Wout Bittremieux
- Department of Computer Science, University of Antwerp, 2020 Antwerp, Belgium
- Biomedical Informatics Network Antwerpen (Biomina), 2020 Antwerp, Belgium
| |
Collapse
|
7
|
Vinh T, Nguyen L, Trinh QH, Nguyen-Vo TH, Nguyen BP. Predicting Cardiotoxicity of Molecules Using Attention-Based Graph Neural Networks. J Chem Inf Model 2024; 64:1816-1827. [PMID: 38438914 DOI: 10.1021/acs.jcim.3c01286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2024]
Abstract
In drug discovery, the search for new and effective medications is often hindered by concerns about toxicity. Numerous promising molecules fail to pass the later phases of drug development due to strict toxicity assessments. This challenge significantly increases the cost, time, and human effort needed to discover new therapeutic molecules. Additionally, a considerable number of drugs already on the market have been withdrawn or re-evaluated because of their unwanted side effects. Among the various types of toxicity, drug-induced heart damage is a severe adverse effect commonly associated with several medications, especially those used in cancer treatments. Although a number of computational approaches have been proposed to identify the cardiotoxicity of molecules, the performance and interpretability of the existing approaches are limited. In our study, we proposed a more effective computational framework to predict the cardiotoxicity of molecules using an attention-based graph neural network. Experimental results indicated that the proposed framework outperformed the other methods. The stability of the model was also confirmed by our experiments. To assist researchers in evaluating the cardiotoxicity of molecules, we have developed an easy-to-use online web server that incorporates our model.
Collapse
Affiliation(s)
- Tuan Vinh
- Department of Chemistry, Emory University, 201 Dowman Drive, Atlanta, Georgia 30322-1007, United States
| | - Loc Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Kelburn Parade, Wellington 6012, New Zealand
| | - Quang H Trinh
- School of Information and Communication Technology, Hanoi University of Science and Technology, Hanoi 100000, Vietnam
| | - Thanh-Hoang Nguyen-Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Kelburn Parade, Wellington 6012, New Zealand
- School of Innovation, Design and Technology, Wellington Institute of Technology, 21 Kensington Avenue, Lower Hutt 5012, New Zealand
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Kelburn Parade, Wellington 6012, New Zealand
| |
Collapse
|
8
|
Ma M, Lei X. A deep learning framework for predicting molecular property based on multi-type features fusion. Comput Biol Med 2024; 169:107911. [PMID: 38160501 DOI: 10.1016/j.compbiomed.2023.107911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 12/18/2023] [Accepted: 12/24/2023] [Indexed: 01/03/2024]
Abstract
Extracting expressive molecular features is essential for molecular property prediction. Sequence-based representation is a common representation of molecules, which ignores the structure information of molecules. While molecular graph representation has a weak ability in expressing the 3D structure. In this article, we try to make use of the advantages of different type representations simultaneously for molecular property prediction. Thus, we propose a fusion model named DLF-MFF, which integrates the multi-type molecular features. Specifically, we first extract four different types of features from molecular fingerprints, 2D molecular graph, 3D molecular graph and molecular image. Then, in order to learn molecular features individually, we use four essential deep learning frameworks, which correspond to four distinct molecular representations. The final molecular representation is created by integrating the four feature vectors and feeding them into prediction layer to predict molecular property. We compare DLF-MFF with 7 state-of-the-art methods on 6 benchmark datasets consisting of multiple molecular properties, the experimental results show that DLF-MFF achieves state-of-the-art performance on 6 benchmark datasets. Moreover, DLF-MFF is applied to identify potential anti-SARS-CoV-2 inhibitor from 2500 drugs. We predict probability of each drug being inferred as a 3CL protease inhibitor and also calculate the binding affinity scores between each drug and 3CL protease. The results show that DLF-MFF product better performance in the identification of anti-SARS-CoV-2 inhibitor. This work is expected to offer novel research perspectives for accurate prediction of molecular properties and provide valuable insights into drug repurposing for COVID-19.
Collapse
Affiliation(s)
- Mei Ma
- School of Computer Science, Shaanxi Normal University, Xi'an, 710119, China; School of Mathematics and Statistics, Qinghai Normal University, Qinghai, 810000, China
| | - Xiujuan Lei
- School of Computer Science, Shaanxi Normal University, Xi'an, 710119, China.
| |
Collapse
|
9
|
Wang D, Jin J, Li Z, Wang Y, Fan M, Liang S, Su R, Wei L. StructuralDPPIV: a novel deep learning model based on atom structure for predicting dipeptidyl peptidase-IV inhibitory peptides. Bioinformatics 2024; 40:btae057. [PMID: 38305458 PMCID: PMC10904144 DOI: 10.1093/bioinformatics/btae057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 12/07/2023] [Accepted: 01/30/2024] [Indexed: 02/03/2024] Open
Abstract
MOTIVATION Diabetes is a chronic metabolic disorder that has been a major cause of blindness, kidney failure, heart attacks, stroke, and lower limb amputation across the world. To alleviate the impact of diabetes, researchers have developed the next generation of anti-diabetic drugs, known as dipeptidyl peptidase IV inhibitory peptides (DPP-IV-IPs). However, the discovery of these promising drugs has been restricted due to the lack of effective peptide-mining tools. RESULTS Here, we presented StructuralDPPIV, a deep learning model designed for DPP-IV-IP identification, which takes advantage of both molecular graph features in amino acid and sequence information. Experimental results on the independent test dataset and two wet experiment datasets show that our model outperforms the other state-of-art methods. Moreover, to better study what StructuralDPPIV learns, we used CAM technology and perturbation experiment to analyze our model, which yielded interpretable insights into the reasoning behind prediction results. AVAILABILITY AND IMPLEMENTATION The project code is available at https://github.com/WeiLab-BioChem/Structural-DPP-IV.
Collapse
Affiliation(s)
- Ding Wang
- School of Software, Shandong University, Jinan 250101, China
| | - Junru Jin
- School of Software, Shandong University, Jinan 250101, China
| | - Zhongshen Li
- School of Software, Shandong University, Jinan 250101, China
| | - Yu Wang
- School of Software, Shandong University, Jinan 250101, China
| | - Mushuang Fan
- School of Software, Shandong University, Jinan 250101, China
| | - Sirui Liang
- School of Software, Shandong University, Jinan 250101, China
| | - Ran Su
- College of Intelligence and Computing, Tianjin University, Tianjin 300350, China
| | - Leyi Wei
- Faculty of Applied Sciences, Macao Polytechnic University, Macao 999078, China
- Joint SDU-NTU Centre for Artificial Intelligence Research (C-FAIR), Shandong University, Jinan 250101, China
| |
Collapse
|
10
|
Viganò EL, Ballabio D, Roncaglioni A. Artificial Intelligence and Machine Learning Methods to Evaluate Cardiotoxicity following the Adverse Outcome Pathway Frameworks. TOXICS 2024; 12:87. [PMID: 38276722 PMCID: PMC10820364 DOI: 10.3390/toxics12010087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 01/15/2024] [Accepted: 01/17/2024] [Indexed: 01/27/2024]
Abstract
Cardiovascular disease is a leading global cause of mortality. The potential cardiotoxic effects of chemicals from different classes, such as environmental contaminants, pesticides, and drugs can significantly contribute to effects on health. The same chemical can induce cardiotoxicity in different ways, following various Adverse Outcome Pathways (AOPs). In addition, the potential synergistic effects between chemicals further complicate the issue. In silico methods have become essential for tackling the problem from different perspectives, reducing the need for traditional in vivo testing, and saving valuable resources in terms of time and money. Artificial intelligence (AI) and machine learning (ML) are among today's advanced approaches for evaluating chemical hazards. They can serve, for instance, as a first-tier component of Integrated Approaches to Testing and Assessment (IATA). This study employed ML and AI to assess interactions between chemicals and specific biological targets within the AOP networks for cardiotoxicity, starting with molecular initiating events (MIEs) and progressing through key events (KEs). We explored methods to encode chemical information in a suitable way for ML and AI. We started with commonly used approaches in Quantitative Structure-Activity Relationship (QSAR) methods, such as molecular descriptors and different types of fingerprint. We then increased the complexity of encoders, incorporating graph-based methods, auto-encoders, and character embeddings employed in neural language processing. We also developed a multimodal neural network architecture, capable of considering the complementary nature of different chemical representations simultaneously. The potential of this approach, compared to more conventional architectures designed to handle a single encoder, becomes apparent when the amount of data increases.
Collapse
Affiliation(s)
- Edoardo Luca Viganò
- Laboratory of Environmental Toxicology and Chemistry, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCSS, 20156 Milan, Italy;
| | - Davide Ballabio
- Milano Chemometrics and QSAR Research Group, Department of Earth and Environmental Sciences, University of Milano-Bicocca, 20126 Milan, Italy;
| | - Alessandra Roncaglioni
- Laboratory of Environmental Toxicology and Chemistry, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCSS, 20156 Milan, Italy;
| |
Collapse
|
11
|
Lv Q, Zhou F, Liu X, Zhi L. Artificial intelligence in small molecule drug discovery from 2018 to 2023: Does it really work? Bioorg Chem 2023; 141:106894. [PMID: 37776682 DOI: 10.1016/j.bioorg.2023.106894] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 09/24/2023] [Accepted: 09/25/2023] [Indexed: 10/02/2023]
Abstract
Utilizing artificial intelligence (AI) in drug design represents an advanced approach for identifying targets and developing new drugs. Integrating AI techniques significantly reduces the workload involved in drug development and enhances the efficiency of early-stage drug discovery. This review aims to present a comprehensive overview of the utilization of AI methods in the field of small drug design, with a specific focus on four key areas: protein structure prediction, molecular virtual screening, molecular design, and absorption, distribution, metabolism, excretion, and toxicity (ADMET) prediction. Additionally, the role and limitations of AI in drug development are explored, and the impact of AI on decision-making processes is studied. It is important to note that while AI can bring numerous benefits to the early stage of drug development, the direction and quality of decision-making should still be emphasized, as AI should be considered as a tool rather than a decisive factor.
Collapse
Affiliation(s)
- Qi Lv
- School of Pharmacy, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, PR China
| | - Feilong Zhou
- School of Pharmacy, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, PR China
| | - Xinhua Liu
- School of Pharmacy, Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, PR China.
| | - Liping Zhi
- School of Health Management, Anhui Medical University Hefei, 230032, PR China.
| |
Collapse
|
12
|
Guo W, Liu J, Dong F, Song M, Li Z, Khan MKH, Patterson TA, Hong H. Review of machine learning and deep learning models for toxicity prediction. Exp Biol Med (Maywood) 2023; 248:1952-1973. [PMID: 38057999 PMCID: PMC10798180 DOI: 10.1177/15353702231209421] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/08/2023] Open
Abstract
The ever-increasing number of chemicals has raised public concerns due to their adverse effects on human health and the environment. To protect public health and the environment, it is critical to assess the toxicity of these chemicals. Traditional in vitro and in vivo toxicity assays are complicated, costly, and time-consuming and may face ethical issues. These constraints raise the need for alternative methods for assessing the toxicity of chemicals. Recently, due to the advancement of machine learning algorithms and the increase in computational power, many toxicity prediction models have been developed using various machine learning and deep learning algorithms such as support vector machine, random forest, k-nearest neighbors, ensemble learning, and deep neural network. This review summarizes the machine learning- and deep learning-based toxicity prediction models developed in recent years. Support vector machine and random forest are the most popular machine learning algorithms, and hepatotoxicity, cardiotoxicity, and carcinogenicity are the frequently modeled toxicity endpoints in predictive toxicology. It is known that datasets impact model performance. The quality of datasets used in the development of toxicity prediction models using machine learning and deep learning is vital to the performance of the developed models. The different toxicity assignments for the same chemicals among different datasets of the same type of toxicity have been observed, indicating benchmarking datasets is needed for developing reliable toxicity prediction models using machine learning and deep learning algorithms. This review provides insights into current machine learning models in predictive toxicology, which are expected to promote the development and application of toxicity prediction models in the future.
Collapse
Affiliation(s)
- Wenjing Guo
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Jie Liu
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Fan Dong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Meng Song
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Zoe Li
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Md Kamrul Hasan Khan
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Tucker A Patterson
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Huixiao Hong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| |
Collapse
|
13
|
Kim T, Chung KC, Park H. Derivation of Highly Predictive 3D-QSAR Models for hERG Channel Blockers Based on the Quantum Artificial Neural Network Algorithm. Pharmaceuticals (Basel) 2023; 16:1509. [PMID: 38004375 PMCID: PMC10675541 DOI: 10.3390/ph16111509] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Revised: 10/14/2023] [Accepted: 10/20/2023] [Indexed: 11/26/2023] Open
Abstract
The hERG potassium channel serves as an annexed target for drug discovery because the associated off-target inhibitory activity may cause serious cardiotoxicity. Quantitative structure-activity relationship (QSAR) models were developed to predict inhibitory activities against the hERG potassium channel, utilizing the three-dimensional (3D) distribution of quantum mechanical electrostatic potential (ESP) as the molecular descriptor. To prepare the optimal atomic coordinates of dataset molecules, pairwise 3D structural alignments were carried out in order for the quantum mechanical cross correlation between the template and other molecules to be maximized. This alignment method stands out from the common atom-by-atom matching technique, as it can handle structurally diverse molecules as effectively as chemical derivatives that share an identical scaffold. The alignment problem prevalent in 3D-QSAR methods was ameliorated substantially by dividing the dataset molecules into seven subsets, each of which contained molecules with similar molecular weights. Using an artificial neural network algorithm to find the functional relationship between the quantum mechanical ESP descriptors and the experimental hERG inhibitory activities, highly predictive 3D-QSAR models were derived for all seven molecular subsets to the extent that the squared correlation coefficients exceeded 0.79. Given their simplicity in model development and strong predictability, the 3D-QSAR models developed in this study are expected to function as an effective virtual screening tool for assessing the potential cardiotoxicity of drug candidate molecules.
Collapse
Affiliation(s)
| | - Kee-Choo Chung
- Department of Bioscience and Biotechnology, Sejong University, 209 Neungdong-ro, Kwangjin-gu, Seoul 05006, Republic of Korea;
| | - Hwangseo Park
- Department of Bioscience and Biotechnology, Sejong University, 209 Neungdong-ro, Kwangjin-gu, Seoul 05006, Republic of Korea;
| |
Collapse
|
14
|
Chen Y, Yu X, Li W, Tang Y, Liu G. In silico prediction of hERG blockers using machine learning and deep learning approaches. J Appl Toxicol 2023; 43:1462-1475. [PMID: 37093028 DOI: 10.1002/jat.4477] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 04/04/2023] [Accepted: 04/19/2023] [Indexed: 04/25/2023]
Abstract
The human ether-à-go-go-related gene (hERG) is associated with drug cardiotoxicity. If the hERG channel is blocked, it will lead to prolonged QT interval and cause sudden death in severe cases. Therefore, it is important to evaluate the hERG-blocking property of compounds in early drug discovery. In this study, a dataset containing 4556 compounds with IC50 values determined by patch clamp techniques on mammalian lineage cells was collected, and hERG blockers and non-blockers were distinguished according to three single thresholds and two binary thresholds. Four machine learning (ML) algorithms combining four molecular fingerprints and molecular descriptors as well as graph convolutional neural networks (GCNs) were used to construct a series of binary classification models. The results showed that the best models varied for different thresholds. The ML models implemented by support vector machine and random forest performed well based on Morgan fingerprints and molecular descriptors, with AUCs ranging from 0.884 to 0.950. GCN showed superior prediction performance with AUCs above 0.952, which might be related to its direct extraction of molecular features from the original input. Meanwhile, the classification of binary threshold was better than that of single threshold, which could provide us with a more accurate prediction of hERG blockers. At last, the applicability domain for the model was defined, and seven structural alerts that might generate hERG blockage were identified by information gain and substructure frequency analysis. Our work would be beneficial for identifying hERG blockers in chemicals.
Collapse
Affiliation(s)
- Yuanting Chen
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, China
| | - Xinxin Yu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, China
| | - Weihua Li
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, China
| | - Yun Tang
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, China
| | - Guixia Liu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, China
| |
Collapse
|
15
|
Liu X, Yang H, Ai C, Ding Y, Guo F, Tang J. MVML-MPI: Multi-View Multi-Label Learning for Metabolic Pathway Inference. Brief Bioinform 2023; 24:bbad393. [PMID: 37930024 DOI: 10.1093/bib/bbad393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 09/20/2023] [Accepted: 10/11/2023] [Indexed: 11/07/2023] Open
Abstract
Development of robust and effective strategies for synthesizing new compounds, drug targeting and constructing GEnome-scale Metabolic models (GEMs) requires a deep understanding of the underlying biological processes. A critical step in achieving this goal is accurately identifying the categories of pathways in which a compound participated. However, current machine learning-based methods often overlook the multifaceted nature of compounds, resulting in inaccurate pathway predictions. Therefore, we present a novel framework on Multi-View Multi-Label Learning for Metabolic Pathway Inference, hereby named MVML-MPI. First, MVML-MPI learns the distinct compound representations in parallel with corresponding compound encoders to fully extract features. Subsequently, we propose an attention-based mechanism that offers a fusion module to complement these multi-view representations. As a result, MVML-MPI accurately represents and effectively captures the complex relationship between compounds and metabolic pathways and distinguishes itself from current machine learning-based methods. In experiments conducted on the Kyoto Encyclopedia of Genes and Genomes pathways dataset, MVML-MPI outperformed state-of-the-art methods, demonstrating the superiority of MVML-MPI and its potential to utilize the field of metabolic pathway design, which can aid in optimizing drug-like compounds and facilitating the development of GEMs. The code and data underlying this article are freely available at https://github.com/guofei-tju/MVML-MPI. Contact: jtang@cse.sc.edu, guofei@csu.edu.com or wuxi_dyj@csj.uestc.edu.cn.
Collapse
Affiliation(s)
- Xiaoyi Liu
- Computer Science and Engineering, University of South Carolina, Columbia 29208, USA
| | - Hongpeng Yang
- Computer Science and Engineering, University of South Carolina, Columbia 29208, USA
| | - Chengwei Ai
- Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Yijie Ding
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou 324000, China
| | - Fei Guo
- Computer Science and Engineering, Central South University, Changsha 410083, China
| | - Jijun Tang
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Nanshan 518055, China
| |
Collapse
|
16
|
Ylipää E, Chavan S, Bånkestad M, Broberg J, Glinghammar B, Norinder U, Cotgreave I. hERG-toxicity prediction using traditional machine learning and advanced deep learning techniques. Curr Res Toxicol 2023; 5:100121. [PMID: 37701072 PMCID: PMC10493507 DOI: 10.1016/j.crtox.2023.100121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 08/24/2023] [Accepted: 08/30/2023] [Indexed: 09/14/2023] Open
Abstract
The rise of artificial intelligence (AI) based algorithms has gained a lot of interest in the pharmaceutical development field. Our study demonstrates utilization of traditional machine learning techniques such as random forest (RF), support-vector machine (SVM), extreme gradient boosting (XGBoost), deep neural network (DNN) as well as advanced deep learning techniques like gated recurrent unit-based DNN (GRU-DNN) and graph neural network (GNN), towards predicting human ether-á-go-go related gene (hERG) derived toxicity. Using the largest hERG dataset derived to date, we have utilized 203,853 and 87,366 compounds for training and testing the models, respectively. The results show that GNN, SVM, XGBoost, DNN, RF, and GRU-DNN all performed well, with validation set AUC ROC scores equals 0.96, 0.95, 0.95, 0.94, 0.94 and 0.94, respectively. The GNN was found to be the top performing model based on predictive power and generalizability. The GNN technique is free of any feature engineering steps while having a minimal human intervention. The GNN approach may serve as a basis for comprehensive automation in predictive toxicology. We believe that the models presented here may serve as a promising tool, both for academic institutes as well as pharmaceutical industries, in predicting hERG-liability in new molecular structures.
Collapse
Affiliation(s)
- Erik Ylipää
- Computer Systems Unit, Research Institutes of Sweden RISE, Kista 164 40, Sweden
| | - Swapnil Chavan
- Unit of Chemical and Pharmaceutical Toxicology, Research Institutes of Sweden RISE, Södertalje 151 36, Sweden
| | - Maria Bånkestad
- Computer Systems Unit, Research Institutes of Sweden RISE, Kista 164 40, Sweden
| | - Johan Broberg
- Computer Systems Unit, Research Institutes of Sweden RISE, Kista 164 40, Sweden
| | - Björn Glinghammar
- Preclinical Development & Translational Medicine, Swedish Orphan Biovitrum AB, Solna 171 65, Sweden
| | - Ulf Norinder
- Department of Computer and Systems Sciences, Stockholm University, Kista 164 07, Sweden
| | - Ian Cotgreave
- Unit of Chemical and Pharmaceutical Toxicology, Research Institutes of Sweden RISE, Södertalje 151 36, Sweden
| |
Collapse
|
17
|
Seo YJ, Kim E, Oh IS, Hyun JY, Song JH, Lim HJ, Park SJ. Intramolecular cyclization of N-cyano sulfoximines by N-CN bond activation. RSC Adv 2023; 13:24445-24449. [PMID: 37583669 PMCID: PMC10424563 DOI: 10.1039/d3ra04208a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Accepted: 08/04/2023] [Indexed: 08/17/2023] Open
Abstract
Metal-free halogenated anhydrides promote the intramolecular cyclization of N-cyano sulfoximines. Trifluoro- or trichloroacetic anhydride (TFAA or TCAA, respectively) activate the N-cyano groups of N-cyano sulfoximines, leading to the intramolecular cyclization of 2-benzamide-N-cyano sulfoximines 1. This method results in excellent yields of thiadiazinone 1-oxides 2. A full intramolecular cyclization pattern was suggested by (i) labeling experiments with 13C, (ii) isolating of N-trifluoroacetyl sulfoximine 1ac, and (iii) confirming the generation of the intermediate 1ad by LC/MS analysis.
Collapse
Affiliation(s)
- Ye Ji Seo
- Department of Drug Discovery, Korea Research Institute of Chemical Technology (KRICT) 141 Gajeong-ro, Yuseong-gu Daejeon 34114 Republic of Korea +82 42 860 7160 +82 42 860 7175
- Pharmaceutical Chemistry, University of Science & Technology Daejeon 34113 Republic of Korea
| | - Eunsil Kim
- Department of Drug Discovery, Korea Research Institute of Chemical Technology (KRICT) 141 Gajeong-ro, Yuseong-gu Daejeon 34114 Republic of Korea +82 42 860 7160 +82 42 860 7175
- Department of Chemistry, Sogang University 35 Baekbeom-ro, Mapo-gu Seoul 04107 Republic of Korea
| | - In Seok Oh
- Department of Drug Discovery, Korea Research Institute of Chemical Technology (KRICT) 141 Gajeong-ro, Yuseong-gu Daejeon 34114 Republic of Korea +82 42 860 7160 +82 42 860 7175
- Department of Chemistry, Sogang University 35 Baekbeom-ro, Mapo-gu Seoul 04107 Republic of Korea
| | - Ji Young Hyun
- Department of Drug Discovery, Korea Research Institute of Chemical Technology (KRICT) 141 Gajeong-ro, Yuseong-gu Daejeon 34114 Republic of Korea +82 42 860 7160 +82 42 860 7175
- Pharmaceutical Chemistry, University of Science & Technology Daejeon 34113 Republic of Korea
| | - Ji Ho Song
- Department of Drug Discovery, Korea Research Institute of Chemical Technology (KRICT) 141 Gajeong-ro, Yuseong-gu Daejeon 34114 Republic of Korea +82 42 860 7160 +82 42 860 7175
- Pharmaceutical Chemistry, University of Science & Technology Daejeon 34113 Republic of Korea
| | - Hwan Jung Lim
- Department of Drug Discovery, Korea Research Institute of Chemical Technology (KRICT) 141 Gajeong-ro, Yuseong-gu Daejeon 34114 Republic of Korea +82 42 860 7160 +82 42 860 7175
- Pharmaceutical Chemistry, University of Science & Technology Daejeon 34113 Republic of Korea
| | - Seong Jun Park
- Department of Drug Discovery, Korea Research Institute of Chemical Technology (KRICT) 141 Gajeong-ro, Yuseong-gu Daejeon 34114 Republic of Korea +82 42 860 7160 +82 42 860 7175
- Pharmaceutical Chemistry, University of Science & Technology Daejeon 34113 Republic of Korea
| |
Collapse
|
18
|
Jang WD, Jang J, Song JS, Ahn S, Oh KS. PredPS: Attention-based graph neural network for predicting stability of compounds in human plasma. Comput Struct Biotechnol J 2023; 21:3532-3539. [PMID: 37484492 PMCID: PMC10362732 DOI: 10.1016/j.csbj.2023.07.008] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 07/02/2023] [Accepted: 07/05/2023] [Indexed: 07/25/2023] Open
Abstract
Stability of compounds in the human plasma is crucial for maintaining sufficient systemic drug exposure and considered an essential factor in the early stages of drug discovery and development. The rapid degradation of compounds in the plasma can result in poor in vivo efficacy. Currently, there are no open-source software programs for predicting human plasma stability. In this study, we developed an attention-based graph neural network, PredPS to predict the plasma stability of compounds in human plasma using in-house and open-source datasets. The PredPS outperformed the two machine learning and two deep learning algorithms that were used for comparison indicating its stability-predicting efficiency. PredPS achieved an area under the receiver operating characteristic curve of 90.1%, accuracy of 83.5%, sensitivity of 82.3%, and specificity of 84.6% when evaluated using 5-fold cross-validation. In the early stages of drug discovery, PredPS could be a helpful method for predicting the human plasma stability of compounds. Saving time and money can be accomplished by adopting an in silico-based plasma stability prediction model at the high-throughput screening stage. The source code for PredPS is available at https://bitbucket.org/krict-ai/predps and the PredPS web server is available at https://predps.netlify.app.
Collapse
Affiliation(s)
- Woo Dae Jang
- Data Convergence Drug Research Center, Korea Research Institute of Chemical Technology, Daejeon 34114, Republic of Korea
| | - Jidon Jang
- Data Convergence Drug Research Center, Korea Research Institute of Chemical Technology, Daejeon 34114, Republic of Korea
| | - Jin Sook Song
- Data Convergence Drug Research Center, Korea Research Institute of Chemical Technology, Daejeon 34114, Republic of Korea
| | - Sunjoo Ahn
- Data Convergence Drug Research Center, Korea Research Institute of Chemical Technology, Daejeon 34114, Republic of Korea
- Department of Medicinal and Pharmaceutical Chemistry, University of Science and Technology, Daejeon 34129, Republic of Korea
| | - Kwang-Seok Oh
- Data Convergence Drug Research Center, Korea Research Institute of Chemical Technology, Daejeon 34114, Republic of Korea
- Department of Medicinal and Pharmaceutical Chemistry, University of Science and Technology, Daejeon 34129, Republic of Korea
| |
Collapse
|
19
|
Wang H, Zhu G, Izu LT, Chen-Izu Y, Ono N, Altaf-Ul-Amin MD, Kanaya S, Huang M. On QSAR-based cardiotoxicity modeling with the expressiveness-enhanced graph learning model and dual-threshold scheme. Front Physiol 2023; 14:1156286. [PMID: 37228825 PMCID: PMC10203956 DOI: 10.3389/fphys.2023.1156286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Accepted: 04/05/2023] [Indexed: 05/27/2023] Open
Abstract
Introduction: Given the direct association with malignant ventricular arrhythmias, cardiotoxicity is a major concern in drug design. In the past decades, computational models based on the quantitative structure-activity relationship have been proposed to screen out cardiotoxic compounds and have shown promising results. The combination of molecular fingerprint and the machine learning model shows stable performance for a wide spectrum of problems; however, not long after the advent of the graph neural network (GNN) deep learning model and its variant (e.g., graph transformer), it has become the principal way of quantitative structure-activity relationship-based modeling for its high flexibility in feature extraction and decision rule generation. Despite all these progresses, the expressiveness (the ability of a program to identify non-isomorphic graph structures) of the GNN model is bounded by the WL isomorphism test, and a suitable thresholding scheme that relates directly to the sensitivity and credibility of a model is still an open question. Methods: In this research, we further improved the expressiveness of the GNN model by introducing the substructure-aware bias by the graph subgraph transformer network model. Moreover, to propose the most appropriate thresholding scheme, a comprehensive comparison of the thresholding schemes was conducted. Results: Based on these improvements, the best model attains performance with 90.4% precision, 90.4% recall, and 90.5% F1-score with a dual-threshold scheme (active: < 1 μ M ; non-active: > 30 μ M ). The improved pipeline (graph subgraph transformer network model and thresholding scheme) also shows its advantages in terms of the activity cliff problem and model interpretability.
Collapse
Affiliation(s)
- Huijia Wang
- Graduate School of Science and Technology, Nara Institute of Science and Technology, Ikoma, Japan
| | - Guangxian Zhu
- Graduate School of Science and Technology, Nara Institute of Science and Technology, Ikoma, Japan
| | - Leighton T. Izu
- Department of Pharmacology, University of California, Davis, CA, United States
| | - Ye Chen-Izu
- Department of Biomedical Engineering, University of California, Davis, CA, United States
| | - Naoaki Ono
- Data Science Center, Nara Institute of Science and Technology, Ikoma, Japan
| | - MD Altaf-Ul-Amin
- Graduate School of Science and Technology, Nara Institute of Science and Technology, Ikoma, Japan
| | - Shigehiko Kanaya
- Graduate School of Science and Technology, Nara Institute of Science and Technology, Ikoma, Japan
| | - Ming Huang
- Graduate School of Science and Technology, Nara Institute of Science and Technology, Ikoma, Japan
| |
Collapse
|
20
|
Wu Z, Wang J, Du H, Jiang D, Kang Y, Li D, Pan P, Deng Y, Cao D, Hsieh CY, Hou T. Chemistry-intuitive explanation of graph neural networks for molecular property prediction with substructure masking. Nat Commun 2023; 14:2585. [PMID: 37142585 PMCID: PMC10160109 DOI: 10.1038/s41467-023-38192-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2022] [Accepted: 04/12/2023] [Indexed: 05/06/2023] Open
Abstract
Graph neural networks (GNNs) have been widely used in molecular property prediction, but explaining their black-box predictions is still a challenge. Most existing explanation methods for GNNs in chemistry focus on attributing model predictions to individual nodes, edges or fragments that are not necessarily derived from a chemically meaningful segmentation of molecules. To address this challenge, we propose a method named substructure mask explanation (SME). SME is based on well-established molecular segmentation methods and provides an interpretation that aligns with the understanding of chemists. We apply SME to elucidate how GNNs learn to predict aqueous solubility, genotoxicity, cardiotoxicity and blood-brain barrier permeation for small molecules. SME provides interpretation that is consistent with the understanding of chemists, alerts them to unreliable performance, and guides them in structural optimization for target properties. Hence, we believe that SME empowers chemists to confidently mine structure-activity relationship (SAR) from reliable GNNs through a transparent inspection on how GNNs pick up useful signals when learning from data.
Collapse
Affiliation(s)
- Zhenxing Wu
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, P.R. China
- CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018, Zhejiang, P.R. China
| | - Jike Wang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, P.R. China
- CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018, Zhejiang, P.R. China
- National Engineering Research Center for Multimedia Software, School of Computer Science, Wuhan University, Wuhan, 430072, Hubei, P.R. China
| | - Hongyan Du
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, P.R. China
- CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018, Zhejiang, P.R. China
| | - Dejun Jiang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, P.R. China
- CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018, Zhejiang, P.R. China
| | - Yu Kang
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, P.R. China
| | - Dan Li
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, P.R. China
| | - Peichen Pan
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, P.R. China
| | - Yafeng Deng
- CarbonSilicon AI Technology Co., Ltd, Hangzhou, 310018, Zhejiang, P.R. China
| | - Dongsheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, Changsha, 410004, Hunan, P.R. China.
| | - Chang-Yu Hsieh
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, P.R. China.
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, P.R. China.
| |
Collapse
|
21
|
AlRawashdeh S, Chandrasekaran S, Barakat KH. Structural analysis of hERG channel blockers and the implications for drug design. J Mol Graph Model 2023; 120:108405. [PMID: 36680816 DOI: 10.1016/j.jmgm.2023.108405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 12/26/2022] [Accepted: 01/09/2023] [Indexed: 01/13/2023]
Abstract
The repolarizing current (Ikr) produced by the hERG potassium channel forms a major component of the cardiac action potential and blocking this current by small molecule drugs can lead to life-threatening cardiotoxicity. Understanding the mechanisms of drug-mediated hERG inhibition is essential to develop a second generation of safe drugs, with minimal cardiotoxic effects. Although various computational tools and drug design guidelines have been developed to avoid binding of drugs to the hERG pore domain, there are many other aspects that are still open for investigation. This includes the use computational modelling to study the implications of hERG mutations on hERG structure and trafficking, the interactions of hERG with hERG chaperone proteins and with membrane-soluble molecules, the mechanisms of drugs that inhibit hERG trafficking and drugs that rescue hERG mutations. The plethora of available experimental data regarding all these aspects can guide the construction of much needed robust computational structural models to study these mechanisms for the rational design of safe drugs.
Collapse
Affiliation(s)
- Sara AlRawashdeh
- Faculty of Pharmacy and Pharmaceutical Sciences, University of Alberta, Edmonton, AB, Canada
| | | | - Khaled H Barakat
- Faculty of Pharmacy and Pharmaceutical Sciences, University of Alberta, Edmonton, AB, Canada.
| |
Collapse
|
22
|
Tran TTV, Surya Wibowo A, Tayara H, Chong KT. Artificial Intelligence in Drug Toxicity Prediction: Recent Advances, Challenges, and Future Perspectives. J Chem Inf Model 2023; 63:2628-2643. [PMID: 37125780 DOI: 10.1021/acs.jcim.3c00200] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Toxicity prediction is a critical step in the drug discovery process that helps identify and prioritize compounds with the greatest potential for safe and effective use in humans, while also reducing the risk of costly late-stage failures. It is estimated that over 30% of drug candidates are discarded owing to toxicity. Recently, artificial intelligence (AI) has been used to improve drug toxicity prediction as it provides more accurate and efficient methods for identifying the potentially toxic effects of new compounds before they are tested in human clinical trials, thus saving time and money. In this review, we present an overview of recent advances in AI-based drug toxicity prediction, including the use of various machine learning algorithms and deep learning architectures, of six major toxicity properties and Tox21 assay end points. Additionally, we provide a list of public data sources and useful toxicity prediction tools for the research community and highlight the challenges that must be addressed to enhance model performance. Finally, we discuss future perspectives for AI-based drug toxicity prediction. This review can aid researchers in understanding toxicity prediction and pave the way for new methods of drug discovery.
Collapse
Affiliation(s)
- Thi Tuyet Van Tran
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea
- Faculty of Information Technology, An Giang University, Long Xuyen 880000, Vietnam
- Vietnam National University - Ho Chi Minh City, Ho Chi Minh 700000, Vietnam
| | - Agung Surya Wibowo
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea
- Department of Electrical Engineering, Telkom University, Bandung 40257, Indonesia
| | - Hilal Tayara
- School of International Engineering and Science, Jeonbuk National University, Jeonju 54896, Republic of Korea
| | - Kil To Chong
- Advances Electronics and Information Research Center, Jeonbuk National University, Jeonju 54896, Republic of Korea
| |
Collapse
|
23
|
Yang W, Ouyang Q, Zhu Z, Wu Y, Fan M, Liao Y, Guo X, Xu Z, Zhang X, Zhang Y, Hu N, Zhang D. A biosensing system employing nonlinear dynamic analysis-assisted neural network for drug-induced cardiotoxicity assessment. Biosens Bioelectron 2023; 222:114923. [PMID: 36455375 DOI: 10.1016/j.bios.2022.114923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Accepted: 11/16/2022] [Indexed: 11/18/2022]
Abstract
Preclinical investigation of drug-induced cardiotoxicity is of importance for drug development. To evaluate such cardiotoxicity, in vitro high-throughput interdigitated electrode-based recording of cardiomyocytes mechanical beating is widely used. To automatically analyze the features from the beating signals for drug-induced cardiotoxicity assessment, artificial neural network analysis is conventionally employed and signals are segmented into cycles and feature points are located in the cycles. However, signal segmentation and location of feature points for different signal shapes require design of specific algorithms. Consequently, this may lower the efficiency of research and the applications of such algorithms in signals with different morphologies are limited. Here, we present a biosensing system that employs nonlinear dynamic analysis-assisted neural network (NDANN) to avoid the signal segmentation process and directly extract features from beating signal time series. By processing beating time series with fixed time duration to avoid the signal segmentation process, this NDANN-based biosensing system can identify drug-induced cardiotoxicity with accuracy over 0.99. The individual drugs were classified with high accuracies over 0.94 and drug-induced cardiotoxicity levels were accurately predicted. We also evaluated the generalization performance of the NDANN-based biosensing system in assessing drug-induced cardiotoxicity through an independent dataset. This system achieved accuracy of 0.85-0.95 for different drug concentrations in identification of drug-induced cardiotoxicity. This result demonstrates that our NDANN-based biosensing system has the capacity of screening newly developed drugs, which is crucial in practical applications. This NDANN-based biosensing system can work as a new screening platform for drug-induced cardiotoxicity and improve the efficiency of bio-signal processing.
Collapse
Affiliation(s)
- Wenjian Yang
- Research Center for Intelligent Sensing Systems, Zhejiang Laboratory, Hangzhou, 311100, China
| | - Qiangqiang Ouyang
- First Affiliated Hospital, Sun Yat-Sen University, Guangzhou, 510006, China
| | - Zhijing Zhu
- Key Laboratory of Novel Target and Drug Study for Neural Repair of Zhejiang Province, School of Medicine, School of Computer & Computing Science, Zhejiang University City College, Hangzhou, 310015, China; School of Brain Science and Brain Medicine, Zhejiang University, Hangzhou, 310058, China
| | - Yue Wu
- Research Center for Intelligent Sensing Systems, Zhejiang Laboratory, Hangzhou, 311100, China.
| | - Minzhi Fan
- Research Center for Intelligent Sensing Systems, Zhejiang Laboratory, Hangzhou, 311100, China
| | - Yuheng Liao
- Research Center for Intelligent Sensing Systems, Zhejiang Laboratory, Hangzhou, 311100, China
| | - Xinyu Guo
- Research Center for Intelligent Sensing Systems, Zhejiang Laboratory, Hangzhou, 311100, China
| | - Zhongyuan Xu
- Research Center for Intelligent Sensing Systems, Zhejiang Laboratory, Hangzhou, 311100, China
| | - Xiaoyu Zhang
- Research Center for Intelligent Sensing Systems, Zhejiang Laboratory, Hangzhou, 311100, China
| | - Yunshan Zhang
- Research Center for Intelligent Sensing Systems, Zhejiang Laboratory, Hangzhou, 311100, China
| | - Ning Hu
- ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, 311200, China; Stoddart Institute of Molecular Science, Department of Chemistry, Zhejiang University, Hangzhou, 310058, China
| | - Diming Zhang
- Research Center for Intelligent Sensing Systems, Zhejiang Laboratory, Hangzhou, 311100, China.
| |
Collapse
|
24
|
Wang T, Sun J, Zhao Q. Investigating cardiotoxicity related with hERG channel blockers using molecular fingerprints and graph attention mechanism. Comput Biol Med 2023; 153:106464. [PMID: 36584603 DOI: 10.1016/j.compbiomed.2022.106464] [Citation(s) in RCA: 105] [Impact Index Per Article: 105.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 12/12/2022] [Accepted: 12/19/2022] [Indexed: 12/24/2022]
Abstract
Human ether-a-go-go-related gene (hERG) channel blockade by small molecules is a big concern during drug development in the pharmaceutical industry. Failure or inhibition of hERG channel activity caused by drug molecules can lead to prolonging QT interval, which will result in serious cardiotoxicity. Thus, evaluating the hERG blocking activity of all these small molecular compounds is technically challenging, and the relevant procedures are expensive and time-consuming. In this study, we develop a novel deep learning predictive model named DMFGAM for predicting hERG blockers. In order to characterize the molecule more comprehensively, we first consider the fusion of multiple molecular fingerprint features to characterize its final molecular fingerprint features. Then, we use the multi-head attention mechanism to extract the molecular graph features. Both molecular fingerprint features and molecular graph features are fused as the final features of the compounds to make the feature expression of compounds more comprehensive. Finally, the molecules are classified into hERG blockers or hERG non-blockers through the fully connected neural network. We conduct 5-fold cross-validation experiment to evaluate the performance of DMFGAM, and verify the robustness of DMFGAM on external validation datasets. We believe DMFGAM can serve as a powerful tool to predict hERG channel blockers in the early stages of drug discovery and development.
Collapse
Affiliation(s)
- Tianyi Wang
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, 114051, China
| | - Jianqiang Sun
- School of Automation and Electrical Engineering, Linyi University, Linyi, 276000, China
| | - Qi Zhao
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, 114051, China.
| |
Collapse
|
25
|
Iftkhar S, de Sá AGC, Velloso JPL, Aljarf R, Pires DEV, Ascher DB. cardioToxCSM: A Web Server for Predicting Cardiotoxicity of Small Molecules. J Chem Inf Model 2022; 62:4827-4836. [PMID: 36219164 DOI: 10.1021/acs.jcim.2c00822] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The design of novel, safe, and effective drugs to treat human diseases is a challenging venture, with toxicity being one of the main sources of attrition at later stages of development. Failure due to toxicity incurs a significant increase in costs and time to market, with multiple drugs being withdrawn from the market due to their adverse effects. Cardiotoxicity, for instance, was responsible for the failure of drugs such as fenspiride, propoxyphene, and valdecoxib. While significant effort has been dedicated to mitigate this issue by developing computational approaches that aim to identify molecules likely to be toxic, including quantitative structure-activity relationship models and machine learning methods, current approaches present limited performance and interpretability. To overcome these, we propose a new web-based computational method, cardioToxCSM, which can predict six types of cardiac toxicity outcomes, including arrhythmia, cardiac failure, heart block, hERG toxicity, hypertension, and myocardial infarction, efficiently and accurately. cardioToxCSM was developed using the concept of graph-based signatures, molecular descriptors, toxicophore matchings, and molecular fingerprints, leveraging explainable machine learning, and was validated internally via different cross validation schemes and externally via low-redundancy blind sets. The models presented robust performances with areas under ROC curves of up to 0.898 on 5-fold cross-validation, consistent with metrics on blind tests. Additionally, our models provide interpretation of the predictions by identifying whether substructures that are commonly enriched in toxic compounds were present. We believe cardioToxCSM will provide valuable insight into the potential cardiotoxicity of small molecules early on drug screening efforts. The method is made freely available as a web server at https://biosig.lab.uq.edu.au/cardiotoxcsm.
Collapse
Affiliation(s)
- Saba Iftkhar
- School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia 4072, Queensland, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Parkville 3052, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne 3004, Victoria, Australia
| | - Alex G C de Sá
- School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia 4072, Queensland, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Parkville 3052, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne 3004, Victoria, Australia.,Baker Department of Cardiometabolic Health, Melbourne Medical School, University of Melbourne, Parkville 3010, Victoria, Australia
| | - João P L Velloso
- School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia 4072, Queensland, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Parkville 3052, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne 3004, Victoria, Australia
| | - Raghad Aljarf
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Parkville 3052, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne 3004, Victoria, Australia.,Baker Department of Cardiometabolic Health, Melbourne Medical School, University of Melbourne, Parkville 3010, Victoria, Australia
| | - Douglas E V Pires
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Parkville 3052, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne 3004, Victoria, Australia.,School of Computing and Information Systems, University of Melbourne, Parkville 3052, Victoria, Australia
| | - David B Ascher
- School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia 4072, Queensland, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Parkville 3052, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne 3004, Victoria, Australia.,Baker Department of Cardiometabolic Health, Melbourne Medical School, University of Melbourne, Parkville 3010, Victoria, Australia
| |
Collapse
|
26
|
Deep learning methods for molecular representation and property prediction. Drug Discov Today 2022; 27:103373. [PMID: 36167282 DOI: 10.1016/j.drudis.2022.103373] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 08/22/2022] [Accepted: 09/21/2022] [Indexed: 01/11/2023]
Abstract
With advances in artificial intelligence (AI) methods, computer-aided drug design (CADD) has developed rapidly in recent years. Effective molecular representation and accurate property prediction are crucial tasks in CADD workflows. In this review, we summarize contemporary applications of deep learning (DL) methods for molecular representation and property prediction. We categorize DL methods according to the format of molecular data (1D, 2D, and 3D). In addition, we discuss some common DL models, such as ensemble learning and transfer learning, and analyze the interpretability methods for these models. We also highlight the challenges and opportunities of DL methods for molecular representation and property prediction.
Collapse
|
27
|
Potter LE, Burgess CR. The melanin-concentrating hormone system as a target for the treatment of sleep disorders. Front Neurosci 2022; 16:952275. [PMID: 36177357 PMCID: PMC9513178 DOI: 10.3389/fnins.2022.952275] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Accepted: 08/24/2022] [Indexed: 11/13/2022] Open
Abstract
Given the widespread prevalence of sleep disorders and their impacts on health, it is critical that researchers continue to identify and evaluate novel avenues of treatment. Recently the melanin-concentrating hormone (MCH) system has attracted commercial and scientific interest as a potential target of pharmacotherapy for sleep disorders. This interest emerges from basic scientific research demonstrating a role for MCH in regulating sleep, and particularly REM sleep. In addition to this role in sleep regulation, the MCH system and the MCH receptor 1 (MCHR1) have been implicated in a wide variety of other physiological functions and behaviors, including feeding/metabolism, reward, anxiety, depression, and learning. The basic research literature on sleep and the MCH system, and the history of MCH drug development, provide cause for both skepticism and cautious optimism about the prospects of MCH-targeting drugs in sleep disorders. Extensive efforts have focused on developing MCHR1 antagonists for use in obesity, however, few of these drugs have advanced to clinical trials, and none have gained regulatory approval. Additional basic research will be needed to fully characterize the MCH system’s role in sleep regulation, for example, to fully differentiate between MCH-neuron and peptide/receptor-mediated functions. Additionally, a number of issues relating to drug design will continue to pose a practical challenge for novel pharmacotherapies targeting the MCH system.
Collapse
Affiliation(s)
- Liam E. Potter
- Department of Molecular and Integrative Physiology, Michigan Medicine, Ann Arbor, MI, United States
- Michigan Neuroscience Institute, University of Michigan, Ann Arbor, MI, United States
- *Correspondence: Liam E. Potter,
| | - Christian R. Burgess
- Department of Molecular and Integrative Physiology, Michigan Medicine, Ann Arbor, MI, United States
- Michigan Neuroscience Institute, University of Michigan, Ann Arbor, MI, United States
- Christian R. Burgess,
| |
Collapse
|
28
|
Delre P, Lavado GJ, Lamanna G, Saviano M, Roncaglioni A, Benfenati E, Mangiatordi GF, Gadaleta D. Ligand-based prediction of hERG-mediated cardiotoxicity based on the integration of different machine learning techniques. Front Pharmacol 2022; 13:951083. [PMID: 36133824 PMCID: PMC9483173 DOI: 10.3389/fphar.2022.951083] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 07/20/2022] [Indexed: 11/13/2022] Open
Abstract
Drug-induced cardiotoxicity is a common side effect of drugs in clinical use or under postmarket surveillance and is commonly due to off-target interactions with the cardiac human-ether-a-go-go-related (hERG) potassium channel. Therefore, prioritizing drug candidates based on their hERG blocking potential is a mandatory step in the early preclinical stage of a drug discovery program. Herein, we trained and properly validated 30 ligand-based classifiers of hERG-related cardiotoxicity based on 7,963 curated compounds extracted by the freely accessible repository ChEMBL (version 25). Different machine learning algorithms were tested, namely, random forest, K-nearest neighbors, gradient boosting, extreme gradient boosting, multilayer perceptron, and support vector machine. The application of 1) the best practices for data curation, 2) the feature selection method VSURF, and 3) the synthetic minority oversampling technique (SMOTE) to properly handle the unbalanced data, allowed for the development of highly predictive models (BAMAX = 0.91, AUCMAX = 0.95). Remarkably, the undertaken temporal validation approach not only supported the predictivity of the herein presented classifiers but also suggested their ability to outperform those models commonly used in the literature. From a more methodological point of view, the study put forward a new computational workflow, freely available in the GitHub repository (https://github.com/PDelre93/hERG-QSAR), as valuable for building highly predictive models of hERG-mediated cardiotoxicity.
Collapse
Affiliation(s)
- Pietro Delre
- CNR—Institute of Crystallography, Bari, Italy
- Chemistry Department, University of Bari “Aldo Moro”, Bari, Italy
| | - Giovanna J. Lavado
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Giuseppe Lamanna
- CNR—Institute of Crystallography, Bari, Italy
- Chemistry Department, University of Bari “Aldo Moro”, Bari, Italy
| | | | - Alessandra Roncaglioni
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Emilio Benfenati
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
| | - Giuseppe Felice Mangiatordi
- CNR—Institute of Crystallography, Bari, Italy
- *Correspondence: Giuseppe Felice Mangiatordi, ; Domenico Gadaleta,
| | - Domenico Gadaleta
- Laboratory of Environmental Chemistry and Toxicology, Department of Environmental Health Sciences, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Milan, Italy
- *Correspondence: Giuseppe Felice Mangiatordi, ; Domenico Gadaleta,
| |
Collapse
|
29
|
Goel H, Yu W, MacKerell AD. hERG Blockade Prediction by Combining Site Identification by Ligand Competitive Saturation and Physicochemical Properties. CHEMISTRY (BASEL, SWITZERLAND) 2022; 4:630-646. [PMID: 36712295 PMCID: PMC9881610 DOI: 10.3390/chemistry4030045] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Human ether-a-go-go-related gene (hERG) potassium channel is well-known contributor to drug-induced cardiotoxicity and therefore an extremely important target when performing safety assessments of drug candidates. Ligand-based approaches in connection with quantitative structure active relationships (QSAR) analyses have been developed to predict hERG toxicity. Availability of the recent published cryogenic electron microscopy (cryo-EM) structure for the hERG channel opened the prospect for using structure-based simulation and docking approaches for hERG drug liability predictions. In recent time, the idea of combining structure- and ligand-based approaches for modeling hERG drug liability has gained momentum offering improvements in predictability when compared to ligand-based QSAR practices alone. The present article demonstrates uniting the structure-based SILCS (site-identification by ligand competitive saturation) approach in conjunction with physicochemical properties to develop predictive models for hERG blockade. This combination leads to improved model predictability based on Pearson's R and percent correct (represents rank-ordering of ligands) metric for different validation sets of hERG blockers involving diverse chemical scaffold and wide range of pIC50 values. The inclusion of the SILCS structure-based approach allows determination of the hERG region to which compounds bind and the contribution of different chemical moieties in the compounds to blockade, thereby facilitating the rational ligand design to minimize hERG liability.
Collapse
Affiliation(s)
- Himanshu Goel
- Computer Aided Drug Design Center, Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy, 20 Penn St. Baltimore, MD 21201, United States
| | - Wenbo Yu
- Computer Aided Drug Design Center, Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy, 20 Penn St. Baltimore, MD 21201, United States
| | - Alexander D. MacKerell
- Computer Aided Drug Design Center, Department of Pharmaceutical Sciences, University of Maryland School of Pharmacy, 20 Penn St. Baltimore, MD 21201, United States
| |
Collapse
|
30
|
Shan M, Jiang C, Qin L, Cheng G. A Review of Computational Methods in Predicting hERG Channel Blockers. ChemistrySelect 2022. [DOI: 10.1002/slct.202201221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Mengyi Shan
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| | - Chen Jiang
- QuanMin RenZheng (HangZhou) Technology Co. Ltd. China
| | - Lu‐Ping Qin
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| | - Gang Cheng
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| |
Collapse
|
31
|
Zhang Y, Wong G, Mann G, Muller S, Yang JYH. SurvBenchmark: comprehensive benchmarking study of survival analysis methods using both omics data and clinical data. Gigascience 2022; 11:6652188. [PMID: 35906887 PMCID: PMC9338425 DOI: 10.1093/gigascience/giac071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 05/16/2022] [Accepted: 06/22/2022] [Indexed: 11/24/2022] Open
Abstract
Survival analysis is a branch of statistics that deals with both the tracking of time and the survival status simultaneously as the dependent response. Current comparisons of survival model performance mostly center on clinical data with classic statistical survival models, with prediction accuracy often serving as the sole metric of model performance. Moreover, survival analysis approaches for censored omics data have not been thoroughly investigated. The common approach is to binarize the survival time and perform a classification analysis. Here, we develop a benchmarking design, SurvBenchmark, that evaluates a diverse collection of survival models for both clinical and omics data sets. SurvBenchmark not only focuses on classical approaches such as the Cox model but also evaluates state-of-the-art machine learning survival models. All approaches were assessed using multiple performance metrics; these include model predictability, stability, flexibility, and computational issues. Our systematic comparison design with 320 comparisons (20 methods over 16 data sets) shows that the performances of survival models vary in practice over real-world data sets and over the choice of the evaluation metric. In particular, we highlight that using multiple performance metrics is critical in providing a balanced assessment of various models. The results in our study will provide practical guidelines for translational scientists and clinicians, as well as define possible areas of investigation in both survival technique and benchmarking strategies.
Collapse
Affiliation(s)
- Yunwei Zhang
- School of Mathematics and Statistics, The University of Sydney, Sydney 2006, Australia.,Charles Perkins Centre, The University of Sydney, Sydney 2006, Australia
| | - Germaine Wong
- Sydney School of Public Health, The University of Sydney, NSW, Sydney 2006, Australia.,Centre for Kidney Research, Kids Research Institute, The Children's Hospital at Westmead, NSW, 2145, Sydney, Australia.,Centre for Transplant and Renal Research, Westmead Hospital, NSW, 2145, Sydney, Australia
| | - Graham Mann
- John Curtin School of Medical Research, Australian National University, Canberra 2601, Australia.,Melanoma Institute Australia, North Sydney, NSW 2065, Australia
| | - Samuel Muller
- School of Mathematics and Statistics, The University of Sydney, Sydney 2006, Australia.,Department of Mathematics and Statistics, Macquarie University, Sydney 2109, Australia
| | - Jean Y H Yang
- School of Mathematics and Statistics, The University of Sydney, Sydney 2006, Australia.,Charles Perkins Centre, The University of Sydney, Sydney 2006, Australia.,Laboratory of Data Discovery for Health Limited (D24H), Science Park, Hong Kong SAR, China
| |
Collapse
|
32
|
CDI Exerts Anti-Tumor Effects by Blocking the FoxM1-DNA Interaction. Biomedicines 2022; 10:biomedicines10071671. [PMID: 35884976 PMCID: PMC9313426 DOI: 10.3390/biomedicines10071671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 06/30/2022] [Accepted: 07/06/2022] [Indexed: 12/03/2022] Open
Abstract
The Forkhead box protein M1 (FoxM1) is an appealing target for anti-cancer therapeutics as this cell proliferation-associated transcription factor is overexpressed in most human cancers. FoxM1 is involved in tumor invasion, angiogenesis, and metastasis. To discover novel inhibitors that disrupt the FoxM1-DNA interaction, we identified CDI, a small molecule that inhibits the FoxM1–DNA interaction. CDI was identified through an assay based on the time-resolved fluorescence energy transfer response of a labeled consensus oligonucleotide that was bound to a recombinant FoxM1-dsDNA binding domain (FoxM1-DBD) protein and exhibited potent inhibitory activity against FoxM1-DNA interaction. CDI suppressed cell proliferation and induced apoptosis in MDA-MB-231 cells obtained from a breast cancer patient. Furthermore, it decreased not only the mRNA and protein expression of FoxM1 but also that of downstream targets such as CDC25b. Additionally, global transcript profiling of MDA-MB-231 cells by RNA-Seq showed that CDI decreases the expression of FoxM1-regulated genes. The docking and MD simulation results indicated that CDI likely binds to the DNA interaction site of FoxM1-DBD and inhibits the function of FoxM1-DBD. These results of CDI being a possible effective inhibitor of FoxM1-DNA interaction will encourage its usage in pharmaceutical applications.
Collapse
|
33
|
Kim H, Park M, Lee I, Nam H. BayeshERG: a robust, reliable and interpretable deep learning model for predicting hERG channel blockers. Brief Bioinform 2022; 23:6609519. [PMID: 35709752 DOI: 10.1093/bib/bbac211] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2022] [Revised: 04/19/2022] [Accepted: 05/06/2022] [Indexed: 11/13/2022] Open
Abstract
Unintended inhibition of the human ether-à-go-go-related gene (hERG) ion channel by small molecules leads to severe cardiotoxicity. Thus, hERG channel blockage is a significant concern in the development of new drugs. Several computational models have been developed to predict hERG channel blockage, including deep learning models; however, they lack robustness, reliability and interpretability. Here, we developed a graph-based Bayesian deep learning model for hERG channel blocker prediction, named BayeshERG, which has robust predictive power, high reliability and high resolution of interpretability. First, we applied transfer learning with 300 000 large data in initial pre-training to increase the predictive performance. Second, we implemented a Bayesian neural network with Monte Carlo dropout to calibrate the uncertainty of the prediction. Third, we utilized global multihead attentive pooling to augment the high resolution of structural interpretability for the hERG channel blockers and nonblockers. We conducted both internal and external validations for stringent evaluation; in particular, we benchmarked most of the publicly available hERG channel blocker prediction models. We showed that our proposed model outperformed predictive performance and uncertainty calibration performance. Furthermore, we found that our model learned to focus on the essential substructures of hERG channel blockers via an attention mechanism. Finally, we validated the prediction results of our model by conducting in vitro experiments and confirmed its high validity. In summary, BayeshERG could serve as a versatile tool for discovering hERG channel blockers and helping maximize the possibility of successful drug discovery. The data and source code are available at our GitHub repository (https://github.com/GIST-CSBL/BayeshERG).
Collapse
Affiliation(s)
- Hyunho Kim
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Buk-gu, Gwangju, 61005, Republic of Korea
| | - Minsu Park
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Buk-gu, Gwangju, 61005, Republic of Korea
| | - Ingoo Lee
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Buk-gu, Gwangju, 61005, Republic of Korea
| | - Hojung Nam
- School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), Buk-gu, Gwangju, 61005, Republic of Korea
| |
Collapse
|
34
|
Identification and New Indication of Melanin-Concentrating Hormone Receptor 1 (MCHR1) Antagonist Derived from Machine Learning and Transcriptome-Based Drug Repositioning Approaches. Int J Mol Sci 2022; 23:ijms23073807. [PMID: 35409167 PMCID: PMC8998904 DOI: 10.3390/ijms23073807] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 03/28/2022] [Accepted: 03/28/2022] [Indexed: 01/02/2023] Open
Abstract
Melanin-concentrating hormone receptor 1 (MCHR1) has been a target for appetite suppressants, which are helpful in treating obesity. However, it is challenging to develop an MCHR1 antagonist because its binding site is similar to that of the human Ether-à-go-go-Related Gene (hERG) channel, whose inhibition may cause cardiotoxicity. Most drugs developed as MCHR1 antagonists have failed in clinical development due to cardiotoxicity caused by hERG inhibition. Machine learning-based prediction models can overcome these difficulties and provide new opportunities for drug discovery. In this study, we identified KRX-104130 with potent MCHR1 antagonistic activity and no cardiotoxicity through virtual screening using two MCHR1 binding affinity prediction models and an hERG-induced cardiotoxicity prediction model. In addition, we explored other possibilities for expanding the new indications for KRX-104130 using a transcriptome-based drug repositioning approach. KRX-104130 increased the expression of low-density lipoprotein receptor (LDLR), which induced cholesterol reduction in the gene expression analysis. This was confirmed by comparison with gene expression in a nonalcoholic steatohepatitis (NASH) patient group. In a NASH mouse model, the administration of KRX-104130 showed a protective effect by reducing hepatic lipid accumulation, liver injury, and histopathological changes, indicating a promising prospect for the therapeutic effect of NASH as a new indication for MCHR1 antagonists.
Collapse
|
35
|
Liu M, Sun ZL, Zeng Z, Lam KM. MGF6mARice: prediction of DNA N6-methyladenine sites in rice by exploiting molecular graph feature and residual block. Brief Bioinform 2022; 23:6553606. [PMID: 35325050 DOI: 10.1093/bib/bbac082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 02/13/2022] [Accepted: 02/16/2022] [Indexed: 11/12/2022] Open
Abstract
DNA N6-methyladenine (6mA) is produced by the N6 position of the adenine being methylated, which occurs at the molecular level, and is involved in numerous vital biological processes in the rice genome. Given the shortcomings of biological experiments, researchers have developed many computational methods to predict 6mA sites and achieved good performance. However, the existing methods do not consider the occurrence mechanism of 6mA to extract features from the molecular structure. In this paper, a novel deep learning method is proposed by devising DNA molecular graph feature and residual block structure for 6mA sites prediction in rice, named MGF6mARice. Firstly, the DNA sequence is changed into a simplified molecular input line entry system (SMILES) format, which reflects chemical molecular structure. Secondly, for the molecular structure data, we construct the DNA molecular graph feature based on the principle of graph convolutional network. Then, the residual block is designed to extract higher level, distinguishable features from molecular graph features. Finally, the prediction module is used to obtain the result of whether it is a 6mA site. By means of 10-fold cross-validation, MGF6mARice outperforms the state-of-the-art approaches. Multiple experiments have shown that the molecular graph feature and residual block can promote the performance of MGF6mARice in 6mA prediction. To the best of our knowledge, it is the first time to derive a feature of DNA sequence by considering the chemical molecular structure. We hope that MGF6mARice will be helpful for researchers to analyze 6mA sites in rice.
Collapse
Affiliation(s)
- Mengya Liu
- School of Computer Science and Technology, Anhui University, Hefei, 230601, China
| | - Zhan-Li Sun
- School of Artificial Intelligence, Anhui University, Hefei, 230601, China
| | - Zhigang Zeng
- School of Artificial Intelligence and Automation, Huazhong University of Science and Technology, Wuhan, 430074, China
| | - Kin-Man Lam
- Department of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong Kong, China
| |
Collapse
|
36
|
Staszak M, Staszak K, Wieszczycka K, Bajek A, Roszkowski K, Tylkowski B. Machine learning in drug design: Use of artificial intelligence to explore the chemical structure–biological activity relationship. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1568] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Maciej Staszak
- Institute of Technology and Chemical Engineering Poznan University of Technology Poznan Poland
| | - Katarzyna Staszak
- Institute of Technology and Chemical Engineering Poznan University of Technology Poznan Poland
| | - Karolina Wieszczycka
- Institute of Technology and Chemical Engineering Poznan University of Technology Poznan Poland
| | - Anna Bajek
- Department of Tissue Engineering Collegium Medicum, Nicolaus Copernicus University Bydgoszcz Poland
| | - Krzysztof Roszkowski
- Department of Oncology Collegium Medicum Nicolaus Copernicus University Bydgoszcz Poland
| | - Bartosz Tylkowski
- Department of Chemical Engineering University Rovira i Virgili Tarragona Spain
- Eurecat, Centre Tecnològic de Catalunya Chemical Technologies Unit Tarragona Spain
| |
Collapse
|
37
|
Shan M, Jiang C, Chen J, Qin LP, Qin JJ, Cheng G. Predicting hERG channel blockers with directed message passing neural networks. RSC Adv 2022; 12:3423-3430. [PMID: 35425351 PMCID: PMC8979305 DOI: 10.1039/d1ra07956e] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Accepted: 12/13/2021] [Indexed: 11/30/2022] Open
Abstract
Compounds with human ether-à-go-go related gene (hERG) blockade activity may cause severe cardiotoxicity. Assessing the hERG liability in the early stages of the drug discovery process is important, and the in silico methods for predicting hERG channel blockers are actively pursued. In the present study, the directed message passing neural network (D-MPNN) was applied to construct classification models for identifying hERG blockers based on diverse datasets. Several descriptors and fingerprints were tested along with the D-MPNN model. Among all these combinations, D-MPNN with the moe206 descriptors generated from MOE (D-MPNN + moe206) showed significantly improved performances. The AUC-ROC values of the D-MPNN + moe206 model reached 0.956 ± 0.005 under random split and 0.922 ± 0.015 under scaffold split on Cai's hERG dataset, respectively. Moreover, the comparisons between our models and several recently reported machine learning models were made based on various datasets. Our results indicated that the D-MPNN + moe206 model is among the best classification models. Overall, the excellent performance of the DMPNN + moe206 model achieved in this study highlights its potential application in the discovery of novel and effective hERG blockers. Compounds with human ether-à-go-go related gene (hERG) blockade activity may cause severe cardiotoxicity.![]()
Collapse
Affiliation(s)
- Mengyi Shan
- College of Pharmaceutical Sciences, Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| | - Chen Jiang
- College of Pharmaceutical Sciences, Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China .,Hangzhou Jingchun Trading Co., Ltd. China
| | - Jing Chen
- College of Pharmaceutical Sciences, Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China .,College of Pharmaceutical Sciences, Zhejiang University Hangzhou Zhejiang 310058 PR China
| | - Lu-Ping Qin
- College of Pharmaceutical Sciences, Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| | - Jiang-Jiang Qin
- The Cancer Hospital of the University of Chinese Academy of Sciences, Zhejiang Cancer Hospital, Institute of Basic Medicine and Cancer (IBMC), Chinese Academy of Sciences Hangzhou 310022 China
| | - Gang Cheng
- College of Pharmaceutical Sciences, Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| |
Collapse
|
38
|
Ryu JY, Lee JH, Lee BH, Song JS, Ahn S, Oh KS. PredMS: a random forest model for predicting metabolic stability of drug candidates in human liver microsomes. Bioinformatics 2022; 38:364-368. [PMID: 34515778 DOI: 10.1093/bioinformatics/btab547] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2021] [Revised: 07/22/2021] [Accepted: 09/08/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Poor metabolic stability leads to drug development failure. Therefore, it is essential to evaluate the metabolic stability of small compounds for successful drug discovery and development. However, evaluating metabolic stability in vitro and in vivo is expensive, time-consuming and laborious. In addition, only a few free software programs are available for metabolic stability data and prediction. Therefore, in this study, we aimed to develop a prediction model that predicts the metabolic stability of small compounds. RESULTS We developed a computational model, PredMS, which predicts the metabolic stability of small compounds as stable or unstable in human liver microsomes. PredMS is based on a random forest model using an in-house database of metabolic stability data of 1917 compounds. To validate the prediction performance of PredMS, we generated external test data of 61 compounds. PredMS achieved an accuracy of 0.74, Matthew's correlation coefficient of 0.48, sensitivity of 0.70, specificity of 0.86, positive predictive value of 0.94 and negative predictive value of 0.46 on the external test dataset. PredMS will be a useful tool to predict the metabolic stability of small compounds in the early stages of drug discovery and development. AVAILABILITY AND IMPLEMENTATION The source code for PredMS is available at https://bitbucket.org/krictai/predms, and the PredMS web server is available at https://predms.netlify.app. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Jae Yong Ryu
- Department of Biotechnology, Duksung Women's University, Seoul 01369, Republic of Korea
| | - Jeong Hyun Lee
- Data Convergence Drug Research Center, Korea Research Institute of Chemical Technology, 34114 Daejeon, Republic of Korea
| | - Byung Ho Lee
- Data Convergence Drug Research Center, Korea Research Institute of Chemical Technology, 34114 Daejeon, Republic of Korea
| | - Jin Sook Song
- Data Convergence Drug Research Center, Korea Research Institute of Chemical Technology, 34114 Daejeon, Republic of Korea
| | - Sunjoo Ahn
- Data Convergence Drug Research Center, Korea Research Institute of Chemical Technology, 34114 Daejeon, Republic of Korea.,Department of Medicinal and Pharmaceutical Chemistry, University of Science and Technology, Daejeon 34129, Republic of Korea
| | - Kwang-Seok Oh
- Data Convergence Drug Research Center, Korea Research Institute of Chemical Technology, 34114 Daejeon, Republic of Korea.,Department of Medicinal and Pharmaceutical Chemistry, University of Science and Technology, Daejeon 34129, Republic of Korea
| |
Collapse
|
39
|
Wang Y, Wang B, Jiang J, Guo J, Lai J, Lian XY, Wu J. Multitask CapsNet: An Imbalanced Data Deep Learning Method for Predicting Toxicants. ACS OMEGA 2021; 6:26545-26555. [PMID: 34661009 PMCID: PMC8515573 DOI: 10.1021/acsomega.1c03842] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Accepted: 09/14/2021] [Indexed: 05/17/2023]
Abstract
Drug development has a high failure rate, with safety properties constituting a considerable challenge. To reduce risk, in silico tools, including various machine learning methods, have been applied for toxicity prediction. However, these approaches often confront a serious problem: the training data sets are usually biased (imbalanced positive and negative samples), which would result in model training difficulty and unsatisfactory prediction accuracy. Multitask networks obtained significantly better predictive accuracies than single-task methods, and capsule neural networks showed excellent performance in sparse data sets in previous studies. In this study, we developed a new multitask framework based on a capsule neural network (multitask CapsNet) to measure 12 different toxic effects simultaneously. We found that multitask CapsNet excelled in toxicity prediction and outperformed many other computational approaches using the multitask strategy. Only after training on biased data sets did multitask CapsNet achieve significantly improved prediction accuracy on the Tox21 Data Challenge, which gave the largest ratio of highest accuracy (8/12) among compared models. Our model gave a prediction accuracy of 96.6% for the target NR.PPAR.gamma, whose ratio of negative to positive samples was up to 36:1. These results suggested that multitask CapsNet could overcome the bias problems and would provide a novel, accurate, and efficient approach for predicting the toxicities of compounds.
Collapse
Affiliation(s)
- Yiwei Wang
- School
of Preclinical Medicine, Southwest Medical
University, Luzhou 646000, China
| | - Binyou Wang
- School
of Pharmacy, Southwest Medical University, Luzhou 646000, China
| | - Jie Jiang
- School
of Preclinical Medicine, Southwest Medical
University, Luzhou 646000, China
| | - Jianmin Guo
- School
of Preclinical Medicine, Southwest Medical
University, Luzhou 646000, China
| | - Jia Lai
- School
of Pharmacy, Southwest Medical University, Luzhou 646000, China
| | - Xiao-Yuan Lian
- School
of Pharmacy, Zhejiang University, Hangzhou 310011, China
| | - Jianming Wu
- Key
Laboratory of Medical Electrophysiology, Ministry of Education of
China, Medical Key Laboratory for Drug Discovery and Druggability
Evaluation of Sichuan Province, Luzhou Key
Laboratory of Activity Screening and Druggability Evaluation for Chinese
Materia Medica, Luzhou 646000, China
| |
Collapse
|
40
|
Creanza TM, Delre P, Ancona N, Lentini G, Saviano M, Mangiatordi GF. Structure-Based Prediction of hERG-Related Cardiotoxicity: A Benchmark Study. J Chem Inf Model 2021; 61:4758-4770. [PMID: 34506150 PMCID: PMC9282647 DOI: 10.1021/acs.jcim.1c00744] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
![]()
Drug-induced blockade of the human
ether-à-go-go-related
gene (hERG) channel is today considered the main
cause of cardiotoxicity in postmarketing surveillance. Hence, several
ligand-based approaches were developed in the last years and are currently
employed in the early stages of a drug discovery process for in silico cardiac safety assessment of drug candidates.
Herein, we present the first structure-based classifiers able to discern hERG binders from nonbinders. LASSO regularized support
vector machines were applied to integrate docking scores and protein–ligand
interaction fingerprints. A total of 396 models were trained and validated
based on: (i) high-quality experimental bioactivity information returned
by 8337 curated compounds extracted from ChEMBL (version 25) and (ii)
structural predictor data. Molecular docking simulations were performed
using GLIDE and GOLD software programs and four different hERG structural models, namely, the recently published structures
obtained by cryoelectron microscopy (PDB codes: 5VA1 and 7CN1) and
two published homology models selected for comparison. Interestingly,
some classifiers return performances comparable to ligand-based models
in terms of area under the ROC curve (AUCMAX = 0.86 ±
0.01) and negative predictive values (NPVMAX = 0.81 ±
0.01), thus putting forward the herein proposed computational workflow
as a valuable tool for predicting hERG-related cardiotoxicity
without the limitations of ligand-based models, typically affected
by low interpretability and a limited applicability domain. From a
methodological point of view, our study represents the first example
of a successful integration of docking scores and protein–ligand
interaction fingerprints (IFs) through a support vector machine (SVM)
LASSO regularized strategy. Finally, the study highlights the importance
of using hERG structural models accounting for ligand-induced
fit effects and allowed us to select the best-performing protein conformation
(made available in the Supporting Information, SI) to be employed
for a reliable structure-based prediction of hERG-related cardiotoxicity.
Collapse
Affiliation(s)
- Teresa Maria Creanza
- CNR-Institute of Intelligent Industrial Technologies and Systems for Advanced Manufacturing, Via Amendola 122/o, 70126 Bari, Italy
| | - Pietro Delre
- Chemistry Department, University of Bari "Aldo Moro", via E. Orabona, 4, I-70125 Bari, Italy.,CNR-Institute of Crystallography, Via Amendola 122/o, 70126 Bari, Italy
| | - Nicola Ancona
- CNR-Institute of Intelligent Industrial Technologies and Systems for Advanced Manufacturing, Via Amendola 122/o, 70126 Bari, Italy
| | - Giovanni Lentini
- Department of Pharmacy-Pharmaceutical Sciences, University of Bari "Aldo Moro", via E. Orabona, 4, I-70125 Bari, Italy
| | - Michele Saviano
- CNR-Institute of Crystallography, Via Amendola 122/o, 70126 Bari, Italy
| | | |
Collapse
|
41
|
Lee KH, Fant AD, Guo J, Guan A, Jung J, Kudaibergenova M, Miranda WE, Ku T, Cao J, Wacker S, Duff HJ, Newman AH, Noskov SY, Shi L. Toward Reducing hERG Affinities for DAT Inhibitors with a Combined Machine Learning and Molecular Modeling Approach. J Chem Inf Model 2021; 61:4266-4279. [PMID: 34420294 DOI: 10.1021/acs.jcim.1c00856] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Psychostimulant drugs, such as cocaine, inhibit dopamine reuptake via blockading the dopamine transporter (DAT), which is the primary mechanism underpinning their abuse. Atypical DAT inhibitors are dissimilar to cocaine and can block cocaine- or methamphetamine-induced behaviors, supporting their development as part of a treatment regimen for psychostimulant use disorders. When developing these atypical DAT inhibitors as medications, it is necessary to avoid off-target binding that can produce unwanted side effects or toxicities. In particular, the blockade of a potassium channel, human ether-a-go-go (hERG), can lead to potentially lethal ventricular tachycardia. In this study, we established a counter screening platform for DAT and against hERG binding by combining machine learning-based quantitative structure-activity relationship (QSAR) modeling, experimental validation, and molecular modeling and simulations. Our results show that the available data are adequate to establish robust QSAR models, as validated by chemical synthesis and pharmacological evaluation of a validation set of DAT inhibitors. Furthermore, the QSAR models based on subsets of the data according to experimental approaches used have predictive power as well, which opens the door to target specific functional states of a protein. Complementarily, our molecular modeling and simulations identified the structural elements responsible for a pair of DAT inhibitors having opposite binding affinity trends at DAT and hERG, which can be leveraged for rational optimization of lead atypical DAT inhibitors with desired pharmacological properties.
Collapse
Affiliation(s)
- Kuo Hao Lee
- Computational Chemistry and Molecular Biophysics Section, Molecular Targets and Medications Discovery Branch, National Institute on Drug Abuse-Intramural Research Program, National Institutes of Health, Baltimore, Maryland 21224, United States
| | - Andrew D Fant
- Computational Chemistry and Molecular Biophysics Section, Molecular Targets and Medications Discovery Branch, National Institute on Drug Abuse-Intramural Research Program, National Institutes of Health, Baltimore, Maryland 21224, United States
| | - Jiqing Guo
- Libin Cardiovascular Institute of Alberta, Cumming School of Medicine, University of Calgary, Calgary, Alberta T2N 4N1, Canada
| | - Andy Guan
- Computational Chemistry and Molecular Biophysics Section, Molecular Targets and Medications Discovery Branch, National Institute on Drug Abuse-Intramural Research Program, National Institutes of Health, Baltimore, Maryland 21224, United States
| | - Joslyn Jung
- Computational Chemistry and Molecular Biophysics Section, Molecular Targets and Medications Discovery Branch, National Institute on Drug Abuse-Intramural Research Program, National Institutes of Health, Baltimore, Maryland 21224, United States
| | - Mary Kudaibergenova
- Centre for Molecular Simulation, Department of Biological Sciences, University of Calgary, Calgary, Alberta T2N 1N4, Canada
| | - Williams E Miranda
- Centre for Molecular Simulation, Department of Biological Sciences, University of Calgary, Calgary, Alberta T2N 1N4, Canada
| | - Therese Ku
- Medicinal Chemistry Section, Molecular Targets and Medications Discovery Branch, National Institute on Drug Abuse-Intramural Research Program, National Institutes of Health, Baltimore, Maryland 21224, United States
| | - Jianjing Cao
- Medicinal Chemistry Section, Molecular Targets and Medications Discovery Branch, National Institute on Drug Abuse-Intramural Research Program, National Institutes of Health, Baltimore, Maryland 21224, United States
| | - Soren Wacker
- Libin Cardiovascular Institute of Alberta, Cumming School of Medicine, University of Calgary, Calgary, Alberta T2N 4N1, Canada.,Centre for Molecular Simulation, Department of Biological Sciences, University of Calgary, Calgary, Alberta T2N 1N4, Canada.,Achlys Inc., 7-126 Li Ka Shing Center for Health and Innovation, Edmonton, Alberta T6G 2E1, Canada
| | - Henry J Duff
- Libin Cardiovascular Institute of Alberta, Cumming School of Medicine, University of Calgary, Calgary, Alberta T2N 4N1, Canada
| | - Amy Hauck Newman
- Medicinal Chemistry Section, Molecular Targets and Medications Discovery Branch, National Institute on Drug Abuse-Intramural Research Program, National Institutes of Health, Baltimore, Maryland 21224, United States
| | - Sergei Y Noskov
- Centre for Molecular Simulation, Department of Biological Sciences, University of Calgary, Calgary, Alberta T2N 1N4, Canada
| | - Lei Shi
- Computational Chemistry and Molecular Biophysics Section, Molecular Targets and Medications Discovery Branch, National Institute on Drug Abuse-Intramural Research Program, National Institutes of Health, Baltimore, Maryland 21224, United States
| |
Collapse
|
42
|
Coffin AB, Boney R, Hill J, Tian C, Steyger PS. Detecting Novel Ototoxins and Potentiation of Ototoxicity by Disease Settings. Front Neurol 2021; 12:725566. [PMID: 34489859 PMCID: PMC8418111 DOI: 10.3389/fneur.2021.725566] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 07/22/2021] [Indexed: 12/24/2022] Open
Abstract
Over 100 drugs and chemicals are associated with permanent hearing loss, tinnitus, and vestibular deficits, collectively known as ototoxicity. The ototoxic potential of drugs is rarely assessed in pre-clinical drug development or during clinical trials, so this debilitating side-effect is often discovered as patients begin to report hearing loss. Furthermore, drug-induced ototoxicity in adults, and particularly in elderly patients, may go unrecognized due to hearing loss from a variety of etiologies because of a lack of baseline assessments immediately prior to novel therapeutic treatment. During the current pandemic, there is an intense effort to identify new drugs or repurpose FDA-approved drugs to treat COVID-19. Several potential COVID-19 therapeutics are known ototoxins, including chloroquine (CQ) and lopinavir-ritonavir, demonstrating the necessity to identify ototoxic potential in existing and novel medicines. Furthermore, several factors are emerging as potentiators of ototoxicity, such as inflammation (a hallmark of COVID-19), genetic polymorphisms, and ototoxic synergy with co-therapeutics, increasing the necessity to evaluate a drug's potential to induce ototoxicity under varying conditions. Here, we review the potential of COVID-19 therapies to induce ototoxicity and factors that may compound their ototoxic effects. We then discuss two models for rapidly detecting the potential for ototoxicity: mammalian auditory cell lines and the larval zebrafish lateral line. These models offer considerable value for pre-clinical drug development, including development of COVID-19 therapies. Finally, we show the validity of in silico screening for ototoxic potential using a computational model that compares structural similarity of compounds of interest with a database of known ototoxins and non-ototoxins. Preclinical screening at in silico, in vitro, and in vivo levels can provide an earlier indication of the potential for ototoxicity and identify the subset of candidate therapeutics for treating COVID-19 that need to be monitored for ototoxicity as for other widely-used clinical therapeutics, like aminoglycosides and cisplatin.
Collapse
Affiliation(s)
| | | | - Jordan Hill
- Washington State University Vancouver, Vancouver, WA, United States
| | - Cong Tian
- Department of Biomedical Sciences, School of Medicine, Creighton University, Omaha, NE, United States
| | - Peter S. Steyger
- Department of Biomedical Sciences, School of Medicine, Creighton University, Omaha, NE, United States
- National Center for Rehabilitative Auditory Research, Portland, OR, United States
| |
Collapse
|
43
|
Karim A, Lee M, Balle T, Sattar A. CardioTox net: a robust predictor for hERG channel blockade based on deep learning meta-feature ensembles. J Cheminform 2021; 13:60. [PMID: 34399849 PMCID: PMC8365955 DOI: 10.1186/s13321-021-00541-z] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Accepted: 08/05/2021] [Indexed: 11/10/2022] Open
Abstract
MOTIVATION Ether-a-go-go-related gene (hERG) channel blockade by small molecules is a big concern during drug development in the pharmaceutical industry. Blockade of hERG channels may cause prolonged QT intervals that potentially could lead to cardiotoxicity. Various in-silico techniques including deep learning models are widely used to screen out small molecules with potential hERG related toxicity. Most of the published deep learning methods utilize a single type of features which might restrict their performance. Methods based on more than one type of features such as DeepHIT struggle with the aggregation of extracted information. DeepHIT shows better performance when evaluated against one or two accuracy metrics such as negative predictive value (NPV) and sensitivity (SEN) but struggle when evaluated against others such as Matthew correlation coefficient (MCC), accuracy (ACC), positive predictive value (PPV) and specificity (SPE). Therefore, there is a need for a method that can efficiently aggregate information gathered from models based on different chemical representations and boost hERG toxicity prediction over a range of performance metrics. RESULTS In this paper, we propose a deep learning framework based on step-wise training to predict hERG channel blocking activity of small molecules. Our approach utilizes five individual deep learning base models with their respective base features and a separate neural network to combine the outputs of the five base models. By using three external independent test sets with potency activity of IC50 at a threshold of 10 [Formula: see text]m, our method achieves better performance for a combination of classification metrics. We also investigate the effective aggregation of chemical information extracted for robust hERG activity prediction. In summary, CardioTox net can serve as a robust tool for screening small molecules for hERG channel blockade in drug discovery pipelines and performs better than previously reported methods on a range of classification metrics.
Collapse
Affiliation(s)
- Abdul Karim
- School of Information Communication Technology, Griffith University, 4111 Nathan, Brisbane, Australia
| | - Matthew Lee
- School of Information Communication Technology, Griffith University, 4111 Nathan, Brisbane, Australia
| | - Thomas Balle
- Sydney Pharmacy School, Faculty of Medicine and Health, The University of Sydney, 2006 Sydney, Australia
- Brain and Mind Centre, The University of Sydney, 2050 Sydney, Australia
| | - Abdul Sattar
- Institute of Integrated and Intelligent Systems, Griffith University, 4111 Nathan, Brisbane, Australia
| |
Collapse
|
44
|
Aghasafari P, Yang PC, Kernik DC, Sakamoto K, Kanda Y, Kurokawa J, Vorobyov I, Clancy CE. A deep learning algorithm to translate and classify cardiac electrophysiology. eLife 2021; 10:68335. [PMID: 34212860 PMCID: PMC8282335 DOI: 10.7554/elife.68335] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Accepted: 06/29/2021] [Indexed: 01/15/2023] Open
Abstract
The development of induced pluripotent stem cell-derived cardiomyocytes (iPSC-CMs) has been a critical in vitro advance in the study of patient-specific physiology, pathophysiology, and pharmacology. We designed a new deep learning multitask network approach intended to address the low throughput, high variability, and immature phenotype of the iPSC-CM platform. The rationale for combining translation and classification tasks is because the most likely application of the deep learning technology we describe here is to translate iPSC-CMs following application of a perturbation. The deep learning network was trained using simulated action potential (AP) data and applied to classify cells into the drug-free and drugged categories and to predict the impact of electrophysiological perturbation across the continuum of aging from the immature iPSC-CMs to the adult ventricular myocytes. The phase of the AP extremely sensitive to perturbation due to a steep rise of the membrane resistance was found to contain the key information required for successful network multitasking. We also demonstrated successful translation of both experimental and simulated iPSC-CM AP data validating our network by prediction of experimental drug-induced effects on adult cardiomyocyte APs by the latter.
Collapse
Affiliation(s)
- Parya Aghasafari
- Department of Physiology and Membrane Biology, University of California, Davis, Davis, United States
| | - Pei-Chi Yang
- Department of Physiology and Membrane Biology, University of California, Davis, Davis, United States
| | - Divya C Kernik
- Washington University in St. Louis, St. Louis, United States
| | - Kazuho Sakamoto
- Department of Bio-Informational Pharmacology, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
| | - Yasunari Kanda
- Division of Pharmacology, National Institute of Health Sciences, Kanagawa, Japan
| | - Junko Kurokawa
- Department of Bio-Informational Pharmacology, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
| | - Igor Vorobyov
- Department of Physiology and Membrane Biology, University of California, Davis, Davis, United States.,Department of Pharmacology, University of California, Davis, Davis, United States
| | - Colleen E Clancy
- Department of Physiology and Membrane Biology, University of California, Davis, Davis, United States
| |
Collapse
|
45
|
Jeon J, Kang S, Kim HU. Predicting biochemical and physiological effects of natural products from molecular structures using machine learning. Nat Prod Rep 2021; 38:1954-1966. [PMID: 34047331 DOI: 10.1039/d1np00016k] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Covering: 2016 to 2021Discovery of novel natural products has been greatly facilitated by advances in genome sequencing, genome mining and analytical techniques. As a result, the volume of data for natural products has increased over the years, which started to serve as ingredients for developing machine learning models. In the past few years, a number of machine learning models have been developed to examine various aspects of a molecule by effectively processing its molecular structure. Understanding of the biological effects of natural products can benefit from such machine learning approaches. In this context, this Highlight reviews recent studies on machine learning models developed to infer various biological effects of molecules. A particular attention is paid to molecular featurization, or computational representation of a molecular structure, which is an essential process during the development of a machine learning model. Technical challenges associated with the use of machine learning for natural products are further discussed.
Collapse
Affiliation(s)
- Junhyeok Jeon
- Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea.
| | - Seongmo Kang
- Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea.
| | - Hyun Uk Kim
- Department of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea. and KAIST Institute for Artificial Intelligence, KAIST, Daejeon 34141, Republic of Korea and BioProcess Engineering Research Center and BioInformatics Research Center, KAIST, Daejeon 34141, Republic of Korea
| |
Collapse
|
46
|
Karim A, Riahi V, Mishra A, Newton MAH, Dehzangi A, Balle T, Sattar A. Quantitative Toxicity Prediction via Meta Ensembling of Multitask Deep Learning Models. ACS OMEGA 2021; 6:12306-12317. [PMID: 34056383 PMCID: PMC8154128 DOI: 10.1021/acsomega.1c01247] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Accepted: 04/22/2021] [Indexed: 05/17/2023]
Abstract
Toxicity prediction using quantitative structure-activity relationship has achieved significant progress in recent years. However, most existing machine learning methods in toxicity prediction utilize only one type of feature representation and one type of neural network, which essentially restricts their performance. Moreover, methods that use more than one type of feature representation struggle with the aggregation of information captured within the features since they use predetermined aggregation formulas. In this paper, we propose a deep learning framework for quantitative toxicity prediction using five individual base deep learning models and their own base feature representations. We then propose to adopt a meta ensemble approach using another separate deep learning model to perform aggregation of the outputs of the individual base deep learning models. We train our deep learning models in a weighted multitask fashion combining four quantitative toxicity data sets of LD50, IGC50, LC50, and LC50-DM and minimizing the root-mean-square errors. Compared to the current state-of-the-art toxicity prediction method TopTox on LD50, IGC50, and LC50-DM, that is, three out of four data sets, our method, respectively, obtains 5.46, 16.67, and 6.34% better root-mean-square errors, 6.41, 11.80, and 12.16% better mean absolute errors, and 5.21, 7.36, and 2.54% better coefficients of determination. We named our method QuantitativeTox, and our implementation is available from the GitHub repository https://github.com/Abdulk084/QuantitativeTox.
Collapse
Affiliation(s)
- Abdul Karim
- School
of Information Communication Technology, Griffith University, Nathan, Brisbane 4111, Australia
| | - Vahid Riahi
- School
of Information Communication Technology, Griffith University, Nathan, Brisbane 4111, Australia
| | - Avinash Mishra
- Department
of Chemical Engineering, Indian Institute
of Technology, Hauz Khas 110016, New Delhi, India
| | - M. A. Hakim Newton
- Institute
of Integrated and Intelligent Systems, Griffith
University, Nathan, Brisbane 4111, Australia
| | - Abdollah Dehzangi
- Department
of Computer Science, Rutgers University
Camden, Camden 08102, New Jersey, United States
- Center
for Computational and Integrative Biology, Rutgers University Camden, Camden 08102, New Jersey, United States
| | - Thomas Balle
- Sydney Pharmacy
School, Faculty of Medicine and Health, The University of Sydney, Camperdown 2006, New South Wales, Australia
- Brain
and Mind Centre, The University of Sydney, Camperdown 2006, New South Wales, Australia
| | - Abdul Sattar
- Institute
of Integrated and Intelligent Systems, Griffith
University, Nathan, Brisbane 4111, Australia
| |
Collapse
|
47
|
Zhao L, Dong Q, Luo C, Wu Y, Bu D, Qi X, Luo Y, Zhao Y. DeepOmix: A scalable and interpretable multi-omics deep learning framework and application in cancer survival analysis. Comput Struct Biotechnol J 2021; 19:2719-2725. [PMID: 34093987 PMCID: PMC8131983 DOI: 10.1016/j.csbj.2021.04.067] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 04/26/2021] [Accepted: 04/27/2021] [Indexed: 01/23/2023] Open
Abstract
Integrative analysis of multi-omics data can elucidate valuable insights into complex molecular mechanisms for various diseases. However, due to their different modalities and high dimension, utilizing and integrating different types of omics data suffers from great challenges. There is an urgent need to develop a powerful method to improve survival prediction and detect functional gene modules from multi-omics data. To deal with these problems, we present DeepOmix (a scalable and interpretable multi-Omics Deep learning framework and application in cancer survival analysis), a flexible, scalable, and interpretable method for extracting relationships between the clinical survival time and multi-omics data based on a deep learning framework. DeepOmix enables the non-linear combination of variables from different omics datasets and incorporates prior biological information defined by users (such as signaling pathways and tissue networks). Benchmark experiments demonstrate that DeepOmix outperforms the other five cutting-edge prediction methods. Besides, Lower Grade Glioma (LGG) is taken as the case study to perform the prognosis prediction and illustrate the functional module nodes which are associated with the prognostic result in the prediction model.
Collapse
Affiliation(s)
- Lianhe Zhao
- Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Qiongye Dong
- Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
| | - Chunlong Luo
- Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yang Wu
- Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
| | - Dechao Bu
- Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China
| | - Xiaoning Qi
- Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yufan Luo
- Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yi Zhao
- Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China.,Hwa Mei Hospital, University of Chinese Academy of Sciences, Ningbo 315000, China
| |
Collapse
|
48
|
Zhu S, Wu M, Huang Z, An J. Trends in application of advancing computational approaches in GPCR ligand discovery. Exp Biol Med (Maywood) 2021; 246:1011-1024. [PMID: 33641446 PMCID: PMC8113737 DOI: 10.1177/1535370221993422] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
G protein-coupled receptors (GPCRs) comprise the most important superfamily of protein targets in current ligand discovery and drug development. GPCRs are integral membrane proteins that play key roles in various cellular signaling processes. Therefore, GPCR signaling pathways are closely associated with numerous diseases, including cancer and several neurological, immunological, and hematological disorders. Computer-aided drug design (CADD) can expedite the process of GPCR drug discovery and potentially reduce the actual cost of research and development. Increasing knowledge of biological structures, as well as improvements on computer power and algorithms, have led to unprecedented use of CADD for the discovery of novel GPCR modulators. Similarly, machine learning approaches are now widely applied in various fields of drug target research. This review briefly summarizes the application of rising CADD methodologies, as well as novel machine learning techniques, in GPCR structural studies and bioligand discovery in the past few years. Recent novel computational strategies and feasible workflows are updated, and representative cases addressing challenging issues on olfactory receptors, biased agonism, and drug-induced cardiotoxic effects are highlighted to provide insights into future GPCR drug discovery.
Collapse
Affiliation(s)
- Siyu Zhu
- Division of Infectious Diseases and Global Public Health, Department of Medicine, School of Medicine, University of California at San Diego, La Jolla, CA 92093, USA
- Ciechanover Institute of Precision and Regenerative Medicine, School of Life and Health Sciences, Chinese University of Hong Kong, Shenzhen 518172, China
| | - Meixian Wu
- Division of Infectious Diseases and Global Public Health, Department of Medicine, School of Medicine, University of California at San Diego, La Jolla, CA 92093, USA
| | - Ziwei Huang
- Division of Infectious Diseases and Global Public Health, Department of Medicine, School of Medicine, University of California at San Diego, La Jolla, CA 92093, USA
- Ciechanover Institute of Precision and Regenerative Medicine, School of Life and Health Sciences, Chinese University of Hong Kong, Shenzhen 518172, China
- School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Jing An
- Division of Infectious Diseases and Global Public Health, Department of Medicine, School of Medicine, University of California at San Diego, La Jolla, CA 92093, USA
| |
Collapse
|