1
|
Tran TTV, Tayara H, Chong KT. AMPred-CNN: Ames mutagenicity prediction model based on convolutional neural networks. Comput Biol Med 2024; 176:108560. [PMID: 38754218 DOI: 10.1016/j.compbiomed.2024.108560] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Revised: 04/15/2024] [Accepted: 05/05/2024] [Indexed: 05/18/2024]
Abstract
Mutagenicity assessment plays a pivotal role in the safety evaluation of chemicals, pharmaceuticals, and environmental compounds. In recent years, the development of robust computational models for predicting chemical mutagenicity has gained significant attention, driven by the need for efficient and cost-effective toxicity assessments. In this paper, we proposed AMPred-CNN, an innovative Ames mutagenicity prediction model based on Convolutional Neural Networks (CNNs), uniquely employing molecular structures as images to leverage CNNs' powerful feature extraction capabilities. The study employs the widely used benchmark mutagenicity dataset from Hansen et al. for model development and evaluation. Comparative analyses with traditional ML models on different molecular features reveal substantial performance enhancements. AMPred-CNN outshines these models, demonstrating superior accuracy, AUC, F1 score, MCC, sensitivity, and specificity on the test set. Notably, AMPred-CNN is further benchmarked against seven recent ML and DL models, consistently showcasing superior performance with an impressive AUC of 0.954. Our study highlights the effectiveness of CNNs in advancing mutagenicity prediction, paving the way for broader applications in toxicology and drug development.
Collapse
Affiliation(s)
- Thi Tuyet Van Tran
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea; Faculty of Information Technology, An Giang University, Long Xuyen 880000, Viet Nam; Vietnam National University-Ho Chi Minh City, Ho Chi Minh 700000, Viet Nam.
| | - Hilal Tayara
- School of International Engineering and Science, Jeonbuk National University, Jeonju 54896, Republic of Korea.
| | - Kil To Chong
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea; Advances Electronics and Information Research Center, Jeonbuk National University, Jeonju 54896, Republic of Korea.
| |
Collapse
|
2
|
Sousa GHM, Gomes RA, de Oliveira EO, Trossini GHG. Machine learning methods applied for the prediction of biological activities of triple reuptake inhibitors. J Biomol Struct Dyn 2023; 41:10277-10286. [PMID: 36546689 DOI: 10.1080/07391102.2022.2154269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 11/25/2022] [Indexed: 12/24/2022]
Abstract
Major depressive disorder (MDD) is characterized by a series of disabling symptoms like anhedonia, depressed mood, lack of motivation for daily tasks and self-extermination thoughts. The monoamine deficiency hypothesis states that depression is mainly caused by a deficiency of monoamine at the synaptic cleft. Thus, major efforts have been made to develop drugs that inhibit serotonin (SERT), norepinephrine (NET) and dopamine (DAT) transporters and increase the availability of these monoamines. Current gold standard treatment of MDD uses drugs that target one or more monoamine transporters. Triple reuptake inhibitors (TRIs) can target SERT, NET, and DAT simultaneously, and are believed to have the potential to be early onset antidepressants. Quantitative structure-activity relationship models were developed using machine learning algorithms in order to predict biological activities of a series of triple reuptake inhibitor compounds that showed in vitro inhibitory activity against multiple targets. The results, using mostly interpretable descriptors, showed that the internal and external predictive ability of the models are adequate, particularly of the DAT and NET by Random Forest and Support Vector Machine models. The current work shows that models developed from relatively simple, chemically interpretable descriptors can predict the activity of TRIs with similar structure in the applicability domain using ML methods.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
| | - Renan Augusto Gomes
- Faculdade de Ciências Farmacêuticas, Universidade de São Paulo, São Paulo, SP, Brazil
| | | | | |
Collapse
|
3
|
Tran TTV, Surya Wibowo A, Tayara H, Chong KT. Artificial Intelligence in Drug Toxicity Prediction: Recent Advances, Challenges, and Future Perspectives. J Chem Inf Model 2023; 63:2628-2643. [PMID: 37125780 DOI: 10.1021/acs.jcim.3c00200] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Toxicity prediction is a critical step in the drug discovery process that helps identify and prioritize compounds with the greatest potential for safe and effective use in humans, while also reducing the risk of costly late-stage failures. It is estimated that over 30% of drug candidates are discarded owing to toxicity. Recently, artificial intelligence (AI) has been used to improve drug toxicity prediction as it provides more accurate and efficient methods for identifying the potentially toxic effects of new compounds before they are tested in human clinical trials, thus saving time and money. In this review, we present an overview of recent advances in AI-based drug toxicity prediction, including the use of various machine learning algorithms and deep learning architectures, of six major toxicity properties and Tox21 assay end points. Additionally, we provide a list of public data sources and useful toxicity prediction tools for the research community and highlight the challenges that must be addressed to enhance model performance. Finally, we discuss future perspectives for AI-based drug toxicity prediction. This review can aid researchers in understanding toxicity prediction and pave the way for new methods of drug discovery.
Collapse
Affiliation(s)
- Thi Tuyet Van Tran
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea
- Faculty of Information Technology, An Giang University, Long Xuyen 880000, Vietnam
- Vietnam National University - Ho Chi Minh City, Ho Chi Minh 700000, Vietnam
| | - Agung Surya Wibowo
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, Republic of Korea
- Department of Electrical Engineering, Telkom University, Bandung 40257, Indonesia
| | - Hilal Tayara
- School of International Engineering and Science, Jeonbuk National University, Jeonju 54896, Republic of Korea
| | - Kil To Chong
- Advances Electronics and Information Research Center, Jeonbuk National University, Jeonju 54896, Republic of Korea
| |
Collapse
|
4
|
Lemée P, Fessard V, Habauzit D. Prioritization of mycotoxins based on mutagenicity and carcinogenicity evaluation using combined in silico QSAR methods. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2023; 323:121284. [PMID: 36804886 DOI: 10.1016/j.envpol.2023.121284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 02/01/2023] [Accepted: 02/12/2023] [Indexed: 06/18/2023]
Abstract
Mycotoxins and their metabolites are a family of compounds that contains a great diversity of both structure and biological properties. Information on their toxicity is spread within several databases and in scientific literature. Due to the number of molecules and their structure diversity, the cost and time required for hazard evaluation of each compound is unrealistic. In that purpose, new approach methodologies (NAMs) can be applied to evaluate such large set of molecules. Among them, quantitative structure-activity relationship (QSAR) in silico models could be useful to predict the mutagenic and carcinogenic properties of mycotoxins. First, a complete list of 904 mycotoxins and metabolites was built. Then, some known mycotoxins were used to determine the best QSAR tools for mutagenicity and carcinogenicity predictions. The best tool was further applied to the whole list of 904 mycotoxins. At the end, 95 mycotoxins were identified as both mutagen and carcinogen and should be prioritized for further evaluation.
Collapse
Affiliation(s)
- Pierre Lemée
- ANSES (French Agency for Food, Environmental and Occupational Health & Safety), Toxicology of Contaminants Unit, Fougères, France
| | - Valérie Fessard
- ANSES (French Agency for Food, Environmental and Occupational Health & Safety), Toxicology of Contaminants Unit, Fougères, France
| | - Denis Habauzit
- ANSES (French Agency for Food, Environmental and Occupational Health & Safety), Toxicology of Contaminants Unit, Fougères, France.
| |
Collapse
|
5
|
Lou C, Yang H, Deng H, Huang M, Li W, Liu G, Lee PW, Tang Y. Chemical rules for optimization of chemical mutagenicity via matched molecular pairs analysis and machine learning methods. J Cheminform 2023; 15:35. [PMID: 36941726 PMCID: PMC10029263 DOI: 10.1186/s13321-023-00707-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Accepted: 03/06/2023] [Indexed: 03/23/2023] Open
Abstract
Chemical mutagenicity is a serious issue that needs to be addressed in early drug discovery. Over a long period of time, medicinal chemists have manually summarized a series of empirical rules for the optimization of chemical mutagenicity. However, given the rising amount of data, it is getting more difficult for medicinal chemists to identify more comprehensive chemical rules behind the biochemical data. Herein, we integrated a large Ames mutagenicity data set with 8576 compounds to derive mutagenicity transformation rules for reversing Ames mutagenicity via matched molecular pairs analysis. A well-trained consensus model with a reasonable applicability domain was constructed, which showed favorable performance in the external validation set with an accuracy of 0.815. The model was used to assess the generalizability and validity of these mutagenicity transformation rules. The results demonstrated that these rules were of great value and could provide inspiration for the structural modifications of compounds with potential mutagenic effects. We also found that the local chemical environment of the attachment points of rules was critical for successful transformation. To facilitate the use of these mutagenicity transformation rules, we integrated them into ADMETopt2 ( http://lmmd.ecust.edu.cn/admetsar2/admetopt2/ ), a free web server for optimization of chemical ADMET properties. The above-mentioned approach would be extended to the optimization of other toxicity endpoints.
Collapse
Affiliation(s)
- Chaofeng Lou
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China
| | - Hongbin Yang
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China
| | - Hua Deng
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China
| | - Mengting Huang
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China
| | - Weihua Li
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China
| | - Guixia Liu
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China
| | - Philip W Lee
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China
| | - Yun Tang
- Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism, School of Pharmacy, East China University of Science and Technology, Shanghai, 200237, China.
| |
Collapse
|
6
|
Cavasotto CN, Scardino V. Machine Learning Toxicity Prediction: Latest Advances by Toxicity End Point. ACS OMEGA 2022; 7:47536-47546. [PMID: 36591139 PMCID: PMC9798519 DOI: 10.1021/acsomega.2c05693] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 11/28/2022] [Indexed: 05/29/2023]
Abstract
Machine learning (ML) models to predict the toxicity of small molecules have garnered great attention and have become widely used in recent years. Computational toxicity prediction is particularly advantageous in the early stages of drug discovery in order to filter out molecules with high probability of failing in clinical trials. This has been helped by the increase in the number of large toxicology databases available. However, being an area of recent application, a greater understanding of the scope and applicability of ML methods is still necessary. There are various kinds of toxic end points that have been predicted in silico. Acute oral toxicity, hepatotoxicity, cardiotoxicity, mutagenicity, and the 12 Tox21 data end points are among the most commonly investigated. Machine learning methods exhibit different performances on different data sets due to dissimilar complexity, class distributions, or chemical space covered, which makes it hard to compare the performance of algorithms over different toxic end points. The general pipeline to predict toxicity using ML has already been analyzed in various reviews. In this contribution, we focus on the recent progress in the area and the outstanding challenges, making a detailed description of the state-of-the-art models implemented for each toxic end point. The type of molecular representation, the algorithm, and the evaluation metric used in each research work are explained and analyzed. A detailed description of end points that are usually predicted, their clinical relevance, the available databases, and the challenges they bring to the field are also highlighted.
Collapse
Affiliation(s)
- Claudio N. Cavasotto
- Computational
Drug Design and Biomedical Informatics Laboratory, Instituto de Investigaciones
en Medicina Traslacional (IIMT), CONICET-Universidad
Austral, Pilar, B1629AHJ Buenos Aires, Argentina
- Austral
Institute for Applied Artificial Intelligence, Universidad Austral, Pilar, B1629AHJ Buenos Aires, Argentina
- Facultad
de Ciencias Biomédicas, Facultad de Ingenierá, Universidad Austral, Pilar, B1630FHB Buenos
Aires, Argentina
| | - Valeria Scardino
- Austral
Institute for Applied Artificial Intelligence, Universidad Austral, Pilar, B1629AHJ Buenos Aires, Argentina
- Meton
AI, Inc., Wilmington, Delaware 19801, United
States
| |
Collapse
|
7
|
Martínez MJ, Sabando MV, Soto AJ, Roca C, Requena-Triguero C, Campillo NE, Páez JA, Ponzoni I. Multitask Deep Neural Networks for Ames Mutagenicity Prediction. J Chem Inf Model 2022; 62:6342-6351. [PMID: 36066065 DOI: 10.1021/acs.jcim.2c00532] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
The Ames mutagenicity test constitutes the most frequently used assay to estimate the mutagenic potential of drug candidates. While this test employs experimental results using various strains of Salmonella typhimurium, the vast majority of the published in silico models for predicting mutagenicity do not take into account the test results of the individual experiments conducted for each strain. Instead, such QSAR models are generally trained employing overall labels (i.e., mutagenic and nonmutagenic). Recently, neural-based models combined with multitask learning strategies have yielded interesting results in different domains, given their capabilities to model multitarget functions. In this scenario, we propose a novel neural-based QSAR model to predict mutagenicity that leverages experimental results from different strains involved in the Ames test by means of a multitask learning approach. To the best of our knowledge, the modeling strategy hereby proposed has not been applied to model Ames mutagenicity previously. The results yielded by our model surpass those obtained by single-task modeling strategies, such as models that predict the overall Ames label or ensemble models built from individual strains. For reproducibility and accessibility purposes, all source code and datasets used in our experiments are publicly available.
Collapse
Affiliation(s)
- María Jimena Martínez
- ISISTAN (CONICET - UNCPBA) Campus Universitario - Paraje Arroyo Seco, 7000, Tandil, Argentina
| | - María Virginia Sabando
- Institute for Computer Science and Engineering, UNS-CONICET, 8000, Bahía Blanca, Argentina.,Department of Computer Science and Engineering, Universidad Nacional del Sur, 8000, Bahía Blanca, Argentina
| | - Axel J Soto
- Institute for Computer Science and Engineering, UNS-CONICET, 8000, Bahía Blanca, Argentina.,Department of Computer Science and Engineering, Universidad Nacional del Sur, 8000, Bahía Blanca, Argentina
| | - Carlos Roca
- CIB Margarita Salas (CSIC) Ramiro de Maeztu, 9. 28740, Madrid, Spain
| | | | - Nuria E Campillo
- CIB Margarita Salas (CSIC) Ramiro de Maeztu, 9. 28740, Madrid, Spain.,Instituto de Ciencias Matemáticas (CSIC), Nicolás Cabrera, no13-15, Campus de Cantoblanco, UAM, CP 28049, Madrid, Spain
| | - Juan A Páez
- Instituto de Química Médica. Consejo Superior de Investigaciones Científicas (CSIC), Juan de la Cierva 3, 28006, Madrid, Spain
| | - Ignacio Ponzoni
- Institute for Computer Science and Engineering, UNS-CONICET, 8000, Bahía Blanca, Argentina.,Department of Computer Science and Engineering, Universidad Nacional del Sur, 8000, Bahía Blanca, Argentina
| |
Collapse
|