1
|
Le NQK, Tran TX, Nguyen PA, Ho TT, Nguyen VN. Recent progress in machine learning approaches for predicting carcinogenicity in drug development. Expert Opin Drug Metab Toxicol 2024; 20:621-628. [PMID: 38742542 DOI: 10.1080/17425255.2024.2356162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Accepted: 05/13/2024] [Indexed: 05/16/2024]
Abstract
INTRODUCTION This review explores the transformative impact of machine learning (ML) on carcinogenicity prediction within drug development. It discusses the historical context and recent advancements, emphasizing the significance of ML methodologies in overcoming challenges related to data interpretation, ethical considerations, and regulatory acceptance. AREAS COVERED The review comprehensively examines the integration of ML, deep learning, and diverse artificial intelligence (AI) approaches in various aspects of drug development safety assessments. It explores applications ranging from early-phase compound screening to clinical trial optimization, highlighting the versatility of ML in enhancing predictive accuracy and efficiency. EXPERT OPINION Through the analysis of traditional approaches such as in vivo rodent bioassays and in vitro assays, the review underscores the limitations and resource intensity associated with these methods. It provides expert insights into how ML offers innovative solutions to address these challenges, revolutionizing safety assessments in drug development.
Collapse
Affiliation(s)
- Nguyen Quoc Khanh Le
- Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
- AIBioMed Research Group, Taipei Medical University, Taipei, Taiwan
- Research Center for Artificial Intelligence in Medicine, Taipei Medical University, Taipei, Taiwan
- Translational Imaging Research Center, Taipei Medical University Hospital, Taipei, Taiwan
| | - Thi-Xuan Tran
- University of Economics and Business Administration, Thai Nguyen University, Thai Nguyen, Vietnam
| | - Phung-Anh Nguyen
- Clinical Data Center, Office of Data Science, Taipei Medical University, Taipei, Vietnam
- Clinical Big Data Research Center, Taipei Medical University Hospital, Taipei Medical University, Taipei, Vietnam
| | - Trang-Thi Ho
- Department of Computer Science and Information Engineering, TamKang University, New Taipei, Taiwan
| | - Van-Nui Nguyen
- University of Information and Communication Technology, Thai Nguyen University, Thai Nguyen, Vietnam
| |
Collapse
|
2
|
Guo W, Liu J, Dong F, Song M, Li Z, Khan MKH, Patterson TA, Hong H. Review of machine learning and deep learning models for toxicity prediction. Exp Biol Med (Maywood) 2023; 248:1952-1973. [PMID: 38057999 PMCID: PMC10798180 DOI: 10.1177/15353702231209421] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/08/2023] Open
Abstract
The ever-increasing number of chemicals has raised public concerns due to their adverse effects on human health and the environment. To protect public health and the environment, it is critical to assess the toxicity of these chemicals. Traditional in vitro and in vivo toxicity assays are complicated, costly, and time-consuming and may face ethical issues. These constraints raise the need for alternative methods for assessing the toxicity of chemicals. Recently, due to the advancement of machine learning algorithms and the increase in computational power, many toxicity prediction models have been developed using various machine learning and deep learning algorithms such as support vector machine, random forest, k-nearest neighbors, ensemble learning, and deep neural network. This review summarizes the machine learning- and deep learning-based toxicity prediction models developed in recent years. Support vector machine and random forest are the most popular machine learning algorithms, and hepatotoxicity, cardiotoxicity, and carcinogenicity are the frequently modeled toxicity endpoints in predictive toxicology. It is known that datasets impact model performance. The quality of datasets used in the development of toxicity prediction models using machine learning and deep learning is vital to the performance of the developed models. The different toxicity assignments for the same chemicals among different datasets of the same type of toxicity have been observed, indicating benchmarking datasets is needed for developing reliable toxicity prediction models using machine learning and deep learning algorithms. This review provides insights into current machine learning models in predictive toxicology, which are expected to promote the development and application of toxicity prediction models in the future.
Collapse
Affiliation(s)
- Wenjing Guo
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Jie Liu
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Fan Dong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Meng Song
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Zoe Li
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Md Kamrul Hasan Khan
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Tucker A Patterson
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Huixiao Hong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| |
Collapse
|
3
|
Singh S, Karthikeyan C, Moorthy NSHN. Classification analysis of fatty acid synthase inhibitors using multialgorithms on topological descriptors and structural fingerprints. Chem Biol Drug Des 2023; 101:395-407. [PMID: 36065591 DOI: 10.1111/cbdd.14138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2022] [Revised: 08/21/2022] [Accepted: 08/28/2022] [Indexed: 01/14/2023]
Abstract
Fatty acid synthase (FASN) is one of the enzymes required for fatty acid biosynthesis and is expressed as low or absent in most normal cells/tissues. However, this enzyme is upregulated in various cancer cells; hence, it can act as an important target to design and develop novel FASN inhibitors for cancer therapy. In the present investigation, a series of structurally diverse compounds that possessed FASN inhibitory activities were subjected to classification analysis using different algorithms such as support vector machine, decision tree, Naïve Bayes and random forest. The physicochemical descriptors and MACCS fingerprints were calculated using PaDEL software, and the WEKA software was utilized for the classification model building. The statistical parameters/confusion matrix calculated from the analysis revealed that the selected models have significant predictive performances. The results showed that the topological properties of the molecules are the main determinant for the activity classification. The key descriptors comprised of hydrogen bonding groups, especially acceptor (nHBAcc, minHBint9, minHBint5 and nwHBa), charge on the topological surface of the molecules (JGI10 & GGI2), ionization potential (GATS5i and GATS1i) and branching and distance between the groups (ETA_Eta_B_RC) are significantly contributed in the classification models. Further, the presence of heteroatoms (MACCSFP82, MACCSFP93 and MACCSFP131), especially nitrogen atom(s) and hydrogen bond acceptor groups (N-N group, NC(=O)N, N-C(=O)), actively contributed to the inhibitory activities. The results concluded that the topological polar properties concentrated in a specific region have significant FASN inhibitory activity. Hence, these results shall be used to develop novel molecules with increased FASN inhibitory activity.
Collapse
Affiliation(s)
- Shailendra Singh
- Cancept Therapeutics Laboratory, Department of Pharmacy, Indira Gandhi National Tribal University, Amarkantak, India
| | - Chandrabose Karthikeyan
- Cancept Therapeutics Laboratory, Department of Pharmacy, Indira Gandhi National Tribal University, Amarkantak, India
| | | |
Collapse
|
4
|
Predicting cell-penetrating peptides using machine learning algorithms and navigating in their chemical space. Sci Rep 2021; 11:7628. [PMID: 33828175 PMCID: PMC8027643 DOI: 10.1038/s41598-021-87134-w] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2020] [Accepted: 03/24/2021] [Indexed: 02/01/2023] Open
Abstract
Cell-penetrating peptides (CPPs) are naturally able to cross the lipid bilayer membrane that protects cells. These peptides share common structural and physicochemical properties and show different pharmaceutical applications, among which drug delivery is the most important. Due to their ability to cross the membranes by pulling high-molecular-weight polar molecules, they are termed Trojan horses. In this study, we proposed a machine learning (ML)-based framework named BChemRF-CPPred (beyond chemical rules-based framework for CPP prediction) that uses an artificial neural network, a support vector machine, and a Gaussian process classifier to differentiate CPPs from non-CPPs, using structure- and sequence-based descriptors extracted from PDB and FASTA formats. The performance of our algorithm was evaluated by tenfold cross-validation and compared with those of previously reported prediction tools using an independent dataset. The BChemRF-CPPred satisfactorily identified CPP-like structures using natural and synthetic modified peptide libraries and also obtained better performance than those of previously reported ML-based algorithms, reaching the independent test accuracy of 90.66% (AUC = 0.9365) for PDB, and an accuracy of 86.5% (AUC = 0.9216) for FASTA input. Moreover, our analyses of the CPP chemical space demonstrated that these peptides break some molecular rules related to the prediction of permeability of therapeutic molecules in cell membranes. This is the first comprehensive analysis to predict synthetic and natural CPP structures and to evaluate their chemical space using an ML-based framework. Our algorithm is freely available for academic use at http://comptools.linc.ufpa.br/BChemRF-CPPred .
Collapse
|
5
|
Jiao Z, Hu P, Xu H, Wang Q. Machine Learning and Deep Learning in Chemical Health and Safety: A Systematic Review of Techniques and Applications. ACS CHEMICAL HEALTH & SAFETY 2020. [DOI: 10.1021/acs.chas.0c00075] [Citation(s) in RCA: 61] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Affiliation(s)
- Zeren Jiao
- Mary Kay O’Connor Process Safety Center, Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas 77843-3122, United States
| | - Pingfan Hu
- Mary Kay O’Connor Process Safety Center, Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas 77843-3122, United States
| | - Hongfei Xu
- Mary Kay O’Connor Process Safety Center, Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas 77843-3122, United States
| | - Qingsheng Wang
- Mary Kay O’Connor Process Safety Center, Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, Texas 77843-3122, United States
| |
Collapse
|
6
|
Active learning efficiently converges on rational limits of toxicity prediction and identifies patterns for molecule design. ACTA ACUST UNITED AC 2020. [DOI: 10.1016/j.comtox.2020.100129] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
|
7
|
Wang YW, Huang L, Jiang SW, Li K, Zou J, Yang SY. CapsCarcino: A novel sparse data deep learning tool for predicting carcinogens. Food Chem Toxicol 2020; 135:110921. [PMID: 31669597 DOI: 10.1016/j.fct.2019.110921] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Revised: 09/21/2019] [Accepted: 10/23/2019] [Indexed: 12/11/2022]
Affiliation(s)
- Yi-Wei Wang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, Sichuan, 610041, PR China; College of Preclinical Medicine, Southwest Medical University, Luzhou, Sichuan, 646000, PR China
| | - Lei Huang
- School of Computer Science & Engineer, University of Electronic Science and Technology of China, Chengdu, Sichuan, 611731, PR China; Basic Teaching Department, Sichuan College of Architectural Technology, Deyang, Sichuan, 61800, PR China
| | - Si-Wen Jiang
- School of Computer Science & Engineer, University of Electronic Science and Technology of China, Chengdu, Sichuan, 611731, PR China
| | - Kan Li
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, Sichuan, 610041, PR China
| | - Jun Zou
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, Sichuan, 610041, PR China.
| | - Sheng-Yong Yang
- State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University, Chengdu, Sichuan, 610041, PR China.
| |
Collapse
|