1
|
Meng J, Zhang L, He Z, Hu M, Liu J, Bao W, Tian Q, Feng H, Liu H. Development of a machine learning-based target-specific scoring function for structure-based binding affinity prediction for human dihydroorotate dehydrogenase inhibitors. J Comput Chem 2025; 46:e27510. [PMID: 39325045 DOI: 10.1002/jcc.27510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2024] [Revised: 08/21/2024] [Accepted: 09/11/2024] [Indexed: 09/27/2024]
Abstract
Human dihydroorotate dehydrogenase (hDHODH) is a flavin mononucleotide-dependent enzyme that can limit de novo pyrimidine synthesis, making it a therapeutic target for diseases such as autoimmune disorders and cancer. In this study, using the docking structures of complexes generated by AutoDock Vina, we integrate interaction features and ligand features, and employ support vector regression to develop a target-specific scoring function for hDHODH (TSSF-hDHODH). The Pearson correlation coefficient values of TSSF-hDHODH in the cross-validation and external validation are 0.86 and 0.74, respectively, both of which are far superior to those of classic scoring function AutoDock Vina and random forest (RF) based generic scoring function RF-Score. TSSF-hDHODH is further used for the virtual screening of potential inhibitors in the FDA-Approved & Pharmacopeia Drug Library. In conjunction with the results from molecular dynamics simulations, crizotinib is identified as a candidate for subsequent structural optimization. This study can be useful for the discovery of hDHODH inhibitors and the development of scoring functions for additional targets.
Collapse
Affiliation(s)
- Jinhui Meng
- School of Life Science, Liaoning University, Shenyang, Liaoning, China
| | - Li Zhang
- School of Life Science, Liaoning University, Shenyang, Liaoning, China
- Liaoning Provincial Key Laboratory of Computational Simulation and Information Processing of Biomacromolecules, Liaoning University, Shenyang, Liaoning, China
- Engineering Laboratory for Molecular Simulation and Designing of Drug Molecules of Liaoning, Liaoning University, Shenyang, Liaoning, China
| | - Zhe He
- School of Life Science, Liaoning University, Shenyang, Liaoning, China
| | - Mengfeng Hu
- School of Life Science, Liaoning University, Shenyang, Liaoning, China
| | - Jinhan Liu
- School of Life Science, Liaoning University, Shenyang, Liaoning, China
| | - Wenzhuo Bao
- School of Life Science, Liaoning University, Shenyang, Liaoning, China
| | - Qifeng Tian
- School of Life Science, Liaoning University, Shenyang, Liaoning, China
| | - Huawei Feng
- School of Pharmacy, Liaoning University, Shenyang, Liaoning, China
| | - Hongsheng Liu
- Liaoning Provincial Key Laboratory of Computational Simulation and Information Processing of Biomacromolecules, Liaoning University, Shenyang, Liaoning, China
- Engineering Laboratory for Molecular Simulation and Designing of Drug Molecules of Liaoning, Liaoning University, Shenyang, Liaoning, China
- School of Pharmacy, Liaoning University, Shenyang, Liaoning, China
| |
Collapse
|
2
|
Meng L, Zhou B, Liu H, Chen Y, Yuan R, Chen Z, Luo S, Chen H. Advancing toxicity studies of per- and poly-fluoroalkyl substances (pfass) through machine learning: Models, mechanisms, and future directions. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 946:174201. [PMID: 38936709 DOI: 10.1016/j.scitotenv.2024.174201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 06/17/2024] [Accepted: 06/20/2024] [Indexed: 06/29/2024]
Abstract
Perfluorinated and perfluoroalkyl substances (PFASs), encompassing a vast array of isomeric chemicals, are recognized as typical emerging contaminants with direct or potential impacts on human health and the ecological environment. With the complex and elusive toxicological profiles of PFASs, machine learning (ML) has been increasingly employed in their toxicity studies due to its proficiency in prediction and data analytics. This integration is poised to become a predominant trend in environmental toxicology, propelled by the swift advancements in computational technology. This review diligently examines the literature to encapsulate the varied objectives of employing ML in the toxicity studies of PFASs: (1) Utilizing ML to establish Quantitative Structure-Activity Relationship (QSAR) models for PFASs with diverse toxicity endpoints, facilitating the targeted toxicity prediction of unidentified PFASs; (2) Investigating and substantiating the Adverse Outcome Pathway (AOP) through the synergy of ML and traditional toxicological methods, with this refining the toxicity assessment framework for PFASs; (3) Dissecting and elucidating the features of established ML models to advance Open Research into the toxicity of PFASs, with a primary focus on determinants and mechanisms. The discourse extends to an in-depth examination of ML studies, segregating findings based on their distinct application trajectories. Given that ML represents a nascent paradigm within PFASs research, this review delineates the collective challenges encountered in the ML-mediated study of PFAS toxicity and proffers strategic guidance for ensuing investigations.
Collapse
Affiliation(s)
- Lingxuan Meng
- Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, School of Energy and Environmental Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Beihai Zhou
- Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, School of Energy and Environmental Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Haijun Liu
- School of Resources and Environment, Anqing Normal University, Anqing, China.
| | - Yuefang Chen
- Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, School of Energy and Environmental Engineering, University of Science and Technology Beijing, Beijing 100083, China.
| | - Rongfang Yuan
- Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, School of Energy and Environmental Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Zhongbing Chen
- Faculty of Environmental Sciences, Czech University of Life Sciences Prague, Kamýcká 129, 16500 Praha-Suchdol, Czech Republic.
| | - Shuai Luo
- Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, School of Energy and Environmental Engineering, University of Science and Technology Beijing, Beijing 100083, China
| | - Huilun Chen
- Beijing Key Laboratory of Resource-oriented Treatment of Industrial Pollutants, School of Energy and Environmental Engineering, University of Science and Technology Beijing, Beijing 100083, China.
| |
Collapse
|
3
|
Sanches IH, Braga RC, Alves VM, Andrade CH. Enhancing hERG Risk Assessment with Interpretable Classificatory and Regression Models. Chem Res Toxicol 2024; 37:910-922. [PMID: 38781421 PMCID: PMC11187631 DOI: 10.1021/acs.chemrestox.3c00400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 04/22/2024] [Accepted: 05/14/2024] [Indexed: 05/25/2024]
Abstract
The human Ether-à-go-go-Related Gene (hERG) is a transmembrane protein that regulates cardiac action potential, and its inhibition can induce a potentially deadly cardiac syndrome. In vitro tests help identify hERG blockers at early stages; however, the high cost motivates searching for alternative, cost-effective methods. The primary goal of this study was to enhance the Pred-hERG tool for predicting hERG blockage. To achieve this, we developed new QSAR models that incorporated additional data, updated existing classificatory and multiclassificatory models, and introduced new regression models. Notably, we integrated SHAP (SHapley Additive exPlanations) values to offer a visual interpretation of these models. Utilizing the latest data from ChEMBL v30, encompassing over 14,364 compounds with hERG data, our binary and multiclassification models outperformed both the previous iteration of Pred-hERG and all publicly available models. Notably, the new version of our tool introduces a regression model for predicting hERG activity (pIC50). The optimal model demonstrated an R2 of 0.61 and an RMSE of 0.48, surpassing the only available regression model in the literature. Pred-hERG 5.0 now offers users a swift, reliable, and user-friendly platform for the early assessment of chemically induced cardiotoxicity through hERG blockage. The tool provides versatile outcomes, including (i) classificatory predictions of hERG blockage with prediction reliability, (ii) multiclassificatory predictions of hERG blockage with reliability, (iii) regression predictions with estimated pIC50 values, and (iv) probability maps illustrating the contribution of chemical fragments for each prediction. Furthermore, we implemented explainable AI analysis (XAI) to visualize SHAP values, providing insights into the contribution of each feature to binary classification predictions. A consensus prediction calculated based on the predictions of the three developed models is also present to assist the user's decision-making process. Pred-hERG 5.0 has been designed to be user-friendly, making it accessible to users without computational or programming expertise. The tool is freely available at http://predherg.labmol.com.br.
Collapse
Affiliation(s)
- Igor H. Sanches
- Laboratory
for Molecular Modeling and Drug Design (LabMol), Faculty of Pharmacy, Universidade Federal de Goiás, Goiânia, GO 74690-900, Brazil
- Center
for Excellence in Artificial Intelligence (CEIA), Institute of Informatics, Universidade Federal de Goiás, Goiânia, GO 74690-900, Brazil
- Center
for the Research and Advancement in Fragments and Molecular Targets
(CRAFT), School of Pharmaceutical Sciences at Ribeirao Preto, University of São Paulo, Ribeirão Preto, SP 05508-220, Brazil
| | | | - Vinicius M. Alves
- University
of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Carolina Horta Andrade
- Laboratory
for Molecular Modeling and Drug Design (LabMol), Faculty of Pharmacy, Universidade Federal de Goiás, Goiânia, GO 74690-900, Brazil
- Center
for Excellence in Artificial Intelligence (CEIA), Institute of Informatics, Universidade Federal de Goiás, Goiânia, GO 74690-900, Brazil
- Center
for the Research and Advancement in Fragments and Molecular Targets
(CRAFT), School of Pharmaceutical Sciences at Ribeirao Preto, University of São Paulo, Ribeirão Preto, SP 05508-220, Brazil
| |
Collapse
|
4
|
Lin H, Wu H, Li H, Song A, Yin W. The essential role of GSTP1 I105V polymorphism in the prediction of CDNB metabolism and toxicity: In silico and in vitro insights. Toxicol In Vitro 2023; 90:105601. [PMID: 37031912 DOI: 10.1016/j.tiv.2023.105601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2023] [Revised: 04/02/2023] [Accepted: 04/04/2023] [Indexed: 04/11/2023]
Abstract
Humans are continuously exposed to toxic chemicals such as nitro-chlorobenzene (CDNB) through occupation, water, and even the air we breathe. Due to the severe toxicity caused by the high electrophilicity of CDNB, occupational and environmental exposure to CDNB can produce toxic effects that ultimately lead to cell damage. CDNB can be eliminated from organisms by binding to GSH, the catalytic product of glutathione S-transferase P1 (GSTP1). Therefore, GSTP1 plays an important role in the detoxification of CDNB. However, subtle variations in GSTP1 can result in single nucleotide polymorphisms (SNPs). Indeed, the correlation between the clinical outcome of the disease and certain genotypes of GSTP1 has been extensively studied, however, their impact on the metabolic detoxification of toxicants such as CDNB remains to be elucidated. Among the various SNPs of GSTP1, I105V has a significant effect on the catalytic activity of GSTP1. In this paper, a GSTP1 I105V polymorphism model was successfully established, and its effect on CDNB metabolism and toxicity was studied by computer analysis including molecular docking and molecular dynamics simulation. The result demonstrated that the binding capacity of CDNB decreases with the I105V mutation of GSTP1(p < 0.001), indicating the changes in its detoxification efficacy in CDNB-induced cell damage. Organisms expressing GSTP1 V105 are more susceptible to cell damage caused by CDNB than individuals expressing GSTP1 I105 (p < 0.001). In sum, the data in this study provide prospective insights into the mechanism and capacity of CDNB detoxification in the GSTP1 allele, extending the CDNB-mediated toxicological profile. In addition, the heterogeneity of the GSTP1 allele should be included in toxicological studies of individuals exposed to CDNB.
Collapse
Affiliation(s)
- Hao Lin
- The State Key Lab of Pharmaceutical Biotechnology, College of Life Sciences, Nanjing University, Nanjing, China
| | - Han Wu
- The State Key Lab of Pharmaceutical Biotechnology, College of Life Sciences, Nanjing University, Nanjing, China
| | - Hengda Li
- The State Key Lab of Pharmaceutical Biotechnology, College of Life Sciences, Nanjing University, Nanjing, China
| | - Aoqi Song
- Department of Pharmacy, Shanghai Fourth People's Hospital, School of Medicine, Tongji University, Shanghai, China.
| | - Wu Yin
- The State Key Lab of Pharmaceutical Biotechnology, College of Life Sciences, Nanjing University, Nanjing, China.
| |
Collapse
|
5
|
Vittorio S, Lunghini F, Pedretti A, Vistoli G, Beccari AR. Ensemble of structure and ligand-based classification models for hERG liability profiling. Front Pharmacol 2023; 14:1148670. [PMID: 37033661 PMCID: PMC10076575 DOI: 10.3389/fphar.2023.1148670] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Accepted: 03/13/2023] [Indexed: 04/11/2023] Open
Abstract
Drug-induced cardiotoxicity represents one of the most critical safety concerns in the early stages of drug development. The blockade of the human ether-à-go-go-related potassium channel (hERG) is the most frequent cause of cardiotoxicity, as it is associated to long QT syndrome which can lead to fatal arrhythmias. Therefore, assessing hERG liability of new drugs candidates is crucial to avoid undesired cardiotoxic effects. In this scenario, computational approaches have emerged as useful tools for the development of predictive models able to identify potential hERG blockers. In the last years, several efforts have been addressed to generate ligand-based (LB) models due to the lack of experimental structural information about hERG channel. However, these methods rely on the structural features of the molecules used to generate the model and often fail in correctly predicting new chemical scaffolds. Recently, the 3D structure of hERG channel has been experimentally solved enabling the use of structure-based (SB) strategies which may overcome the limitations of the LB approaches. In this study, we compared the performances achieved by both LB and SB classifiers for hERG-related cardiotoxicity developed by using Random Forest algorithm and employing a training set containing 12789 hERG binders. The SB models were trained on a set of scoring functions computed by docking and rescoring calculations, while the LB classifiers were built on a set of physicochemical descriptors and fingerprints. Furthermore, models combining the LB and SB features were developed as well. All the generated models were internally validated by ten-fold cross-validation on the TS and further verified on an external test set. The former revealed that the best performance was achieved by the LB model, while the model combining the LB and the SB attributes displayed the best results when applied on the external test set highlighting the usefulness of the integration of LB and SB features in correctly predicting unseen molecules. Overall, our predictive models showed satisfactory performances providing new useful tools to filter out potential cardiotoxic drug candidates in the early phase of drug discovery.
Collapse
Affiliation(s)
- Serena Vittorio
- Dipartimento di Scienze Farmaceutiche, Università Degli Studi di Milano, Milano, Italy
| | | | - Alessandro Pedretti
- Dipartimento di Scienze Farmaceutiche, Università Degli Studi di Milano, Milano, Italy
| | - Giulio Vistoli
- Dipartimento di Scienze Farmaceutiche, Università Degli Studi di Milano, Milano, Italy
| | | |
Collapse
|
6
|
Shan M, Jiang C, Qin L, Cheng G. A Review of Computational Methods in Predicting hERG Channel Blockers. ChemistrySelect 2022. [DOI: 10.1002/slct.202201221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Mengyi Shan
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| | - Chen Jiang
- QuanMin RenZheng (HangZhou) Technology Co. Ltd. China
| | - Lu‐Ping Qin
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| | - Gang Cheng
- School of Pharmaceutical Sciences Zhejiang Chinese Medical University Hangzhou 310053 People's Republic of China
| |
Collapse
|
7
|
Li S, Wang L, Meng J, Zhao Q, Zhang L, Liu H. De Novo design of potential inhibitors against SARS-CoV-2 Mpro. Comput Biol Med 2022; 147:105728. [PMID: 35763931 PMCID: PMC9197785 DOI: 10.1016/j.compbiomed.2022.105728] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 05/31/2022] [Accepted: 06/11/2022] [Indexed: 11/16/2022]
Abstract
The impact of the ravages of COVID-19 on people's lives is obvious, and the development of novel potential inhibitors against SARS-CoV-2 main protease (Mpro), which has been validated as a potential target for drug design, is urgently needed. This study developed a model named MproI-GEN, which can be used for the de novo design of potential Mpro inhibitors (MproIs) based on deep learning. The model was mainly composed of long-short term memory modules, and the last layer was re-trained with transfer learning. The validity (0.9248), novelty (0.9668), and uniqueness (0.0652) of the designed potential MproI library (PMproIL) were evaluated, and the results showed that MproI-GEN could be used to design structurally novel and reasonable molecules. Additionally, PMproIL was filtered based on machine learning models and molecular docking. After filtering, the potential MproIs were verified with molecular dynamics simulations to evaluate the binding stability levels of these MproIs and SARS-CoV-2 Mpro, thereby illustrating the inhibitory effects of the potential MproIs against Mpro. Two potential MproIs were proposed in this study. This study provides not only new possibilities for the development of COVID-19 drugs but also a complete pipeline for the discovery of novel lead compounds.
Collapse
Affiliation(s)
- Shimeng Li
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Lianxin Wang
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Jinhui Meng
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Qi Zhao
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan, 114051, China.
| | - Li Zhang
- School of Life Science, Liaoning University, Shenyang, 110036, China; Shenyang Key Laboratory of Computer Simulating and Information Processing of Bio-macromolecules, Shenyang, 110036, China.
| | - Hongsheng Liu
- Shenyang Key Laboratory of Computer Simulating and Information Processing of Bio-macromolecules, Shenyang, 110036, China; Research Center for Computer Simulating and Information Processing of Bio-macromolecules of Liaoning Province, Shenyang, 110036, China; Engineering Laboratory for Molecular Simulation and Designing of Drug Molecules of Liaoning, Shenyang, 110036, China; School of Pharmaceutical Sciences, Liaoning University, Shenyang, 110036, China.
| |
Collapse
|
8
|
Ding W, Nan Y, Wu J, Han C, Xin X, Li S, Liu H, Zhang L. Combining multi-dimensional molecular fingerprints to predict the hERG cardiotoxicity of compounds. Comput Biol Med 2022; 144:105390. [DOI: 10.1016/j.compbiomed.2022.105390] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 03/06/2022] [Accepted: 03/07/2022] [Indexed: 01/28/2023]
|