1
|
Feitosa FL, F Cabral V, Sanches IH, Silva-Mendonca S, Borba JVVB, Braga RC, Andrade CH. Cyto-Safe: A Machine Learning Tool for Early Identification of Cytotoxic Compounds in Drug Discovery. J Chem Inf Model 2024; 64:9056-9062. [PMID: 39661446 DOI: 10.1021/acs.jcim.4c01811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2024]
Abstract
Cytotoxicity is essential in drug discovery, enabling early evaluation of toxic compounds during screenings to minimize toxicological risks. In vitro assays support high-throughput screening, allowing for efficient detection of toxic substances while considerably reducing the need for animal testing. Additionally, AI-based Quantitative Structure-Activity Relationship (AI-QSAR) models enhance early stage predictions by assessing the cytotoxic potential of molecular structures, which helps prioritize low-risk compounds for further validation. We present a freely accessible web application designed for identifying potential cytotoxic compounds utilizing QSAR models. This application utilizes machine learning techniques and is built on a data set of approximately 90,000 compounds, evaluated against two cell lines, 3T3 and HEK 293. Users can interact with the app by inputting a SMILES representation, uploading CSV or SDF files, or sketching molecules. The output includes a binary prediction for each cell line, a confidence percentage, and an explainable AI (XAI) analysis. Cyto-Safe web-app version 1.0 is available at http://insightai.labmol.com.br/.
Collapse
Affiliation(s)
- Francisco L Feitosa
- Laboratory for Molecular Modeling and Drug Design (LabMol), Faculdade de Farmácia, Universidade Federal de Goiás, Goiânia, Goiás 74605-220, Brazil
- Center for the Research and Advancement in Fragments and molecular Targets (CRAFT), School of Pharmaceutical Sciences at Ribeirão Preto, University of São Paulo, Ribeirão Preto, São Paulo 05508-220, Brazil
- Center for Excellence in Artificial Intelligence (CEIA), Institute of Informatics, Universidade Federal de Goiás, Goiânia, Goiás 74605-170, Brazil
| | - Victoria F Cabral
- Laboratory for Molecular Modeling and Drug Design (LabMol), Faculdade de Farmácia, Universidade Federal de Goiás, Goiânia, Goiás 74605-220, Brazil
- Center for the Research and Advancement in Fragments and molecular Targets (CRAFT), School of Pharmaceutical Sciences at Ribeirão Preto, University of São Paulo, Ribeirão Preto, São Paulo 05508-220, Brazil
- Center for Excellence in Artificial Intelligence (CEIA), Institute of Informatics, Universidade Federal de Goiás, Goiânia, Goiás 74605-170, Brazil
| | - Igor H Sanches
- Laboratory for Molecular Modeling and Drug Design (LabMol), Faculdade de Farmácia, Universidade Federal de Goiás, Goiânia, Goiás 74605-220, Brazil
- Center for the Research and Advancement in Fragments and molecular Targets (CRAFT), School of Pharmaceutical Sciences at Ribeirão Preto, University of São Paulo, Ribeirão Preto, São Paulo 05508-220, Brazil
- Center for Excellence in Artificial Intelligence (CEIA), Institute of Informatics, Universidade Federal de Goiás, Goiânia, Goiás 74605-170, Brazil
| | - Sabrina Silva-Mendonca
- Laboratory for Molecular Modeling and Drug Design (LabMol), Faculdade de Farmácia, Universidade Federal de Goiás, Goiânia, Goiás 74605-220, Brazil
- Center for the Research and Advancement in Fragments and molecular Targets (CRAFT), School of Pharmaceutical Sciences at Ribeirão Preto, University of São Paulo, Ribeirão Preto, São Paulo 05508-220, Brazil
- Center for Excellence in Artificial Intelligence (CEIA), Institute of Informatics, Universidade Federal de Goiás, Goiânia, Goiás 74605-170, Brazil
| | - Joyce V V B Borba
- Laboratory for Molecular Modeling and Drug Design (LabMol), Faculdade de Farmácia, Universidade Federal de Goiás, Goiânia, Goiás 74605-220, Brazil
- Center for the Research and Advancement in Fragments and molecular Targets (CRAFT), School of Pharmaceutical Sciences at Ribeirão Preto, University of São Paulo, Ribeirão Preto, São Paulo 05508-220, Brazil
- Center for Excellence in Artificial Intelligence (CEIA), Institute of Informatics, Universidade Federal de Goiás, Goiânia, Goiás 74605-170, Brazil
| | | | - Carolina Horta Andrade
- Laboratory for Molecular Modeling and Drug Design (LabMol), Faculdade de Farmácia, Universidade Federal de Goiás, Goiânia, Goiás 74605-220, Brazil
- Center for the Research and Advancement in Fragments and molecular Targets (CRAFT), School of Pharmaceutical Sciences at Ribeirão Preto, University of São Paulo, Ribeirão Preto, São Paulo 05508-220, Brazil
- Center for Excellence in Artificial Intelligence (CEIA), Institute of Informatics, Universidade Federal de Goiás, Goiânia, Goiás 74605-170, Brazil
| |
Collapse
|
2
|
Krasnov L, Tatarin S, Smirnov D, Bezzubov S. IrCytoToxDB: a dataset of iridium(III) complexes cytotoxicities against various cell lines. Sci Data 2024; 11:870. [PMID: 39127782 PMCID: PMC11316804 DOI: 10.1038/s41597-024-03735-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Accepted: 08/02/2024] [Indexed: 08/12/2024] Open
Abstract
Iridium(III) complexes nowadays became rising stars in various health-related applications. Thus, there is a necessity to assess cytotoxicity of the synthesized molecules against cancer/normal cell lines. In this report, we present a dataset of 2694 experimental cytotoxicity values of 803 iridium complexes against 127 different cell lines. We specify the experimental conditions and provide representation of the complexes molecules in machine-readable format. The dataset provides a starting point for exploration of new iridium-based cellular probes and opens new possibilities for predictions of toxicities and data-driven generation of new organometallic anticancer agents.
Collapse
Affiliation(s)
- Lev Krasnov
- N.S. Kurnakov Institute of General and Inorganic Chemistry, Russian Academy of Sciences, Leninskii pr. 31, Moscow, 119991, Russia
| | - Sergei Tatarin
- N.S. Kurnakov Institute of General and Inorganic Chemistry, Russian Academy of Sciences, Leninskii pr. 31, Moscow, 119991, Russia
| | - Daniil Smirnov
- N.S. Kurnakov Institute of General and Inorganic Chemistry, Russian Academy of Sciences, Leninskii pr. 31, Moscow, 119991, Russia
| | - Stanislav Bezzubov
- N.S. Kurnakov Institute of General and Inorganic Chemistry, Russian Academy of Sciences, Leninskii pr. 31, Moscow, 119991, Russia.
| |
Collapse
|
3
|
Yang X, Sun J, Jin B, Lu Y, Cheng J, Jiang J, Zhao Q, Shuai J. Multi-task aquatic toxicity prediction model based on multi-level features fusion. J Adv Res 2024:S2090-1232(24)00226-1. [PMID: 38844122 DOI: 10.1016/j.jare.2024.06.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2024] [Revised: 05/21/2024] [Accepted: 06/02/2024] [Indexed: 06/09/2024] Open
Abstract
INTRODUCTION With the escalating menace of organic compounds in environmental pollution imperiling the survival of aquatic organisms, the investigation of organic compound toxicity across diverse aquatic species assumes paramount significance for environmental protection. Understanding how different species respond to these compounds helps assess the potential ecological impact of pollution on aquatic ecosystems as a whole. Compared with traditional experimental methods, deep learning methods have higher accuracy in predicting aquatic toxicity, faster data processing speed and better generalization ability. OBJECTIVES This article presents ATFPGT-multi, an advanced multi-task deep neural network prediction model for organic toxicity. METHODS The model integrates molecular fingerprints and molecule graphs to characterize molecules, enabling the simultaneous prediction of acute toxicity for the same organic compound across four distinct fish species. Furthermore, to validate the advantages of multi-task learning, we independently construct prediction models, named ATFPGT-single, for each fish species. We employ cross-validation in our experiments to assess the performance and generalization ability of ATFPGT-multi. RESULTS The experimental results indicate, first, that ATFPGT-multi outperforms ATFPGT-single on four fish datasets with AUC improvements of 9.8%, 4%, 4.8%, and 8.2%, respectively, demonstrating the superiority of multi-task learning over single-task learning. Furthermore, in comparison with previous algorithms, ATFPGT-multi outperforms comparative methods, emphasizing that our approach exhibits higher accuracy and reliability in predicting aquatic toxicity. Moreover, ATFPGT-multi utilizes attention scores to identify molecular fragments associated with fish toxicity in organic molecules, as demonstrated by two organic molecule examples in the main text, demonstrating the interpretability of ATFPGT-multi. CONCLUSION In summary, ATFPGT-multi provides important support and reference for the further development of aquatic toxicity assessment. All of codes and datasets are freely available online at https://github.com/zhaoqi106/ATFPGT-multi.
Collapse
Affiliation(s)
- Xin Yang
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan 114051, China; Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou 325001, China
| | - Jianqiang Sun
- School of Information Science and Engineering, Linyi University, Linyi 276000, China
| | - Bingyu Jin
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan 114051, China
| | - Yuer Lu
- Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou 325001, China
| | - Jinyan Cheng
- Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou 325001, China
| | - Jiaju Jiang
- College of Life Sciences, Sichuan University, Chengdu 610064, China
| | - Qi Zhao
- School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan 114051, China.
| | - Jianwei Shuai
- Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou 325001, China; Oujiang Laboratory (Zhejiang Lab for Regenerative Medicine, Vision and Brain Health), Wenzhou 325001, China.
| |
Collapse
|
4
|
Liu H, Chen P, Hu B, Wang S, Wang H, Luan J, Wang J, Lin B, Cheng M. FaissMolLib: An efficient and easy deployable tool for ligand-based virtual screening. Comput Biol Chem 2024; 110:108057. [PMID: 38581840 DOI: 10.1016/j.compbiolchem.2024.108057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Revised: 03/06/2024] [Accepted: 03/20/2024] [Indexed: 04/08/2024]
Abstract
Virtual screening-based molecular similarity and fingerprint are crucial in drug design, target prediction, and ADMET prediction, aiding in identifying potential hits and optimizing lead compounds. However, challenges such as lack of comprehensive open-source molecular fingerprint databases and efficient search methods for virtual screening are prevalent. To address these issues, we introduce FaissMolLib, an open-source virtual screening tool that integrates 2.8 million compounds from ChEMBL and ZINC databases. Notably, FaissMolLib employs the highly efficient Faiss search algorithm, outperforming the Tanimoto algorithm in identifying similar molecules with its tighter clustering in scatter plots and lower mean, standard deviation, and variance in key molecular properties. This feature enables FaissMolLib to screen 2.8 million compounds in just 0.05 seconds, offering researchers an efficient, easily deployable solution for virtual screening on laptops and building unique compound databases. This significant advancement holds great potential for accelerating drug discovery efforts and enhancing chemical data analysis. FaissMolLib is freely available at http://liuhaihan.gnway.cc:80. The code and dataset of FaissMolLib are freely available at https://github.com/Superhaihan/FiassMolLib.
Collapse
Affiliation(s)
- Haihan Liu
- Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China
| | - Peiying Chen
- Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China
| | - Baichun Hu
- Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China
| | - Shizun Wang
- Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China
| | - Hanxun Wang
- Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China
| | - Jiasi Luan
- Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Medical Devices, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China
| | - Jian Wang
- Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China.
| | - Bin Lin
- Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China.
| | - Maosheng Cheng
- Key Laboratory of Structure-Based Drug Design &Discovery of Ministry of Education, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; Key Laboratory of Intelligent Drug Design and New Drug Discovery of Liaoning Province, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China; School of Pharmaceutical Engineering, Shenyang Pharmaceutical University, Shenyang 110016, People's Republic of China.
| |
Collapse
|
5
|
Srivastava S, Jain P. Computational Approaches: A New Frontier in Cancer Research. Comb Chem High Throughput Screen 2024; 27:1861-1876. [PMID: 38031782 DOI: 10.2174/0113862073265604231106112203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 09/08/2023] [Accepted: 09/21/2023] [Indexed: 12/01/2023]
Abstract
Cancer is a broad category of disease that can start in virtually any organ or tissue of the body when aberrant cells assault surrounding organs and proliferate uncontrollably. According to the most recent statistics, cancer will be the cause of 10 million deaths worldwide in 2020, accounting for one death out of every six worldwide. The typical approach used in anti-cancer research is highly time-consuming and expensive, and the outcomes are not particularly encouraging. Computational techniques have been employed in anti-cancer research to advance our understanding. Recent years have seen a significant and exceptional impact on anticancer research due to the rapid development of computational tools for novel drug discovery, drug design, genetic studies, genome characterization, cancer imaging and detection, radiotherapy, cancer metabolomics, and novel therapeutic approaches. In this paper, we examined the various subfields of contemporary computational techniques, including molecular docking, artificial intelligence, bioinformatics, virtual screening, and QSAR, and their applications in the study of cancer.
Collapse
Affiliation(s)
- Shubham Srivastava
- Department of Pharmacy, IIMT College of Pharmacy, Uttar Pradesh, 201310, India
| | - Pushpendra Jain
- Department of Pharmacy, IIMT College of Pharmacy, Uttar Pradesh, 201310, India
| |
Collapse
|
6
|
Guo W, Liu J, Dong F, Song M, Li Z, Khan MKH, Patterson TA, Hong H. Review of machine learning and deep learning models for toxicity prediction. Exp Biol Med (Maywood) 2023; 248:1952-1973. [PMID: 38057999 PMCID: PMC10798180 DOI: 10.1177/15353702231209421] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/08/2023] Open
Abstract
The ever-increasing number of chemicals has raised public concerns due to their adverse effects on human health and the environment. To protect public health and the environment, it is critical to assess the toxicity of these chemicals. Traditional in vitro and in vivo toxicity assays are complicated, costly, and time-consuming and may face ethical issues. These constraints raise the need for alternative methods for assessing the toxicity of chemicals. Recently, due to the advancement of machine learning algorithms and the increase in computational power, many toxicity prediction models have been developed using various machine learning and deep learning algorithms such as support vector machine, random forest, k-nearest neighbors, ensemble learning, and deep neural network. This review summarizes the machine learning- and deep learning-based toxicity prediction models developed in recent years. Support vector machine and random forest are the most popular machine learning algorithms, and hepatotoxicity, cardiotoxicity, and carcinogenicity are the frequently modeled toxicity endpoints in predictive toxicology. It is known that datasets impact model performance. The quality of datasets used in the development of toxicity prediction models using machine learning and deep learning is vital to the performance of the developed models. The different toxicity assignments for the same chemicals among different datasets of the same type of toxicity have been observed, indicating benchmarking datasets is needed for developing reliable toxicity prediction models using machine learning and deep learning algorithms. This review provides insights into current machine learning models in predictive toxicology, which are expected to promote the development and application of toxicity prediction models in the future.
Collapse
Affiliation(s)
- Wenjing Guo
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Jie Liu
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Fan Dong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Meng Song
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Zoe Li
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Md Kamrul Hasan Khan
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Tucker A Patterson
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| | - Huixiao Hong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA
| |
Collapse
|
7
|
Paes de Almeida V, Tolouei SEL, Minteguiaga M, Chaves DSDA, Heiden G, Khan SI, Trott J, Wang M, Dellacassa E, Raman V, Farago PV, Khan IA, Gasparotto Junior A, Manfron J. Chemical Profiles and Cytotoxic Activities of Essential Oils from Six Species of Baccharis Subgenus Coridifoliae (Asteraceae). Chem Biodivers 2023; 20:e202300862. [PMID: 37647349 DOI: 10.1002/cbdv.202300862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 08/29/2023] [Accepted: 08/30/2023] [Indexed: 09/01/2023]
Abstract
Several Baccharis species are popularly known in traditional medicine as "carquejas", "vassouras", "ervas-santas" and "mio-mios", and are used as anti-inflammatories, digestives, and diuretics. This study aimed to investigate the chemical compositions and cytotoxic activities of essential oils (EOs) of six Baccharis species belonging to subgenus Coridifoliae, namely B. albilanosa, B. coridifolia, B. erigeroides, B. napaea, B. ochracea, and B. pluricapitulata. GC/MS analyses of the EOs showed that the oxygenated sesquiterpenes spathulenol (7.32-38.22 %) and caryophyllene oxide (10.83-16.75 %) were the major components for all the species. The EOs of almost all species were cytotoxic against cancer (BT-549, KB, SK-MEL and SK-OV-3) and normal kidney (VERO and LLC-PK1) cell lines, whereas B. erigeroides EO showed cytotoxicity only against LLC-PK1. This article augments the current knowledge about the chemical-biological properties of Baccharis subgenus Coridifoliae and discusses the therapeutic potentials of these economically unexploited plants.
Collapse
Affiliation(s)
- Valter Paes de Almeida
- Postgraduate Program in Pharmaceutical Sciences, State University of Ponta Grossa, Ponta Grossa, Parana, Brazil
| | - Sara Emília Lima Tolouei
- Departamento de Farmacologia, Federal University of Parana, Curitiba, Parana, Brazil
- National Center for Natural Products Research, University of Mississippi, University, Mississippi, USA
| | - Manuel Minteguiaga
- Espacio de Ciencia y Tecnología Química, Centro Universitario Regional Noreste, Universidad de la República, Tacuarembó, Uruguay
- Laboratorio de Biotecnología de Aromas, Facultad de Química, Universidad de la República, Montevideo, Uruguay
| | | | - Gustavo Heiden
- Embrapa Clima Temperado, Pelotas, Rio Grande do Sul, Brazil
| | - Shabana Iqrar Khan
- National Center for Natural Products Research, University of Mississippi, University, Mississippi, USA
| | - John Trott
- National Center for Natural Products Research, University of Mississippi, University, Mississippi, USA
| | - Mei Wang
- Natural Products Utilization Research Unit, Agricultural Research Service, United States Department of Agriculture, University, Mississippi, USA
| | - Eduardo Dellacassa
- Laboratorio de Biotecnología de Aromas, Facultad de Química, Universidad de la República, Montevideo, Uruguay
| | - Vijayasankar Raman
- National Center for Natural Products Research, University of Mississippi, University, Mississippi, USA
| | - Paulo Vitor Farago
- Postgraduate Program in Pharmaceutical Sciences, State University of Ponta Grossa, Ponta Grossa, Parana, Brazil
- National Center for Natural Products Research, University of Mississippi, University, Mississippi, USA
| | - Ikhlas Ahmed Khan
- National Center for Natural Products Research, University of Mississippi, University, Mississippi, USA
| | - Arquimedes Gasparotto Junior
- Departamento de Farmacologia, Federal University of Parana, Curitiba, Parana, Brazil
- Laboratório de Farmacologia Cardiovascular, Faculty of Health Sciences, Federal University of Grande Dourados, Dourados, Mato Grosso do Sul, Brazil
| | - Jane Manfron
- Postgraduate Program in Pharmaceutical Sciences, State University of Ponta Grossa, Ponta Grossa, Parana, Brazil
- National Center for Natural Products Research, University of Mississippi, University, Mississippi, USA
| |
Collapse
|
8
|
Zamisa SJ, Adeleke AA, Devnarain N, Rhman MA, Owira PMO, Omondi B. The link between relative stability constant of DNA- and BSA-chromenopyrimidine complexes and cytotoxicity towards human breast cancer cells (MCF-7). RSC Adv 2023; 13:21820-21837. [PMID: 37475760 PMCID: PMC10354499 DOI: 10.1039/d3ra01741a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 07/11/2023] [Indexed: 07/22/2023] Open
Abstract
In this study, we synthesized and characterized ten chromenopyrimidine derivatives using analytical and spectroscopic methods. Studies on DNA and albumin binding affinity, as well as cytotoxicity tests on human breast cancer (MCF-7) cells, of the chromenopyrimidines, were conducted. The natural logarithm of the relative stability constant of DNA- and BSA-chromenopyrimidine complexes [ln(KDNA/KBSA)] was used as a criterion for selecting compounds for cytotoxicity studies. We found that ln(KDNA/KBSA) was inversely related to IC50 values of the compounds in MCF-7 cells. The antiproliferative effects of the compounds were found to induce apoptosis in MCF-7 cells, which is a desired mechanism of cell death. Correlations between the DNA and albumin binding affinities of chromenopyrimidines were established. We propose that this relationship approach can, for a given set of compounds, assist in predicting the cytotoxicity of potential drug candidates towards MCF-7 cells based on their experimentally determined CT-DNA and BSA binding affinities.
Collapse
Affiliation(s)
- Sizwe J Zamisa
- School of Chemistry and Physics, University of KwaZulu-Natal Private Bag X54001 Durban 4000 South Africa
| | - Adesola A Adeleke
- School of Chemistry and Physics, University of KwaZulu-Natal Private Bag X54001 Durban 4000 South Africa
| | - Nikita Devnarain
- Molecular and Clinical Pharmacology Research Laboratory, Department of Pharmacology, Discipline of Pharmaceutical Science, University of KwaZulu-Natal Private Bag X54001 Durban 4000 South Africa
| | - Mahasin Abdel Rhman
- Molecular and Clinical Pharmacology Research Laboratory, Department of Pharmacology, Discipline of Pharmaceutical Science, University of KwaZulu-Natal Private Bag X54001 Durban 4000 South Africa
| | - Peter M O Owira
- Molecular and Clinical Pharmacology Research Laboratory, Department of Pharmacology, Discipline of Pharmaceutical Science, University of KwaZulu-Natal Private Bag X54001 Durban 4000 South Africa
| | - Bernard Omondi
- School of Chemistry and Physics, University of KwaZulu-Natal Private Bag X54001 Durban 4000 South Africa
| |
Collapse
|
9
|
Li M, Zeng M, Zhang H, Chen H, Guan L. Biological Activity Predictions of Ligands Based on Hybrid Molecular Fingerprinting and Ensemble Learning. ACS OMEGA 2023; 8:5561-5570. [PMID: 36816680 PMCID: PMC9933080 DOI: 10.1021/acsomega.2c06944] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 12/23/2022] [Indexed: 06/18/2023]
Abstract
The biological activity predictions of ligands are an important research direction, which can improve the efficiency and success probability of drug screening. However, the traditional prediction method has the disadvantages of complex modeling and low screening efficiency. Machine learning is considered an important research direction to solve these traditional method problems in the near future. This paper proposes a machine learning model with high predictive accuracy and stable prediction ability, namely, the back propagation neural network cross-support vector regression model (BPCSVR). By comparing multiple molecular descriptors, MACCS fingerprint and ECFP6 fingerprint were selected as inputs, and the stable prediction ability of the model was improved by integrating multiple models and correcting similar samples. We used leave-one-out cross-validation on 3038 samples from six data sets. The coefficient of determination, root mean square error, and absolute error were used as the evaluation parameters. After comparing the multiclass models, the results show that the BPCSVR model has stable prediction ability in different data sets, and the prediction accuracy is higher than other comparison models.
Collapse
|
10
|
Xu X, Wang C, Gui B, Yuan X, Li C, Zhao Y, Martyniuk CJ, Su L. Application of machine learning to predict the inhibitory activity of organic chemicals on thyroid stimulating hormone receptor. ENVIRONMENTAL RESEARCH 2022; 212:113175. [PMID: 35351457 DOI: 10.1016/j.envres.2022.113175] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2021] [Revised: 03/04/2022] [Accepted: 03/22/2022] [Indexed: 06/14/2023]
Abstract
With the promotion of carbon neutrality, it is also important to synchronously promote the assessment and sustainable management of chemicals so as to protect public health. Humans and animals are possibly exposed to endocrine disruptors that have inhibitory effects on thyroid stimulating hormone receptor (TSHR). As such, it is important to identify chemicals that inhibit TSHR and to develop models to predict their inhibitory activity. In this study, 5952 compounds derived from a cyclic adenosine monophosphate (cAMP) analysis, a key signaling pathway in thyrocytes, were used to establish a binary classification model comparing methods that included random forest (RF), extreme gradient boosting (XGB), and logistic regression (LR). The prediction model based on RF showed the highest identification accuracy for revealing chemicals that may inhibit TSHR. For the RF model, recall was calculated at 0.89, balance accuracy was 0.85, and its receiver operating characteristic (ROC) curve-area under (AUC) was 0.92, indicating that the model had very high predictive capacity. The lowest CDocker energy (CE) and CDocker interaction energy (CIE) for chemicals and TSHR were determined and were subsequently introduced into the predictive model as descriptors. A regression model, extreme gradient boosting-Regression (XGBR), was successfully established yielding an R2 = 0.65 to predict inhibitory activity for active compounds. Parameters that included dissociation characteristics, molecular structure, and binding energy were all key factors in the predictive model. We demonstrate that QSAR models are useful approaches, not only for identifying chemicals that inhibit TSHR, but for predicting inhibitory activity of active compounds.
Collapse
Affiliation(s)
- Xiaotian Xu
- State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, School of Environment, Northeast Normal University, Changchun, 130117, PR China
| | - Chen Wang
- State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, School of Environment, Northeast Normal University, Changchun, 130117, PR China
| | - Bingxin Gui
- State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, School of Environment, Northeast Normal University, Changchun, 130117, PR China
| | - Xiangyi Yuan
- State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, School of Environment, Northeast Normal University, Changchun, 130117, PR China
| | - Chao Li
- State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, School of Environment, Northeast Normal University, Changchun, 130117, PR China
| | - Yuanhui Zhao
- State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, School of Environment, Northeast Normal University, Changchun, 130117, PR China
| | - Christopher J Martyniuk
- Center for Environmental and Human Toxicology, Department of Physiological Sciences, College of Veterinary Medicine, UF Genetics Institute, Interdisciplinary Program in Biomedical Sciences Neuroscience, University of Florida, Gainesville, FL, 32611, USA
| | - Limin Su
- State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, School of Environment, Northeast Normal University, Changchun, 130117, PR China.
| |
Collapse
|
11
|
Yang P, Henle EA, Fern XZ, Simon CM. Classifying the toxicity of pesticides to honey bees via support vector machines with random walk graph kernels. J Chem Phys 2022; 157:034102. [DOI: 10.1063/5.0090573] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Pesticides benefit agriculture by increasing crop yield, quality, and security. However, pesticides may inadvertently harm bees, which are valuable as pollinators. Thus, candidate pesticides in development pipelines must be assessed for toxicity to bees. Leveraging a dataset of 382 molecules with toxicity labels from honey bee exposure experiments, we train a support vector machine (SVM) to predict the toxicity of pesticides to honey bees. We compare two representations of the pesticide molecules: (i) a random walk feature vector listing counts of length- L walks on the molecular graph with each vertex- and edge-label sequence and (ii) the Molecular ACCess System (MACCS) structural key fingerprint (FP), a bit vector indicating the presence/absence of a list of pre-defined subgraph patterns in the molecular graph. We explicitly construct the MACCS FPs but rely on the fixed-length- L random walk graph kernel (RWGK) in place of the dot product for the random walk representation. The L-RWGK-SVM achieves an accuracy, precision, recall, and F1 score (mean over 2000 runs) of 0.81, 0.68, 0.71, and 0.69, respectively, on the test data set—with L = 4 being the mode optimal walk length. The MACCS-FP-SVM performs on par/marginally better than the L-RWGK-SVM, lends more interpretability, but varies more in performance. We interpret the MACCS-FP-SVM by illuminating which subgraph patterns in the molecules tend to strongly push them toward the toxic/non-toxic side of the separating hyperplane.
Collapse
Affiliation(s)
- Ping Yang
- School of Chemical, Biological, and Environmental Engineering, Oregon State University, Corvallis, Oregon 97331, USA
| | - E. Adrian Henle
- School of Chemical, Biological, and Environmental Engineering, Oregon State University, Corvallis, Oregon 97331, USA
| | - Xiaoli Z. Fern
- School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, Oregon 97331, USA
| | - Cory M. Simon
- School of Chemical, Biological, and Environmental Engineering, Oregon State University, Corvallis, Oregon 97331, USA
| |
Collapse
|
12
|
Ranasinghe JC, Jain A, Wu W, Zhang K, Wang Z, Huang S. Engineered 2D materials for optical bioimaging and path toward therapy and tissue engineering. JOURNAL OF MATERIALS RESEARCH 2022; 37:1689-1713. [PMID: 35615304 PMCID: PMC9122553 DOI: 10.1557/s43578-022-00591-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 05/04/2022] [Indexed: 06/15/2023]
Abstract
Two-dimensional (2D) layered materials as a new class of nanomaterial are characterized by a list of exotic properties. These layered materials are investigated widely in several biomedical applications. A comprehensive understanding of the state-of-the-art developments of 2D materials designed for multiple nanoplatforms will aid researchers in various fields to broaden the scope of biomedical applications. Here, we review the advances in 2D material-based biomedical applications. First, we introduce the classification and properties of 2D materials. Next, we summarize surface and structural engineering methods of 2D materials where we discuss surface functionalization, defect, and strain engineering, and creating heterostructures based on layered materials for biomedical applications. After that, we discuss different biomedical applications. Then, we briefly introduced the emerging role of machine learning (ML) as a technological advancement to boost biomedical platforms. Finally, the current challenges, opportunities, and prospects on 2D materials in biomedical applications are discussed. Graphical abstract
Collapse
Affiliation(s)
- Jeewan C. Ranasinghe
- Department of Electrical Engineering, The Pennsylvania State University, University Park, PA 16802 USA
| | - Arpit Jain
- Department of Materials Science and Engineering, The Pennsylvania State University, University Park, PA 16802 USA
| | - Wenjing Wu
- Department of Electrical Engineering, The Pennsylvania State University, University Park, PA 16802 USA
| | - Kunyan Zhang
- Department of Electrical Engineering, The Pennsylvania State University, University Park, PA 16802 USA
| | - Ziyang Wang
- Department of Electrical Engineering, The Pennsylvania State University, University Park, PA 16802 USA
| | - Shengxi Huang
- Department of Electrical Engineering, The Pennsylvania State University, University Park, PA 16802 USA
| |
Collapse
|
13
|
Lim GP, Soon CF, Ma NL, Morsin M, Nayan N, Ahmad MK, Tee KS. Cytotoxicity of MXene-based nanomaterials for biomedical applications: A mini review. ENVIRONMENTAL RESEARCH 2021; 201:111592. [PMID: 34175291 DOI: 10.1016/j.envres.2021.111592] [Citation(s) in RCA: 61] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Revised: 05/15/2021] [Accepted: 06/21/2021] [Indexed: 06/13/2023]
Abstract
MXene based nanomaterial is an uprising two-dimensional material gaining tremendous scientific attentions due to its versatile properties for the applications in electronic devices, power generation, sensors, drug delivery, and biomedicine. However, the cytotoxic effects of MXene still remained a huge concern. Therefore, stringent analysis of biocompatibility of MXene is an essential requirement before introduction to human physiological system. Several in vitro and in vivo toxicological studies have been reported to investigate the interactions between MXenes with living organisms such as microbes, mammalian cells and animal models. The biological response and cytotoxicity reported were dependent on the physicochemical properties of MXene. The biocompatibility and cytotoxicity of MXene were dependent on size, dose, and surface coating. This review demystifies the in vitro and in vivo biocompatibility studies associated with MXene. Various methods proposed to mitigate the cytotoxicity of MXene for in vivo applications were revealed. The machine learning methods were developed to predict the cytotoxicity of experimentally synthesized MXene compounds. Finally, we also discussed the current research gaps of applying MXenes in biomedical interventions.
Collapse
Affiliation(s)
- Gim Pao Lim
- Biosensor and Bioengineering Lab, Microelectronics and Nanotechnology-Shamsuddin Research Center, Institute for Integrated Engineering, Universiti Tun Hussein Onn Malaysia, 86400, Parit Raja, Batu Pahat, Johor, Malaysia
| | - Chin Fhong Soon
- Biosensor and Bioengineering Lab, Microelectronics and Nanotechnology-Shamsuddin Research Center, Institute for Integrated Engineering, Universiti Tun Hussein Onn Malaysia, 86400, Parit Raja, Batu Pahat, Johor, Malaysia.
| | - Nyuk Ling Ma
- Faculty of Science and Marine Environment, Universiti Malaysia Terengganu, 21030, Kuala Nerus, Terengganu, Malaysia
| | - Marlia Morsin
- Biosensor and Bioengineering Lab, Microelectronics and Nanotechnology-Shamsuddin Research Center, Institute for Integrated Engineering, Universiti Tun Hussein Onn Malaysia, 86400, Parit Raja, Batu Pahat, Johor, Malaysia
| | - Nafarizal Nayan
- Biosensor and Bioengineering Lab, Microelectronics and Nanotechnology-Shamsuddin Research Center, Institute for Integrated Engineering, Universiti Tun Hussein Onn Malaysia, 86400, Parit Raja, Batu Pahat, Johor, Malaysia
| | - Mohd Khairul Ahmad
- Biosensor and Bioengineering Lab, Microelectronics and Nanotechnology-Shamsuddin Research Center, Institute for Integrated Engineering, Universiti Tun Hussein Onn Malaysia, 86400, Parit Raja, Batu Pahat, Johor, Malaysia
| | - Kian Sek Tee
- Faculty of Electrical and Electronic Engineering, Universiti Tun Hussein Onn Malaysia, 86400, Parit Raja, Batu Pahat, Johor, Malaysia
| |
Collapse
|
14
|
Bai G, Wu T, Zhao L, Wang X, Li S, Ni X. CBDPS 1.0: A Python GUI Application for Machine Learning Models to Predict Bitter-Tasting Children's Oral Medicines. Chem Pharm Bull (Tokyo) 2021; 69:989-994. [PMID: 34421065 DOI: 10.1248/cpb.c20-00866] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Bitter tastes are innately aversive and are thought to help protect animals from consuming poisons. Children are extremely sensitive to drug tastes, and their compliance is especially poor with bitter medicine. Therefore, judging whether a drug is bitter and adopting flavor correction and taste-masking strategies are key to solving the problem of drug compliance in children. Although various machine learning models for bitterness and sweetness prediction have been reported in the literature, no learning model or bitterness database for children's medication has yet been reported. In this study, we trained four different machine learning models to predict bitterness. The goal of this study was to develop and validate a machine learning model called the "Children's Bitter Drug Prediction System" (CBDPS) based on Tkinter, which predicts the bitterness of a medicine based on its chemical structure. Users can enter the Simplified Molecular-Input Line-Entry System (SMILES) formula for a single compound or multiple compounds, and CBDPS will predict the bitterness of children's medicines made from those XGBoost-Molecular ACCess System (XgBoost-MACCS) model yielded an accuracy of 88% under cross-validation.
Collapse
Affiliation(s)
- Guoliang Bai
- Clinical Research Center, Beijing Children's Hospital, Capital Medical University, National Center for Children's Health
| | - Tiantian Wu
- College of Economics and Management, Nanjing University of Aeronautics and Astronautics
| | - Libo Zhao
- Clinical Research Center, Beijing Children's Hospital, Capital Medical University, National Center for Children's Health
| | - Xiaoling Wang
- Clinical Research Center, Beijing Children's Hospital, Capital Medical University, National Center for Children's Health
| | - Shan Li
- College of Economics and Management, Nanjing University of Aeronautics and Astronautics
| | - Xin Ni
- Clinical Research Center, Beijing Children's Hospital, Capital Medical University, National Center for Children's Health
| |
Collapse
|
15
|
Huang Y, Wang J, Wang S, Xu X, Qin W, Wen Y, Zhao YH, Martyniuk CJ. Discrimination of active and inactive substances in cytotoxicity based on Tox21 10K compound library: Structure alert and mode of action. Toxicology 2021; 462:152948. [PMID: 34530041 DOI: 10.1016/j.tox.2021.152948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 08/28/2021] [Accepted: 09/08/2021] [Indexed: 10/20/2022]
Abstract
In vitro cytotoxicity assay is an ideal alternative method for the in vivo toxicity in the risk assessment of pollutants in environment. However, modes of action (MOAs) of cytotoxicity have not been investigated for a wide range of compounds. In this paper, binomial and recursive partitioning analysis were carried out between the cytotoxicity and molecular descriptors for 8981 compounds. The results showed that cytotoxicity is strongly related to the chemical hydrophobicity and excess molar refraction, indicating the bio-uptake and chemical-receptor interaction through π and n electron pair play important roles in the cytotoxicity. The decision tree derived from recursive partitioning analysis revealed that the studied compounds could be divided into 25 groups and their structural characteristics could be used as structure alert to identify active and inactive compounds in cytotoxicity. The descriptors used in the decision tree revealed that chemical ionization and bioavailability could affect the cytotoxicity for ionizable and highly hydrophobic compounds. Comparison of MOAs based on Verhaar's classification scheme showed that many inert or less inert compounds were inactive substance, and many reactive or specifically-acting compounds were active substances in the cytotoxicity. In vitro toxicity assay instead of in vivo toxicity assay can be used in the environmental hazard and risk assessment of organic pollutants. The descriptors used in the binomial equation and decision tree reveal that chemical hydrophobicity, ionization and solubility play very important roles for identification of active and inactive compounds. The results obtained in this paper are valuable for understanding the modes of action in cytotoxicity and in vivo-in vitro toxicity relationship.
Collapse
Affiliation(s)
- Ying Huang
- State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, School of Environment, Northeast Normal University, Changchun, Jilin 130117, PR China
| | - Jia Wang
- State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, School of Environment, Northeast Normal University, Changchun, Jilin 130117, PR China
| | - Shuo Wang
- State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, School of Environment, Northeast Normal University, Changchun, Jilin 130117, PR China
| | - Xiaotian Xu
- State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, School of Environment, Northeast Normal University, Changchun, Jilin 130117, PR China
| | - Weichao Qin
- State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, School of Environment, Northeast Normal University, Changchun, Jilin 130117, PR China
| | - Yang Wen
- Key Laboratory of Environmental Materials and Pollution Control, The Education Department of Jilin Province, School of Environmental Science and Engineering, Jilin Normal University, Siping, Jilin 136000, PR China.
| | - Yuan H Zhao
- State Environmental Protection Key Laboratory of Wetland Ecology and Vegetation Restoration, School of Environment, Northeast Normal University, Changchun, Jilin 130117, PR China.
| | - Christopher J Martyniuk
- Center for Environmental and Human Toxicology, Department of Physiological Sciences, College of Veterinary Medicine, UF Genetics Institute, Interdisciplinary Program in Biomedical Sciences Neuroscience, University of Florida, Gainesville, FL, 32611, USA
| |
Collapse
|
16
|
A Novel Machine Learning Model to Predict the Photo-Degradation Performance of Different Photocatalysts on a Variety of Water Contaminants. Catalysts 2021. [DOI: 10.3390/catal11091107] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
This paper describes an innovative machine learning (ML) model to predict the performance of different metal oxide photocatalysts on a wide range of contaminants. The molecular structures of metal oxide photocatalysts are encoded with a crystal graph convolution neural network (CGCNN). The structure of organic compounds is encoded via digital molecular fingerprints (MF). The encoded features of the photocatalysts and contaminants are input to an artificial neural network (ANN), named as CGCNN-MF-ANN model. The CGCNN-MF-ANN model has achieved a very good prediction of the photocatalytic degradation rate constants by different photocatalysts over a wide range of organic contaminants. The effects of the data training strategy on the ML model performance are compared. The effects of different factors on photocatalytic degradation performance are further evaluated by feature importance analyses. Examples are illustrated on the use of this novel ML model for optimal photocatalyst selection and for assessing other types of photocatalysts for different environmental applications.
Collapse
|
17
|
MohammadiPeyhani H, Chiappino-Pepe A, Haddadi K, Hafner J, Hadadi N, Hatzimanikatis V. NICEdrug.ch, a workflow for rational drug design and systems-level analysis of drug metabolism. eLife 2021; 10:e65543. [PMID: 34340747 PMCID: PMC8331181 DOI: 10.7554/elife.65543] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Accepted: 07/07/2021] [Indexed: 12/30/2022] Open
Abstract
The discovery of a drug requires over a decade of intensive research and financial investments - and still has a high risk of failure. To reduce this burden, we developed the NICEdrug.ch resource, which incorporates 250,000 bioactive molecules, and studied their enzymatic metabolic targets, fate, and toxicity. NICEdrug.ch includes a unique fingerprint that identifies reactive similarities between drug-drug and drug-metabolite pairs. We validated the application, scope, and performance of NICEdrug.ch over similar methods in the field on golden standard datasets describing drugs and metabolites sharing reactivity, drug toxicities, and drug targets. We use NICEdrug.ch to evaluate inhibition and toxicity by the anticancer drug 5-fluorouracil, and suggest avenues to alleviate its side effects. We propose shikimate 3-phosphate for targeting liver-stage malaria with minimal impact on the human host cell. Finally, NICEdrug.ch suggests over 1300 candidate drugs and food molecules to target COVID-19 and explains their inhibitory mechanism for further experimental screening. The NICEdrug.ch database is accessible online to systematically identify the reactivity of small molecules and druggable enzymes with practical applications in lead discovery and drug repurposing.
Collapse
Affiliation(s)
- Homa MohammadiPeyhani
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Anush Chiappino-Pepe
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Kiandokht Haddadi
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Jasmin Hafner
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Noushin Hadadi
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Vassily Hatzimanikatis
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| |
Collapse
|
18
|
Seal S, Yang H, Vollmers L, Bender A. Comparison of Cellular Morphological Descriptors and Molecular Fingerprints for the Prediction of Cytotoxicity- and Proliferation-Related Assays. Chem Res Toxicol 2021; 34:422-437. [PMID: 33522793 DOI: 10.1021/acs.chemrestox.0c00303] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Cell morphology features, such as those from the Cell Painting assay, can be generated at relatively low costs and represent versatile biological descriptors of a system and thereby compound response. In this study, we explored cell morphology descriptors and molecular fingerprints, separately and in combination, for the prediction of cytotoxicity- and proliferation-related in vitro assay endpoints. We selected 135 compounds from the MoleculeNet ToxCast benchmark data set which were annotated with Cell Painting readouts, where the relatively small size of the data set is due to the overlap of required annotations. We trained Random Forest classification models using nested cross-validation and Cell Painting descriptors, Morgan and ErG fingerprints, and their combinations. While using leave-one-cluster-out cross-validation (with clusters based on physicochemical descriptors), models using Cell Painting descriptors achieved higher average performance over all assays (Balanced Accuracy of 0.65, Matthews Correlation Coefficient of 0.28, and AUC-ROC of 0.71) compared to models using ErG fingerprints (BA 0.55, MCC 0.09, and AUC-ROC 0.60) and Morgan fingerprints alone (BA 0.54, MCC 0.06, and AUC-ROC 0.56). While using random shuffle splits, the combination of Cell Painting descriptors with ErG and Morgan fingerprints further improved balanced accuracy on average by 8.9% (in 9 out of 12 assays) and 23.4% (in 8 out of 12 assays) compared to using only ErG and Morgan fingerprints, respectively. Regarding feature importance, Cell Painting descriptors related to nuclei texture, granularity of cells, and cytoplasm as well as cell neighbors and radial distributions were identified to be most contributing, which is plausible given the endpoint considered. We conclude that cell morphological descriptors contain complementary information to molecular fingerprints which can be used to improve the performance of predictive cytotoxicity models, in particular in areas of novel structural space.
Collapse
Affiliation(s)
- Srijit Seal
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Hongbin Yang
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Luis Vollmers
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Andreas Bender
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
19
|
Feng H, Zhang L, Li S, Liu L, Yang T, Yang P, Zhao J, Arkin IT, Liu H. Predicting the reproductive toxicity of chemicals using ensemble learning methods and molecular fingerprints. Toxicol Lett 2021; 340:4-14. [PMID: 33421549 DOI: 10.1016/j.toxlet.2021.01.002] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2020] [Revised: 10/29/2020] [Accepted: 01/03/2021] [Indexed: 12/20/2022]
Abstract
Reproductive toxicity endpoints are a significant safety concern in the assessment of the adverse effects of chemicals in drug discovery. Computational models that can accurately predict a chemical's toxic potential are increasingly pursued to replace traditional animal experiments. Thus, ensemble learning models were built to predict the reproductive toxicity of compounds. Our ensemble models were developed using support vector machine, random forest, and extreme gradient boosting methods and 9 molecular fingerprints calculated for a dataset containing 1823 chemicals. The best prediction performance was achieved by the Ensemble-Top12 model, with an accuracy (ACC) of 86.33 %, a sensitivity (SEN) of 82.02 %, a specificity (SPE) of 90.19 %, and an area under the receiver operating characteristic curve (AUC) of 0.937 in 5-fold cross-validation and ACC, SEN, SPE, and AUC values of 84.38 %, 86.90 %, 90.67 %, and 0.920, respectively, in external validation. We also defined the applicability domain (AD) of the ensemble model by calculating the Tanimoto distance of the training set. Compared with models in existing literature, our ensemble model achieves relatively high ACC, SPE and AUC values. We also identified several fingerprint features related to chemical reproductive toxicity. Considering the performance of model, we recommend using the Ensemble-Top12 model to predict reproductive toxicity in early drug development.
Collapse
Affiliation(s)
- Huawei Feng
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Li Zhang
- School of Life Science, Liaoning University, Shenyang, 110036, China; Technology Innovation Center for Computer Simulating and Information Processing of Bio-macromolecules of Shenyang, Shenyang, 110036, China; Engineering Laboratory for Molecular Simulation and Designing of Drug Molecules of Liaoning, Liaoning University, Shenyang, 110036, China
| | - Shimeng Li
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Lili Liu
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Tianzhou Yang
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Pengyu Yang
- School of Information, Liaoning University, Shenyang, 110036, China
| | - Jian Zhao
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Isaiah Tuvia Arkin
- Department of Biological Chemistry, The Hebrew University of Jerusalem, Edmond J. Safra Campus, Givat-Ram, Jerusalem, 91904, Israel
| | - Hongsheng Liu
- Technology Innovation Center for Computer Simulating and Information Processing of Bio-macromolecules of Shenyang, Shenyang, 110036, China; Engineering Laboratory for Molecular Simulation and Designing of Drug Molecules of Liaoning, Liaoning University, Shenyang, 110036, China; School of Pharmaceutical Science, Liaoning University, Shenyang, 110036, China.
| |
Collapse
|
20
|
Djawad YA, Kiely J, Luxton R. Classification of the mechanism of toxicity as applied to human cell line ECV304. Comput Methods Biomech Biomed Engin 2020; 24:933-944. [PMID: 33356573 DOI: 10.1080/10255842.2020.1861255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
The objective of this study was to identify the pattern of cytotoxicity testing of the human cell line ECV304 using three techniques of an ensemble learning algorithm (bagging, boosting and stacking). The study of cell morphology of ECV304 cell line was conducted using impedimetric measurement. Three types of toxins were applied to the ECV304 cell line namely 1 mM hydrogen peroxide (H2O2), 5% dimethyl sulfoxide and 10 μg Saponin. The measurement was conducted using electrodes and lock-in amplifier to detect impedance changes during cytotoxicity testing within a frequency range 200 and 830 kHz. The results were analysed, processed and extracted using detrended fluctuation analysis to obtain characteristics and features of the cells when exposed to the each of the toxins. Three ensemble algorithms applied showed slightly different results on the performance for classifying the data set from the feature extraction that was performed. However, the results show that the cell reaction to the toxins could be classified.
Collapse
Affiliation(s)
- Yasser Abd Djawad
- Department of Electronics, Universitas Negeri Makassar, Makassar, Indonesia
| | | | | |
Collapse
|
21
|
Constructing bi-plots for random forest: Tutorial. Anal Chim Acta 2020; 1131:146-155. [PMID: 32928475 DOI: 10.1016/j.aca.2020.06.043] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2020] [Revised: 06/15/2020] [Accepted: 06/16/2020] [Indexed: 01/29/2023]
Abstract
Current technological developments have allowed for a significant increase and availability of data. Consequently, this has opened enormous opportunities for the machine learning and data science field, translating into the development of new algorithms in a wide range of applications in medical, biomedical, daily-life, and national security areas. Ensemble techniques are among the pillars of the machine learning field, and they can be defined as approaches in which multiple, complex, independent/uncorrelated, predictive models are subsequently combined by either averaging or voting to yield a higher model performance. Random forest (RF), a popular ensemble method, has been successfully applied in various domains due to its ability to build predictive models with high certainty and little necessity of model optimization. RF provides both a predictive model and an estimation of the variable importance. However, the estimation of the variable importance is based on thousands of trees, and therefore, it does not specify which variable is important for which sample group. The present study demonstrates an approach based on the pseudo-sample principle that allows for construction of bi-plots (i.e. spin plots) associated with RF models. The pseudo-sample principle for RF. is explained and demonstrated by using two simulated datasets, and three different types of real data, which include political sciences, food chemistry and the human microbiome data. The pseudo-sample bi-plots, associated with RF and its unsupervised version, allow for a versatile visualization of multivariate models, and the variable importance and the relation among them.
Collapse
|
22
|
Marchwiany ME, Birowska M, Popielski M, Majewski JA, Jastrzębska AM. Surface-Related Features Responsible for Cytotoxic Behavior of MXenes Layered Materials Predicted with Machine Learning Approach. MATERIALS (BASEL, SWITZERLAND) 2020; 13:E3083. [PMID: 32664304 PMCID: PMC7412046 DOI: 10.3390/ma13143083] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 07/06/2020] [Accepted: 07/08/2020] [Indexed: 12/16/2022]
Abstract
To speed up the implementation of the two-dimensional materials in the development of potential biomedical applications, the toxicological aspects toward human health need to be addressed. Due to time-consuming and expensive analysis, only part of the continuously expanding family of 2D materials can be tested in vitro. The machine learning methods can be used-by extracting new insights from available biological data sets, and provide further guidance for experimental studies. This study identifies the most relevant highly surface-specific features that might be responsible for cytotoxic behavior of 2D materials, especially MXenes. In particular, two factors, namely, the presence of transition metal oxides and lithium atoms on the surface, are identified as cytotoxicity-generating features. The developed machine learning model succeeds in predicting toxicity for other 2D MXenes, previously not tested in vitro, and hence, is able to complement the existing knowledge coming from in vitro studies. Thus, we claim that it might be one of the solutions for reducing the number of toxicological studies needed, and allows for minimizing failures in future biological applications.
Collapse
Affiliation(s)
- Maciej E. Marchwiany
- Interdisciplinary Centre for Mathematical and Computational Modelling (ICM), University of Warsaw, Pawińskiego 5a, 02-106 Warsaw, Poland;
| | - Magdalena Birowska
- Faculty of Physics, University of Warsaw, Pasteura 5, 00-092 Warsaw, Poland; (M.P.); (J.A.M.)
| | - Mariusz Popielski
- Faculty of Physics, University of Warsaw, Pasteura 5, 00-092 Warsaw, Poland; (M.P.); (J.A.M.)
| | - Jacek A. Majewski
- Faculty of Physics, University of Warsaw, Pasteura 5, 00-092 Warsaw, Poland; (M.P.); (J.A.M.)
| | - Agnieszka M. Jastrzębska
- Faculty of Materials Science and Engineering, Warsaw University of Technology, Wołoska 141, 02-507 Warsaw, Poland;
| |
Collapse
|
23
|
Pirzada RH, Javaid N, Choi S. The Roles of the NLRP3 Inflammasome in Neurodegenerative and Metabolic Diseases and in Relevant Advanced Therapeutic Interventions. Genes (Basel) 2020; 11:E131. [PMID: 32012695 PMCID: PMC7074480 DOI: 10.3390/genes11020131] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2019] [Revised: 01/22/2020] [Accepted: 01/22/2020] [Indexed: 02/07/2023] Open
Abstract
Inflammasomes are intracellular multiprotein complexes in the cytoplasm that regulate inflammation activation in the innate immune system in response to pathogens and to host self-derived molecules. Recent advances greatly improved our understanding of the activation of nucleotide-binding oligomerization domain-like receptor (NLR) family pyrin domain containing 3 (NLRP3) inflammasomes at the molecular level. The NLRP3 belongs to the subfamily of NLRP which activates caspase 1, thus causing the production of proinflammatory cytokines (interleukin 1β and interleukin 18) and pyroptosis. This inflammasome is involved in multiple neurodegenerative and metabolic disorders including Alzheimer's disease, multiple sclerosis, type 2 diabetes mellitus, and gout. Therefore, therapeutic targeting to the NLRP3 inflammasome complex is a promising way to treat these diseases. Recent research advances paved the way toward drug research and development using a variety of machine learning-based and artificial intelligence-based approaches. These state-of-the-art approaches will lead to the discovery of better drugs after the training of such a system.
Collapse
Affiliation(s)
| | | | - Sangdun Choi
- Department of Molecular Science and Technology, Ajou University, Suwon 16499, Korea; (R.H.P.); (N.J.)
| |
Collapse
|
24
|
Advancing Drug Discovery via Artificial Intelligence. Trends Pharmacol Sci 2019; 40:592-604. [DOI: 10.1016/j.tips.2019.06.004] [Citation(s) in RCA: 164] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Revised: 05/23/2019] [Accepted: 06/11/2019] [Indexed: 01/15/2023]
|