1
|
Khezri R, Jamaleddin Shahtaheri S, Khezri E, Niknam Shahrak M, Khadem M. In-silico green toxicology approach toward discovering safer ligands for development of safe-by-design metal-organic frameworks. Toxicol Mech Methods 2024; 34:821-832. [PMID: 38725267 DOI: 10.1080/15376516.2024.2353364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Accepted: 04/23/2024] [Indexed: 05/25/2024]
Abstract
A vast variety of chemical compounds have been fabricated and commercialized, they not only result in industrial exposure during manufacturing and usage, but also have environmental impacts throughout their whole life cycle. Consequently, attempts to assess the risk of chemicals in terms of toxicology have never ceased. In-silico toxicology, also known as predictive toxicology, has advanced significantly over the last decade as a result of the drawbacks of experimental investigations. In this study, ProTox-III was applied to predict the toxicity of the ligands used for metal-organic framework (MOF) design and synthesis. Initially, 35 ligands, that have been frequently utilized for MOF synthesis and fabrication, were selected. Subsequently, canonical simplified molecular-input line-entry system (SMILES) of ligands were extracted from the PUBCHEM database and inserted into the ProTox-III online server. Ultimately, webserver outputs including LD50 and the probability of toxicological endpoints (cytotoxicity, carcinogenicity, mutagenicity, immunotoxicity, and ecotoxicity) were obtained and organized. According to retrieved LD50 data, the safest ligand was 5-hydroxyisophthalic. In contrast, the most hazardous ligand was 5-chlorobenzimidazole, with an LD50 of 8 mg/kg. Among evaluated endpoints, ecotoxicity was the most active and was detected in several imidazolate ligands. This data can open new horizons in design and development of green MOFs.
Collapse
Affiliation(s)
- Reyhane Khezri
- Department of Occupational Health Engineering, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran
| | - Seyed Jamaleddin Shahtaheri
- Department of Occupational Health Engineering, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran
| | - Elahe Khezri
- Student Research Committee, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Mahdi Niknam Shahrak
- Department of Chemical Engineering, Faculty of Advanced Technologies, Quchan University of Technology, Quchan, Iran
| | - Monireh Khadem
- Department of Occupational Health Engineering, School of Public Health, Tehran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
2
|
Chong SG, Ismail IS, Chong CM, Mad Nasir N, Saleh Hodin NA. 1H NMR-metabolomics studies on acute toxicity effect of lead in adult zebrafish ( Danio rerio) model. Drug Chem Toxicol 2024; 47:573-586. [PMID: 38726945 DOI: 10.1080/01480545.2024.2346751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 04/09/2024] [Accepted: 04/18/2024] [Indexed: 09/04/2024]
Abstract
Zebrafish (Danio rerio) is ideal for studying the effects of toxins like lead or plumbum (Pb) which persist in the environment and harm body systems when absorbed. Increasing Pb concentration could result in a higher mortality rate and alteration of behavior and metabolism. The present study evaluates the acute toxicity effect of Pb on metabolome and behavior in adult zebrafish. The zebrafish were exposed to various Pb concentrations ranging from 0 to 30 mg/L for different periods (24, 48, and 72 h) before the fish samples were subjected to Nuclear Magnetic Resonance (NMR)-multivariate data analysis (MVDA) with additional support from behavioral assessment. The behavior of zebrafish was significantly altered after Pb inducement and the differential metabolites increased in low (5 mg/L) while decreased in high (10 mg/L) Pb concentrations. An ideal Pb induction could be achieved by 5 mg/L concentration in 24 h, which induced significant metabolite changes without irreversible damage. Continuing research on the effects of lead toxicity is crucial to develop effective prevention and treatment strategies.
Collapse
Affiliation(s)
- Siok-Geok Chong
- Natural Medicines and Products Research Laboratory (NaturMeds), Institute of Bioscience, Universiti Putra Malaysia (UPM), Serdang, Malaysia
| | - Intan Safinar Ismail
- Natural Medicines and Products Research Laboratory (NaturMeds), Institute of Bioscience, Universiti Putra Malaysia (UPM), Serdang, Malaysia
- Department of Chemistry, Faculty of Science, Universiti Putra Malaysia (UPM), Serdang, Malaysia
| | - Chou-Min Chong
- Department of Aquaculture, Faculty of Agricultural Sciences, Universiti Putra Malaysia (UPM), Serdang, Malaysia
| | - Nadiah Mad Nasir
- Department of Chemistry, Faculty of Science, Universiti Putra Malaysia (UPM), Serdang, Malaysia
| | - Nur Atikah Saleh Hodin
- Natural Medicines and Products Research Laboratory (NaturMeds), Institute of Bioscience, Universiti Putra Malaysia (UPM), Serdang, Malaysia
| |
Collapse
|
3
|
Li XL, Zhang JQ, Shen XJ, Zhang Y, Guo DA. Overview and limitations of database in global traditional medicines: A narrative review. Acta Pharmacol Sin 2024:10.1038/s41401-024-01353-1. [PMID: 39095509 DOI: 10.1038/s41401-024-01353-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 07/02/2024] [Indexed: 08/04/2024] Open
Abstract
The study of traditional medicine has garnered significant interest, resulting in various research areas including chemical composition analysis, pharmacological research, clinical application, and quality control. The abundance of available data has made databases increasingly essential for researchers to manage the vast amount of information and explore new drugs. In this article we provide a comprehensive overview and summary of 182 databases that are relevant to traditional medicine research, including 73 databases for chemical component analysis, 70 for pharmacology research, and 39 for clinical application and quality control from published literature (2000-2023). The review categorizes the databases by functionality, offering detailed information on websites and capacities to facilitate easier access. Moreover, this article outlines the primary function of each database, supplemented by case studies to aid in database selection. A practical test was conducted on 68 frequently used databases using keywords and functionalities, resulting in the identification of highlighted databases. This review serves as a reference for traditional medicine researchers to choose appropriate databases and also provides insights and considerations for the function and content design of future databases.
Collapse
Affiliation(s)
- Xiao-Lan Li
- Shanghai Research Center for Modernization of Traditional Chinese Medicine, National Engineering Research Center of TCM Standardization Technology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Jian-Qing Zhang
- Shanghai Research Center for Modernization of Traditional Chinese Medicine, National Engineering Research Center of TCM Standardization Technology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China
| | - Xuan-Jing Shen
- Shanghai Research Center for Modernization of Traditional Chinese Medicine, National Engineering Research Center of TCM Standardization Technology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yu Zhang
- Shanghai Research Center for Modernization of Traditional Chinese Medicine, National Engineering Research Center of TCM Standardization Technology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - De-An Guo
- Shanghai Research Center for Modernization of Traditional Chinese Medicine, National Engineering Research Center of TCM Standardization Technology, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, 201203, China.
- University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
4
|
Banerjee P, Kemmler E, Dunkel M, Preissner R. ProTox 3.0: a webserver for the prediction of toxicity of chemicals. Nucleic Acids Res 2024; 52:W513-W520. [PMID: 38647086 PMCID: PMC11223834 DOI: 10.1093/nar/gkae303] [Citation(s) in RCA: 29] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 03/26/2024] [Accepted: 04/09/2024] [Indexed: 04/25/2024] Open
Abstract
Interaction with chemicals, present in drugs, food, environments, and consumer goods, is an integral part of our everyday life. However, depending on the amount and duration, such interactions can also result in adverse effects. With the increase in computational methods, the in silico methods can offer significant benefits to both regulatory needs and requirements for risk assessments and the pharmaceutical industry to assess the safety profile of a chemical. Here, we present ProTox 3.0, which incorporates molecular similarity and machine-learning models for the prediction of 61 toxicity endpoints such as acute toxicity, organ toxicity, clinical toxicity, molecular-initiating events (MOE), adverse outcomes (Tox21) pathways, several other toxicological endpoints and toxicity off-targets. All the ProTox 3.0 models are validated on independent external sets and have shown strong performance. ProTox envisages itself as a complete, freely available computational platform for in silico toxicity prediction for toxicologists, regulatory agencies, computational chemists, and medicinal chemists. The ProTox 3.0 webserver is free and open to all users, and there is no login requirement and can be accessed via https://tox.charite.de. The web server takes a 2D chemical structure as input and reports the toxicological profile of the compound for each endpoint with a confidence score and overall toxicity radar plot and network plot.
Collapse
Affiliation(s)
- Priyanka Banerjee
- Institute for Physiology & Science-IT, Charité – University Medicine Berlin, 10115 Berlin, Germany
- Member of the KFO 339: Food Allergy and Tolerance (Food@), Clinical Research Unit funded by the German Research Foundation, Berlin, Germany
| | - Emanuel Kemmler
- Institute for Physiology & Science-IT, Charité – University Medicine Berlin, 10115 Berlin, Germany
- Member of the KFO 339: Food Allergy and Tolerance (Food@), Clinical Research Unit funded by the German Research Foundation, Berlin, Germany
| | - Mathias Dunkel
- Institute for Physiology & Science-IT, Charité – University Medicine Berlin, 10115 Berlin, Germany
| | - Robert Preissner
- Institute for Physiology & Science-IT, Charité – University Medicine Berlin, 10115 Berlin, Germany
| |
Collapse
|
5
|
Cavaco T, Faria JMS. Phytochemical Volatiles as Potential Bionematicides with Safer Ecotoxicological Properties. TOXICS 2024; 12:406. [PMID: 38922086 PMCID: PMC11209200 DOI: 10.3390/toxics12060406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 05/27/2024] [Accepted: 05/31/2024] [Indexed: 06/27/2024]
Abstract
The management of plant-parasitic nematodes (PPNs) still relies on traditional nematicides that threaten the environment and human health. Novel solutions are urgently needed for PPN pest management that are effective while safeguarding non-target organisms. Volatile phytochemicals belong to a structurally diverse group of bioactive metabolites that are believed to hold safer environmental characteristics than synthetic pesticides. Nonetheless, not many studies have analysed the potential environmental benefits of shifting to these novel bionematicides. In the present study, 20 phytochemical volatiles with reported nematicidal activity were compared to traditional pesticides using specific parameters of environmental and human health safety available on applied online databases and predicted in silico through specialised software. Overall, the reviewed nematicidal phytochemicals were reportedly less toxic than synthetic nematicides. They were predicted to disperse to the air and soil environmental compartments and were reported to have a lower toxicity on aquatic organisms. On the contrary, the synthetic nematicides were reportedly toxic to aquatic organisms while showing a predicted high affinity to the water environmental compartment. As alternatives, β-keto or fatty acid derivatives, e.g., aliphatic alcohols or ketones, showed more adequate properties. This study highlights the importance of complementing studies on nematicidal activity with a risk assessment-based analysis to allow for a faster selection of nematicidal phytochemical volatiles and to leverage the development and implementation of bionematicides.
Collapse
Affiliation(s)
- Tomás Cavaco
- Instituto Nacional de Investigação Agrária e Veterinária (INIAV, I.P.), Quinta do Marquês, 2780-159 Oeiras, Portugal;
- Instituto Superior de Agronomia (ISA), Universidade de Lisboa, 1349-107 Lisboa, Portugal
| | - Jorge M. S. Faria
- Instituto Nacional de Investigação Agrária e Veterinária (INIAV, I.P.), Quinta do Marquês, 2780-159 Oeiras, Portugal;
- GREEN-IT Bioresources for Sustainability, Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa (ITQB NOVA), Av. da República, 2780-157 Oeiras, Portugal
| |
Collapse
|
6
|
Mir SA, Nayak B, Khan A, Khan MI, Eldakhakhny BM, Arif DO. An exploration of binding of Hesperidin, Rutin, and Thymoquinone to acetylcholinesterase enzyme using multi-level computational approaches. J Biomol Struct Dyn 2023:1-15. [PMID: 37811769 DOI: 10.1080/07391102.2023.2265492] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Accepted: 09/24/2023] [Indexed: 10/10/2023]
Abstract
Alzheimer's disease, an intricate neurological disorder, is impacting an ever-increasing number of individuals globally, particularly among the aging population. For several decades phytochemicals were used as Ayurveda to treat both communicable and non-communicable diseases. Acetylcholinesterase (AChE) is a widely chosen therapeutic target for the development of early prevention and effective management of neurodegenerative diseases. The primary objective of the present study was to investigate the binding potential between Rutin Thymoquinone, Hesperidin and the FDA-approved drug Donepezil with AChE. Additionally, a comparative analysis was conducted. These phytochemicals were docked with the binding site of the AChE experimental complex. The molecular dockings demonstrated that the Hesperidinh showed a better binding affinity of -22.0631 kcal/mol. The ADME/T investigations revealed that the selected phytochemicals are non-toxic and drug-like candidates. Molecular dynamics simulations were implemented to determine the conformational changes of Rutin, hesperidin, Thymoquinone, and Donepezil complexed with AChE. Hesperidin and Donepezil were more stable than Rutin, Thymoquinone complexed with AChE. Next, essential dynamics and defining the secondary structure of protein were to determine the conformational changes in AChE complexed with selected phytochemicals during simulations. Overall, the MD Simulations demonstrated that all complexes in this study achieved stability until 100 ns of the simulation period was performed thrice. The structural analysis of AChE was done using multiple search engines to explore the molecular functions, biological processes, and pathways in which AChE proteins are involved and to identify potential drug targets for various diseases. This present study concludes that Hesperidin was found to be a more potent AChE inhibitors than Rutin, and further experiments are required to determine the effectivity of Hesperidin against neurodegenerative diseases.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Showkat Ahmad Mir
- School of Life Sciences, Sambalpur University, Jyoti Vihar, Odisha, India
| | - Binata Nayak
- School of Life Sciences, Sambalpur University, Jyoti Vihar, Odisha, India
| | - Andleeb Khan
- Department of Biosciences, Faculty of Science, Integral University, Lucknow, India
| | - Mohammad Imran Khan
- Department of Biochemistry, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia
- Centre for Artificial Intelligence in Precision Medicines, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Basmah M Eldakhakhny
- Department of Clinical Biochemistry, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Deema O Arif
- Faculty of Medicine, Ibn Sina National College, Jeddah, Saudi Arabia
| |
Collapse
|
7
|
Alqahtani S. Improving on in-silico prediction of oral drug bioavailability. Expert Opin Drug Metab Toxicol 2023; 19:665-670. [PMID: 37728393 DOI: 10.1080/17425255.2023.2261366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Accepted: 09/18/2023] [Indexed: 09/21/2023]
Abstract
INTRODUCTION Although significant development has been made in high-throughput screening of oral drug absorption and oral bioavailability, in silico prediction continues to play an important role in prediction of oral bioavailability and assisting in the proper selection of potential drug candidates. AREAS COVERED This review describes the improvements and latest modeling methods and algorithms available for the prediction of this important parameter. We performed a PubMed database search with a focus on the literature published in the last 15 years. EXPERT OPINION A tremendous efforts have been done in the past several years to develop reliable prediction tools that can provide accurate prediction for oral bioavailability. Several studies demonstrated new methodologies and techniques to develop either web-based in silico predictive tools or integrated PBPK models to predict oral bioavailability for new molecules. Improvements in the databases and the computational power will enhance the in silico prediction accuracy and reliability. Finally, introducing artificial intelligence to the drug development process will help improve the prediction tools.
Collapse
Affiliation(s)
- Saeed Alqahtani
- Department of Clinical Pharmacy, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia
- Clinical Pharmacokinetics and Pharmacogenetics Unit, King Saud University Medical City, Riyadh, Saudi Arabia
| |
Collapse
|
8
|
Molecular modeling and simulations of some antiviral drugs, benzylisoquinoline alkaloid, and coumarin molecules to investigate the effects on Mpro main viral protease inhibition. Biochem Biophys Rep 2023; 34:101459. [PMID: 36987522 PMCID: PMC10037929 DOI: 10.1016/j.bbrep.2023.101459] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2023] [Revised: 03/14/2023] [Accepted: 03/15/2023] [Indexed: 03/26/2023] Open
Abstract
Background SARS-CoV-2 is a deadly viral disease and uncounted deaths occurs since its first appearance in the year 2019. The antiviral drugs, benzylisoquinoline alkaloids, and coumarin molecules were searched using different online engines for drug repurposing with SARS-CoV-2 and to investigate the effects on main viral protease (Mpro) upon their bindings. Methods A database composed of antiviral drugs, benzylisoquinoline alkaloids, and Coumarin molecules was screened through a molecular docking strategy to uncover the interactions of collected molecules with SARS-CoV-2 Mpro. Further, molecular dynamics simulations (MDS) were implemented for 100 ns to calculate the stability of the best complexed molecular scaffold with Mpro. The conformations of the simulated complexes were investigated by using principal component analysis (PCA) and Gibbs energy landscape (FEL) and DSSP together. Next, free binding energy (ΔGbind) was calculated using the mmpbsa method. Results Molecular docking simulations demonstrate 17 molecules exhibited better binding affinity out of 99 molecules present in the database with the viral protease Mpro, followed ADMET properties and were documented. The Coumarin-EM04 molecular scaffold exhibited interactions with catalytical dyad HIS41, CYS145, and neighboring amino acids SER165 and GLN189 in the catalytical site. The crucial factor RMSD was calculated to determine the orientations of Coumarin-EM04. The Coumarin-EM04 complexed with Mpro was found stable in the binding site during MDS. Furthermore, the free energy binding ΔGbind of Coumarin-EM04 was found to be −187.471 ± 2.230 kJ/mol, and for Remdesivir ΔGbind was −171.926 ± 2.237 kJ/mol with SARS-CoV-2 Mpro. Conclusion In this study, we identify potent molecules that exhibit interactions with catalytical dyad HIS41 and CYS145 amino acids and unravel Coumarin-EM04 exhibited ΔGbind higher than Remdesivir against Mpro and thus may serve better antiviral agent against SARS-CoV-2.
Collapse
|
9
|
Petroleum Hydrocarbon Catabolic Pathways as Targets for Metabolic Engineering Strategies for Enhanced Bioremediation of Crude-Oil-Contaminated Environments. FERMENTATION-BASEL 2023. [DOI: 10.3390/fermentation9020196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Abstract
Anthropogenic activities and industrial effluents are the major sources of petroleum hydrocarbon contamination in different environments. Microbe-based remediation techniques are known to be effective, inexpensive, and environmentally safe. In this review, the metabolic-target-specific pathway engineering processes used for improving the bioremediation of hydrocarbon-contaminated environments have been described. The microbiomes are characterised using environmental genomics approaches that can provide a means to determine the unique structural, functional, and metabolic pathways used by the microbial community for the degradation of contaminants. The bacterial metabolism of aromatic hydrocarbons has been explained via peripheral pathways by the catabolic actions of enzymes, such as dehydrogenases, hydrolases, oxygenases, and isomerases. We proposed that by using microbiome engineering techniques, specific pathways in an environment can be detected and manipulated as targets. Using the combination of metabolic engineering with synthetic biology, systemic biology, and evolutionary engineering approaches, highly efficient microbial strains may be utilised to facilitate the target-dependent bioprocessing and degradation of petroleum hydrocarbons. Moreover, the use of CRISPR-cas and genetic engineering methods for editing metabolic genes and modifying degradation pathways leads to the selection of recombinants that have improved degradation abilities. The idea of growing metabolically engineered microbial communities, which play a crucial role in breaking down a range of pollutants, has also been explained. However, the limitations of the in-situ implementation of genetically modified organisms pose a challenge that needs to be addressed in future research.
Collapse
|
10
|
Wu L, Yan B, Han J, Li R, Xiao J, He S, Bo X. TOXRIC: a comprehensive database of toxicological data and benchmarks. Nucleic Acids Res 2023; 51:D1432-D1445. [PMID: 36400569 PMCID: PMC9825425 DOI: 10.1093/nar/gkac1074] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Revised: 10/10/2022] [Accepted: 10/26/2022] [Indexed: 11/20/2022] Open
Abstract
The toxic effects of compounds on environment, humans, and other organisms have been a major focus of many research areas, including drug discovery and ecological research. Identifying the potential toxicity in the early stage of compound/drug discovery is critical. The rapid development of computational methods for evaluating various toxicity categories has increased the need for comprehensive and system-level collection of toxicological data, associated attributes, and benchmarks. To contribute toward this goal, we proposed TOXRIC (https://toxric.bioinforai.tech/), a database with comprehensive toxicological data, standardized attribute data, practical benchmarks, informative visualization of molecular representations, and an intuitive function interface. The data stored in TOXRIC contains 113 372 compounds, 13 toxicity categories, 1474 toxicity endpoints covering in vivo/in vitro endpoints and 39 feature types, covering structural, target, transcriptome, metabolic data, and other descriptors. All the curated datasets of endpoints and features can be retrieved, downloaded and directly used as output or input to Machine Learning (ML)-based prediction models. In addition to serving as a data repository, TOXRIC also provides visualization of benchmarks and molecular representations for all endpoint datasets. Based on these results, researchers can better understand and select optimal feature types, molecular representations, and baseline algorithms for each endpoint prediction task. We believe that the rich information on compound toxicology, ML-ready datasets, benchmarks and molecular representation distribution can greatly facilitate toxicological investigations, interpretation of toxicological mechanisms, compound/drug discovery and the development of computational methods.
Collapse
Affiliation(s)
- Lianlian Wu
- Department of Bioinformatics, Institute of Health Service and Transfusion Medicine, Beijing 100850, China
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin 300072, China
| | - Bowei Yan
- Department of Bioinformatics, Institute of Health Service and Transfusion Medicine, Beijing 100850, China
- State Key Laboratory of Genetic Engineering and Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Institute of Biomedical Sciences, Human Phenome Institute, Fudan University, Shanghai 200433, China
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences, Beijing 102206, China
| | - Junshan Han
- Department of Bioinformatics, Institute of Health Service and Transfusion Medicine, Beijing 100850, China
| | - Ruijiang Li
- Department of Bioinformatics, Institute of Health Service and Transfusion Medicine, Beijing 100850, China
| | - Jian Xiao
- Department of Pharmacy, Xiangya Hospital, Central South University, Changsha 410008, Hunan, China
- Institute for Rational and Safe Medication Practices, National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha 410008, Hunan, China
| | - Song He
- Department of Bioinformatics, Institute of Health Service and Transfusion Medicine, Beijing 100850, China
| | - Xiaochen Bo
- Department of Bioinformatics, Institute of Health Service and Transfusion Medicine, Beijing 100850, China
- Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin 300072, China
| |
Collapse
|
11
|
Cavasotto CN, Scardino V. Machine Learning Toxicity Prediction: Latest Advances by Toxicity End Point. ACS OMEGA 2022; 7:47536-47546. [PMID: 36591139 PMCID: PMC9798519 DOI: 10.1021/acsomega.2c05693] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 11/28/2022] [Indexed: 05/29/2023]
Abstract
Machine learning (ML) models to predict the toxicity of small molecules have garnered great attention and have become widely used in recent years. Computational toxicity prediction is particularly advantageous in the early stages of drug discovery in order to filter out molecules with high probability of failing in clinical trials. This has been helped by the increase in the number of large toxicology databases available. However, being an area of recent application, a greater understanding of the scope and applicability of ML methods is still necessary. There are various kinds of toxic end points that have been predicted in silico. Acute oral toxicity, hepatotoxicity, cardiotoxicity, mutagenicity, and the 12 Tox21 data end points are among the most commonly investigated. Machine learning methods exhibit different performances on different data sets due to dissimilar complexity, class distributions, or chemical space covered, which makes it hard to compare the performance of algorithms over different toxic end points. The general pipeline to predict toxicity using ML has already been analyzed in various reviews. In this contribution, we focus on the recent progress in the area and the outstanding challenges, making a detailed description of the state-of-the-art models implemented for each toxic end point. The type of molecular representation, the algorithm, and the evaluation metric used in each research work are explained and analyzed. A detailed description of end points that are usually predicted, their clinical relevance, the available databases, and the challenges they bring to the field are also highlighted.
Collapse
Affiliation(s)
- Claudio N. Cavasotto
- Computational
Drug Design and Biomedical Informatics Laboratory, Instituto de Investigaciones
en Medicina Traslacional (IIMT), CONICET-Universidad
Austral, Pilar, B1629AHJ Buenos Aires, Argentina
- Austral
Institute for Applied Artificial Intelligence, Universidad Austral, Pilar, B1629AHJ Buenos Aires, Argentina
- Facultad
de Ciencias Biomédicas, Facultad de Ingenierá, Universidad Austral, Pilar, B1630FHB Buenos
Aires, Argentina
| | - Valeria Scardino
- Austral
Institute for Applied Artificial Intelligence, Universidad Austral, Pilar, B1629AHJ Buenos Aires, Argentina
- Meton
AI, Inc., Wilmington, Delaware 19801, United
States
| |
Collapse
|
12
|
MohammadiPeyhani H, Chiappino-Pepe A, Haddadi K, Hafner J, Hadadi N, Hatzimanikatis V. NICEdrug.ch, a workflow for rational drug design and systems-level analysis of drug metabolism. eLife 2021; 10:e65543. [PMID: 34340747 PMCID: PMC8331181 DOI: 10.7554/elife.65543] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Accepted: 07/07/2021] [Indexed: 12/30/2022] Open
Abstract
The discovery of a drug requires over a decade of intensive research and financial investments - and still has a high risk of failure. To reduce this burden, we developed the NICEdrug.ch resource, which incorporates 250,000 bioactive molecules, and studied their enzymatic metabolic targets, fate, and toxicity. NICEdrug.ch includes a unique fingerprint that identifies reactive similarities between drug-drug and drug-metabolite pairs. We validated the application, scope, and performance of NICEdrug.ch over similar methods in the field on golden standard datasets describing drugs and metabolites sharing reactivity, drug toxicities, and drug targets. We use NICEdrug.ch to evaluate inhibition and toxicity by the anticancer drug 5-fluorouracil, and suggest avenues to alleviate its side effects. We propose shikimate 3-phosphate for targeting liver-stage malaria with minimal impact on the human host cell. Finally, NICEdrug.ch suggests over 1300 candidate drugs and food molecules to target COVID-19 and explains their inhibitory mechanism for further experimental screening. The NICEdrug.ch database is accessible online to systematically identify the reactivity of small molecules and druggable enzymes with practical applications in lead discovery and drug repurposing.
Collapse
Affiliation(s)
- Homa MohammadiPeyhani
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Anush Chiappino-Pepe
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Kiandokht Haddadi
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Jasmin Hafner
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Noushin Hadadi
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Vassily Hatzimanikatis
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| |
Collapse
|
13
|
Wu F, Zhou Y, Li L, Shen X, Chen G, Wang X, Liang X, Tan M, Huang Z. Computational Approaches in Preclinical Studies on Drug Discovery and Development. Front Chem 2020; 8:726. [PMID: 33062633 PMCID: PMC7517894 DOI: 10.3389/fchem.2020.00726] [Citation(s) in RCA: 106] [Impact Index Per Article: 26.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2020] [Accepted: 07/14/2020] [Indexed: 12/11/2022] Open
Abstract
Because undesirable pharmacokinetics and toxicity are significant reasons for the failure of drug development in the costly late stage, it has been widely recognized that drug ADMET properties should be considered as early as possible to reduce failure rates in the clinical phase of drug discovery. Concurrently, drug recalls have become increasingly common in recent years, prompting pharmaceutical companies to increase attention toward the safety evaluation of preclinical drugs. In vitro and in vivo drug evaluation techniques are currently more mature in preclinical applications, but these technologies are costly. In recent years, with the rapid development of computer science, in silico technology has been widely used to evaluate the relevant properties of drugs in the preclinical stage and has produced many software programs and in silico models, further promoting the study of ADMET in vitro. In this review, we first introduce the two ADMET prediction categories (molecular modeling and data modeling). Then, we perform a systematic classification and description of the databases and software commonly used for ADMET prediction. We focus on some widely studied ADMT properties as well as PBPK simulation, and we list some applications that are related to the prediction categories and web tools. Finally, we discuss challenges and limitations in the preclinical area and propose some suggestions and prospects for the future.
Collapse
Affiliation(s)
- Fengxu Wu
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, China
| | - Yuquan Zhou
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- The Second School of Clinical Medicine, Guangdong Medical University, Dongguan, China
| | - Langhui Li
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- Key Laboratory for Research and Development of Natural Drugs of Guangdong Province, School of Pharmacy, Guangdong Medical University, Dongguan, China
| | - Xianhuan Shen
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- Key Laboratory for Research and Development of Natural Drugs of Guangdong Province, School of Pharmacy, Guangdong Medical University, Dongguan, China
| | - Ganying Chen
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- The Second School of Clinical Medicine, Guangdong Medical University, Dongguan, China
| | - Xiaoqing Wang
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- Key Laboratory for Research and Development of Natural Drugs of Guangdong Province, School of Pharmacy, Guangdong Medical University, Dongguan, China
| | - Xianyang Liang
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- The Second School of Clinical Medicine, Guangdong Medical University, Dongguan, China
| | - Mengyuan Tan
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- Key Laboratory for Research and Development of Natural Drugs of Guangdong Province, School of Pharmacy, Guangdong Medical University, Dongguan, China
| | - Zunnan Huang
- Key Laboratory of Big Data Mining and Precision Drug Design of Guangdong Medical University, Research Platform Service Management Center, Dongguan, China
- Key Laboratory for Research and Development of Natural Drugs of Guangdong Province, School of Pharmacy, Guangdong Medical University, Dongguan, China
- Marine Biomedical Research Institute of Guangdong Zhanjiang, Zhanjiang, China
| |
Collapse
|
14
|
Yang J, Wang D, Jia C, Wang M, Hao G, Yang G. Freely Accessible Chemical Database Resources of Compounds for In Silico Drug Discovery. Curr Med Chem 2020; 26:7581-7597. [PMID: 29737247 DOI: 10.2174/0929867325666180508100436] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Revised: 01/26/2018] [Accepted: 04/18/2018] [Indexed: 11/22/2022]
Abstract
BACKGROUND In silico drug discovery has been proved to be a solidly established key component in early drug discovery. However, this task is hampered by the limitation of quantity and quality of compound databases for screening. In order to overcome these obstacles, freely accessible database resources of compounds have bloomed in recent years. Nevertheless, how to choose appropriate tools to treat these freely accessible databases is crucial. To the best of our knowledge, this is the first systematic review on this issue. OBJECTIVE The existed advantages and drawbacks of chemical databases were analyzed and summarized based on the collected six categories of freely accessible chemical databases from literature in this review. RESULTS Suggestions on how and in which conditions the usage of these databases could be reasonable were provided. Tools and procedures for building 3D structure chemical libraries were also introduced. CONCLUSION In this review, we described the freely accessible chemical database resources for in silico drug discovery. In particular, the chemical information for building chemical database appears as attractive resources for drug design to alleviate experimental pressure.
Collapse
Affiliation(s)
- JingFang Yang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China
| | - Di Wang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China
| | - Chenyang Jia
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China
| | - Mengyao Wang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China
| | - GeFei Hao
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China
| | - GuangFu Yang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan 430079, China.,Collaborative Innovation Center of Chemical Science and Engineering, Tianjin 300072, China
| |
Collapse
|
15
|
Vo AH, Van Vleet TR, Gupta RR, Liguori MJ, Rao MS. An Overview of Machine Learning and Big Data for Drug Toxicity Evaluation. Chem Res Toxicol 2019; 33:20-37. [DOI: 10.1021/acs.chemrestox.9b00227] [Citation(s) in RCA: 55] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Andy H. Vo
- Department of Preclinical Safety, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Terry R. Van Vleet
- Department of Preclinical Safety, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Rishi R. Gupta
- Information Research, Research and Development, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Michael J. Liguori
- Department of Preclinical Safety, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Mohan S. Rao
- Department of Preclinical Safety, AbbVie, 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| |
Collapse
|
16
|
Banerjee P, Eckert AO, Schrey AK, Preissner R. ProTox-II: a webserver for the prediction of toxicity of chemicals. Nucleic Acids Res 2019; 46:W257-W263. [PMID: 29718510 PMCID: PMC6031011 DOI: 10.1093/nar/gky318] [Citation(s) in RCA: 1196] [Impact Index Per Article: 239.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 04/26/2018] [Indexed: 01/06/2023] Open
Abstract
Advancement in the field of computational research has made it possible for the in silico methods to offer significant benefits to both regulatory needs and requirements for risk assessments, and pharmaceutical industry to assess the safety profile of a chemical. Here, we present ProTox-II that incorporates molecular similarity, pharmacophores, fragment propensities and machine-learning models for the prediction of various toxicity endpoints; such as acute toxicity, hepatotoxicity, cytotoxicity, carcinogenicity, mutagenicity, immunotoxicity, adverse outcomes pathways (Tox21) and toxicity targets. The predictive models are built on data from both in vitro assays (e.g. Tox21 assays, Ames bacterial mutation assays, hepG2 cytotoxicity assays, Immunotoxicity assays) and in vivo cases (e.g. carcinogenicity, hepatotoxicity). The models have been validated on independent external sets and have shown strong performance. ProTox-II provides a freely available webserver for in silico toxicity prediction for toxicologists, regulatory agencies, computational and medicinal chemists, and all users without login at http://tox.charite.de/protox_II. The webserver takes a two-dimensional chemical structure as an input and reports the possible toxicity profile of the chemical for 33 models with confidence scores, and an overall toxicity radar chart along with three most similar compounds with known acute toxicity.
Collapse
Affiliation(s)
- Priyanka Banerjee
- Structural Bioinformatics Group, Institute for Physiology & ECRC, Charité - University Medicine Berlin, 10115 Berlin, Germany
| | - Andreas O Eckert
- Structural Bioinformatics Group, Institute for Physiology & ECRC, Charité - University Medicine Berlin, 10115 Berlin, Germany
| | - Anna K Schrey
- Structural Bioinformatics Group, Institute for Physiology & ECRC, Charité - University Medicine Berlin, 10115 Berlin, Germany
| | - Robert Preissner
- Structural Bioinformatics Group, Institute for Physiology & ECRC, Charité - University Medicine Berlin, 10115 Berlin, Germany.,BB3R - Berlin Brandenburg 3R Graduate School, Freie Universität Berlin, Berlin, Germany
| |
Collapse
|
17
|
Qian T, Zhu S, Hoshida Y. Use of big data in drug development for precision medicine: an update. EXPERT REVIEW OF PRECISION MEDICINE AND DRUG DEVELOPMENT 2019; 4:189-200. [PMID: 31286058 PMCID: PMC6613936 DOI: 10.1080/23808993.2019.1617632] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Accepted: 05/08/2019] [Indexed: 02/08/2023]
Abstract
INTRODUCTION Big-data-driven drug development resources and methodologies have been evolving with ever-expanding data from large-scale biological experiments, clinical trials, and medical records from participants in data collection initiatives. The enrichment of biological- and clinical-context-specific large-scale data has enabled computational inference more relevant to real-world biomedical research, particularly identification of therapeutic targets and drugs for specific diseases and clinical scenarios. AREAS COVERED Here we overview recent progresses made in the fields: new big-data-driven approach to therapeutic target discovery, candidate drug prioritization, inference of clinical toxicity, and machine-learning methods in drug discovery. EXPERT OPINION In the near future, much larger volumes and complex datasets for precision medicine will be generated, e.g., individual and longitudinal multi-omic, and direct-to-consumer datasets. Closer collaborations between experts with different backgrounds would also be required to better translate analytic results into prognosis and treatment in the clinical practice. Meanwhile, cloud computing with protected patient privacy would become more routine analytic practice to fill the gaps within data integration along with the advent of big-data. To conclude, integration of multitudes of data generated for each individual along with techniques tailored for big-data analytics may eventually enable us to achieve precision medicine.
Collapse
Affiliation(s)
- Tongqi Qian
- Department of Genetics and Genomic Sciences and Icahn
Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount
Sinai, New York, NY, USA
| | - Shijia Zhu
- Liver Tumor Translational Research Program, Simmons
Comprehensive Cancer Center, Division of Digestive and Liver Diseases, Department of
Internal Medicine, University of Texas Southwestern Medical Center, Dallas, TX
75390, USA
| | - Yujin Hoshida
- Liver Tumor Translational Research Program, Simmons
Comprehensive Cancer Center, Division of Digestive and Liver Diseases, Department of
Internal Medicine, University of Texas Southwestern Medical Center, Dallas, TX
75390, USA
| |
Collapse
|
18
|
Tang W, Chen J, Wang Z, Xie H, Hong H. Deep learning for predicting toxicity of chemicals: a mini review. JOURNAL OF ENVIRONMENTAL SCIENCE AND HEALTH. PART C, ENVIRONMENTAL CARCINOGENESIS & ECOTOXICOLOGY REVIEWS 2019; 36:252-271. [PMID: 30821199 DOI: 10.1080/10590501.2018.1537563] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Humans and wildlife inhabit a world with panoply of natural and synthetic chemicals. Alarmingly, only a limited number of chemicals have undergone comprehensive toxicological evaluation due to limitations of traditional toxicity testing. High-throughput screening assays provide a higher-speed alternative for conventional toxicity testing. Advancement of high-throughput bioassay technology has greatly increased chemical toxicity data volumes in the past decade, pushing toxicology research into a "big data" era. However, traditional data analysis methods fail to effectively process large data volumes, presenting both a challenge and an opportunity for toxicologists. Deep learning, a machine learning method leveraging deep neural networks (DNNs), is a proven useful tool for building quantitative structure-activity relationship (QSAR) models for toxicity prediction utilizing these new large datasets. In this mini review, a brief technical background on DNNs is provided, and the current state of chemical toxicity prediction models built with DNNs is reviewed. In addition, relevant toxicity data sources are summarized, possible limitations are discussed, and perspectives on DNN utilization in chemical toxicity prediction are given.
Collapse
Affiliation(s)
- Weihao Tang
- a Key Laboratory of Industrial Ecology and Environmental Engineering (MOE), School of Environmental Science and Technology , Dalian University of Technology , Dalian , China
| | - Jingwen Chen
- a Key Laboratory of Industrial Ecology and Environmental Engineering (MOE), School of Environmental Science and Technology , Dalian University of Technology , Dalian , China
| | - Zhongyu Wang
- a Key Laboratory of Industrial Ecology and Environmental Engineering (MOE), School of Environmental Science and Technology , Dalian University of Technology , Dalian , China
| | - Hongbin Xie
- a Key Laboratory of Industrial Ecology and Environmental Engineering (MOE), School of Environmental Science and Technology , Dalian University of Technology , Dalian , China
| | - Huixiao Hong
- b National Center for Toxicological Research , U.S. Food and Drug Administration , Jefferson , Arkansas , USA
| |
Collapse
|
19
|
Pu L, Naderi M, Liu T, Wu HC, Mukhopadhyay S, Brylinski M. eToxPred: a machine learning-based approach to estimate the toxicity of drug candidates. BMC Pharmacol Toxicol 2019; 20:2. [PMID: 30621790 PMCID: PMC6325674 DOI: 10.1186/s40360-018-0282-6] [Citation(s) in RCA: 73] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2018] [Accepted: 12/26/2018] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND The efficiency of drug development defined as a number of successfully launched new pharmaceuticals normalized by financial investments has significantly declined. Nonetheless, recent advances in high-throughput experimental techniques and computational modeling promise reductions in the costs and development times required to bring new drugs to market. The prediction of toxicity of drug candidates is one of the important components of modern drug discovery. RESULTS In this work, we describe eToxPred, a new approach to reliably estimate the toxicity and synthetic accessibility of small organic compounds. eToxPred employs machine learning algorithms trained on molecular fingerprints to evaluate drug candidates. The performance is assessed against multiple datasets containing known drugs, potentially hazardous chemicals, natural products, and synthetic bioactive compounds. Encouragingly, eToxPred predicts the synthetic accessibility with the mean square error of only 4% and the toxicity with the accuracy of as high as 72%. CONCLUSIONS eToxPred can be incorporated into protocols to construct custom libraries for virtual screening in order to filter out those drug candidates that are potentially toxic or would be difficult to synthesize. It is freely available as a stand-alone software at https://github.com/pulimeng/etoxpred .
Collapse
Affiliation(s)
- Limeng Pu
- Division of Electrical & Computer Engineering, Louisiana State University, Baton Rouge, LA, 70803, USA
| | - Misagh Naderi
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, 70803, USA
| | - Tairan Liu
- Department of Mechanical Engineering, Louisiana State University, Baton Rouge, LA, 70803, USA
| | - Hsiao-Chun Wu
- Division of Electrical & Computer Engineering, Louisiana State University, Baton Rouge, LA, 70803, USA
| | - Supratik Mukhopadhyay
- Department of Computer Science, Louisiana State University, Baton Rouge, LA, 70803, USA
| | - Michal Brylinski
- Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, 70803, USA. .,Center for Computation & Technology, Louisiana State University, Baton Rouge, LA, 70803, USA.
| |
Collapse
|
20
|
Mode-of-Action-Guided, Molecular Modeling-Based Toxicity Prediction: A Novel Approach for In Silico Predictive Toxicology. CHALLENGES AND ADVANCES IN COMPUTATIONAL CHEMISTRY AND PHYSICS 2019. [DOI: 10.1007/978-3-030-16443-0_6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
21
|
Li Y, Idakwo G, Thangapandian S, Chen M, Hong H, Zhang C, Gong P. Target-specific toxicity knowledgebase (TsTKb): a novel toolkit for in silico predictive toxicology. JOURNAL OF ENVIRONMENTAL SCIENCE AND HEALTH. PART C, ENVIRONMENTAL CARCINOGENESIS & ECOTOXICOLOGY REVIEWS 2018; 36:219-236. [PMID: 30426823 DOI: 10.1080/10590501.2018.1537148] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
As the number of man-made chemicals increases at an unprecedented pace, efforts of quickly screening and accurately evaluating their potential adverse biological effects have been hampered by prohibitively high costs of in vivo/vitro toxicity testing. While it is unrealistic and unnecessary to test every uncharacterized chemical, it remains a major challenge to develop alternative in silico tools with high reliability and precision in toxicity prediction. To address this urgent need, we have developed a novel mode-of-action-guided, molecular modeling-based, and machine learning-enabled modeling approach for in silico chemical toxicity prediction. Here we introduce the core element of this approach, Target-specific Toxicity Knowledgebase (TsTKb), which consists of two main components: Chemical Mode of Action (ChemMoA) database and a suite of prediction model libraries.
Collapse
Affiliation(s)
- Yan Li
- a Bennett Aerospace Inc. , Cary , NC , USA
| | - Gabriel Idakwo
- b School of Computing Science and Computer Engineering , University of Southern Mississippi , Hattiesburg , MS , USA
| | - Sundar Thangapandian
- c Environmental Laboratory , US Army Engineer Research and Development Center , Vicksburg , MS , USA
| | - Minjun Chen
- d Division of Bioinformatics and Biostatistics , National Center for Toxicological Research, US Food and Drug Administration , Jefferson , AR , USA
| | - Huixiao Hong
- d Division of Bioinformatics and Biostatistics , National Center for Toxicological Research, US Food and Drug Administration , Jefferson , AR , USA
| | - Chaoyang Zhang
- b School of Computing Science and Computer Engineering , University of Southern Mississippi , Hattiesburg , MS , USA
| | - Ping Gong
- c Environmental Laboratory , US Army Engineer Research and Development Center , Vicksburg , MS , USA
| |
Collapse
|
22
|
Van Vleet TR, Liguori MJ, Lynch JJ, Rao M, Warder S. Screening Strategies and Methods for Better Off-Target Liability Prediction and Identification of Small-Molecule Pharmaceuticals. SLAS DISCOVERY 2018; 24:1-24. [PMID: 30196745 DOI: 10.1177/2472555218799713] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Pharmaceutical discovery and development is a long and expensive process that, unfortunately, still results in a low success rate, with drug safety continuing to be a major impedance. Improved safety screening strategies and methods are needed to more effectively fill this critical gap. Recent advances in informatics are now making it possible to manage bigger data sets and integrate multiple sources of screening data in a manner that can potentially improve the selection of higher-quality drug candidates. Integrated screening paradigms have become the norm in Pharma, both in discovery screening and in the identification of off-target toxicity mechanisms during later-stage development. Furthermore, advances in computational methods are making in silico screens more relevant and suggest that they may represent a feasible option for augmenting the current screening paradigm. This paper outlines several fundamental methods of the current drug screening processes across Pharma and emerging techniques/technologies that promise to improve molecule selection. In addition, the authors discuss integrated screening strategies and provide examples of advanced screening paradigms.
Collapse
Affiliation(s)
- Terry R Van Vleet
- 1 Department of Investigative Toxicology and Pathology, AbbVie, N Chicago, IL, USA
| | - Michael J Liguori
- 1 Department of Investigative Toxicology and Pathology, AbbVie, N Chicago, IL, USA
| | - James J Lynch
- 2 Department of Integrated Science and Technology, AbbVie, N Chicago, IL, USA
| | - Mohan Rao
- 1 Department of Investigative Toxicology and Pathology, AbbVie, N Chicago, IL, USA
| | - Scott Warder
- 3 Department of Target Enabling Science and Technology, AbbVie, N Chicago, IL, USA
| |
Collapse
|
23
|
Wu Y, Wang G. Machine Learning Based Toxicity Prediction: From Chemical Structural Description to Transcriptome Analysis. Int J Mol Sci 2018; 19:E2358. [PMID: 30103448 PMCID: PMC6121588 DOI: 10.3390/ijms19082358] [Citation(s) in RCA: 86] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2018] [Revised: 07/31/2018] [Accepted: 08/08/2018] [Indexed: 02/07/2023] Open
Abstract
Toxicity prediction is very important to public health. Among its many applications, toxicity prediction is essential to reduce the cost and labor of a drug's preclinical and clinical trials, because a lot of drug evaluations (cellular, animal, and clinical) can be spared due to the predicted toxicity. In the era of Big Data and artificial intelligence, toxicity prediction can benefit from machine learning, which has been widely used in many fields such as natural language processing, speech recognition, image recognition, computational chemistry, and bioinformatics, with excellent performance. In this article, we review machine learning methods that have been applied to toxicity prediction, including deep learning, random forests, k-nearest neighbors, and support vector machines. We also discuss the input parameter to the machine learning algorithm, especially its shift from chemical structural description only to that combined with human transcriptome data analysis, which can greatly enhance prediction accuracy.
Collapse
Affiliation(s)
- Yunyi Wu
- Department of Biology, Guangdong Provincial Key Laboratory of Cell Microenviroment and Disease Research, Southern University of Science and Technology, Shenzhen 518055, China.
| | - Guanyu Wang
- Department of Biology, Guangdong Provincial Key Laboratory of Cell Microenviroment and Disease Research, Southern University of Science and Technology, Shenzhen 518055, China.
| |
Collapse
|
24
|
Günthardt BF, Hollender J, Hungerbühler K, Scheringer M, Bucheli TD. Comprehensive Toxic Plants-Phytotoxins Database and Its Application in Assessing Aquatic Micropollution Potential. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2018; 66:7577-7588. [PMID: 29944838 DOI: 10.1021/acs.jafc.8b01639] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
The production of toxic plant secondary metabolites (phytotoxins) for defense is a widespread phenomenon in the plant kingdom and is even present in agricultural crops. These phytotoxins may have similar characteristics to anthropogenic micropollutants in terms of persistence and toxicity. However, they are only rarely included in environmental risk assessments, partly because a systematic overview of phytotoxins is missing. Here, we present a newly developed, freely available database, Toxic Plants-PhytoToxins (TPPT), containing 1586 phytotoxins of potential ecotoxicological relevance in Central Europe linked to 844 plant species. Our database summarizes phytotoxin patterns in plant species and provides detailed biological and chemical information as well as in silico estimated properties. Using the database, we evaluated phytotoxins regarding occurrence, approximated from the frequencies of Swiss plant species; environmental behavior based on aquatic persistence and mobility; and toxicity. The assessment showed that over 34% of all phytotoxins are potential aquatic micropollutants and should be included in environmental investigations.
Collapse
Affiliation(s)
- Barbara F Günthardt
- Environmental Analytics , Agroscope , Reckenholzstrasse 191 , 8046 Zürich , Switzerland
- Institute of Biogeochemistry and Pollutant Dynamics , ETH Zurich , Universitätsstrasse 16 , 8092 Zürich , Switzerland
| | - Juliane Hollender
- Institute of Biogeochemistry and Pollutant Dynamics , ETH Zurich , Universitätsstrasse 16 , 8092 Zürich , Switzerland
- Swiss Federal Institute of Aquatic Science and Technology (Eawag) , Überlandstrasse 133 , 8600 Dübendorf , Switzerland
| | - Konrad Hungerbühler
- Institute for Chemical and Bioengineering , ETH Zurich , Wolfgang-Pauli-Strasse 10 , 8093 Zürich , Switzerland
| | - Martin Scheringer
- Institute for Chemical and Bioengineering , ETH Zurich , Wolfgang-Pauli-Strasse 10 , 8093 Zürich , Switzerland
- RECETOX , Masaryk University , Kamenice 753/5 , 625 00 Brno , Czech Republic
| | - Thomas D Bucheli
- Environmental Analytics , Agroscope , Reckenholzstrasse 191 , 8046 Zürich , Switzerland
| |
Collapse
|
25
|
Banerjee P, Eckert AO, Schrey AK, Preissner R. ProTox-II: a webserver for the prediction of toxicity of chemicals. Nucleic Acids Res 2018. [PMID: 29718510 DOI: 10.1093/nar/gky318%jnucleicacidsresearch] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2023] Open
Abstract
Advancement in the field of computational research has made it possible for the in silico methods to offer significant benefits to both regulatory needs and requirements for risk assessments, and pharmaceutical industry to assess the safety profile of a chemical. Here, we present ProTox-II that incorporates molecular similarity, pharmacophores, fragment propensities and machine-learning models for the prediction of various toxicity endpoints; such as acute toxicity, hepatotoxicity, cytotoxicity, carcinogenicity, mutagenicity, immunotoxicity, adverse outcomes pathways (Tox21) and toxicity targets. The predictive models are built on data from both in vitro assays (e.g. Tox21 assays, Ames bacterial mutation assays, hepG2 cytotoxicity assays, Immunotoxicity assays) and in vivo cases (e.g. carcinogenicity, hepatotoxicity). The models have been validated on independent external sets and have shown strong performance. ProTox-II provides a freely available webserver for in silico toxicity prediction for toxicologists, regulatory agencies, computational and medicinal chemists, and all users without login at http://tox.charite.de/protox_II. The webserver takes a two-dimensional chemical structure as an input and reports the possible toxicity profile of the chemical for 33 models with confidence scores, and an overall toxicity radar chart along with three most similar compounds with known acute toxicity.
Collapse
Affiliation(s)
- Priyanka Banerjee
- Structural Bioinformatics Group, Institute for Physiology & ECRC, Charité - University Medicine Berlin, 10115 Berlin, Germany
| | - Andreas O Eckert
- Structural Bioinformatics Group, Institute for Physiology & ECRC, Charité - University Medicine Berlin, 10115 Berlin, Germany
| | - Anna K Schrey
- Structural Bioinformatics Group, Institute for Physiology & ECRC, Charité - University Medicine Berlin, 10115 Berlin, Germany
| | - Robert Preissner
- Structural Bioinformatics Group, Institute for Physiology & ECRC, Charité - University Medicine Berlin, 10115 Berlin, Germany.,BB3R - Berlin Brandenburg 3R Graduate School, Freie Universität Berlin, Berlin, Germany
| |
Collapse
|
26
|
Identification of Estrogen Receptor α Antagonists from Natural Products via In Vitro and In Silico Approaches. OXIDATIVE MEDICINE AND CELLULAR LONGEVITY 2018; 2018:6040149. [PMID: 29861831 PMCID: PMC5971309 DOI: 10.1155/2018/6040149] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/15/2018] [Revised: 04/01/2018] [Accepted: 04/11/2018] [Indexed: 12/14/2022]
Abstract
Estrogen receptor α (ERα) is a successful target for ER-positive breast cancer and also reported to be relevant in many other diseases. Selective estrogen receptor modulators (SERMs) make a good therapeutic effect in clinic. Because of the drug resistance and side effects of current SERMs, the discovery of new SERMs is given more and more attention. Virtual screening is a validated method to high effectively to identify novel bioactive small molecules. Ligand-based machine learning methods and structure-based molecular docking were first performed for identification of ERα antagonist from in-house natural product library. Naive Bayesian and recursive partitioning models with two kinds of descriptors were built and validated based on training set, test set, and external test set and then were utilized for distinction of active and inactive compounds. Totally, 162 compounds were predicted as ER antagonists and were further evaluated by molecular docking. According to docking score, we selected 8 representative compounds for both ERα competitor assay and luciferase reporter gene assay. Genistein, daidzein, phloretin, ellagic acid, ursolic acid, (-)-epigallocatechin-3-gallate, kaempferol, and naringenin exhibited different levels for antagonistic activity against ERα. These studies validated the feasibility of machine learning methods for predicting bioactivities of ligands and provided better insight into the natural products acting as estrogen receptor modulator, which are important lead compounds for future new drug design.
Collapse
|
27
|
Yang H, Sun L, Li W, Liu G, Tang Y. In Silico Prediction of Chemical Toxicity for Drug Design Using Machine Learning Methods and Structural Alerts. Front Chem 2018; 6:30. [PMID: 29515993 PMCID: PMC5826228 DOI: 10.3389/fchem.2018.00030] [Citation(s) in RCA: 108] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2018] [Accepted: 02/05/2018] [Indexed: 12/17/2022] Open
Abstract
During drug development, safety is always the most important issue, including a variety of toxicities and adverse drug effects, which should be evaluated in preclinical and clinical trial phases. This review article at first simply introduced the computational methods used in prediction of chemical toxicity for drug design, including machine learning methods and structural alerts. Machine learning methods have been widely applied in qualitative classification and quantitative regression studies, while structural alerts can be regarded as a complementary tool for lead optimization. The emphasis of this article was put on the recent progress of predictive models built for various toxicities. Available databases and web servers were also provided. Though the methods and models are very helpful for drug design, there are still some challenges and limitations to be improved for drug safety assessment in the future.
Collapse
Affiliation(s)
| | | | | | | | - Yun Tang
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai, China
| |
Collapse
|
28
|
Canault B, Bourg S, Vayer P, Bonnet P. Comprehensive Network Map of ADME-Tox Databases. Mol Inform 2017; 36. [DOI: 10.1002/minf.201700029] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2017] [Accepted: 06/14/2017] [Indexed: 01/04/2023]
Affiliation(s)
- Baptiste Canault
- Institut de Chimie Organique et Analytique (ICOA); Université d'Orléans et CNRS; UMR7311, BP 6759 45067 Orléans France
| | - Stéphane Bourg
- Institut de Chimie Organique et Analytique (ICOA); Université d'Orléans et CNRS; UMR7311, BP 6759 45067 Orléans France
| | - Philippe Vayer
- Technologie Servier; 25-27 rue Eugène Vignat, BP 11749 45007 Orléans cedex 1 France
| | - Pascal Bonnet
- Institut de Chimie Organique et Analytique (ICOA); Université d'Orléans et CNRS; UMR7311, BP 6759 45067 Orléans France
| |
Collapse
|
29
|
Backman TWH, Evans DS, Girke T. Large-scale bioactivity analysis of the small-molecule assayed proteome. PLoS One 2017; 12:e0171413. [PMID: 28178331 PMCID: PMC5298297 DOI: 10.1371/journal.pone.0171413] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2016] [Accepted: 01/20/2017] [Indexed: 12/12/2022] Open
Abstract
This study presents an analysis of the small molecule bioactivity profiles across large quantities of diverse protein families represented in PubChem BioAssay. We compared the bioactivity profiles of FDA approved drugs to non-FDA approved compounds, and report several distinct patterns characteristic of the approved drugs. We found that a large fraction of the previously reported higher target promiscuity among FDA approved compounds, compared to non-FDA approved bioactives, was frequently due to cross-reactivity within rather than across protein families. We identified 804 potentially novel protein target candidates for FDA approved drugs, as well as 901 potentially novel target candidates with active non-FDA approved compounds, but no FDA approved drugs with activity against these targets. We also identified 486348 potentially novel compounds active against the same targets as FDA approved drugs, as well as 153402 potentially novel compounds active against targets without active FDA approved drugs. By quantifying the agreement among replicated screens, we estimated that more than half of these novel outcomes are reproducible. Using biclustering, we identified many dense clusters of FDA approved drugs with enriched activity against a common set of protein targets. We also report the distribution of compound promiscuity using a Bayesian statistical model, and report the sensitivity and specificity of two common methods for identifying promiscuous compounds. Aggregator assays exhibited greater accuracy in identifying highly promiscuous compounds, while PAINS substructures were able to identify a much larger set of "middle range" promiscuous compounds. Additionally, we report a large number of promiscuous compounds not identified as aggregators or PAINS. In summary, the results of this study represent a rich reference for selecting novel drug and target protein candidates, as well as for eliminating candidate compounds with unselective activities.
Collapse
Affiliation(s)
- Tyler William H. Backman
- Department of Bioengineering, University of California Riverside, Riverside, California, United States of America
- Institute for Integrative Genome Biology, University of California Riverside, Riverside, California, United States of America
| | - Daniel S. Evans
- California Pacific Medical Center Research Institute, San Francisco, California, United States of America
| | - Thomas Girke
- Institute for Integrative Genome Biology, University of California Riverside, Riverside, California, United States of America
- * E-mail:
| |
Collapse
|
30
|
Abstract
![]()
Despite a large and
rapidly growing body of small molecule bioactivity
screens available in the public domain, systematic leverage of the
data to assess target druggability and compound selectivity has been
confounded by a lack of suitable cross-target analysis software. We
have developed bioassayR, a computational tool that enables simultaneous
analysis of thousands of bioassay experiments performed over a diverse
set of compounds and biological targets. Unique features include support
for large-scale cross-target analyses of both public and custom bioassays,
generation of high throughput screening fingerprints (HTSFPs), and
an optional preloaded database that provides access to a substantial
portion of publicly available bioactivity data. bioassayR is implemented
as an open-source R/Bioconductor package available from https://bioconductor.org/packages/bioassayR/.
Collapse
Affiliation(s)
- Tyler William H Backman
- Institute for Integrative Genome Biology, University of California, Riverside , Riverside, California 92521, United States
| | - Thomas Girke
- Institute for Integrative Genome Biology, University of California, Riverside , Riverside, California 92521, United States
| |
Collapse
|
31
|
De Sotto R, Medriano C, Cho Y, Seok KS, Park Y, Kim S. Significance of metabolite extraction method for evaluating sulfamethazine toxicity in adult zebrafish using metabolomics. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2016; 127:127-134. [PMID: 26827276 DOI: 10.1016/j.ecoenv.2016.01.015] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2015] [Revised: 01/12/2016] [Accepted: 01/13/2016] [Indexed: 06/05/2023]
Abstract
Recently, environmental metabolomics has been introduced as a next generation environmental toxicity method which helps in evaluating toxicity of bioactive compounds to non-target organisms. In general, efficient metabolite extraction from target cells is one of the keys to success to better understand the effects of toxic substances to organisms. In this regard, the aim of this study is (1) to compare two sample extraction methods in terms of abundance and quality of metabolites and (2) investigate how this could lead to difference in data interpretation using pathway analysis. For this purpose, the antibiotic sulfamethazine and zebrafish (Danio rerio) were selected as model toxic substance and target organism, respectively. The zebrafish was exposed to four different sulfamethazine concentrations (0, 10, 30, and 50mg/L) for 72h. Metabolites were extracted using two different methods (Bligh and Dyer and solid-phase extraction). A total of 13,538 and 12,469 features were detected using quadrupole time-of-flight liquid chromatography mass spectrometry (QTOF LC-MS). Of these metabolites, 4278 (Bligh and Dyer) and 332 (solid phase extraction) were found to be significant after false discovery rate adjustment at a significance threshold of 0.01. Metlin and KEGG pathway analysis showed comprehensive information from fish samples extracted using Bligh and Dyer compared to solid phase extraction. This study shows that proper selection of sample extraction method is critically important for interpreting and analyzing the toxicity data of organisms when metabolomics is applied.
Collapse
Affiliation(s)
- Ryan De Sotto
- Bio Monitoring Laboratory, Program in Environmental Technology and Policy, Korea University Sejong Campus, 2511 Sejong-ro, Sejong City, Chungnam 339-700, South Korea
| | - Carl Medriano
- Metabolomics Laboratory, College of Pharmacy, Korea University Sejong Campus, 2511 Sejong-ro, Sejong City, Chungnam 339-700, South Korea
| | - Yunchul Cho
- Department of Environmental Engineering, Daejeon University, 62 Daehak-Ro, Dong-Gu, Daejeon 300-716, South Korea
| | - Kwang-Seol Seok
- National Institute of Environmental Research, Environmental Research Complex, Incheon 404-708, Korea
| | - Youngja Park
- Metabolomics Laboratory, College of Pharmacy, Korea University Sejong Campus, 2511 Sejong-ro, Sejong City, Chungnam 339-700, South Korea.
| | - Sungpyo Kim
- Bio Monitoring Laboratory, Program in Environmental Technology and Policy, Korea University Sejong Campus, 2511 Sejong-ro, Sejong City, Chungnam 339-700, South Korea.
| |
Collapse
|
32
|
Warr WA. Many InChIs and quite some feat. J Comput Aided Mol Des 2015; 29:681-94. [PMID: 26081259 DOI: 10.1007/s10822-015-9854-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2015] [Accepted: 06/10/2015] [Indexed: 12/14/2022]
Affiliation(s)
- Wendy A Warr
- Wendy Warr & Associates, Holmes Chapel, Crewe, Cheshire, CW4 7HZ, UK,
| |
Collapse
|
33
|
Ioannidis D, Papadopoulos GE, Anastassopoulos G, Kortsaris A, Anagnostopoulos K. Structural properties and interaction energies affecting drug design. An approach combining molecular simulations, statistics, interaction energies and neural networks. Comput Biol Chem 2015; 56:7-12. [DOI: 10.1016/j.compbiolchem.2015.02.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2014] [Revised: 02/17/2015] [Accepted: 02/22/2015] [Indexed: 11/26/2022]
|
34
|
Xie T, Song S, Li S, Ouyang L, Xia L, Huang J. Review of natural product databases. Cell Prolif 2015; 48:398-404. [PMID: 26009974 DOI: 10.1111/cpr.12190] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2015] [Accepted: 03/14/2015] [Indexed: 02/05/2023] Open
Abstract
OBJECTIVES Many natural products have pharmacological or biological activities that can be of therapeutic benefit in treating diseases, and are also an important source of inspiration for development of potential novel drugs. The past few decades have witnessed extensive study of natural products for their promising prospects in application of medicinal chemistry, molecular biology and pharmaceutical sciences. MATERIALS AND METHODS Natural product databases have provided systematic collection of information concerning natural products and their derivatives, including structure, source and mechanisms of action, which significantly support modern drug discovery. RESULTS Currently, a considerable number of natural product databases, such as TCM Database@Taiwan, TCMID, CEMTDD, SuperToxic and SuperNatural, have been developed, providing data such as integrated medicinal herbs, ingredients, 2D/3D structures of the compounds, related target proteins, relevant diseases, and metabolic toxicity and more. CONCLUSIONS We focus on an analytical overview of current natural product databases, and further discuss the good, bad or imperfection of current ones, in the hope of better integrating existing relevant outcomes, thus providing new routes for future drug discovery.
Collapse
Affiliation(s)
- Tao Xie
- State Key Laboratory of Biotherapy/Collaborative Innovation Center of Biotherapy, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Sicheng Song
- State Key Laboratory of Biotherapy/Collaborative Innovation Center of Biotherapy, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Sijia Li
- State Key Laboratory of Stomatology, West China Hospital of Stomatology, Sichuan University, Chengdu, 610041, China
| | - Liang Ouyang
- State Key Laboratory of Biotherapy/Collaborative Innovation Center of Biotherapy, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Lin Xia
- Department of Gastrointestinal Surgery, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Jian Huang
- School of Traditional Chinese Materia Medica, Shenyang Pharmaceutical University, Shenyang, 110016, China
| |
Collapse
|
35
|
Dimova D, Stumpfe D, Bajorath J. Identification of orthologous target pairs with shared active compounds and comparison of organism-specific activity patterns. Chem Biol Drug Des 2015; 86:1105-14. [PMID: 25931211 DOI: 10.1111/cbdd.12578] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2015] [Revised: 04/16/2015] [Accepted: 04/27/2015] [Indexed: 12/18/2022]
Abstract
A systematic search for active small molecules shared by orthologous targets was carried out, leading to the identification of 803 compound-based orthologous target pairs covering a total of 938 orthologues, 358 unique targets and 98 organisms. Many orthologous target pairs were found to have substantial compound coverage, enabling the introduction of an orthologous target pairs classification including 'organism cliffs' and 'potency-retaining' pairs. A total of 158 orthologous target pairs involving human orthologues were identified, which were typically associated with drug discovery-relevant targets, organism combinations and compound data. Orthologous target pairs with human orthologues included 83 potency-retaining orthologous target pairs covering a variety of targets and organisms. On the basis of these orthologous target pairs, the compound search was further extended and 1149 potent compounds were identified that only had reported activities for non-human orthologues of 48 therapeutic targets, but not their human counterparts, hence providing a large pool of candidate compounds for further evaluation. The complete set of orthologous target pairs identified in our analysis, the orthologous target pairs classification including associated data and all candidate compounds are made freely available.
Collapse
Affiliation(s)
- Dilyana Dimova
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, Bonn, D-53113, Germany
| | - Dagmar Stumpfe
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, Bonn, D-53113, Germany
| | - Jürgen Bajorath
- Department of Life Science Informatics, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, Bonn, D-53113, Germany
| |
Collapse
|
36
|
Pires DEV, Blundell TL, Ascher DB. pkCSM: Predicting Small-Molecule Pharmacokinetic and Toxicity Properties Using Graph-Based Signatures. J Med Chem 2015; 58:4066-72. [PMID: 25860834 PMCID: PMC4434528 DOI: 10.1021/acs.jmedchem.5b00104] [Citation(s) in RCA: 2076] [Impact Index Per Article: 230.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
![]()
Drug development has a high attrition
rate, with poor pharmacokinetic
and safety properties a significant hurdle. Computational approaches
may help minimize these risks. We have developed a novel approach
(pkCSM) which uses graph-based signatures to develop predictive models
of central ADMET properties for drug development. pkCSM performs as
well or better than current methods. A freely accessible web server
(http://structure.bioc.cam.ac.uk/pkcsm), which retains
no information submitted to it, provides an integrated platform to
rapidly evaluate pharmacokinetic and toxicity properties.
Collapse
Affiliation(s)
- Douglas E V Pires
- †Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Sanger Building, Cambridge, Cambridgshire CB2 1GA, U.K.,‡Centro de Pesquisas René Rachou, Fundação Oswaldo Cruz, Belo Horizonte 30190-002, Brazil
| | - Tom L Blundell
- †Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Sanger Building, Cambridge, Cambridgshire CB2 1GA, U.K
| | - David B Ascher
- †Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Sanger Building, Cambridge, Cambridgshire CB2 1GA, U.K
| |
Collapse
|
37
|
Abstract
Fragment-based drug design has proved itself as a powerful technique for increasing the sampling and diversity of chemical space and enabling the design of novel leads and compounds. Computational techniques for identifying fragments, binding sites and particularly for linking, growing, and evolving fragments play a significant role in the process. Information from ADME studies and clustering property information in the form of toxicophores and chemotypes can play a significant role in aiding the design of novel, selective fragments with good activity profiles.
Collapse
Affiliation(s)
- Rachelle J Bienstock
- Independent Researcher and Consultant, 300 Pitch Pine Lane, Chapel Hill, NC, 27514, USA,
| |
Collapse
|
38
|
Jeong E, He N, Park H, Song M, Kim N, Lee S, Yoon S. MACE: mutation-oriented profiling of chemical response and gene expression in cancers. ACTA ACUST UNITED AC 2014; 31:1508-14. [PMID: 25536965 DOI: 10.1093/bioinformatics/btu835] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2014] [Accepted: 12/14/2014] [Indexed: 12/21/2022]
Abstract
SUMMARY The mutational status of specific cancer lineages can affect the sensitivity to or resistance against cancer drugs. The MACE database provides web-based interactive tools for interpreting large chemical screening and gene expression datasets of cancer cell lines in terms of mutation and lineage categories. GI50 data of chemicals against individual NCI60 cell lines were normalized and organized to statistically identify mutation- or lineage-specific chemical responses. Similarly, DNA microarray data on NCI60 cell lines were processed to analyze mutation- or lineage-specific gene expression signatures. A combined analysis of GI50 and gene expression data to find potential associations between chemicals and genes is also a capability of this system. This database will provide extensive, systematic information to identify lineage- or mutation-specific anticancer agents and related gene targets. AVAILABILITY AND IMPLEMENTATION The MACE web database is available at http://mace.sookmyung.ac.kr/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online. CONTACT yoonsj@sookmyung.ac.kr.
Collapse
Affiliation(s)
- Euna Jeong
- Center for Advanced Bioinformatics and Systems Medicine and Department of Biological Sciences, Sookmyung Women's University, Seoul 140-742, Republic of Korea
| | - Ningning He
- Center for Advanced Bioinformatics and Systems Medicine and Department of Biological Sciences, Sookmyung Women's University, Seoul 140-742, Republic of Korea
| | - Hyerin Park
- Center for Advanced Bioinformatics and Systems Medicine and Department of Biological Sciences, Sookmyung Women's University, Seoul 140-742, Republic of Korea
| | - Mee Song
- Center for Advanced Bioinformatics and Systems Medicine and Department of Biological Sciences, Sookmyung Women's University, Seoul 140-742, Republic of Korea
| | - Nayoung Kim
- Center for Advanced Bioinformatics and Systems Medicine and Department of Biological Sciences, Sookmyung Women's University, Seoul 140-742, Republic of Korea
| | - Seongjoon Lee
- Center for Advanced Bioinformatics and Systems Medicine and Department of Biological Sciences, Sookmyung Women's University, Seoul 140-742, Republic of Korea
| | - Sukjoon Yoon
- Center for Advanced Bioinformatics and Systems Medicine and Department of Biological Sciences, Sookmyung Women's University, Seoul 140-742, Republic of Korea Center for Advanced Bioinformatics and Systems Medicine and Department of Biological Sciences, Sookmyung Women's University, Seoul 140-742, Republic of Korea
| |
Collapse
|
39
|
Bucheli TD. Phytotoxins: environmental micropollutants of concern? ENVIRONMENTAL SCIENCE & TECHNOLOGY 2014; 48:13027-13033. [PMID: 25325883 DOI: 10.1021/es504342w] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Natural toxins such as mycotoxins or phytotoxins (bioactive compounds from fungi and plants, respectively) have been widely studied in food and feed, where they are stated to out-compete synthetic chemicals in their overall human and animal toxicological risk. A similar perception and awareness is yet largely missing for environmental safety. This article attempts to raise concern in this regard, by providing (circumstantial) evidence that phytotoxins in particular can be emitted into the environment, where they may contribute to the complex mixture of organic micropollutants. Exposures can be orders-of-magnitude higher in anthropogenically managed/affected (agro-)ecosystems than in the pristine environment.
Collapse
Affiliation(s)
- Thomas D Bucheli
- Agroscope Institute for Sustainability Sciences ISS , CH-8046 Zurich, Switzerland
| |
Collapse
|
40
|
Drwal MN, Banerjee P, Dunkel M, Wettig MR, Preissner R. ProTox: a web server for the in silico prediction of rodent oral toxicity. Nucleic Acids Res 2014; 42:W53-8. [PMID: 24838562 PMCID: PMC4086068 DOI: 10.1093/nar/gku401] [Citation(s) in RCA: 324] [Impact Index Per Article: 32.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Animal trials are currently the major method for determining the possible toxic effects of drug candidates and cosmetics. In silico prediction methods represent an alternative approach and aim to rationalize the preclinical drug development, thus enabling the reduction of the associated time, costs and animal experiments. Here, we present ProTox, a web server for the prediction of rodent oral toxicity. The prediction method is based on the analysis of the similarity of compounds with known median lethal doses (LD50) and incorporates the identification of toxic fragments, therefore representing a novel approach in toxicity prediction. In addition, the web server includes an indication of possible toxicity targets which is based on an in-house collection of protein–ligand-based pharmacophore models (‘toxicophores’) for targets associated with adverse drug reactions. The ProTox web server is open to all users and can be accessed without registration at: http://tox.charite.de/tox. The only requirement for the prediction is the two-dimensional structure of the input compounds. All ProTox methods have been evaluated based on a diverse external validation set and displayed strong performance (sensitivity, specificity and precision of 76, 95 and 75%, respectively) and superiority over other toxicity prediction tools, indicating their possible applicability for other compound classes.
Collapse
Affiliation(s)
- Malgorzata N Drwal
- Structural Bioinformatics Group, Institute for Physiology, Charité - University Medicine Berlin, Lindenberger Weg 80, 13125 Berlin, Germany
| | - Priyanka Banerjee
- Structural Bioinformatics Group, Institute for Physiology, Charité - University Medicine Berlin, Lindenberger Weg 80, 13125 Berlin, Germany Graduate School of Computational Systems Biology, Humboldt-Universität zu Berlin, Invalidenstraße 42, 10115 Berlin, Germany
| | - Mathias Dunkel
- Structural Bioinformatics Group, Institute for Physiology, Charité - University Medicine Berlin, Lindenberger Weg 80, 13125 Berlin, Germany
| | - Martin R Wettig
- Structural Bioinformatics Group, Institute for Physiology, Charité - University Medicine Berlin, Lindenberger Weg 80, 13125 Berlin, Germany
| | - Robert Preissner
- Structural Bioinformatics Group, Institute for Physiology, Charité - University Medicine Berlin, Lindenberger Weg 80, 13125 Berlin, Germany German Cancer Consortium (DKTK), Im Neuenheimer Feld 280, 69120 Heidelberg, Germany
| |
Collapse
|
41
|
Arora PK, Bae H. Integration of bioinformatics to biodegradation. Biol Proced Online 2014; 16:8. [PMID: 24808763 PMCID: PMC4012781 DOI: 10.1186/1480-9222-16-8] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2014] [Accepted: 04/19/2014] [Indexed: 12/22/2022] Open
Abstract
Bioinformatics and biodegradation are two primary scientific fields in applied microbiology and biotechnology. The present review describes development of various bioinformatics tools that may be applied in the field of biodegradation. Several databases, including the University of Minnesota Biocatalysis/Biodegradation database (UM-BBD), a database of biodegradative oxygenases (OxDBase), Biodegradation Network-Molecular Biology Database (Bionemo) MetaCyc, and BioCyc have been developed to enable access to information related to biochemistry and genetics of microbial degradation. In addition, several bioinformatics tools for predicting toxicity and biodegradation of chemicals have been developed. Furthermore, the whole genomes of several potential degrading bacteria have been sequenced and annotated using bioinformatics tools.
Collapse
Affiliation(s)
- Pankaj Kumar Arora
- School of Biotechnology, Yeungnam University, Gyeongsan 712-749, Republic of Korea
| | - Hanhong Bae
- School of Biotechnology, Yeungnam University, Gyeongsan 712-749, Republic of Korea
| |
Collapse
|
42
|
Minkiewicz P, Miciński J, Darewicz M, Bucholska J. Biological and Chemical Databases for Research into the Composition of Animal Source Foods. FOOD REVIEWS INTERNATIONAL 2013. [DOI: 10.1080/87559129.2013.818011] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
43
|
Villoutreix BO, Lagorce D, Labbé CM, Sperandio O, Miteva MA. One hundred thousand mouse clicks down the road: selected online resources supporting drug discovery collected over a decade. Drug Discov Today 2013; 18:1081-9. [PMID: 23831439 DOI: 10.1016/j.drudis.2013.06.013] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2013] [Revised: 06/18/2013] [Accepted: 06/26/2013] [Indexed: 12/17/2022]
Abstract
Online resources enabling and supporting drug discovery have blossomed during the past ten years. However, drug hunters commonly find themselves overwhelmed by the proliferation of these computer-based resources. Ten years ago, we, the authors of this review, felt that a comprehensive list of in silico resources relating to drug discovery was needed. Especially because the internet provides a wealth of inspiring tools that, if fully exploited, could greatly assist the process. We present here a compilation of online tools and databases collected over the past decade. The tools were essentially found through literature and internet searches and, currently, our list contains over 1500 URLs. We also briefly highlight some recently reported services and comment about ongoing and future efforts in the field.
Collapse
Affiliation(s)
- Bruno O Villoutreix
- Université Paris Diderot, Sorbonne Paris Cité, Inserm UMR-S 973, Molécules Thérapeutiques In Silico, 39 rue Helene Brion, 75013 Paris, France.
| | | | | | | | | |
Collapse
|
44
|
iPPI-DB: a manually curated and interactive database of small non-peptide inhibitors of protein-protein interactions. Drug Discov Today 2013; 18:958-68. [PMID: 23688585 DOI: 10.1016/j.drudis.2013.05.003] [Citation(s) in RCA: 83] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2013] [Revised: 05/06/2013] [Accepted: 05/10/2013] [Indexed: 01/05/2023]
Abstract
The development of small molecule drugs targeting protein-protein interactions (PPI) represents a major challenge, in part owing to the misunderstanding of the PPI chemical space. To this end, we have manually collected the structures, the physicochemical and pharmacological profiles of 1650 PPI inhibitors across 13 families of PPI targets in a database named iPPI-DB. To access iPPI-DB, we propose a user-friendly web application (www.ippidb.cdithem.fr) with customizable queries and intuitive visualizing functionalities for associated properties of the compounds. This could assist scientists to design the next generation of PPI drugs. In this review, we describe iPPI-DB in the context of other low molecular weight molecule databases.
Collapse
|
45
|
Abstract
“Big” molecules such as proteins and genes still continue to capture the imagination of most biologists, biochemists and bioinformaticians. “Small” molecules, on the other hand, are the molecules that most biologists, biochemists and bioinformaticians prefer to ignore. However, it is becoming increasingly apparent that small molecules such as amino acids, lipids and sugars play a far more important role in all aspects of disease etiology and disease treatment than we realized. This particular chapter focuses on an emerging field of bioinformatics called “chemical bioinformatics” – a discipline that has evolved to help address the blended chemical and molecular biological needs of toxicogenomics, pharmacogenomics, metabolomics and systems biology. In the following pages we will cover several topics related to chemical bioinformatics. First, a brief overview of some of the most important or useful chemical bioinformatic resources will be given. Second, a more detailed overview will be given on those particular resources that allow researchers to connect small molecules to diseases. This section will focus on describing a number of recently developed databases or knowledgebases that explicitly relate small molecules – either as the treatment, symptom or cause – to disease. Finally a short discussion will be provided on newly emerging software tools that exploit these databases as a means to discover new biomarkers or even new treatments for disease.
Collapse
Affiliation(s)
- David S Wishart
- Department of Biological Sciences, University of Alberta, Edmonton, Alberta, Canada.
| |
Collapse
|
46
|
Cheng F, Li W, Zhou Y, Shen J, Wu Z, Liu G, Lee PW, Tang Y. admetSAR: a comprehensive source and free tool for assessment of chemical ADMET properties. J Chem Inf Model 2012; 52:3099-105. [PMID: 23092397 DOI: 10.1021/ci300367a] [Citation(s) in RCA: 1149] [Impact Index Per Article: 95.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Absorption, distribution, metabolism, excretion, and toxicity (ADMET) properties play key roles in the discovery/development of drugs, pesticides, food additives, consumer products, and industrial chemicals. This information is especially useful when to conduct environmental and human hazard assessment. The most critical rate limiting step in the chemical safety assessment workflow is the availability of high quality data. This paper describes an ADMET structure-activity relationship database, abbreviated as admetSAR. It is an open source, text and structure searchable, and continually updated database that collects, curates, and manages available ADMET-associated properties data from the published literature. In admetSAR, over 210,000 ADMET annotated data points for more than 96,000 unique compounds with 45 kinds of ADMET-associated properties, proteins, species, or organisms have been carefully curated from a large number of diverse literatures. The database provides a user-friendly interface to query a specific chemical profile, using either CAS registry number, common name, or structure similarity. In addition, the database includes 22 qualitative classification and 5 quantitative regression models with highly predictive accuracy, allowing to estimate ecological/mammalian ADMET properties for novel chemicals. AdmetSAR is accessible free of charge at http://www.admetexp.org.
Collapse
Affiliation(s)
- Feixiong Cheng
- Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China
| | | | | | | | | | | | | | | |
Collapse
|
47
|
DESTAF: a database of text-mined associations for reproductive toxins potentially affecting human fertility. Reprod Toxicol 2011; 33:99-105. [PMID: 22198179 DOI: 10.1016/j.reprotox.2011.12.007] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2011] [Revised: 11/14/2011] [Accepted: 12/09/2011] [Indexed: 11/22/2022]
Abstract
The Dragon Exploration System for Toxicants and Fertility (DESTAF) is a publicly available resource which enables researchers to efficiently explore both known and potentially novel information and associations in the field of reproductive toxicology. To create DESTAF we used data from the literature (including over 10500 PubMed abstracts), several publicly available biomedical repositories, and specialized, curated dictionaries. DESTAF has an interface designed to facilitate rapid assessment of the key associations between relevant concepts, allowing for a more in-depth exploration of information based on different gene/protein-, enzyme/metabolite-, toxin/chemical-, disease- or anatomically centric perspectives. As a special feature, DESTAF allows for the creation and initial testing of potentially new association hypotheses that suggest links between biological entities identified through the database. DESTAF, along with a PDF manual, can be found at http://cbrc.kaust.edu.sa/destaf. It is free to academic and non-commercial users and will be updated quarterly.
Collapse
|
48
|
Griffith SD, Quest DJ, Brettin TS, Cottingham RW. Scenario driven data modelling: a method for integrating diverse sources of data and data streams. BMC Bioinformatics 2011; 12 Suppl 10:S17. [PMID: 22165854 PMCID: PMC3236839 DOI: 10.1186/1471-2105-12-s10-s17] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Background Biology is rapidly becoming a data intensive, data-driven science. It is essential that data is represented and connected in ways that best represent its full conceptual content and allows both automated integration and data driven decision-making. Recent advancements in distributed multi-relational directed graphs, implemented in the form of the Semantic Web make it possible to deal with complicated heterogeneous data in new and interesting ways. Results This paper presents a new approach, scenario driven data modelling (SDDM), that integrates multi-relational directed graphs with data streams. SDDM can be applied to virtually any data integration challenge with widely divergent types of data and data streams. In this work, we explored integrating genetics data with reports from traditional media. SDDM was applied to the New Delhi metallo-beta-lactamase gene (NDM-1), an emerging global health threat. The SDDM process constructed a scenario, created a RDF multi-relational directed graph that linked diverse types of data to the Semantic Web, implemented RDF conversion tools (RDFizers) to bring content into the Sematic Web, identified data streams and analytical routines to analyse those streams, and identified user requirements and graph traversals to meet end-user requirements. Conclusions We provided an example where SDDM was applied to a complex data integration challenge. The process created a model of the emerging NDM-1 health threat, identified and filled gaps in that model, and constructed reliable software that monitored data streams based on the scenario derived multi-relational directed graph. The SDDM process significantly reduced the software requirements phase by letting the scenario and resulting multi-relational directed graph define what is possible and then set the scope of the user requirements. Approaches like SDDM will be critical to the future of data intensive, data-driven science because they automate the process of converting massive data streams into usable knowledge.
Collapse
|
49
|
Willighagen EL, Jeliazkova N, Hardy B, Grafström RC, Spjuth O. Computational toxicology using the OpenTox application programming interface and Bioclipse. BMC Res Notes 2011; 4:487. [PMID: 22075173 PMCID: PMC3264531 DOI: 10.1186/1756-0500-4-487] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2011] [Accepted: 11/10/2011] [Indexed: 11/10/2022] Open
Abstract
Background Toxicity is a complex phenomenon involving the potential adverse effect on a range of biological functions. Predicting toxicity involves using a combination of experimental data (endpoints) and computational methods to generate a set of predictive models. Such models rely strongly on being able to integrate information from many sources. The required integration of biological and chemical information sources requires, however, a common language to express our knowledge ontologically, and interoperating services to build reliable predictive toxicology applications. Findings This article describes progress in extending the integrative bio- and cheminformatics platform Bioclipse to interoperate with OpenTox, a semantic web framework which supports open data exchange and toxicology model building. The Bioclipse workbench environment enables functionality from OpenTox web services and easy access to OpenTox resources for evaluating toxicity properties of query molecules. Relevant cases and interfaces based on ten neurotoxins are described to demonstrate the capabilities provided to the user. The integration takes advantage of semantic web technologies, thereby providing an open and simplifying communication standard. Additionally, the use of ontologies ensures proper interoperation and reliable integration of toxicity information from both experimental and computational sources. Conclusions A novel computational toxicity assessment platform was generated from integration of two open science platforms related to toxicology: Bioclipse, that combines a rich scriptable and graphical workbench environment for integration of diverse sets of information sources, and OpenTox, a platform for interoperable toxicology data and computational services. The combination provides improved reliability and operability for handling large data sets by the use of the Open Standards from the OpenTox Application Programming Interface. This enables simultaneous access to a variety of distributed predictive toxicology databases, and algorithm and model resources, taking advantage of the Bioclipse workbench handling the technical layers.
Collapse
Affiliation(s)
- Egon L Willighagen
- Department of Pharmaceutical Bioinformatics, Uppsala University, Uppsala, Sweden.
| | | | | | | | | |
Collapse
|
50
|
Maddah F, Soeria-Atmadja D, Malm P, Gustafsson M, Hammerling U. Interrogating health-related public databases from a food toxicology perspective: Computational analysis of scoring data. Food Chem Toxicol 2011; 49:2830-40. [DOI: 10.1016/j.fct.2011.08.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2011] [Revised: 07/22/2011] [Accepted: 08/03/2011] [Indexed: 11/28/2022]
|