1
|
Amano Y, Yamane M, Honda H. RAID: Regression Analysis–Based Inductive DNA Microarray for Precise Read-Across. Front Pharmacol 2022; 13:879907. [PMID: 35935858 PMCID: PMC9354856 DOI: 10.3389/fphar.2022.879907] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2022] [Accepted: 05/30/2022] [Indexed: 12/02/2022] Open
Abstract
Chemical structure-based read-across represents a promising method for chemical toxicity evaluation without the need for animal testing; however, a chemical structure is not necessarily related to toxicity. Therefore, in vitro studies were often used for read-across reliability refinement; however, their external validity has been hindered by the gap between in vitro and in vivo conditions. Thus, we developed a virtual DNA microarray, regression analysis–based inductive DNA microarray (RAID), which quantitatively predicts in vivo gene expression profiles based on the chemical structure and/or in vitro transcriptome data. For each gene, elastic-net models were constructed using chemical descriptors and in vitro transcriptome data to predict in vivo data from in vitro data (in vitro to in vivo extrapolation; IVIVE). In feature selection, useful genes for assessing the quantitative structure–activity relationship (QSAR) and IVIVE were identified. Predicted transcriptome data derived from the RAID system reflected the in vivo gene expression profiles of characteristic hepatotoxic substances. Moreover, gene ontology and pathway analysis indicated that nuclear receptor-mediated xenobiotic response and metabolic activation are related to these gene expressions. The identified IVIVE-related genes were associated with fatty acid, xenobiotic, and drug metabolisms, indicating that in vitro studies were effective in evaluating these key events. Furthermore, validation studies revealed that chemical substances associated with these key events could be detected as hepatotoxic biosimilar substances. These results indicated that the RAID system could represent an alternative screening test for a repeated-dose toxicity test and toxicogenomics analyses. Our technology provides a critical solution for IVIVE-based read-across by considering the mode of action and chemical structures.
Collapse
|
2
|
Rathman J, Yang C, Ribeiro JV, Mostrag A, Thakkar S, Tong W, Hobocienski B, Sacher O, Magdziarz T, Bienfait B. Development of a Battery of In Silico Prediction Tools for Drug-Induced Liver Injury from the Vantage Point of Translational Safety Assessment. Chem Res Toxicol 2020; 34:601-615. [PMID: 33356149 DOI: 10.1021/acs.chemrestox.0c00423] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Drug-induced liver injury (DILI) remains a challenge when translating knowledge from the preclinical stage to human use cases. Attempts to model human DILI directly based on the information from drug labels have had some success; however, the approach falls short of providing insights or addressing uncertainty due to the difficulty of decoupling the idiosyncratic nature of human DILI outcomes. Our approach in this comparative analysis is to leverage existing preclinical and clinical data as well as information on metabolism to better translate mammalian to human DILI. The human DILI knowledge base from the United States Food and Drug Administration (U.S. FDA) National Center for Toxicology Research contains 1036 pharmaceuticals from diverse therapeutic categories. A human DILI training set of 305 oral marketed drugs was prepared and a binary classification scheme applied. The second knowledge base consists of mammalian repeated dose toxicity with liver toxicity data from various regulatory sources. Within this knowledge base, we identified 278 pharmaceuticals containing 198 marketed or withdrawn oral drugs with data from the U.S. FDA new drug application and 98 active pharmaceutical ingredients from ToxCast. From this collection, a set of 225 oral drugs was prepared as the mammalian hepatotoxicity training set with particular end points of pathology findings in the liver and bile duct. Both human and mammalian data sets were processed using various learning algorithms, including artificial intelligence approaches. The external validations for both models were comparable to the training statistics. These data sets were also used to extract species-differentiating chemotypes that differentiate DILI effects on humans from mammals. A systematic workflow was devised to predict human DILI and provide mechanistic insights. For a given query molecule, both human and mammalian models are run. If the predictions are discordant, both metabolites and parents are investigated for quantitative structure-activity relationship and species-differentiating chemotypes. Their results are combined using the Dempster-Shafer decision theory to yield a final outcome prediction for human DILI with estimated uncertainty. Finally, these tools are implementable within an in silico platform for systematic evaluation.
Collapse
Affiliation(s)
- James Rathman
- Molecular Networks GmbH - Computerchemie (MN-AM), 90411 Nurnberg, Germany.,Chemical and Biomolecular Engineering, The Ohio State University, Columbus, Ohio 43210, United States
| | - Chihae Yang
- Molecular Networks GmbH - Computerchemie (MN-AM), 90411 Nurnberg, Germany
| | - J Vinicius Ribeiro
- Molecular Networks GmbH - Computerchemie (MN-AM), 90411 Nurnberg, Germany
| | - Aleksandra Mostrag
- Molecular Networks GmbH - Computerchemie (MN-AM), 90411 Nurnberg, Germany
| | - Shraddha Thakkar
- National Center for Toxicology Research, United States Food and Drug Administration, Jefferson, Arkansas 72079, United States
| | - Weida Tong
- National Center for Toxicology Research, United States Food and Drug Administration, Jefferson, Arkansas 72079, United States
| | - Bryan Hobocienski
- Molecular Networks GmbH - Computerchemie (MN-AM), 90411 Nurnberg, Germany
| | - Oliver Sacher
- Molecular Networks GmbH - Computerchemie (MN-AM), 90411 Nurnberg, Germany
| | - Tomasz Magdziarz
- Molecular Networks GmbH - Computerchemie (MN-AM), 90411 Nurnberg, Germany
| | - Bruno Bienfait
- Molecular Networks GmbH - Computerchemie (MN-AM), 90411 Nurnberg, Germany
| |
Collapse
|
3
|
Li T, Tong W, Roberts R, Liu Z, Thakkar S. Deep Learning on High-Throughput Transcriptomics to Predict Drug-Induced Liver Injury. Front Bioeng Biotechnol 2020; 8:562677. [PMID: 33330410 PMCID: PMC7728858 DOI: 10.3389/fbioe.2020.562677] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2020] [Accepted: 11/05/2020] [Indexed: 12/14/2022] Open
Abstract
Drug-induced liver injury (DILI) is one of the most cited reasons for the high drug attrition rate and drug withdrawal from the market. The accumulated large amount of high throughput transcriptomic profiles and advances in deep learning provide an unprecedented opportunity to improve the suboptimal performance of DILI prediction. In this study, we developed an eight-layer Deep Neural Network (DNN) model for DILI prediction using transcriptomic profiles of human cell lines (LINCS L1000 dataset) with the current largest binary DILI annotation data [i.e., DILI severity and toxicity (DILIst)]. The developed models were evaluated by Monte Carlo cross-validation (MCCV), permutation test, and an independent validation (IV) set. The developed DNN model achieved the area under the receiver operating characteristic curve (AUC) of 0.802 and 0.798, and balanced accuracy of 0.741 and 0.721 for training and an IV set, respectively, outperforming the conventional machine learning algorithms, including K-nearest neighbors (KNN), Support Vector Machine (SVM), and Random Forest (RF). Moreover, the developed DNN model provided a more balanced sensitivity of 0.839 and specificity of 0.603. Besides, we found the developed DNN model had a superior predictive performance for oncology drugs. Also, the functional and network analysis of genes driving the predictions revealed their relevance to the underlying mechanisms of DILI. The proposed DNN model could be a promising tool for early detection of DILI potential in the pre-clinical setting.
Collapse
Affiliation(s)
- Ting Li
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, United States.,Joint Bioinformatics Program, University of Arkansas at Little Rock and University of Arkansas for Medical Sciences, Little Rock, AR, United States
| | - Weida Tong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, United States
| | - Ruth Roberts
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, United States.,ApconiX Ltd., Alderley Edge, United Kingdom.,Department of Biosciences, University of Birmingham, Birmingham, United Kingdom
| | - Zhichao Liu
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, United States
| | - Shraddha Thakkar
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, United States
| |
Collapse
|
4
|
Liu Y, Jing R, Wen Z, Li M. Narrowing the Gap Between In Vitro and In Vivo Genetic Profiles by Deconvoluting Toxicogenomic Data In Silico. Front Pharmacol 2020; 10:1489. [PMID: 31992983 PMCID: PMC6964707 DOI: 10.3389/fphar.2019.01489] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2019] [Accepted: 11/18/2019] [Indexed: 01/09/2023] Open
Abstract
Toxicogenomics (TGx) is a powerful method to evaluate toxicity and is widely used in both in vivo and in vitro assays. For in vivo TGx, reduction, refinement, and replacement represent the unremitting pursuit of live-animal tests, but in vitro assays, as alternatives, usually demonstrate poor correlation with real in vivo assays. In living subjects, in addition to drug effects, inner-environmental reactions also affect genetic variation, and these two factors are further jointly reflected in gene abundance. Thus, finding a strategy to factorize inner-environmental factor from in vivo assays based on gene expression levels and to further utilize in vitro data to better simulate in vivo data is needed. We proposed a strategy based on post-modified non-negative matrix factorization, which can estimate the gene expression profiles and contents of major factors in samples. The applicability of the strategy was first verified, and the strategy was then utilized to simulate in vivo data by correcting in vitro data. The similarities between real in vivo data and simulated data (single-dose 0.72, repeat-doses 0.75) were higher than those observed when directly comparing real in vivo data with in vitro data (single-dose 0.56, repeat-doses 0.70). Moreover, by keeping environment-related factor, a simulation can always be generated by using in vitro data to provide potential substitutions for in vivo TGx and to reduce the launch of live-animal tests.
Collapse
Affiliation(s)
- Yuan Liu
- College of Chemistry, Sichuan University, Chengdu, China
| | - Runyu Jing
- College of Cybersecurity, Sichuan University, Chengdu, China
| | - Zhining Wen
- College of Chemistry, Sichuan University, Chengdu, China
| | - Menglong Li
- College of Chemistry, Sichuan University, Chengdu, China
| |
Collapse
|
5
|
Liu Z, Zhu L, Thakkar S, Roberts R, Tong W. Can Transcriptomic Profiles from Cancer Cell Lines Be Used for Toxicity Assessment? Chem Res Toxicol 2019; 33:271-280. [DOI: 10.1021/acs.chemrestox.9b00288] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Affiliation(s)
- Zhichao Liu
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, Arkansas 72079, United States
| | - Liyuan Zhu
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, Arkansas 72079, United States
| | - Shraddha Thakkar
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, Arkansas 72079, United States
| | - Ruth Roberts
- ApconiX, BioHub at Alderley Park, Alderley Edge SK10 4TG, U.K
- University of Birmingham, Edgbaston, Birmingham B15 2TT, U.K
| | - Weida Tong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, Arkansas 72079, United States
| |
Collapse
|
6
|
Thakkar S, Li T, Liu Z, Wu L, Roberts R, Tong W. Drug-induced liver injury severity and toxicity (DILIst): binary classification of 1279 drugs by human hepatotoxicity. Drug Discov Today 2019; 25:201-208. [PMID: 31669330 DOI: 10.1016/j.drudis.2019.09.022] [Citation(s) in RCA: 59] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2019] [Revised: 09/22/2019] [Accepted: 09/26/2019] [Indexed: 12/12/2022]
Abstract
Drug-induced liver injury (DILI) is of significant concern to drug development and regulatory review because of the limited success with existing preclinical models. For developing alternative methods, a large drug list is needed with known DILI severity and toxicity. We augmented the DILIrank data set [annotated using US Food and Drug Administration (FDA) drug labeling)] with four literature datasets (N >350 drugs) to generate the largest drug list with DILI classification, called DILIst (DILI severity and toxicity). DILIst comprises 1279 drugs, of which 768 were DILI positives (increase of 65% from DILIrank), whereas 511 were DILI negatives (increase of 65%). The investigation of DILI positive-negative distribution across various therapeutic categories revealed the most and least frequent DILI categories. Thus, we consider DILIst to be an invaluable resource for the community to improve DILI research.
Collapse
Affiliation(s)
- Shraddha Thakkar
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR 72079, USA
| | - Ting Li
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR 72079, USA
| | - Zhichao Liu
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR 72079, USA
| | - Leihong Wu
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR 72079, USA
| | - Ruth Roberts
- ApconiX Ltd, Alderley Park, Alderley Edge, SK10 4TG, UK; University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
| | - Weida Tong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR 72079, USA.
| |
Collapse
|
7
|
Liu Z, Huang R, Roberts R, Tong W. Toxicogenomics: A 2020 Vision. Trends Pharmacol Sci 2019; 40:92-103. [PMID: 30594306 PMCID: PMC9988209 DOI: 10.1016/j.tips.2018.12.001] [Citation(s) in RCA: 68] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Revised: 11/23/2018] [Accepted: 12/04/2018] [Indexed: 12/29/2022]
Abstract
Toxicogenomics (TGx) has contributed significantly to toxicology and now has great potential to support moves towards animal-free approaches in regulatory decision making. Here, we discuss in vitro TGx systems and their potential impact on risk assessment. We raise awareness of the rapid advancement of genomics technologies, which generates novel genomics features essential for enhanced risk assessment. We specifically emphasize the importance of reproducibility in utilizing TGx in the regulatory setting. We also highlight the role of machine learning (particularly deep learning) in developing TGx-based predictive models. Lastly, we touch on the topics of how TGx approaches could facilitate adverse outcome pathways (AOP) development and enhance read-across strategies to further regulatory application. Finally, we summarize current efforts to develop TGx for risk assessment and set out remaining challenges.
Collapse
Affiliation(s)
- Zhichao Liu
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, Arkansas 72079, USA.
| | - Ruili Huang
- National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland 20850, USA
| | - Ruth Roberts
- ApconiX, BioHub at Alderley Park, Alderley Edge, SK10 4TG, UK; University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
| | - Weida Tong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, Arkansas 72079, USA.
| |
Collapse
|
8
|
Taškova K, Fontaine JF, Mrowka R, Andrade-Navarro MA. Literature optimized integration of gene expression for organ-specific evaluation of toxicogenomics datasets. PLoS One 2019; 14:e0210467. [PMID: 30640953 PMCID: PMC6331104 DOI: 10.1371/journal.pone.0210467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2018] [Accepted: 12/24/2018] [Indexed: 11/30/2022] Open
Abstract
The study of drug toxicity in human organs is complicated by their complex inter-relations and by the obvious difficulty to testing drug effects on biologically relevant material. Animal models and human cell cultures offer alternatives for systematic and large-scale profiling of drug effects on gene expression level, as typically found in the so-called toxicogenomics datasets. However, the complexity of these data, which includes variable drug doses, time points, and experimental setups, makes it difficult to choose and integrate the data, and to evaluate the appropriateness of one or another model system to study drug toxicity (of particular drugs) of particular human organs. Here, we define a protocol to integrate drug-wise rankings of gene expression changes in toxicogenomics data, which we apply to the TG-GATEs dataset, to prioritize genes for association to drug toxicity in liver or kidney. Contrast of the results with sets of known human genes associated to drug toxicity in the literature allows to compare different rank aggregation approaches for the task at hand. Collectively, ranks from multiple models point to genes not previously associated to toxicity, notably, the PCNA clamp associated factor (PCLAF), and genes regulated by the master regulator of the antioxidant response NFE2L2, such as NQO1 and SRXN1. In addition, comparing gene ranks from different models allowed us to evaluate striking differences in terms of toxicity-associated genes between human and rat hepatocytes or between rat liver and rat hepatocytes. We interpret these results to point to the different molecular functions associated to organ toxicity that are best described by each model. We conclude that the expected production of toxicogenomics panels with larger numbers of drugs and models, in combination with the ongoing increase of the experimental literature in organ toxicity, will lead to increasingly better associations of genes for organism toxicity.
Collapse
Affiliation(s)
| | | | - Ralf Mrowka
- Experimentelle Nephrologie, Universitätsklinikum Jena, KIM III, Jena, Germany
| | | |
Collapse
|