1
|
van Dijk R, Arevalo J, Babadi M, Carpenter AE, Singh S. Capturing cell heterogeneity in representations of cell populations for image-based profiling using contrastive learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.14.567038. [PMID: 39131344 PMCID: PMC11312468 DOI: 10.1101/2023.11.14.567038] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]
Abstract
Image-based cell profiling is a powerful tool that compares perturbed cell populations by measuring thousands of single-cell features and summarizing them into profiles. Typically a sample is represented by averaging across cells, but this fails to capture the heterogeneity within cell populations. We introduce CytoSummaryNet: a Deep Sets-based approach that improves mechanism of action prediction by 30-68% in mean average precision compared to average profiling on a public dataset. CytoSummaryNet uses self-supervised contrastive learning in a multiple-instance learning framework, providing an easier-to-apply method for aggregating single-cell feature data than previously published strategies. Interpretability analysis suggests that the model achieves this improvement by downweighting small mitotic cells or those with debris and prioritizing large uncrowded cells. The approach requires only perturbation labels for training, which are readily available in all cell profiling datasets. CytoSummaryNet offers a straightforward post-processing step for single-cell profiles that can significantly boost retrieval performance on image-based profiling datasets.
Collapse
|
2
|
Odje F, Meijer D, von Coburg E, van der Hooft JJJ, Dunst S, Medema MH, Volkamer A. Unleashing the potential of cell painting assays for compound activities and hazards prediction. FRONTIERS IN TOXICOLOGY 2024; 6:1401036. [PMID: 39086553 PMCID: PMC11288911 DOI: 10.3389/ftox.2024.1401036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Accepted: 06/14/2024] [Indexed: 08/02/2024] Open
Abstract
The cell painting (CP) assay has emerged as a potent imaging-based high-throughput phenotypic profiling (HTPP) tool that provides comprehensive input data for in silico prediction of compound activities and potential hazards in drug discovery and toxicology. CP enables the rapid, multiplexed investigation of various molecular mechanisms for thousands of compounds at the single-cell level. The resulting large volumes of image data provide great opportunities but also pose challenges to image and data analysis routines as well as property prediction models. This review addresses the integration of CP-based phenotypic data together with or in substitute of structural information from compounds into machine (ML) and deep learning (DL) models to predict compound activities for various human-relevant disease endpoints and to identify the underlying modes-of-action (MoA) while avoiding unnecessary animal testing. The successful application of CP in combination with powerful ML/DL models promises further advances in understanding compound responses of cells guiding therapeutic development and risk assessment. Therefore, this review highlights the importance of unlocking the potential of CP assays when combined with molecular fingerprints for compound evaluation and discusses the current challenges that are associated with this approach.
Collapse
Affiliation(s)
- Floriane Odje
- Data Driven Drug Design, Center for Bioinformatics, Saarland University, Saarbrücken, Germany
| | - David Meijer
- Bioinformatics Group, Wageningen University, Wageningen, Netherlands
| | - Elena von Coburg
- Department Experimental Toxicology and ZEBET, German Federal Institute for Risk Assessment (BfR), German Centre for the Protection of Laboratory Animals (Bf3R), Berlin, Germany
| | | | - Sebastian Dunst
- Department Experimental Toxicology and ZEBET, German Federal Institute for Risk Assessment (BfR), German Centre for the Protection of Laboratory Animals (Bf3R), Berlin, Germany
| | - Marnix H. Medema
- Bioinformatics Group, Wageningen University, Wageningen, Netherlands
| | - Andrea Volkamer
- Data Driven Drug Design, Center for Bioinformatics, Saarland University, Saarbrücken, Germany
| |
Collapse
|
3
|
Rezaei Adariani S, Agne D, Koska S, Burhop A, Seitz C, Warmers J, Janning P, Metz M, Pahl A, Sievers S, Waldmann H, Ziegler S. Detection of a Mitochondrial Fragmentation and Integrated Stress Response Using the Cell Painting Assay. J Med Chem 2024. [PMID: 39018123 DOI: 10.1021/acs.jmedchem.4c01183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/19/2024]
Abstract
Mitochondria are cellular powerhouses and are crucial for cell function. However, they are vulnerable to internal and external perturbagens that may impair mitochondrial function and eventually lead to cell death. In particular, small molecules may impact mitochondrial function, and therefore, their influence on mitochondrial homeostasis is at best assessed early on in the characterization of biologically active small molecules and drug discovery. We demonstrate that unbiased morphological profiling by means of the cell painting assay (CPA) can detect mitochondrial stress coupled with the induction of an integrated stress response. This activity is common for compounds addressing different targets, is not shared by direct inhibitors of the electron transport chain, and enables prediction of mitochondrial stress induction for small molecules that are profiled using CPA.
Collapse
Affiliation(s)
- Soheila Rezaei Adariani
- Department of Chemical Biology, Max Planck Institute of Molecular Physiology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
- Faculty of Chemistry and Chemical Biology, Technical University Dortmund, Otto-Hahn-Strasse 6, 44227 Dortmund, Germany
| | - Daya Agne
- Department of Chemical Biology, Max Planck Institute of Molecular Physiology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
| | - Sandra Koska
- Department of Chemical Biology, Max Planck Institute of Molecular Physiology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
| | - Annina Burhop
- Department of Chemical Biology, Max Planck Institute of Molecular Physiology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
| | - Carina Seitz
- Compound Management and Screening Center, Max Planck Institute of Molecular Physiology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
| | - Jens Warmers
- Department of Chemical Biology, Max Planck Institute of Molecular Physiology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
- Faculty of Chemistry and Chemical Biology, Technical University Dortmund, Otto-Hahn-Strasse 6, 44227 Dortmund, Germany
| | - Petra Janning
- Department of Chemical Biology, Max Planck Institute of Molecular Physiology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
| | - Malte Metz
- Department of Chemical Biology, Max Planck Institute of Molecular Physiology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
| | - Axel Pahl
- Compound Management and Screening Center, Max Planck Institute of Molecular Physiology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
| | - Sonja Sievers
- Compound Management and Screening Center, Max Planck Institute of Molecular Physiology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
| | - Herbert Waldmann
- Department of Chemical Biology, Max Planck Institute of Molecular Physiology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
- Faculty of Chemistry and Chemical Biology, Technical University Dortmund, Otto-Hahn-Strasse 6, 44227 Dortmund, Germany
| | - Slava Ziegler
- Department of Chemical Biology, Max Planck Institute of Molecular Physiology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
| |
Collapse
|
4
|
Seal S, Williams D, Hosseini-Gerami L, Mahale M, Carpenter AE, Spjuth O, Bender A. Improved Detection of Drug-Induced Liver Injury by Integrating Predicted In Vivo and In Vitro Data. Chem Res Toxicol 2024. [PMID: 38981058 DOI: 10.1021/acs.chemrestox.4c00015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/11/2024]
Abstract
Drug-induced liver injury (DILI) has been a significant challenge in drug discovery, often leading to clinical trial failures and necessitating drug withdrawals. Over the last decade, the existing suite of in vitro proxy-DILI assays has generally improved at identifying compounds with hepatotoxicity. However, there is considerable interest in enhancing the in silico prediction of DILI because it allows for evaluating large sets of compounds more quickly and cost-effectively, particularly in the early stages of projects. In this study, we aim to study ML models for DILI prediction that first predict nine proxy-DILI labels and then use them as features in addition to chemical structural features to predict DILI. The features include in vitro (e.g., mitochondrial toxicity, bile salt export pump inhibition) data, in vivo (e.g., preclinical rat hepatotoxicity studies) data, pharmacokinetic parameters of maximum concentration, structural fingerprints, and physicochemical parameters. We trained DILI-prediction models on 888 compounds from the DILI data set (composed of DILIst and DILIrank) and tested them on a held-out external test set of 223 compounds from the DILI data set. The best model, DILIPredictor, attained an AUC-PR of 0.79. This model enabled the detection of the top 25 toxic compounds (2.68 LR+, positive likelihood ratio) compared to models using only structural features (1.65 LR+ score). Using feature interpretation from DILIPredictor, we identified the chemical substructures causing DILI and differentiated cases of DILI caused by compounds in animals but not in humans. For example, DILIPredictor correctly recognized 2-butoxyethanol as nontoxic in humans despite its hepatotoxicity in mice models. Overall, the DILIPredictor model improves the detection of compounds causing DILI with an improved differentiation between animal and human sensitivity and the potential for mechanism evaluation. DILIPredictor required only chemical structures as input for prediction and is publicly available at https://broad.io/DILIPredictor for use via web interface and with all code available for download.
Collapse
Affiliation(s)
- Srijit Seal
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Rd, Cambridge CB2 1EW, United Kingdom
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02141, United States
| | - Dominic Williams
- Safety Innovation, Clinical Pharmacology and Safety Sciences, AstraZeneca, Cambridge CB4 0FZ, United Kingdom
- Quantitative Biology, Discovery Sciences, R&D, AstraZeneca, Cambridge CB4 0FZ, United Kingdom
| | - Layla Hosseini-Gerami
- Ignota Laboratories, County Hall, Westminster Bridge Rd, London SE1 7PB, United Kingdom
| | - Manas Mahale
- Bombay College of Pharmacy Kalina Santacruz (E), Mumbai 400 098, India
| | - Anne E Carpenter
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02141, United States
| | - Ola Spjuth
- Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, Box 591, Uppsala SE-75124, Sweden
| | - Andreas Bender
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Rd, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
5
|
Liu G, Seal S, Arevalo J, Liang Z, Carpenter AE, Jiang M, Singh S. Learning Molecular Representation in a Cell. ARXIV 2024:arXiv:2406.12056v2. [PMID: 38947938 PMCID: PMC11213146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]
Abstract
Predicting drug efficacy and safety in vivo requires information on biological responses (e.g., cell morphology and gene expression) to small molecule perturbations. However, current molecular representation learning methods do not provide a comprehensive view of cell states under these perturbations and struggle to remove noise, hindering model generalization. We introduce the Information Alignment (InfoAlign) approach to learn molecular representations through the information bottleneck method in cells. We integrate molecules and cellular response data as nodes into a context graph, connecting them with weighted edges based on chemical, biological, and computational criteria. For each molecule in a training batch, InfoAlign optimizes the encoder's latent representation with a minimality objective to discard redundant structural information. A sufficiency objective decodes the representation to align with different feature spaces from the molecule's neighborhood in the context graph. We demonstrate that the proposed sufficiency objective for alignment is tighter than existing encoder-based contrastive methods. Empirically, we validate representations from InfoAlign in two downstream tasks: molecular property prediction against up to 19 baseline methods across four datasets, plus zero-shot molecule-morphology matching.
Collapse
|
6
|
Alijagic A, Sinisalu L, Duberg D, Kotlyar O, Scherbak N, Engwall M, Orešič M, Hyötyläinen T. Metabolic and phenotypic changes induced by PFAS exposure in two human hepatocyte cell models. ENVIRONMENT INTERNATIONAL 2024; 190:108820. [PMID: 38906088 DOI: 10.1016/j.envint.2024.108820] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 06/13/2024] [Accepted: 06/13/2024] [Indexed: 06/23/2024]
Abstract
PFAS are ubiquitous industrial chemicals with known adverse health effects, particularly on the liver. The liver, being a vital metabolic organ, is susceptible to PFAS-induced metabolic dysregulation, leading to conditions such as hepatotoxicity and metabolic disturbances. In this study, we investigated the phenotypic and metabolic responses of PFAS exposure using two hepatocyte models, HepG2 (male cell line) and HepaRG (female cell line), aiming to define phenotypic alterations, and metabolic disturbances at the metabolite and pathway levels. The PFAS mixture composition was selected based on epidemiological data, covering a broad concentration spectrum observed in diverse human populations. Phenotypic profiling by Cell Painting assay disclosed predominant effects of PFAS exposure on mitochondrial structure and function in both cell models as well as effects on F-actin, Golgi apparatus, and plasma membrane-associated measures. We employed comprehensive metabolic characterization using liquid chromatography combined with high-resolution mass spectrometry (LC-HRMS). We observed dose-dependent changes in the metabolic profiles, particularly in lipid, steroid, amino acid and sugar and carbohydrate metabolism in both cells as well as in cell media, with HepaRG cell line showing a stronger metabolic response. In cells, most of the bile acids, acylcarnitines and free fatty acids showed downregulation, while medium-chain fatty acids and carnosine were upregulated, while the cell media showed different response especially in relation to the bile acids in HepaRG cell media. Importantly, we observed also nonmonotonic response for several phenotypic features and metabolites. On the pathway level, PFAS exposure was also associated with pathways indicating oxidative stress and inflammatory responses. Taken together, our findings on PFAS-induced phenotypic and metabolic disruptions in hepatocytes shed light on potential mechanisms contributing to the broader comprehension of PFAS-related health risks.
Collapse
Affiliation(s)
- Andi Alijagic
- Man-Technology-Environment (MTM) Research Centre, School of Science and Technology, Örebro University, SE-701 82 Örebro, Sweden; Inflammatory Response and Infection Susceptibility Centre (iRiSC), Faculty of Medicine and Health, Örebro University, Örebro SE-701 82, Sweden; School of Medical Sciences, Faculty of Medicine and Health, Örebro University, SE-701 82 Örebro, Sweden
| | - Lisanna Sinisalu
- Man-Technology-Environment (MTM) Research Centre, School of Science and Technology, Örebro University, SE-701 82 Örebro, Sweden
| | - Daniel Duberg
- Man-Technology-Environment (MTM) Research Centre, School of Science and Technology, Örebro University, SE-701 82 Örebro, Sweden
| | - Oleksandr Kotlyar
- Man-Technology-Environment (MTM) Research Centre, School of Science and Technology, Örebro University, SE-701 82 Örebro, Sweden; Centre for Applied Autonomous Sensor Systems (AASS), Mobile Robotics and Olfaction Lab (MRO), Örebro University, SE-701 82 Örebro, Sweden
| | - Nikolai Scherbak
- Man-Technology-Environment (MTM) Research Centre, School of Science and Technology, Örebro University, SE-701 82 Örebro, Sweden
| | - Magnus Engwall
- Man-Technology-Environment (MTM) Research Centre, School of Science and Technology, Örebro University, SE-701 82 Örebro, Sweden
| | - Matej Orešič
- School of Medical Sciences, Faculty of Medicine and Health, Örebro University, SE-701 82 Örebro, Sweden; Turku Centre for Biotechnology, University of Turku and Åbo Akademi University, FI-20520 Turku, Finland; Department of Life Technologies, University of Turku, FI-20014 Turku, Finland
| | - Tuulia Hyötyläinen
- Man-Technology-Environment (MTM) Research Centre, School of Science and Technology, Örebro University, SE-701 82 Örebro, Sweden.
| |
Collapse
|
7
|
Seal S, Williams DP, Hosseini-Gerami L, Mahale M, Carpenter AE, Spjuth O, Bender A. Improved Detection of Drug-Induced Liver Injury by Integrating Predicted in vivo and in vitro Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.10.575128. [PMID: 38895462 PMCID: PMC11185581 DOI: 10.1101/2024.01.10.575128] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
Drug-induced liver injury (DILI) has been significant challenge in drug discovery, often leading to clinical trial failures and necessitating drug withdrawals. The existing suite of in vitro proxy-DILI assays is generally effective at identifying compounds with hepatotoxicity. However, there is considerable interest in enhancing in silico prediction of DILI because it allows for the evaluation of large sets of compounds more quickly and cost-effectively, particularly in the early stages of projects. In this study, we aim to study ML models for DILI prediction that first predicts nine proxy-DILI labels and then uses them as features in addition to chemical structural features to predict DILI. The features include in vitro (e.g., mitochondrial toxicity, bile salt export pump inhibition) data, in vivo (e.g., preclinical rat hepatotoxicity studies) data, pharmacokinetic parameters of maximum concentration, structural fingerprints, and physicochemical parameters. We trained DILI-prediction models on 888 compounds from the DILIst dataset and tested on a held-out external test set of 223 compounds from DILIst dataset. The best model, DILIPredictor, attained an AUC-ROC of 0.79. This model enabled the detection of top 25 toxic compounds compared to models using only structural features (2.68 LR+ score). Using feature interpretation from DILIPredictor, we were able to identify the chemical substructures causing DILI as well as differentiate cases DILI is caused by compounds in animals but not in humans. For example, DILIPredictor correctly recognized 2-butoxyethanol as non-toxic in humans despite its hepatotoxicity in mice models. Overall, the DILIPredictor model improves the detection of compounds causing DILI with an improved differentiation between animal and human sensitivity as well as the potential for mechanism evaluation. DILIPredictor is publicly available at https://broad.io/DILIPredictor for use via web interface and with all code available for download and local implementation via https://pypi.org/project/dilipred/.
Collapse
Affiliation(s)
- Srijit Seal
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Rd, CB2 1EW, Cambridge, United Kingdom
- Imaging Platform, Broad Institute of MIT and Harvard, US
| | - Dominic P. Williams
- Safety Innovation, Clinical Pharmacology and Safety Sciences, AstraZeneca, Cambridge CB4 0FZ, United Kingdom
- Quantitative Biology, Discovery Sciences, R&D, AstraZeneca, Cambridge CB4 0FZ, United Kingdom
| | | | - Manas Mahale
- Bombay College of Pharmacy Kalina Santacruz (E), Mumbai 400 098, India
| | | | - Ola Spjuth
- Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, Box 591, SE-75124, Uppsala, Sweden
| | - Andreas Bender
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Rd, CB2 1EW, Cambridge, United Kingdom
| |
Collapse
|
8
|
Chung E, Wen X, Jia X, Ciallella HL, Aleksunes LM, Zhu H. Hybrid non-animal modeling: A mechanistic approach to predict chemical hepatotoxicity. JOURNAL OF HAZARDOUS MATERIALS 2024; 471:134297. [PMID: 38677119 DOI: 10.1016/j.jhazmat.2024.134297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Revised: 04/10/2024] [Accepted: 04/11/2024] [Indexed: 04/29/2024]
Abstract
Developing mechanistic non-animal testing methods based on the adverse outcome pathway (AOP) framework must incorporate molecular and cellular key events associated with target toxicity. Using data from an in vitro assay and chemical structures, we aimed to create a hybrid model to predict hepatotoxicants. We first curated a reference dataset of 869 compounds for hepatotoxicity modeling. Then, we profiled them against PubChem for existing in vitro toxicity data. Of the 2560 resulting assays, we selected the mitochondrial membrane potential (MMP) assay, a high-throughput screening (HTS) tool that can test chemical disruptors for mitochondrial function. Machine learning was applied to develop quantitative structure-activity relationship (QSAR) models with 2536 compounds tested in the MMP assay for screening new compounds. The MMP assay results, including QSAR model outputs, yielded hepatotoxicity predictions for reference set compounds with a Correct Classification Ratio (CCR) of 0.59. The predictivity improved by including 37 structural alerts (CCR = 0.8). We validated our model by testing 37 reference set compounds in human HepG2 hepatoma cells, and reliably predicting them for hepatotoxicity (CCR = 0.79). This study introduces a novel AOP modeling strategy that combines public HTS data, computational modeling, and experimental testing to predict chemical hepatotoxicity.
Collapse
Affiliation(s)
- Elena Chung
- Department of Chemistry and Biochemistry, Rowan University, NJ, USA; Center for Biomedical Informatics and Genomics, Tulane University, New Orleans, LA, USA
| | - Xia Wen
- Department of Pharmacology and Toxicology, Rutgers University, Piscataway, NJ, USA
| | - Xuelian Jia
- Department of Chemistry and Biochemistry, Rowan University, NJ, USA; Center for Biomedical Informatics and Genomics, Tulane University, New Orleans, LA, USA
| | - Heather L Ciallella
- Department of Toxicology, Cuyahoga County Medical Examiner's Office, Cleveland, OH, USA
| | - Lauren M Aleksunes
- Department of Pharmacology and Toxicology, Rutgers University, Piscataway, NJ, USA
| | - Hao Zhu
- Department of Chemistry and Biochemistry, Rowan University, NJ, USA; Center for Biomedical Informatics and Genomics, Tulane University, New Orleans, LA, USA.
| |
Collapse
|
9
|
Le Du-Carrée J, Palacios CK, Rotander A, Larsson M, Alijagic A, Kotlyar O, Engwall M, Sjöberg V, Keiter SH, Almeda R. Cocktail effects of tire wear particles leachates on diverse biological models: A multilevel analysis. JOURNAL OF HAZARDOUS MATERIALS 2024; 471:134401. [PMID: 38678714 DOI: 10.1016/j.jhazmat.2024.134401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2024] [Revised: 04/03/2024] [Accepted: 04/23/2024] [Indexed: 05/01/2024]
Abstract
Tire wear particles (TWP) stand out as a major contributor to microplastic pollution, yet their environmental impact remains inadequately understood. This study delves into the cocktail effects of TWP leachates, employing molecular, cellular, and organismal assessments on diverse biological models. Extracted in artificial seawater and analyzed for metals and organic compounds, TWP leachates revealed the presence of polyaromatic hydrocarbons and 4-tert-octylphenol. Exposure to TWP leachates (1.5 to 1000 mg peq L-1) inhibited algae growth and induced zebrafish embryotoxicity, pigment alterations, and behavioral changes. Cell painting uncovered pro-apoptotic changes, while mechanism-specific gene-reporter assays highlighted endocrine-disrupting potential, particularly antiandrogenic effects. Although heavy metals like zinc have been suggested as major players in TWP leachate toxicity, this study emphasizes water-leachable organic compounds as the primary causative agents of observed acute toxicity. The findings underscore the need to reduce TWP pollution in aquatic systems and enhance regulations governing highly toxic tire additives.
Collapse
Affiliation(s)
- Jessy Le Du-Carrée
- University of Las Palmas de Gran Canaria: Las Palmas de Gran Canaria, Spain.
| | - Clara Kempkens Palacios
- Man-Technology-Environment Research Center (MTM), Biology, Örebro University, SE-701 82 Örebro, Sweden
| | - Anna Rotander
- Man-Technology-Environment Research Center (MTM), Biology, Örebro University, SE-701 82 Örebro, Sweden
| | - Maria Larsson
- Man-Technology-Environment Research Center (MTM), Biology, Örebro University, SE-701 82 Örebro, Sweden
| | - Andi Alijagic
- Man-Technology-Environment Research Center (MTM), Biology, Örebro University, SE-701 82 Örebro, Sweden; Inflammatory Response and Infection Susceptibility Centre (iRiSC), Faculty of Medicine and Health, Örebro University, SE-701 82 Örebro, Sweden; Faculty of Medicine and Health, School of Medical Sciences, Örebro University, SE-701 82 Örebro, Sweden
| | - Oleksandr Kotlyar
- Man-Technology-Environment Research Center (MTM), Biology, Örebro University, SE-701 82 Örebro, Sweden; Centre for Applied Autonomous Sensor Systems (AASS), Mobile Robotics and Olfaction Lab (MRO), Örebro University, SE-701 82 Örebro, Sweden
| | - Magnus Engwall
- Man-Technology-Environment Research Center (MTM), Biology, Örebro University, SE-701 82 Örebro, Sweden
| | - Viktor Sjöberg
- Man-Technology-Environment Research Center (MTM), Biology, Örebro University, SE-701 82 Örebro, Sweden
| | - Steffen H Keiter
- Man-Technology-Environment Research Center (MTM), Biology, Örebro University, SE-701 82 Örebro, Sweden
| | - Rodrigo Almeda
- University of Las Palmas de Gran Canaria: Las Palmas de Gran Canaria, Spain
| |
Collapse
|
10
|
Tang Q, Ratnayake R, Seabra G, Jiang Z, Fang R, Cui L, Ding Y, Kahveci T, Bian J, Li C, Luesch H, Li Y. Morphological profiling for drug discovery in the era of deep learning. Brief Bioinform 2024; 25:bbae284. [PMID: 38886164 PMCID: PMC11182685 DOI: 10.1093/bib/bbae284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2024] [Revised: 05/13/2024] [Accepted: 06/03/2024] [Indexed: 06/20/2024] Open
Abstract
Morphological profiling is a valuable tool in phenotypic drug discovery. The advent of high-throughput automated imaging has enabled the capturing of a wide range of morphological features of cells or organisms in response to perturbations at the single-cell resolution. Concurrently, significant advances in machine learning and deep learning, especially in computer vision, have led to substantial improvements in analyzing large-scale high-content images at high throughput. These efforts have facilitated understanding of compound mechanism of action, drug repurposing, characterization of cell morphodynamics under perturbation, and ultimately contributing to the development of novel therapeutics. In this review, we provide a comprehensive overview of the recent advances in the field of morphological profiling. We summarize the image profiling analysis workflow, survey a broad spectrum of analysis strategies encompassing feature engineering- and deep learning-based approaches, and introduce publicly available benchmark datasets. We place a particular emphasis on the application of deep learning in this pipeline, covering cell segmentation, image representation learning, and multimodal learning. Additionally, we illuminate the application of morphological profiling in phenotypic drug discovery and highlight potential challenges and opportunities in this field.
Collapse
Affiliation(s)
- Qiaosi Tang
- Calico Life Sciences, South San Francisco, CA 94080, United States
| | - Ranjala Ratnayake
- Department of Medicinal Chemistry, Center for Natural Products, Drug Discovery and Development, University of Florida, Gainesville, FL 32610, United States
| | - Gustavo Seabra
- Department of Medicinal Chemistry, Center for Natural Products, Drug Discovery and Development, University of Florida, Gainesville, FL 32610, United States
| | - Zhe Jiang
- Department of Computer & Information Science & Engineering, University of Florida, Gainesville, FL 32611, United States
| | - Ruogu Fang
- Department of Computer & Information Science & Engineering, University of Florida, Gainesville, FL 32611, United States
- J. Crayton Pruitt Family Department of Biomedical Engineering, Herbert Wertheim College of Engineering, University of Florida, Gainesville, FL 32611, United States
| | - Lina Cui
- Department of Medicinal Chemistry, Center for Natural Products, Drug Discovery and Development, University of Florida, Gainesville, FL 32610, United States
| | - Yousong Ding
- Department of Medicinal Chemistry, Center for Natural Products, Drug Discovery and Development, University of Florida, Gainesville, FL 32610, United States
| | - Tamer Kahveci
- Department of Computer & Information Science & Engineering, University of Florida, Gainesville, FL 32611, United States
| | - Jiang Bian
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, FL 32611, United States
| | - Chenglong Li
- Department of Medicinal Chemistry, Center for Natural Products, Drug Discovery and Development, University of Florida, Gainesville, FL 32610, United States
| | - Hendrik Luesch
- Department of Medicinal Chemistry, Center for Natural Products, Drug Discovery and Development, University of Florida, Gainesville, FL 32610, United States
| | - Yanjun Li
- Department of Medicinal Chemistry, Center for Natural Products, Drug Discovery and Development, University of Florida, Gainesville, FL 32610, United States
- Department of Computer & Information Science & Engineering, University of Florida, Gainesville, FL 32611, United States
| |
Collapse
|
11
|
Seal S, Trapotsi MA, Spjuth O, Singh S, Carreras-Puigvert J, Greene N, Bender A, Carpenter AE. A Decade in a Systematic Review: The Evolution and Impact of Cell Painting. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.04.592531. [PMID: 38766203 PMCID: PMC11100607 DOI: 10.1101/2024.05.04.592531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]
Abstract
High-content image-based assays have fueled significant discoveries in the life sciences in the past decade (2013-2023), including novel insights into disease etiology, mechanism of action, new therapeutics, and toxicology predictions. Here, we systematically review the substantial methodological advancements and applications of Cell Painting. Advancements include improvements in the Cell Painting protocol, assay adaptations for different types of perturbations and applications, and improved methodologies for feature extraction, quality control, and batch effect correction. Moreover, machine learning methods recently surpassed classical approaches in their ability to extract biologically useful information from Cell Painting images. Cell Painting data have been used alone or in combination with other - omics data to decipher the mechanism of action of a compound, its toxicity profile, and many other biological effects. Overall, key methodological advances have expanded Cell Painting's ability to capture cellular responses to various perturbations. Future advances will likely lie in advancing computational and experimental techniques, developing new publicly available datasets, and integrating them with other high-content data types.
Collapse
Affiliation(s)
- Srijit Seal
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, CB2 1EW, Cambridge, United Kingdom
| | - Maria-Anna Trapotsi
- Imaging and Data Analytics, Clinical Pharmacology & Safety Sciences, R&D, AstraZeneca, 1 Francis Crick Avenue, Cambridge, CB2 0AA, United Kingdom
| | - Ola Spjuth
- Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, Box 591, SE-75124, Uppsala, Sweden
| | - Shantanu Singh
- Imaging and Data Analytics, Clinical Pharmacology & Safety Sciences, R&D, AstraZeneca, 1 Francis Crick Avenue, Cambridge, CB2 0AA, United Kingdom
| | - Jordi Carreras-Puigvert
- Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, Box 591, SE-75124, Uppsala, Sweden
| | - Nigel Greene
- Imaging and Data Analytics, Clinical Pharmacology & Safety Sciences, R&D, AstraZeneca, 35 Gatehouse Drive, Waltham, MA 02451, USA
| | - Andreas Bender
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, CB2 1EW, Cambridge, United Kingdom
| | - Anne E. Carpenter
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States
| |
Collapse
|
12
|
Seal S, Trapotsi MA, Spjuth O, Singh S, Carreras-Puigvert J, Greene N, Bender A, Carpenter AE. A Decade in a Systematic Review: The Evolution and Impact of Cell Painting. ARXIV 2024:arXiv:2405.02767v1. [PMID: 38745696 PMCID: PMC11092692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
High-content image-based assays have fueled significant discoveries in the life sciences in the past decade (2013-2023), including novel insights into disease etiology, mechanism of action, new therapeutics, and toxicology predictions. Here, we systematically review the substantial methodological advancements and applications of Cell Painting. Advancements include improvements in the Cell Painting protocol, assay adaptations for different types of perturbations and applications, and improved methodologies for feature extraction, quality control, and batch effect correction. Moreover, machine learning methods recently surpassed classical approaches in their ability to extract biologically useful information from Cell Painting images. Cell Painting data have been used alone or in combination with other -omics data to decipher the mechanism of action of a compound, its toxicity profile, and many other biological effects. Overall, key methodological advances have expanded Cell Painting's ability to capture cellular responses to various perturbations. Future advances will likely lie in advancing computational and experimental techniques, developing new publicly available datasets, and integrating them with other high-content data types.
Collapse
Affiliation(s)
- Srijit Seal
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, CB2 1EW, Cambridge, United Kingdom
| | - Maria-Anna Trapotsi
- Imaging and Data Analytics, Clinical Pharmacology & Safety Sciences, R&D, AstraZeneca, 1 Francis Crick Avenue, Cambridge, CB2 0AA, United Kingdom
| | - Ola Spjuth
- Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, Box 591, SE-75124, Uppsala, Sweden
| | - Shantanu Singh
- Imaging and Data Analytics, Clinical Pharmacology & Safety Sciences, R&D, AstraZeneca, 1 Francis Crick Avenue, Cambridge, CB2 0AA, United Kingdom
| | - Jordi Carreras-Puigvert
- Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, Box 591, SE-75124, Uppsala, Sweden
| | - Nigel Greene
- Imaging and Data Analytics, Clinical Pharmacology & Safety Sciences, R&D, AstraZeneca, 35 Gatehouse Drive, Waltham, MA 02451, USA
| | - Andreas Bender
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, CB2 1EW, Cambridge, United Kingdom
| | - Anne E. Carpenter
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, United States
| |
Collapse
|
13
|
Fredin Haslum J, Lardeau CH, Karlsson J, Turkki R, Leuchowius KJ, Smith K, Müllers E. Cell Painting-based bioactivity prediction boosts high-throughput screening hit-rates and compound diversity. Nat Commun 2024; 15:3470. [PMID: 38658534 PMCID: PMC11043326 DOI: 10.1038/s41467-024-47171-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 03/22/2024] [Indexed: 04/26/2024] Open
Abstract
Identifying active compounds for a target is a time- and resource-intensive task in early drug discovery. Accurate bioactivity prediction using morphological profiles could streamline the process, enabling smaller, more focused compound screens. We investigate the potential of deep learning on unrefined single-concentration activity readouts and Cell Painting data, to predict compound activity across 140 diverse assays. We observe an average ROC-AUC of 0.744 ± 0.108 with 62% of assays achieving ≥0.7, 30% ≥0.8, and 7% ≥0.9. In many cases, the high prediction performance can be achieved using only brightfield images instead of multichannel fluorescence images. A comprehensive analysis shows that Cell Painting-based bioactivity prediction is robust across assay types, technologies, and target classes, with cell-based assays and kinase targets being particularly well-suited for prediction. Experimental validation confirms the enrichment of active compounds. Our findings indicate that models trained on Cell Painting data, combined with a small set of single-concentration data points, can reliably predict the activity of a compound library across diverse targets and assays while maintaining high hit rates and scaffold diversity. This approach has the potential to reduce the size of screening campaigns, saving time and resources, and enabling primary screening with more complex assays.
Collapse
Affiliation(s)
- Johan Fredin Haslum
- KTH Royal Institute of Technology, Stockholm, Sweden
- Science for Life Laboratory, Stockholm, Sweden
- Research and Early Development, Cardiovascular, Renal and Metabolism (CVRM), BioPharmaceuticals R&D, AstraZeneca, Gothenburg, Sweden
| | | | - Johan Karlsson
- Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden
| | - Riku Turkki
- Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden
| | | | - Kevin Smith
- KTH Royal Institute of Technology, Stockholm, Sweden
- Science for Life Laboratory, Stockholm, Sweden
| | - Erik Müllers
- Research and Early Development, Cardiovascular, Renal and Metabolism (CVRM), BioPharmaceuticals R&D, AstraZeneca, Gothenburg, Sweden.
| |
Collapse
|
14
|
Godinez WJ, Trifonov V, Fang B, Kuzu G, Pei L, Guiguemde WA, Martin EJ, King FJ, Jenkins JL, Skewes-Cox P. Compound Activity Prediction with Dose-Dependent Transcriptomic Profiles and Deep Learning. J Chem Inf Model 2024; 64:2695-2704. [PMID: 38293736 DOI: 10.1021/acs.jcim.3c01855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2024]
Abstract
Predicting compound activity in assays is a long-standing challenge in drug discovery. Computational models based on compound-induced gene expression signatures from a single profiling assay have shown promise toward predicting compound activity in other, seemingly unrelated, assays. Applications of such models include predicting mechanisms-of-action (MoA) for phenotypic hits, identifying off-target activities, and identifying polypharmacologies. Here, we introduce transcriptomics-to-activity transformer (TAT) models that leverage gene expression profiles observed over compound treatment at multiple concentrations to predict the compound activity in other biochemical or cellular assays. We built TAT models based on gene expression data from a RASL-seq assay to predict the activity of 2692 compounds in 262 dose-response assays. We obtained useful models for 51% of the assays, as determined through a realistic held-out set. Prospectively, we experimentally validated the activity predictions of a TAT model in a malaria inhibition assay. With a 63% hit rate, TAT successfully identified several submicromolar malaria inhibitors. Our results thus demonstrate the potential of transcriptomic responses over compound concentration and the TAT modeling framework as a cost-efficient way to identify the bioactivities of promising compounds across many assays.
Collapse
Affiliation(s)
- William J Godinez
- Novartis Institutes for BioMedical Research, Emeryville, California 94608, United States
| | - Vladimir Trifonov
- Novartis Institutes for BioMedical Research, San Diego, California 92121, United States
| | - Bin Fang
- Novartis Institutes for BioMedical Research, San Diego, California 92121, United States
| | - Guray Kuzu
- Novartis Institutes for BioMedical Research, San Diego, California 92121, United States
| | - Luying Pei
- Novartis Institutes for BioMedical Research, Emeryville, California 94608, United States
| | - W Armand Guiguemde
- Novartis Institutes for BioMedical Research, Emeryville, California 94608, United States
| | - Eric J Martin
- Novartis Institutes for BioMedical Research, Emeryville, California 94608, United States
| | - Frederick J King
- Novartis Institutes for BioMedical Research, San Diego, California 92121, United States
| | - Jeremy L Jenkins
- Novartis Institutes for BioMedical Research, Cambridge, Massachusetts 02139, United States
| | - Peter Skewes-Cox
- Novartis Institutes for BioMedical Research, Emeryville, California 94608, United States
| |
Collapse
|
15
|
Lee J, König M, Braun G, Escher BI. Water Quality Monitoring with the Multiplexed Assay MitoOxTox for Mitochondrial Toxicity, Oxidative Stress Response, and Cytotoxicity in AREc32 Cells. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2024; 58:5716-5726. [PMID: 38503264 PMCID: PMC10993414 DOI: 10.1021/acs.est.3c09844] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 03/06/2024] [Accepted: 03/06/2024] [Indexed: 03/21/2024]
Abstract
Mitochondria play a key role in the energy production of cells, but their function can be disturbed by environmental toxicants. We developed a cell-based mitochondrial toxicity assay for environmental chemicals and their mixtures extracted from water samples. The reporter gene cell line AREc32, which is frequently used to quantify the cytotoxicity and oxidative stress response of water samples, was multiplexed with an endpoint of mitochondrial toxicity. The disruption of the mitochondrial membrane potential (MMP) was quantified by high-content imaging and compared to measured cytotoxicity, predicted baseline toxicity, and activation of the oxidative stress response. Mitochondrial complex I inhibitors showed highly specific effects on the MMP, with minor effects on cell viability. Uncouplers showed a wide distribution of specificity on the MMP, often accompanied by specific cytotoxicity (enhanced over baseline toxicity). Mitochondrial toxicity and the oxidative stress response were not directly associated. The multiplexed assay was applied to water samples ranging from wastewater treatment plant (WWTP) influent and effluent and surface water to drinking and bottled water from various European countries. Specific effects on MMP were observed for the WWTP influent and effluent. This new MitoOxTox assay is an important complement for existing in vitro test batteries for water quality testing and has potential for applications in human biomonitoring.
Collapse
Affiliation(s)
- Jungeun Lee
- Department
of Cell Toxicology, UFZ—Helmholtz
Centre for Environmental Research, 04318 Leipzig, Germany
| | - Maria König
- Department
of Cell Toxicology, UFZ—Helmholtz
Centre for Environmental Research, 04318 Leipzig, Germany
| | - Georg Braun
- Department
of Cell Toxicology, UFZ—Helmholtz
Centre for Environmental Research, 04318 Leipzig, Germany
| | - Beate I. Escher
- Department
of Cell Toxicology, UFZ—Helmholtz
Centre for Environmental Research, 04318 Leipzig, Germany
- Environmental
Toxicology, Department of Geosciences, Eberhard
Karls University, Schnarrenbergstr.
94-96, 72076 Tübingen, Germany
| |
Collapse
|
16
|
Sharma R, Saghapour E, Chen JY. An NLP-based technique to extract meaningful features from drug SMILES. iScience 2024; 27:109127. [PMID: 38455979 PMCID: PMC10918220 DOI: 10.1016/j.isci.2024.109127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 09/30/2023] [Accepted: 02/01/2024] [Indexed: 03/09/2024] Open
Abstract
NLP is a well-established field in ML for developing language models that capture the sequence of words in a sentence. Similarly, drug molecule structures can also be represented as sequences using the SMILES notation. However, unlike natural language texts, special characters in drug SMILES have specific meanings and cannot be ignored. We introduce a novel NLP-based method that extracts interpretable sequences and essential features from drug SMILES notation using N-grams. Our method compares these features to Morgan fingerprint bit-vectors using UMAP-based embedding, and we validate its effectiveness through two personalized drug screening (PSD) case studies. Our NLP-based features are sparse and, when combined with gene expressions and disease phenotype features, produce better ML models for PSD. This approach provides a new way to analyze drug molecule structures represented as SMILES notation, which can help accelerate drug discovery efforts. We have also made our method accessible through a Python library.
Collapse
Affiliation(s)
- Rahul Sharma
- Informatics Institute, School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, USA
| | - Ehsan Saghapour
- Informatics Institute, School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, USA
| | - Jake Y. Chen
- Informatics Institute, School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, USA
| |
Collapse
|
17
|
Seal S, Carreras-Puigvert J, Singh S, Carpenter AE, Spjuth O, Bender A. From pixels to phenotypes: Integrating image-based profiling with cell health data as BioMorph features improves interpretability. Mol Biol Cell 2024; 35:mr2. [PMID: 38170589 PMCID: PMC10916876 DOI: 10.1091/mbc.e23-08-0298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 12/07/2023] [Accepted: 12/22/2023] [Indexed: 01/05/2024] Open
Abstract
Cell Painting assays generate morphological profiles that are versatile descriptors of biological systems and have been used to predict in vitro and in vivo drug effects. However, Cell Painting features extracted from classical software such as CellProfiler are based on statistical calculations and often not readily biologically interpretable. In this study, we propose a new feature space, which we call BioMorph, that maps these Cell Painting features with readouts from comprehensive Cell Health assays. We validated that the resulting BioMorph space effectively connected compounds not only with the morphological features associated with their bioactivity but with deeper insights into phenotypic characteristics and cellular processes associated with the given bioactivity. The BioMorph space revealed the mechanism of action for individual compounds, including dual-acting compounds such as emetine, an inhibitor of both protein synthesis and DNA replication. Overall, BioMorph space offers a biologically relevant way to interpret the cell morphological features derived using software such as CellProfiler and to generate hypotheses for experimental validation.
Collapse
Affiliation(s)
- Srijit Seal
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge MA 02142
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United Kingdom
| | - Jordi Carreras-Puigvert
- Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, 752 37 Uppsala, Sweden
| | - Shantanu Singh
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge MA 02142
| | - Anne E Carpenter
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge MA 02142
| | - Ola Spjuth
- Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, 752 37 Uppsala, Sweden
| | - Andreas Bender
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
18
|
Seal S, Spjuth O, Hosseini-Gerami L, García-Ortegón M, Singh S, Bender A, Carpenter AE. Insights into Drug Cardiotoxicity from Biological and Chemical Data: The First Public Classifiers for FDA Drug-Induced Cardiotoxicity Rank. J Chem Inf Model 2024; 64:1172-1186. [PMID: 38300851 PMCID: PMC10900289 DOI: 10.1021/acs.jcim.3c01834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 01/11/2024] [Accepted: 01/16/2024] [Indexed: 02/03/2024]
Abstract
Drug-induced cardiotoxicity (DICT) is a major concern in drug development, accounting for 10-14% of postmarket withdrawals. In this study, we explored the capabilities of chemical and biological data to predict cardiotoxicity, using the recently released DICTrank data set from the United States FDA. We found that such data, including protein targets, especially those related to ion channels (e.g., hERG), physicochemical properties (e.g., electrotopological state), and peak concentration in plasma offer strong predictive ability for DICT. Compounds annotated with mechanisms of action such as cyclooxygenase inhibition could distinguish between most-concern and no-concern DICT. Cell Painting features for ER stress discerned most-concern cardiotoxic from nontoxic compounds. Models based on physicochemical properties provided substantial predictive accuracy (AUCPR = 0.93). With the availability of omics data in the future, using biological data promises enhanced predictability and deeper mechanistic insights, paving the way for safer drug development. All models from this study are available at https://broad.io/DICTrank_Predictor.
Collapse
Affiliation(s)
- Srijit Seal
- Imaging
Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, United States
- Yusuf
Hamied Department of Chemistry, University
of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K.
| | - Ola Spjuth
- Department
of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, Box
591, SE-75124 Uppsala, Sweden
| | - Layla Hosseini-Gerami
- Ignota
Labs, The Bradfield Centre, Cambridge Science Park, County Hall, Westminster Bridge Road, Cambridge CB4 0GA, U.K.
| | - Miguel García-Ortegón
- Yusuf
Hamied Department of Chemistry, University
of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K.
| | - Shantanu Singh
- Imaging
Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, United States
| | - Andreas Bender
- Yusuf
Hamied Department of Chemistry, University
of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K.
| | - Anne E. Carpenter
- Imaging
Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, United States
| |
Collapse
|
19
|
Horne R, Wilson-Godber J, González Díaz A, Brotzakis ZF, Seal S, Gregory RC, Possenti A, Chia S, Vendruscolo M. Using Generative Modeling to Endow with Potency Initially Inert Compounds with Good Bioavailability and Low Toxicity. J Chem Inf Model 2024; 64:590-596. [PMID: 38261763 PMCID: PMC10865343 DOI: 10.1021/acs.jcim.3c01777] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2023] [Revised: 12/10/2023] [Accepted: 12/12/2023] [Indexed: 01/25/2024]
Abstract
In the early stages of drug development, large chemical libraries are typically screened to identify compounds of promising potency against the chosen targets. Often, however, the resulting hit compounds tend to have poor drug metabolism and pharmacokinetics (DMPK), with negative developability features that may be difficult to eliminate. Therefore, starting the drug discovery process with a "null library", compounds that have highly desirable DMPK properties but no potency against the chosen targets, could be advantageous. Here, we explore the opportunities offered by machine learning to realize this strategy in the case of the inhibition of α-synuclein aggregation, a process associated with Parkinson's disease. We apply MolDQN, a generative machine learning method, to build an inhibitory activity against α-synuclein aggregation into an initial inactive compound with good DMPK properties. Our results illustrate how generative modeling can be used to endow initially inert compounds with desirable developability properties.
Collapse
Affiliation(s)
- Robert
I. Horne
- Centre
for Misfolding Diseases, Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United
Kingdom
| | - Jared Wilson-Godber
- Centre
for Misfolding Diseases, Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United
Kingdom
| | - Alicia González Díaz
- Centre
for Misfolding Diseases, Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United
Kingdom
| | - Z. Faidon Brotzakis
- Centre
for Misfolding Diseases, Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United
Kingdom
| | - Srijit Seal
- Centre
for Misfolding Diseases, Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United
Kingdom
- Imaging
Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, United States
| | - Rebecca C. Gregory
- Centre
for Misfolding Diseases, Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United
Kingdom
| | - Andrea Possenti
- Centre
for Misfolding Diseases, Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United
Kingdom
| | - Sean Chia
- Centre
for Misfolding Diseases, Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United
Kingdom
- Bioprocessing
Technology Institute, Agency for Science, Technology and Research (A*STAR), 138668 Singapore, Singapore
| | - Michele Vendruscolo
- Centre
for Misfolding Diseases, Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United
Kingdom
| |
Collapse
|
20
|
Alijagic A, Kotlyar O, Larsson M, Salihovic S, Hedbrant A, Eriksson U, Karlsson P, Persson A, Scherbak N, Färnlund K, Engwall M, Särndahl E. Immunotoxic, genotoxic, and endocrine disrupting impacts of polyamide microplastic particles and chemicals. ENVIRONMENT INTERNATIONAL 2024; 183:108412. [PMID: 38183898 DOI: 10.1016/j.envint.2023.108412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 12/06/2023] [Accepted: 12/28/2023] [Indexed: 01/08/2024]
Abstract
Due to their exceptional properties and cost effectiveness, polyamides or nylons have emerged as widely used materials, revolutionizing diverse industries, including industrial 3D printing or additive manufacturing (AM). Powder-based AM technologies employ tonnes of polyamide microplastics to produce complex components every year. However, the lack of comprehensive toxicity assessment of particulate polyamides and polyamide-associated chemicals, especially in the light of the global microplastics crisis, calls for urgent action. This study investigated the physicochemical properties of polyamide-12 microplastics used in AM, and assessed a number of toxicity endpoints focusing on inflammation, immunometabolism, genotoxicity, aryl hydrocarbon receptor (AhR) activation, endocrine disruption, and cell morphology. Specifically, microplastics examination by means of field emission scanning electron microscopy revealed that work flow reuse of material created a fraction of smaller particles with an average size of 1-5 µm, a size range readily available for uptake by human cells. Moreover, chemical analysis by means of gas chromatography high-resolution mass spectrometry detected several polyamide-associated chemicals including starting material, plasticizer, thermal stabilizer/antioxidant, and migrating slip additive. Even if polyamide particles and chemicals did not induce an acute inflammatory response, repeated and prolonged exposure of human primary macrophages disclosed a steady increase in the levels of proinflammatory chemokine Interleukin-8 (IL-8/CXCL-8). Moreover, targeted metabolomics disclosed that polyamide particles modulated the kynurenine pathway and some of its key metabolites. The p53-responsive luciferase reporter gene assay showed that particles per se were able to activate p53, being indicative of a genotoxic stress. Polyamide-associated chemicals triggered moderate activation of AhR and elicited anti-androgenic activity. Finally, a high-throughput and non-targeted morphological profiling by Cell Painting assay outlined major sites of bioactivity of polyamide-associated chemicals and indicated putative mechanisms of toxicity in the cells. These findings reveal that the increasing use of polyamide microplastics may pose a potential health risk for the exposed individuals, and it merits more attention.
Collapse
Affiliation(s)
- Andi Alijagic
- Man-Technology-Environment Research Center (MTM), Örebro University, Örebro SE-701 82, Sweden; Inflammatory Response and Infection Susceptibility Centre (iRiSC), Faculty of Medicine and Health, Örebro University, Örebro SE-701 82, Sweden; School of Medical Sciences, Faculty of Medicine and Health, Örebro University, Örebro SE-701 82, Sweden.
| | - Oleksandr Kotlyar
- Man-Technology-Environment Research Center (MTM), Örebro University, Örebro SE-701 82, Sweden; Centre for Applied Autonomous Sensor Systems (AASS), Mobile Robotics and Olfaction Lab (MRO), Örebro University, SE-701 82 Örebro, Sweden
| | - Maria Larsson
- Man-Technology-Environment Research Center (MTM), Örebro University, Örebro SE-701 82, Sweden
| | - Samira Salihovic
- Inflammatory Response and Infection Susceptibility Centre (iRiSC), Faculty of Medicine and Health, Örebro University, Örebro SE-701 82, Sweden; School of Medical Sciences, Faculty of Medicine and Health, Örebro University, Örebro SE-701 82, Sweden
| | - Alexander Hedbrant
- Inflammatory Response and Infection Susceptibility Centre (iRiSC), Faculty of Medicine and Health, Örebro University, Örebro SE-701 82, Sweden; School of Medical Sciences, Faculty of Medicine and Health, Örebro University, Örebro SE-701 82, Sweden
| | - Ulrika Eriksson
- Man-Technology-Environment Research Center (MTM), Örebro University, Örebro SE-701 82, Sweden
| | - Patrik Karlsson
- Department of Mechanical Engineering, Örebro University, Örebro SE-701 82, Sweden
| | - Alexander Persson
- Inflammatory Response and Infection Susceptibility Centre (iRiSC), Faculty of Medicine and Health, Örebro University, Örebro SE-701 82, Sweden; School of Medical Sciences, Faculty of Medicine and Health, Örebro University, Örebro SE-701 82, Sweden
| | - Nikolai Scherbak
- Man-Technology-Environment Research Center (MTM), Örebro University, Örebro SE-701 82, Sweden
| | | | - Magnus Engwall
- Man-Technology-Environment Research Center (MTM), Örebro University, Örebro SE-701 82, Sweden
| | - Eva Särndahl
- Inflammatory Response and Infection Susceptibility Centre (iRiSC), Faculty of Medicine and Health, Örebro University, Örebro SE-701 82, Sweden; School of Medical Sciences, Faculty of Medicine and Health, Örebro University, Örebro SE-701 82, Sweden
| |
Collapse
|
21
|
Seal S, Spjuth O, Hosseini-Gerami L, García-Ortegón M, Singh S, Bender A, Carpenter AE. Insights into Drug Cardiotoxicity from Biological and Chemical Data: The First Public Classifiers for FDA DICTrank. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.15.562398. [PMID: 37905146 PMCID: PMC10614794 DOI: 10.1101/2023.10.15.562398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
Drug-induced cardiotoxicity (DICT) is a major concern in drug development, accounting for 10-14% of postmarket withdrawals. In this study, we explored the capabilities of various chemical and biological data to predict cardiotoxicity, using the recently released Drug-Induced Cardiotoxicity Rank (DICTrank) dataset from the United States FDA. We analyzed a diverse set of data sources, including physicochemical properties, annotated mechanisms of action (MOA), Cell Painting, Gene Expression, and more, to identify indications of cardiotoxicity. We found that such data, including protein targets, especially those related to ion channels (such as hERG), physicochemical properties (such as electrotopological state) as well as peak concentration in plasma offer strong predictive ability as well as valuable insights into DICT. We also found compounds annotated with particular mechanisms of action, such as cyclooxygenase inhibition, could distinguish between most-concern and no-concern DICT compounds. Cell Painting features related to ER stress discern the most-concern cardiotoxic compounds from non-toxic compounds. While models based on physicochemical properties currently provide substantial predictive accuracy (AUCPR = 0.93), this study also underscores the potential benefits of incorporating more comprehensive biological data in future DICT predictive models. With the availability of - omics data in the future, using biological data promises enhanced predictability and delivers deeper mechanistic insights, paving the way for safer therapeutic drug development. All models and data used in this study are publicly released at https://broad.io/DICTrank_Predictor.
Collapse
Affiliation(s)
- Srijit Seal
- Imaging Platform, Broad Institute of MIT and Harvard, US
| | - Ola Spjuth
- Department of Pharmaceutical Biosciences, Uppsala University, Sweden
| | | | | | - Shantanu Singh
- Imaging Platform, Broad Institute of MIT and Harvard, US
| | | | | |
Collapse
|
22
|
Way GP, Sailem H, Shave S, Kasprowicz R, Carragher NO. Evolution and impact of high content imaging. SLAS DISCOVERY : ADVANCING LIFE SCIENCES R & D 2023; 28:292-305. [PMID: 37666456 DOI: 10.1016/j.slasd.2023.08.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 08/09/2023] [Accepted: 08/29/2023] [Indexed: 09/06/2023]
Abstract
The field of high content imaging has steadily evolved and expanded substantially across many industry and academic research institutions since it was first described in the early 1990's. High content imaging refers to the automated acquisition and analysis of microscopic images from a variety of biological sample types. Integration of high content imaging microscopes with multiwell plate handling robotics enables high content imaging to be performed at scale and support medium- to high-throughput screening of pharmacological, genetic and diverse environmental perturbations upon complex biological systems ranging from 2D cell cultures to 3D tissue organoids to small model organisms. In this perspective article the authors provide a collective view on the following key discussion points relevant to the evolution of high content imaging: • Evolution and impact of high content imaging: An academic perspective • Evolution and impact of high content imaging: An industry perspective • Evolution of high content image analysis • Evolution of high content data analysis pipelines towards multiparametric and phenotypic profiling applications • The role of data integration and multiomics • The role and evolution of image data repositories and sharing standards • Future perspective of high content imaging hardware and software.
Collapse
Affiliation(s)
- Gregory P Way
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Heba Sailem
- School of Cancer and Pharmaceutical Sciences, King's College London, UK
| | - Steven Shave
- GlaxoSmithKline Medicines Research Centre, Gunnels Wood Rd, Stevenage SG1 2NY, UK; Edinburgh Cancer Research, Cancer Research UK Scotland Centre, Institute of Genetics and Cancer, University of Edinburgh, UK
| | - Richard Kasprowicz
- GlaxoSmithKline Medicines Research Centre, Gunnels Wood Rd, Stevenage SG1 2NY, UK
| | - Neil O Carragher
- Edinburgh Cancer Research, Cancer Research UK Scotland Centre, Institute of Genetics and Cancer, University of Edinburgh, UK.
| |
Collapse
|
23
|
Lejal V, Cerisier N, Rouquié D, Taboureau O. Assessment of Drug-Induced Liver Injury through Cell Morphology and Gene Expression Analysis. Chem Res Toxicol 2023; 36:1456-1470. [PMID: 37652439 PMCID: PMC10523580 DOI: 10.1021/acs.chemrestox.2c00381] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Indexed: 09/02/2023]
Abstract
Drug-induced liver injury (DILI) is a significant concern in drug development, often leading to drug withdrawal. Although many studies aim to identify biomarkers and gene/pathway signatures related to liver toxicity and aim to predict DILI compounds, this remains a challenge in drug discovery. With a strong development of high-content screening/imaging (HCS/HCI) for phenotypic screening, we explored the morphological cell perturbations induced by DILI compounds. In the first step, cell morphological signatures were associated with two datasets of DILI chemicals (DILIRank and eTox). The mechanisms of action were then analyzed for chemicals having transcriptomics data and sharing similar morphological perturbations. Signaling pathways associated with liver toxicity (cell cycle, cell growth, apoptosis, ...) were then captured, and a hypothetical relation between cell morphological perturbations and gene deregulation was illustrated within our analysis. Finally, using the cell morphological signatures, machine learning approaches were developed to predict chemicals with a potential risk of DILI. Some models showed relevant performance with validation set balanced accuracies between 0.645 and 0.739. Overall, our findings demonstrate the utility of combining HCI with transcriptomics data to identify the morphological and gene expression signatures related to DILI chemicals. Moreover, our protocol could be extended to other toxicity end points, offering a promising avenue for comprehensive toxicity assessment in drug discovery.
Collapse
Affiliation(s)
- Vanille Lejal
- Université
Paris Cité, Inserm U1133, CNRS
UMR 8251, 75013, Paris, France
| | - Natacha Cerisier
- Université
Paris Cité, Inserm U1133, CNRS
UMR 8251, 75013, Paris, France
| | - David Rouquié
- Bayer
SAS, Bayer Crop Science, 355 rue Dostoïevski, CS 90153, 06906 Valbonne, Sophia-Antipolis, France
- Université
Côte d’Azur 3IA Interdisciplinary Institute in Artificial Intelligence, 06103 Nice Cedex, France
| | - Olivier Taboureau
- Université
Paris Cité, Inserm U1133, CNRS
UMR 8251, 75013, Paris, France
| |
Collapse
|
24
|
Sada Del Real K, Rubio A. Discovering the mechanism of action of drugs with a sparse explainable network. EBioMedicine 2023; 95:104767. [PMID: 37633093 PMCID: PMC10474372 DOI: 10.1016/j.ebiom.2023.104767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 07/31/2023] [Accepted: 08/08/2023] [Indexed: 08/28/2023] Open
Abstract
BACKGROUND Although Deep Neural Networks (DDNs) have been successful in predicting the efficacy of cancer drugs, the lack of explainability in their decision-making process is a significant challenge. Previous research proposed mimicking the Gene Ontology structure to allow for interpretation of each neuron in the network. However, these previous approaches require huge amount of GPU resources and hinder its extension to genome-wide models. METHODS We developed SparseGO, a sparse and interpretable neural network, for predicting drug response in cancer cell lines and their Mechanism of Action (MoA). To ensure model generalization, we trained it on multiple datasets and evaluated its performance using three cross-validation schemes. Its efficiency allows it to be used with gene expression. In addition, SparseGO integrates an eXplainable Artificial Intelligence (XAI) technique, DeepLIFT, with Support Vector Machines to computationally discover the MoA of drugs. FINDINGS SparseGO's sparse implementation significantly reduced GPU memory usage and training speed compared to other methods, allowing it to process gene expression instead of mutations as input data. SparseGO using expression improved the accuracy and enabled its use on drug repositioning. Furthermore, gene expression allows the prediction of MoA using 265 drugs to train it. It was validated on understudied drugs such as parbendazole and PD153035. INTERPRETATION SparseGO is an effective XAI method for predicting, but more importantly, understanding drug response. FUNDING The Accelerator Award Programme funded by Cancer Research UK [C355/A26819], Fundación Científica de la AECC and Fondazione AIRC, Project PIBA_2020_1_0055 funded by the Basque Government and the Synlethal Project (RETOS Investigacion, Spanish Government).
Collapse
Affiliation(s)
- Katyna Sada Del Real
- Departamento de Ingeniería Biomédica y Ciencias, TECNUN, Universidad de Navarra, San Sebastián 20018, Spain
| | - Angel Rubio
- Departamento de Ingeniería Biomédica y Ciencias, TECNUN, Universidad de Navarra, San Sebastián 20018, Spain; Instituto de Ciencia de Datos e Inteligencia Artificial (DATAI), Universidad de Navarra, Pamplona 31080, Spain.
| |
Collapse
|
25
|
Pahl A, Schölermann B, Lampe P, Rusch M, Dow M, Hedberg C, Nelson A, Sievers S, Waldmann H, Ziegler S. Morphological subprofile analysis for bioactivity annotation of small molecules. Cell Chem Biol 2023:S2451-9456(23)00159-9. [PMID: 37385259 DOI: 10.1016/j.chembiol.2023.06.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 03/21/2023] [Accepted: 06/02/2023] [Indexed: 07/01/2023]
Abstract
Fast prediction of the mode of action (MoA) for bioactive compounds would immensely foster bioactivity annotation in compound collections and may early on reveal off-targets in chemical biology research and drug discovery. Morphological profiling, e.g., using the Cell Painting assay, offers a fast, unbiased assessment of compound activity on various targets in one experiment. However, due to incomplete bioactivity annotation and unknown activities of reference compounds, prediction of bioactivity is not straightforward. Here we introduce the concept of subprofile analysis to map the MoA for both, reference and unexplored compounds. We defined MoA clusters and extracted cluster subprofiles that contain only a subset of morphological features. Subprofile analysis allows for the assignment of compounds to, currently, twelve targets or MoA. This approach enables rapid bioactivity annotation of compounds and will be extended to further clusters in the future.
Collapse
Affiliation(s)
- Axel Pahl
- Max Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany.
| | - Beate Schölermann
- Max Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
| | - Philipp Lampe
- Max Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
| | - Marion Rusch
- Max Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
| | - Mark Dow
- School of Chemistry and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds LS2 9JT, UK
| | - Christian Hedberg
- Max Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
| | - Adam Nelson
- School of Chemistry and Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds LS2 9JT, UK
| | - Sonja Sievers
- Max Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany
| | - Herbert Waldmann
- Max Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany; Technical University Dortmund, Faculty of Chemistry and Chemical Biology, Otto-Hahn-Strasse 6, 44227 Dortmund, Germany
| | - Slava Ziegler
- Max Planck Institute of Molecular Physiology, Department of Chemical Biology, Otto-Hahn-Strasse 11, 44227 Dortmund, Germany.
| |
Collapse
|
26
|
Herman D, Kańduła MM, Freitas LGA, van Dongen C, Le Van T, Mesens N, Jaensch S, Gustin E, Micholt L, Lardeau CH, Varsakelis C, Reumers J, Zoffmann S, Will Y, Peeters PJ, Ceulemans H. Leveraging Cell Painting Images to Expand the Applicability Domain and Actively Improve Deep Learning Quantitative Structure-Activity Relationship Models. Chem Res Toxicol 2023. [PMID: 37327474 DOI: 10.1021/acs.chemrestox.2c00404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
The search for chemical hit material is a lengthy and increasingly expensive drug discovery process. To improve it, ligand-based quantitative structure-activity relationship models have been broadly applied to optimize primary and secondary compound properties. Although these models can be deployed as early as the stage of molecule design, they have a limited applicability domain─if the structures of interest differ substantially from the chemical space on which the model was trained, a reliable prediction will not be possible. Image-informed ligand-based models partly solve this shortcoming by focusing on the phenotype of a cell caused by small molecules, rather than on their structure. While this enables chemical diversity expansion, it limits the application to compounds physically available and imaged. Here, we employ an active learning approach to capitalize on both of these methods' strengths and boost the model performance of a mitochondrial toxicity assay (Glu/Gal). Specifically, we used a phenotypic Cell Painting screen to build a chemistry-independent model and adopted the results as the main factor in selecting compounds for experimental testing. With the additional Glu/Gal annotation for selected compounds we were able to dramatically improve the chemistry-informed ligand-based model with respect to the increased recognition of compounds from a 10% broader chemical space.
Collapse
Affiliation(s)
- Dorota Herman
- In-Silico Discovery, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| | - Maciej M Kańduła
- Discovery Technology and Molecular Pharmacology, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| | - Lorena G A Freitas
- In-Silico Discovery, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| | | | - Thanh Le Van
- Discovery Technology and Molecular Pharmacology, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| | - Natalie Mesens
- Predictive, Investigative and Translational Toxicology, PSTS, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| | - Steffen Jaensch
- Discovery Technology and Molecular Pharmacology, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| | - Emmanuel Gustin
- Discovery Technology and Molecular Pharmacology, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| | - Liesbeth Micholt
- Discovery Technology and Molecular Pharmacology, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| | - Charles-Hugues Lardeau
- Discovery Technology and Molecular Pharmacology, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| | - Christos Varsakelis
- In-Silico Discovery, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| | - Joke Reumers
- Discovery Technology and Molecular Pharmacology, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| | - Sannah Zoffmann
- Discovery Technology and Molecular Pharmacology, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| | - Yvonne Will
- Predictive, Investigative and Translational Toxicology, PSTS, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| | - Pieter J Peeters
- Discovery Technology and Molecular Pharmacology, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| | - Hugo Ceulemans
- In-Silico Discovery, Janssen Research & Development, Pharmaceutical Companies of Johnson & Johnson, Beerse B-2340, Belgium
| |
Collapse
|
27
|
Seal S, Yang H, Trapotsi MA, Singh S, Carreras-Puigvert J, Spjuth O, Bender A. Merging bioactivity predictions from cell morphology and chemical fingerprint models using similarity to training data. J Cheminform 2023; 15:56. [PMID: 37268960 DOI: 10.1186/s13321-023-00723-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Accepted: 04/20/2023] [Indexed: 06/04/2023] Open
Abstract
The applicability domain of machine learning models trained on structural fingerprints for the prediction of biological endpoints is often limited by the lack of diversity of chemical space of the training data. In this work, we developed similarity-based merger models which combined the outputs of individual models trained on cell morphology (based on Cell Painting) and chemical structure (based on chemical fingerprints) and the structural and morphological similarities of the compounds in the test dataset to compounds in the training dataset. We applied these similarity-based merger models using logistic regression models on the predictions and similarities as features and predicted assay hit calls of 177 assays from ChEMBL, PubChem and the Broad Institute (where the required Cell Painting annotations were available). We found that the similarity-based merger models outperformed other models with an additional 20% assays (79 out of 177 assays) with an AUC > 0.70 compared with 65 out of 177 assays using structural models and 50 out of 177 assays using Cell Painting models. Our results demonstrated that similarity-based merger models combining structure and cell morphology models can more accurately predict a wide range of biological assay outcomes and further expanded the applicability domain by better extrapolating to new structural and morphology spaces.
Collapse
Affiliation(s)
- Srijit Seal
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK
| | - Hongbin Yang
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK
| | - Maria-Anna Trapotsi
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK
| | - Satvik Singh
- Department of Applied Mathematics and Theoretical Physics (DAMTP), University of Cambridge, Cambridge, UK
| | - Jordi Carreras-Puigvert
- Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Ola Spjuth
- Department of Pharmaceutical Biosciences and Science for Life Laboratory, Uppsala University, Uppsala, Sweden.
| | - Andreas Bender
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK.
| |
Collapse
|
28
|
Pruteanu LL, Bender A. Using Transcriptomics and Cell Morphology Data in Drug Discovery: The Long Road to Practice. ACS Med Chem Lett 2023; 14:386-395. [PMID: 37077392 PMCID: PMC10107910 DOI: 10.1021/acsmedchemlett.3c00015] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Accepted: 03/10/2023] [Indexed: 04/21/2023] Open
Abstract
Gene expression and cell morphology data are high-dimensional biological readouts of much recent interest for drug discovery. They are able to describe biological systems in different states (e.g., healthy and diseased), as well as biological systems before and after compound treatment, and they are hence useful for matching both spaces (e.g., for drug repurposing) as well as for characterizing compounds with respect to efficacy and safety endpoints. This Microperspective describes recent advances in this direction with a focus on applied drug discovery and drug repurposing, as well as outlining what else is needed to advance further, with a particular focus on better understanding the applicability domain of readouts and their relevance for decision making, which is currently often still unclear.
Collapse
Affiliation(s)
- Lavinia-Lorena Pruteanu
- Department
of Chemistry and Biology, North University
Center at Baia Mare, Technical University of Cluj-Napoca, Victoriei 76, 430122 Baia Mare, Romania
- Research
Center for Functional Genomics, Biomedicine, and Translational Medicine, “Iuliu Haţieganu” University
of Medicine and Pharmacy, 400337 Cluj-Napoca, Romania
| | - Andreas Bender
- Centre
for Molecular Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
29
|
Nyffeler J, Willis C, Harris FR, Foster MJ, Chambers B, Culbreth M, Brockway RE, Davidson-Fritz S, Dawson D, Shah I, Friedman KP, Chang D, Everett LJ, Wambaugh JF, Patlewicz G, Harrill JA. Application of cell painting for chemical hazard evaluation in support of screening-level chemical assessments. Toxicol Appl Pharmacol 2023; 468:116513. [PMID: 37044265 DOI: 10.1016/j.taap.2023.116513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Revised: 04/03/2023] [Accepted: 04/08/2023] [Indexed: 04/14/2023]
Abstract
'Cell Painting' is an imaging-based high-throughput phenotypic profiling (HTPP) method in which cultured cells are fluorescently labeled to visualize subcellular structures (i.e., nucleus, nucleoli, endoplasmic reticulum, cytoskeleton, Golgi apparatus / plasma membrane and mitochondria) and to quantify morphological changes in response to chemicals or other perturbagens. HTPP is a high-throughput and cost-effective bioactivity screening method that detects effects associated with many different molecular mechanisms in an untargeted manner, enabling rapid in vitro hazard assessment for thousands of chemicals. Here, 1201 chemicals from the ToxCast library were screened in concentration-response up to ~100 μM in human U-2 OS cells using HTPP. A phenotype altering concentration (PAC) was estimated for chemicals active in the tested range. PACs tended to be higher than lower bound potency values estimated from a broad collection of targeted high-throughput assays, but lower than the threshold for cytotoxicity. In vitro to in vivo extrapolation (IVIVE) was used to estimate administered equivalent doses (AEDs) based on PACs for comparison to human exposure predictions. AEDs for 18/412 chemicals overlapped with predicted human exposures. Phenotypic profile information was also leveraged to identify putative mechanisms of action and group chemicals. Of 58 known nuclear receptor modulators, only glucocorticoids and retinoids produced characteristic profiles; and both receptor types are expressed in U-2 OS cells. Thirteen chemicals with profile similarity to glucocorticoids were tested in a secondary screen and one chemical, pyrene, was confirmed by an orthogonal gene expression assay as a novel putative GR modulating chemical. Most active chemicals demonstrated profiles not associated with a known mechanism-of-action. However, many structurally related chemicals produced similar profiles, with exceptions such as diniconazole, whose profile differed from other active conazoles. Overall, the present study demonstrates how HTPP can be applied in screening-level chemical assessments through a series of examples and brief case studies.
Collapse
Affiliation(s)
- Jo Nyffeler
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America; Oak Ridge Institute for Science and Education (ORISE) Postdoctoral Fellow, Oak Ridge, TN 37831, United States of America
| | - Clinton Willis
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America
| | - Felix R Harris
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America; Oak Ridge Associated Universities (ORAU) National Student Services Contractor, Oak Ridge, TN 37831, United States of America
| | - M J Foster
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America; Oak Ridge Associated Universities (ORAU) National Student Services Contractor, Oak Ridge, TN 37831, United States of America
| | - Bryant Chambers
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America
| | - Megan Culbreth
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America
| | - Richard E Brockway
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America; Oak Ridge Associated Universities (ORAU) National Student Services Contractor, Oak Ridge, TN 37831, United States of America
| | - Sarah Davidson-Fritz
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America
| | - Daniel Dawson
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America
| | - Imran Shah
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America
| | - Katie Paul Friedman
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America
| | - Dan Chang
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America
| | - Logan J Everett
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America
| | - John F Wambaugh
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America
| | - Grace Patlewicz
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America
| | - Joshua A Harrill
- Center for Computational Toxicology & Exposure, Office of Research and Development, US Environmental Protection Agency, Durham, NC 27711, United States of America.
| |
Collapse
|
30
|
Moshkov N, Becker T, Yang K, Horvath P, Dancik V, Wagner BK, Clemons PA, Singh S, Carpenter AE, Caicedo JC. Predicting compound activity from phenotypic profiles and chemical structures. Nat Commun 2023; 14:1967. [PMID: 37031208 PMCID: PMC10082762 DOI: 10.1038/s41467-023-37570-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Accepted: 03/23/2023] [Indexed: 04/10/2023] Open
Abstract
Predicting assay results for compounds virtually using chemical structures and phenotypic profiles has the potential to reduce the time and resources of screens for drug discovery. Here, we evaluate the relative strength of three high-throughput data sources-chemical structures, imaging (Cell Painting), and gene-expression profiles (L1000)-to predict compound bioactivity using a historical collection of 16,170 compounds tested in 270 assays for a total of 585,439 readouts. All three data modalities can predict compound activity for 6-10% of assays, and in combination they predict 21% of assays with high accuracy, which is a 2 to 3 times higher success rate than using a single modality alone. In practice, the accuracy of predictors could be lower and still be useful, increasing the assays that can be predicted from 37% with chemical structures alone up to 64% when combined with phenotypic data. Our study shows that unbiased phenotypic profiling can be leveraged to enhance compound bioactivity prediction to accelerate the early stages of the drug-discovery process.
Collapse
Affiliation(s)
- Nikita Moshkov
- Broad Institute of MIT and Harvard, Cambridge, USA
- Biological Research Centre, Szeged, Hungary
| | - Tim Becker
- Broad Institute of MIT and Harvard, Cambridge, USA
| | | | | | - Vlado Dancik
- Broad Institute of MIT and Harvard, Cambridge, USA
| | | | | | | | | | | |
Collapse
|
31
|
Cerisier N, Dafniet B, Badel A, Taboureau O. Linking chemicals, genes and morphological perturbations to diseases. Toxicol Appl Pharmacol 2023; 461:116407. [PMID: 36736439 DOI: 10.1016/j.taap.2023.116407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 01/13/2023] [Accepted: 01/28/2023] [Indexed: 02/05/2023]
Abstract
The progress in image-based high-content screening technology has facilitated high-throughput phenotypic profiling notably the quantification of cell morphology perturbation by chemicals. However, understanding the mechanism of action of a chemical and linking it to cell morphology and phenotypes remains a challenge in drug discovery. In this study, we intended to integrate molecules that induced transcriptomic perturbations and cellular morphological changes into a biological network in order to assess chemical-phenotypic relationships in humans. Such a network was enriched with existing disease information to suggest molecular and cellular profiles leading to phenotypes. Two datasets were used for this study. Firstly, we used the "Cell Painting morphological profiling assay" dataset, composed of 30,000 compounds tested on human osteosarcoma cells (named U2OS). Secondly, we used the "L1000 mRNA profiling assay" dataset, a collection of transcriptional expression data from cultured human cells treated with approximately 20,000 bioactive small molecules from the Library of Integrated Network-based Cellular Signatures (LINCS). Furthermore, pathways, gene ontology terms and disease enrichments were performed on the transcriptomics data. Overall, our study makes it possible to develop a biological network combining chemical-gene-pathway-morphological perturbation and disease relationships. It contains an ensemble of 9989 chemicals, 732 significant morphological features and 12,328 genes. Through diverse examples, we demonstrated that some drugs shared similar genes, pathways and morphological profiles that, taken together, could help in deciphering chemical-phenotype observations.
Collapse
Affiliation(s)
- Natacha Cerisier
- Université Paris Cité, INSERM U1133, CNRS UMR 8251, 75006 Paris, France
| | - Bryan Dafniet
- Université Paris Cité, INSERM U1133, CNRS UMR 8251, 75006 Paris, France
| | - Anne Badel
- Université Paris Cité, INSERM U1133, CNRS UMR 8251, 75006 Paris, France
| | - Olivier Taboureau
- Université Paris Cité, INSERM U1133, CNRS UMR 8251, 75006 Paris, France.
| |
Collapse
|
32
|
Using chemical and biological data to predict drug toxicity. SLAS DISCOVERY : ADVANCING LIFE SCIENCES R & D 2023; 28:53-64. [PMID: 36639032 DOI: 10.1016/j.slasd.2022.12.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Revised: 12/19/2022] [Accepted: 12/31/2022] [Indexed: 01/12/2023]
Abstract
Various sources of information can be used to better understand and predict compound activity and safety-related endpoints, including biological data such as gene expression and cell morphology. In this review, we first introduce types of chemical, in vitro and in vivo information that can be used to describe compounds and adverse effects. We then explore how compound descriptors based on chemical structure or biological perturbation response can be used to predict safety-related endpoints, and how especially biological data can help us to better understand adverse effects mechanistically. Overall, the described applications demonstrate how large-scale biological information presents new opportunities to anticipate and understand the biological effects of compounds, and how this can support predictive toxicology and drug discovery projects.
Collapse
|