1
|
Lo MCK, Siu DMD, Lee KCM, Wong JSJ, Yeung MCF, Hsin MKY, Ho JCM, Tsia KK. Information-Distilled Generative Label-Free Morphological Profiling Encodes Cellular Heterogeneity. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2307591. [PMID: 38864546 PMCID: PMC11304271 DOI: 10.1002/advs.202307591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 05/17/2024] [Indexed: 06/13/2024]
Abstract
Image-based cytometry faces challenges due to technical variations arising from different experimental batches and conditions, such as differences in instrument configurations or image acquisition protocols, impeding genuine biological interpretation of cell morphology. Existing solutions, often necessitating extensive pre-existing data knowledge or control samples across batches, have proved limited, especially with complex cell image data. To overcome this, "Cyto-Morphology Adversarial Distillation" (CytoMAD), a self-supervised multi-task learning strategy that distills biologically relevant cellular morphological information from batch variations, is introduced to enable integrated analysis across multiple data batches without complex data assumptions or extensive manual annotation. Unique to CytoMAD is its "morphology distillation", symbiotically paired with deep-learning image-contrast translation-offering additional interpretable insights into label-free cell morphology. The versatile efficacy of CytoMAD is demonstrated in augmenting the power of biophysical imaging cytometry. It allows integrated label-free classification of human lung cancer cell types and accurately recapitulates their progressive drug responses, even when trained without the drug concentration information. CytoMAD also allows joint analysis of tumor biophysical cellular heterogeneity, linked to epithelial-mesenchymal plasticity, that standard fluorescence markers overlook. CytoMAD can substantiate the wide adoption of biophysical cytometry for cost-effective diagnosis and screening.
Collapse
Affiliation(s)
- Michelle C. K. Lo
- Department of Electrical and Electronic EngineeringThe University of Hong KongHong Kong000000Hong Kong
- Advanced Biomedical Instrumentation CentreHong Kong Science Park, New TerritoriesHong Kong000000Hong Kong
| | - Dickson M. D. Siu
- Department of Electrical and Electronic EngineeringThe University of Hong KongHong Kong000000Hong Kong
- Advanced Biomedical Instrumentation CentreHong Kong Science Park, New TerritoriesHong Kong000000Hong Kong
| | - Kelvin C. M. Lee
- Department of Electrical and Electronic EngineeringThe University of Hong KongHong Kong000000Hong Kong
- Advanced Biomedical Instrumentation CentreHong Kong Science Park, New TerritoriesHong Kong000000Hong Kong
| | - Justin S. J. Wong
- Conzeb LimitedHong Kong Science Park, New TerritoriesHong Kong000000Hong Kong
| | - Maximus C. F. Yeung
- Department of Pathology, Li Ka Shing Faculty of MedicineThe University of Hong KongPokfulam RoadHong Kong000000Hong Kong
| | - Michael K. Y. Hsin
- Department of Surgery, Li Ka Shing Faculty of MedicineThe University of Hong KongPokfulam RoadHong Kong000000Hong Kong
| | - James C. M. Ho
- Department of Medicine, Li Ka Shing Faculty of MedicineThe University of Hong KongPokfulam RoadHong Kong000000Hong Kong
| | - Kevin K. Tsia
- Department of Electrical and Electronic EngineeringThe University of Hong KongHong Kong000000Hong Kong
- Advanced Biomedical Instrumentation CentreHong Kong Science Park, New TerritoriesHong Kong000000Hong Kong
| |
Collapse
|
2
|
Liu X, Shi L, Zhao Z, Shu J, Min W. VIBRANT: spectral profiling for single-cell drug responses. Nat Methods 2024; 21:501-511. [PMID: 38374266 PMCID: PMC11214684 DOI: 10.1038/s41592-024-02185-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Accepted: 01/16/2024] [Indexed: 02/21/2024]
Abstract
High-content cell profiling has proven invaluable for single-cell phenotyping in response to chemical perturbations. However, methods with improved throughput, information content and affordability are still needed. We present a new high-content spectral profiling method named vibrational painting (VIBRANT), integrating mid-infrared vibrational imaging, multiplexed vibrational probes and an optimized data analysis pipeline for measuring single-cell drug responses. Three infrared-active vibrational probes were designed to measure distinct essential metabolic activities in human cancer cells. More than 20,000 single-cell drug responses were collected, corresponding to 23 drug treatments. The resulting spectral profile is highly sensitive to phenotypic changes under drug perturbation. Using this property, we built a machine learning classifier to accurately predict drug mechanism of action at single-cell level with minimal batch effects. We further designed an algorithm to discover drug candidates with new mechanisms of action and evaluate drug combinations. Overall, VIBRANT has demonstrated great potential across multiple areas of phenotypic screening.
Collapse
Affiliation(s)
- Xinwen Liu
- Department of Chemistry, Columbia University, New York, NY, USA
| | - Lixue Shi
- Department of Chemistry, Columbia University, New York, NY, USA
- Shanghai Xuhui Central Hospital, Zhongshan-Xuhui Hospital, and Shanghai Key Laboratory of Medical Epigenetics, International Co-laboratory of Medical Epigenetics and Metabolism, Institutes of Biomedical Sciences, Shanghai Medical College, Fudan University, Shanghai, China
| | - Zhilun Zhao
- Department of Chemistry, Columbia University, New York, NY, USA
| | - Jian Shu
- Cutaneous Biology Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Wei Min
- Department of Chemistry, Columbia University, New York, NY, USA.
- Department of Biomedical Engineering, Columbia University, New York, NY, USA.
| |
Collapse
|
3
|
Moshkov N, Bornholdt M, Benoit S, Smith M, McQuin C, Goodman A, Senft RA, Han Y, Babadi M, Horvath P, Cimini BA, Carpenter AE, Singh S, Caicedo JC. Learning representations for image-based profiling of perturbations. Nat Commun 2024; 15:1594. [PMID: 38383513 PMCID: PMC10881515 DOI: 10.1038/s41467-024-45999-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 02/07/2024] [Indexed: 02/23/2024] Open
Abstract
Measuring the phenotypic effect of treatments on cells through imaging assays is an efficient and powerful way of studying cell biology, and requires computational methods for transforming images into quantitative data. Here, we present an improved strategy for learning representations of treatment effects from high-throughput imaging, following a causal interpretation. We use weakly supervised learning for modeling associations between images and treatments, and show that it encodes both confounding factors and phenotypic features in the learned representation. To facilitate their separation, we constructed a large training dataset with images from five different studies to maximize experimental diversity, following insights from our causal analysis. Training a model with this dataset successfully improves downstream performance, and produces a reusable convolutional network for image-based profiling, which we call Cell Painting CNN. We evaluated our strategy on three publicly available Cell Painting datasets, and observed that the Cell Painting CNN improves performance in downstream analysis up to 30% with respect to classical features, while also being more computationally efficient.
Collapse
Affiliation(s)
- Nikita Moshkov
- HUN-REN Biological Research Centre, 62 Temesvári krt, Szeged, 6726, Hungary
| | - Michael Bornholdt
- Broad Institute of MIT and Harvard, 415 Main St, Cambridge, MA, 02141, USA
| | - Santiago Benoit
- Broad Institute of MIT and Harvard, 415 Main St, Cambridge, MA, 02141, USA
- Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, PA, 15213, USA
| | - Matthew Smith
- Broad Institute of MIT and Harvard, 415 Main St, Cambridge, MA, 02141, USA
- Harvard College, 86 Brattle Street Cambridge, Cambridge, MA, 02138, USA
| | - Claire McQuin
- Broad Institute of MIT and Harvard, 415 Main St, Cambridge, MA, 02141, USA
| | - Allen Goodman
- Broad Institute of MIT and Harvard, 415 Main St, Cambridge, MA, 02141, USA
| | - Rebecca A Senft
- Broad Institute of MIT and Harvard, 415 Main St, Cambridge, MA, 02141, USA
| | - Yu Han
- Broad Institute of MIT and Harvard, 415 Main St, Cambridge, MA, 02141, USA
| | - Mehrtash Babadi
- Broad Institute of MIT and Harvard, 415 Main St, Cambridge, MA, 02141, USA
| | - Peter Horvath
- HUN-REN Biological Research Centre, 62 Temesvári krt, Szeged, 6726, Hungary
| | - Beth A Cimini
- Broad Institute of MIT and Harvard, 415 Main St, Cambridge, MA, 02141, USA
| | - Anne E Carpenter
- Broad Institute of MIT and Harvard, 415 Main St, Cambridge, MA, 02141, USA
| | - Shantanu Singh
- Broad Institute of MIT and Harvard, 415 Main St, Cambridge, MA, 02141, USA
| | - Juan C Caicedo
- Broad Institute of MIT and Harvard, 415 Main St, Cambridge, MA, 02141, USA.
- Morgridge Institute for Research, 330 N Orchard St, Madison, WI, 53715, USA.
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, 1300 University Ave, Madison, WI, 53706, USA.
| |
Collapse
|
4
|
Yang Y, Wang K, Lu Z, Wang T, Wang X. Cytomulate: accurate and efficient simulation of CyTOF data. Genome Biol 2023; 24:262. [PMID: 37974276 PMCID: PMC10652542 DOI: 10.1186/s13059-023-03099-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 10/24/2023] [Indexed: 11/19/2023] Open
Abstract
Recently, many analysis tools have been devised to offer insights into data generated via cytometry by time-of-flight (CyTOF). However, objective evaluations of these methods remain absent as most evaluations are conducted against real data where the ground truth is generally unknown. In this paper, we develop Cytomulate, a reproducible and accurate simulation algorithm of CyTOF data, which could serve as a foundation for future method development and evaluation. We demonstrate that Cytomulate can capture various characteristics of CyTOF data and is superior in learning overall data distributions than single-cell RNA-seq-oriented methods such as scDesign2, Splatter, and generative models like LAMBDA.
Collapse
Affiliation(s)
- Yuqiu Yang
- Department of Statistics and Data Science, Southern Methodist University, Dallas, TX, 75275, USA
- Quantitative Biomedical Research Center, Peter O'Donnell Jr. School of Public Health, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| | - Kaiwen Wang
- Department of Statistics and Data Science, Southern Methodist University, Dallas, TX, 75275, USA
| | - Zeyu Lu
- Department of Statistics and Data Science, Southern Methodist University, Dallas, TX, 75275, USA
- Quantitative Biomedical Research Center, Peter O'Donnell Jr. School of Public Health, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| | - Tao Wang
- Quantitative Biomedical Research Center, Peter O'Donnell Jr. School of Public Health, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA.
- Center for the Genetics of Host Defense, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA.
| | - Xinlei Wang
- Department of Statistics and Data Science, Southern Methodist University, Dallas, TX, 75275, USA.
- Department of Mathematics, University of Texas at Arlington, Arlington, 76019, USA.
- Center for Data Science Research and Education, College of Science, University of Texas at Arlington, Arlington, 76019, USA.
| |
Collapse
|
5
|
Janssens R, Zhang X, Kauffmann A, de Weck A, Durand EY. Fully unsupervised deep mode of action learning for phenotyping high-content cellular images. Bioinformatics 2021; 37:4548-4555. [PMID: 34240099 DOI: 10.1093/bioinformatics/btab497] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2020] [Revised: 06/15/2021] [Accepted: 07/02/2021] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION The identification and discovery of phenotypes from high content screening (HCS) images is a challenging task. Earlier works use image analysis pipelines to extract biological features, supervised training methods or generate features with neural networks pretrained on non-cellular images. We introduce a novel unsupervised deep learning algorithm to cluster cellular images with similar Mode-of-Action (MOA) together using only the images' pixel intensity values as input. It corrects for batch effect during training. Importantly, our method does not require the extraction of cell candidates and works from the entire images directly. RESULTS The method achieves competitive results on the labelled subset of the BBBC021 dataset with an accuracy of 97.09% for correctly classifying the MOA by nearest neighbors matching. Importantly, we can train our approach on unannotated datasets. Therefore, our method can discover novel MOAs and annotate unlabelled compounds. The ability to train end-to-end on the full resolution images makes our method easy to apply and allows it to further distinguish treatments by their effect on proliferation. AVAILABILITY Our code is available at https://github.com/Novartis/UMM-Discovery. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Rens Janssens
- Novartis Institutes for BioMedical Research Inc, Basel, Switzerland
| | - Xian Zhang
- Novartis Institutes for BioMedical Research Inc, Basel, Switzerland
| | - Audrey Kauffmann
- Novartis Institutes for BioMedical Research Inc, Basel, Switzerland
| | - Antoine de Weck
- Novartis Institutes for BioMedical Research Inc, Basel, Switzerland
| | - Eric Y Durand
- Novartis Institutes for BioMedical Research Inc, Basel, Switzerland
| |
Collapse
|
6
|
Pratapa A, Doron M, Caicedo JC. Image-based cell phenotyping with deep learning. Curr Opin Chem Biol 2021; 65:9-17. [PMID: 34023800 DOI: 10.1016/j.cbpa.2021.04.001] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Accepted: 04/10/2021] [Indexed: 12/25/2022]
Abstract
A cell's phenotype is the culmination of several cellular processes through a complex network of molecular interactions that ultimately result in a unique morphological signature. Visual cell phenotyping is the characterization and quantification of these observable cellular traits in images. Recently, cellular phenotyping has undergone a massive overhaul in terms of scale, resolution, and throughput, which is attributable to advances across electronic, optical, and chemical technologies for imaging cells. Coupled with the rapid acceleration of deep learning-based computational tools, these advances have opened up new avenues for innovation across a wide variety of high-throughput cell biology applications. Here, we review applications wherein deep learning is powering the recognition, profiling, and prediction of visual phenotypes to answer important biological questions. As the complexity and scale of imaging assays increase, deep learning offers computational solutions to elucidate the details of previously unexplored cellular phenotypes.
Collapse
|