1
|
Gross B, Dauvin A, Cabeli V, Kmetzsch V, El Khoury J, Dissez G, Ouardini K, Grouard S, Davi A, Loeb R, Esposito C, Hulot L, Ghermi R, Blum M, Darhi Y, Durand EY, Romagnoni A. Robust evaluation of deep learning-based representation methods for survival and gene essentiality prediction on bulk RNA-seq data. Sci Rep 2024; 14:17064. [PMID: 39048590 PMCID: PMC11269749 DOI: 10.1038/s41598-024-67023-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Accepted: 07/08/2024] [Indexed: 07/27/2024] Open
Abstract
Deep learning (DL) has shown potential to provide powerful representations of bulk RNA-seq data in cancer research. However, there is no consensus regarding the impact of design choices of DL approaches on the performance of the learned representation, including the model architecture, the training methodology and the various hyperparameters. To address this problem, we evaluate the performance of various design choices of DL representation learning methods using TCGA and DepMap pan-cancer datasets and assess their predictive power for survival and gene essentiality predictions. We demonstrate that baseline methods achieve comparable or superior performance compared to more complex models on survival predictions tasks. DL representation methods, however, are the most efficient to predict the gene essentiality of cell lines. We show that auto-encoders (AE) are consistently improved by techniques such as masking and multi-head training. Our results suggest that the impact of DL representations and of pretraining are highly task- and architecture-dependent, highlighting the need for adopting rigorous evaluation guidelines. These guidelines for robust evaluation are implemented in a pipeline made available to the research community.
Collapse
|
2
|
Ly ST, Lin B, Vo HQ, Maric D, Roysam B, Nguyen HV. Cellular data extraction from multiplexed brain imaging data using self-supervised Dual-loss Adaptive Masked Autoencoder. Artif Intell Med 2024; 151:102828. [PMID: 38564879 DOI: 10.1016/j.artmed.2024.102828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 02/14/2024] [Accepted: 02/26/2024] [Indexed: 04/04/2024]
Abstract
Reliable large-scale cell detection and segmentation is the fundamental first step to understanding biological processes in the brain. The ability to phenotype cells at scale can accelerate preclinical drug evaluation and system-level brain histology studies. The impressive advances in deep learning offer a practical solution to cell image detection and segmentation. Unfortunately, categorizing cells and delineating their boundaries for training deep networks is an expensive process that requires skilled biologists. This paper presents a novel self-supervised Dual-Loss Adaptive Masked Autoencoder (DAMA) for learning rich features from multiplexed immunofluorescence brain images. DAMA's objective function minimizes the conditional entropy in pixel-level reconstruction and feature-level regression. Unlike existing self-supervised learning methods based on a random image masking strategy, DAMA employs a novel adaptive mask sampling strategy to maximize mutual information and effectively learn brain cell data. To the best of our knowledge, this is the first effort to develop a self-supervised learning method for multiplexed immunofluorescence brain images. Our extensive experiments demonstrate that DAMA features enable superior cell detection, segmentation, and classification performance without requiring many annotations. In addition, to examine the generalizability of DAMA, we also experimented on TissueNet, a multiplexed imaging dataset comprised of two-channel fluorescence images from six distinct tissue types, captured using six different imaging platforms. Our code is publicly available at https://github.com/hula-ai/DAMA.
Collapse
Affiliation(s)
- Son T Ly
- Department of Electrical and Computer Engineering, University of Houston, TX 77204, USA.
| | - Bai Lin
- Department of Electrical and Computer Engineering, University of Houston, TX 77204, USA
| | - Hung Q Vo
- Department of Electrical and Computer Engineering, University of Houston, TX 77204, USA
| | - Dragan Maric
- National Institute of Neurological Disorders and Stroke, Bethesda, MD 20892, USA
| | - Badrinath Roysam
- Department of Electrical and Computer Engineering, University of Houston, TX 77204, USA
| | - Hien V Nguyen
- Department of Electrical and Computer Engineering, University of Houston, TX 77204, USA.
| |
Collapse
|
3
|
Shen X, Li X. Deep-learning methods for unveiling large-scale single-cell transcriptomes. Cancer Biol Med 2024; 20:j.issn.2095-3941.2023.0436. [PMID: 38318925 PMCID: PMC10845931 DOI: 10.20892/j.issn.2095-3941.2023.0436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Accepted: 12/20/2023] [Indexed: 02/07/2024] Open
Affiliation(s)
- Xilin Shen
- Tianjin Cancer Institute, Key Laboratory of Cancer Prevention and Therapy, Tianjin’s Clinical Research Center for Cancer, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute & Hospital, Tianjin Medical University, Tianjin 300060, China
| | - Xiangchun Li
- Tianjin Cancer Institute, Key Laboratory of Cancer Prevention and Therapy, Tianjin’s Clinical Research Center for Cancer, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute & Hospital, Tianjin Medical University, Tianjin 300060, China
| |
Collapse
|
4
|
A single-cell analysis reveals tumor heterogeneity and immune environment of acral melanoma. Nat Commun 2022; 13:7250. [PMID: 36433984 PMCID: PMC9700682 DOI: 10.1038/s41467-022-34877-3] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2021] [Accepted: 11/10/2022] [Indexed: 11/26/2022] Open
Abstract
Acral melanoma is a dismal subtype of melanoma occurring in glabrous acral skin, and has a higher incidence in East Asians. We perform single-cell RNA sequencing for 63,394 cells obtained from 5 acral and 3 cutaneous melanoma samples to investigate tumor heterogeneity and immune environment. We define 5 orthogonal functional cell clusters that are involved in TGF-beta signaling, Type I interferon, Wnt signaling, Cell cycle, and Cholesterol efflux signaling. Signatures of enriched TGF-beta, Type I interferon, and cholesterol efflux signaling are significantly associated with good prognosis of melanoma. Compared with cutaneous melanoma, acral melanoma samples have significantly severe immunosuppressive state including depletion of cytotoxic CD8+ T cells, enrichment of Treg cells, and exhausted CD8+ T cells. PD1 and TIM-3 have higher expression in the exhaustive CD8+ T cells of acral melanoma. Key findings are verified in two independent validation sets. This study contributes to our better understanding of acral melanoma.
Collapse
|
5
|
Li Y, Wu X, Yang P, Jiang G, Luo Y. Machine Learning for Lung Cancer Diagnosis, Treatment, and Prognosis. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022; 20:850-866. [PMID: 36462630 PMCID: PMC10025752 DOI: 10.1016/j.gpb.2022.11.003] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 10/03/2022] [Accepted: 11/17/2022] [Indexed: 12/03/2022]
Abstract
The recent development of imaging and sequencing technologies enables systematic advances in the clinical study of lung cancer. Meanwhile, the human mind is limited in effectively handling and fully utilizing the accumulation of such enormous amounts of data. Machine learning-based approaches play a critical role in integrating and analyzing these large and complex datasets, which have extensively characterized lung cancer through the use of different perspectives from these accrued data. In this review, we provide an overview of machine learning-based approaches that strengthen the varying aspects of lung cancer diagnosis and therapy, including early detection, auxiliary diagnosis, prognosis prediction, and immunotherapy practice. Moreover, we highlight the challenges and opportunities for future applications of machine learning in lung cancer.
Collapse
Affiliation(s)
- Yawei Li
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA
| | - Xin Wu
- Department of Medicine, University of Illinois at Chicago, Chicago, IL 60612, USA
| | - Ping Yang
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN 55905 / Scottsdale, AZ 85259, USA
| | - Guoqian Jiang
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN 55905, USA
| | - Yuan Luo
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA.
| |
Collapse
|