1
|
Zhao J, Wang Y, Feng C, Yin M, Gao Y, Wei L, Song C, Ai B, Wang Q, Zhang J, Zhu J, Li C. SCInter: A comprehensive single-cell transcriptome integration database for human and mouse. Comput Struct Biotechnol J 2024; 23:77-86. [PMID: 38125297 PMCID: PMC10731004 DOI: 10.1016/j.csbj.2023.11.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 11/12/2023] [Accepted: 11/13/2023] [Indexed: 12/23/2023] Open
Abstract
Single-cell RNA sequencing (scRNA-seq), which profiles gene expression at the cellular level, has effectively explored cell heterogeneity and reconstructed developmental trajectories. With the increasing research on diseases and biological processes, scRNA-seq datasets are accumulating rapidly, highlighting the urgent need for collecting and processing these data to support comprehensive and effective annotation and analysis. Here, we have developed a comprehensive Single-Cell transcriptome integration database for human and mouse (SCInter, https://bio.liclab.net/SCInter/index.php), which aims to provide a manually curated database that supports the provision of gene expression profiles across various cell types at the sample level. The current version of SCInter includes 115 integrated datasets and 1016 samples, covering nearly 150 tissues/cell lines. It contains 8016,646 cell markers in 457 identified cell types. SCInter enabled comprehensive analysis of cataloged single-cell data encompassing quality control (QC), clustering, cell markers, multi-method cell type automatic annotation, predicting cell differentiation trajectories and so on. At the same time, SCInter provided a user-friendly interface to query, browse, analyze and visualize each integrated dataset and single cell sample, along with comprehensive QC reports and processing results. It will facilitate the identification of cell type in different cell subpopulations and explore developmental trajectories, enhancing the study of cell heterogeneity in the fields of immunology and oncology.
Collapse
Affiliation(s)
- Jun Zhao
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, 163319, China
| | - Yuezhu Wang
- School of Artificial Intelligence, Jilin University, Changchun 130012, China
| | - Chenchen Feng
- School of Computer, University of South China, Hengyang, Hunan 421001, China
| | - Mingxue Yin
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan, 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
| | - Yu Gao
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, 163319, China
| | - Ling Wei
- Institute of Medical Innovation and Research, Peking University Third Hospital, Beijing 100191, China
- Cancer Center, Peking University Third Hospital, Beijing 100191, China
| | - Chao Song
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan, 421001, China
| | - Bo Ai
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, 163319, China
| | - Qiuyu Wang
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan, 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
| | - Jian Zhang
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, 163319, China
| | - Jiang Zhu
- School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing, 163319, China
| | - Chunquan Li
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- School of Computer, University of South China, Hengyang, Hunan 421001, China
- Hunan Provincial Key Laboratory of Multi-omics And Artificial Intelligence of Cardiovascular Diseases, University of South China, Hengyang, Hunan, 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Department of Cell Biology and Genetics, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
| |
Collapse
|
2
|
Casey C, Fullard JF, Sleator RD. Unravelling the genetic basis of Schizophrenia. Gene 2024; 902:148198. [PMID: 38266791 DOI: 10.1016/j.gene.2024.148198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 12/07/2023] [Accepted: 01/19/2024] [Indexed: 01/26/2024]
Abstract
Neuronal development is a highly regulated mechanism that is central to organismal function in animals. In humans, disruptions to this process can lead to a range of neurodevelopmental phenotypes, including Schizophrenia (SCZ). SCZ has a significant genetic component, whereby an individual with an SCZ affected family member is eight times more likely to develop the disease than someone with no family history of SCZ. By examining a combination of genomic, transcriptomic and epigenomic datasets, large-scale 'omics' studies aim to delineate the relationship between genetic variation and abnormal cellular activity in the SCZ brain. Herein, we provide a brief overview of some of the key omics methods currently being used in SCZ research, including RNA-seq, the assay for transposase-accessible chromatin with high-throughput sequencing (ATAC-seq) and high-throughput chromosome conformation capture (3C) approaches (e.g., Hi-C), as well as single-cell/nuclei iterations of these methods. We also discuss how these techniques are being employed to further our understanding of the genetic basis of SCZ, and to identify associated molecular pathways, biomarkers, and candidate drug targets.
Collapse
Affiliation(s)
- Clara Casey
- Department of Biological Sciences, Munster Technological University, Bishopstown, Cork, Ireland; Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, United States; Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, United States; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, United States
| | - John F Fullard
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, United States; Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, United States; Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, United States
| | - Roy D Sleator
- Department of Biological Sciences, Munster Technological University, Bishopstown, Cork, Ireland.
| |
Collapse
|
3
|
Song X, Wei J, Li Y, Zhu W, Cai Z, Li K, Wei J, Lu J, Pan W, Li M. An integrative pan-cancer analysis of the molecular characteristics of dietary restriction in tumour microenvironment. EBioMedicine 2024; 102:105078. [PMID: 38507875 PMCID: PMC10965464 DOI: 10.1016/j.ebiom.2024.105078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 02/28/2024] [Accepted: 03/07/2024] [Indexed: 03/22/2024] Open
Abstract
BACKGROUND Dietary restriction (DR), a general term for dieting, has been demonstrated as an effective intervention in reducing the occurrence of cancers. Molecular activities associated with DR are crucial in mediating its anti-cancer effects, yet a comprehensive exploration of the landscape of these activities at the pan-cancer level is still lacking. METHODS We proposed a computational approach for quantifying DR-related molecular activities and delineating the landscape of these activities across 33 cancer types and 30 normal tissues within 27,320 samples. We thoroughly examined the associations between DR-related molecular activities and various factors, including the tumour microenvironment, immunological phenotypes, genomic features, and clinical prognosis. Meanwhile, we identified two DR genes that show potential as prognostic predictors in hepatocellular carcinoma and verified them by immunohistochemical assays in 90 patients. FINDINGS We found that DR-related molecular activities showed a close association with tumour immunity and hold potential for predicting immunotherapy responses in various cancers. Importantly, a higher level of DR-related molecular activities is associated with improved overall survival and cancer-specific survival. FZD1 and G6PD are two DR genes that serve as biomarkers for predicting the prognosis of patients with hepatocellular carcinoma. INTERPRETATION This study presents a robust link between DR-related molecular activities and tumour immunity across multiple cancer types. Our research could open the path for further investigation of DR-related molecular processes in cancer treatment. FUNDING National Natural Science Foundation of China (Grant No. 82000628) and the Guangdong-Hong Kong-Macao University Joint Laboratory of Interventional Medicine Foundation of Guangdong Province (Grant No. 2023LSYS001).
Collapse
Affiliation(s)
- Xiaoyi Song
- Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China; Guangdong-Hong Kong-Macao University Joint Laboratory of Interventional Medicine, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China
| | - Jiaxing Wei
- Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China; Guangdong-Hong Kong-Macao University Joint Laboratory of Interventional Medicine, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China
| | - Yang Li
- Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China; Guangdong-Hong Kong-Macao University Joint Laboratory of Interventional Medicine, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China
| | - Wen Zhu
- Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China; Guangdong-Hong Kong-Macao University Joint Laboratory of Interventional Medicine, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China
| | - Zhiyuan Cai
- Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China; Guangdong-Hong Kong-Macao University Joint Laboratory of Interventional Medicine, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China
| | - Kunwei Li
- Department of Radiology, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China
| | - Jingyue Wei
- Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China; Guangdong-Hong Kong-Macao University Joint Laboratory of Interventional Medicine, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China
| | - Jieyu Lu
- Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China; Guangdong-Hong Kong-Macao University Joint Laboratory of Interventional Medicine, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China
| | - Wanping Pan
- Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China; Guangdong-Hong Kong-Macao University Joint Laboratory of Interventional Medicine, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China
| | - Man Li
- Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China; Guangdong-Hong Kong-Macao University Joint Laboratory of Interventional Medicine, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China; Biobank, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China; Department of Information Technology and Data Center, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai, 519000, China.
| |
Collapse
|
4
|
Yi F, Cohen T, Zimmerman N, Dündar F, Zumbo P, Eltilib R, Brophy EJ, Arkin H, Feucht J, Gormally MV, Hackett CS, Kropp KN, Etxeberria I, Chandran SS, Park JH, Hsu KC, Sadelain M, Betel D, Klebanoff CA. CAR-engineered lymphocyte persistence is governed by a FAS ligand/FAS auto-regulatory circuit. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.26.582108. [PMID: 38464085 PMCID: PMC10925151 DOI: 10.1101/2024.02.26.582108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Chimeric antigen receptor (CAR)-engineered T and NK cells can cause durable remission of B-cell malignancies; however, limited persistence restrains the full potential of these therapies in many patients. The FAS ligand (FAS-L)/FAS pathway governs naturally-occurring lymphocyte homeostasis, yet knowledge of which cells express FAS-L in patients and whether these sources compromise CAR persistence remains incomplete. Here, we constructed a single-cell atlas of diverse cancer types to identify cellular subsets expressing FASLG, the gene encoding FAS-L. We discovered that FASLG is limited primarily to endogenous T cells, NK cells, and CAR-T cells while tumor and stromal cells express minimal FASLG. To establish whether CAR-T/NK cell survival is regulated through FAS-L, we performed competitive fitness assays using lymphocytes modified with or without a FAS dominant negative receptor (ΔFAS). Following adoptive transfer, ΔFAS-expressing CAR-T and CAR-NK cells became enriched across multiple tissues, a phenomenon that mechanistically was reverted through FASLG knockout. By contrast, FASLG was dispensable for CAR-mediated tumor killing. In multiple models, ΔFAS co-expression by CAR-T and CAR-NK enhanced antitumor efficacy compared with CAR cells alone. Together, these findings reveal that CAR-engineered lymphocyte persistence is governed by a FAS-L/FAS auto-regulatory circuit.
Collapse
Affiliation(s)
- Fei Yi
- Human Oncology and Pathogenesis Program (HOPP), Memorial Sloan Kettering Cancer Center (MSKCC), New York, NY, USA
| | - Tal Cohen
- Human Oncology and Pathogenesis Program (HOPP), Memorial Sloan Kettering Cancer Center (MSKCC), New York, NY, USA
- Department of Pediatric Hematology/Oncology, Weill Cornell Medical College, New York, NY, USA
| | - Natalie Zimmerman
- Human Oncology and Pathogenesis Program (HOPP), Memorial Sloan Kettering Cancer Center (MSKCC), New York, NY, USA
| | - Friederike Dündar
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
- Applied Bioinformatics Core, Weill Cornell Medicine, New York, NY, USA
| | - Paul Zumbo
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
- Applied Bioinformatics Core, Weill Cornell Medicine, New York, NY, USA
| | - Razan Eltilib
- Human Oncology and Pathogenesis Program (HOPP), Memorial Sloan Kettering Cancer Center (MSKCC), New York, NY, USA
| | - Erica J. Brophy
- Human Oncology and Pathogenesis Program (HOPP), Memorial Sloan Kettering Cancer Center (MSKCC), New York, NY, USA
| | - Hannah Arkin
- Human Oncology and Pathogenesis Program (HOPP), Memorial Sloan Kettering Cancer Center (MSKCC), New York, NY, USA
| | - Judith Feucht
- Center for Cell Engineering, MSKCC, New York, NY, USA
- Cluster of Excellence iFIT, University Children’s Hospital Tübingen, Tübingen, Germany
| | - Michael V. Gormally
- Human Oncology and Pathogenesis Program (HOPP), Memorial Sloan Kettering Cancer Center (MSKCC), New York, NY, USA
- Cell Therapy Service, Department of Medicine, MSKCC, New York, NY, USA
| | | | - Korbinian N. Kropp
- Human Oncology and Pathogenesis Program (HOPP), Memorial Sloan Kettering Cancer Center (MSKCC), New York, NY, USA
| | - Inaki Etxeberria
- Human Oncology and Pathogenesis Program (HOPP), Memorial Sloan Kettering Cancer Center (MSKCC), New York, NY, USA
| | - Smita S. Chandran
- Human Oncology and Pathogenesis Program (HOPP), Memorial Sloan Kettering Cancer Center (MSKCC), New York, NY, USA
| | - Jae H. Park
- Center for Cell Engineering, MSKCC, New York, NY, USA
- Cell Therapy Service, Department of Medicine, MSKCC, New York, NY, USA
- Department of Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Katharine C. Hsu
- Human Oncology and Pathogenesis Program (HOPP), Memorial Sloan Kettering Cancer Center (MSKCC), New York, NY, USA
- Center for Cell Engineering, MSKCC, New York, NY, USA
- Department of Medicine, Weill Cornell Medicine, New York, NY, USA
| | - Michel Sadelain
- Center for Cell Engineering, MSKCC, New York, NY, USA
- Department of Immunology, Sloan Kettering Institute, MSKCC, New York, NY, USA
| | - Doron Betel
- Applied Bioinformatics Core, Weill Cornell Medicine, New York, NY, USA
- Department of Medicine, Weill Cornell Medicine, New York, NY, USA
- Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA
| | - Christopher A. Klebanoff
- Human Oncology and Pathogenesis Program (HOPP), Memorial Sloan Kettering Cancer Center (MSKCC), New York, NY, USA
- Center for Cell Engineering, MSKCC, New York, NY, USA
- Cell Therapy Service, Department of Medicine, MSKCC, New York, NY, USA
- Department of Medicine, Weill Cornell Medicine, New York, NY, USA
- Parker Institute for Cancer Immunotherapy, New York, NY, USA
| |
Collapse
|
5
|
Zhong L, Wang F, Liu D, Kuang W, Ji N, Li J, Zeng X, Li T, Dan H, Chen Q. Single-cell transcriptomics dissects premalignant progression in proliferative verrucous leukoplakia. Oral Dis 2024; 30:172-186. [PMID: 35950708 DOI: 10.1111/odi.14347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2022] [Revised: 07/19/2022] [Accepted: 08/05/2022] [Indexed: 11/28/2022]
Abstract
OBJECTIVE Proliferative verrucous leukoplakia (PVL) is characterized by a spectrum of clinicopathological features and a high risk of malignant transformation. In this study, we aimed to delineate the dynamic changes in molecular signature during PVL progression and identify the potential cell subtypes that play a key role in the premalignant evolution of PVL. METHODS We performed single-cell RNA sequencing on three biopsy samples from a large PVL lesion. These samples exhibited a histopathological continuum of PVL progression. RESULTS By analyzing the transcriptome profiles of 27,611 cells from these samples, we identified ten major cell lineages and revealed that cellular remodeling occurred during the progression of PVL lesions, including epithelial, stromal, and immune cells. Epithelial cells are shifted to tumorigenic states and secretory patterns at the premalignant stage. Immune cells showed growing immunosuppressive phenotypes during PVL progression. Remarkably, two novel cell subtypes INSR+ endothelial cells and ASPN+ fibroblasts, were discovered and may play vital roles in microenvironment remodeling, such as angiogenesis and stromal fibrosis, which are closely involved in malignant transformation. CONCLUSION Our work is the first to depict the cellular landscape of PVL and speculate that disease progression may be driven by functional remodeling of multiple cell subtypes.
Collapse
Affiliation(s)
- Liang Zhong
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China
| | - Fei Wang
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China
| | - Dan Liu
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China
| | - Wenjing Kuang
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China
| | - Ning Ji
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China
| | - Jing Li
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China
| | - Xin Zeng
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China
| | - Taiwen Li
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China
| | - Hongxia Dan
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China
| | - Qianming Chen
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Chinese Academy of Medical Sciences Research Unit of Oral Carcinogenesis and Management, West China Hospital of Stomatology, Sichuan University, Chengdu, China
| |
Collapse
|
6
|
Lu C, Wei Y, Abbas M, Agula H, Wang E, Meng Z, Zhang R. Application of Single-Cell Assay for Transposase-Accessible Chromatin with High Throughput Sequencing in Plant Science: Advances, Technical Challenges, and Prospects. Int J Mol Sci 2024; 25:1479. [PMID: 38338756 PMCID: PMC10855595 DOI: 10.3390/ijms25031479] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 01/16/2024] [Accepted: 01/23/2024] [Indexed: 02/12/2024] Open
Abstract
The Single-cell Assay for Transposase-Accessible Chromatin with high throughput sequencing (scATAC-seq) has gained increasing popularity in recent years, allowing for chromatin accessibility to be deciphered and gene regulatory networks (GRNs) to be inferred at single-cell resolution. This cutting-edge technology now enables the genome-wide profiling of chromatin accessibility at the cellular level and the capturing of cell-type-specific cis-regulatory elements (CREs) that are masked by cellular heterogeneity in bulk assays. Additionally, it can also facilitate the identification of rare and new cell types based on differences in chromatin accessibility and the charting of cellular developmental trajectories within lineage-related cell clusters. Due to technical challenges and limitations, the data generated from scATAC-seq exhibit unique features, often characterized by high sparsity and noise, even within the same cell type. To address these challenges, various bioinformatic tools have been developed. Furthermore, the application of scATAC-seq in plant science is still in its infancy, with most research focusing on root tissues and model plant species. In this review, we provide an overview of recent progress in scATAC-seq and its application across various fields. We first conduct scATAC-seq in plant science. Next, we highlight the current challenges of scATAC-seq in plant science and major strategies for cell type annotation. Finally, we outline several future directions to exploit scATAC-seq technologies to address critical challenges in plant science, ranging from plant ENCODE(The Encyclopedia of DNA Elements) project construction to GRN inference, to deepen our understanding of the roles of CREs in plant biology.
Collapse
Affiliation(s)
- Chao Lu
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China; (C.L.); (Y.W.)
- Key Laboratory of Herbage & Endemic Crop Biology, Ministry of Education, School of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Yunxiao Wei
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China; (C.L.); (Y.W.)
| | - Mubashir Abbas
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China; (C.L.); (Y.W.)
| | - Hasi Agula
- Key Laboratory of Herbage & Endemic Crop Biology, Ministry of Education, School of Life Sciences, Inner Mongolia University, Hohhot 010070, China
| | - Edwin Wang
- Cumming School of Medicine, University of Calgary, Calgary, AB T2N 4N1, Canada
| | - Zhigang Meng
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China; (C.L.); (Y.W.)
| | - Rui Zhang
- Biotechnology Research Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China; (C.L.); (Y.W.)
| |
Collapse
|
7
|
Guo ZH, Wang YB, Wang S, Zhang Q, Huang DS. scCorrector: a robust method for integrating multi-study single-cell data. Brief Bioinform 2024; 25:bbad525. [PMID: 38271483 PMCID: PMC10810333 DOI: 10.1093/bib/bbad525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 11/12/2023] [Accepted: 12/19/2023] [Indexed: 01/27/2024] Open
Abstract
The advent of single-cell sequencing technologies has revolutionized cell biology studies. However, integrative analyses of diverse single-cell data face serious challenges, including technological noise, sample heterogeneity, and different modalities and species. To address these problems, we propose scCorrector, a variational autoencoder-based model that can integrate single-cell data from different studies and map them into a common space. Specifically, we designed a Study Specific Adaptive Normalization for each study in decoder to implement these features. scCorrector substantially achieves competitive and robust performance compared with state-of-the-art methods and brings novel insights under various circumstances (e.g. various batches, multi-omics, cross-species, and development stages). In addition, the integration of single-cell data and spatial data makes it possible to transfer information between different studies, which greatly expand the narrow range of genes covered by MERFISH technology. In summary, scCorrector can efficiently integrate multi-study single-cell datasets, thereby providing broad opportunities to tackle challenges emerging from noisy resources.
Collapse
Affiliation(s)
- Zhen-Hao Guo
- College of Electronics and Information Engineering, Tongji University, Shanghai 200000, China
| | - Yan-Bin Wang
- College of Computer Science and Technology, Zhejiang University 310027, China
| | - Siguo Wang
- Eastern Institute for Advanced Study, Eastern Institute of Technology, Tongxin Road No.568, Ningbo, Zhejiang 315201, China
| | - Qinhu Zhang
- Eastern Institute for Advanced Study, Eastern Institute of Technology, Tongxin Road No.568, Ningbo, Zhejiang 315201, China
| | - De-Shuang Huang
- Eastern Institute for Advanced Study, Eastern Institute of Technology, Tongxin Road No.568, Ningbo, Zhejiang 315201, China
| |
Collapse
|
8
|
Guo C, Yang X, Li L. Pyroptosis-Related Gene Signature Predicts Prognosis and Response to Immunotherapy and Medication in Pediatric and Young Adult Osteosarcoma Patients. J Inflamm Res 2024; 17:417-445. [PMID: 38269108 PMCID: PMC10807455 DOI: 10.2147/jir.s440425] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Accepted: 12/21/2023] [Indexed: 01/26/2024] Open
Abstract
Purpose Pyroptosis, a new form of inflammatory programmed cell death, has recently gained attention. However, the impact of the expression levels of pyroptosis-related genes (PRGs) on the overall survival (OS) of osteosarcoma patients remains unclear. This study aims to investigate the impact of the expression levels of PRGs on the OS of pediatric and young adult patients with osteosarcoma. Patients and Methods Transcriptome matrix datasets of normal muscle or skeletal tissues from the Genotype-Tissue Expression (GTEx) project and osteosarcoma specimen the National Cancer Institute's (NCI) Therapeutically Applicable Research to Generate Effective Treatments (TARGET) database were used to identify pyroptosis-related genes (PRGs) associated with prognosis. The National Center for Biotechnology Information's (NCBI) GSE21257 dataset was employed to validate the predictive value of the pyroptosis-related signature (PRS). Additionally, reverse transcription polymerase chain reaction (RT-qPCR) experiment was performed in normal and osteosarcoma cell lines. Results The study identified 18 differentially expressed PRGs (DEPRGs) between normal muscle or skeletal tissues and tumor samples. Multiple machine learning techniques were used to select PRGs, resulting in the identification of four hub PRGs. A PRS-score was calculated for each sample based on the expression of these four hub PRGs, and samples were categorized into low and high PRS-score level groups. It was confirmed that metastatic status and PRS-score level are independent prognostic predictors. A nomogram model for predicting OS of osteosarcoma patients was constructed. Single-cell RNA-sequencing data display the expression patterns of the hub PRGs. RT-qPCR data results were found to be consistent with the differential expression analysis performed on TARGET and GTEx samples. Conclusion The study developed a novel pyroptosis-related gene signature that can stratify pediatric and young adult osteosarcoma patients into different risk groups, thus predicting their response to immunotherapy and chemotherapy.
Collapse
Affiliation(s)
- Chaofan Guo
- Department of Orthopedics, Shanxi Provincial People’s Hospital, Taiyuan, Shanxi Province, People’s Republic of China
- Department of Spine Surgery, Fifth Hospital of Shanxi Medical University, Taiyuan, Shanxi Province, People’s Republic of China
| | - Xin Yang
- Department of Neurosurgery, Chongqing Fourth People’s Hospital, Chongqing, People’s Republic of China
| | - Lijun Li
- Department of Orthopedics, Shanxi Provincial People’s Hospital, Taiyuan, Shanxi Province, People’s Republic of China
- Department of Spine Surgery, Fifth Hospital of Shanxi Medical University, Taiyuan, Shanxi Province, People’s Republic of China
| |
Collapse
|
9
|
Jiang Y, Hu Z, Lynch AW, Jiang J, Zhu A, Zhang Y, Xie Y, Li R, Zhou N, Meyer CA, Cejas P, Brown M, Long HW, Qiu X. scATAnno: Automated Cell Type Annotation for single-cell ATAC Sequencing Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.06.01.543296. [PMID: 37333088 PMCID: PMC10274707 DOI: 10.1101/2023.06.01.543296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/20/2023]
Abstract
The recent advances in single-cell epigenomic techniques have created a growing demand for scATAC-seq analysis. One key task is to determine cell types based on epigenetic profiling. We introduce scATAnno, a workflow designed to automatically annotate scATAC-seq data using large-scale scATAC-seq reference atlases. This workflow can generate scATAC-seq reference atlases from publicly available datasets, and enable accurate cell type annotation by integrating query data with reference atlases, without the aid of scRNA-seq profiling. To enhance annotation accuracy, we have incorporated KNN-based and weighted distance-based uncertainty scores to effectively detect unknown cell populations within the query data. We showcase the utility of scATAnno across multiple datasets, including peripheral blood mononuclear cell (PBMC), basal cell carcinoma (BCC) and Triple Negative Breast Cancer (TNBC), and demonstrate that scATAnno accurately annotates cell types across conditions. Overall, scATAnno is a powerful tool for cell type annotation in scATAC-seq data and can aid in the interpretation of new scATAC-seq datasets in complex biological systems.
Collapse
|
10
|
Wang X, Duan M, Li J, Ma A, Xin G, Xu D, Li Z, Liu B, Ma Q. MarsGT: Multi-omics analysis for rare population inference using single-cell graph transformer. Nat Commun 2024; 15:338. [PMID: 38184630 PMCID: PMC10771517 DOI: 10.1038/s41467-023-44570-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 12/14/2023] [Indexed: 01/08/2024] Open
Abstract
Rare cell populations are key in neoplastic progression and therapeutic response, offering potential intervention targets. However, their computational identification and analysis often lag behind major cell types. To fill this gap, we introduce MarsGT: Multi-omics Analysis for Rare population inference using a Single-cell Graph Transformer. It identifies rare cell populations using a probability-based heterogeneous graph transformer on single-cell multi-omics data. MarsGT outperforms existing tools in identifying rare cells across 550 simulated and four real human datasets. In mouse retina data, it reveals unique subpopulations of rare bipolar cells and a Müller glia cell subpopulation. In human lymph node data, MarsGT detects an intermediate B cell population potentially acting as lymphoma precursors. In human melanoma data, it identifies a rare MAIT-like population impacted by a high IFN-I response and reveals the mechanism of immunotherapy. Hence, MarsGT offers biological insights and suggests potential strategies for early detection and therapeutic intervention of disease.
Collapse
Affiliation(s)
- Xiaoying Wang
- School of Mathematics, Shandong University, Jinan, Shandong, 250100, China
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, 43210, USA
| | - Maoteng Duan
- School of Mathematics, Shandong University, Jinan, Shandong, 250100, China
| | - Jingxian Li
- School of Mathematics, Shandong University, Jinan, Shandong, 250100, China
| | - Anjun Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, 43210, USA
| | - Gang Xin
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, 43210, USA
| | - Dong Xu
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, 65211, USA
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, 65211, USA
| | - Zihai Li
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, 43210, USA
| | - Bingqiang Liu
- School of Mathematics, Shandong University, Jinan, Shandong, 250100, China.
| | - Qin Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA.
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, 43210, USA.
| |
Collapse
|
11
|
Li X, Sun Z, Ma J, Yang M, Cao H, Jiao G. Identification of TNFRSF21 as an inhibitory factor of osteosarcoma based on a necroptosis-related prognostic gene signature and molecular experiments. Cancer Cell Int 2024; 24:14. [PMID: 38184626 PMCID: PMC10770912 DOI: 10.1186/s12935-023-03198-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Accepted: 12/26/2023] [Indexed: 01/08/2024] Open
Abstract
BACKGROUND Osteosarcoma is one of the most common malignant bone tumors with bad prognosis. Necroptosis is a form of programmed cell death. Recent studies showed that targeting necroptosis was a new promising approach for tumor therapy. This study aimed to establish a necroptosis-related gene signature to evaluated prognosis and explore the relationship between necroptosis and osteosarcoma. METHODS Data from The Cancer Genome Atlas was used for developing the signature and the derived necroptosis score (NS). Data from Gene Expression Omnibus served as validation. Principal component analysis (PCA), Cox regression, receiver operating characteristic (ROC) curves and Kaplan-Meier survival analysis were used to assess the performance of signature. The association between the NS and osteosarcoma was analyzed via gene set enrichment analysis, gene set variation analysis and Pearson test. Single-cell data was used for further exploration. Among the genes that constituted the signature, the role of TNFRSF21 in osteosarcoma was unclear. Molecular experiments were used to explore TNFRSF21 function. RESULTS Our data revealed that lower NS indicated more active necroptosis in osteosarcoma. Patients with lower NS had a better prognosis. PCA and ROC curves demonstrated NS was effective to predict prognosis. NS was negatively associated with immune infiltration levels and tumor microenvironment scores and positively associated with tumor purity and stemness index. Single-cell data showed necroptosis heterogeneity in osteosarcoma. The cell communication pattern of malignant cells with high NS was positively correlated with tumor progression. The expression of TNFRSF21 was down-regulated in osteosarcoma cell lines. Overexpression of TNFRSF21 inhibited proliferation and motility of osteosarcoma cells. Mechanically, TNFRSF21 upregulated the phosphorylation levels of RIPK1, RIPK3 and MLKL to promote necroptosis in osteosarcoma. CONCLUSIONS The necroptosis prognostic signature and NS established in this study could be used as an independent prognostic factor, TNFRSF21 may be a necroptosis target in osteosarcoma therapy.
Collapse
Affiliation(s)
- Xiang Li
- Department of Orthopedics, Qilu Hospital of Shandong University, No.107, Wenhuaxi Road, Lixia District, Jinan, 250000, Shandong Province, China
- Cheeloo College of Medicine, Shandong University, Jinan, Shandong Province, China
| | - Zhenqian Sun
- Department of Orthopedics, Qilu Hospital of Shandong University, No.107, Wenhuaxi Road, Lixia District, Jinan, 250000, Shandong Province, China
- Cheeloo College of Medicine, Shandong University, Jinan, Shandong Province, China
| | - Jinlong Ma
- Department of Orthopedics, Qilu Hospital of Shandong University, No.107, Wenhuaxi Road, Lixia District, Jinan, 250000, Shandong Province, China
- Cheeloo College of Medicine, Shandong University, Jinan, Shandong Province, China
| | - Miaomiao Yang
- Department of Oncology, Yantai Yuhuangding Hospital, Yantai, Shandong Province, China
| | - Hongxin Cao
- Department of Medical Oncology, Qilu Hospital of Shandong University, Jinan, Shandong Province, China
- Key Laboratory of Chemical Biology (Ministry of Education), Institute of Biochemical and Biotechnological Drug, School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan, Shandong Province, China
| | - Guangjun Jiao
- Department of Orthopedics, Qilu Hospital of Shandong University, No.107, Wenhuaxi Road, Lixia District, Jinan, 250000, Shandong Province, China.
- Cheeloo College of Medicine, Shandong University, Jinan, Shandong Province, China.
| |
Collapse
|
12
|
Huang X, Song C, Zhang G, Li Y, Zhao Y, Zhang Q, Zhang Y, Fan S, Zhao J, Xie L, Li C. scGRN: a comprehensive single-cell gene regulatory network platform of human and mouse. Nucleic Acids Res 2024; 52:D293-D303. [PMID: 37889053 PMCID: PMC10767939 DOI: 10.1093/nar/gkad885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 09/19/2023] [Accepted: 10/12/2023] [Indexed: 10/28/2023] Open
Abstract
Gene regulatory networks (GRNs) are interpretable graph models encompassing the regulatory interactions between transcription factors (TFs) and their downstream target genes. Making sense of the topology and dynamics of GRNs is fundamental to interpreting the mechanisms of disease etiology and translating corresponding findings into novel therapies. Recent advances in single-cell multi-omics techniques have prompted the computational inference of GRNs from single-cell transcriptomic and epigenomic data at an unprecedented resolution. Here, we present scGRN (https://bio.liclab.net/scGRN/), a comprehensive single-cell multi-omics gene regulatory network platform of human and mouse. The current version of scGRN catalogs 237 051 cell type-specific GRNs (62 999 692 TF-target gene pairs), covering 160 tissues/cell lines and 1324 single-cell samples. scGRN is the first resource documenting large-scale cell type-specific GRN information of diverse human and mouse conditions inferred from single-cell multi-omics data. We have implemented multiple online tools for effective GRN analysis, including differential TF-target network analysis, TF enrichment analysis, and pathway downstream analysis. We also provided details about TF binding to promoters, super-enhancers and typical enhancers of target genes in GRNs. Taken together, scGRN is an integrative and useful platform for searching, browsing, analyzing, visualizing and downloading GRNs of interest, enabling insight into the differences in regulatory mechanisms across diverse conditions.
Collapse
Affiliation(s)
- Xuemei Huang
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Hunan Provincial Key Laboratory of Multi-omics and Artificial Intelligence of Cardiovascular Diseases & College of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- School of Computer, University of South China, Hengyang, Hunan, 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
| | - Chao Song
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Hunan Provincial Key Laboratory of Multi-omics and Artificial Intelligence of Cardiovascular Diseases & College of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- The First Affiliated Hospital, Department of Cardiology, Hengyang Medical School, University of South China, Hengyang, China
| | - Guorui Zhang
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Hunan Provincial Key Laboratory of Multi-omics and Artificial Intelligence of Cardiovascular Diseases & College of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
| | - Ye Li
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Hunan Provincial Key Laboratory of Multi-omics and Artificial Intelligence of Cardiovascular Diseases & College of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
| | - Yu Zhao
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Hunan Provincial Key Laboratory of Multi-omics and Artificial Intelligence of Cardiovascular Diseases & College of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- School of Computer, University of South China, Hengyang, Hunan, 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
| | - Qinyi Zhang
- Hunan Provincial Key Laboratory of Multi-omics and Artificial Intelligence of Cardiovascular Diseases & College of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
| | - Yuexin Zhang
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Hunan Provincial Key Laboratory of Multi-omics and Artificial Intelligence of Cardiovascular Diseases & College of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- The First Affiliated Hospital, Institute of Cardiovascular Disease, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
| | - Shifan Fan
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Hunan Provincial Key Laboratory of Multi-omics and Artificial Intelligence of Cardiovascular Diseases & College of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- School of Computer, University of South China, Hengyang, Hunan, 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
| | - Jun Zhao
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
| | - Liyuan Xie
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Hunan Provincial Key Laboratory of Multi-omics and Artificial Intelligence of Cardiovascular Diseases & College of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- School of Computer, University of South China, Hengyang, Hunan, 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
| | - Chunquan Li
- The First Affiliated Hospital & MOE Key Lab of Rare Pediatric Diseases, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- Hunan Provincial Key Laboratory of Multi-omics and Artificial Intelligence of Cardiovascular Diseases & College of Basic Medical Sciences, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- School of Computer, University of South China, Hengyang, Hunan, 421001, China
- Hunan Provincial Maternal and Child Health Care Hospital, National Health Commission Key Laboratory of Birth Defect Research and Prevention, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
- The First Affiliated Hospital, Cardiovascular Lab of Big Data and Imaging Artificial Intelligence, Hengyang Medical School, University of South China, Hengyang, Hunan, 421001, China
| |
Collapse
|
13
|
Miao Z, Kim J. Uniform quantification of single-nucleus ATAC-seq data with Paired-Insertion Counting (PIC) and a model-based insertion rate estimator. Nat Methods 2024; 21:32-36. [PMID: 38049698 PMCID: PMC10776405 DOI: 10.1038/s41592-023-02103-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Accepted: 10/25/2023] [Indexed: 12/06/2023]
Abstract
Existing approaches to scoring single-nucleus assay for transposase-accessible chromatin with sequencing (snATAC-seq) feature matrices from sequencing reads are inconsistent, affecting downstream analyses and displaying artifacts. We show that, even with sparse single-cell data, quantitative counts are informative for estimating the regulatory state of a cell, which calls for a consistent treatment. We propose Paired-Insertion Counting as a uniform method for snATAC-seq feature characterization and provide a probability model for inferring latent insertion dynamics from snATAC-seq count matrices.
Collapse
Affiliation(s)
- Zhen Miao
- Graduate Group in Genomics and Computational Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Department of Biology, University of Pennsylvania, Philadelphia, PA, USA
| | - Junhyong Kim
- Graduate Group in Genomics and Computational Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
- Department of Biology, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
14
|
Patruno L, Milite S, Bergamin R, Calonaci N, D’Onofrio A, Anselmi F, Antoniotti M, Graudenzi A, Caravagna G. A Bayesian method to infer copy number clones from single-cell RNA and ATAC sequencing. PLoS Comput Biol 2023; 19:e1011557. [PMID: 37917660 PMCID: PMC10645363 DOI: 10.1371/journal.pcbi.1011557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 11/14/2023] [Accepted: 09/30/2023] [Indexed: 11/04/2023] Open
Abstract
Single-cell RNA and ATAC sequencing technologies enable the examination of gene expression and chromatin accessibility in individual cells, providing insights into cellular phenotypes. In cancer research, it is important to consistently analyze these states within an evolutionary context on genetic clones. Here we present CONGAS+, a Bayesian model to map single-cell RNA and ATAC profiles onto the latent space of copy number clones. CONGAS+ clusters cells into tumour subclones with similar ploidy, rendering straightforward to compare their expression and chromatin profiles. The framework, implemented on GPU and tested on real and simulated data, scales to analyse seamlessly thousands of cells, demonstrating better performance than single-molecule models, and supporting new multi-omics assays. In prostate cancer, lymphoma and basal cell carcinoma, CONGAS+ successfully identifies complex subclonal architectures while providing a coherent mapping between ATAC and RNA, facilitating the study of genotype-phenotype maps and their connection to genomic instability.
Collapse
Affiliation(s)
- Lucrezia Patruno
- Department of Informatics, Systems and Communication, Università degli Studi di Milano-Bicocca, Milan, Italy
- Department of Mathematics and Geosciences, Università degli Studi di Trieste, Trieste, Italy
| | - Salvatore Milite
- Department of Mathematics and Geosciences, Università degli Studi di Trieste, Trieste, Italy
- Centre for Computational Biology, Human Technopole, Milan, Italy
| | - Riccardo Bergamin
- Department of Mathematics and Geosciences, Università degli Studi di Trieste, Trieste, Italy
| | - Nicola Calonaci
- Department of Mathematics and Geosciences, Università degli Studi di Trieste, Trieste, Italy
| | - Alberto D’Onofrio
- Department of Mathematics and Geosciences, Università degli Studi di Trieste, Trieste, Italy
| | - Fabio Anselmi
- Department of Mathematics and Geosciences, Università degli Studi di Trieste, Trieste, Italy
| | - Marco Antoniotti
- Department of Informatics, Systems and Communication, Università degli Studi di Milano-Bicocca, Milan, Italy
- B4—Bicocca Bioinformatics Biostatistics and Bioimaging Centre, Università degli Studi di Milano-Bicocca, Milan, Italy
| | - Alex Graudenzi
- Department of Informatics, Systems and Communication, Università degli Studi di Milano-Bicocca, Milan, Italy
- B4—Bicocca Bioinformatics Biostatistics and Bioimaging Centre, Università degli Studi di Milano-Bicocca, Milan, Italy
| | - Giulio Caravagna
- Department of Mathematics and Geosciences, Università degli Studi di Trieste, Trieste, Italy
| |
Collapse
|
15
|
Carbonetto P, Luo K, Sarkar A, Hung A, Tayeb K, Pott S, Stephens M. GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership. Genome Biol 2023; 24:236. [PMID: 37858253 PMCID: PMC10588049 DOI: 10.1186/s13059-023-03067-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 09/20/2023] [Indexed: 10/21/2023] Open
Abstract
Parts-based representations, such as non-negative matrix factorization and topic modeling, have been used to identify structure from single-cell sequencing data sets, in particular structure that is not as well captured by clustering or other dimensionality reduction methods. However, interpreting the individual parts remains a challenge. To address this challenge, we extend methods for differential expression analysis by allowing cells to have partial membership to multiple groups. We call this grade of membership differential expression (GoM DE). We illustrate the benefits of GoM DE for annotating topics identified in several single-cell RNA-seq and ATAC-seq data sets.
Collapse
Affiliation(s)
- Peter Carbonetto
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
- Research Computing Center, University of Chicago, Chicago, IL, USA
| | - Kaixuan Luo
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Abhishek Sarkar
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
- Vesalius Therapeutics, Cambridge, MA, USA
| | - Anthony Hung
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
- Section of Genetic Medicine, University of Chicago, Chicago, IL, USA
| | - Karl Tayeb
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
- Committee on Genetics, Genomics and Systems Biology, University of Chicago, Chicago, IL, USA
| | - Sebastian Pott
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
- Section of Genetic Medicine, University of Chicago, Chicago, IL, USA
| | - Matthew Stephens
- Department of Human Genetics, University of Chicago, Chicago, IL, USA.
- Department of Statistics, University of Chicago, Chicago, IL, USA.
| |
Collapse
|
16
|
Ren P, Shi X, Yu Z, Dong X, Ding X, Wang J, Sun L, Yan Y, Hu J, Zhang P, Chen Q, Zhang J, Li T, Wang C. Single-cell assignment using multiple-adversarial domain adaptation network with large-scale references. CELL REPORTS METHODS 2023; 3:100577. [PMID: 37751689 PMCID: PMC10545911 DOI: 10.1016/j.crmeth.2023.100577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 06/11/2023] [Accepted: 08/09/2023] [Indexed: 09/28/2023]
Abstract
The rapid accumulation of single-cell RNA-seq data has provided rich resources to characterize various human cell populations. However, achieving accurate cell-type annotation using public references presents challenges due to inconsistent annotations, batch effects, and rare cell types. Here, we introduce SELINA (single-cell identity navigator), an integrative and automatic cell-type annotation framework based on a pre-curated reference atlas spanning various tissues. SELINA employs a multiple-adversarial domain adaptation network to remove batch effects within the reference dataset. Additionally, it enhances the annotation of less frequent cell types by synthetic minority oversampling and fits query data with the reference data using an autoencoder. SELINA culminates in the creation of a comprehensive and uniform reference atlas, encompassing 1.7 million cells covering 230 distinct human cell types. We substantiate its robustness and superiority across a multitude of human tissues. Notably, SELINA could accurately annotate cells within diverse disease contexts. SELINA provides a complete solution for human single-cell RNA-seq data annotation with both python and R packages.
Collapse
Affiliation(s)
- Pengfei Ren
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Science and Technology, Tongji University, Shanghai 200092, China; Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China; Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100084, China; Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100084, China
| | - Xiaoying Shi
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Science and Technology, Tongji University, Shanghai 200092, China; Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Zhiguang Yu
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, Guangxi 530004, China
| | - Xin Dong
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Science and Technology, Tongji University, Shanghai 200092, China; Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Xuanxin Ding
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Science and Technology, Tongji University, Shanghai 200092, China; Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Jin Wang
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Science and Technology, Tongji University, Shanghai 200092, China; Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Liangdong Sun
- Department of Thoracic Surgery, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai 200433, China
| | - Yilv Yan
- Department of Thoracic Surgery, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai 200433, China
| | - Junjie Hu
- Department of Thoracic Surgery, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai 200433, China
| | - Peng Zhang
- Department of Thoracic Surgery, Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai 200433, China
| | - Qianming Chen
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Research Unit of Oral Carcinogenesis and Management, Chinese Academy of Medical Sciences, West China Hospital of Stomatology, Sichuan University, Chengdu 610041, China; Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Medicine, Nanjing Medical University, Nanjing 211166, China
| | - Jing Zhang
- Research Center for Translational Medicine, Shanghai East Hospital, School of Life Science and Technology, Tongji University, Shanghai, China.
| | - Taiwen Li
- State Key Laboratory of Oral Diseases, National Clinical Research Center for Oral Diseases, Research Unit of Oral Carcinogenesis and Management, Chinese Academy of Medical Sciences, West China Hospital of Stomatology, Sichuan University, Chengdu 610041, China; Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Cancer Medicine, Nanjing Medical University, Nanjing 211166, China.
| | - Chenfei Wang
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Science and Technology, Tongji University, Shanghai 200092, China; Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China.
| |
Collapse
|
17
|
Tian L, Xie Y, Xie Z, Tian J, Tian W. AtacAnnoR: a reference-based annotation tool for single cell ATAC-seq data. Brief Bioinform 2023; 24:bbad268. [PMID: 37497729 DOI: 10.1093/bib/bbad268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 06/14/2023] [Accepted: 07/04/2023] [Indexed: 07/28/2023] Open
Abstract
Here, we present AtacAnnoR, a two-round annotation method for scATAC-seq data using well-annotated scRNA-seq data as reference. We evaluate AtacAnnoR's performance against six competing methods on 11 benchmark datasets. Our results show that AtacAnnoR achieves the highest mean accuracy and the highest mean balanced accuracy and performs particularly well when unpaired scRNA-seq data are used as the reference. Furthermore, AtacAnnoR implements a 'Combine and Discard' strategy to further improve annotation accuracy when annotations of multiple references are available. AtacAnnoR has been implemented in an R package and can be directly integrated into currently popular scATAC-seq analysis pipelines.
Collapse
Affiliation(s)
- Lejin Tian
- State Key Laboratory of Genetic Engineering, Department of Computational Biology, School of Life Sciences, Fudan University, Shanghai, China
| | - Yunxiao Xie
- State Key Laboratory of Genetic Engineering, Department of Computational Biology, School of Life Sciences, Fudan University, Shanghai, China
| | - Zhaobin Xie
- State Key Laboratory of Genetic Engineering, Department of Computational Biology, School of Life Sciences, Fudan University, Shanghai, China
| | | | - Weidong Tian
- State Key Laboratory of Genetic Engineering, Department of Computational Biology, School of Life Sciences, Fudan University, Shanghai, China
- Children's Hospital of Fudan University, Shanghai, China
- Children's Hospital of Shandong University, Jinan, China
| |
Collapse
|
18
|
Carbonetto P, Luo K, Sarkar A, Hung A, Tayeb K, Pott S, Stephens M. GoM DE: interpreting structure in sequence count data with differential expression analysis allowing for grades of membership. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.03.531029. [PMID: 36945441 PMCID: PMC10028846 DOI: 10.1101/2023.03.03.531029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/11/2023]
Abstract
Parts-based representations, such as non-negative matrix factorization and topic modeling, have been used to identify structure from single-cell sequencing data sets, in particular structure that is not as well captured by clustering or other dimensionality reduction methods. However, interpreting the individual parts remains a challenge. To address this challenge, we extend methods for differential expression analysis by allowing cells to have partial membership to multiple groups. We call this grade of membership differential expression (GoM DE). We illustrate the benefits of GoM DE for annotating topics identified in several single-cell RNA-seq and ATAC-seq data sets.
Collapse
Affiliation(s)
- Peter Carbonetto
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
- Research Computing Center, University of Chicago, Chicago, IL, USA
| | - Kaixuan Luo
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Abhishek Sarkar
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
- Vesalius Therapeutics, Cambridge, MA, USA
| | - Anthony Hung
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
- Section of Genetic Medicine, University of Chicago, Chicago, IL, USA
| | - Karl Tayeb
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
- Committee on Genetics, Genomics and Systems Biology, University of Chicago, Chicago, IL, USA
| | - Sebastian Pott
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
- Section of Genetic Medicine, University of Chicago, Chicago, IL, USA
| | - Matthew Stephens
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
- Department of Statistics, University of Chicago, Chicago, IL, USA
| |
Collapse
|
19
|
Chen S, Zhu B, Huang S, Hickey JW, Lin KZ, Snyder M, Greenleaf WJ, Nolan GP, Zhang NR, Ma Z. Integration of spatial and single-cell data across modalities with weakly linked features. Nat Biotechnol 2023:10.1038/s41587-023-01935-0. [PMID: 37679544 DOI: 10.1038/s41587-023-01935-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Accepted: 08/02/2023] [Indexed: 09/09/2023]
Abstract
Although single-cell and spatial sequencing methods enable simultaneous measurement of more than one biological modality, no technology can capture all modalities within the same cell. For current data integration methods, the feasibility of cross-modal integration relies on the existence of highly correlated, a priori 'linked' features. We describe matching X-modality via fuzzy smoothed embedding (MaxFuse), a cross-modal data integration method that, through iterative coembedding, data smoothing and cell matching, uses all information in each modality to obtain high-quality integration even when features are weakly linked. MaxFuse is modality-agnostic and demonstrates high robustness and accuracy in the weak linkage scenario, achieving 20~70% relative improvement over existing methods under key evaluation metrics on benchmarking datasets. A prototypical example of weak linkage is the integration of spatial proteomic data with single-cell sequencing data. On two example analyses of this type, MaxFuse enabled the spatial consolidation of proteomic, transcriptomic and epigenomic information at single-cell resolution on the same tissue section.
Collapse
Affiliation(s)
- Shuxiao Chen
- Department of Statistics and Data Science, The Wharton School, University of Pennsylvania, Philadelphia, PA, USA
| | - Bokai Zhu
- Department of Microbiology and Immunology, Stanford University, Stanford, CA, USA
- Department of Pathology, Stanford University, Stanford, CA, USA
| | - Sijia Huang
- Department of Statistics and Data Science, The Wharton School, University of Pennsylvania, Philadelphia, PA, USA
| | - John W Hickey
- Department of Pathology, Stanford University, Stanford, CA, USA
| | - Kevin Z Lin
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - Michael Snyder
- Department of Genetics, Stanford University, Stanford, CA, USA
| | | | - Garry P Nolan
- Department of Microbiology and Immunology, Stanford University, Stanford, CA, USA.
- Department of Pathology, Stanford University, Stanford, CA, USA.
| | - Nancy R Zhang
- Department of Statistics and Data Science, The Wharton School, University of Pennsylvania, Philadelphia, PA, USA.
| | - Zongming Ma
- Department of Statistics and Data Science, Yale University, New Haven, CT, USA.
| |
Collapse
|
20
|
Xu C, Song LY, Zhou Y, Ma DN, Ding QS, Guo ZJ, Li J, Song SW, Zhang LD, Zheng HL. Integration of eQTL and GWAS analysis uncovers a genetic regulation of natural ionomic variation in Arabidopsis. PLANT CELL REPORTS 2023; 42:1473-1485. [PMID: 37516984 DOI: 10.1007/s00299-023-03042-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Accepted: 06/12/2023] [Indexed: 08/01/2023]
Abstract
KEY MESSAGE This study provided important insights into the genetic architecture of variations in A. thaliana leaf ionome in a cell-type-specific manner. The functional interpretation of traits associated variants by expression quantitative trait loci (eQTL) analysis is usually performed in bulk tissue samples. While the regulation of gene expression is context-dependent, such as cell-type-specific manner. In this study, we estimated cell-type abundances from 728 bulk tissue samples using single-cell RNA-sequencing dataset, and performed cis-eQTL mapping to identify cell-type-interaction eQTL (cis-eQTLs(ci)) in A. thaliana. Also, we performed Genome-wide association studies (GWAS) analyses for 999 accessions to identify the genetic basis of variations in A. thaliana leaf ionome. As a result, a total of 5,664 unique eQTL genes and 15,038 unique cis-eQTLs(ci) were significant. The majority (62.83%) of cis-eQTLs(ci) were cell-type-specific eQTLs. Using colocalization, we uncovered one interested gene AT2G25590 in Phloem cell, encoding a kind of plant Tudor-like protein with possible chromatin-associated functions, which colocalized with the most significant cis-eQTL(ci) of a Mo-related locus (Chr2:10,908,806:A:C; P = 3.27 × 10-27). Furthermore, we prioritized eight target genes associated with AT2G25590, which were previously reported in regulating the concentration of Mo element in A. thaliana. This study revealed the genetic regulation of ionomic variations and provided a foundation for further studies on molecular mechanisms of genetic variants controlling the A. thaliana ionome.
Collapse
Affiliation(s)
- Chaoqun Xu
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
| | - Ling-Yu Song
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
| | - Ying Zhou
- School of Medicine, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, 361102, China
| | - Dong-Na Ma
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
- National Engineering Research Center of Cereal Fermentation and Food Biomanufacturing, School of Food Science and Technology, Jiangnan University, Wuxi, 214122, Jiangsu, China
| | - Qian-Su Ding
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
| | - Ze-Jun Guo
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
| | - Jing Li
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
| | - Shi-Wei Song
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
| | - Lu-Dan Zhang
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China
| | - Hai-Lei Zheng
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, Xiamen, 361104, China.
| |
Collapse
|
21
|
Wang X, Duan M, Li J, Ma A, Xu D, Li Z, Liu B, Ma Q. MarsGT: Multi-omics analysis for rare population inference using single-cell graph transformer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.15.553454. [PMID: 37645917 PMCID: PMC10462017 DOI: 10.1101/2023.08.15.553454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Rare cell populations are key in neoplastic progression and therapeutic response, offering potential intervention targets. However, their computational identification and analysis often lag behind major cell types. To fill this gap, we introduced MarsGT: Multi-omics Analysis for Rare population inference using Single-cell Graph Transformer. It identifies rare cell populations using a probability-based heterogeneous graph transformer on single-cell multi-omics data. MarsGT outperformed existing tools in identifying rare cells across 400 simulated and four real human datasets. In mouse retina data, it revealed unique subpopulations of rare bipolar cells and a Müller glia cell subpopulation. In human lymph node data, MarsGT detected an intermediate B cell population potentially acting as lymphoma precursors. In human melanoma data, it identified a rare MAIT-like population impacted by a high IFN-I response and revealed the mechanism of immunotherapy. Hence, MarsGT offers biological insights and suggests potential strategies for early detection and therapeutic intervention of disease.
Collapse
Affiliation(s)
- Xiaoying Wang
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210, USA
| | - Maoteng Duan
- School of Mathematics, Shandong University, Jinan, Shandong, 250100, China
| | - Jingxian Li
- School of Mathematics, Shandong University, Jinan, Shandong, 250100, China
| | - Anjun Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210, USA
| | - Dong Xu
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA
| | - Zihai Li
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210, USA
| | - Bingqiang Liu
- School of Mathematics, Shandong University, Jinan, Shandong, 250100, China
| | - Qin Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210, USA
| |
Collapse
|
22
|
Cao G, Yue J, Ruan Y, Han Y, Zhi Y, Lu J, Liu M, Xu X, Wang J, Gu Q, Wen X, Gao J, Zhang Q, Kang J, Wang C, Li F. Single-cell dissection of cervical cancer reveals key subsets of the tumor immune microenvironment. EMBO J 2023; 42:e110757. [PMID: 37427448 PMCID: PMC10425846 DOI: 10.15252/embj.2022110757] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 04/05/2023] [Accepted: 05/19/2023] [Indexed: 07/11/2023] Open
Abstract
The tumor microenvironment (TME) directly determines patients' outcomes and therapeutic efficiencies. An in-depth understanding of the TME is required to improve the prognosis of patients with cervical cancer (CC). This study conducted single-cell RNA and TCR sequencing of six-paired tumors and adjacent normal tissues to map the CC immune landscape. T and NK cells were highly enriched in the tumor area and transitioned from cytotoxic to exhaustion phenotypes. Our analyses suggest that cytotoxic large-clone T cells are critical effectors in the antitumor response. This study also revealed tumor-specific germinal center B cells associated with tertiary lymphoid structures. A high-germinal center B cell proportion in patients with CC is predictive of improved clinical outcomes and is associated with elevated hormonal immune responses. We depicted an immune-excluded stromal landscape and established a joint model of tumor and stromal cells to predict CC patients' prognosis. The study revealed tumor ecosystem subsets linked to antitumor response or prognosis in the TME and provides information for future combinational immunotherapy.
Collapse
Affiliation(s)
- Guangxu Cao
- Department of Obstetrics and Gynecology, Shanghai East Hospital, School of MedicineTongji UniversityShanghaiChina
| | - Jiali Yue
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, Frontier Science Center for Stem Cells, School of Life Science and TechnologyTongji UniversityShanghaiChina
| | - Yetian Ruan
- Department of Obstetrics and Gynecology, Shanghai East Hospital, School of MedicineTongji UniversityShanghaiChina
| | - Ya Han
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, Frontier Science Center for Stem Cells, School of Life Science and TechnologyTongji UniversityShanghaiChina
| | - Yong Zhi
- Department of Obstetrics and Gynecology, Shanghai East Hospital, School of MedicineTongji UniversityShanghaiChina
| | - Jianqiao Lu
- Department of Obstetrics and Gynecology, Shanghai East Hospital, School of MedicineTongji UniversityShanghaiChina
| | - Min Liu
- Department of Obstetrics and Gynecology, Shanghai East Hospital, School of MedicineTongji UniversityShanghaiChina
| | - Xinxin Xu
- Department of Obstetrics and Gynecology, Shanghai East Hospital, School of MedicineTongji UniversityShanghaiChina
| | - Jin Wang
- Department of Obstetrics and Gynecology, Shanghai East Hospital, School of MedicineTongji UniversityShanghaiChina
| | - Quan Gu
- CVR BioinformaticsUniversity of Glasgow Centre for Virus ResearchGlasgowUK
| | - Xuejun Wen
- Department of Chemical and Life Science Engineering, School of EngineeringVirginia Commonwealth UniversityRichmondVirginiaUSA
| | - Jinli Gao
- Department of Pathology, Shanghai East Hospital, School of MedicineTongji UniversityShanghaiChina
| | - Qingfeng Zhang
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, Frontier Science Center for Stem Cells, School of Life Science and TechnologyTongji UniversityShanghaiChina
| | - Jiuhong Kang
- Clinical and Translational Research Center of Shanghai First Maternity and Infant Hospital, Shanghai Key Laboratory of Signaling and Disease Research, Collaborative Innovation Center for Brain Science, School of Life Sciences and TechnologyTongji UniversityShanghaiChina
| | - Chenfei Wang
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, Frontier Science Center for Stem Cells, School of Life Science and TechnologyTongji UniversityShanghaiChina
| | - Fang Li
- Department of Obstetrics and Gynecology, Shanghai East Hospital, School of MedicineTongji UniversityShanghaiChina
| |
Collapse
|
23
|
Shen L, Jiang S, Yang Y, Yang H, Fang Y, Tang M, Zhu R, Xu J, Jiang H. Pan-cancer and single-cell analysis reveal the prognostic value and immune response of NQO1. Front Cell Dev Biol 2023; 11:1174535. [PMID: 37583897 PMCID: PMC10424457 DOI: 10.3389/fcell.2023.1174535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 07/21/2023] [Indexed: 08/17/2023] Open
Abstract
Background: Overexpression of the NAD(P)H: Quinone Oxidoreductase 1 (NQOI) gene has been linked with tumor progression, aggressiveness, drug resistance, and poor patient prognosis. Most research has described the biological function of the NQO1 in certain types and limited samples, but a comprehensive understanding of the NQO1's function and clinical importance at the pan-cancer level is scarce. More research is needed to understand the role of NQO1 in tumor infiltration, and immune checkpoint inhibitors in various cancers are needed. Methods: The NQO1 expression data for 33 types of pan-cancer and their association with the prognosis, pathologic stage, gender, immune cell infiltration, the tumor mutation burden, microsatellite instability, immune checkpoints, enrichment pathways, and the half-maximal inhibitory concentration (IC50) were downloaded from public databases. Results: Our findings indicate that the NQO1 gene was significantly upregulated in most cancer types. The Cox regression analysis showed that overexpression of the NQO1 gene was related to poor OS in Glioma, uveal melanoma, head and neck squamous cell carcinoma, kidney renal papillary cell carcinoma, and adrenocortical carcinoma. NQO1 mRNA expression positively correlated with infiltrating immune cells and checkpoint molecule levels. The single-cell analysis revealed a potential relationship between the NQO1 mRNA expression levels and the infiltration of immune cells and stromal cells in bladder urothelial carcinoma, invasive breast carcinoma, and colorectal cancer. Conversely, a negative association was noted between various drugs (17-AAG, Lapatinib, Trametinib, PD-0325901) and the NQO1 mRNA expression levels. Conclusion: NQO1 expression was significantly associated with prognosis, immune infiltrates, and drug resistance in multiple cancer types. The inhibition of the NQO1-dependent signaling pathways may provide a promising strategy for developing new cancer-targeted therapies.
Collapse
Affiliation(s)
- Liping Shen
- Department of Clinical Laboratory, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical, Taizhou, Zhejiang, China
| | - Shan Jiang
- Department of Radiology, Jining No. 1 People’s Hospital, Jining, Shandong, China
| | - Yu Yang
- Department of Orthopedics, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical, Taizhou, Zhejiang, China
| | - Hongli Yang
- Department of Pathology, The First Affiliated Hospital of Soochow University, Suzhou, Jiangsu, China
| | - Yanchun Fang
- Department of Ultrasonography, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical, Taizhou, Zhejiang, China
| | - Meng Tang
- Department of Ultrasonography, Jining No. 1 People’s Hospital, Jining, Shandong, China
| | - Rangteng Zhu
- Department of Orthopedics, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical, Taizhou, Zhejiang, China
| | - Jiaqin Xu
- Department of Clinical Laboratory, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical, Taizhou, Zhejiang, China
| | - Hantao Jiang
- Department of Orthopedics, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical, Taizhou, Zhejiang, China
| |
Collapse
|
24
|
Littman R, Cheng M, Wang N, Peng C, Yang X. SCING: Inference of robust, interpretable gene regulatory networks from single cell and spatial transcriptomics. iScience 2023; 26:107124. [PMID: 37434694 PMCID: PMC10331489 DOI: 10.1016/j.isci.2023.107124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 03/31/2023] [Accepted: 06/09/2023] [Indexed: 07/13/2023] Open
Abstract
Gene regulatory network (GRN) inference is an integral part of understanding physiology and disease. Single cell/nuclei RNA-seq (scRNA-seq/snRNA-seq) data has been used to elucidate cell-type GRNs; however, the accuracy and speed of current scRNAseq-based GRN approaches are suboptimal. Here, we present Single Cell INtegrative Gene regulatory network inference (SCING), a gradient boosting and mutual information-based approach for identifying robust GRNs from scRNA-seq, snRNA-seq, and spatial transcriptomics data. Performance evaluation using Perturb-seq datasets, held-out data, and the mouse cell atlas combined with the DisGeNET database demonstrates the improved accuracy and biological interpretability of SCING compared to existing methods. We applied SCING to the entire mouse single cell atlas, human Alzheimer's disease (AD), and mouse AD spatial transcriptomics. SCING GRNs reveal unique disease subnetwork modeling capabilities, have intrinsic capacity to correct for batch effects, retrieve disease relevant genes and pathways, and are informative on spatial specificity of disease pathogenesis.
Collapse
Affiliation(s)
- Russell Littman
- Department of Integrative Biology & Physiology, UCLA, Los Angeles, CA, USA
- Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA, USA
| | - Michael Cheng
- Department of Integrative Biology & Physiology, UCLA, Los Angeles, CA, USA
- Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA, USA
| | - Ning Wang
- Department of Integrative Biology & Physiology, UCLA, Los Angeles, CA, USA
| | - Chao Peng
- Department of Neurology, David Geffen School of Medicine, UCLA, Los Angeles, CA, USA
| | - Xia Yang
- Department of Integrative Biology & Physiology, UCLA, Los Angeles, CA, USA
- Bioinformatics Interdepartmental Program, UCLA, Los Angeles, CA, USA
- Institute for Quantitative and Computational Biosciences (QCBio), Los Angeles, CA, USA
- Molecular Biology Institute (MBI), Los Angeles, CA, USA
- Brain Research Institute (BRI), Los Angeles, CA, USA
| |
Collapse
|
25
|
Fangma Y, Liu M, Liao J, Chen Z, Zheng Y. Dissecting the brain with spatially resolved multi-omics. J Pharm Anal 2023; 13:694-710. [PMID: 37577383 PMCID: PMC10422112 DOI: 10.1016/j.jpha.2023.04.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 04/04/2023] [Accepted: 04/06/2023] [Indexed: 08/15/2023] Open
Abstract
Recent studies have highlighted spatially resolved multi-omics technologies, including spatial genomics, transcriptomics, proteomics, and metabolomics, as powerful tools to decipher the spatial heterogeneity of the brain. Here, we focus on two major approaches in spatial transcriptomics (next-generation sequencing-based technologies and image-based technologies), and mass spectrometry imaging technologies used in spatial proteomics and spatial metabolomics. Furthermore, we discuss their applications in neuroscience, including building the brain atlas, uncovering gene expression patterns of neurons for special behaviors, deciphering the molecular basis of neuronal communication, and providing a more comprehensive explanation of the molecular mechanisms underlying central nervous system disorders. However, further efforts are still needed toward the integrative application of multi-omics technologies, including the real-time spatial multi-omics analysis in living cells, the detailed gene profile in a whole-brain view, and the combination of functional verification.
Collapse
Affiliation(s)
- Yijia Fangma
- Key Laboratory of Neuropharmacology and Translational Medicine of Zhejiang Province, School of Pharmaceutical Sciences, Zhejiang Chinese Medical University, Hangzhou, 310053, China
| | - Mengting Liu
- Key Laboratory of Neuropharmacology and Translational Medicine of Zhejiang Province, School of Pharmaceutical Sciences, Zhejiang Chinese Medical University, Hangzhou, 310053, China
| | - Jie Liao
- Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Zhong Chen
- Key Laboratory of Neuropharmacology and Translational Medicine of Zhejiang Province, School of Pharmaceutical Sciences, Zhejiang Chinese Medical University, Hangzhou, 310053, China
| | - Yanrong Zheng
- Key Laboratory of Neuropharmacology and Translational Medicine of Zhejiang Province, School of Pharmaceutical Sciences, Zhejiang Chinese Medical University, Hangzhou, 310053, China
| |
Collapse
|
26
|
Fernández-Moya SM, Ganesh AJ, Plass M. Neural cell diversity in the light of single-cell transcriptomics. Transcription 2023; 14:158-176. [PMID: 38229529 PMCID: PMC10807474 DOI: 10.1080/21541264.2023.2295044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Accepted: 11/10/2023] [Indexed: 01/18/2024] Open
Abstract
The development of highly parallel and affordable high-throughput single-cell transcriptomics technologies has revolutionized our understanding of brain complexity. These methods have been used to build cellular maps of the brain, its different regions, and catalog the diversity of cells in each of them during development, aging and even in disease. Now we know that cellular diversity is way beyond what was previously thought. Single-cell transcriptomics analyses have revealed that cell types previously considered homogeneous based on imaging techniques differ depending on several factors including sex, age and location within the brain. The expression profiles of these cells have also been exploited to understand which are the regulatory programs behind cellular diversity and decipher the transcriptional pathways driving them. In this review, we summarize how single-cell transcriptomics have changed our view on the cellular diversity in the human brain, and how it could impact the way we study neurodegenerative diseases. Moreover, we describe the new computational approaches that can be used to study cellular differentiation and gain insight into the functions of individual cell populations under different conditions and their alterations in disease.
Collapse
Affiliation(s)
- Sandra María Fernández-Moya
- Gene Regulation of Cell Identity, Regenerative Medicine Program, Bellvitge Institute for Biomedical Research (IDIBELL), Barcelona, L’Hospitalet del Llobregat, Spain
- Program for Advancing Clinical Translation of Regenerative Medicine of Catalonia, P- CMR[C], Barcelona, L’Hospitalet del Llobregat, Spain
| | - Akshay Jaya Ganesh
- Gene Regulation of Cell Identity, Regenerative Medicine Program, Bellvitge Institute for Biomedical Research (IDIBELL), Barcelona, L’Hospitalet del Llobregat, Spain
- Program for Advancing Clinical Translation of Regenerative Medicine of Catalonia, P- CMR[C], Barcelona, L’Hospitalet del Llobregat, Spain
| | - Mireya Plass
- Gene Regulation of Cell Identity, Regenerative Medicine Program, Bellvitge Institute for Biomedical Research (IDIBELL), Barcelona, L’Hospitalet del Llobregat, Spain
- Program for Advancing Clinical Translation of Regenerative Medicine of Catalonia, P- CMR[C], Barcelona, L’Hospitalet del Llobregat, Spain
- Center for Networked Biomedical Research on Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, Spain
| |
Collapse
|
27
|
Duque AF, Morin S, Wolf G, Moon KR. Geometry Regularized Autoencoders. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2023; 45:7381-7394. [PMID: 36374884 PMCID: PMC10339657 DOI: 10.1109/tpami.2022.3222104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
A fundamental task in data exploration is to extract low dimensional representations that capture intrinsic geometry in data, especially for faithfully visualizing data in two or three dimensions. Common approaches use kernel methods for manifold learning. However, these methods typically only provide an embedding of the input data and cannot extend naturally to new data points. Autoencoders have also become popular for representation learning. While they naturally compute feature extractors that are extendable to new data and invertible (i.e., reconstructing original features from latent representation), they often fail at representing the intrinsic data geometry compared to kernel-based manifold learning. We present a new method for integrating both approaches by incorporating a geometric regularization term in the bottleneck of the autoencoder. This regularization encourages the learned latent representation to follow the intrinsic data geometry, similar to manifold learning algorithms, while still enabling faithful extension to new data and preserving invertibility. We compare our approach to autoencoder models for manifold learning to provide qualitative and quantitative evidence of our advantages in preserving intrinsic structure, out of sample extension, and reconstruction. Our method is easily implemented for big-data applications, whereas other methods are limited in this regard.
Collapse
Affiliation(s)
- Andres F. Duque
- Department of Mathematics & Statistics, Utah State University, Logan, UT 84322 US
| | - Sacha Morin
- Departments of Computer Science & Operations Research and Mathematics & Statistics (correspondingly), Université de Montréal, Montréal, Quebec, H3T 1J4, Canada; Mila - Quebec AI Institute, Montreal, Quebec, H2S 3H1, Canada
| | - Guy Wolf
- Departments of Computer Science & Operations Research and Mathematics & Statistics (correspondingly), Université de Montréal, Montréal, Quebec, H3T 1J4, Canada; Mila - Quebec AI Institute, Montreal, Quebec, H2S 3H1, Canada
| | - Kevin R. Moon
- Department of Mathematics & Statistics, Utah State University, Logan, UT 84322 US
| |
Collapse
|
28
|
Ziyani C, Delaneau O, Ribeiro DM. Multimodal single cell analysis infers widespread enhancer co-activity in a lymphoblastoid cell line. Commun Biol 2023; 6:563. [PMID: 37237005 DOI: 10.1038/s42003-023-04954-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 05/18/2023] [Indexed: 05/28/2023] Open
Abstract
Non-coding regulatory elements such as enhancers are key in controlling the cell-type specificity and spatio-temporal expression of genes. To drive stable and precise gene transcription robust to genetic variation and environmental stress, genes are often targeted by multiple enhancers with redundant action. However, it is unknown whether enhancers targeting the same gene display simultaneous activity or whether some enhancer combinations are more often co-active than others. Here, we take advantage of recent developments in single cell technology that permit assessing chromatin status (scATAC-seq) and gene expression (scRNA-seq) in the same single cells to correlate gene expression to the activity of multiple enhancers. Measuring activity patterns across 24,844 human lymphoblastoid single cells, we find that the majority of enhancers associated with the same gene display significant correlation in their chromatin profiles. For 6944 expressed genes associated with enhancers, we predict 89,885 significant enhancer-enhancer associations between nearby enhancers. We find that associated enhancers share similar transcription factor binding profiles and that gene essentiality is linked with higher enhancer co-activity. We provide a set of predicted enhancer-enhancer associations based on correlation derived from a single cell line, which can be further investigated for functional relevance.
Collapse
Affiliation(s)
- Chaymae Ziyani
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Olivier Delaneau
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Diogo M Ribeiro
- Department of Computational Biology, University of Lausanne, Lausanne, Switzerland.
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland.
| |
Collapse
|
29
|
Zhang Y, Xiang G, Jiang AY, Lynch A, Zeng Z, Wang C, Zhang W, Fan J, Kang J, Gu SS, Wan C, Zhang B, Liu XS, Brown M, Meyer CA. MetaTiME integrates single-cell gene expression to characterize the meta-components of the tumor immune microenvironment. Nat Commun 2023; 14:2634. [PMID: 37149682 PMCID: PMC10164163 DOI: 10.1038/s41467-023-38333-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Accepted: 04/26/2023] [Indexed: 05/08/2023] Open
Abstract
Recent advances in single-cell RNA sequencing have shown heterogeneous cell types and gene expression states in the non-cancerous cells in tumors. The integration of multiple scRNA-seq datasets across tumors can indicate common cell types and states in the tumor microenvironment (TME). We develop a data driven framework, MetaTiME, to overcome the limitations in resolution and consistency that result from manual labelling using known gene markers. Using millions of TME single cells, MetaTiME learns meta-components that encode independent components of gene expression observed across cancer types. The meta-components are biologically interpretable as cell types, cell states, and signaling activities. By projecting onto the MetaTiME space, we provide a tool to annotate cell states and signature continuums for TME scRNA-seq data. Leveraging epigenetics data, MetaTiME reveals critical transcriptional regulators for the cell states. Overall, MetaTiME learns data-driven meta-components that depict cellular states and gene regulators for tumor immunity and cancer immunotherapy.
Collapse
Affiliation(s)
- Yi Zhang
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02215, USA
| | - Guanjue Xiang
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02215, USA
| | - Alva Yijia Jiang
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Allen Lynch
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02215, USA
| | - Zexian Zeng
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02215, USA
| | - Chenfei Wang
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02215, USA
| | - Wubing Zhang
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02215, USA
| | - Jingyu Fan
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02215, USA
| | - Jiajinlong Kang
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Shengqing Stan Gu
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Changxin Wan
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02215, USA
| | - Boning Zhang
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02215, USA
| | - X Shirley Liu
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA.
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02215, USA.
- Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, USA.
| | - Myles Brown
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA.
- Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, USA.
| | - Clifford A Meyer
- Department of Data Science, Dana-Farber Cancer Institute, Boston, MA, 02215, USA.
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, 02215, USA.
- Center for Functional Cancer Epigenetics, Dana-Farber Cancer Institute, Boston, MA, USA.
| |
Collapse
|
30
|
Lv J, Jiang Z, Yuan J, Zhuang M, Guan X, Liu H, Yin Y, Ma Y, Liu Z, Wang H, Wang X. Pan-cancer analysis identifies PD-L2 as a tumor promotor in the tumor microenvironment. Front Immunol 2023; 14:1093716. [PMID: 37006239 PMCID: PMC10060638 DOI: 10.3389/fimmu.2023.1093716] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Accepted: 01/19/2023] [Indexed: 03/18/2023] Open
Abstract
BACKGROUND Programmed cell death protein 1 (PD-1) receptor has two ligands,programmed death-ligand 1 (PD-L1) and PD-L2. When compared with PD-L1, PD-L2 has not received much attention, and its role remains unclear. METHODS The expression profiles of pdcd1lg2 (PD-L2-encoding gene) mRNA and PD-L2 protein were analyzed using TCGA, ICGC, and HPA databases. Kaplan-Meier and Cox regression analyses were used to assess the prognostic significance of PD-L2. We used GSEA, Spearman's correlation analysis and PPI network to explore the biological functions of PD-L2. PD-L2-associated immune cell infiltration was evaluated using the ESTIMATE algorithm and TIMER 2.0. The expressions of PD-L2 in tumor-associated macrophages (TAMs) in human colon cancer samples, and in mice in an immunocompetent syngeneic setting were verified using scRNA-seq datasets, multiplex immunofluorescence staining, and flow cytometry. After fluorescence-activated cell sorting, flow cytometry and qRT-PCR and transwell and colony formation assays were used to evaluate the phenotype and functions of PD-L2+TAMs. Immune checkpoint inhibitors (ICIs) therapy prediction analysis was performed using TIDE and TISMO. Last, a series of targeted small-molecule drugs with promising therapeutic effects were predicted using the GSCA platform. RESULTS PD-L2 was expressed in all the common human cancer types and deteriorated outcomes in multiple cancers. PPI network and Spearman's correlation analysis revealed that PD-L2 was closely associated with many immune molecules. Moreover, both GSEA results of KEGG pathways and GSEA results for Reactome analysis indicated that PD-L2 expression played an important role in cancer immune response. Further analysis showed that PD-L2 expression was strongly associated with the infiltration of immune cells in tumor tissue in almost all cancer types, among which macrophages were the most positively associated with PD-L2 in colon cancer. According to the results mentioned above, we verified the expression of PD-L2 in TAMs in colon cancer and found that PD-L2+TAMs population was not static. Additionally, PD-L2+TAMs exhibited protumor M2 phenotype and increased the migration, invasion, and proliferative capacity of colon cancer cells. Furthermore, PD-L2 had a substantial predictive value for ICIs therapy cohorts. CONCLUSION PD-L2 in the TME, especially expressed on TAMs, could be applied as a potential therapeutic target.
Collapse
Affiliation(s)
- Jingfang Lv
- Department of Colorectal Surgery, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Zheng Jiang
- Department of Colorectal Surgery, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Junhu Yuan
- State Key Laboratory of Molecular Oncology, National Cancer Center/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Meng Zhuang
- Department of Colorectal Surgery, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Xu Guan
- Department of Colorectal Surgery, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Hengchang Liu
- Department of Colorectal Surgery, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Yefeng Yin
- Department of Colorectal Surgery, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Yiming Ma
- State Key Laboratory of Molecular Oncology, National Cancer Center/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Zheng Liu
- Department of Colorectal Surgery, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Hongying Wang
- State Key Laboratory of Molecular Oncology, National Cancer Center/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Xishan Wang
- Department of Colorectal Surgery, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| |
Collapse
|
31
|
Sahu A, Wang X, Munson P, Klomp JP, Wang X, Gu SS, Han Y, Qian G, Nicol P, Zeng Z, Wang C, Tokheim C, Zhang W, Fu J, Wang J, Nair NU, Rens JA, Bourajjaj M, Jansen B, Leenders I, Lemmers J, Musters M, van Zanten S, van Zelst L, Worthington J, Liu JS, Juric D, Meyer CA, Oubrie A, Liu XS, Fisher DE, Flaherty KT. Discovery of Targets for Immune-Metabolic Antitumor Drugs Identifies Estrogen-Related Receptor Alpha. Cancer Discov 2023; 13:672-701. [PMID: 36745048 PMCID: PMC9975674 DOI: 10.1158/2159-8290.cd-22-0244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 09/13/2022] [Accepted: 11/23/2022] [Indexed: 02/07/2023]
Abstract
Drugs that kill tumors through multiple mechanisms have the potential for broad clinical benefits. Here, we first developed an in silico multiomics approach (BipotentR) to find cancer cell-specific regulators that simultaneously modulate tumor immunity and another oncogenic pathway and then used it to identify 38 candidate immune-metabolic regulators. We show the tumor activities of these regulators stratify patients with melanoma by their response to anti-PD-1 using machine learning and deep neural approaches, which improve the predictive power of current biomarkers. The topmost identified regulator, ESRRA, is activated in immunotherapy-resistant tumors. Its inhibition killed tumors by suppressing energy metabolism and activating two immune mechanisms: (i) cytokine induction, causing proinflammatory macrophage polarization, and (ii) antigen-presentation stimulation, recruiting CD8+ T cells into tumors. We also demonstrate a wide utility of BipotentR by applying it to angiogenesis and growth suppressor evasion pathways. BipotentR (http://bipotentr.dfci.harvard.edu/) provides a resource for evaluating patient response and discovering drug targets that act simultaneously through multiple mechanisms. SIGNIFICANCE BipotentR presents resources for evaluating patient response and identifying targets for drugs that can kill tumors through multiple mechanisms concurrently. Inhibition of the topmost candidate target killed tumors by suppressing energy metabolism and effects on two immune mechanisms. This article is highlighted in the In This Issue feature, p. 517.
Collapse
Affiliation(s)
- Avinash Sahu
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts
- Department of Obstetrics and Gynecology, University of Colorado School of Medicine, Aurora, Colorado
- Corresponding Authors: Keith T. Flaherty, Developmental Therapeutics, Massachusetts General Hospital Cancer Center, 55 Fruit Street, Boston, MA 02114. Phone: 617-724-4000; E-mail: ; David E. Fisher, Charlestown Navy Yard Building 149, 149 13th Street, Charlestown, MA 02129. Phone: 617-643-5428; E-mail: ; and Avinash Sahu, Department of Data Sciences, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115. Phone: 240-391-8125; E-mail:
| | - Xiaoman Wang
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts
- State Key Laboratory of Medical Molecular Biology, Department of Biochemistry and Molecular Biology, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Phillip Munson
- Department of Medicine and Harvard Medical School, Massachusetts General Hospital Cancer Center, Boston, Massachusetts
| | | | - Xiaoqing Wang
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts
- Department of Cardiology, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai, China
| | - Shengqing Stan Gu
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts
| | - Ya Han
- School of Life Sciences and Technology, Tongji University, Shanghai, China
| | - Gege Qian
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts
| | - Phillip Nicol
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts
| | - Zexian Zeng
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts
| | - Chenfei Wang
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts
| | - Collin Tokheim
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts
| | - Wubing Zhang
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts
| | - Jingxin Fu
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts
| | - Jin Wang
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts
| | - Nishanth Ulhas Nair
- Cancer Data Science Laboratory, National Cancer Institute, National Institutes of Health, Bethesda, Maryland
| | | | | | - Bas Jansen
- Lead Pharma, Kloosterstraat, Oss, the Netherlands
| | | | - Jaap Lemmers
- Lead Pharma, Kloosterstraat, Oss, the Netherlands
| | - Mark Musters
- Lead Pharma, Kloosterstraat, Oss, the Netherlands
| | | | | | | | - Jun S. Liu
- Department of Statistics, Harvard University, Cambridge, Massachusetts
| | - Dejan Juric
- Department of Medicine and Harvard Medical School, Massachusetts General Hospital Cancer Center, Boston, Massachusetts
| | - Clifford A. Meyer
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts
| | | | - X. Shirley Liu
- Department of Data Science, Dana-Farber Cancer Institute, Boston, Massachusetts
| | - David E. Fisher
- Department of Medicine and Harvard Medical School, Massachusetts General Hospital Cancer Center, Boston, Massachusetts
- Department of Dermatology, Massachusetts General Hospital, Boston, Massachusetts
- Corresponding Authors: Keith T. Flaherty, Developmental Therapeutics, Massachusetts General Hospital Cancer Center, 55 Fruit Street, Boston, MA 02114. Phone: 617-724-4000; E-mail: ; David E. Fisher, Charlestown Navy Yard Building 149, 149 13th Street, Charlestown, MA 02129. Phone: 617-643-5428; E-mail: ; and Avinash Sahu, Department of Data Sciences, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115. Phone: 240-391-8125; E-mail:
| | - Keith T. Flaherty
- Department of Medicine and Harvard Medical School, Massachusetts General Hospital Cancer Center, Boston, Massachusetts
- Corresponding Authors: Keith T. Flaherty, Developmental Therapeutics, Massachusetts General Hospital Cancer Center, 55 Fruit Street, Boston, MA 02114. Phone: 617-724-4000; E-mail: ; David E. Fisher, Charlestown Navy Yard Building 149, 149 13th Street, Charlestown, MA 02129. Phone: 617-643-5428; E-mail: ; and Avinash Sahu, Department of Data Sciences, Dana-Farber Cancer Institute, 44 Binney Street, Boston, MA 02115. Phone: 240-391-8125; E-mail:
| |
Collapse
|
32
|
Ma A, Wang X, Li J, Wang C, Xiao T, Liu Y, Cheng H, Wang J, Li Y, Chang Y, Li J, Wang D, Jiang Y, Su L, Xin G, Gu S, Li Z, Liu B, Xu D, Ma Q. Single-cell biological network inference using a heterogeneous graph transformer. Nat Commun 2023; 14:964. [PMID: 36810839 PMCID: PMC9944243 DOI: 10.1038/s41467-023-36559-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2022] [Accepted: 02/06/2023] [Indexed: 02/23/2023] Open
Abstract
Single-cell multi-omics (scMulti-omics) allows the quantification of multiple modalities simultaneously to capture the intricacy of complex molecular mechanisms and cellular heterogeneity. Existing tools cannot effectively infer the active biological networks in diverse cell types and the response of these networks to external stimuli. Here we present DeepMAPS for biological network inference from scMulti-omics. It models scMulti-omics in a heterogeneous graph and learns relations among cells and genes within both local and global contexts in a robust manner using a multi-head graph transformer. Benchmarking results indicate DeepMAPS performs better than existing tools in cell clustering and biological network construction. It also showcases competitive capability in deriving cell-type-specific biological networks in lung tumor leukocyte CITE-seq data and matched diffuse small lymphocytic lymphoma scRNA-seq and scATAC-seq data. In addition, we deploy a DeepMAPS webserver equipped with multiple functionalities and visualizations to improve the usability and reproducibility of scMulti-omics data analysis.
Collapse
Affiliation(s)
- Anjun Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Xiaoying Wang
- School of Mathematics, Shandong University, Jinan, Shandong, China
| | - Jingxian Li
- School of Mathematics, Shandong University, Jinan, Shandong, China
| | - Cankun Wang
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA
| | - Tong Xiao
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Yuntao Liu
- School of Mathematics, Shandong University, Jinan, Shandong, China
| | - Hao Cheng
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA
| | - Juexin Wang
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Yang Li
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA
| | - Yuzhou Chang
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Jinpu Li
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA
| | - Duolin Wang
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Yuexu Jiang
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Li Su
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA
| | - Gang Xin
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Shaopeng Gu
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA
| | - Zihai Li
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Bingqiang Liu
- School of Mathematics, Shandong University, Jinan, Shandong, China.
| | - Dong Xu
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA.
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA.
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA.
| | - Qin Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA.
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA.
| |
Collapse
|
33
|
Mishra S, Pandey N, Chawla S, Sharma M, Chandra O, Jha IP, SenGupta D, Natarajan KN, Kumar V. Matching queried single-cell open-chromatin profiles to large pools of single-cell transcriptomes and epigenomes for reference supported analysis. Genome Res 2023; 33:218-231. [PMID: 36653120 PMCID: PMC10069468 DOI: 10.1101/gr.277015.122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 01/09/2023] [Indexed: 01/19/2023]
Abstract
The true benefits of large single-cell transcriptome and epigenome data sets can be realized only with the development of new approaches and search tools for annotating individual cells. Matching a single-cell epigenome profile to a large pool of reference cells remains a major challenge. Here, we present scEpiSearch, which enables searching, comparison, and independent classification of single-cell open-chromatin profiles against a large reference of single-cell expression and open-chromatin data sets. Across performance benchmarks, scEpiSearch outperformed multiple methods in accuracy of search and low-dimensional coembedding of single-cell profiles, irrespective of platforms and species. Here we also demonstrate the unconventional utilities of scEpiSearch by applying it on single-cell epigenome profiles of K562 cells and samples from patients with acute leukaemia to reveal different aspects of their heterogeneity, multipotent behavior, and dedifferentiated states. Applying scEpiSearch on our single-cell open-chromatin profiles from embryonic stem cells (ESCs), we identified ESC subpopulations with more activity and poising for endoplasmic reticulum stress and unfolded protein response. Thus, scEpiSearch solves the nontrivial problem of amalgamating information from a large pool of single cells to identify and study the regulatory states of cells using their single-cell epigenomes.
Collapse
Affiliation(s)
- Shreya Mishra
- Department for Computational Biology, IIIT Delhi 110020, India
| | - Neetesh Pandey
- Department for Computational Biology, IIIT Delhi 110020, India
| | - Smriti Chawla
- Department for Computational Biology, IIIT Delhi 110020, India
| | - Madhu Sharma
- Department for Computational Biology, IIIT Delhi 110020, India
| | - Omkar Chandra
- Department for Computational Biology, IIIT Delhi 110020, India
| | | | - Debarka SenGupta
- Department for Computational Biology, IIIT Delhi 110020, India.,Institute of Health and Biomedical Innovation, Queensland University of Technology, Brisbane 4001, Australia
| | - Kedar Nath Natarajan
- DTU Bioengineering, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark
| | - Vibhor Kumar
- Department for Computational Biology, IIIT Delhi 110020, India;
| |
Collapse
|
34
|
Guan PY, Lee JS, Wang L, Lin KZ, Mei W, Chen L, Jiang Y. Destin2: Integrative and cross-modality analysis of single-cell chromatin accessibility data. Front Genet 2023; 14:1089936. [PMID: 36873935 PMCID: PMC9981783 DOI: 10.3389/fgene.2023.1089936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 02/06/2023] [Indexed: 02/19/2023] Open
Abstract
We propose Destin2, a novel statistical and computational method for cross-modality dimension reduction, clustering, and trajectory reconstruction for single-cell ATAC-seq data. The framework integrates cellular-level epigenomic profiles from peak accessibility, motif deviation score, and pseudo-gene activity and learns a shared manifold using the multimodal input, followed by clustering and/or trajectory inference. We apply Destin2 to real scATAC-seq datasets with both discretized cell types and transient cell states and carry out benchmarking studies against existing methods based on unimodal analyses. Using cell-type labels transferred with high confidence from unmatched single-cell RNA sequencing data, we adopt four performance assessment metrics and demonstrate how Destin2 corroborates and improves upon existing methods. Using single-cell RNA and ATAC multiomic data, we further exemplify how Destin2's cross-modality integrative analyses preserve true cell-cell similarities using the matched cell pairs as ground truths. Destin2 is compiled as a freely available R package available at https://github.com/yuchaojiang/Destin2.
Collapse
Affiliation(s)
- Peter Y Guan
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, Unites States
| | - Jin Seok Lee
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, Unites States
| | - Lihao Wang
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, Unites States
| | - Kevin Z Lin
- Department of Statistics, University of Pennsylvania, Philadelphia, PA, Unites States
| | - Wenwen Mei
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, Unites States
| | - Li Chen
- Department of Biostatistics, University of Florida, Gainesville, FL, Unites States
| | - Yuchao Jiang
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, Unites States.,Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC, Unites States.,Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, Unites States
| |
Collapse
|
35
|
Abstract
Single-cell studies are enabling our understanding of the molecular processes of normal cell development and the onset of several pathologies. For instance, single-cell RNA sequencing (scRNA-Seq) measures the transcriptome-wide gene expression at a single-cell resolution, allowing for studying the heterogeneity among the cells of the same population and revealing complex and rare cell populations. On the other hand, single-cell Assay for Transposase-Accessible Chromatin using sequencing (scATAC-Seq) can be used to define transcriptional and epigenetic changes by analyzing the chromatin accessibility at the single-cell level. However, the integration of multi-omics data still remains one of the most difficult tasks in bioinformatics. In this chapter, we focus on the combination of scRNA-Seq and scATACSeq data to perform an integrative analysis of the single-cell transcriptome and chromatin accessibility of human fetal progenitors.
Collapse
|
36
|
Cao K, Gong Q, Hong Y, Wan L. A unified computational framework for single-cell data integration with optimal transport. Nat Commun 2022; 13:7419. [PMID: 36456571 PMCID: PMC9715710 DOI: 10.1038/s41467-022-35094-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Accepted: 11/18/2022] [Indexed: 12/05/2022] Open
Abstract
Single-cell data integration can provide a comprehensive molecular view of cells. However, how to integrate heterogeneous single-cell multi-omics as well as spatially resolved transcriptomic data remains a major challenge. Here we introduce uniPort, a unified single-cell data integration framework that combines a coupled variational autoencoder (coupled-VAE) and minibatch unbalanced optimal transport (Minibatch-UOT). It leverages both highly variable common and dataset-specific genes for integration to handle the heterogeneity across datasets, and it is scalable to large-scale datasets. uniPort jointly embeds heterogeneous single-cell multi-omics datasets into a shared latent space. It can further construct a reference atlas for gene imputation across datasets. Meanwhile, uniPort provides a flexible label transfer framework to deconvolute heterogeneous spatial transcriptomic data using an optimal transport plan, instead of embedding latent space. We demonstrate the capability of uniPort by applying it to integrate a variety of datasets, including single-cell transcriptomics, chromatin accessibility, and spatially resolved transcriptomic data.
Collapse
Affiliation(s)
- Kai Cao
- grid.484479.2LSC, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China ,grid.410726.60000 0004 1797 8419School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Qiyu Gong
- grid.16821.3c0000 0004 0368 8293Shanghai Institute of Immunology, Faculty of Basic Medicine, Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Yiguang Hong
- grid.24516.340000000123704535Department of Control Science and Engineering, Tongji University, Shanghai, China
| | - Lin Wan
- grid.484479.2LSC, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China ,grid.410726.60000 0004 1797 8419School of Mathematical Sciences, University of Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
37
|
Jafari E, Johnson T, Wang Y, Liu Y, Huang K, Wang Y. AIscEA: unsupervised integration of single-cell gene expression and chromatin accessibility via their biological consistency. Bioinformatics 2022; 38:5236-5244. [PMID: 36250795 PMCID: PMC9710555 DOI: 10.1093/bioinformatics/btac683] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2022] [Revised: 10/07/2022] [Accepted: 10/14/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION The integrative analysis of single-cell gene expression and chromatin accessibility measurements is essential for revealing gene regulation, but it is one of the key challenges in computational biology. Gene expression and chromatin accessibility are measurements from different modalities, and no common features can be directly used to guide integration. Current state-of-the-art methods lack practical solutions for finding heterogeneous clusters. However, previous methods might not generate reliable results when cluster heterogeneity exists. More importantly, current methods lack an effective way to select hyper-parameters under an unsupervised setting. Therefore, applying computational methods to integrate single-cell gene expression and chromatin accessibility measurements remains difficult. RESULTS We introduce AIscEA-Alignment-based Integration of single-cell gene Expression and chromatin Accessibility-a computational method that integrates single-cell gene expression and chromatin accessibility measurements using their biological consistency. AIscEA first defines a ranked similarity score to quantify the biological consistency between cell clusters across measurements. AIscEA then uses the ranked similarity score and a novel permutation test to identify cluster alignment across measurements. AIscEA further utilizes graph alignment for the aligned cell clusters to align the cells across measurements. We compared AIscEA with the competing methods on several benchmark datasets and demonstrated that AIscEA is highly robust to the choice of hyper-parameters and can better handle the cluster heterogeneity problem. Furthermore, AIscEA significantly outperforms the state-of-the-art methods when integrating real-world SNARE-seq and scMultiome-seq datasets in terms of integration accuracy. AVAILABILITY AND IMPLEMENTATION AIscEA is available at https://figshare.com/articles/software/AIscEA_zip/21291135 on FigShare as well as {https://github.com/elhaam/AIscEA} onGitHub. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Elham Jafari
- Computer Science Department, Indiana University, Bloomington, IN 47408, USA
| | - Travis Johnson
- Department of Biostatistics and Health Data Science, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Yue Wang
- Department of Medical & Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Yunlong Liu
- Department of Medical & Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Kun Huang
- Department of Biostatistics and Health Data Science, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | - Yijie Wang
- Computer Science Department, Indiana University, Bloomington, IN 47408, USA
| |
Collapse
|
38
|
Han Y, Wang Y, Dong X, Sun D, Liu Z, Yue J, Wang H, Li T, Wang C. TISCH2: expanded datasets and new tools for single-cell transcriptome analyses of the tumor microenvironment. Nucleic Acids Res 2022; 51:D1425-D1431. [PMID: 36321662 PMCID: PMC9825603 DOI: 10.1093/nar/gkac959] [Citation(s) in RCA: 67] [Impact Index Per Article: 33.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Revised: 10/04/2022] [Accepted: 10/11/2022] [Indexed: 11/07/2022] Open
Abstract
The Tumor Immune Single Cell Hub 2 (TISCH2) is a resource of single-cell RNA-seq (scRNA-seq) data from human and mouse tumors, which enables comprehensive characterization of gene expression in the tumor microenvironment (TME) across multiple cancer types. As an increasing number of datasets are generated in the public domain, in this update, TISCH2 has included 190 tumor scRNA-seq datasets covering 6 million cells in 50 cancer types, with 110 newly collected datasets and almost tripling the number of cells compared with the previous release. Furthermore, TISCH2 includes several new functions that allow users to better utilize the large-scale scRNA-seq datasets. First, in the Dataset module, TISCH2 provides the cell-cell communication results in each dataset, facilitating the analyses of interacted cell types and the discovery of significant ligand-receptor pairs between cell types. TISCH2 also includes the transcription factor analyses for each dataset and visualization of the top enriched transcription factors of each cell type. Second, in the Gene module, TISCH2 adds functions for identifying correlated genes and providing survival information for the input genes. In summary, TISCH2 is a user-friendly, up-to-date and well-maintained data resource for gene expression analyses in the TME. TISCH2 is freely available at http://tisch.comp-genomics.org/.
Collapse
Affiliation(s)
| | | | | | - Dongqing Sun
- Shanghai Putuo District People's Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China,Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Zhaoyang Liu
- Shanghai Putuo District People's Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China,Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Jiali Yue
- Shanghai Putuo District People's Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China,Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Haiyun Wang
- Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Taiwen Li
- Correspondence may also be addressed to Taiwen Li. Tel: +86 28 85501484; Fax: +86 28 85501484;
| | - Chenfei Wang
- To whom correspondence should be addressed. Tel: +86 21 65981195; Fax: +86 21 65981195;
| |
Collapse
|
39
|
Shi X, Yu Z, Ren P, Dong X, Ding X, Song J, Zhang J, Li T, Wang C. HUSCH: an integrated single-cell transcriptome atlas for human tissue gene expression visualization and analyses. Nucleic Acids Res 2022; 51:D1029-D1037. [PMID: 36318258 PMCID: PMC9825509 DOI: 10.1093/nar/gkac1001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 10/17/2022] [Accepted: 10/19/2022] [Indexed: 11/07/2022] Open
Abstract
Understanding gene expression patterns across different human cell types is crucial for investigating mechanisms of cell type differentiation, disease occurrence and progression. The recent development of single-cell RNA-seq (scRNA-seq) technologies significantly boosted the characterization of cell type heterogeneities in different human tissues. However, the huge number of datasets in the public domain also posed challenges in data integration and reuse. We present Human Universal Single Cell Hub (HUSCH, http://husch.comp-genomics.org), an atlas-scale curated database that integrates single-cell transcriptomic profiles of nearly 3 million cells from 185 high-quality human scRNA-seq datasets from 45 different tissues. All the data in HUSCH were uniformly processed and annotated with a standard workflow. In the single dataset module, HUSCH provides interactive gene expression visualization, differentially expressed genes, functional analyses, transcription regulators and cell-cell interaction analyses for each cell type cluster. Besides, HUSCH integrated different datasets in the single tissue module and performs data integration, batch correction, and cell type harmonization. This allows a comprehensive visualization and analysis of gene expression within each tissue based on single-cell datasets from multiple sources and platforms. HUSCH is a flexible and comprehensive data portal that enables searching, visualizing, analyzing, and downloading single-cell gene expression for the human tissue atlas.
Collapse
Affiliation(s)
| | | | - Pengfei Ren
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Science and Technology, Tongji University, Tongji, 200092, China,Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Xin Dong
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Science and Technology, Tongji University, Tongji, 200092, China,Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Xuanxin Ding
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Science and Technology, Tongji University, Tongji, 200092, China,Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Jiaming Song
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, College of Life Science and Technology, Guangxi University, Guangxi 530004, China
| | - Jing Zhang
- Research Center for Translational Medicine, Shanghai East Hospital, School of Life Science and Technology, Tongji University, Shanghai, China
| | - Taiwen Li
- Correspondence may also be addressed to Taiwen Li. Tel: +86 28 85501484; Fax: +86 28 85501484;
| | - Chenfei Wang
- To whom correspondence should be addressed. Tel: +86 21 65981197; Fax: +86 21 65981197;
| |
Collapse
|
40
|
Xu C, Ma D, Ding Q, Zhou Y, Zheng H. PlantPhoneDB: A manually curated pan-plant database of ligand-receptor pairs infers cell-cell communication. PLANT BIOTECHNOLOGY JOURNAL 2022; 20:2123-2134. [PMID: 35842742 PMCID: PMC9616517 DOI: 10.1111/pbi.13893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Revised: 07/10/2022] [Accepted: 07/13/2022] [Indexed: 06/15/2023]
Abstract
Ligand-receptor pairs play important roles in cell-cell communication for multicellular organisms in response to environmental cues. Recently, the emergence of single-cell RNA-sequencing (scRNA-seq) provides unprecedented opportunities to investigate cellular communication based on ligand-receptor expression. However, so far, no reliable ligand-receptor interaction database is available for plant species. In this study, we developed PlantPhoneDB (https://jasonxu.shinyapps.io/PlantPhoneDB/), a pan-plant database comprising a large number of high-confidence ligand-receptor pairs manually curated from seven resources. Also, we developed a PlantPhoneDB R package, which not only provided optional four scoring approaches that calculate interaction scores of ligand-receptor pairs between cell types but also provided visualization functions to present analysis results. At the PlantPhoneDB web interface, the processed datasets and results can be searched, browsed, and downloaded. To uncover novel cell-cell communication events in plants, we applied the PlantPhoneDB R package on GSE121619 dataset to infer significant cell-cell interactions of heat-shocked root cells in Arabidopsis thaliana. As a result, the PlantPhoneDB predicted the actively communicating AT1G28290-AT2G14890 ligand-receptor pair in atrichoblast-cortex cell pair in Arabidopsis thaliana. Importantly, the downstream target genes of this ligand-receptor pair were significantly enriched in the ribosome pathway, which facilitated plants adapting to environmental changes. In conclusion, PlantPhoneDB provided researchers with integrated resources to infer cell-cell communication from scRNA-seq datasets.
Collapse
Affiliation(s)
- Chaoqun Xu
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and EcologyXiamen UniversityXiamenChina
| | - Dongna Ma
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and EcologyXiamen UniversityXiamenChina
| | - Qiansu Ding
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and EcologyXiamen UniversityXiamenChina
| | - Ying Zhou
- National Institute for Data Science in Health and Medicine, School of MedicineXiamen UniversityXiamenChina
| | - Hai‐Lei Zheng
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and EcologyXiamen UniversityXiamenChina
| |
Collapse
|
41
|
Liu Z, Sun D, Wang C. Evaluation of cell-cell interaction methods by integrating single-cell RNA sequencing data with spatial information. Genome Biol 2022; 23:218. [PMID: 36253792 PMCID: PMC9575221 DOI: 10.1186/s13059-022-02783-y] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Accepted: 10/04/2022] [Indexed: 11/17/2022] Open
Abstract
Background Cell-cell interactions are important for information exchange between different cells, which are the fundamental basis of many biological processes. Recent advances in single-cell RNA sequencing (scRNA-seq) enable the characterization of cell-cell interactions using computational methods. However, it is hard to evaluate these methods since no ground truth is provided. Spatial transcriptomics (ST) data profiles the relative position of different cells. We propose that the spatial distance suggests the interaction tendency of different cell types, thus could be used for evaluating cell-cell interaction tools. Results We benchmark 16 cell-cell interaction methods by integrating scRNA-seq with ST data. We characterize cell-cell interactions into short-range and long-range interactions using spatial distance distributions between ligands and receptors. Based on this classification, we define the distance enrichment score and apply an evaluation workflow to 16 cell-cell interaction tools using 15 simulated and 5 real scRNA-seq and ST datasets. We also compare the consistency of the results from single tools with the commonly identified interactions. Our results suggest that the interactions predicted by different tools are highly dynamic, and the statistical-based methods show overall better performance than network-based methods and ST-based methods. Conclusions Our study presents a comprehensive evaluation of cell-cell interaction tools for scRNA-seq. CellChat, CellPhoneDB, NicheNet, and ICELLNET show overall better performance than other tools in terms of consistency with spatial tendency and software scalability. We recommend using results from at least two methods to ensure the accuracy of identified interactions. We have packaged the benchmark workflow with detailed documentation at GitHub (https://github.com/wanglabtongji/CCI). Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02783-y.
Collapse
Affiliation(s)
- Zhaoyang Liu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Science and Technology, Tongji University, Shanghai, 200092, China.,Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Dongqing Sun
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Science and Technology, Tongji University, Shanghai, 200092, China.,Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
| | - Chenfei Wang
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Science and Technology, Tongji University, Shanghai, 200092, China. .,Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China.
| |
Collapse
|
42
|
Stanojevic S, Li Y, Ristivojevic A, Garmire LX. Computational Methods for Single-cell Multi-omics Integration and Alignment. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022; 20:836-849. [PMID: 36581065 PMCID: PMC10025765 DOI: 10.1016/j.gpb.2022.11.013] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 08/09/2022] [Accepted: 11/04/2022] [Indexed: 12/27/2022]
Abstract
Recently developed technologies to generate single-cell genomic data have made a revolutionary impact in the field of biology. Multi-omics assays offer even greater opportunities to understand cellular states and biological processes. The problem of integrating different omics data with very different dimensionality and statistical properties remains, however, quite challenging. A growing body of computational tools is being developed for this task, leveraging ideas ranging from machine translation to the theory of networks, and represents another frontier on the interface of biology and data science. Our goal in this review is to provide a comprehensive, up-to-date survey of computational techniques for the integration of single-cell multi-omics data, while making the concepts behind each algorithm approachable to a non-expert audience.
Collapse
Affiliation(s)
- Stefan Stanojevic
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Yijun Li
- Department of Biostatistics, University of Michigan, Ann Arbor, MI 48109, USA
| | | | - Lana X Garmire
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA.
| |
Collapse
|
43
|
Hu K, Liu H, Lawson ND, Zhu LJ. scATACpipe: A nextflow pipeline for comprehensive and reproducible analyses of single cell ATAC-seq data. Front Cell Dev Biol 2022; 10:981859. [PMID: 36238687 PMCID: PMC9551270 DOI: 10.3389/fcell.2022.981859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 09/12/2022] [Indexed: 11/13/2022] Open
Abstract
Single cell ATAC-seq (scATAC-seq) has become the most widely used method for profiling open chromatin landscape of heterogeneous cell populations at a single-cell resolution. Although numerous software tools and pipelines have been developed, an easy-to-use, scalable, reproducible, and comprehensive pipeline for scATAC-seq data analyses is still lacking. To fill this gap, we developed scATACpipe, a Nextflow pipeline, for performing comprehensive analyses of scATAC-seq data including extensive quality assessment, preprocessing, dimension reduction, clustering, peak calling, differential accessibility inference, integration with scRNA-seq data, transcription factor activity and footprinting analysis, co-accessibility inference, and cell trajectory prediction. scATACpipe enables users to perform the end-to-end analysis of scATAC-seq data with three sub-workflow options for preprocessing that leverage 10x Genomics Cell Ranger ATAC software, the ultra-fast Chromap procedures, and a set of custom scripts implementing current best practices for scATAC-seq data preprocessing. The pipeline extends the R package ArchR for downstream analysis with added support to any eukaryotic species with an annotated reference genome. Importantly, scATACpipe generates an all-in-one HTML report for the entire analysis and outputs cluster-specific BAM, BED, and BigWig files for visualization in a genome browser. scATACpipe eliminates the need for users to chain different tools together and facilitates reproducible and comprehensive analyses of scATAC-seq data from raw reads to various biological insights with minimal changes of configuration settings for different computing environments or species. By applying it to public datasets, we illustrated the utility, flexibility, versatility, and reliability of our pipeline, and demonstrated that our scATACpipe outperforms other workflows.
Collapse
Affiliation(s)
- Kai Hu
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, United States
| | - Haibo Liu
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, United States
| | - Nathan D. Lawson
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, United States
| | - Lihua Julie Zhu
- Department of Molecular, Cell and Cancer Biology, University of Massachusetts Chan Medical School, Worcester, MA, United States
- Program in Molecular Medicine, Program in Bioinformatics and Integrative Biology, University of Massachusetts Chan Medical School, Worcester, MA, United States
| |
Collapse
|
44
|
Dong X, Tang K, Xu Y, Wei H, Han T, Wang C. Single-cell gene regulation network inference by large-scale data integration. Nucleic Acids Res 2022; 50:e126. [PMID: 36155797 PMCID: PMC9756951 DOI: 10.1093/nar/gkac819] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Revised: 08/11/2022] [Accepted: 09/14/2022] [Indexed: 12/24/2022] Open
Abstract
Single-cell ATAC-seq (scATAC-seq) has proven to be a state-of-art approach to investigating gene regulation at the single-cell level. However, existing methods cannot precisely uncover cell-type-specific binding of transcription regulators (TRs) and construct gene regulation networks (GRNs) in single-cell. ChIP-seq has been widely used to profile TR binding sites in the past decades. Here, we developed SCRIP, an integrative method to infer single-cell TR activity and targets based on the integration of scATAC-seq and a large-scale TR ChIP-seq reference. Our method showed improved performance in evaluating TR binding activity compared to the existing motif-based methods and reached a higher consistency with matched TR expressions. Besides, our method enables identifying TR target genes as well as building GRNs at the single-cell resolution based on a regulatory potential model. We demonstrate SCRIP's utility in accurate cell-type clustering, lineage tracing, and inferring cell-type-specific GRNs in multiple biological systems. SCRIP is freely available at https://github.com/wanglabtongji/SCRIP.
Collapse
Affiliation(s)
| | | | - Yunfan Xu
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China,Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Hailin Wei
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China,Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Tong Han
- Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration of Ministry of Education, Department of Orthopedics, Tongji Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China,Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Chenfei Wang
- To whom correspondence should be addressed. Tel: +86 21 65981195; Fax: +86 21 65981195;
| |
Collapse
|
45
|
Shen Y, Nussbaum YI, Manjunath Y, Hummel JJ, Ciorba MA, Warren WC, Kaifi JT, Papageorgiou C, Cortese R, Shyu CR, Mitchem JB. TBX21 Methylation as a Potential Regulator of Immune Suppression in CMS1 Subtype Colorectal Cancer. Cancers (Basel) 2022; 14:cancers14194594. [PMID: 36230517 PMCID: PMC9558549 DOI: 10.3390/cancers14194594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2022] [Revised: 09/08/2022] [Accepted: 09/12/2022] [Indexed: 11/16/2022] Open
Abstract
Cytotoxic T lymphocyte (CTL) infiltration is associated with survival, recurrence, and therapeutic response in colorectal cancer (CRC). Immune checkpoint inhibitor (ICI) therapy, which requires CTLs for response, does not work for most CRC patients. Therefore, it is critical to improve our understanding of immune resistance in this disease. We utilized 2391 CRC patients and 7 omics datasets, integrating clinical and genomic data to determine how DNA methylation may impact survival and CTL function in CRC. Using comprehensive molecular subtype (CMS) 1 patients as reference, we found TBX21 to be the only gene with altered expression and methylation that was associated with CTL infiltration. We found that CMS1 patients with high TBX21 expression and low methylation had a significant survival advantage. To confirm the role of Tbx21 in CTL function, we utilized scRNAseq data, demonstrating the association of TBX21 with markers of enhanced CTL function. Further analysis using pathway enrichment found that the genes TBX21, MX1, and SP140 had altered expression and methylation, suggesting that the TP53/P53 pathway may modify TBX21 methylation to upregulate TBX21 expression. Together, this suggests that targeting epigenetic modification more specifically for therapy and patient stratification may provide improved outcomes in CRC.
Collapse
Affiliation(s)
- Yuanyuan Shen
- Institute for Data Science & Informatics, University of Missouri, Columbia, MO 65211, USA
| | - Yulia I. Nussbaum
- Institute for Data Science & Informatics, University of Missouri, Columbia, MO 65211, USA
| | - Yariswamy Manjunath
- Harry S. Truman Memorial Veterans’ Hospital, University of Missouri, Columbia, MO 65211, USA
| | - Justin J. Hummel
- Institute for Data Science & Informatics, University of Missouri, Columbia, MO 65211, USA
| | - Matthew A. Ciorba
- School of Medicine, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Wesley C. Warren
- Institute for Data Science & Informatics, University of Missouri, Columbia, MO 65211, USA
- Department of Animal Sciences, University of Missouri, Columbia, MO 65211, USA
| | - Jussuf T. Kaifi
- Institute for Data Science & Informatics, University of Missouri, Columbia, MO 65211, USA
- Harry S. Truman Memorial Veterans’ Hospital, University of Missouri, Columbia, MO 65211, USA
- Department of Surgery, University of Missouri, Columbia, MO 65211, USA
- School of Medicine, University of Missouri, Columbia, MO 65211, USA
| | - Christos Papageorgiou
- School of Medicine, University of Missouri, Columbia, MO 65211, USA
- Ellis Fischel Cancer Center, University of Missouri, Columbia, MO 65211, USA
| | - Rene Cortese
- School of Medicine, University of Missouri, Columbia, MO 65211, USA
| | - Chi-Ren Shyu
- Institute for Data Science & Informatics, University of Missouri, Columbia, MO 65211, USA
- College of Engineering, University of Missouri, Columbia, MO 65211, USA
| | - Jonathan B. Mitchem
- Institute for Data Science & Informatics, University of Missouri, Columbia, MO 65211, USA
- Harry S. Truman Memorial Veterans’ Hospital, University of Missouri, Columbia, MO 65211, USA
- Department of Surgery, University of Missouri, Columbia, MO 65211, USA
- School of Medicine, University of Missouri, Columbia, MO 65211, USA
- Correspondence:
| |
Collapse
|
46
|
Jiang Y, Harigaya Y, Zhang Z, Zhang H, Zang C, Zhang NR. Nonparametric single-cell multiomic characterization of trio relationships between transcription factors, target genes, and cis-regulatory regions. Cell Syst 2022; 13:737-751.e4. [PMID: 36055233 PMCID: PMC9509445 DOI: 10.1016/j.cels.2022.08.004] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 06/23/2022] [Accepted: 08/11/2022] [Indexed: 01/26/2023]
Abstract
The epigenetic control of gene expression is highly cell-type and context specific. Yet, despite its complexity, gene regulatory logic can be broken down into modular components consisting of a transcription factor (TF) activating or repressing the target gene expression through its binding to a cis-regulatory region. We propose a nonparametric approach, TRIPOD, to detect and characterize the three-way relationships between a TF, its target gene, and the accessibility of the TF's binding site using single-cell RNA and ATAC multiomic data. We apply TRIPOD to interrogate the cell-type-specific regulatory logic in peripheral blood mononuclear cells and contrast our results to detections from enhancer databases, cis-eQTL studies, ChIP-seq experiments, and TF knockdown/knockout studies. We then apply TRIPOD to mouse embryonic brain data and identify regulatory relationships, validated by ChIP-seq and PLAC-seq. Finally, we demonstrate TRIPOD on the SHARE-seq data of differentiating mouse hair follicle cells and identify lineage-specific regulation supported by histone marks and super-enhancer annotations. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Yuchao Jiang
- Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC 27599, USA; Department of Genetics, School of Medicine, University of North Carolina, Chapel Hill, NC 27599, USA; Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC 27599, USA.
| | - Yuriko Harigaya
- Curriculum in Bioinformatics and Computational Biology, School of Medicine, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Zhaojun Zhang
- Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Hongpan Zhang
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA 22908, USA; Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908, USA
| | - Chongzhi Zang
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA 22908, USA; Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908, USA; Department of Public Health Sciences, University of Virginia, Charlottesville, VA 22908, USA
| | - Nancy R Zhang
- Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA 19104, USA.
| |
Collapse
|
47
|
Kartha VK, Duarte FM, Hu Y, Ma S, Chew JG, Lareau CA, Earl A, Burkett ZD, Kohlway AS, Lebofsky R, Buenrostro JD. Functional inference of gene regulation using single-cell multi-omics. CELL GENOMICS 2022; 2. [PMID: 36204155 PMCID: PMC9534481 DOI: 10.1016/j.xgen.2022.100166] [Citation(s) in RCA: 53] [Impact Index Per Article: 26.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Cells require coordinated control over gene expression when responding to environmental stimuli. Here we apply scATAC-seq and single-cell RNA sequencing (scRNA-seq) in resting and stimulated human blood cells. Collectively, we generate ~91,000 single-cell profiles, allowing us to probe the cis-regulatory landscape of the immunological response across cell types, stimuli, and time. Advancing tools to integrate multi-omics data, we develop functional inference of gene regulation (FigR), a framework to computationally pair scA-TAC-seq with scRNA-seq cells, connect distal cis-regulatory elements to genes, and infer gene-regulatory networks (GRNs) to identify candidate transcription factor (TF) regulators. Utilizing these paired multi-omics data, we define domains of regulatory chromatin (DORCs) of immune stimulation and find that cells alter chromatin accessibility and gene expression at timescales of minutes. Construction of the stimulation GRN elucidates TF activity at disease-associated DORCs. Overall, FigR enables elucidation of regulatory interactions across single-cell data, providing new opportunities to understand the function of cells within tissues. Single-cell methods for measuring chromatin accessibility (ATAC-seq) and gene expression (RNA-seq) are rapidly evolving, but tools to integrate data and infer gene-regulatory relationships remain limited. Here we generate multi-omics data of resting and stimulated human blood cells and present a new computational framework for constructing gene-regulatory networks (GRNs). Specifically, we describe functional inference of gene regulation (FigR), a workflow to (1) pair scATAC-seq with scRNA-seq, (2) connect cis-regulatory elements to target genes, and (3) identify TF-gene relationships.
Collapse
Affiliation(s)
- Vinay K. Kartha
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA 02138, USA
- Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Fabiana M. Duarte
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA 02138, USA
- Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Yan Hu
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA 02138, USA
- Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Sai Ma
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA 02138, USA
- Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | | | - Caleb A. Lareau
- Department of Pathology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Andrew Earl
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA 02138, USA
- Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | | | | | | | - Jason D. Buenrostro
- Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA 02138, USA
- Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Corresponding author
| |
Collapse
|
48
|
Feasibility and Application of Cluster Nursing to the Care of Patients with Acute Oncology. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE 2022; 2022:8973449. [PMID: 35958913 PMCID: PMC9357692 DOI: 10.1155/2022/8973449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 07/04/2022] [Accepted: 07/05/2022] [Indexed: 11/17/2022]
Abstract
Objective To probe the utility of cluster nursing for the care of acute oncology clients. Methods One hundred fourteen cases of acute oncology pioneers undergoing therapy in our clinic from April 2019 to February 2021 were randomly assigned into two consecutive arms, conventional care and cluster care, in accordance with the nursing modality. Complications, satisfaction, quality of survival, and negative emotions were compared across the two parties. Results The comorbidity incidence rate of the subject matter in the research cohort was 7.02%, which was below the comorbidity rate of 17.54% in the reaction cohort (P < 0.05); the percentage of satisfaction in the research cohort was 96.49%, which was higher than the satisfaction rate of 78.95% in the reaction cohort (P < 0.05); aftercare, the quality of survival was significantly higher in both groups, and the SAS and SDS scores were significantly lower, with a more pronounced trend of change in the research cohort than in the reaction cohort (P < 0.05). Conclusion Bundled care for casualty oncology is of major value, with a marked reduction in the incidence of postoperative complications, high quality of survival, an excellent prognosis and negative mood, high patient morale and satisfaction and compliance with curative treatment, and good support for subsequent care.
Collapse
|
49
|
Yuan Q, Duren Z. Integration of single-cell multi-omics data by regression analysis on unpaired observations. Genome Biol 2022; 23:160. [PMID: 35854350 PMCID: PMC9295346 DOI: 10.1186/s13059-022-02726-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Accepted: 07/06/2022] [Indexed: 12/04/2022] Open
Abstract
Despite recent developments, it is hard to profile all multi-omics single-cell data modalities on the same cell. Thus, huge amounts of single-cell genomics data of unpaired observations on different cells are generated. We propose a method named UnpairReg for the regression analysis on unpaired observations to integrate single-cell multi-omics data. On real and simulated data, UnpairReg provides an accurate estimation of cell gene expression where only chromatin accessibility data is available. The cis-regulatory network inferred from UnpairReg is highly consistent with eQTL mapping. UnpairReg improves cell type identification accuracy by joint analysis of single-cell gene expression and chromatin accessibility data.
Collapse
Affiliation(s)
- Qiuyue Yuan
- Center for Human Genetics and Department of Genetics and Biochemistry, Clemson University, Greenwood, SC, 29646, USA
| | - Zhana Duren
- Center for Human Genetics and Department of Genetics and Biochemistry, Clemson University, Greenwood, SC, 29646, USA.
| |
Collapse
|
50
|
Casado-Pelaez M, Bueno-Costa A, Esteller M. Single cell cancer epigenetics. Trends Cancer 2022; 8:820-838. [PMID: 35821003 DOI: 10.1016/j.trecan.2022.06.005] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Revised: 05/02/2022] [Accepted: 06/08/2022] [Indexed: 10/17/2022]
Abstract
Bulk sequencing methodologies have allowed us to make great progress in cancer research. Unfortunately, these techniques lack the resolution to fully unravel the epigenetic mechanisms that govern tumor heterogeneity. Consequently, many novel single cell-sequencing methodologies have been developed over the past decade, allowing us to explore the epigenetic components that regulate different aspects of cancer heterogeneity, namely: clonal heterogeneity, tumor microenvironment (TME), spatial organization, intratumoral differentiation programs, metastasis, and resistance mechanisms. In this review, we explore the different sequencing techniques that enable researchers to study different aspects of epigenetics (DNA methylation, chromatin accessibility, histone modifications, DNA-protein interactions, and chromatin 3D architecture) at the single cell level, their potential applications in cancer, and their current technical limitations.
Collapse
Affiliation(s)
- Marta Casado-Pelaez
- Josep Carreras Leukaemia Research Institute (IJC), Badalona, Barcelona, Catalonia, Spain
| | - Alberto Bueno-Costa
- Josep Carreras Leukaemia Research Institute (IJC), Badalona, Barcelona, Catalonia, Spain
| | - Manel Esteller
- Josep Carreras Leukaemia Research Institute (IJC), Badalona, Barcelona, Catalonia, Spain; Centro de Investigacion Biomedica en Red Cancer (CIBERONC), 28029 Madrid, Spain; Institucio Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Catalonia, Spain; Physiological Sciences Department, School of Medicine and Health Sciences, University of Barcelona (UB), Barcelona, Catalonia, Spain.
| |
Collapse
|