1
|
Perez JM, Duda JM, Ryu J, Shetty M, Mehta S, Jagtap PD, Nelson AC, Winterhoff B, Griffin TJ, Starr TK, Thomas SN. Investigating proteogenomic divergence in patient-derived xenograft models of ovarian cancer. Sci Rep 2025; 15:813. [PMID: 39755759 DOI: 10.1038/s41598-024-84874-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2024] [Accepted: 12/27/2024] [Indexed: 01/06/2025] Open
Abstract
Within ovarian cancer research, patient-derived xenograft (PDX) models recapitulate histologic features and genomic aberrations found in original tumors. However, conflicting data from published studies have demonstrated significant transcriptional differences between PDXs and original tumors, challenging the fidelity of these models. We employed a quantitative mass spectrometry-based proteomic approach coupled with generation of patient-specific databases using RNA-seq data to investigate the proteogenomic landscape of serially-passaged PDX models established from two patients with distinct subtypes of ovarian cancer. We demonstrate that the utilization of patient-specific databases guided by transcriptional profiles increases the depth of human protein identification in PDX models. Our data show that human proteomes of serially passaged PDXs differ significantly from their patient-derived tumor of origin. Analysis of differentially abundant proteins revealed enrichment of distinct biological pathways with major downregulated processes including extracellular matrix organization and the immune system. Finally, we investigated the relative abundances of ovarian cancer-related proteins identified from the Cancer Gene Census across serially passaged PDXs, and found their protein levels to be unstable across PDX models. Our findings highlight features of distinct and dynamic proteomes of serially-passaged PDX models of ovarian cancer.
Collapse
Affiliation(s)
- Jesenia M Perez
- Microbiology, Immunology, and Cancer Biology Graduate Program, University of Minnesota School of Medicine, Minneapolis, MN, 55455, USA
| | - Jolene M Duda
- Biochemistry, Molecular Biology and Biophysics, University of Minnesota School of Medicine, Minneapolis, MN, 55455, USA
| | - Joohyun Ryu
- Department of Laboratory Medicine and Pathology, University of Minnesota School of Medicine, 420 Delaware St SE, MMC 609, Minneapolis, MN, 55455, USA
| | - Mihir Shetty
- Masonic Cancer Center and Department of Obstetrics, Gynecology and Women's Health, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Subina Mehta
- Biochemistry, Molecular Biology and Biophysics, University of Minnesota School of Medicine, Minneapolis, MN, 55455, USA
| | - Pratik D Jagtap
- Biochemistry, Molecular Biology and Biophysics, University of Minnesota School of Medicine, Minneapolis, MN, 55455, USA
| | - Andrew C Nelson
- Department of Laboratory Medicine and Pathology, University of Minnesota School of Medicine, 420 Delaware St SE, MMC 609, Minneapolis, MN, 55455, USA
| | - Boris Winterhoff
- Masonic Cancer Center and Department of Obstetrics, Gynecology and Women's Health, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Timothy J Griffin
- Biochemistry, Molecular Biology and Biophysics, University of Minnesota School of Medicine, Minneapolis, MN, 55455, USA
| | - Timothy K Starr
- Masonic Cancer Center and Department of Obstetrics, Gynecology and Women's Health, University of Minnesota, Minneapolis, MN, 55455, USA
| | - Stefani N Thomas
- Department of Laboratory Medicine and Pathology, University of Minnesota School of Medicine, 420 Delaware St SE, MMC 609, Minneapolis, MN, 55455, USA.
| |
Collapse
|
2
|
Paas-Oliveros E, Hernández-Lemus E, de Anda-Jáuregui G. Computational single cell oncology: state of the art. Front Genet 2023; 14:1256991. [PMID: 38028624 PMCID: PMC10663273 DOI: 10.3389/fgene.2023.1256991] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 10/24/2023] [Indexed: 12/01/2023] Open
Abstract
Single cell computational analysis has emerged as a powerful tool in the field of oncology, enabling researchers to decipher the complex cellular heterogeneity that characterizes cancer. By leveraging computational algorithms and bioinformatics approaches, this methodology provides insights into the underlying genetic, epigenetic and transcriptomic variations among individual cancer cells. In this paper, we present a comprehensive overview of single cell computational analysis in oncology, discussing the key computational techniques employed for data processing, analysis, and interpretation. We explore the challenges associated with single cell data, including data quality control, normalization, dimensionality reduction, clustering, and trajectory inference. Furthermore, we highlight the applications of single cell computational analysis, including the identification of novel cell states, the characterization of tumor subtypes, the discovery of biomarkers, and the prediction of therapy response. Finally, we address the future directions and potential advancements in the field, including the development of machine learning and deep learning approaches for single cell analysis. Overall, this paper aims to provide a roadmap for researchers interested in leveraging computational methods to unlock the full potential of single cell analysis in understanding cancer biology with the goal of advancing precision oncology. For this purpose, we also include a notebook that instructs on how to apply the recommended tools in the Preprocessing and Quality Control section.
Collapse
Affiliation(s)
- Ernesto Paas-Oliveros
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico
| | - Enrique Hernández-Lemus
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico
- Center for Complexity Sciences, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Guillermo de Anda-Jáuregui
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico
- Center for Complexity Sciences, Universidad Nacional Autónoma de México, Mexico City, Mexico
- Investigadores por Mexico, Conahcyt, Mexico City, Mexico
| |
Collapse
|
3
|
Giner-Calabuig M, De Leon S, Wang J, Fehlmann TD, Ukaegbu C, Gibson J, Alustiza-Fernandez M, Pico MD, Alenda C, Herraiz M, Carrillo-Palau M, Salces I, Reyes J, Ortega SP, Obrador-Hevia A, Cecchini M, Syngal S, Stoffel E, Ellis NA, Sweasy J, Jover R, Llor X, Xicola RM. Mutational signature profiling classifies subtypes of clinically different mismatch-repair-deficient tumours with a differential immunogenic response potential. Br J Cancer 2022; 126:1595-1603. [PMID: 35197584 PMCID: PMC9130322 DOI: 10.1038/s41416-022-01754-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 01/24/2022] [Accepted: 02/10/2022] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Mismatch repair (MMR) deficiency is the hallmark of tumours from Lynch syndrome (LS), sporadic MLH1 hypermethylated and Lynch-like syndrome (LLS), but there is a lack of understanding of the variability in their mutational profiles based on clinical phenotypes. The aim of this study was to perform a molecular characterisation to identify novel features that can impact tumour behaviour and clinical management. METHODS We tested 105 MMR-deficient colorectal cancer tumours (25 LS, 35 LLS and 45 sporadic) for global exome microsatellite instability, cancer mutational signatures, mutational spectrum and neoepitope load. RESULTS Fifty-three percent of tumours showed high contribution of MMR-deficient mutational signatures, high level of global exome microsatellite instability, loss of MLH1/PMS2 protein expression and included sporadic tumours. Thirty-one percent of tumours showed weaker features of MMR deficiency, 62% lost MSH2/MSH6 expression and included 60% of LS and 44% of LLS tumours. Remarkably, 9% of all tumours lacked global exome microsatellite instability. Lastly, HLA-B07:02 could be triggering the neoantigen presentation in tumours that show the strongest contribution of MMR-deficient tumours. CONCLUSIONS Next-generation sequencing approaches allow for a granular molecular characterisation of MMR-deficient tumours, which can be essential to properly diagnose and treat patients with these tumours in the setting of personalised medicine.
Collapse
Affiliation(s)
- Mar Giner-Calabuig
- Department of Medicine and Cancer Center, Yale University School of Medicine, New Haven, CT, USA
- Servicio de Medicina Digestiva, Hospital General Universitario de Alicante, Instituto de Investigación Sanitaria ISABIAL, Alicante, Spain
| | - Seila De Leon
- Department of Medicine and Cancer Center, Yale University School of Medicine, New Haven, CT, USA
| | - Julian Wang
- Department of Medicine and Cancer Center, Yale University School of Medicine, New Haven, CT, USA
| | - Tara D Fehlmann
- Divisions of Cancer Genetics and Population Sciences, Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Chinedu Ukaegbu
- Divisions of Cancer Genetics and Population Sciences, Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Joanna Gibson
- Department of Pathology and Cancer Center, Yale University School of Medicine, New Haven, CT, USA
| | - Miren Alustiza-Fernandez
- Servicio de Medicina Digestiva, Hospital General Universitario de Alicante, Instituto de Investigación Sanitaria ISABIAL, Alicante, Spain
| | - Maria-Dolores Pico
- Servicio de Medicina Digestiva, Hospital General Universitario de Alicante, Instituto de Investigación Sanitaria ISABIAL, Alicante, Spain
| | - Cristina Alenda
- Servicio de Medicina Digestiva, Hospital General Universitario de Alicante, Instituto de Investigación Sanitaria ISABIAL, Alicante, Spain
| | - Maite Herraiz
- Departamento de Digestivo, Clínica Universidad de Navarra, Navarra, Spain
| | - Marta Carrillo-Palau
- Servicio de Medicina Digestiva, Hospital Universitario de Canarias, Tenerife, Spain
| | - Inmaculada Salces
- Servicio de Medicina Digestiva, Hospital 12 de Octubre, Madrid, Spain
| | - Josep Reyes
- Servei de Digestiu, Hospital Comarcal d'Inca, Mallorca, Spain
| | - Silvia P Ortega
- Servei de Digestiu, Hospital Comarcal d'Inca, Mallorca, Spain
| | | | - Michael Cecchini
- Department of Medicine and Cancer Center, Yale University School of Medicine, New Haven, CT, USA
| | - Sapna Syngal
- Divisions of Cancer Genetics and Population Sciences, Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Elena Stoffel
- Division of Gastroenterology and Hepatology, Department of Internal Medicine, and Rogel Cancer Center, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Nathan A Ellis
- Department of Cellular and Molecular Medicine, University of Arizona Cancer Center, University of Arizona, Tucson, AZ, USA
| | - Joann Sweasy
- Department of Therapeutic Radiobiology and Cancer Center, Yale University, New Haven, CT, USA
| | - Rodrigo Jover
- Servicio de Medicina Digestiva, Hospital General Universitario de Alicante, Instituto de Investigación Sanitaria ISABIAL, Alicante, Spain
| | - Xavier Llor
- Department of Medicine and Cancer Center, Yale University School of Medicine, New Haven, CT, USA
| | - Rosa M Xicola
- Department of Medicine and Cancer Center, Yale University School of Medicine, New Haven, CT, USA.
| |
Collapse
|
4
|
Manders F, Brandsma AM, de Kanter J, Verheul M, Oka R, van Roosmalen MJ, van der Roest B, van Hoeck A, Cuppen E, van Boxtel R. MutationalPatterns: the one stop shop for the analysis of mutational processes. BMC Genomics 2022; 23:134. [PMID: 35168570 PMCID: PMC8845394 DOI: 10.1186/s12864-022-08357-3] [Citation(s) in RCA: 77] [Impact Index Per Article: 25.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 02/01/2022] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND The collective of somatic mutations in a genome represents a record of mutational processes that have been operative in a cell. These processes can be investigated by extracting relevant mutational patterns from sequencing data. RESULTS Here, we present the next version of MutationalPatterns, an R/Bioconductor package, which allows in-depth mutational analysis of catalogues of single and double base substitutions as well as small insertions and deletions. Major features of the package include the possibility to perform regional mutation spectra analyses and the possibility to detect strand asymmetry phenomena, such as lesion segregation. On top of this, the package also contains functions to determine how likely it is that a signature can cause damaging mutations (i.e., mutations that affect protein function). This updated package supports stricter signature refitting on known signatures in order to prevent overfitting. Using simulated mutation matrices containing varied signature contributions, we showed that reliable refitting can be achieved even when only 50 mutations are present per signature. Additionally, we incorporated bootstrapped signature refitting to assess the robustness of the signature analyses. Finally, we applied the package on genome mutation data of cell lines in which we deleted specific DNA repair processes and on large cancer datasets, to show how the package can be used to generate novel biological insights. CONCLUSIONS This novel version of MutationalPatterns allows for more comprehensive analyses and visualization of mutational patterns in order to study the underlying processes. Ultimately, in-depth mutational analyses may contribute to improved biological insights in mechanisms of mutation accumulation as well as aid cancer diagnostics. MutationalPatterns is freely available at http://bioconductor.org/packages/MutationalPatterns .
Collapse
Affiliation(s)
- Freek Manders
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584CS, Utrecht, The Netherlands
- Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands
| | - Arianne M Brandsma
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584CS, Utrecht, The Netherlands
- Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands
| | - Jurrian de Kanter
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584CS, Utrecht, The Netherlands
- Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands
| | - Mark Verheul
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584CS, Utrecht, The Netherlands
- Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands
| | - Rurika Oka
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584CS, Utrecht, The Netherlands
- Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands
| | - Markus J van Roosmalen
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584CS, Utrecht, The Netherlands
- Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands
| | - Bastiaan van der Roest
- Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands
- Center for Molecular Medicine, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands
| | - Arne van Hoeck
- Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands
- Center for Molecular Medicine, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands
| | - Edwin Cuppen
- Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands
- Center for Molecular Medicine, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands
| | - Ruben van Boxtel
- Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584CS, Utrecht, The Netherlands.
- Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands.
| |
Collapse
|
5
|
Lei Y, Tang R, Xu J, Wang W, Zhang B, Liu J, Yu X, Shi S. Applications of single-cell sequencing in cancer research: progress and perspectives. J Hematol Oncol 2021; 14:91. [PMID: 34108022 PMCID: PMC8190846 DOI: 10.1186/s13045-021-01105-2] [Citation(s) in RCA: 262] [Impact Index Per Article: 65.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2021] [Accepted: 06/03/2021] [Indexed: 02/06/2023] Open
Abstract
Single-cell sequencing, including genomics, transcriptomics, epigenomics, proteomics and metabolomics sequencing, is a powerful tool to decipher the cellular and molecular landscape at a single-cell resolution, unlike bulk sequencing, which provides averaged data. The use of single-cell sequencing in cancer research has revolutionized our understanding of the biological characteristics and dynamics within cancer lesions. In this review, we summarize emerging single-cell sequencing technologies and recent cancer research progress obtained by single-cell sequencing, including information related to the landscapes of malignant cells and immune cells, tumor heterogeneity, circulating tumor cells and the underlying mechanisms of tumor biological behaviors. Overall, the prospects of single-cell sequencing in facilitating diagnosis, targeted therapy and prognostic prediction among a spectrum of tumors are bright. In the near future, advances in single-cell sequencing will undoubtedly improve our understanding of the biological characteristics of tumors and highlight potential precise therapeutic targets for patients.
Collapse
Affiliation(s)
- Yalan Lei
- Department of Pancreatic Surgery, Fudan University Shanghai Cancer Center, No. 270 Dong'An Road, Shanghai, 200032, China.,Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, China.,Shanghai Pancreatic Cancer Institute, No. 270 Dong'An Road, Shanghai, 200032, China.,Pancreatic Cancer Institute, Fudan University, Shanghai, China
| | - Rong Tang
- Department of Pancreatic Surgery, Fudan University Shanghai Cancer Center, No. 270 Dong'An Road, Shanghai, 200032, China.,Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, China.,Shanghai Pancreatic Cancer Institute, No. 270 Dong'An Road, Shanghai, 200032, China.,Pancreatic Cancer Institute, Fudan University, Shanghai, China
| | - Jin Xu
- Department of Pancreatic Surgery, Fudan University Shanghai Cancer Center, No. 270 Dong'An Road, Shanghai, 200032, China.,Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, China.,Shanghai Pancreatic Cancer Institute, No. 270 Dong'An Road, Shanghai, 200032, China.,Pancreatic Cancer Institute, Fudan University, Shanghai, China
| | - Wei Wang
- Department of Pancreatic Surgery, Fudan University Shanghai Cancer Center, No. 270 Dong'An Road, Shanghai, 200032, China.,Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, China.,Shanghai Pancreatic Cancer Institute, No. 270 Dong'An Road, Shanghai, 200032, China.,Pancreatic Cancer Institute, Fudan University, Shanghai, China
| | - Bo Zhang
- Department of Pancreatic Surgery, Fudan University Shanghai Cancer Center, No. 270 Dong'An Road, Shanghai, 200032, China.,Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, China.,Shanghai Pancreatic Cancer Institute, No. 270 Dong'An Road, Shanghai, 200032, China.,Pancreatic Cancer Institute, Fudan University, Shanghai, China
| | - Jiang Liu
- Department of Pancreatic Surgery, Fudan University Shanghai Cancer Center, No. 270 Dong'An Road, Shanghai, 200032, China.,Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, China.,Shanghai Pancreatic Cancer Institute, No. 270 Dong'An Road, Shanghai, 200032, China.,Pancreatic Cancer Institute, Fudan University, Shanghai, China
| | - Xianjun Yu
- Department of Pancreatic Surgery, Fudan University Shanghai Cancer Center, No. 270 Dong'An Road, Shanghai, 200032, China. .,Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, China. .,Shanghai Pancreatic Cancer Institute, No. 270 Dong'An Road, Shanghai, 200032, China. .,Pancreatic Cancer Institute, Fudan University, Shanghai, China.
| | - Si Shi
- Department of Pancreatic Surgery, Fudan University Shanghai Cancer Center, No. 270 Dong'An Road, Shanghai, 200032, China. .,Department of Oncology, Shanghai Medical College, Fudan University, Shanghai, China. .,Shanghai Pancreatic Cancer Institute, No. 270 Dong'An Road, Shanghai, 200032, China. .,Pancreatic Cancer Institute, Fudan University, Shanghai, China.
| |
Collapse
|
6
|
Liang L, Zhu K, Lu S. BEM: Mining Coregulation Patterns in Transcriptomics via Boolean Matrix Factorization. Bioinformatics 2020; 36:4030-4037. [PMID: 31913438 DOI: 10.1093/bioinformatics/btz977] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2019] [Revised: 11/21/2019] [Accepted: 01/02/2020] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION The matrix factorization is an important way to analyze coregulation patterns in transcriptomic data, which can reveal the tumor signal perturbation status and subtype classification. However, current matrix factorization methods do not provide clear bicluster structure. Furthermore, these algorithms are based on the assumption of linear combination, which may not be sufficient to capture the coregulation patterns. RESULTS We presented a new algorithm for Boolean matrix factorization (BMF) via expectation maximization (BEM). BEM is more aligned with the molecular mechanism of transcriptomic coregulation and can scale to matrix with over 100 million data points. Synthetic experiments showed that BEM outperformed other BMF methods in terms of reconstruction error. Real-world application demonstrated that BEM is applicable to all kinds of transcriptomic data, including bulk RNA-seq, single-cell RNA-seq and spatial transcriptomic datasets. Given appropriate binarization, BEM was able to extract coregulation patterns consistent with disease subtypes, cell types or spatial anatomy. AVAILABILITY AND IMPLEMENTATION Python source code of BEM is available on https://github.com/LifanLiang/EM_BMF. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Lifan Liang
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15206-3701, USA
| | - Kunju Zhu
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15206-3701, USA.,Department of Central Lab., Clinical Medicine Research Institute, Jinan University, Guangzhou, Guangdong 51063, China
| | - Songjian Lu
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15206-3701, USA
| |
Collapse
|
7
|
Seweryn MT, Pietrzak M, Ma Q. Application of information theoretical approaches to assess diversity and similarity in single-cell transcriptomics. Comput Struct Biotechnol J 2020; 18:1830-1837. [PMID: 32728406 PMCID: PMC7371753 DOI: 10.1016/j.csbj.2020.05.005] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Revised: 04/24/2020] [Accepted: 05/06/2020] [Indexed: 02/09/2023] Open
Abstract
Single-cell transcriptomics offers a powerful way to reveal the heterogeneity of individual cells. To date, many information theoretical approaches have been proposed to assess diversity and similarity, and characterize the latent heterogeneity in transcriptome data. Diversity implies gene expression variations and can facilitate the identification of signature genes; while, similarity unravels co-expression patterns for cell type clustering. In this review, we summarized 16 measures of information theory used for evaluating diversity and similarity in single-cell transcriptomic data, provide references and shed light on selected theoretical properties when there is a need to select proper measurements in general cases. We further provide an R package assembling discussed approaches to improve the researchers own single-cell transcriptome study. At last, we prospected further applications of diversity and similarity measures in support of depicting heterogeneity in single-cell multi-omics data.
Collapse
Affiliation(s)
- Michal T. Seweryn
- Center for Medical Genomics, Jagiellonian University, Cracow, Poland
| | - Maciej Pietrzak
- Department of Biomedical Informatics, The Ohio State University, Columbus OH, United States
| | - Qin Ma
- Department of Biomedical Informatics, The Ohio State University, Columbus OH, United States
| |
Collapse
|