1
|
Ban GI, Puviindran V, Xiang Y, Nadesan P, Tang J, Ou J, Guardino N, Nakagawa M, Browne M, Wallace A, Ishikawa K, Shimada E, Martin JT, Diao Y, Kirsch DG, Alman BA. The COMPASS complex maintains the metastatic capacity imparted by a subpopulation of cells in UPS. iScience 2024; 27:110187. [PMID: 38989451 PMCID: PMC11233968 DOI: 10.1016/j.isci.2024.110187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2024] [Revised: 04/20/2024] [Accepted: 06/03/2024] [Indexed: 07/12/2024] Open
Abstract
Intratumoral heterogeneity is common in cancer, particularly in sarcomas like undifferentiated pleomorphic sarcoma (UPS), where individual cells demonstrate a high degree of cytogenic diversity. Previous studies showed that a small subset of cells within UPS, known as the metastatic clone (MC), as responsible for metastasis. Using a CRISPR-based genomic screen in-vivo, we identified the COMPASS complex member Setd1a as a key regulator maintaining the metastatic phenotype of the MC in murine UPS. Depletion of Setd1a inhibited metastasis development in the MC. Transcriptome and chromatin sequencing revealed COMPASS complex target genes in UPS, such as Cxcl10, downregulated in the MC. Deleting Cxcl10 in non-MC cells increased their metastatic potential. Treating mice with human UPS xenografts with a COMPASS complex inhibitor suppressed metastasis without affecting tumor growth in the primary tumor. Our data identified an epigenetic program in a subpopulation of sarcoma cells that maintains metastatic potential.
Collapse
Affiliation(s)
- Ga I Ban
- Department of Orthopedic Surgery, Duke University School of Medicine, Durham, NC, USA
| | - Vijitha Puviindran
- Department of Orthopedic Surgery, Duke University School of Medicine, Durham, NC, USA
| | - Yu Xiang
- Department of Cell Biology and Duke Regeneration Center, Duke University School of Medicine, Durham, NC, USA
| | - Puvi Nadesan
- Department of Orthopedic Surgery, Duke University School of Medicine, Durham, NC, USA
| | - Jackie Tang
- Department of Orthopedic Surgery, Duke University School of Medicine, Durham, NC, USA
| | - Jianhong Ou
- Department of Cell Biology and Duke Regeneration Center, Duke University School of Medicine, Durham, NC, USA
| | - Nicholas Guardino
- Department of Orthopedic Surgery, Duke University School of Medicine, Durham, NC, USA
| | - Makoto Nakagawa
- Department of Orthopedic Surgery, Duke University School of Medicine, Durham, NC, USA
| | - MaKenna Browne
- Department of Cell Biology and Duke Regeneration Center, Duke University School of Medicine, Durham, NC, USA
| | - Asjah Wallace
- Department of Orthopedic Surgery, Duke University School of Medicine, Durham, NC, USA
| | - Koji Ishikawa
- Department of Orthopedic Surgery, Duke University School of Medicine, Durham, NC, USA
| | - Eijiro Shimada
- Department of Orthopedic Surgery, Duke University School of Medicine, Durham, NC, USA
| | - John T Martin
- Department of Orthopedic Surgery, Duke University School of Medicine, Durham, NC, USA
| | - Yarui Diao
- Department of Cell Biology and Duke Regeneration Center, Duke University School of Medicine, Durham, NC, USA
| | - David G Kirsch
- Department of Radiation Oncology, Duke University School of Medicine, Durham, NC, USA
- The Princes Margaret Cancer Centre, Department of Radiation Oncology, University Health Network and the University of Toronto, Toronto, ON, Canada
| | - Benjamin A Alman
- Department of Orthopedic Surgery, Duke University School of Medicine, Durham, NC, USA
| |
Collapse
|
2
|
Wang M, Xie Y, Liu J, Li A, Chen L, Stromberg A, Arnold SM, Liu C, Wang C. A Probabilistic Approach to Estimate the Temporal Order of Pathway Mutations Accounting for Intra-Tumor Heterogeneity. Cancers (Basel) 2024; 16:2488. [PMID: 39001551 PMCID: PMC11240401 DOI: 10.3390/cancers16132488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2024] [Accepted: 06/26/2024] [Indexed: 07/16/2024] Open
Abstract
The development of cancer involves the accumulation of somatic mutations in several essential biological pathways. Delineating the temporal order of pathway mutations during tumorigenesis is crucial for comprehending the biological mechanisms underlying cancer development and identifying potential targets for therapeutic intervention. Several computational and statistical methods have been introduced for estimating the order of somatic mutations based on mutation profile data from a cohort of patients. However, one major issue of current methods is that they do not take into account intra-tumor heterogeneity (ITH), which limits their ability to accurately discern the order of pathway mutations. To address this problem, we propose PATOPAI, a probabilistic approach to estimate the temporal order of mutations at the pathway level by incorporating ITH information as well as pathway and functional annotation information of mutations. PATOPAI uses a maximum likelihood approach to estimate the probability of pathway mutational events occurring in a specific sequence, wherein it focuses on the orders that are consistent with the phylogenetic structure of the tumors. Applications to whole exome sequencing data from The Cancer Genome Atlas (TCGA) illustrate our method's ability to recover the temporal order of pathway mutations in several cancer types.
Collapse
Affiliation(s)
- Menghan Wang
- Department of Statistics, University of Kentucky, Lexington, KY 40536, USA
| | - Yanqi Xie
- Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40508, USA
| | - Jinpeng Liu
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA
- Division of Cancer Biostatistics, Department of Internal Medicine, University of Kentucky, Lexington, KY 40536, USA
| | - Austin Li
- Department of Computer Science, Princeton University, Princeton, NJ 08540, USA
| | - Li Chen
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA
- Division of Cancer Biostatistics, Department of Internal Medicine, University of Kentucky, Lexington, KY 40536, USA
| | - Arnold Stromberg
- Department of Statistics, University of Kentucky, Lexington, KY 40536, USA
| | - Susanne M Arnold
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA
- Division of Medical Oncology, Department of Internal Medicine, University of Kentucky, Lexington, KY 40536, USA
| | - Chunming Liu
- Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40508, USA
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA
| | - Chi Wang
- Department of Statistics, University of Kentucky, Lexington, KY 40536, USA
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA
- Division of Cancer Biostatistics, Department of Internal Medicine, University of Kentucky, Lexington, KY 40536, USA
| |
Collapse
|
3
|
Gao Y, Feder AF. Detecting branching rate heterogeneity in multifurcating trees with applications in lineage tracing data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.27.601073. [PMID: 39005367 PMCID: PMC11244928 DOI: 10.1101/2024.06.27.601073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
Understanding cellular birth rate differences is crucial for predicting cancer progression and interpreting tumor-derived genetic data. Lineage tracing experiments enable detailed reconstruction of cellular genealogies, offering new opportunities to measure branching rate heterogeneity. However, the lineage tracing process can introduce complex tree features that complicate this effort. Here, we examine tree characteristics in lineage tracing-derived genealogies and find that editing window placement leads to multifurcations at a tree's root or tips. We propose several ways in which existing tree topology-based metrics can be extended to test for rate heterogeneity on trees even in the presence of lineage-tracing associated distortions. Although these methods vary in power and robustness, a test based on the J 1 statistic effectively detects branching rate heterogeneity in simulated lineage tracing data. Tests based on other common statistics ( ŝ and the Sackin index) show interior performance to J 1 . We apply our validated methods to xenograft experimental data and find widespread rate heterogeneity across multiple study systems. Our results demonstrate the potential of tree topology statistics in analyzing lineage tracing data, and highlight the challenges associated with adapting phylogenetic methods to these systems.
Collapse
|
4
|
Li R, Shi F, Song L, Yu Z. scGAL: unmask tumor clonal substructure by jointly analyzing independent single-cell copy number and scRNA-seq data. BMC Genomics 2024; 25:393. [PMID: 38649804 PMCID: PMC11034052 DOI: 10.1186/s12864-024-10319-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 04/17/2024] [Indexed: 04/25/2024] Open
Abstract
BACKGROUND Accurately deciphering clonal copy number substructure can provide insights into the evolutionary mechanism of cancer, and clustering single-cell copy number profiles has become an effective means to unmask intra-tumor heterogeneity (ITH). However, copy numbers inferred from single-cell DNA sequencing (scDNA-seq) data are error-prone due to technically confounding factors such as amplification bias and allele-dropout, and this makes it difficult to precisely identify the ITH. RESULTS We introduce a hybrid model called scGAL to infer clonal copy number substructure. It combines an autoencoder with a generative adversarial network to jointly analyze independent single-cell copy number profiles and gene expression data from same cell line. Under an adversarial learning framework, scGAL exploits complementary information from gene expression data to relieve the effects of noise in copy number data, and learns latent representations of scDNA-seq cells for accurate inference of the ITH. Evaluation results on three real cancer datasets suggest scGAL is able to accurately infer clonal architecture and surpasses other similar methods. In addition, assessment of scGAL on various simulated datasets demonstrates its high robustness against the changes of data size and distribution. scGAL can be accessed at: https://github.com/zhyu-lab/scgal . CONCLUSIONS Joint analysis of independent single-cell copy number and gene expression data from a same cell line can effectively exploit complementary information from individual omics, and thus gives more refined indication of clonal copy number substructure.
Collapse
Affiliation(s)
- Ruixiang Li
- School of Information Engineering, Ningxia University, Yinchuan, 750021, China
| | - Fangyuan Shi
- School of Information Engineering, Ningxia University, Yinchuan, 750021, China
- Collaborative Innovation Center for Ningxia Big Data and Artificial Intelligence Co-founded by Ningxia Municipality and Ministry of Education, Ningxia University, Yinchuan, 750021, China
| | - Lijuan Song
- School of Information Engineering, Ningxia University, Yinchuan, 750021, China
- Collaborative Innovation Center for Ningxia Big Data and Artificial Intelligence Co-founded by Ningxia Municipality and Ministry of Education, Ningxia University, Yinchuan, 750021, China
| | - Zhenhua Yu
- School of Information Engineering, Ningxia University, Yinchuan, 750021, China.
- Collaborative Innovation Center for Ningxia Big Data and Artificial Intelligence Co-founded by Ningxia Municipality and Ministry of Education, Ningxia University, Yinchuan, 750021, China.
| |
Collapse
|
5
|
Khayatian E, Valiente G, Zhang L. The k-Robinson-Foulds Dissimilarity Measures for Comparison of Labeled Trees. J Comput Biol 2024; 31:328-344. [PMID: 38271573 PMCID: PMC11057537 DOI: 10.1089/cmb.2023.0312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2024] Open
Abstract
Understanding the mutational history of tumor cells is a critical endeavor in unraveling the mechanisms that drive the onset and progression of cancer. Modeling tumor cell evolution with labeled trees motivates researchers to develop different measures to compare labeled trees. Although the Robinson-Foulds (RF) distance is widely used for comparing species trees, its applicability to labeled trees reveals certain limitations. This study introduces the k-RF dissimilarity measures, tailored to address the challenges of labeled tree comparison. The RF distance is succinctly expressed as n-RF in the space of labeled trees with n nodes. Like the RF distance, the k-RF is a pseudometric for multiset-labeled trees and becomes a metric in the space of 1-labeled trees. By setting k to a small value, the k-RF dissimilarity can capture analogous local regions in two labeled trees with different size or different labels.
Collapse
Affiliation(s)
- Elahe Khayatian
- Department of Mathematics, National University of Singapore, Singapore, Singapore
| | - Gabriel Valiente
- Department of Computer Science, Technical University of Catalonia, Barcelona, Spain
| | - Louxin Zhang
- Department of Mathematics, National University of Singapore, Singapore, Singapore
| |
Collapse
|
6
|
Liu F, Shi F, Du F, Cao X, Yu Z. CoT: a transformer-based method for inferring tumor clonal copy number substructure from scDNA-seq data. Brief Bioinform 2024; 25:bbae187. [PMID: 38670159 PMCID: PMC11052634 DOI: 10.1093/bib/bbae187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 03/08/2024] [Accepted: 04/16/2024] [Indexed: 04/28/2024] Open
Abstract
Single-cell DNA sequencing (scDNA-seq) has been an effective means to unscramble intra-tumor heterogeneity, while joint inference of tumor clones and their respective copy number profiles remains a challenging task due to the noisy nature of scDNA-seq data. We introduce a new bioinformatics method called CoT for deciphering clonal copy number substructure. The backbone of CoT is a Copy number Transformer autoencoder that leverages multi-head attention mechanism to explore correlations between different genomic regions, and thus capture global features to create latent embeddings for the cells. CoT makes it convenient to first infer cell subpopulations based on the learned embeddings, and then estimate single-cell copy numbers through joint analysis of read counts data for the cells belonging to the same cluster. This exploitation of clonal substructure information in copy number analysis helps to alleviate the effect of read counts non-uniformity, and yield robust estimations of the tumor copy numbers. Performance evaluation on synthetic and real datasets showcases that CoT outperforms the state of the arts, and is highly useful for deciphering clonal copy number substructure.
Collapse
Affiliation(s)
- Furui Liu
- School of Information Engineering, Ningxia University, 750021, Ningxia, China
| | - Fangyuan Shi
- School of Information Engineering, Ningxia University, 750021, Ningxia, China
- Collaborative Innovation Center for Ningxia Big Data and Artificial Intelligence Co-founded by Ningxia Municipality and Ministry of Education, Ningxia University, 750021, Ningxia, China
| | - Fang Du
- School of Information Engineering, Ningxia University, 750021, Ningxia, China
- Collaborative Innovation Center for Ningxia Big Data and Artificial Intelligence Co-founded by Ningxia Municipality and Ministry of Education, Ningxia University, 750021, Ningxia, China
| | - Xiangmei Cao
- Basic Medical School, Ningxia Medical University, 750001, Ningxia, China
| | - Zhenhua Yu
- School of Information Engineering, Ningxia University, 750021, Ningxia, China
- Collaborative Innovation Center for Ningxia Big Data and Artificial Intelligence Co-founded by Ningxia Municipality and Ministry of Education, Ningxia University, 750021, Ningxia, China
| |
Collapse
|
7
|
Fu X, Luo Z, Deng Y, LaFramboise W, Bartlett D, Schwartz R. Marker selection strategies for circulating tumor DNA guided by phylogenetic inference. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.21.585352. [PMID: 38586041 PMCID: PMC10996527 DOI: 10.1101/2024.03.21.585352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/09/2024]
Abstract
Motivation Blood-based profiling of tumor DNA ("liquid biopsy") has offered great prospects for non-invasive early cancer diagnosis, treatment monitoring, and clinical guidance, but require further advances in computational methods to become a robust quantitative assay of tumor clonal evolution. We propose new methods to better characterize tumor clonal dynamics from circulating tumor DNA (ctDNA), through application to two specific questions: 1) How to apply longitudinal ctDNA data to refine phylogeny models of clonal evolution, and 2) how to quantify changes in clonal frequencies that may be indicative of treatment response or tumor progression. We pose these questions through a probabilistic framework for optimally identifying maximum likelihood markers and applying them to characterizing clonal evolution. Results We first estimate a distribution over plausible clonal lineage models, using bootstrap samples over pre-treatment tissue-based sequence data. We then refine these lineage models and the clonal frequencies they imply over successive longitudinal samples. We use the resulting framework for modeling and refining tree distributions to pose a set of optimization problems to select ctDNA markers to maximize measures of utility capturing ability to solve the two questions of reducing uncertain in phylogeny models or quantifying clonal frequencies given the models. We tested our methods on synthetic data and showed them to be effective at refining distributions of tree models and clonal frequencies so as to minimize measures of tree distance relative to the ground truth. Application of the tree refinement methods to real tumor data further demonstrated their effectiveness in refining a clonal lineage model and assessing its clonal frequencies. The work shows the power of computational methods to improve marker selection, clonal lineage reconstruction, and clonal dynamics profiling for more precise and quantitative assays of tumor progression. Availability https://github.com/CMUSchwartzLab/Mase-phi.git. Contact russells@andrew.cmu.edu.
Collapse
Affiliation(s)
- Xuecong Fu
- Department of Biological Sciences, Carnegie Mellon University, 5000 Forbes Avenue, 15217, Pittsburgh, PA, USA
| | - Zhicheng Luo
- Department of Biological Sciences, Carnegie Mellon University, 5000 Forbes Avenue, 15217, Pittsburgh, PA, USA
| | - Yueqian Deng
- Ray and Stephanie Lane Computational Biology Department, Carnegie Mellon University, 5000 Forbes Avenue, 15217, Pittsburgh, PA, USA
| | - William LaFramboise
- Allegheny Health Network Cancer Institute, Allegheny Health Network, 320 East North Avenue, 15212, Pittsburgh, PA, USA
| | - David Bartlett
- Allegheny Health Network Cancer Institute, Allegheny Health Network, 320 East North Avenue, 15212, Pittsburgh, PA, USA
| | - Russell Schwartz
- Department of Biological Sciences, Carnegie Mellon University, 5000 Forbes Avenue, 15217, Pittsburgh, PA, USA
- Ray and Stephanie Lane Computational Biology Department, Carnegie Mellon University, 5000 Forbes Avenue, 15217, Pittsburgh, PA, USA
| |
Collapse
|
8
|
Srivatsa A, Schwartz R. Optimizing Design of Genomics Studies for Clonal Evolution Analysis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.14.585055. [PMID: 38559253 PMCID: PMC10980045 DOI: 10.1101/2024.03.14.585055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Genomic biotechnologies have seen rapid development over the past two decades, allowing for both the inference and modification of genetic and epigenetic information at the single cell level. While these tools present enormous potential for basic research, diagnostics, and treatment, they also raise difficult issues of how to design research studies to deploy these tools most effectively. In designing a study at the population or individual level, a researcher might combine several different sequencing modalities and sampling protocols, each with different utility, costs, and other tradeoffs. The central problem this paper attempts to address is then how one might create an optimal study design for a genomic analysis, with particular focus on studies involving somatic variation, typically for applications in cancer genomics. We pose the study design problem as a stochastic constrained nonlinear optimization problem and introduce a simulation-centered optimization procedure that iteratively optimizes the objective function using surrogate modeling combined with pattern and gradient search. Finally, we demonstrate the use of our procedure on diverse test cases to derive resource and study design allocations optimized for various objectives for the study of somatic cell populations.
Collapse
Affiliation(s)
- Arjun Srivatsa
- Ray and Stephanie Lane Computational Biology Department, Carnegie Mellon University, Pittsburgh PA 15213, USA
| | - Russell Schwartz
- Ray and Stephanie Lane Computational Biology Department, Carnegie Mellon University, Pittsburgh PA 15213, USA
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh PA 15213, USA
| |
Collapse
|
9
|
George J, Maas L, Abedpour N, Cartolano M, Kaiser L, Fischer RN, Scheel AH, Weber JP, Hellmich M, Bosco G, Volz C, Mueller C, Dahmen I, John F, Alves CP, Werr L, Panse JP, Kirschner M, Engel-Riedel W, Jürgens J, Stoelben E, Brockmann M, Grau S, Sebastian M, Stratmann JA, Kern J, Hummel HD, Hegedüs B, Schuler M, Plönes T, Aigner C, Elter T, Toepelt K, Ko YD, Kurz S, Grohé C, Serke M, Höpker K, Hagmeyer L, Doerr F, Hekmath K, Strapatsas J, Kambartel KO, Chakupurakal G, Busch A, Bauernfeind FG, Griesinger F, Luers A, Dirks W, Wiewrodt R, Luecke A, Rodermann E, Diel A, Hagen V, Severin K, Ullrich RT, Reinhardt HC, Quaas A, Bogus M, Courts C, Nürnberg P, Becker K, Achter V, Büttner R, Wolf J, Peifer M, Thomas RK. Evolutionary trajectories of small cell lung cancer under therapy. Nature 2024; 627:880-889. [PMID: 38480884 PMCID: PMC10972747 DOI: 10.1038/s41586-024-07177-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Accepted: 02/07/2024] [Indexed: 03/18/2024]
Abstract
The evolutionary processes that underlie the marked sensitivity of small cell lung cancer (SCLC) to chemotherapy and rapid relapse are unknown1-3. Here we determined tumour phylogenies at diagnosis and throughout chemotherapy and immunotherapy by multiregion sequencing of 160 tumours from 65 patients. Treatment-naive SCLC exhibited clonal homogeneity at distinct tumour sites, whereas first-line platinum-based chemotherapy led to a burst in genomic intratumour heterogeneity and spatial clonal diversity. We observed branched evolution and a shift to ancestral clones underlying tumour relapse. Effective radio- or immunotherapy induced a re-expansion of founder clones with acquired genomic damage from first-line chemotherapy. Whereas TP53 and RB1 alterations were exclusively part of the common ancestor, MYC family amplifications were frequently not constituents of the founder clone. At relapse, emerging subclonal mutations affected key genes associated with SCLC biology, and tumours harbouring clonal CREBBP/EP300 alterations underwent genome duplications. Gene-damaging TP53 alterations and co-alterations of TP53 missense mutations with TP73, CREBBP/EP300 or FMN2 were significantly associated with shorter disease relapse following chemotherapy. In summary, we uncover key processes of the genomic evolution of SCLC under therapy, identify the common ancestor as the source of clonal diversity at relapse and show central genomic patterns associated with sensitivity and resistance to chemotherapy.
Collapse
Affiliation(s)
- Julie George
- Department of Translational Genomics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany.
- Department of Otorhinolaryngology, Head and Neck Surgery, Faculty of Medicine and University Hospital Cologne, University Hospital of Cologne, Cologne, Germany.
| | - Lukas Maas
- Department of Translational Genomics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Nima Abedpour
- Department of Translational Genomics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
- Department I of Internal Medicine, Centre for Integrated Oncology Aachen Bonn Cologne Duesseldorf, University Hospital Cologne, Cologne, Germany
- Cancer Research Centre Cologne Essen, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Maria Cartolano
- Department of Translational Genomics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
- Centre for Molecular Medicine, University of Cologne, Cologne, Germany
| | - Laura Kaiser
- Department of Translational Genomics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Rieke N Fischer
- Department I of Internal Medicine, Lung Cancer Group Cologne, University Hospital Cologne, Cologne, Germany
| | - Andreas H Scheel
- Institute of Pathology, Medical Faculty, University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Jan-Philipp Weber
- Department I of Internal Medicine, Lung Cancer Group Cologne, University Hospital Cologne, Cologne, Germany
| | - Martin Hellmich
- Institute of Medical Statistics, and Computational Biology, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Graziella Bosco
- Department of Translational Genomics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Caroline Volz
- Department I of Internal Medicine, Centre for Integrated Oncology Aachen Bonn Cologne Duesseldorf, University Hospital Cologne, Cologne, Germany
- Centre for Molecular Medicine, University of Cologne, Cologne, Germany
| | - Christian Mueller
- Department of Translational Genomics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
- Department of Otorhinolaryngology, Head and Neck Surgery, Faculty of Medicine and University Hospital Cologne, University Hospital of Cologne, Cologne, Germany
| | - Ilona Dahmen
- Department of Translational Genomics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Felix John
- Department I of Internal Medicine, Lung Cancer Group Cologne, University Hospital Cologne, Cologne, Germany
| | - Cleidson Padua Alves
- Department of Translational Genomics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Lisa Werr
- Department of Translational Genomics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Jens Peter Panse
- Department of Haematology, Oncology, Haemostaseology and Stem Cell Transplantation, University Hospital RWTH Aachen, Aachen, Germany
- Centre for Integrated Oncology, Aachen Bonn Cologne Düsseldorf, Aachen, Germany
| | - Martin Kirschner
- Department of Haematology, Oncology, Haemostaseology and Stem Cell Transplantation, University Hospital RWTH Aachen, Aachen, Germany
- Centre for Integrated Oncology, Aachen Bonn Cologne Düsseldorf, Aachen, Germany
| | - Walburga Engel-Riedel
- Department of Pneumology, City of Cologne Municipal Hospitals, Lung Hospital Cologne Merheim, Cologne, Germany
| | - Jessica Jürgens
- Department of Pneumology, City of Cologne Municipal Hospitals, Lung Hospital Cologne Merheim, Cologne, Germany
| | - Erich Stoelben
- Thoraxclinic Cologne, Thoracic Surgery, St. Hildegardis-Krankenhaus, Cologne, Germany
| | - Michael Brockmann
- Department of Pathology, City of Cologne Municipal Hospitals, Witten/Herdecke University, Cologne, Germany
| | - Stefan Grau
- Department of General Neurosurgery, Centre of Neurosurgery, University Hospital Cologne, Cologne, Germany
- University Medicine Marburg - Campus Fulda, Department of Neurosurgery, Fulda, Germany
| | - Martin Sebastian
- Department of Medicine II, Haematology/Oncology, University Hospital Frankfurt, Goethe University, Frankfurt, Germany
- Frankfurt Cancer Institute, Goethe University Frankfurt, Frankfurt, Germany
- DKFZ, German Cancer Research Centre, German Cancer Consortium, Heidelberg, Germany
| | - Jan A Stratmann
- Department of Medicine II, Haematology/Oncology, University Hospital Frankfurt, Goethe University, Frankfurt, Germany
- Frankfurt Cancer Institute, Goethe University Frankfurt, Frankfurt, Germany
| | - Jens Kern
- Klinikum Würzburg Mitte - Missioklinik site, Pneumology and Respiratory Medicine, Würzburg, Germany
| | - Horst-Dieter Hummel
- Translational Oncology/Early Clinical Trial Unit, Comprehensive Cancer Centre Mainfranken, University Hospital Wuerzburg, Wuerzburg, Germany
| | - Balazs Hegedüs
- Department of Thoracic Surgery, University Medicine Essen - Ruhrlandklinik, University Duisburg-Essen, Essen, Germany
| | - Martin Schuler
- DKFZ, German Cancer Research Centre, German Cancer Consortium, Heidelberg, Germany
- Department of Medical Oncology, West German Cancer Centre Essen, University Duisburg-Essen, Essen, Germany
| | - Till Plönes
- Department of Medical Oncology, West German Cancer Centre Essen, University Duisburg-Essen, Essen, Germany
- Division of Thoracic Surgery, Department of General, Thoracic and Vascular Surgery, University Hospital Carl Gustav Carus, Dresden, Germany
| | - Clemens Aigner
- Department of Thoracic Surgery, University Medicine Essen - Ruhrlandklinik, University Duisburg-Essen, Essen, Germany
- Department of Thoracic Surgery, Medical University of Vienna, Vienna General Hospital, Vienna, Austria
| | - Thomas Elter
- Department I of Internal Medicine, Centre for Integrated Oncology Aachen Bonn Cologne Duesseldorf, University Hospital Cologne, Cologne, Germany
| | - Karin Toepelt
- Department I of Internal Medicine, Centre for Integrated Oncology Aachen Bonn Cologne Duesseldorf, University Hospital Cologne, Cologne, Germany
| | | | - Sylke Kurz
- Department of Respiratory Diseases, Evangelische Lungenklinik, Berlin, Germany
| | - Christian Grohé
- Department of Respiratory Diseases, Evangelische Lungenklinik, Berlin, Germany
| | - Monika Serke
- DGD Lungenklinik Hemer, Internal Medicine, Pneumology and Oncology, Hemer, Germany
| | - Katja Höpker
- Clinic III for Internal Medicine, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Lars Hagmeyer
- Clinic of Pneumology and Allergology, Centre for Sleep Medicine and Respiratory Care, Bethanien Hospital Solingen, Solingen, Germany
| | - Fabian Doerr
- Department of Thoracic Surgery, University Medicine Essen - Ruhrlandklinik, University Duisburg-Essen, Essen, Germany
- Department of Cardiothoracic Surgery, University Hospital of Cologne, Cologne, Germany
| | - Khosro Hekmath
- Department of Cardiothoracic Surgery, University Hospital of Cologne, Cologne, Germany
| | - Judith Strapatsas
- Department of Haematology, Oncology and Clinical Immunology, University Hospital of Duesseldorf, Düsseldorf, Germany
| | | | | | - Annette Busch
- Medical Clinic III for Oncology, Haematology, Immune-Oncology and Rheumatology, Centre for Integrative Medicine, University Hospital Bonn, Bonn, Germany
| | - Franz-Georg Bauernfeind
- Medical Clinic III for Oncology, Haematology, Immune-Oncology and Rheumatology, Centre for Integrative Medicine, University Hospital Bonn, Bonn, Germany
| | - Frank Griesinger
- Pius-Hospital Oldenburg, Department of Haematology and Oncology, University Department Internal Medicine-Oncology, University Medicine Oldenburg, Oldenburg, Germany
| | - Anne Luers
- Pius-Hospital Oldenburg, Department of Haematology and Oncology, University Department Internal Medicine-Oncology, University Medicine Oldenburg, Oldenburg, Germany
| | - Wiebke Dirks
- Pius-Hospital Oldenburg, Department of Haematology and Oncology, University Department Internal Medicine-Oncology, University Medicine Oldenburg, Oldenburg, Germany
| | - Rainer Wiewrodt
- Pulmonary Division, Department of Medicine A, Münster University Hospital, Münster, Germany
| | - Andrea Luecke
- Pulmonary Division, Department of Medicine A, Münster University Hospital, Münster, Germany
| | - Ernst Rodermann
- Onkologie Rheinsieg, Praxisnetzwerk Hämatologie und Internistische Onkologie, Troisdorf, Germany
| | - Andreas Diel
- Onkologie Rheinsieg, Praxisnetzwerk Hämatologie und Internistische Onkologie, Troisdorf, Germany
| | - Volker Hagen
- Clinic II for Internal Medicine, St.-Johannes-Hospital Dortmund, Dortmund, Germany
| | - Kai Severin
- Haematologie und Onkologie Köln MV-Zentrum, Cologne, Germany
| | - Roland T Ullrich
- Department I of Internal Medicine, Centre for Integrated Oncology Aachen Bonn Cologne Duesseldorf, University Hospital Cologne, Cologne, Germany
- Centre for Molecular Medicine, University of Cologne, Cologne, Germany
| | - Hans Christian Reinhardt
- Department of Haematology and Stem Cell Transplantation, University Hospital Essen, Essen, Germany
- West German Cancer Centre, University Hospital Essen, Essen, Germany
| | - Alexander Quaas
- Institute of Pathology, Medical Faculty, University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Magdalena Bogus
- Institute of Legal Medicine, University of Cologne, Cologne, Germany
| | - Cornelius Courts
- Institute of Legal Medicine, University of Cologne, Cologne, Germany
| | - Peter Nürnberg
- Cologne Centre for Genomics, West German Genome Centre, University of Cologne, Cologne, Germany
| | - Kerstin Becker
- Cologne Centre for Genomics, West German Genome Centre, University of Cologne, Cologne, Germany
| | - Viktor Achter
- Computing Centre, University of Cologne, Cologne, Germany
| | - Reinhard Büttner
- Institute of Pathology, Medical Faculty, University Hospital Cologne, University of Cologne, Cologne, Germany
| | - Jürgen Wolf
- Department I of Internal Medicine, Lung Cancer Group Cologne, University Hospital Cologne, Cologne, Germany
| | - Martin Peifer
- Department of Translational Genomics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany.
- Centre for Molecular Medicine, University of Cologne, Cologne, Germany.
| | - Roman K Thomas
- Department of Translational Genomics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany.
- Institute of Pathology, Medical Faculty, University Hospital Cologne, University of Cologne, Cologne, Germany.
- DKFZ, German Cancer Research Centre, German Cancer Consortium, Heidelberg, Germany.
| |
Collapse
|
10
|
Qi Y, El-Kebir M. Consensus Tree Under the Ancestor-Descendant Distance is NP-Hard. J Comput Biol 2024; 31:58-70. [PMID: 38010616 DOI: 10.1089/cmb.2023.0262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2023] Open
Abstract
Due to uncertainty in tumor phylogeny inference from sequencing data, many methods infer multiple, equally plausible phylogenies for the same cancer. To summarize the solution space T of tumor phylogenies, consensus tree methods seek a single best representative tree S under a specified pairwise tree distance function. One such distance function is the ancestor-descendant (AD) distance [Formula: see text] , which equals the size of the symmetric difference of the transitive closures of the edge sets [Formula: see text] and [Formula: see text] . Here, we show that finding a consensus tree S for tumor phylogenies T that minimizes the total AD distance [Formula: see text] is NP-hard.
Collapse
Affiliation(s)
- Yuanyuan Qi
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| | - Mohammed El-Kebir
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
- Cancer Center at Illinois, University of Illinois Urbana-Champaign, Urbana, Illinois, USA
| |
Collapse
|
11
|
Tijhuis AE, Foijer F. Characterizing chromosomal instability-driven cancer evolution and cell fitness at a glance. J Cell Sci 2024; 137:jcs260199. [PMID: 38224461 DOI: 10.1242/jcs.260199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2024] Open
Abstract
Chromosomal instability (CIN), an increased rate of chromosome segregation errors during mitosis, is a hallmark of cancer cells. CIN leads to karyotype differences between cells and thus large-scale heterogeneity among individual cancer cells; therefore, it plays an important role in cancer evolution. Studying CIN and its consequences is technically challenging, but various technologies have been developed to track karyotype dynamics during tumorigenesis, trace clonal lineages and link genomic changes to cancer phenotypes at single-cell resolution. These methods provide valuable insight not only into the role of CIN in cancer progression, but also into cancer cell fitness. In this Cell Science at a Glance article and the accompanying poster, we discuss the relationship between CIN, cancer cell fitness and evolution, and highlight techniques that can be used to study the relationship between these factors. To that end, we explore methods of assessing cancer cell fitness, particularly for chromosomally unstable cancer.
Collapse
Affiliation(s)
- Andréa E Tijhuis
- European Research Institute for the Biology of Ageing , University Medical Center Groningen, University of Groningen,9713 AV Groningen, The Netherlands
| | - Floris Foijer
- European Research Institute for the Biology of Ageing , University Medical Center Groningen, University of Groningen,9713 AV Groningen, The Netherlands
| |
Collapse
|
12
|
Pedrosa VF, Santos PADPD, Giesta LB, Zemor JC, Okamoto MH, Chagas AC, Bezerra ADC, Poersch LHDS, Romano LA. Occurrence of hepatic and splenic lipomas in Nile tilapia (Oreochromis niloticus). JOURNAL OF FISH DISEASES 2024; 47:e13869. [PMID: 37792353 DOI: 10.1111/jfd.13869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 09/19/2023] [Accepted: 09/22/2023] [Indexed: 10/05/2023]
Affiliation(s)
- Virgínia F Pedrosa
- Laboratory of Immunology and Pathology of Aquatic Organisms, Institute of Oceanography, Federal University of Rio Grande, Brazil
| | - Pedro A De P Dos Santos
- Laboratory of Immunology and Pathology of Aquatic Organisms, Institute of Oceanography, Federal University of Rio Grande, Brazil
| | - Luana B Giesta
- Laboratory of Immunology and Pathology of Aquatic Organisms, Institute of Oceanography, Federal University of Rio Grande, Brazil
| | - Julio C Zemor
- Institute of Oceanography, Aquaculture Marine Station, Federal University of Rio Grande, Brazil
| | - Marcelo H Okamoto
- Institute of Oceanography, Aquaculture Marine Station, Federal University of Rio Grande, Brazil
| | - Andrezza C Chagas
- Institute of Oceanography, Aquaculture Marine Station, Federal University of Rio Grande, Brazil
| | - Aline da C Bezerra
- Institute of Oceanography, Aquaculture Marine Station, Federal University of Rio Grande, Brazil
| | - Luis H da S Poersch
- Institute of Oceanography, Aquaculture Marine Station, Federal University of Rio Grande, Brazil
| | - Luis A Romano
- Laboratory of Immunology and Pathology of Aquatic Organisms, Institute of Oceanography, Federal University of Rio Grande, Brazil
| |
Collapse
|
13
|
Rossi N, Gigante N, Vitacolonna N, Piazza C. Inferring Markov Chains to Describe Convergent Tumor Evolution With CIMICE. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:106-119. [PMID: 38015671 DOI: 10.1109/tcbb.2023.3337258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2023]
Abstract
The field of tumor phylogenetics focuses on studying the differences within cancer cell populations. Many efforts are done within the scientific community to build cancer progression models trying to understand the heterogeneity of such diseases. These models are highly dependent on the kind of data used for their construction, therefore, as the experimental technologies evolve, it is of major importance to exploit their peculiarities. In this work we describe a cancer progression model based on Single Cell DNA Sequencing data. When constructing the model, we focus on tailoring the formalism on the specificity of the data. We operate by defining a minimal set of assumptions needed to reconstruct a flexible DAG structured model, capable of identifying progression beyond the limitation of the infinite site assumption. Our proposal is conservative in the sense that we aim to neither discard nor infer knowledge which is not represented in the data. We provide simulations and analytical results to show the features of our model, test it on real data, show how it can be integrated with other approaches to cope with input noise. Moreover, our framework can be exploited to produce simulated data that follows our theoretical assumptions. Finally, we provide an open source R implementation of our approach, called CIMICE, that is publicly available on BioConductor.
Collapse
|
14
|
Bristy NA, Fu X, Schwartz R. Sc-TUSV-ext: Single-cell clonal lineage inference from single nucleotide variants (SNV), copy number alterations (CNA) and structural variants (SV). BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.07.570724. [PMID: 38106049 PMCID: PMC10723466 DOI: 10.1101/2023.12.07.570724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Clonal lineage inference ("tumor phylogenetics") has become a crucial tool for making sense of somatic evolution processes that underlie cancer development and are increasingly recognized as part of normal tissue growth and aging. The inference of clonal lineage trees from single cell sequence data offers particular promise for revealing processes of somatic evolution in unprecedented detail. However, most such tools are based on fairly restrictive models of the types of mutation events observed in somatic evolution and of the processes by which they develop. The present work seeks to enhance the power and versatility of tools for single-cell lineage reconstruction by making more comprehensive use of the range of molecular variant types by which tumors evolve. We introduce Sc-TUSV-ext, an integer linear programming (ILP) based tumor phylogeny reconstruction method that, for the first time, integrates single nucleotide variants (SNV), copy number alterations (CNA) and structural variations (SV) into clonal lineage reconstruction from single-cell DNA sequencing data. We show on synthetic data that accounting for these variant types collectively leads to improved accuracy in clonal lineage reconstruction relative to prior methods that consider only subsets of the variant types. We further demonstrate the effectiveness on real data in resolving clonal evolution in the presence of multiple variant types, providing a path towards more comprehensive insight into how various forms of somatic mutability collectively shape tissue development.
Collapse
|
15
|
Han Y, Molloy EK. Quartets enable statistically consistent estimation of cell lineage trees under an unbiased error and missingness model. Algorithms Mol Biol 2023; 18:19. [PMID: 38041123 PMCID: PMC10691101 DOI: 10.1186/s13015-023-00248-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 11/19/2023] [Indexed: 12/03/2023] Open
Abstract
Cancer progression and treatment can be informed by reconstructing its evolutionary history from tumor cells. Although many methods exist to estimate evolutionary trees (called phylogenies) from molecular sequences, traditional approaches assume the input data are error-free and the output tree is fully resolved. These assumptions are challenged in tumor phylogenetics because single-cell sequencing produces sparse, error-ridden data and because tumors evolve clonally. Here, we study the theoretical utility of methods based on quartets (four-leaf, unrooted phylogenetic trees) in light of these barriers. We consider a popular tumor phylogenetics model, in which mutations arise on a (highly unresolved) tree and then (unbiased) errors and missing values are introduced. Quartets are then implied by mutations present in two cells and absent from two cells. Our main result is that the most probable quartet identifies the unrooted model tree on four cells. This motivates seeking a tree such that the number of quartets shared between it and the input mutations is maximized. We prove an optimal solution to this problem is a consistent estimator of the unrooted cell lineage tree; this guarantee includes the case where the model tree is highly unresolved, with error defined as the number of false negative branches. Lastly, we outline how quartet-based methods might be employed when there are copy number aberrations and other challenges specific to tumor phylogenetics.
Collapse
Affiliation(s)
- Yunheng Han
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | - Erin K Molloy
- Department of Computer Science, University of Maryland, College Park, MD, USA.
- University of Maryland Institute for Advanced Computer Studies, College Park, MD, USA.
| |
Collapse
|
16
|
Sashittal P, Zhang H, Iacobuzio-Donahue CA, Raphael BJ. ConDoR: tumor phylogeny inference with a copy-number constrained mutation loss model. Genome Biol 2023; 24:272. [PMID: 38037115 PMCID: PMC10688497 DOI: 10.1186/s13059-023-03106-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Accepted: 11/07/2023] [Indexed: 12/02/2023] Open
Abstract
A tumor contains a diverse collection of somatic mutations that reflect its past evolutionary history and that range in scale from single nucleotide variants (SNVs) to large-scale copy-number aberrations (CNAs). However, no current single-cell DNA sequencing (scDNA-seq) technology produces accurate measurements of both SNVs and CNAs, complicating the inference of tumor phylogenies. We introduce a new evolutionary model, the constrained k-Dollo model, that uses SNVs as phylogenetic markers but constrains losses of SNVs according to clusters of cells. We derive an algorithm, ConDoR, that infers phylogenies from targeted scDNA-seq data using this model. We demonstrate the advantages of ConDoR on simulated and real scDNA-seq data.
Collapse
Affiliation(s)
| | - Haochen Zhang
- Gerstner Sloan Kettering Graduate School of Biomedical Sciences, Memorial Sloan Kettering Cancer Center, NY, USA
| | - Christine A Iacobuzio-Donahue
- Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, NY, USA
- David M. Rubenstein Center for Pancreatic Cancer Research, Memorial Sloan Kettering Cancer Center, NY, USA
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, NY, USA
| | | |
Collapse
|
17
|
Wang Z, Sun J, Gao Y, Xue Y, Zhang Y, Li K, Zhang W, Zhang C, Zu J, Zhang L. Fusang: a framework for phylogenetic tree inference via deep learning. Nucleic Acids Res 2023; 51:10909-10923. [PMID: 37819036 PMCID: PMC10639059 DOI: 10.1093/nar/gkad805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 08/17/2023] [Accepted: 09/20/2023] [Indexed: 10/13/2023] Open
Abstract
Phylogenetic tree inference is a classic fundamental task in evolutionary biology that entails inferring the evolutionary relationship of targets based on multiple sequence alignment (MSA). Maximum likelihood (ML) and Bayesian inference (BI) methods have dominated phylogenetic tree inference for many years, but BI is too slow to handle a large number of sequences. Recently, deep learning (DL) has been successfully applied to quartet phylogenetic tree inference and tentatively extended into more sequences with the quartet puzzling algorithm. However, no DL-based tools are immediately available for practical real-world applications. In this paper, we propose Fusang (http://fusang.cibr.ac.cn), a DL-based framework that achieves comparable performance to that of ML-based tools with both simulated and real datasets. More importantly, with continuous optimization, e.g. through the use of customized training datasets for real-world scenarios, Fusang has great potential to outperform ML-based tools.
Collapse
Affiliation(s)
- Zhicheng Wang
- Chinese Institute for Brain Research, Beijing 102206, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Jinnan Sun
- School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, China
| | - Yuan Gao
- Chinese Institute for Brain Research, Beijing 102206, China
| | - Yongwei Xue
- Chinese Institute for Brain Research, Beijing 102206, China
| | - Yubo Zhang
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Kuan Li
- Chinese Institute for Brain Research, Beijing 102206, China
| | - Wei Zhang
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing 100871, China
| | - Chi Zhang
- Key Laboratory of Vertebrate Evolution and Human Origins, Institute of Vertebrate Paleontology and Paleoanthropology, Center for Excellence in Life and Paleoenvironment, Chinese Academy of Sciences, Beijing 100044, China
| | - Jian Zu
- School of Mathematics and Statistics, Xi’an Jiaotong University, Xi’an 710049, China
| | - Li Zhang
- Chinese Institute for Brain Research, Beijing 102206, China
| |
Collapse
|
18
|
Liu Y, Li XC, Rashidi Mehrabadi F, Schäffer AA, Pratt D, Crawford DR, Malikić S, Molloy EK, Gopalan V, Mount SM, Ruppin E, Aldape KD, Sahinalp SC. Single-cell methylation sequencing data reveal succinct metastatic migration histories and tumor progression models. Genome Res 2023; 33:1089-1100. [PMID: 37316351 PMCID: PMC10538489 DOI: 10.1101/gr.277608.122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 06/06/2023] [Indexed: 06/16/2023]
Abstract
Recent studies exploring the impact of methylation in tumor evolution suggest that although the methylation status of many of the CpG sites are preserved across distinct lineages, others are altered as the cancer progresses. Because changes in methylation status of a CpG site may be retained in mitosis, they could be used to infer the progression history of a tumor via single-cell lineage tree reconstruction. In this work, we introduce the first principled distance-based computational method, Sgootr, for inferring a tumor's single-cell methylation lineage tree and for jointly identifying lineage-informative CpG sites that harbor changes in methylation status that are retained along the lineage. We apply Sgootr on single-cell bisulfite-treated whole-genome sequencing data of multiregionally sampled tumor cells from nine metastatic colorectal cancer patients, as well as multiregionally sampled single-cell reduced-representation bisulfite sequencing data from a glioblastoma patient. We show that the tumor lineages constructed reveal a simple model underlying tumor progression and metastatic seeding. A comparison of Sgootr against alternative approaches shows that Sgootr can construct lineage trees with fewer migration events and with more in concordance with the sequential-progression model of tumor evolution, with a running time a fraction of that used in prior studies. Lineage-informative CpG sites identified by Sgootr are in inter-CpG island (CGI) regions, as opposed to intra-CGIs, which have been the main regions of interest in genomic methylation-related analyses.
Collapse
Affiliation(s)
- Yuelin Liu
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
- Department of Computer Science, University of Maryland, College Park, Maryland 20742, USA
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland 20742, USA
| | - Xuan Cindy Li
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
- Program in Computational Biology, Bioinformatics, and Genomics, University of Maryland, College Park, Maryland 20742, USA
| | - Farid Rashidi Mehrabadi
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
- Department of Computer Science, Indiana University, Bloomington, Indiana 47408, USA
- Laboratory of Human Carcinogenesis, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Alejandro A Schäffer
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Drew Pratt
- Laboratory of Pathology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - David R Crawford
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
- Program in Computational Biology, Bioinformatics, and Genomics, University of Maryland, College Park, Maryland 20742, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland 20742, USA
| | - Salem Malikić
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Erin K Molloy
- Department of Computer Science, University of Maryland, College Park, Maryland 20742, USA
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland 20742, USA
| | - Vishaka Gopalan
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Stephen M Mount
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland 20742, USA
| | - Eytan Ruppin
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Kenneth D Aldape
- Laboratory of Pathology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - S Cenk Sahinalp
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892, USA;
| |
Collapse
|
19
|
Guang Z, Smith-Erb M, Oesper L. A weighted distance-based approach for deriving consensus tumor evolutionary trees. Bioinformatics 2023; 39:i204-i212. [PMID: 37387177 DOI: 10.1093/bioinformatics/btad230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023] Open
Abstract
MOTIVATION The acquisition of somatic mutations by a tumor can be modeled by a type of evolutionary tree. However, it is impossible to observe this tree directly. Instead, numerous algorithms have been developed to infer such a tree from different types of sequencing data. But such methods can produce conflicting trees for the same patient, making it desirable to have approaches that can combine several such tumor trees into a consensus or summary tree. We introduce The Weighted m-Tumor Tree Consensus Problem (W-m-TTCP) to find a consensus tree among multiple plausible tumor evolutionary histories, each assigned a confidence weight, given a specific distance measure between tumor trees. We present an algorithm called TuELiP that is based on integer linear programming which solves the W-m-TTCP, and unlike other existing consensus methods, allows the input trees to be weighted differently. RESULTS On simulated data we show that TuELiP outperforms two existing methods at correctly identifying the true underlying tree used to create the simulations. We also show that the incorporation of weights can lead to more accurate tree inference. On a Triple-Negative Breast Cancer dataset, we show that including confidence weights can have important impacts on the consensus tree identified. AVAILABILITY An implementation of TuELiP and simulated datasets are available at https://bitbucket.org/oesperlab/consensus-ilp/src/main/.
Collapse
Affiliation(s)
- Ziyun Guang
- Department of Computer Science, Carleton College, Northfield, MN 55057, USA
| | - Matthew Smith-Erb
- Department of Computer Science, Carleton College, Northfield, MN 55057, USA
| | - Layla Oesper
- Department of Computer Science, Carleton College, Northfield, MN 55057, USA
| |
Collapse
|
20
|
Luo XG, Kuipers J, Beerenwinkel N. Joint inference of exclusivity patterns and recurrent trajectories from tumor mutation trees. Nat Commun 2023; 14:3676. [PMID: 37344522 DOI: 10.1038/s41467-023-39400-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Accepted: 06/12/2023] [Indexed: 06/23/2023] Open
Abstract
Cancer progression is an evolutionary process shaped by both deterministic and stochastic forces. Multi-region and single-cell sequencing of tumors enable high-resolution reconstruction of the mutational history of each tumor and highlight the extensive diversity across tumors and patients. Resolving the interactions among mutations and recovering recurrent evolutionary processes may offer greater opportunities for successful therapeutic strategies. To this end, we present a novel probabilistic framework, called TreeMHN, for the joint inference of exclusivity patterns and recurrent trajectories from a cohort of intra-tumor phylogenetic trees. Through simulations, we show that TreeMHN outperforms existing alternatives that can only focus on one aspect of the task. By analyzing datasets of blood, lung, and breast cancers, we find the most likely evolutionary trajectories and mutational patterns, consistent with and enriching our current understanding of tumorigenesis. Moreover, TreeMHN facilitates the prediction of tumor evolution and provides probabilistic measures on the next mutational events given a tumor tree, a prerequisite for evolution-guided treatment strategies.
Collapse
Affiliation(s)
- Xiang Ge Luo
- Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, 4058, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Mattenstrasse 26, 4058, Basel, Switzerland
| | - Jack Kuipers
- Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, 4058, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Mattenstrasse 26, 4058, Basel, Switzerland
| | - Niko Beerenwinkel
- Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, 4058, Basel, Switzerland.
- SIB Swiss Institute of Bioinformatics, Mattenstrasse 26, 4058, Basel, Switzerland.
| |
Collapse
|
21
|
Lu B, Curtius K, Graham TA, Yang Z, Barnes CP. CNETML: maximum likelihood inference of phylogeny from copy number profiles of multiple samples. Genome Biol 2023; 24:144. [PMID: 37340508 DOI: 10.1186/s13059-023-02983-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Accepted: 06/08/2023] [Indexed: 06/22/2023] Open
Abstract
Phylogenetic trees based on copy number profiles from multiple samples of a patient are helpful to understand cancer evolution. Here, we develop a new maximum likelihood method, CNETML, to infer phylogenies from such data. CNETML is the first program to jointly infer the tree topology, node ages, and mutation rates from total copy numbers of longitudinal samples. Our extensive simulations suggest CNETML performs well on copy numbers relative to ploidy and under slight violation of model assumptions. The application of CNETML to real data generates results consistent with previous discoveries and provides novel early copy number events for further investigation.
Collapse
Affiliation(s)
- Bingxin Lu
- Department of Cell and Developmental Biology, University College London, London, UK.
- UCL Genetics Institute, University College London, London, UK.
| | - Kit Curtius
- Barts Cancer Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, UK
- Division of Biomedical Informatics, Department of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Trevor A Graham
- Barts Cancer Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, UK
- Centre for Evolution and Cancer, Institute of Cancer Research, London, UK
| | - Ziheng Yang
- Department of Genetics, Evolution and Environment, University College London, London, UK
| | - Chris P Barnes
- Department of Cell and Developmental Biology, University College London, London, UK.
- UCL Genetics Institute, University College London, London, UK.
| |
Collapse
|
22
|
Hessey S, Fessas P, Zaccaria S, Jamal-Hanjani M, Swanton C. Insights into the metastatic cascade through research autopsies. Trends Cancer 2023; 9:490-502. [PMID: 37059687 DOI: 10.1016/j.trecan.2023.03.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Revised: 03/05/2023] [Accepted: 03/07/2023] [Indexed: 04/16/2023]
Abstract
Metastasis is a complex process and the leading cause of cancer-related death globally. Recent studies have demonstrated that genomic sequencing data from paired primary and metastatic tumours can be used to trace the evolutionary origins of cells responsible for metastasis. This approach has yielded new insights into the genomic alterations that engender metastatic potential, and the mechanisms by which cancer spreads. Given that the reliability of these approaches is contingent upon how representative the samples are of primary and metastatic tumour heterogeneity, we review insights from studies that have reconstructed the evolution of metastasis within the context of their cohorts and designs. We discuss the role of research autopsies in achieving the comprehensive sampling necessary to advance the current understanding of metastasis.
Collapse
Affiliation(s)
- Sonya Hessey
- Cancer Research UK Lung Cancer Centre of Excellence, University College London Cancer Institute, London, UK; Cancer Metastasis Laboratory, University College London Cancer Institute, London, UK; Computational Cancer Genomics Research Group, University College London Cancer Institute, London, UK
| | - Petros Fessas
- Cancer Metastasis Laboratory, University College London Cancer Institute, London, UK
| | - Simone Zaccaria
- Cancer Research UK Lung Cancer Centre of Excellence, University College London Cancer Institute, London, UK; Computational Cancer Genomics Research Group, University College London Cancer Institute, London, UK
| | - Mariam Jamal-Hanjani
- Cancer Research UK Lung Cancer Centre of Excellence, University College London Cancer Institute, London, UK; Cancer Metastasis Laboratory, University College London Cancer Institute, London, UK; Department of Oncology, University College London Hospitals, London, UK.
| | - Charles Swanton
- Cancer Research UK Lung Cancer Centre of Excellence, University College London Cancer Institute, London, UK; Department of Oncology, University College London Hospitals, London, UK; Cancer Evolution and Genome Instability Laboratory, The Francis Crick Institute, London, UK.
| |
Collapse
|
23
|
Zhu X, Zhao W, Zhou Z, Gu X. Unraveling the Drivers of Tumorigenesis in the Context of Evolution: Theoretical Models and Bioinformatics Tools. J Mol Evol 2023:10.1007/s00239-023-10117-0. [PMID: 37246992 DOI: 10.1007/s00239-023-10117-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Accepted: 05/09/2023] [Indexed: 05/30/2023]
Abstract
Cancer originates from somatic cells that have accumulated mutations. These mutations alter the phenotype of the cells, allowing them to escape homeostatic regulation that maintains normal cell numbers. The emergence of malignancies is an evolutionary process in which the random accumulation of somatic mutations and sequential selection of dominant clones cause cancer cells to proliferate. The development of technologies such as high-throughput sequencing has provided a powerful means to measure subclonal evolutionary dynamics across space and time. Here, we review the patterns that may be observed in cancer evolution and the methods available for quantifying the evolutionary dynamics of cancer. An improved understanding of the evolutionary trajectories of cancer will enable us to explore the molecular mechanism of tumorigenesis and to design tailored treatment strategies.
Collapse
Affiliation(s)
- Xunuo Zhu
- Innovation Institute for Artificial Intelligence in Medicine, Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Wenyi Zhao
- Innovation Institute for Artificial Intelligence in Medicine, Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Zhan Zhou
- Innovation Institute for Artificial Intelligence in Medicine, Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China.
- The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, 322000, China.
- Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 310058, China.
| | - Xun Gu
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA, 50011, USA.
| |
Collapse
|
24
|
Hamis S, Somervuo P, Ågren JA, Tadele DS, Kesseli J, Scott JG, Nykter M, Gerlee P, Finkelshtein D, Ovaskainen O. Spatial cumulant models enable spatially informed treatment strategies and analysis of local interactions in cancer systems. J Math Biol 2023; 86:68. [PMID: 37017776 PMCID: PMC10076412 DOI: 10.1007/s00285-023-01903-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 01/13/2023] [Accepted: 03/09/2023] [Indexed: 04/06/2023]
Abstract
Theoretical and applied cancer studies that use individual-based models (IBMs) have been limited by the lack of a mathematical formulation that enables rigorous analysis of these models. However, spatial cumulant models (SCMs), which have arisen from theoretical ecology, describe population dynamics generated by a specific family of IBMs, namely spatio-temporal point processes (STPPs). SCMs are spatially resolved population models formulated by a system of differential equations that approximate the dynamics of two STPP-generated summary statistics: first-order spatial cumulants (densities), and second-order spatial cumulants (spatial covariances). We exemplify how SCMs can be used in mathematical oncology by modelling theoretical cancer cell populations comprising interacting growth factor-producing and non-producing cells. To formulate model equations, we use computational tools that enable the generation of STPPs, SCMs and mean-field population models (MFPMs) from user-defined model descriptions (Cornell et al. Nat Commun 10:4716, 2019). To calculate and compare STPP, SCM and MFPM-generated summary statistics, we develop an application-agnostic computational pipeline. Our results demonstrate that SCMs can capture STPP-generated population density dynamics, even when MFPMs fail to do so. From both MFPM and SCM equations, we derive treatment-induced death rates required to achieve non-growing cell populations. When testing these treatment strategies in STPP-generated cell populations, our results demonstrate that SCM-informed strategies outperform MFPM-informed strategies in terms of inhibiting population growths. We thus demonstrate that SCMs provide a new framework in which to study cell-cell interactions, and can be used to describe and perturb STPP-generated cell population dynamics. We, therefore, argue that SCMs can be used to increase IBMs' applicability in cancer research.
Collapse
Affiliation(s)
- Sara Hamis
- Tampere Institute for Advanced Study, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland.
- Department of Biological and Environmental Science, University of Jyväskylä, Jyväskylä, Finland.
| | - Panu Somervuo
- Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, University of Helsinki, Helsinki, Finland
| | - J Arvid Ågren
- Department of Evolutionary Biology, Uppsala University, Uppsala, Sweden
- Department of Translational Hematology and Oncology Research, Cleveland Clinic, Cleveland, OH, USA
| | - Dagim Shiferaw Tadele
- Department of Translational Hematology and Oncology Research, Cleveland Clinic, Cleveland, OH, USA
- Department for Medical Genetics, Oslo University Hospital, Ullevål, Oslo, Norway
| | - Juha Kesseli
- Prostate Cancer Research Center, Faculty of Medicine and Health Technology, Tampere University and Tays Cancer Centre, Tampere, Finland
| | - Jacob G Scott
- Department of Translational Hematology and Oncology Research, Cleveland Clinic, Cleveland, OH, USA
- Case Western Reserve School of Medicine, Cleveland, OH, USA
- Department of Radiation Oncology, Cleveland Clinic, Cleveland, OH, USA
| | - Matti Nykter
- Prostate Cancer Research Center, Faculty of Medicine and Health Technology, Tampere University and Tays Cancer Centre, Tampere, Finland
- Foundation for the Finnish Cancer Institute, Helsinki, Finland
| | - Philip Gerlee
- Mathematical Sciences, Chalmers University of Technology, Gothenburg, Sweden
- Mathematical Sciences, University of Gothenburg, Gothenburg, Sweden
| | - Dmitri Finkelshtein
- Department of Mathematics, Faculty of Science and Engineering, Swansea University, Swansea, UK
| | - Otso Ovaskainen
- Department of Biological and Environmental Science, University of Jyväskylä, Jyväskylä, Finland
- Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, University of Helsinki, Helsinki, Finland
- Department of Biology, Centre for Biodiversity Dynamics, Norwegian University of Science and Technology, Trondheim, Norway
| |
Collapse
|
25
|
Moravec JC, Lanfear R, Spector DL, Diermeier SD, Gavryushkin A. Testing for Phylogenetic Signal in Single-Cell RNA-Seq Data. J Comput Biol 2023; 30:518-537. [PMID: 36475926 PMCID: PMC10125402 DOI: 10.1089/cmb.2022.0357] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Phylogenetic methods are emerging as a useful tool to understand cancer evolutionary dynamics, including tumor structure, heterogeneity, and progression. Most currently used approaches utilize either bulk whole genome sequencing or single-cell DNA sequencing and are based on calling copy number alterations and single nucleotide variants (SNVs). Single-cell RNA sequencing (scRNA-seq) is commonly applied to explore differential gene expression of cancer cells throughout tumor progression. The method exacerbates the single-cell sequencing problem of low yield per cell with uneven expression levels. This accounts for low and uneven sequencing coverage and makes SNV detection and phylogenetic analysis challenging. In this article, we demonstrate for the first time that scRNA-seq data contain sufficient evolutionary signal and can also be utilized in phylogenetic analyses. We explore and compare results of such analyses based on both expression levels and SNVs called from scRNA-seq data. Both techniques are shown to be useful for reconstructing phylogenetic relationships between cells, reflecting the clonal composition of a tumor. Both standardized expression values and SNVs appear to be equally capable of reconstructing a similar pattern of phylogenetic relationship. This pattern is stable even when phylogenetic uncertainty is taken in account. Our results open up a new direction of somatic phylogenetics based on scRNA-seq data. Further research is required to refine and improve these approaches to capture the full picture of somatic evolutionary dynamics in cancer.
Collapse
Affiliation(s)
- Jiří C. Moravec
- Department of Computer Science, University of Otago, Dunedin, New Zealand
- School of Mathematics and Statistics, University of Canterbury, Christchurch, New Zealand
| | - Robert Lanfear
- Division of Ecology and Evolution, Research School of Biology, Australian National University, Canberra, Australia
| | | | | | - Alex Gavryushkin
- School of Mathematics and Statistics, University of Canterbury, Christchurch, New Zealand
| |
Collapse
|
26
|
Jun SH, Toosi H, Mold J, Engblom C, Chen X, O'Flanagan C, Hagemann-Jensen M, Sandberg R, Aparicio S, Hartman J, Roth A, Lagergren J. Reconstructing clonal tree for phylo-phenotypic characterization of cancer using single-cell transcriptomics. Nat Commun 2023; 14:982. [PMID: 36813776 PMCID: PMC9946941 DOI: 10.1038/s41467-023-36202-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 01/20/2023] [Indexed: 02/24/2023] Open
Abstract
Functional characterization of the cancer clones can shed light on the evolutionary mechanisms driving cancer's proliferation and relapse mechanisms. Single-cell RNA sequencing data provide grounds for understanding the functional state of cancer as a whole; however, much research remains to identify and reconstruct clonal relationships toward characterizing the changes in functions of individual clones. We present PhylEx that integrates bulk genomics data with co-occurrences of mutations from single-cell RNA sequencing data to reconstruct high-fidelity clonal trees. We evaluate PhylEx on synthetic and well-characterized high-grade serous ovarian cancer cell line datasets. PhylEx outperforms the state-of-the-art methods both when comparing capacity for clonal tree reconstruction and for identifying clones. We analyze high-grade serous ovarian cancer and breast cancer data to show that PhylEx exploits clonal expression profiles beyond what is possible with expression-based clustering methods and clear the way for accurate inference of clonal trees and robust phylo-phenotypic analysis of cancer.
Collapse
Affiliation(s)
- Seong-Hwan Jun
- SciLifeLab, School of EECS, KTH Royal Institute of Technology, Stockholm, Sweden.,Department of Biostatistics and Computational Biology, University of Rochester Medical Center, Rochester, USA
| | - Hosein Toosi
- SciLifeLab, School of EECS, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Jeff Mold
- Department of Cell and Molecular Biology, Karolinska Institutet, Solna, Sweden
| | - Camilla Engblom
- Department of Cell and Molecular Biology, Karolinska Institutet, Solna, Sweden
| | - Xinsong Chen
- Department of Oncology and Pathology, Karolinska Institutet, Solna, Sweden
| | - Ciara O'Flanagan
- Department of Molecular Oncology, BC Cancer, Vancouver, BC, Canada
| | | | - Rickard Sandberg
- Department of Cell and Molecular Biology, Karolinska Institutet, Solna, Sweden
| | - Samuel Aparicio
- Department of Molecular Oncology, BC Cancer, Vancouver, BC, Canada.,Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, Canada
| | - Johan Hartman
- Department of Oncology and Pathology, Karolinska Institutet, Solna, Sweden.,Department of Clinical Pathology and Cytology, Karolinska University Laboratory, Stockholm, Sweden
| | - Andrew Roth
- Department of Molecular Oncology, BC Cancer, Vancouver, BC, Canada. .,Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, Canada. .,Department of Computer Science, University of British Columbia, Vancouver, Canada.
| | - Jens Lagergren
- SciLifeLab, School of EECS, KTH Royal Institute of Technology, Stockholm, Sweden.
| |
Collapse
|
27
|
Ring A, Nguyen-Sträuli BD, Wicki A, Aceto N. Biology, vulnerabilities and clinical applications of circulating tumour cells. Nat Rev Cancer 2023; 23:95-111. [PMID: 36494603 PMCID: PMC9734934 DOI: 10.1038/s41568-022-00536-4] [Citation(s) in RCA: 69] [Impact Index Per Article: 69.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/07/2022] [Indexed: 12/13/2022]
Abstract
In recent years, exceptional technological advances have enabled the identification and interrogation of rare circulating tumour cells (CTCs) from blood samples of patients, leading to new fields of research and fostering the promise for paradigm-changing, liquid biopsy-based clinical applications. Analysis of CTCs has revealed distinct biological phenotypes, including the presence of CTC clusters and the interaction between CTCs and immune or stromal cells, impacting metastasis formation and providing new insights into cancer vulnerabilities. Here we review the progress made in understanding biological features of CTCs and provide insight into exploiting these developments to design future clinical tools for improving the diagnosis and treatment of cancer.
Collapse
Affiliation(s)
- Alexander Ring
- Department of Biology, Institute for Molecular Health Sciences, ETH Zurich, Zurich, Switzerland
- Department of Medical Oncology and Hematology, University Hospital Zurich and University of Zurich, Zurich, Switzerland
| | - Bich Doan Nguyen-Sträuli
- Department of Biology, Institute for Molecular Health Sciences, ETH Zurich, Zurich, Switzerland
- Department of Gynecology, University Hospital Zurich and University of Zurich, Zurich, Switzerland
| | - Andreas Wicki
- Department of Medical Oncology and Hematology, University Hospital Zurich and University of Zurich, Zurich, Switzerland
| | - Nicola Aceto
- Department of Biology, Institute for Molecular Health Sciences, ETH Zurich, Zurich, Switzerland.
| |
Collapse
|
28
|
Sashittal P, Zhang H, Iacobuzio-Donahue CA, Raphael BJ. ConDoR: Tumor phylogeny inference with a copy-number constrained mutation loss model. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.05.522408. [PMID: 36711528 PMCID: PMC9882003 DOI: 10.1101/2023.01.05.522408] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Tumors consist of subpopulations of cells that harbor distinct collections of somatic mutations. These mutations range in scale from single nucleotide variants (SNVs) to large-scale copy-number aberrations (CNAs). While many approaches infer tumor phylogenies using SNVs as phylogenetic markers, CNAs that overlap SNVs may lead to erroneous phylogenetic inference. Specifically, an SNV may be lost in a cell due to a deletion of the genomic segment containing the SNV. Unfortunately, no current single-cell DNA sequencing (scDNA-seq) technology produces accurate measurements of both SNVs and CNAs. For instance, recent targeted scDNA-seq technologies, such as Mission Bio Tapestri, measure SNVs with high fidelity in individual cells, but yield much less reliable measurements of CNAs. We introduce a new evolutionary model, the constrained k-Dollo model, that uses SNVs as phylogenetic markers and partial information about CNAs in the form of clustering of cells with similar copy-number profiles. This copy-number clustering constrains where loss of SNVs can occur in the phylogeny. We develop ConDoR (Constrained Dollo Reconstruction), an algorithm to infer tumor phylogenies from targeted scDNA-seq data using the constrained k-Dollo model. We show that ConDoR outperforms existing methods on simulated data. We use ConDoR to analyze a new multi-region targeted scDNA-seq dataset of 2153 cells from a pancreatic ductal adenocarcinoma (PDAC) tumor and produce a more plausible phylogeny compared to existing methods that conforms to histological results for the tumor from a previous study. We also analyze a metastatic colorectal cancer dataset, deriving a more parsimonious phylogeny than previously published analyses and with a simpler monoclonal origin of metastasis compared to the original study. Code availability Software is available at https://github.com/raphael-group/constrained-Dollo.
Collapse
Affiliation(s)
| | - Haochen Zhang
- Gerstner Sloan Kettering Graduate School of Biomedical Sciences, Memorial Sloan Kettering Cancer Center, NY, USA
| | - Christine A. Iacobuzio-Donahue
- Human Oncology and Pathogenesis Program, Memorial Sloan Kettering Cancer Center, NY, USA
- David M. Rubenstein Center for Pancreatic Cancer Research, Memorial Sloan Kettering Cancer Center, NY, USA
- Department of Pathology and Laboratory Medicine, Memorial Sloan Kettering Cancer Center, NY, USA
| | | |
Collapse
|
29
|
Chen J. Timed hazard networks: Incorporating temporal difference for oncogenetic analysis. PLoS One 2023; 18:e0283004. [PMID: 36928529 PMCID: PMC10019724 DOI: 10.1371/journal.pone.0283004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Accepted: 03/01/2023] [Indexed: 03/18/2023] Open
Abstract
Oncogenetic graphical models are crucial for understanding cancer progression by analyzing the accumulation of genetic events. These models are used to identify statistical dependencies and temporal order of genetic events, which helps design targeted therapies. However, existing algorithms do not account for temporal differences between samples in oncogenetic analysis. This paper introduces Timed Hazard Networks (TimedHN), a new statistical model that uses temporal differences to improve accuracy and reliability. TimedHN models the accumulation process as a continuous-time Markov chain and includes an efficient gradient computation algorithm for optimization. Our simulation experiments demonstrate that TimedHN outperforms current state-of-the-art graph reconstruction methods. We also compare TimedHN with existing methods on a luminal breast cancer dataset, highlighting its potential utility. The Matlab implementation and data are available at https://github.com/puar-playground/TimedHN.
Collapse
Affiliation(s)
- Jian Chen
- Department of Computer Science and Engineering, University at Buffalo, Buffalo, NY, United States of America
- * E-mail:
| |
Collapse
|
30
|
Seillier L, Peifer M. Reconstructing Phylogenetic Relationship in Bladder Cancer: A Methodological Overview. Methods Mol Biol 2023; 2684:113-132. [PMID: 37410230 DOI: 10.1007/978-1-0716-3291-8_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/07/2023]
Abstract
Bladder cancer (BC) expresses itself as a highly heterogeneous disease both at the histological and molecular level, often occurring as synchronous or metachronous multifocal disease with high risk of recurrence and potential to metastasize. Multiple sequencing studies focusing on both non-muscle-invasive bladder cancer (NMIBC) and muscle-invasive bladder cancer (MIBC) gave insights into the extent of both inter- and intrapatient heterogeneity, but many questions on clonal evolution in BC remain unanswered. In this review article, we provide an overview over the technical and theoretical concepts linked to reconstructing evolutionary trajectories in BC and propose a set of tools and established software for phylogenetic analysis.
Collapse
Affiliation(s)
| | - Martin Peifer
- Department of Translational Genomics, University of Cologne, Cologne, Germany
| |
Collapse
|
31
|
Moen MT, Johnston IG. HyperHMM: efficient inference of evolutionary and progressive dynamics on hypercubic transition graphs. Bioinformatics 2022; 39:6895098. [PMID: 36511587 PMCID: PMC9848056 DOI: 10.1093/bioinformatics/btac803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 11/11/2022] [Accepted: 12/12/2022] [Indexed: 12/15/2022] Open
Abstract
MOTIVATION The evolution of bacterial drug resistance and other features in biology, the progression of cancer and other diseases and a wide range of broader questions can often be viewed as the sequential stochastic acquisition of binary traits (e.g. genetic changes, symptoms or characters). Using potentially noisy or incomplete data to learn the sequences by which such traits are acquired is a problem of general interest. The problem is complicated for large numbers of traits, which may, individually or synergistically, influence the probability of further acquisitions both positively and negatively. Hypercubic inference approaches, based on hidden Markov models on a hypercubic transition network, address these complications, but previous Bayesian instances can consume substantial time for converged results, limiting their practical use. RESULTS Here, we introduce HyperHMM, an adapted Baum-Welch (expectation-maximization) algorithm for hypercubic inference with resampling to quantify uncertainty, and show that it allows orders-of-magnitude faster inference while making few practical sacrifices compared to previous hypercubic inference approaches. We show that HyperHMM allows any combination of traits to exert arbitrary positive or negative influence on the acquisition of other traits, relaxing a common limitation of only independent trait influences. We apply this approach to synthetic and biological datasets and discuss its more general application in learning evolutionary and progressive pathways. AVAILABILITY AND IMPLEMENTATION Code for inference and visualization, and data for example cases, is freely available at https://github.com/StochasticBiology/hypercube-hmm. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Marcus T Moen
- Department of Mathematics, University of Bergen, Bergen, Vestland, Norway
| | | |
Collapse
|
32
|
Sankaran VG, Weissman JS, Zon LI. Cellular barcoding to decipher clonal dynamics in disease. Science 2022; 378:eabm5874. [PMID: 36227997 PMCID: PMC10111813 DOI: 10.1126/science.abm5874] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Cellular barcodes are distinct DNA sequences that enable one to track specific cells across time or space. Recent advances in our ability to detect natural or synthetic cellular barcodes, paired with single-cell readouts of cell state, have markedly increased our knowledge of clonal dynamics and genealogies of the cells that compose a variety of tissues and organs. These advances hold promise to redefine our view of human disease. Here, we provide an overview of cellular barcoding approaches, discuss applications to gain new insights into disease mechanisms, and provide an outlook on future applications. We discuss unanticipated insights gained through barcoding in studies of cancer and blood cell production and describe how barcoding can be applied to a growing array of medical fields, particularly with the increasing recognition of clonal contributions in human diseases.
Collapse
Affiliation(s)
- Vijay G Sankaran
- Division of Hematology and Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, USA.,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02115, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,Harvard Stem Cell Institute, Cambridge, MA 02138, USA
| | - Jonathan S Weissman
- Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA.,Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA.,David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA.,Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Leonard I Zon
- Division of Hematology and Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, USA.,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02115, USA.,Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.,Harvard Stem Cell Institute, Cambridge, MA 02138, USA.,Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA.,Stem Cell Program, Boston Children's Hospital, Harvard Medical School, Boston, MA 02115, USA.,Harvard Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA 02138, USA
| |
Collapse
|
33
|
Somatic variation in normal tissues: friend or foe of cancer early detection? Ann Oncol 2022; 33:1239-1249. [PMID: 36162751 DOI: 10.1016/j.annonc.2022.09.156] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Revised: 09/03/2022] [Accepted: 09/10/2022] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND Seemingly normal tissues progressively become populated by mutant clones over time. Most of these clones bear mutations in well-known cancer genes but only rarely do they transform into cancer. This poses questions on what triggers cancer initiation and what implications somatic variation has for cancer early detection. DESIGN We analysed recent mutational screens of healthy and cancer-free diseased tissues to compare somatic drivers and the causes of somatic variation across tissues. We then reviewed the mechanisms of clonal expansion and their relationships with age and diseases other than cancer. We finally discussed the relevance of somatic variation for cancer initiation and how it can help or hinder cancer detection and prevention. RESULTS The extent of somatic variation is highly variable across tissues and depends on intrinsic features, such as tissue architecture and turnover, as well as the exposure to endogenous and exogenous insults. Most somatic mutations driving clonal expansion are tissue-specific and inactivate tumor suppressor genes involved in chromatin modification and cell growth signaling. Some of these genes are more frequently mutated in normal tissues than cancer, indicating a context-dependent cancer promoting or protective role. Mutant clones can persist over a long time or disappear rapidly, suggesting that their fitness depends on the dynamic equilibrium with the environment. The disruption of this equilibrium is likely responsible for their transformation into malignant clones and knowing what triggers this process is key for cancer prevention and early detection. Somatic variation should be considered in liquid biopsy, where it may contribute cancer-independent mutations, and in the identification of cancer drivers, since not all mutated genes favoring clonal expansion also drive tumorigenesis. CONCLUSIONS Somatic variation and the factors governing homeostasis of normal tissues should be taken into account when devising strategies for cancer prevention and early detection.
Collapse
|
34
|
Pellegrina L, Vandin F. Discovering significant evolutionary trajectories in cancer phylogenies. Bioinformatics 2022; 38:ii49-ii55. [PMID: 36124798 DOI: 10.1093/bioinformatics/btac467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
MOTIVATION Tumors are the result of a somatic evolutionary process leading to substantial intra-tumor heterogeneity. Single-cell and multi-region sequencing enable the detailed characterization of the clonal architecture of tumors and have highlighted its extensive diversity across tumors. While several computational methods have been developed to characterize the clonal composition and the evolutionary history of tumors, the identification of significantly conserved evolutionary trajectories across tumors is still a major challenge. RESULTS We present a new algorithm, MAximal tumor treeS TRajectOries (MASTRO), to discover significantly conserved evolutionary trajectories in cancer. MASTRO discovers all conserved trajectories in a collection of phylogenetic trees describing the evolution of a cohort of tumors, allowing the discovery of conserved complex relations between alterations. MASTRO assesses the significance of the trajectories using a conditional statistical test that captures the coherence in the order in which alterations are observed in different tumors. We apply MASTRO to data from nonsmall-cell lung cancer bulk sequencing and to acute myeloid leukemia data from single-cell panel sequencing, and find significant evolutionary trajectories recapitulating and extending the results reported in the original studies. AVAILABILITY AND IMPLEMENTATION MASTRO is available at https://github.com/VandinLab/MASTRO. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Leonardo Pellegrina
- Department of Information Engineering, University of Padova, Padova, 35129, Italy
| | - Fabio Vandin
- Department of Information Engineering, University of Padova, Padova, 35129, Italy
| |
Collapse
|
35
|
Kızılkale C, Rashidi Mehrabadi F, Sadeqi Azer E, Pérez-Guijarro E, Marie KL, Lee MP, Day CP, Merlino G, Ergün F, Buluç A, Sahinalp SC, Malikić S. Fast intratumor heterogeneity inference from single-cell sequencing data. NATURE COMPUTATIONAL SCIENCE 2022; 2:577-583. [PMID: 38177468 PMCID: PMC10765963 DOI: 10.1038/s43588-022-00298-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 07/14/2022] [Indexed: 01/06/2024]
Abstract
We introduce HUNTRESS, a computational method for mutational intratumor heterogeneity inference from noisy genotype matrices derived from single-cell sequencing data, the running time of which is linear with the number of cells and quadratic with the number of mutations. We prove that, under reasonable conditions, HUNTRESS computes the true progression history of a tumor with high probability. On simulated and real tumor sequencing data, HUNTRESS is demonstrated to be faster than available alternatives with comparable or better accuracy. Additionally, the progression histories of tumors inferred by HUNTRESS on real single-cell sequencing datasets agree with the best known evolution scenarios for the associated tumors.
Collapse
Affiliation(s)
- Can Kızılkale
- Department of Electrical Engineering and Computer Sciences UC Berkeley, Berkeley, CA, USA
- Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Farid Rashidi Mehrabadi
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
- Department of Computer Science, Indiana University, Bloomington, IN, USA
| | - Erfan Sadeqi Azer
- Department of Computer Science, Indiana University, Bloomington, IN, USA
- Google LLC, Sunnyvale, CA, USA
| | - Eva Pérez-Guijarro
- Laboratory of Cancer Biology and Genetics, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Kerrie L Marie
- Laboratory of Cancer Biology and Genetics, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Maxwell P Lee
- Laboratory of Cancer Biology and Genetics, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Chi-Ping Day
- Laboratory of Cancer Biology and Genetics, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Glenn Merlino
- Laboratory of Cancer Biology and Genetics, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Funda Ergün
- Department of Computer Science, Indiana University, Bloomington, IN, USA
| | - Aydın Buluç
- Department of Electrical Engineering and Computer Sciences UC Berkeley, Berkeley, CA, USA
- Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - S Cenk Sahinalp
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA.
| | - Salem Malikić
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA.
| |
Collapse
|
36
|
Hui S, Nielsen R. SCONCE2: jointly inferring single cell copy number profiles and tumor evolutionary distances. BMC Bioinformatics 2022; 23:348. [PMID: 35986254 PMCID: PMC9392257 DOI: 10.1186/s12859-022-04890-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Accepted: 08/11/2022] [Indexed: 11/24/2022] Open
Abstract
Background Single cell whole genome tumor sequencing can yield novel insights into the evolutionary history of somatic copy number alterations. Existing single cell copy number calling methods do not explicitly model the shared evolutionary process of multiple cells, and generally analyze cells independently. Additionally, existing methods for estimating tumor cell phylogenies using copy number profiles are sensitive to profile estimation errors. Results We present SCONCE2, a method for jointly calling copy number alterations and estimating pairwise distances for single cell sequencing data. Using simulations, we show that SCONCE2 has higher accuracy in copy number calling and phylogeny estimation than competing methods. We apply SCONCE2 to previously published single cell sequencing data to illustrate the utility of the method. Conclusions SCONCE2 jointly estimates copy number profiles and a distance metric for inferring tumor phylogenies in single cell whole genome tumor sequencing across multiple cells, enabling deeper understandings of tumor evolution. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04890-w.
Collapse
|
37
|
Ahmadinejad N, Troftgruben S, Wang J, Chandrashekar PB, Dinu V, Maley C, Liu L. Accurate Identification of Subclones in Tumor Genomes. Mol Biol Evol 2022; 39:6617617. [PMID: 35749590 PMCID: PMC9260306 DOI: 10.1093/molbev/msac136] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Understanding intra-tumor heterogeneity is critical for studying tumorigenesis and designing personalized treatments. To decompose the mixed cell population in a tumor, subclones are inferred computationally based on variant allele frequency (VAF) from bulk sequencing data. In this study, we showed that sequencing depth, mean VAF, and variance of VAF of a subclone are confounded. Without considering this effect, current methods require deep-sequencing data (>300x depth) to reliably infer subclones. Here we present a novel algorithm that incorporates depth-variance and mean-variance dependencies in a clustering error model and successfully identifies subclones in tumors sequenced at depths of as low as 30x. We implemented the algorithm as a model-based adaptive grouping of subclones (MAGOS) method. Analyses of computer simulated data and empirical sequencing data showed that MAGOS outperformed existing methods on minimum sequencing depth, decomposition accuracy, and computation efficiency. The most prominent improvements were observed in analyzing tumors sequenced at depths between 30x and 200x, while the performance was comparable between MAGOS and existing methods on deeply sequenced tumors. MAGOS supports analysis of single nucleotide variants and copy number variants from a single sample or multiple samples of a tumor. We applied MAGOS to whole-exome data of late-stage liver cancers and discovered that high subclone count in a tumor was a significant risk factor of poor prognosis. Lastly, our analysis suggested that sequencing multiple samples of the same tumor at standard depth is more cost-effective and robust for subclone characterization than deep sequencing a single sample. MAGOS is available at github (https://github.com/liliulab/magos).
Collapse
Affiliation(s)
- Navid Ahmadinejad
- College of Health Solutions, Arizona State University, Phoenix, AZ, 85054, USA.,Biodesign Institute, Arizona State University, Tempe, AZ, 85281, USA
| | - Shayna Troftgruben
- College of Health Solutions, Arizona State University, Phoenix, AZ, 85054, USA
| | - Junwen Wang
- College of Health Solutions, Arizona State University, Phoenix, AZ, 85054, USA.,Department of Health Sciences Research & Center for Individualized Medicine, Mayo Clinic Arizona, Scottsdale, AZ, 85259, USA
| | - Pramod B Chandrashekar
- College of Health Solutions, Arizona State University, Phoenix, AZ, 85054, USA.,Biodesign Institute, Arizona State University, Tempe, AZ, 85281, USA
| | - Valentin Dinu
- College of Health Solutions, Arizona State University, Phoenix, AZ, 85054, USA.,Biodesign Institute, Arizona State University, Tempe, AZ, 85281, USA
| | - Carlo Maley
- Biodesign Institute, Arizona State University, Tempe, AZ, 85281, USA
| | - Li Liu
- College of Health Solutions, Arizona State University, Phoenix, AZ, 85054, USA.,Biodesign Institute, Arizona State University, Tempe, AZ, 85281, USA
| |
Collapse
|
38
|
African mitochondrial haplogroup L7: a 100,000-year-old maternal human lineage discovered through reassessment and new sequencing. Sci Rep 2022; 12:10747. [PMID: 35750688 PMCID: PMC9232647 DOI: 10.1038/s41598-022-13856-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 05/30/2022] [Indexed: 11/17/2022] Open
Abstract
Archaeological and genomic evidence suggest that modern Homo sapiens have roamed the planet for some 300–500 thousand years. In contrast, global human mitochondrial (mtDNA) diversity coalesces to one African female ancestor (“Mitochondrial Eve”) some 145 thousand years ago, owing to the ¼ gene pool size of our matrilineally inherited haploid genome. Therefore, most of human prehistory was spent in Africa where early ancestors of Southern African Khoisan and Central African rainforest hunter-gatherers (RFHGs) segregated into smaller groups. Their subdivisions followed climatic oscillations, new modes of subsistence, local adaptations, and cultural-linguistic differences, all prior to their exodus out of Africa. Seven African mtDNA haplogroups (L0–L6) traditionally captured this ancient structure—these L haplogroups have formed the backbone of the mtDNA tree for nearly two decades. Here we describe L7, an eighth haplogroup that we estimate to be ~ 100 thousand years old and which has been previously misclassified in the literature. In addition, L7 has a phylogenetic sublineage L7a*, the oldest singleton branch in the human mtDNA tree (~ 80 thousand years). We found that L7 and its sister group L5 are both low-frequency relics centered around East Africa, but in different populations (L7: Sandawe; L5: Mbuti). Although three small subclades of African foragers hint at the population origins of L5'7, the majority of subclades are divided into Afro-Asiatic and eastern Bantu groups, indicative of more recent admixture. A regular re-estimation of the entire mtDNA haplotype tree is needed to ensure correct cladistic placement of new samples in the future.
Collapse
|
39
|
Chen K, Moravec JÍC, Gavryushkin A, Welch D, Drummond AJ. Accounting for errors in data improves divergence time estimates in single-cell cancer evolution. Mol Biol Evol 2022; 39:6613463. [PMID: 35733333 PMCID: PMC9356729 DOI: 10.1093/molbev/msac143] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Single-cell sequencing provides a new way to explore the evolutionary history of cells. Compared to traditional bulk sequencing, where a population of heterogeneous cells is pooled to form a single observation, single-cell sequencing isolates and amplifies genetic material from individual cells, thereby preserving the information about the origin of the sequences. However, single-cell data is more error-prone than bulk sequencing data due to the limited genomic material available per cell. Here, we present error and mutation models for evolutionary inference of single-cell data within a mature and extensible Bayesian framework, BEAST2. Our framework enables integration with biologically informative models such as relaxed molecular clocks and population dynamic models. Our simulations show that modeling errors increase the accuracy of relative divergence times and substitution parameters. We reconstruct the phylogenetic history of a colorectal cancer patient and a healthy patient from single-cell DNA sequencing data. We find that the estimated times of terminal splitting events are shifted forward in time compared to models which ignore errors. We observed that not accounting for errors can overestimate the phylogenetic diversity in single-cell DNA sequencing data. We estimate that 30-50% of the apparent diversity can be attributed to error. Our work enables a full Bayesian approach capable of accounting for errors in the data within the integrative Bayesian software framework BEAST2.
Collapse
Affiliation(s)
- Kylie Chen
- School of Computer Science, University of Auckland, Auckland, New Zealand
| | - Jiř Í C Moravec
- Department of Computer Science, University of Otago, Dunedin, New Zealand.,School of Mathematics and Statistics, University of Canterbury, Christchurch, New Zealand
| | - Alex Gavryushkin
- School of Mathematics and Statistics, University of Canterbury, Christchurch, New Zealand
| | - David Welch
- School of Computer Science, University of Auckland, Auckland, New Zealand
| | - Alexei J Drummond
- School of Computer Science, University of Auckland, Auckland, New Zealand.,School of Biological Sciences, University of Auckland, Auckland, New Zealand
| |
Collapse
|
40
|
Khandai K, Navarro-Martinez C, Smith B, Buonopane R, Byun SA, Patterson M. Determining Significant Correlation Between Pairs of Extant Characters in a Small Parsimony Framework. J Comput Biol 2022; 29:1132-1154. [PMID: 35723627 DOI: 10.1089/cmb.2022.0141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
When studying the evolutionary relationships among a set of species, the principle of parsimony states that a relationship involving the fewest number of evolutionary events is likely the correct one. Due to its simplicity, this principle was formalized in the context of computational evolutionary biology decades ago by, for example, Fitch and Sankoff. Because the parsimony framework does not require a model of evolution, unlike maximum likelihood or Bayesian approaches, it is often a good starting point when no reasonable estimate of such a model is available. In this work, we devise a method for determining if pairs of discrete characters are significantly correlated across all most parsimonious reconstructions, given a set of species on these characters, and an evolutionary tree. The first step of this method is to use Sankoff's algorithm to compute all most parsimonious assignments of ancestral states (of each character) to the internal nodes of the phylogeny. Correlation between a pair of evolutionary events (e.g., absent to present) for a pair of characters is then determined by the (co-) occurrence patterns between the sets of their respective ancestral assignments. The probability of obtaining a correlation this extreme (or more) under a null hypothesis where the events happen randomly on the evolutionary tree is then used to assess the significance of this correlation. We implement this method: parcours (PARsimonious CO-occURrenceS) and use it to identify significantly correlated evolution among vocalizations and morphological characters in the Felidae family.
Collapse
Affiliation(s)
- Kaustubh Khandai
- Department of Computer Science, Georgia State University, Atlanta, Georgia, USA
| | | | - Brendan Smith
- Department of Biology, Fairfield University, Fairfield, Connecticut, USA
| | - Rebecca Buonopane
- Department of Biology, Fairfield University, Fairfield, Connecticut, USA
| | - Soyong Ashley Byun
- Department of Biology, Fairfield University, Fairfield, Connecticut, USA
| | - Murray Patterson
- Department of Computer Science, Georgia State University, Atlanta, Georgia, USA
| |
Collapse
|
41
|
Wang W, Chen Y, Wu L, Zhang Y, Yoo S, Chen Q, Liu S, Hou Y, Chen XP, Chen Q, Zhu J. HBV genome-enriched single cell sequencing revealed heterogeneity in HBV-driven hepatocellular carcinoma (HCC). BMC Med Genomics 2022; 15:134. [PMID: 35710421 PMCID: PMC9205089 DOI: 10.1186/s12920-022-01264-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Accepted: 05/05/2022] [Indexed: 11/11/2022] Open
Abstract
BACKGROUND Hepatitis B virus (HBV) related hepatocellular carcinoma (HCC) is heterogeneous and frequently contains multifocal tumors, but how the multifocal tumors relate to each other in terms of HBV integration and other genomic patterns is not clear. METHODS To interrogate heterogeneity of HBV-HCC, we developed a HBV genome enriched single cell sequencing (HGE-scSeq) procedure and a computational method to identify HBV integration sites and infer DNA copy number variations (CNVs). RESULTS We performed HGE-scSeq on 269 cells from four tumor sites and two tumor thrombi of a HBV-HCC patient. HBV integrations were identified in 142 out of 269 (53%) cells sequenced, and were enriched in two HBV integration hotspots chr1:34,397,059 (CSMD2) and chr8:118,557,327 (MED30/EXT1). There were also 162 rare integration sites. HBV integration sites were enriched in DNA fragile sites and sequences around HBV integration sites were enriched for microhomologous sequences between human and HBV genomes. CNVs were inferred for each individual cell and cells were grouped into four clonal groups based on their CNVs. Cells in different clonal groups had different degrees of HBV integration heterogeneity. All of 269 cells carried chromosome 1q amplification, a recurrent feature of HCC tumors, suggesting that 1q amplification occurred before HBV integration events in this case study. Further, we performed simulation studies to demonstrate that the sequential events (HBV infecting transformed cells) could result in the observed phenotype with biologically reasonable parameters. CONCLUSION Our HGE-scSeq data reveals high heterogeneity of HCC tumor cells in terms of both HBV integrations and CNVs. There were two HBV integration hotspots across cells, and cells from multiple tumor sites shared some HBV integration and CNV patterns.
Collapse
Affiliation(s)
- Wenhui Wang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1425 Madison Ave., New York, NY, 10029, USA
- Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Sema4, Stamford, CT, USA
| | - Yan Chen
- The Hepatic Surgery Centre at Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology (HUST), Wuhan, China
| | | | - Yi Zhang
- Department of Mathematics, Hebei University of Science and Technology, Shijiazhuang, Hebei, China
| | - Seungyeul Yoo
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1425 Madison Ave., New York, NY, 10029, USA
- Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Sema4, Stamford, CT, USA
| | - Quan Chen
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1425 Madison Ave., New York, NY, 10029, USA
- Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Sema4, Stamford, CT, USA
| | | | | | - Xiao-Ping Chen
- The Hepatic Surgery Centre at Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology (HUST), Wuhan, China
| | - Qian Chen
- The Division of Gastroenterology, Department of Internal Medicine at Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology (HUST), Wuhan, China.
| | - Jun Zhu
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, 1425 Madison Ave., New York, NY, 10029, USA.
- Icahn Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
- Sema4, Stamford, CT, USA.
- The Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| |
Collapse
|
42
|
Markowska M, Cąkała T, Miasojedow B, Aybey B, Juraeva D, Mazur J, Ross E, Staub E, Szczurek E. CONET: copy number event tree model of evolutionary tumor history for single-cell data. Genome Biol 2022; 23:128. [PMID: 35681161 PMCID: PMC9185904 DOI: 10.1186/s13059-022-02693-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 05/23/2022] [Indexed: 11/10/2022] Open
Abstract
Copy number alterations constitute important phenomena in tumor evolution. Whole genome single-cell sequencing gives insight into copy number profiles of individual cells, but is highly noisy. Here, we propose CONET, a probabilistic model for joint inference of the evolutionary tree on copy number events and copy number calling. CONET employs an efficient, regularized MCMC procedure to search the space of possible model structures and parameters. We introduce a range of model priors and penalties for efficient regularization. CONET reveals copy number evolution in two breast cancer samples, and outperforms other methods in tree reconstruction, breakpoint identification and copy number calling.
Collapse
Affiliation(s)
- Magda Markowska
- University of Warsaw, Faculty of Mathematics, Informatics and Mechanics, Banacha 2, Warsaw, Poland.,Medical University of Warsaw, Postgraduate School of Molecular Medicine, Ks. Trojdena 2a Street, Warsaw, Poland
| | - Tomasz Cąkała
- University of Warsaw, Faculty of Mathematics, Informatics and Mechanics, Banacha 2, Warsaw, Poland
| | - BłaŻej Miasojedow
- University of Warsaw, Faculty of Mathematics, Informatics and Mechanics, Banacha 2, Warsaw, Poland
| | - Bogac Aybey
- Merck Healthcare KGaA, Translational Medicine, Oncology Bioinformatics, Frankfurter Str. 250, Darmstadt, 64293, Germany
| | - Dilafruz Juraeva
- Merck Healthcare KGaA, Translational Medicine, Oncology Bioinformatics, Frankfurter Str. 250, Darmstadt, 64293, Germany
| | - Johanna Mazur
- Merck Healthcare KGaA, Translational Medicine, Oncology Bioinformatics, Frankfurter Str. 250, Darmstadt, 64293, Germany
| | - Edith Ross
- Merck Healthcare KGaA, Translational Medicine, Oncology Bioinformatics, Frankfurter Str. 250, Darmstadt, 64293, Germany
| | - Eike Staub
- Merck Healthcare KGaA, Translational Medicine, Oncology Bioinformatics, Frankfurter Str. 250, Darmstadt, 64293, Germany
| | - Ewa Szczurek
- University of Warsaw, Faculty of Mathematics, Informatics and Mechanics, Banacha 2, Warsaw, Poland.
| |
Collapse
|
43
|
Nicol PB, Barabási DL, Coombes KR, Asiaee A. SITH
: An R package for visualizing and analyzing a spatial model of intratumor heterogeneity. COMPUTATIONAL AND SYSTEMS ONCOLOGY 2022; 2. [DOI: 10.1002/cso2.1033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Affiliation(s)
- Phillip B. Nicol
- Department of Biostatistics Harvard University Boston Massachusetts USA
| | | | - Kevin R. Coombes
- Department of Biomedical Informatics Ohio State University Columbus Ohio USA
| | - Amir Asiaee
- Department of Biostatistics Vanderbilt University Nashville TN USA
| |
Collapse
|
44
|
Yang D, Jones MG, Naranjo S, Rideout WM, Min KHJ, Ho R, Wu W, Replogle JM, Page JL, Quinn JJ, Horns F, Qiu X, Chen MZ, Freed-Pastor WA, McGinnis CS, Patterson DM, Gartner ZJ, Chow ED, Bivona TG, Chan MM, Yosef N, Jacks T, Weissman JS. Lineage tracing reveals the phylodynamics, plasticity, and paths of tumor evolution. Cell 2022; 185:1905-1923.e25. [PMID: 35523183 DOI: 10.1016/j.cell.2022.04.015] [Citation(s) in RCA: 102] [Impact Index Per Article: 51.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Revised: 02/09/2022] [Accepted: 04/08/2022] [Indexed: 12/19/2022]
Abstract
Tumor evolution is driven by the progressive acquisition of genetic and epigenetic alterations that enable uncontrolled growth and expansion to neighboring and distal tissues. The study of phylogenetic relationships between cancer cells provides key insights into these processes. Here, we introduced an evolving lineage-tracing system with a single-cell RNA-seq readout into a mouse model of Kras;Trp53(KP)-driven lung adenocarcinoma and tracked tumor evolution from single-transformed cells to metastatic tumors at unprecedented resolution. We found that the loss of the initial, stable alveolar-type2-like state was accompanied by a transient increase in plasticity. This was followed by the adoption of distinct transcriptional programs that enable rapid expansion and, ultimately, clonal sweep of stable subclones capable of metastasizing. Finally, tumors develop through stereotypical evolutionary trajectories, and perturbing additional tumor suppressors accelerates progression by creating novel trajectories. Our study elucidates the hierarchical nature of tumor evolution and, more broadly, enables in-depth studies of tumor progression.
Collapse
Affiliation(s)
- Dian Yang
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Matthew G Jones
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Biological and Medical Informatics Graduate Program, University of California, San Francisco, San Francisco, CA 94158, USA; Integrative Program in Quantitative Biology, University of California, San Francisco, San Francisco, CA 94158, USA; Center for Computational Biology, University of California, Berkeley, Berkeley, CA 94720, USA; David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Santiago Naranjo
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - William M Rideout
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Kyung Hoi Joseph Min
- Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Raymond Ho
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Wei Wu
- Department of Medicine, University of California, San Francisco, San Francisco, CA 94158, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Joseph M Replogle
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Medical Scientist Training Program, University of California, San Francisco, San Francisco, CA 94158, USA; Tetrad Graduate Program, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Jennifer L Page
- Cell and Genome Engineering Core, University of California San Francisco, San Francisco, CA 94158, USA
| | - Jeffrey J Quinn
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Felix Horns
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | - Xiaojie Qiu
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | - Michael Z Chen
- Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Medical Scientist Training Program, Harvard Medical School, Boston, MA 02115, USA
| | - William A Freed-Pastor
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02215, USA
| | - Christopher S McGinnis
- Tetrad Graduate Program, University of California, San Francisco, San Francisco, CA 94158, USA; Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA
| | - David M Patterson
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Zev J Gartner
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA 94158, USA; Chan Zuckerberg BioHub Investigator, University of California, San Francisco, San Francisco, CA 94158, USA; Center for Cellular Construction, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Eric D Chow
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, CA 94158, USA; Center for Advanced Technology, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Trever G Bivona
- Department of Medicine, University of California, San Francisco, San Francisco, CA 94158, USA; Helen Diller Family Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Michelle M Chan
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA; Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA
| | - Nir Yosef
- Center for Computational Biology, University of California, Berkeley, Berkeley, CA 94720, USA; Chan Zuckerberg BioHub Investigator, University of California, San Francisco, San Francisco, CA 94158, USA; Department of Electrical Engineering and Computer Science, University of California Berkeley, Berkeley, CA 94720, USA; Ragon Institute of Massachusetts General Hospital, MIT and Harvard University, Cambridge, MA, USA.
| | - Tyler Jacks
- David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA.
| | - Jonathan S Weissman
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA; Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Whitehead Institute for Biomedical Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; David H. Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA.
| |
Collapse
|
45
|
Zeng Z, Li W, Zhang D, Zhang C, Jiang X, Guo R, Wang Z, Yang C, Yan H, Zhang Z, Wang Q, Huang R, Zhao Q, Li B, Hu X, Gao L. Development of a Chemoresistant Risk Scoring Model for Prechemotherapy Osteosarcoma Using Single-Cell Sequencing. Front Oncol 2022; 12:893282. [PMID: 35664733 PMCID: PMC9159767 DOI: 10.3389/fonc.2022.893282] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Accepted: 04/07/2022] [Indexed: 12/13/2022] Open
Abstract
Background Chemoresistance is one of the leading causes that severely limits the success of osteosarcoma treatment. Evaluating chemoresistance before chemotherapy poses a new challenge for researchers. We established an effective chemoresistance risk scoring model for prechemotherapy osteosarcoma using single-cell sequencing. Methods We comprehensively analyzed osteosarcoma data from the bulk mRNA sequencing dataset TARGET-OS and the single-cell RNA sequencing (scRNA-seq) dataset GSE162454. Chemoresistant tumor clusters were identified using enrichment analysis and AUCell scoring. Its differentiated trajectory was achieved with inferCNV and pseudotime analysis. Ligand-receptor interactions were annotated with iTALK. Furthermore, we established a chemoresistance risk scoring model using LASSO regression based on scRNA-seq-based markers of chemoresistant tumor clusters. The TARGET-OS dataset was used as the training group, and the bulk mRNA array dataset GSE33382 was used as the validation group. Finally, the performance was verified for its discriminatory ability and calibration. Results Using bulk RNA data, we found that osteogenic expression was upregulated in chemoresistant osteosarcoma as compared to chemosensitive osteosarcoma. Then, we transferred the bulk RNA findings to scRNA-seq and noticed osteosarcoma tumor clusters C14 and C25 showing osteogenic cancer stem cell expression patterns, which fit chemoresistant characteristics. C14 and C25 possessed bridge roles in interactions with other clusters. On the one hand, they received various growth factor stimulators and could potentially transform into a proliferative state. On the other hand, they promote local tumor angiogenesis, bone remodeling and immunosuppression. Next, we identified a ten-gene signature from the C14 and C25 markers and constructed a chemoresistant risk scoring model using LASSO regression model. Finally, we found that chemoresistant osteosarcoma had higher chemoresistance risk score and that the model showed good discriminatory ability and calibration in both the training and validation groups (AUCtrain = 0.82; AUCvalid = 0.84). Compared with that of the classic bulk RNA-based model, it showed more robust performance in validation environment (AUCvalid-scRNA = 0.84; AUCvalid-bulk DEGs = 0.54). Conclusions Our work provides insights into understanding chemoresistant osteosarcoma tumor cells and using single-cell sequencing to establish a chemoresistance risk scoring model. The model showed good discriminatory ability and calibration and provided us with a feasible way to evaluate chemoresistance in prechemotherapy osteosarcoma.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | - Bo Li
- Department of Orthopedics, Sun Yat-sen Memorial Hospital, Guangzhou, China
| | - Xumin Hu
- Department of Orthopedics, Sun Yat-sen Memorial Hospital, Guangzhou, China
| | - Liangbin Gao
- Department of Orthopedics, Sun Yat-sen Memorial Hospital, Guangzhou, China
| |
Collapse
|
46
|
Abstract
Distilling biologically meaningful information from cancer genome sequencing data requires comprehensive identification of somatic alterations using rigorous computational methods. As the amount and complexity of sequencing data have increased, so has the number of tools for analysing them. Here, we describe the main steps involved in the bioinformatic analysis of cancer genomes, review key algorithmic developments and highlight popular tools and emerging technologies. These tools include those that identify point mutations, copy number alterations, structural variations and mutational signatures in cancer genomes. We also discuss issues in experimental design, the strengths and limitations of sequencing modalities and methodological challenges for the future.
Collapse
|
47
|
Sepich-Poore GD, Guccione C, Laplane L, Pradeu T, Curtius K, Knight R. Cancer's second genome: Microbial cancer diagnostics and redefining clonal evolution as a multispecies process: Humans and their tumors are not aseptic, and the multispecies nature of cancer modulates clinical care and clonal evolution: Humans and their tumors are not aseptic, and the multispecies nature of cancer modulates clinical care and clonal evolution. Bioessays 2022; 44:e2100252. [PMID: 35253252 PMCID: PMC10506734 DOI: 10.1002/bies.202100252] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2021] [Revised: 01/31/2022] [Accepted: 02/16/2022] [Indexed: 12/13/2022]
Abstract
The presence and role of microbes in human cancers has come full circle in the last century. Tumors are no longer considered aseptic, but implications for cancer biology and oncology remain underappreciated. Opportunities to identify and build translational diagnostics, prognostics, and therapeutics that exploit cancer's second genome-the metagenome-are manifold, but require careful consideration of microbial experimental idiosyncrasies that are distinct from host-centric methods. Furthermore, the discoveries of intracellular and intra-metastatic cancer bacteria necessitate fundamental changes in describing clonal evolution and selection, reflecting bidirectional interactions with non-human residents. Reconsidering cancer clonality as a multispecies process similarly holds key implications for understanding metastasis and prognosing therapeutic resistance while providing rational guidance for the next generation of bacterial cancer therapies. Guided by these new findings and challenges, this Review describes opportunities to exploit cancer's metagenome in oncology and proposes an evolutionary framework as a first step towards modeling multispecies cancer clonality. Also see the video abstract here: https://youtu.be/-WDtIRJYZSs.
Collapse
Affiliation(s)
| | - Caitlin Guccione
- Division of Biomedical Informatics, Department of Medicine, University of California San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA 92093, USA
- Department of Pediatrics, University of California San Diego, La Jolla, CA 92093, USA
| | - Lucie Laplane
- Institut d’histoire et de philosophie des sciences et des techniques (UMR8590), CNRS & Panthéon-Sorbonne University, 75006 Paris, France
- Hematopoietic stem cells and the development of myeloid malignancies (UMR1287), Gustave Roussy Cancer Campus, 94800 Villejuif, France
| | - Thomas Pradeu
- ImmunoConcept (UMR5164), CNRS & University of Bordeaux, 33076 Bordeaux Cedex, France
| | - Kit Curtius
- Division of Biomedical Informatics, Department of Medicine, University of California San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA 92093, USA
| | - Rob Knight
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
- Department of Pediatrics, University of California San Diego, La Jolla, CA 92093, USA
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA 92093, USA
| |
Collapse
|
48
|
Chen Z, Gong F, Wan L, Ma L. BiTSC
2: Bayesian inference of tumor clonal tree by joint analysis of single-cell SNV and CNA data. Brief Bioinform 2022; 23:6562684. [PMID: 35368055 PMCID: PMC9116244 DOI: 10.1093/bib/bbac092] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Revised: 01/29/2022] [Accepted: 02/23/2022] [Indexed: 12/14/2022] Open
Abstract
Abstract
The rapid development of single-cell DNA sequencing (scDNA-seq) technology has greatly enhanced the resolution of tumor cell profiling, providing an unprecedented perspective in characterizing intra-tumoral heterogeneity and understanding tumor progression and metastasis. However, prominent algorithms for constructing tumor phylogeny based on scDNA-seq data usually only take single nucleotide variations (SNVs) as markers, failing to consider the effect caused by copy number alterations (CNAs). Here, we propose BiTSC$^2$, Bayesian inference of Tumor clonal Tree by joint analysis of Single-Cell SNV and CNA data. BiTSC$^2$ takes raw reads from scDNA-seq as input, accounts for the overlapping of CNA and SNV, models allelic dropout rate, sequencing errors and missing rate, as well as assigns single cells into subclones. By applying Markov Chain Monte Carlo sampling, BiTSC$^2$ can simultaneously estimate the subclonal scCNA and scSNV genotype matrices, subclonal assignments and tumor subclonal evolutionary tree. In comparison with existing methods on synthetic and real tumor data, BiTSC$^2$ shows high accuracy in genotype recovery, subclonal assignment and tree reconstruction. BiTSC$^2$ also performs robustly in dealing with scDNA-seq data with low sequencing depth and variant missing rate. BiTSC$^2$ software is available at https://github.com/ucasdp/BiTSC2.
Collapse
Affiliation(s)
- Ziwei Chen
- Institute of Zoology, Chinese Academy of Sciences, Beichen West Road, 100101, Beijing, Country
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Zhongguancun East Road, 100190, Beijing, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Yuquan Road, 100049, Beijing, China
| | - Fuzhou Gong
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Zhongguancun East Road, 100190, Beijing, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Yuquan Road, 100049, Beijing, China
| | - Lin Wan
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Zhongguancun East Road, 100190, Beijing, China
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Yuquan Road, 100049, Beijing, China
| | - Liang Ma
- Institute of Zoology, Chinese Academy of Sciences, Beichen West Road, 100101, Beijing, Country
- School of Mathematical Sciences, University of Chinese Academy of Sciences, Yuquan Road, 100049, Beijing, China
| |
Collapse
|
49
|
Molecular Characteristics and Prognostic Role of MFAP2 in Stomach Adenocarcinoma. JOURNAL OF HEALTHCARE ENGINEERING 2022; 2022:1417238. [PMID: 35356627 PMCID: PMC8959993 DOI: 10.1155/2022/1417238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Accepted: 11/19/2021] [Indexed: 11/26/2022]
Abstract
Molecular characteristics and prognostic role of MFAP2 were by no means stated. The MFAP2 expression and prognostic prices in this study, with Cox analysis, was employed to develop a predictive fee for MFAP2. To know about coexpression and practical networks associated with MFAP2, LinkedOmics and GEPIA2 have been used. MFAP2 expression has been increased and verified in many unbiased coalitions in TCGA-STAD tumor tissues. In addition, in each TCGA and various cohorts, increased MFAP2 was linked with lower survival. Evaluation by Cox revealed the unbiased danger to average survival, disease-specific survival, and progression-free survival of STAD used to be due to the elevated expression of MFAP2. Active community assessed the MFAP2, through which more than a few cancer-associated kinases and E2F household pathways are regulated, which shows that MFAP2 affects RNA transportation, oocyte meiosis, spliceosome, and ribosome biogenesis. MFAP2 can predict and is linked to the prediction of STAD independently. The closure of the MFAP2 link to the macrophage marker genes is, in particular, the achievable core of immune response.
Collapse
|
50
|
Ouellette TW, Awadalla P. Inferring ongoing cancer evolution from single tumour biopsies using synthetic supervised learning. PLoS Comput Biol 2022; 18:e1010007. [PMID: 35482653 PMCID: PMC9049314 DOI: 10.1371/journal.pcbi.1010007] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Accepted: 03/09/2022] [Indexed: 11/18/2022] Open
Abstract
Variant allele frequencies (VAF) encode ongoing evolution and subclonal selection in growing tumours. However, existing methods that utilize VAF information for cancer evolutionary inference are compressive, slow, or incorrectly specify the underlying cancer evolutionary dynamics. Here, we provide a proof-of-principle synthetic supervised learning method, TumE, that integrates simulated models of cancer evolution with Bayesian neural networks, to infer ongoing selection in bulk-sequenced single tumour biopsies. Analyses in synthetic and patient tumours show that TumE significantly improves both accuracy and inference time per sample when detecting positive selection, deconvoluting selected subclonal populations, and estimating subclone frequency. Importantly, we show how transfer learning can leverage stored knowledge within TumE models for related evolutionary inference tasks-substantially reducing data and computational time for further model development and providing a library of recyclable deep learning models for the cancer evolution community. This extensible framework provides a foundation and future directions for harnessing progressive computational methods for the benefit of cancer genomics and, in turn, the cancer patient.
Collapse
Affiliation(s)
- Tom W. Ouellette
- Ontario Institute for Cancer Research, Department of Computational Biology, Toronto, Ontario, Canada
- Department of Molecular Genetics, Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Philip Awadalla
- Ontario Institute for Cancer Research, Department of Computational Biology, Toronto, Ontario, Canada
- Department of Molecular Genetics, Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|