1
|
Lei H, Gertz EM, Schäffer AA, Fu X, Tao Y, Heselmeyer-Haddad K, Torres I, Li G, Xu L, Hou Y, Wu K, Shi X, Dean M, Ried T, Schwartz R. Tumor heterogeneity assessed by sequencing and fluorescence in situ hybridization (FISH) data. Bioinformatics 2021; 37:4704-4711. [PMID: 34289030 PMCID: PMC8665747 DOI: 10.1093/bioinformatics/btab504] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Revised: 05/19/2021] [Accepted: 07/05/2021] [Indexed: 12/12/2022] Open
Abstract
MOTIVATION Computational reconstruction of clonal evolution in cancers has become a crucial tool for understanding how tumors initiate and progress and how this process varies across patients. The field still struggles, however, with special challenges of applying phylogenetic methods to cancers, such as the prevalence and importance of copy number alteration (CNA) and structural variation (SV) events in tumor evolution, which are difficult to profile accurately by prevailing sequencing methods in such a way that subsequent reconstruction by phylogenetic inference algorithms is accurate. RESULTS In the present work, we develop computational methods to combine sequencing with multiplex interphase fluorescence in situ hybridization (miFISH) to exploit the complementary advantages of each technology in inferring accurate models of clonal CNA evolution accounting for both focal changes and aneuploidy at whole-genome scales. By integrating such information in an integer linear programming (ILP) framework, we demonstrate on simulated data that incorporation of FISH data substantially improves accurate inference of focal CNA and ploidy changes in clonal evolution from deconvolving bulk sequence data. Analysis of real glioblastoma data for which FISH, bulk sequence, and single cell sequence are all available confirms the power of FISH to enhance accurate reconstruction of clonal copy number evolution in conjunction with bulk and optionally single-cell sequence data. AVAILABILITY Source code is available on Github at https://github.com/CMUSchwartzLab/FISH_deconvolution.
Collapse
Affiliation(s)
- Haoyun Lei
- Computational Biology Dept, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
| | - E Michael Gertz
- Cancer Data Science Laboratory, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Alejandro A Schäffer
- Cancer Data Science Laboratory, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Xuecong Fu
- Shenzhen Luohu People's Hospital, Shenzhen, 518000, China
| | - Yifeng Tao
- Computational Biology Dept, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
| | - Kerstin Heselmeyer-Haddad
- Genetics Branch, Cancer Genomics Section, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Irianna Torres
- Genetics Branch, Cancer Genomics Section, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Guibo Li
- Department of Biology, University of Copenhagen, Copenhagen, 1599, Denmark
| | - Liqin Xu
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Soltofts Plads, 2800 Kongens Lyngby, Denmark
| | - Yong Hou
- Department of Biology, University of Copenhagen, Copenhagen, 1599, Denmark
| | - Kui Wu
- Department of Biology, University of Copenhagen, Copenhagen, 1599, Denmark
| | - Xulian Shi
- Shenzhen Luohu People's Hospital, Shenzhen, 518000, China
| | - Michael Dean
- Laboratory of Translational Genomics, Division of Cancer Epidemiology & Genetics, National Cancer Institute, U.S. National Institutes of Health, Gaithersburg, MD, 20814, USA
| | - Thomas Ried
- Genetics Branch, Cancer Genomics Section, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Russell Schwartz
- Computational Biology Dept, Carnegie Mellon University, Pittsburgh, PA, 15213, USA.,Dept. of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
| |
Collapse
|
2
|
Xia R, Lin Y, Zhou J, Geng T, Feng B, Tang J. Phylogenetic Reconstruction for Copy-Number Evolution Problems. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019; 16:694-699. [PMID: 29993694 DOI: 10.1109/tcbb.2018.2829698] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Cancer is known for its heterogeneity and is regarded as an evolutionary process driven by somatic mutations and clonal expansions. This evolutionary process can be modeled by a phylogenetic tree and phylogenetic analysis of multiple subclones of cancer cells can facilitate the study of the tumor variants progression. Copy-number aberration occurs frequently in many types of tumors in terms of segmental amplifications and deletions. In this paper, we developed a distance-based method for reconstructing phylogenies from copy-number profiles of cancer cells. We demonstrate the importance of distance correction from the edit (minimum) distance to the estimated actual number of events. Experimental results show that our approaches provide accurate and scalable results in estimating the actual number of evolutionary events between copy number profiles and in reconstructing phylogenies.
Collapse
|
3
|
Reconstructing Yeasts Phylogenies and Ancestors from Whole Genome Data. Sci Rep 2017; 7:15209. [PMID: 29123238 PMCID: PMC5680185 DOI: 10.1038/s41598-017-15484-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2017] [Accepted: 10/27/2017] [Indexed: 12/31/2022] Open
Abstract
Phylogenetic studies aim to discover evolutionary relationships and histories. These studies are based on similarities of morphological characters and molecular sequences. Currently, widely accepted phylogenetic approaches are based on multiple sequence alignments, which analyze shared gene datasets and concatenate/coalesce these results to a final phylogeny with maximum support. However, these approaches still have limitations, and often have conflicting results with each other. Reconstructing ancestral genomes helps us understand mechanisms and corresponding consequences of evolution. Most existing genome level phylogeny and ancestor reconstruction methods can only process simplified real genome datasets or simulated datasets with identical genome content, unique genome markers, and limited types of evolutionary events. Here, we provide an alternative way to resolve phylogenetic problems based on analyses of real genome data. We use phylogenetic signals from all types of genome level evolutionary events, and overcome the conflicting issues existing in traditional phylogenetic approaches. Further, we build an automated computational pipeline to reconstruct phylogenies and ancestral genomes for two high-resolution real yeast genome datasets. Comparison results with recent studies and publications show that we reconstruct very accurate and robust phylogenies and ancestors. Finally, we identify and analyze the conserved syntenic blocks among reconstructed ancestral genomes and present yeast species.
Collapse
|