1
|
Dai J, Rubel T, Han Y, Molloy EK. Dollo-CDP: a polynomial-time algorithm for the clade-constrained large Dollo parsimony problem. Algorithms Mol Biol 2024; 19:2. [PMID: 38191515 PMCID: PMC10775561 DOI: 10.1186/s13015-023-00249-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 12/10/2023] [Indexed: 01/10/2024] Open
Abstract
The last decade of phylogenetics has seen the development of many methods that leverage constraints plus dynamic programming. The goal of this algorithmic technique is to produce a phylogeny that is optimal with respect to some objective function and that lies within a constrained version of tree space. The popular species tree estimation method ASTRAL, for example, returns a tree that (1) maximizes the quartet score computed with respect to the input gene trees and that (2) draws its branches (bipartitions) from the input constraint set. This technique has yet to be used for parsimony problems where the input are binary characters, sometimes with missing values. Here, we introduce the clade-constrained character parsimony problem and present an algorithm that solves this problem for the Dollo criterion score in [Formula: see text] time, where n is the number of leaves, k is the number of characters, and [Formula: see text] is the set of clades used as constraints. Dollo parsimony, which requires traits/mutations to be gained at most once but allows them to be lost any number of times, is widely used for tumor phylogenetics as well as species phylogenetics, for example analyses of low-homoplasy retroelement insertions across the vertebrate tree of life. This motivated us to implement our algorithm in a software package, called Dollo-CDP, and evaluate its utility for analyzing retroelement insertion presence / absence patterns for bats, birds, toothed whales as well as simulated data. Our results show that Dollo-CDP can improve upon heuristic search from a single starting tree, often recovering a better scoring tree. Moreover, Dollo-CDP scales to data sets with much larger numbers of taxa than branch-and-bound while still having an optimality guarantee, albeit a more restricted one. Lastly, we show that our algorithm for Dollo parsimony can easily be adapted to Camin-Sokal parsimony but not Fitch parsimony.
Collapse
Affiliation(s)
- Junyan Dai
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | - Tobias Rubel
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | - Yunheng Han
- Department of Computer Science, University of Maryland, College Park, MD, USA
| | - Erin K Molloy
- Department of Computer Science, University of Maryland, College Park, MD, USA.
- University of Maryland Institute for Advanced Computer Studies, College Park, MD, USA.
| |
Collapse
|
2
|
Spain L, Coulton A, Lobon I, Rowan A, Schnidrig D, Shepherd ST, Shum B, Byrne F, Goicoechea M, Piperni E, Au L, Edmonds K, Carlyle E, Hunter N, Renn A, Messiou C, Hughes P, Nobbs J, Foijer F, van den Bos H, Wardenaar R, Spierings DC, Spencer C, Schmitt AM, Tippu Z, Lingard K, Grostate L, Peat K, Kelly K, Sarker S, Vaughan S, Mangwende M, Terry L, Kelly D, Biano J, Murra A, Korteweg J, Lewis C, O'Flaherty M, Cattin AL, Emmerich M, Gerard CL, Pallikonda HA, Lynch J, Mason R, Rogiers A, Xu H, Huebner A, McGranahan N, Al Bakir M, Murai J, Naceur-Lombardelli C, Borg E, Mitchison M, Moore DA, Falzon M, Proctor I, Stamp GW, Nye EL, Young K, Furness AJ, Pickering L, Stewart R, Mahadeva U, Green A, Larkin J, Litchfield K, Swanton C, Jamal-Hanjani M, Turajlic S. Late-Stage Metastatic Melanoma Emerges through a Diversity of Evolutionary Pathways. Cancer Discov 2023; 13:1364-1385. [PMID: 36977461 PMCID: PMC10236155 DOI: 10.1158/2159-8290.cd-22-1427] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Revised: 03/06/2023] [Accepted: 03/17/2023] [Indexed: 03/30/2023]
Abstract
Understanding the evolutionary pathways to metastasis and resistance to immune-checkpoint inhibitors (ICI) in melanoma is critical for improving outcomes. Here, we present the most comprehensive intrapatient metastatic melanoma dataset assembled to date as part of the Posthumous Evaluation of Advanced Cancer Environment (PEACE) research autopsy program, including 222 exome sequencing, 493 panel-sequenced, 161 RNA sequencing, and 22 single-cell whole-genome sequencing samples from 14 ICI-treated patients. We observed frequent whole-genome doubling and widespread loss of heterozygosity, often involving antigen-presentation machinery. We found KIT extrachromosomal DNA may have contributed to the lack of response to KIT inhibitors of a KIT-driven melanoma. At the lesion-level, MYC amplifications were enriched in ICI nonresponders. Single-cell sequencing revealed polyclonal seeding of metastases originating from clones with different ploidy in one patient. Finally, we observed that brain metastases that diverged early in molecular evolution emerge late in disease. Overall, our study illustrates the diverse evolutionary landscape of advanced melanoma. SIGNIFICANCE Despite treatment advances, melanoma remains a deadly disease at stage IV. Through research autopsy and dense sampling of metastases combined with extensive multiomic profiling, our study elucidates the many mechanisms that melanomas use to evade treatment and the immune system, whether through mutations, widespread copy-number alterations, or extrachromosomal DNA. See related commentary by Shain, p. 1294. This article is highlighted in the In This Issue feature, p. 1275.
Collapse
Affiliation(s)
- Lavinia Spain
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
- Skin and Renal Unit, Royal Marsden NHS Foundation Trust, London, United Kingdom
- Department of Medical Oncology, Peter MacCallum Cancer Centre, Melbourne, Australia
| | - Alexander Coulton
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
- Tumour Immunogenomics and Immunosurveillance (TIGI) Lab, UCL Cancer Institute, London, United Kingdom
| | - Irene Lobon
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
| | - Andrew Rowan
- Cancer Evolution and Genome Instability Laboratory, The Francis Crick Institute, London, United Kingdom
| | - Desiree Schnidrig
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
| | - Scott T.C. Shepherd
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
- Skin and Renal Unit, Royal Marsden NHS Foundation Trust, London, United Kingdom
| | - Benjamin Shum
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
- Skin and Renal Unit, Royal Marsden NHS Foundation Trust, London, United Kingdom
| | - Fiona Byrne
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
| | - Maria Goicoechea
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
| | - Elisa Piperni
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
| | - Lewis Au
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
- Skin and Renal Unit, Royal Marsden NHS Foundation Trust, London, United Kingdom
- Department of Medical Oncology, Peter MacCallum Cancer Centre, Melbourne, Australia
- Sir Peter MacCallum Department of Oncology, The University of Melbourne, Victoria, Australia
| | - Kim Edmonds
- The Royal Marsden Hospital, London, United Kingdom
| | | | - Nikki Hunter
- The Royal Marsden Hospital, London, United Kingdom
| | | | - Christina Messiou
- The Royal Marsden Hospital, London, United Kingdom
- The Institute of Cancer Research, Kensington and Chelsea, United Kingdom
| | - Peta Hughes
- Skin and Renal Unit, Royal Marsden NHS Foundation Trust, London, United Kingdom
| | - Jaime Nobbs
- Skin and Renal Unit, Royal Marsden NHS Foundation Trust, London, United Kingdom
| | - Floris Foijer
- European Research Institute for the Biology of Ageing, University of Groningen, University Medical Centre Groningen, Groningen, the Netherlands
| | - Hilda van den Bos
- European Research Institute for the Biology of Ageing, University of Groningen, University Medical Centre Groningen, Groningen, the Netherlands
| | - Rene Wardenaar
- European Research Institute for the Biology of Ageing, University of Groningen, University Medical Centre Groningen, Groningen, the Netherlands
| | - Diana C.J. Spierings
- European Research Institute for the Biology of Ageing, University of Groningen, University Medical Centre Groningen, Groningen, the Netherlands
| | - Charlotte Spencer
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
- Skin and Renal Unit, Royal Marsden NHS Foundation Trust, London, United Kingdom
| | | | - Zayd Tippu
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
- Skin and Renal Unit, Royal Marsden NHS Foundation Trust, London, United Kingdom
| | | | | | - Kema Peat
- The Royal Marsden Hospital, London, United Kingdom
| | | | - Sarah Sarker
- The Royal Marsden Hospital, London, United Kingdom
| | | | | | - Lauren Terry
- The Royal Marsden Hospital, London, United Kingdom
| | - Denise Kelly
- The Royal Marsden Hospital, London, United Kingdom
| | | | - Aida Murra
- The Royal Marsden Hospital, London, United Kingdom
| | | | | | | | - Anne-Laure Cattin
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
| | - Max Emmerich
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
- St. John's Institute of Dermatology, Guy's and St Thomas’ Hospital NHS Foundation Trust, London, United Kingdom
| | - Camille L. Gerard
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
- Precision Oncology Center, Oncology Department, Lausanne University Hospital, Lausanne, Switzerland
| | | | - Joanna Lynch
- The Royal Marsden Hospital, London, United Kingdom
| | - Robert Mason
- Gold Coast University Hospital, Queensland, Australia
| | - Aljosja Rogiers
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
- The Royal Marsden Hospital, London, United Kingdom
| | - Hang Xu
- The Francis Crick Institute, London, United Kingdom
| | - Ariana Huebner
- Cancer Evolution and Genome Instability Laboratory, The Francis Crick Institute, London, United Kingdom
- Cancer Genome Evolution Research Group, Cancer Research UK Lung Cancer Centre of Excellence, UCL Cancer Institute, London, United Kingdom
- Cancer Research UK Lung Cancer Centre of Excellence, UCL Cancer Institute, London, United Kingdom
| | - Nicholas McGranahan
- Cancer Genome Evolution Research Group, Cancer Research UK Lung Cancer Centre of Excellence, UCL Cancer Institute, London, United Kingdom
| | - Maise Al Bakir
- Cancer Evolution and Genome Instability Laboratory, The Francis Crick Institute, London, United Kingdom
- Cancer Research UK Lung Cancer Centre of Excellence, UCL Cancer Institute, London, United Kingdom
| | - Jun Murai
- Tumour Immunogenomics and Immunosurveillance (TIGI) Lab, UCL Cancer Institute, London, United Kingdom
- Drug Discovery Technology Laboratories, Ono Pharmaceutical Co., Ltd. Osaka, Japan
| | | | - Elaine Borg
- University College London Hospital, London, United Kingdom
| | | | - David A. Moore
- Guy's and St Thomas’ NHS Foundation Trust, London, United Kingdom
| | - Mary Falzon
- University College London Hospital, London, United Kingdom
| | - Ian Proctor
- University College London Hospital, London, United Kingdom
| | | | - Emma L. Nye
- The Francis Crick Institute, London, United Kingdom
| | - Kate Young
- Skin and Renal Unit, Royal Marsden NHS Foundation Trust, London, United Kingdom
| | - Andrew J.S. Furness
- Skin and Renal Unit, Royal Marsden NHS Foundation Trust, London, United Kingdom
- The Institute of Cancer Research, Kensington and Chelsea, United Kingdom
| | | | - Ruby Stewart
- Guy's and St Thomas’ NHS Foundation Trust, London, United Kingdom
| | - Ula Mahadeva
- Guy's and St Thomas’ NHS Foundation Trust, London, United Kingdom
| | - Anna Green
- Guy's and St Thomas’ NHS Foundation Trust, London, United Kingdom
| | - James Larkin
- Guy's and St Thomas’ NHS Foundation Trust, London, United Kingdom
| | - Kevin Litchfield
- Tumour Immunogenomics and Immunosurveillance (TIGI) Lab, UCL Cancer Institute, London, United Kingdom
| | - Charles Swanton
- Cancer Evolution and Genome Instability Laboratory, The Francis Crick Institute, London, United Kingdom
| | - Mariam Jamal-Hanjani
- Cancer Research UK Lung Cancer Centre of Excellence, UCL Cancer Institute, London, United Kingdom
- Cancer Metastasis Laboratory, University College London Cancer Institute, London, United Kingdom
- Department of Medical Oncology, University College London Hospitals, London, United Kingdom
| | | | - Samra Turajlic
- Cancer Dynamics Laboratory, The Francis Crick Institute, London, United Kingdom
- Skin and Renal Unit, Royal Marsden NHS Foundation Trust, London, United Kingdom
| |
Collapse
|
4
|
Ciccolella S, Bernardini G, Denti L, Bonizzoni P, Previtali M, Della Vedova G. Triplet-based similarity score for fully multilabeled trees with poly-occurring labels. Bioinformatics 2021; 37:178-184. [PMID: 32730595 PMCID: PMC8055217 DOI: 10.1093/bioinformatics/btaa676] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 06/29/2020] [Accepted: 07/22/2020] [Indexed: 01/06/2023] Open
Abstract
MOTIVATION The latest advances in cancer sequencing, and the availability of a wide range of methods to infer the evolutionary history of tumors, have made it important to evaluate, reconcile and cluster different tumor phylogenies. Recently, several notions of distance or similarities have been proposed in the literature, but none of them has emerged as the golden standard. Moreover, none of the known similarity measures is able to manage mutations occurring multiple times in the tree, a circumstance often occurring in real cases. RESULTS To overcome these limitations, in this article, we propose MP3, the first similarity measure for tumor phylogenies able to effectively manage cases where multiple mutations can occur at the same time and mutations can occur multiple times. Moreover, a comparison of MP3 with other measures shows that it is able to classify correctly similar and dissimilar trees, both on simulated and on real data. AVAILABILITY AND IMPLEMENTATION An open source implementation of MP3 is publicly available at https://github.com/AlgoLab/mp3treesim. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Simone Ciccolella
- Department of Informatics, Systems and Communication, University of Milano-Biccoca, Milan 20126, Italy
| | - Giulia Bernardini
- Department of Informatics, Systems and Communication, University of Milano-Biccoca, Milan 20126, Italy
| | - Luca Denti
- Department of Informatics, Systems and Communication, University of Milano-Biccoca, Milan 20126, Italy
| | - Paola Bonizzoni
- Department of Informatics, Systems and Communication, University of Milano-Biccoca, Milan 20126, Italy
| | - Marco Previtali
- Department of Informatics, Systems and Communication, University of Milano-Biccoca, Milan 20126, Italy
| | - Gianluca Della Vedova
- Department of Informatics, Systems and Communication, University of Milano-Biccoca, Milan 20126, Italy
| |
Collapse
|
5
|
Ciccolella S, Ricketts C, Soto Gomez M, Patterson M, Silverbush D, Bonizzoni P, Hajirasouliha I, Della Vedova G. Inferring cancer progression from Single-Cell Sequencing while allowing mutation losses. Bioinformatics 2021; 37:326-333. [PMID: 32805010 PMCID: PMC8058767 DOI: 10.1093/bioinformatics/btaa722] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2019] [Revised: 08/06/2020] [Accepted: 08/11/2020] [Indexed: 01/21/2023] Open
Abstract
Motivation In recent years, the well-known Infinite Sites Assumption has been a fundamental feature of computational methods devised for reconstructing tumor phylogenies and inferring cancer progressions. However, recent studies leveraging single-cell sequencing (SCS) techniques have shown evidence of the widespread recurrence and, especially, loss of mutations in several tumor samples. While there exist established computational methods that infer phylogenies with mutation losses, there remain some advancements to be made. Results We present Simulated Annealing Single-Cell inference (SASC): a new and robust approach based on simulated annealing for the inference of cancer progression from SCS datasets. In particular, we introduce an extension of the model of evolution where mutations are only accumulated, by allowing also a limited amount of mutation loss in the evolutionary history of the tumor: the Dollo-k model. We demonstrate that SASC achieves high levels of accuracy when tested on both simulated and real datasets and in comparison with some other available methods. Availability and implementation The SASC tool is open source and available at https://github.com/sciccolella/sasc. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Simone Ciccolella
- Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milan, Italy
| | - Camir Ricketts
- Department of Physiology and Biophysics, Tri-I Computational Biology & Medicine Graduate Program, Weill Cornell Medicine of Cornell University, New York, NY 10021, USA.,Institute for Computational Biomedicine, Englander Institute for Precision Medicine, The Meyer Cancer Center, Department of Physiology and Biophysics, Weill Cornell Medicine of Cornell University, New York City, NY 10021, USA
| | - Mauricio Soto Gomez
- Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milan, Italy
| | - Murray Patterson
- Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milan, Italy.,Department of Computer Science, College of Arts and Sciences, Georgia State University, Atlanta, GA 30303, USA
| | - Dana Silverbush
- Department of Pathology and Center for Cancer Research, Massachusetts General Hospital and Harvard Medical School, Boston, MA 02114, USA
| | - Paola Bonizzoni
- Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milan, Italy
| | - Iman Hajirasouliha
- Institute for Computational Biomedicine, Englander Institute for Precision Medicine, The Meyer Cancer Center, Department of Physiology and Biophysics, Weill Cornell Medicine of Cornell University, New York City, NY 10021, USA
| | - Gianluca Della Vedova
- Department of Informatics, Systems and Communication, University of Milano-Bicocca, Milan, Italy
| |
Collapse
|
6
|
Ciccolella S, Soto Gomez M, Patterson MD, Della Vedova G, Hajirasouliha I, Bonizzoni P. gpps: an ILP-based approach for inferring cancer progression with mutation losses from single cell data. BMC Bioinformatics 2020; 21:413. [PMID: 33297943 PMCID: PMC7725124 DOI: 10.1186/s12859-020-03736-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Accepted: 09/03/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Cancer progression reconstruction is an important development stemming from the phylogenetics field. In this context, the reconstruction of the phylogeny representing the evolutionary history presents some peculiar aspects that depend on the technology used to obtain the data to analyze: Single Cell DNA Sequencing data have great specificity, but are affected by moderate false negative and missing value rates. Moreover, there has been some recent evidence of back mutations in cancer: this phenomenon is currently widely ignored. RESULTS We present a new tool, gpps, that reconstructs a tumor phylogeny from Single Cell Sequencing data, allowing each mutation to be lost at most a fixed number of times. The General Parsimony Phylogeny from Single cell (gpps) tool is open source and available at https://github.com/AlgoLab/gpps . CONCLUSIONS gpps provides new insights to the analysis of intra-tumor heterogeneity by proposing a new progression model to the field of cancer phylogeny reconstruction on Single Cell data.
Collapse
Affiliation(s)
- Simone Ciccolella
- Department of Informatics, Systems, and Communication, University of Milano - Bicocca, Milan, Italy.
| | - Mauricio Soto Gomez
- Department of Informatics, Systems, and Communication, University of Milano - Bicocca, Milan, Italy
| | - Murray D Patterson
- Department of Informatics, Systems, and Communication, University of Milano - Bicocca, Milan, Italy.,Georgia State University, Atlanta, GA, USA
| | - Gianluca Della Vedova
- Department of Informatics, Systems, and Communication, University of Milano - Bicocca, Milan, Italy
| | - Iman Hajirasouliha
- Institute for Computational Biomedicine, Weill Cornell Medicine, New York City, NY, USA.,Department of Physiology and Biophysics, Weill Cornell Medicine of Cornell University, NewYork City, 10021, NY, USA
| | - Paola Bonizzoni
- Department of Informatics, Systems, and Communication, University of Milano - Bicocca, Milan, Italy
| |
Collapse
|
7
|
Sadeqi Azer E, Haghir Ebrahimabadi M, Malikić S, Khardon R, Sahinalp SC. Tumor Phylogeny Topology Inference via Deep Learning. iScience 2020; 23:101655. [PMID: 33117968 PMCID: PMC7582044 DOI: 10.1016/j.isci.2020.101655] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Revised: 08/10/2020] [Accepted: 10/02/2020] [Indexed: 01/24/2023] Open
Abstract
Principled computational approaches for tumor phylogeny reconstruction via single-cell sequencing typically aim to build the most likely perfect phylogeny tree from the noisy genotype matrix - which represents genotype calls of single cells. This problem is NP-hard, and as a result, existing approaches aim to solve relatively small instances of it through combinatorial optimization techniques or Bayesian inference. As expected, even when the goal is to infer basic topological features of the tumor phylogeny, rather than reconstructing the topology entirely, these approaches could be prohibitively slow. In this paper, we introduce fast deep learning solutions to the problems of inferring whether the most likely tree has a linear (chain) or branching topology and whether a perfect phylogeny is feasible from a given genotype matrix. We also present a reinforcement learning approach for reconstructing the most likely tumor phylogeny. This preliminary work demonstrates that data-driven approaches can reconstruct key features of tumor evolution.
Collapse
Affiliation(s)
- Erfan Sadeqi Azer
- Department of Computer Science, Indiana University, Bloomington, IN 47408, USA
| | - Mohammad Haghir Ebrahimabadi
- Department of Computer Science, Indiana University, Bloomington, IN 47408, USA
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Salem Malikić
- Department of Computer Science, Indiana University, Bloomington, IN 47408, USA
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | - Roni Khardon
- Department of Computer Science, Indiana University, Bloomington, IN 47408, USA
| | - S. Cenk Sahinalp
- Cancer Data Science Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA
| |
Collapse
|