Markin A, Eulenstein O. Quartet-Based Inference is Statistically Consistent Under the Unified Duplication-Loss-Coalescence Model.
Bioinformatics 2021;
37:4064-4074. [PMID:
34048529 PMCID:
PMC9113308 DOI:
10.1093/bioinformatics/btab414]
[Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Revised: 05/19/2021] [Accepted: 05/27/2021] [Indexed: 12/19/2022] Open
Abstract
Motivation
The classic multispecies coalescent (MSC) model provides the means for theoretical justification of incomplete lineage sorting-aware species tree inference methods. This has motivated an extensive body of work on phylogenetic methods that are statistically consistent under MSC. One such particularly popular method is ASTRAL, a quartet-based species tree inference method. Novel studies suggest that ASTRAL also performs well when given multi-locus gene trees in simulation studies. Further, Legried et al. recently demonstrated that ASTRAL is statistically consistent under the gene duplication and loss model (GDL). GDL is prevalent in evolutionary histories and is the first core process in the powerful duplication-loss-coalescence evolutionary model (DLCoal) by Rasmussen and Kellis.
Results
In this work, we prove that ASTRAL is statistically consistent under the general DLCoal model. Therefore, our result supports the empirical evidence from the simulation-based studies. More broadly, we prove that the quartet-based inference approach is statistically consistent under DLCoal.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse