Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Waddell PJ, Penny D, Moore T. Hadamard conjugations and modeling sequence evolution with unequal rates across sites. Mol Phylogenet Evol 1997;8:33-50. [PMID: 9242594 DOI: 10.1006/mpev.1997.0405] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

For:	Waddell PJ, Penny D, Moore T. Hadamard conjugations and modeling sequence evolution with unequal rates across sites. Mol Phylogenet Evol 1997;8:33-50. [PMID: 9242594 DOI: 10.1006/mpev.1997.0405] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Number

Cited by Other Article(s)

Cunningham CW, Zhu H, Hillis DM. BEST‐FIT MAXIMUM‐LIKELIHOOD MODELS FOR PHYLOGENETIC INFERENCE: EMPIRICAL TESTS WITH KNOWN PHYLOGENIES. Evolution 2017;52:978-987. [DOI: 10.1111/j.1558-5646.1998.tb01827.x] [Citation(s) in RCA: 69] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/1997] [Accepted: 04/16/1998] [Indexed: 12/01/2022]

Harrison LB, Larsson HCE. Among-Character Rate Variation Distributions in Phylogenetic Analysis of Discrete Morphological Characters. Syst Biol 2014;64:307-24. [DOI: 10.1093/sysbio/syu098] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Sinsheimer JS, Little RJA, Lake JA. Rooting gene trees without outgroups: EP rooting. Genome Biol Evol 2012;4:709-19. [PMID: 22593551 PMCID: PMC3509888 DOI: 10.1093/gbe/evs047] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Addario-Berry L, Chor B, Hallett M, Lagergren J, Panconesi A, Wareham T. ANCESTRAL MAXIMUM LIKELIHOOD OF EVOLUTIONARY TREES IS HARD. J Bioinform Comput Biol 2011;2:257-71. [PMID: 15297981 DOI: 10.1142/s0219720004000557] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2003] [Revised: 01/19/2003] [Accepted: 01/26/2003] [Indexed: 11/18/2022]

Waddell PJ, Ota R, Penny D. Measuring fit of sequence data to phylogenetic model: gain of power using marginal tests. J Mol Evol 2009;69:289-99. [PMID: 19851702 DOI: 10.1007/s00239-009-9268-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2009] [Accepted: 07/28/2009] [Indexed: 11/29/2022]

Abstract

Testing fit of data to model is fundamentally important to any science, but publications in the field of phylogenetics rarely do this. Such analyses discard fundamental aspects of science as prescribed by Karl Popper. Indeed, not without cause, Popper (Unended quest: an intellectual autobiography. Fontana, London, 1976) once argued that evolutionary biology was unscientific as its hypotheses were untestable. Here we trace developments in assessing fit from Penny et al. (Nature 297:197-200, 1982) to the present. We compare the general log-likelihood ratio (the G or G (2) statistic) statistic between the evolutionary tree model and the multinomial model with that of marginalized tests applied to an alignment (using placental mammal coding sequence data). It is seen that the most general test does not reject the fit of data to model (P approximately 0.5), but the marginalized tests do. Tests on pairwise frequency (F) matrices, strongly (P < 0.001) reject the most general phylogenetic (GTR) models commonly in use. It is also clear (P < 0.01) that the sequences are not stationary in their nucleotide composition. Deviations from stationarity and homogeneity seem to be unevenly distributed amongst taxa; not necessarily those expected from examining other regions of the genome. By marginalizing the 4( t ) patterns of the i.i.d. model to observed and expected parsimony counts, that is, from constant sites, to singletons, to parsimony informative characters of a minimum possible length, then the likelihood ratio test regains power, and it too rejects the evolutionary model with P << 0.001. Given such behavior over relatively recent evolutionary time, readers in general should maintain a healthy skepticism of results, as the scale of the systematic errors in published trees may really be far larger than the analytical methods (e.g., bootstrap) report.

Collapse

von Reumont BM, Meusemann K, Szucsich NU, Dell'Ampio E, Gowri-Shankar V, Bartel D, Simon S, Letsch HO, Stocsits RR, Luan YX, Wägele JW, Pass G, Hadrys H, Misof B. Can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? A case study on major arthropod relationships. BMC Evol Biol 2009;9:119. [PMID: 19473484 PMCID: PMC2695459 DOI: 10.1186/1471-2148-9-119] [Citation(s) in RCA: 101] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2008] [Accepted: 05/27/2009] [Indexed: 01/27/2023] Open

Abstract

BACKGROUND

Whenever different data sets arrive at conflicting phylogenetic hypotheses, only testable causal explanations of sources of errors in at least one of the data sets allow us to critically choose among the conflicting hypotheses of relationships. The large (28S) and small (18S) subunit rRNAs are among the most popular markers for studies of deep phylogenies. However, some nodes supported by this data are suspected of being artifacts caused by peculiarities of the evolution of these molecules. Arthropod phylogeny is an especially controversial subject dotted with conflicting hypotheses which are dependent on data set and method of reconstruction. We assume that phylogenetic analyses based on these genes can be improved further i) by enlarging the taxon sample and ii) employing more realistic models of sequence evolution incorporating non-stationary substitution processes and iii) considering covariation and pairing of sites in rRNA-genes.

RESULTS

We analyzed a large set of arthropod sequences, applied new tools for quality control of data prior to tree reconstruction, and increased the biological realism of substitution models. Although the split-decomposition network indicated a high noise content in the data set, our measures were able to both improve the analyses and give causal explanations for some incongruities mentioned from analyses of rRNA sequences. However, misleading effects did not completely disappear.

CONCLUSION

Analyses of data sets that result in ambiguous phylogenetic hypotheses demand for methods, which do not only filter stochastic noise, but likewise allow to differentiate phylogenetic signal from systematic biases. Such methods can only rely on our findings regarding the evolution of the analyzed data. Analyses on independent data sets then are crucial to test the plausibility of the results. Our approach can easily be extended to genomic data, as well, whereby layers of quality assessment are set up applicable to phylogenetic reconstructions in general.

Collapse

Gruenheit N, Lockhart PJ, Steel M, Martin W. Difficulties in testing for covarion-like properties of sequences under the confounding influence of changing proportions of variable sites. Mol Biol Evol 2008;25:1512-20. [PMID: 18424773 DOI: 10.1093/molbev/msn098] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Kelchner SA, Thomas MA. Model use in phylogenetics: nine key questions. Trends Ecol Evol 2006;22:87-94. [PMID: 17049674 DOI: 10.1016/j.tree.2006.10.004] [Citation(s) in RCA: 120] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2006] [Revised: 09/19/2006] [Accepted: 10/05/2006] [Indexed: 11/16/2022]

Waddell PJ. Measuring the fit of sequence data to phylogenetic model: allowing for missing data. Mol Biol Evol 2004;22:395-401. [PMID: 15470228 DOI: 10.1093/molbev/msi002] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Cejchan PA. LUCA, or just a conserved Archaeon?: Comments on Xue et al. (2003). Gene 2004;333:47-50. [PMID: 15177679 DOI: 10.1016/j.gene.2004.02.012] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2003] [Revised: 09/24/2003] [Accepted: 02/05/2004] [Indexed: 11/24/2022]

Wägele JW, Holland B, Dreyer H, Hackethal B. Searching factors causing implausible non-monophyly: ssu rDNA phylogeny of Isopoda Asellota (Crustacea: Peracarida) and faster evolution in marine than in freshwater habitats. Mol Phylogenet Evol 2003;28:536-51. [PMID: 12927137 DOI: 10.1016/s1055-7903(03)00053-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Analytic Solutions for Three-Taxon MLMC Trees with Variable Rates Across Sites. ACTA ACUST UNITED AC 2001. [DOI: 10.1007/3-540-44696-6_16] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]

Shpak M, Churchill GA. The information content of a character under a Markov model of evolution. Mol Phylogenet Evol 2000;17:231-43. [PMID: 11083937 DOI: 10.1006/mpev.2000.0846] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Martin P, Kaygorodova I, Sherbakov DY, Verheyen E. Rapidly evolving lineages impede the resolution of phylogenetic relationships among Clitellata (Annelida). Mol Phylogenet Evol 2000;15:355-68. [PMID: 10860645 DOI: 10.1006/mpev.1999.0764] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Ota R, Waddell PJ, Hasegawa M, Shimodaira H, Kishino H. Appropriate likelihood ratio tests and marginal distributions for evolutionary tree models with constraints on parameters. Mol Biol Evol 2000;17:798-803. [PMID: 10779540 DOI: 10.1093/oxfordjournals.molbev.a026358] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Philippe H, Laurent J. How good are deep phylogenetic trees? Curr Opin Genet Dev 1998;8:616-23. [PMID: 9914208 DOI: 10.1016/s0959-437x(98)80028-2] [Citation(s) in RCA: 242] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Gu X, Li WH. Estimation of evolutionary distances under stationary and nonstationary models of nucleotide substitution. Proc Natl Acad Sci U S A 1998;95:5899-905. [PMID: 9600890 PMCID: PMC34493 DOI: 10.1073/pnas.95.11.5899] [Citation(s) in RCA: 61] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Waddell PJ, Steel MA. General time-reversible distances with unequal rates across sites: mixing gamma and inverse Gaussian distributions with invariant sites. Mol Phylogenet Evol 1997;8:398-414. [PMID: 9417897 DOI: 10.1006/mpev.1997.0452] [Citation(s) in RCA: 138] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]