1
|
Stiller J, Feng S, Chowdhury AA, Rivas-González I, Duchêne DA, Fang Q, Deng Y, Kozlov A, Stamatakis A, Claramunt S, Nguyen JMT, Ho SYW, Faircloth BC, Haag J, Houde P, Cracraft J, Balaban M, Mai U, Chen G, Gao R, Zhou C, Xie Y, Huang Z, Cao Z, Yan Z, Ogilvie HA, Nakhleh L, Lindow B, Morel B, Fjeldså J, Hosner PA, da Fonseca RR, Petersen B, Tobias JA, Székely T, Kennedy JD, Reeve AH, Liker A, Stervander M, Antunes A, Tietze DT, Bertelsen MF, Lei F, Rahbek C, Graves GR, Schierup MH, Warnow T, Braun EL, Gilbert MTP, Jarvis ED, Mirarab S, Zhang G. Complexity of avian evolution revealed by family-level genomes. Nature 2024; 629:851-860. [PMID: 38560995 PMCID: PMC11111414 DOI: 10.1038/s41586-024-07323-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 03/15/2024] [Indexed: 04/04/2024]
Abstract
Despite tremendous efforts in the past decades, relationships among main avian lineages remain heavily debated without a clear resolution. Discrepancies have been attributed to diversity of species sampled, phylogenetic method and the choice of genomic regions1-3. Here we address these issues by analysing the genomes of 363 bird species4 (218 taxonomic families, 92% of total). Using intergenic regions and coalescent methods, we present a well-supported tree but also a marked degree of discordance. The tree confirms that Neoaves experienced rapid radiation at or near the Cretaceous-Palaeogene boundary. Sufficient loci rather than extensive taxon sampling were more effective in resolving difficult nodes. Remaining recalcitrant nodes involve species that are a challenge to model due to either extreme DNA composition, variable substitution rates, incomplete lineage sorting or complex evolutionary events such as ancient hybridization. Assessment of the effects of different genomic partitions showed high heterogeneity across the genome. We discovered sharp increases in effective population size, substitution rates and relative brain size following the Cretaceous-Palaeogene extinction event, supporting the hypothesis that emerging ecological opportunities catalysed the diversification of modern birds. The resulting phylogenetic estimate offers fresh insights into the rapid radiation of modern birds and provides a taxon-rich backbone tree for future comparative studies.
Collapse
Affiliation(s)
- Josefin Stiller
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Shaohong Feng
- Center for Evolutionary & Organismal Biology, Liangzhu Laboratory & Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China
- Department of General Surgery, Sir Run-Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, China
- Innovation Center of Yangtze River Delta, Zhejiang University, Jiashan, China
| | - Al-Aabid Chowdhury
- School of Life and Environmental Sciences, University of Sydney, Sydney, New South Wales, Australia
| | | | - David A Duchêne
- Center for Evolutionary Hologenomics, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Qi Fang
- BGI Research, Shenzhen, China
| | - Yuan Deng
- BGI Research, Shenzhen, China
- BGI Research, Wuhan, China
| | - Alexey Kozlov
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Alexandros Stamatakis
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
- Institute of Computer Science, Foundation for Research and Technology Hellas, Heraklion, Greece
- Institute for Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Santiago Claramunt
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
- Department of Natural History, Royal Ontario Museum, Toronto, Ontario, Canada
| | - Jacqueline M T Nguyen
- College of Science and Engineering, Flinders University, Adelaide, South Australia, Australia
- Australian Museum Research Institute, Sydney, New South Wales, Australia
| | - Simon Y W Ho
- School of Life and Environmental Sciences, University of Sydney, Sydney, New South Wales, Australia
| | - Brant C Faircloth
- Department of Biological Sciences and Museum of Natural Science, Louisiana State University, Baton Rouge, LA, USA
| | - Julia Haag
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Peter Houde
- Department of Biology, New Mexico State University, Las Cruces, NM, USA
| | - Joel Cracraft
- Department of Ornithology, American Museum of Natural History, New York, NY, USA
| | - Metin Balaban
- Bioinformatics and Systems Biology Graduate Program, University of California San Diego, La Jolla, CA, USA
| | - Uyen Mai
- Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA
| | - Guangji Chen
- BGI Research, Wuhan, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Rongsheng Gao
- BGI Research, Wuhan, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | | | - Yulong Xie
- Center for Evolutionary & Organismal Biology, Liangzhu Laboratory & Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Zijian Huang
- Center for Evolutionary & Organismal Biology, Liangzhu Laboratory & Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Zhen Cao
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Zhi Yan
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Huw A Ogilvie
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Luay Nakhleh
- Department of Computer Science, Rice University, Houston, TX, USA
| | - Bent Lindow
- Natural History Museum Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Benoit Morel
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
- Institute of Computer Science, Foundation for Research and Technology Hellas, Heraklion, Greece
| | - Jon Fjeldså
- Natural History Museum Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Peter A Hosner
- Natural History Museum Denmark, University of Copenhagen, Copenhagen, Denmark
- Center for Global Mountain Biodiversity, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Rute R da Fonseca
- Center for Global Mountain Biodiversity, Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Bent Petersen
- Center for Evolutionary Hologenomics, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Centre of Excellence for Omics-Driven Computational Biodiscovery, Faculty of Applied Sciences, AIMST University, Bedong, Malaysia
| | - Joseph A Tobias
- Department of Life Sciences, Imperial College London, Silwood Park, Ascot, UK
| | - Tamás Székely
- Milner Centre for Evolution, University of Bath, Bath, UK
- ELKH-DE Reproductive Strategies Research Group, University of Debrecen, Debrecen, Hungary
| | - Jonathan David Kennedy
- Center for Macroecology, Evolution, and Climate, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Andrew Hart Reeve
- Natural History Museum Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Andras Liker
- HUN-REN-PE Evolutionary Ecology Research Group, University of Pannonia, Veszprém, Hungary
- Behavioural Ecology Research Group, Center for Natural Sciences, University of Pannonia, Veszprém, Hungary
| | | | - Agostinho Antunes
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto, Portugal
- Department of Biology, Faculty of Sciences, University of Porto, Porto, Portugal
| | | | - Mads F Bertelsen
- Centre for Zoo and Wild Animal Health, Copenhagen Zoo, Frederiksberg, Denmark
| | - Fumin Lei
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- College of Life Science, University of Chinese Academy of Sciences, Beijing, China
| | - Carsten Rahbek
- Center for Global Mountain Biodiversity, Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Center for Macroecology, Evolution, and Climate, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Institute of Ecology, Peking University, Beijing, China
- Danish Institute for Advanced Study, University of Southern Denmark, Odense, Denmark
| | - Gary R Graves
- Center for Macroecology, Evolution, and Climate, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
| | | | - Tandy Warnow
- University of Illinois Urbana-Champaign, Champaign, IL, USA
| | - Edward L Braun
- Department of Biology, University of Florida, Gainesville, FL, USA
| | - M Thomas P Gilbert
- Center for Evolutionary Hologenomics, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- University Museum, NTNU, Trondheim, Norway
| | - Erich D Jarvis
- Vertebrate Genome Lab, The Rockefeller University, New York, NY, USA
- Howard Hughes Medical Institute, Durham, NC, USA
| | | | - Guojie Zhang
- Center for Evolutionary & Organismal Biology, Liangzhu Laboratory & Women's Hospital, Zhejiang University School of Medicine, Hangzhou, China.
- Innovation Center of Yangtze River Delta, Zhejiang University, Jiashan, China.
- BGI Research, Wuhan, China.
- Villum Center for Biodiversity Genomics, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
2
|
Dietz L, Mayer C, Stolle E, Eberle J, Misof B, Podsiadlowski L, Niehuis O, Ahrens D. Metazoa-level USCOs as markers in species delimitation and classification. Mol Ecol Resour 2024; 24:e13921. [PMID: 38146909 DOI: 10.1111/1755-0998.13921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Revised: 12/06/2023] [Accepted: 12/13/2023] [Indexed: 12/27/2023]
Abstract
Metazoa-level universal single-copy orthologs (mzl-USCOs) are universally applicable markers for DNA taxonomy in animals that can replace or supplement single-gene barcodes. Previously, mzl-USCOs from target enrichment data were shown to reliably distinguish species. Here, we tested whether USCOs are an evenly distributed, representative sample of a given metazoan genome and therefore able to cope with past hybridization events and incomplete lineage sorting. This is relevant for coalescent-based species delimitation approaches, which critically depend on the assumption that the investigated loci do not exhibit autocorrelation due to physical linkage. Based on 239 chromosome-level assembled genomes, we confirmed that mzl-USCOs are genetically unlinked for practical purposes and a representative sample of a genome in terms of reciprocal distances between USCOs on a chromosome and of distribution across chromosomes. We tested the suitability of mzl-USCOs extracted from genomes for species delimitation and phylogeny in four case studies: Anopheles mosquitos, Drosophila fruit flies, Heliconius butterflies and Darwin's finches. In almost all instances, USCOs allowed delineating species and yielded phylogenies that corresponded to those generated from whole genome data. Our phylogenetic analyses demonstrate that USCOs may complement single-gene DNA barcodes and provide more accurate taxonomic inferences. Combining USCOs from sources that used different versions of ortholog reference libraries to infer marker orthology may be challenging and, at times, impact taxonomic conclusions. However, we expect this problem to become less severe as the rapidly growing number of reference genomes provides a better representation of the number and diversity of organismal lineages.
Collapse
Affiliation(s)
- Lars Dietz
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change, Bonn, Germany
| | - Christoph Mayer
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change, Bonn, Germany
| | - Eckart Stolle
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change, Bonn, Germany
| | - Jonas Eberle
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change, Bonn, Germany
- Paris-Lodron-University, Salzburg, Austria
| | - Bernhard Misof
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change, Bonn, Germany
- Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany
| | - Lars Podsiadlowski
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change, Bonn, Germany
| | - Oliver Niehuis
- Abt. Evolutionsbiologie und Ökologie, Institut für Biologie I, Albert-Ludwigs-Universität Freiburg, Freiburg, Germany
| | - Dirk Ahrens
- Museum A. Koenig, Leibniz Institute for the Analysis of Biodiversity Change, Bonn, Germany
| |
Collapse
|
3
|
DeSalle R, Narechania A, Tessler M. Multiple Outgroups Can Cause Random Rooting in Phylogenomics. Mol Phylogenet Evol 2023; 184:107806. [PMID: 37172862 DOI: 10.1016/j.ympev.2023.107806] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 02/06/2023] [Accepted: 04/26/2023] [Indexed: 05/15/2023]
Abstract
Outgroup selection has been a major challenge since the rise of phylogenetics, and it has remained so in the phylogenomic era. Our goal here is to use large phylogenomic animal datasets to examine the impact of outgroup selection on the final topology. The results of our analyses further solidify the fact that distant outgroups can cause random rooting, and that this holds for concatenated and coalescent-based methods. The results also indicate that the standard practice of using multiple outgroups often causes random rooting. Most researchers go out of their way to get multiple outgroups, as this has been standard practice for decades. Based on our findings, this practice should stop. Instead, our results suggest that a single (most closely) related relative should be selected as the outgroup, unless all outgroups are roughly equally closely related to the ingroup.
Collapse
Affiliation(s)
- Rob DeSalle
- Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA; Division of Invertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA
| | - Apurva Narechania
- Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA
| | - Michael Tessler
- Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA; Division of Invertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA; St. Francis College, Department of Biology, Brooklyn, NY 11201, USA
| |
Collapse
|
4
|
Spaulding F, McLaughlin JF, Cheek RG, McCracken KG, Glenn TC, Winker K. Population genomics indicate three different modes of divergence and speciation with gene flow in the green-winged teal duck complex. Mol Phylogenet Evol 2023; 182:107733. [PMID: 36801373 PMCID: PMC10092703 DOI: 10.1016/j.ympev.2023.107733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 01/31/2023] [Accepted: 02/09/2023] [Indexed: 02/18/2023]
Abstract
The processes leading to divergence and speciation can differ broadly among taxa with different life histories. We examine these processes in a small clade of ducks with historically uncertain relationships and species limits. The green-winged teal (Anas crecca) complex is a Holarctic species of dabbling duck currently categorized as three subspecies (Anas crecca crecca, A. c. nimia, and A. c. carolinensis) with a close relative, the yellow-billed teal (Anas flavirostris) from South America. A. c. crecca and A. c. carolinensis are seasonal migrants, while the other taxa are sedentary. We examined divergence and speciation patterns in this group, determining their phylogenetic relationships and the presence and levels of gene flow among lineages using both mitochondrial and genome-wide nuclear DNA obtained from 1,393 ultraconserved element (UCE) loci. Phylogenetic relationships using nuclear DNA among these taxa showed A. c. crecca, A. c. nimia, and A. c. carolinensis clustering together to form one polytomous clade, with A. flavirostris sister to this clade. This relationship can be summarized as (crecca, nimia, carolinensis)(flavirostris). However, whole mitogenomes revealed a different phylogeny: (crecca, nimia)(carolinensis, flavirostris). The best demographic model for key pairwise comparisons supported divergence with gene flow as the probable speciation mechanism in all three contrasts (crecca-nimia, crecca-carolinensis, and carolinensis-flavirostris). Given prior work, gene flow was expected among the Holarctic taxa, but gene flow between North American carolinensis and South American flavirostris (M ∼0.1-0.4 individuals/generation), albeit low, was not expected. Three geographically oriented modes of divergence are likely involved in the diversification of this complex: heteropatric (crecca-nimia), parapatric (crecca-carolinensis), and (mostly) allopatric (carolinensis-flavirostris). Our study shows that ultraconserved elements are a powerful tool for simultaneously studying systematics and population genomics in systems with historically uncertain relationships and species limits.
Collapse
Affiliation(s)
- Fern Spaulding
- University of Alaska Museum, University of Alaska Fairbanks, Fairbanks, AK, USA; Department of Biology and Wildlife, University of Alaska Fairbanks, Fairbanks, AK, USA.
| | - Jessica F McLaughlin
- Department of Environmental Science, Policy, and Management, University of California Berkeley, Berkeley, CA, USA
| | - Rebecca G Cheek
- Graduate Degree Program in Ecology, Department of Biology, Colorado State University, Fort Collins, CO, USA
| | - Kevin G McCracken
- University of Alaska Museum, University of Alaska Fairbanks, Fairbanks, AK, USA; Department of Biology, University of Miami, Coral Gables, FL, USA
| | - Travis C Glenn
- Department of Environmental Health Science, University of Georgia, Athens, GA, USA
| | - Kevin Winker
- University of Alaska Museum, University of Alaska Fairbanks, Fairbanks, AK, USA; Department of Biology and Wildlife, University of Alaska Fairbanks, Fairbanks, AK, USA
| |
Collapse
|
5
|
Hill M, Roch S. Inconsistency of Triplet-Based and Quartet-Based Species Tree Estimation under Intralocus Recombination. J Comput Biol 2022; 29:1173-1197. [PMID: 36048557 DOI: 10.1089/cmb.2022.0265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We consider species tree estimation from multiple loci subject to intralocus recombination. We focus on R∗, a summary coalescent-based method using rooted triplets, as well as a related quartet-based inference method. We demonstrate analytically that in both cases, intralocus recombination gives rise to an inconsistency zone, in which correct inference is not assured even in the limit of infinite amount of data. In addition, we validate and characterize this inconsistency zone through a simulation study, which suggests that differential rates of recombination between closely related taxa can amplify the effect of incomplete lineage sorting and contribute to inconsistency.
Collapse
Affiliation(s)
- Max Hill
- Department of Mathematics, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Sebastien Roch
- Department of Mathematics, University of Wisconsin-Madison, Madison, Wisconsin, USA
| |
Collapse
|
6
|
Černý D, Natale R. Comprehensive taxon sampling and vetted fossils help clarify the time tree of shorebirds (Aves, Charadriiformes). Mol Phylogenet Evol 2022; 177:107620. [PMID: 36038056 DOI: 10.1016/j.ympev.2022.107620] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 06/03/2022] [Accepted: 08/17/2022] [Indexed: 01/20/2023]
Abstract
Shorebirds (Charadriiformes) are a globally distributed clade of modern birds and, due to their ecological and morphological disparity, a frequent subject of comparative studies. While molecular phylogenies have been key to establishing the suprafamilial backbone of the charadriiform tree, a number of relationships at both deep and shallow taxonomic levels remain poorly resolved. The timescale of shorebird evolution also remains uncertain as a result of extensive disagreements among the published divergence dating studies, stemming largely from different choices of fossil calibrations. Here, we present the most comprehensive non-supertree phylogeny of shorebirds to date, based on a total-evidence dataset comprising 353 ingroup taxa (90% of all extant or recently extinct species), 27 loci (15 mitochondrial and 12 nuclear), and 69 morphological characters. We further clarify the timeline of charadriiform evolution by time-scaling this phylogeny using a set of 14 up-to-date and thoroughly vetted fossil calibrations. In addition, we assemble a taxonomically restricted 100-locus dataset specifically designed to resolve outstanding problems in higher-level charadriiform phylogeny. In terms of tree topology, our results are largely congruent with previous studies but indicate that some of the conflicts among earlier analyses reflect a genuine signal of pervasive gene tree discordance. Monophyly of the plovers (Charadriidae), the position of the ibisbill (Ibidorhyncha), and the relationships among the five subfamilies of the gulls (Laridae) could not be resolved even with greatly increased locus and taxon sampling. Moreover, several localized regions of uncertainty persist in shallower parts of the tree, including the interrelationships of the true auks (Alcinae) and anarhynchine plovers. Our node-dating and macroevolutionary rate analyses find support for a Paleocene origin of crown-group shorebirds, as well as exceptionally rapid recent radiations of Old World oystercatchers (Haematopodidae) and select genera of gulls. Our study underscores the challenges involved in estimating a comprehensively sampled and carefully calibrated time tree for a diverse avian clade, and highlights areas in need of further research.
Collapse
Affiliation(s)
- David Černý
- Department of the Geophysical Sciences, University of Chicago, Chicago 60637, USA.
| | - Rossy Natale
- Department of Organismal Biology & Anatomy, University of Chicago, Chicago 60637, USA
| |
Collapse
|
7
|
Gatesy J, Springer MS. Phylogenomic Coalescent Analyses of Avian Retroelements Infer Zero-Length Branches at the Base of Neoaves, Emergent Support for Controversial Clades, and Ancient Introgressive Hybridization in Afroaves. Genes (Basel) 2022; 13:1167. [PMID: 35885951 PMCID: PMC9324441 DOI: 10.3390/genes13071167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Revised: 06/20/2022] [Accepted: 06/21/2022] [Indexed: 01/25/2023] Open
Abstract
Retroelement insertions (RIs) are low-homoplasy characters that are ideal data for addressing deep evolutionary radiations, where gene tree reconstruction errors can severely hinder phylogenetic inference with DNA and protein sequence data. Phylogenomic studies of Neoaves, a large clade of birds (>9000 species) that first diversified near the Cretaceous−Paleogene boundary, have yielded an array of robustly supported, contradictory relationships among deep lineages. Here, we reanalyzed a large RI matrix for birds using recently proposed quartet-based coalescent methods that enable inference of large species trees including branch lengths in coalescent units, clade-support, statistical tests for gene flow, and combined analysis with DNA-sequence-based gene trees. Genome-scale coalescent analyses revealed extremely short branches at the base of Neoaves, meager branch support, and limited congruence with previous work at the most challenging nodes. Despite widespread topological conflicts with DNA-sequence-based trees, combined analyses of RIs with thousands of gene trees show emergent support for multiple higher-level clades (Columbea, Passerea, Columbimorphae, Otidimorphae, Phaethoquornithes). RIs express asymmetrical support for deep relationships within the subclade Afroaves that hints at ancient gene flow involving the owl lineage (Strigiformes). Because DNA-sequence data are challenged by gene tree-reconstruction error, analysis of RIs represents one approach for improving gene tree-based methods when divergences are deep, internodes are short, terminal branches are long, and introgressive hybridization further confounds species−tree inference.
Collapse
Affiliation(s)
- John Gatesy
- Division of Vertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA
| | - Mark S. Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA 92521, USA;
| |
Collapse
|
8
|
Annotation-free delineation of prokaryotic homology groups. PLoS Comput Biol 2022; 18:e1010216. [PMID: 35675326 PMCID: PMC9212150 DOI: 10.1371/journal.pcbi.1010216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Revised: 06/21/2022] [Accepted: 05/16/2022] [Indexed: 11/19/2022] Open
Abstract
Phylogenomic studies of prokaryotic taxa often assume conserved marker genes are homologous across their length. However, processes such as horizontal gene transfer or gene duplication and loss may disrupt this homology by recombining only parts of genes, causing gene fission or fusion. We show using simulation that it is necessary to delineate homology groups in a set of bacterial genomes without relying on gene annotations to define the boundaries of homologous regions. To solve this problem, we have developed a graph-based algorithm to partition a set of bacterial genomes into Maximal Homologous Groups of sequences (MHGs) where each MHG is a maximal set of maximum-length sequences which are homologous across the entire sequence alignment. We applied our algorithm to a dataset of 19 Enterobacteriaceae species and found that MHGs cover much greater proportions of genomes than markers and, relatedly, are less biased in terms of the functions of the genes they cover. We zoomed in on the correlation between each individual marker and their overlapping MHGs, and show that few phylogenetic splits supported by the markers are supported by the MHGs while many marker-supported splits are contradicted by the MHGs. A comparison of the species tree inferred from marker genes with the species tree inferred from MHGs suggests that the increased bias and lack of genome coverage by markers causes incorrect inferences as to the overall relationship between bacterial taxa. Assuming genes to be the basic evolutionary unit has been commonplace in bacterial genomics. For example, when quantifying the extent of horizontal gene transfer it is common to infer gene trees and reconcile them against a species tree to account for recombination-based processes. We have developed a new method which challenges this assumption by identifying contiguous regions of true homology without regards to gene boundaries and applied it to Enterobacteriaceae, a family of bacteria containing several important human pathogens. Our results show that genes are composed of distinct homologous regions with conflicting phylogenetic histories. We further demonstrate that failing to take account of this conflict, together with the functional biases we show exist among single-copy marker genes, significantly changes the consensus evolutionary tree of Enterobacteriaceae.
Collapse
|
9
|
Doronina L, Hughes GM, Moreno-Santillan D, Lawless C, Lonergan T, Ryan L, Jebb D, Kirilenko BM, Korstian JM, Dávalos LM, Vernes SC, Myers EW, Teeling EC, Hiller M, Jermiin LS, Schmitz J, Springer MS, Ray DA. Contradictory Phylogenetic Signals in the Laurasiatheria Anomaly Zone. Genes (Basel) 2022; 13:766. [PMID: 35627151 PMCID: PMC9141728 DOI: 10.3390/genes13050766] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Revised: 04/12/2022] [Accepted: 04/21/2022] [Indexed: 02/04/2023] Open
Abstract
Relationships among laurasiatherian clades represent one of the most highly disputed topics in mammalian phylogeny. In this study, we attempt to disentangle laurasiatherian interordinal relationships using two independent genome-level approaches: (1) quantifying retrotransposon presence/absence patterns, and (2) comparisons of exon datasets at the levels of nucleotides and amino acids. The two approaches revealed contradictory phylogenetic signals, possibly due to a high level of ancestral incomplete lineage sorting. The positions of Eulipotyphla and Chiroptera as the first and second earliest divergences were consistent across the approaches. However, the phylogenetic relationships of Perissodactyla, Cetartiodactyla, and Ferae, were contradictory. While retrotransposon insertion analyses suggest a clade with Cetartiodactyla and Ferae, the exon dataset favoured Cetartiodactyla and Perissodactyla. Future analyses of hitherto unsampled laurasiatherian lineages and synergistic analyses of retrotransposon insertions, exon and conserved intron/intergenic sequences might unravel the conflicting patterns of relationships in this major mammalian clade.
Collapse
Affiliation(s)
- Liliya Doronina
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany;
| | - Graham M. Hughes
- School of Biology and Environmental Science, University College Dublin, Belfield, D04 V1W8 Dublin, Ireland; (C.L.); (T.L.); (L.R.); (E.C.T.); (L.S.J.)
| | - Diana Moreno-Santillan
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409, USA; (D.M.-S.); (J.M.K.)
- Department of Integrative Biology, University of California, Berkeley, CA 92697, USA
| | - Colleen Lawless
- School of Biology and Environmental Science, University College Dublin, Belfield, D04 V1W8 Dublin, Ireland; (C.L.); (T.L.); (L.R.); (E.C.T.); (L.S.J.)
| | - Tadhg Lonergan
- School of Biology and Environmental Science, University College Dublin, Belfield, D04 V1W8 Dublin, Ireland; (C.L.); (T.L.); (L.R.); (E.C.T.); (L.S.J.)
| | - Louise Ryan
- School of Biology and Environmental Science, University College Dublin, Belfield, D04 V1W8 Dublin, Ireland; (C.L.); (T.L.); (L.R.); (E.C.T.); (L.S.J.)
| | - David Jebb
- Max Planck Institute of Molecular Cell Biology and Genetics, 01307 Dresden, Germany; (D.J.); (E.W.M.)
- Max Planck Institute for the Physics of Complex Systems, 01187 Dresden, Germany
- Center for Systems Biology Dresden, 01307 Dresden, Germany
| | - Bogdan M. Kirilenko
- LOEWE Centre for Translational Biodiversity Genomics, 60325 Frankfurt, Germany; (B.M.K.); (M.H.)
- Senckenberg Research Institute, 60325 Frankfurt, Germany
- Faculty of Biosciences, Goethe-University, 60438 Frankfurt, Germany
| | - Jennifer M. Korstian
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409, USA; (D.M.-S.); (J.M.K.)
| | - Liliana M. Dávalos
- Department of Ecology and Evolution and Consortium for Inter—Disciplinary Environmental Research, Stony Brook University, Stony Brook, NY 11794, USA;
| | - Sonja C. Vernes
- School of Biology, The University of St Andrews, St Andrews KY16 9ST, UK;
- Neurogenetics of Vocal Communication Group, Max Planck Institute for Psycholinguistics, 6525 Nijmegen, The Netherlands
| | - Eugene W. Myers
- Max Planck Institute of Molecular Cell Biology and Genetics, 01307 Dresden, Germany; (D.J.); (E.W.M.)
- Faculty of Computer Science, Technical University Dresden, 01307 Dresden, Germany
- The Okinawa Institute of Science and Technology, Okinawa 904-0495, Japan
| | - Emma C. Teeling
- School of Biology and Environmental Science, University College Dublin, Belfield, D04 V1W8 Dublin, Ireland; (C.L.); (T.L.); (L.R.); (E.C.T.); (L.S.J.)
| | - Michael Hiller
- LOEWE Centre for Translational Biodiversity Genomics, 60325 Frankfurt, Germany; (B.M.K.); (M.H.)
- Senckenberg Research Institute, 60325 Frankfurt, Germany
- Faculty of Biosciences, Goethe-University, 60438 Frankfurt, Germany
| | - Lars S. Jermiin
- School of Biology and Environmental Science, University College Dublin, Belfield, D04 V1W8 Dublin, Ireland; (C.L.); (T.L.); (L.R.); (E.C.T.); (L.S.J.)
- Research School of Biology, Australian National University, Canberra, ACT 2601, Australia
- Earth Institute, University College Dublin, D04 V1W8 Dublin, Ireland
| | - Jürgen Schmitz
- Institute of Experimental Pathology, ZMBE, University of Münster, 48149 Münster, Germany;
| | - Mark S. Springer
- Department of Evolution, Ecology and Organismal Biology, University of California, Riverside, CA 92521, USA;
| | - David A. Ray
- Department of Biological Sciences, Texas Tech University, Lubbock, TX 79409, USA; (D.M.-S.); (J.M.K.)
| |
Collapse
|
10
|
Gagnon E, Hilgenhof R, Orejuela A, McDonnell A, Sablok G, Aubriot X, Giacomin L, Gouvêa Y, Bragionis T, Stehmann JR, Bohs L, Dodsworth S, Martine C, Poczai P, Knapp S, Särkinen T. Phylogenomic discordance suggests polytomies along the backbone of the large genus Solanum. AMERICAN JOURNAL OF BOTANY 2022; 109:580-601. [PMID: 35170754 PMCID: PMC9321964 DOI: 10.1002/ajb2.1827] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 12/14/2021] [Indexed: 05/13/2023]
Abstract
PREMISE Evolutionary studies require solid phylogenetic frameworks, but increased volumes of phylogenomic data have revealed incongruent topologies among gene trees in many organisms both between and within genomes. Some of these incongruences indicate polytomies that may remain impossible to resolve. Here we investigate the degree of gene-tree discordance in Solanum, one of the largest flowering plant genera that includes the cultivated potato, tomato, and eggplant, as well as 24 minor crop plants. METHODS A densely sampled species-level phylogeny of Solanum is built using unpublished and publicly available Sanger sequences comprising 60% of all accepted species (742 spp.) and nine regions (ITS, waxy, and seven plastid markers). The robustness of this topology is tested by examining a full plastome dataset with 140 species and a nuclear target-capture dataset with 39 species of Solanum (Angiosperms353 probe set). RESULTS While the taxonomic framework of Solanum remained stable, gene tree conflicts and discordance between phylogenetic trees generated from the target-capture and plastome datasets were observed. The latter correspond to regions with short internodal branches, and network analysis and polytomy tests suggest the backbone is composed of three polytomies found at different evolutionary depths. The strongest area of discordance, near the crown node of Solanum, could potentially represent a hard polytomy. CONCLUSIONS We argue that incomplete lineage sorting due to rapid diversification is the most likely cause for these polytomies, and that embracing the uncertainty that underlies them is crucial to understand the evolution of large and rapidly radiating lineages.
Collapse
Affiliation(s)
- Edeline Gagnon
- Royal Botanic Garden Edinburgh20A Inverleith RowEdinburghEH3 5LRUK
- School of Biological SciencesUniversity of EdinburghKing's Buildings, Mayfield RoadEdinburghEH9 3JHUK
| | - Rebecca Hilgenhof
- Royal Botanic Garden Edinburgh20A Inverleith RowEdinburghEH3 5LRUK
- School of Biological SciencesUniversity of EdinburghKing's Buildings, Mayfield RoadEdinburghEH9 3JHUK
| | - Andrés Orejuela
- Royal Botanic Garden Edinburgh20A Inverleith RowEdinburghEH3 5LRUK
- School of Biological SciencesUniversity of EdinburghKing's Buildings, Mayfield RoadEdinburghEH9 3JHUK
| | - Angela McDonnell
- Negaunee Institute for Plant Conservation Science and ActionChicago Botanic Garden, 1000 Lake Cook RdGlencoeIllinois60022USA
| | - Gaurav Sablok
- Finnish Museum of Natural History (Botany Unit)University of HelsinkiPO Box 7 FI‐00014HelsinkiFinland
- Organismal and Evolutionary Biology Research Programme (OEB)Viikki Plant Science Centre (ViPS)PO Box 65, FI‐00014 University of HelsinkiFinland
| | - Xavier Aubriot
- Université Paris‐Saclay, CNRS, AgroParisTech, ÉcologieSystématique et ÉvolutionOrsay91405France
| | - Leandro Giacomin
- Instituto de Ciências e Tecnologia das Águas & Herbário HSTMUniversidade Federal do Oeste do Pará, Rua Vera Paz, sn, Santarém, CEP 68040‐255PABrazil
| | - Yuri Gouvêa
- Departamento de Botânica, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais—UFMGAv. Antônio Carlos, 6627, Pampulha, Belo Horizonte, CEP 31270‐901MGBrazil
| | - Thamyris Bragionis
- Departamento de Botânica, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais—UFMGAv. Antônio Carlos, 6627, Pampulha, Belo Horizonte, CEP 31270‐901MGBrazil
| | - João Renato Stehmann
- Departamento de Botânica, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais—UFMGAv. Antônio Carlos, 6627, Pampulha, Belo Horizonte, CEP 31270‐901MGBrazil
| | - Lynn Bohs
- Department of BiologyUniversity of UtahSalt Lake CityUtah84112USA
| | - Steven Dodsworth
- School of Life SciencesUniversity of Bedfordshire, University SquareLutonLU1 3JUUK
- Royal Botanic Gardens, Kew, RichmondSurreyTW9 3AEUK
| | | | - Péter Poczai
- Finnish Museum of Natural History (Botany Unit)University of HelsinkiPO Box 7 FI‐00014HelsinkiFinland
- Faculity of Environmental and Biological SciencesUniversity of HelsinkiFI‐00014Finland
| | - Sandra Knapp
- Department of Life SciencesNatural History MuseumCromwell RoadLondonSW7 5BDUK
| | - Tiina Särkinen
- Royal Botanic Garden Edinburgh20A Inverleith RowEdinburghEH3 5LRUK
| |
Collapse
|
11
|
When good mitochondria go bad: Cyto-nuclear discordance in landfowl (Aves: Galliformes). Gene 2021; 801:145841. [PMID: 34274481 DOI: 10.1016/j.gene.2021.145841] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 06/10/2021] [Accepted: 07/13/2021] [Indexed: 11/22/2022]
Abstract
Mitochondrial sequences were among the first molecular data collected for phylogenetic studies and they are plentiful in DNA sequence archives. However, the future value of mitogenomic data in phylogenetics is uncertain, because its phylogenetic signal sometimes conflicts with that of the nuclear genome. A thorough understanding of the causes and prevalence of cyto-nuclear discordance would aid in reconciling different results owing to sequence data type, and provide a framework for interpreting megaphylogenies when taxa which lack substantial nuclear data are placed using mitochondrial data. Here, we examine the prevalence and possible causes of cyto-nuclear discordance in the landfowl (Aves: Galliformes), leveraging 47 new mitogenomes assembled from off-target reads recovered as part of a target-capture study. We evaluated two hypotheses, that cyto-nuclear discordance is "genuine" and a result of biological processes such as incomplete lineage sorting or introgression, and that cyto-nuclear discordance is an artifact of inaccurate mitochondrial tree estimation (the "inaccurate estimation" hypothesis). We identified seven well-supported topological differences between the mitogenomic tree and trees based on nuclear data. These well-supported topological differences were robust to model selection. An examination of sites suggests these differences were driven by small number of sites, particularly from third-codon positions, suggesting that they were not confounded by convergent directional selection. Hence, the hypothesis of genuine discordance was supported.
Collapse
|
12
|
Doyle JJ. Defining coalescent genes: Theory meets practice in organelle phylogenomics. Syst Biol 2021; 71:476-489. [PMID: 34191012 DOI: 10.1093/sysbio/syab053] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 06/24/2021] [Accepted: 06/28/2021] [Indexed: 11/13/2022] Open
Abstract
The species tree paradigm that dominates current molecular systematic practice infers species trees from collections of sequences under assumptions of the multispecies coalescent (MSC), i.e., that there is free recombination between the sequences and no (or very low) recombination within them. These coalescent genes (c-genes) are thus defined in an historical rather than molecular sense, and can in theory be as large as an entire genome or as small as a single nucleotide. A debate about how to define c-genes centers on the contention that nuclear gene sequences used in many coalescent analyses undergo too much recombination, such that their introns comprise multiple c-genes, violating a key assumption of the MSC. Recently a similar argument has been made for the genes of plastid (e.g., chloroplast) and mitochondrial genomes, which for the last 30 or more years have been considered to represent a single c-gene for the purposes of phylogeny reconstruction because they are non-recombining in a historical sense. Consequently, it has been suggested that these genomes should be analyzed using coalescent methods that treat their genes-over 70 protein-coding genes in the case of most plastid genomes (plastomes)-as independent estimates of species phylogeny, in contrast to the usual practice of concatenation, which is appropriate for generating gene trees. However, although recombination certainly occurs in the plastome, as has been recognized since the 1970's, it is unlikely to be phylogenetically relevant. This is because such historically effective recombination can only occur when plastomes with incongruent histories are brought together in the same plastid. However, plastids sort rapidly into different cell lineages and rarely fuse. Thus, because of plastid biology, the plastome is a more canonical c-gene than is the average multi-intron mammalian nuclear gene. The plastome should thus continue to be treated as a single estimate of the underlying species phylogeny, as should the mitochondrial genome. The implications of this long-held insight of molecular systematics for studies in the phylogenomic era are explored.
Collapse
Affiliation(s)
- Jeff J Doyle
- Plant Biology Section, Plant Breeding & Genetics Section, and L. H. Bailey Hortorium, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853 USA
| |
Collapse
|
13
|
Collapsing dubiously resolved gene-tree branches in phylogenomic coalescent analyses. Mol Phylogenet Evol 2021; 158:107092. [PMID: 33545272 DOI: 10.1016/j.ympev.2021.107092] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Revised: 12/30/2020] [Accepted: 01/28/2021] [Indexed: 01/15/2023]
Abstract
In two-step coalescent analyses of phylogenomic data, gene-tree topologies are treated as fixed prior to species-tree inference. Although all gene-tree conflict is assumed to be caused by lineage sorting when applying these methods, in empirical datasets much of the conflict can be caused by estimation error. Weakly supported and even arbitrarily resolved clades are important sources of this estimation error for gene trees inferred from few informative characters relative to the number of sampled terminals, and the resulting extraneous conflict among gene trees can negatively impact species-tree inference. In this study, we quantified the relative severity of alternative methods for collapsing gene-tree branches for seven empirical datasets and quantified their effects on species-tree inference. The branch-collapsing methods that we employed were based on the strict consensus of optimal topologies, various bootstrap thresholds, and 0% approximate likelihood ratio test (SH-like aLRT) support. Up to 86% of internal gene-tree branches are dubiously or arbitrarily resolved in reanalyses of these published phylogenomic datasets, and collapsing these branches increased inferred species-tree coalescent branch lengths by up to 455%. For two datasets, the longer inferred branch lengths sometimes impacted inference of anomaly-zone conditions. Although branch-collapsing methods did not consistently affect the species-tree topology, they often increased branch support. The more severe and clearly justified gene-tree branch-collapsing methods, which we recommend be broadly applied for two-step coalescent analyses, are use of the strict consensus in parsimony analyses and the collapse clades with 0% SH-like aLRT support in likelihood analyses. Collapsing dubiously or arbitrarily resolved branches in gene trees sometimes improved congruence between coalescent-based results and concatenation trees. In such cases, we contend that the resolution provided by concatenation should be preferred and that incomplete lineage sorting is a poor explanation for the initial conflict between phylogenetic approaches.
Collapse
|
14
|
Murphy WJ, Foley NM, Bredemeyer KR, Gatesy J, Springer MS. Phylogenomics and the Genetic Architecture of the Placental Mammal Radiation. Annu Rev Anim Biosci 2020; 9:29-53. [PMID: 33228377 DOI: 10.1146/annurev-animal-061220-023149] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The genomes of placental mammals are being sequenced at an unprecedented rate. Alignments of hundreds, and one day thousands, of genomes spanning the rich living and extinct diversity of species offer unparalleled power to resolve phylogenetic controversies, identify genomic innovations of adaptation, and dissect the genetic architecture of reproductive isolation. We highlight outstanding questions about the earliest phases of placental mammal diversification and the promise of newer methods, as well as remaining challenges, toward using whole genome data to resolve placental mammal phylogeny. The next phase of mammalian comparative genomics will see the completion and application of finished-quality, gapless genome assemblies from many ordinal lineages and closely related species. Interspecific comparisons between the most hypervariable genomic loci will likely reveal large, but heretofore mostly underappreciated, effects on population divergence, morphological innovation, and the origin of new species.
Collapse
Affiliation(s)
- William J Murphy
- Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA;
| | - Nicole M Foley
- Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA;
| | - Kevin R Bredemeyer
- Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA;
| | - John Gatesy
- Division of Vertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA
| | - Mark S Springer
- Department of Evolution, Ecology and Organismal Biology, University of California, Riverside, California 92521, USA
| |
Collapse
|
15
|
Li G, Figueiró HV, Eizirik E, Murphy WJ. Recombination-Aware Phylogenomics Reveals the Structured Genomic Landscape of Hybridizing Cat Species. Mol Biol Evol 2020; 36:2111-2126. [PMID: 31198971 PMCID: PMC6759079 DOI: 10.1093/molbev/msz139] [Citation(s) in RCA: 71] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Current phylogenomic approaches implicitly assume that the predominant phylogenetic signal within a genome reflects the true evolutionary history of organisms, without assessing the confounding effects of postspeciation gene flow that can produce a mosaic of phylogenetic signals that interact with recombinational variation. Here, we tested the validity of this assumption with a phylogenomic analysis of 27 species of the cat family, assessing local effects of recombination rate on species tree inference and divergence time estimation across their genomes. We found that the prevailing phylogenetic signal within the autosomes is not always representative of the most probable speciation history, due to ancient hybridization throughout felid evolution. Instead, phylogenetic signal was concentrated within regions of low recombination, and notably enriched within large X chromosome recombination cold spots that exhibited recurrent patterns of strong genetic differentiation and selective sweeps across mammalian orders. By contrast, regions of high recombination were enriched for signatures of ancient gene flow, and these sequences inflated crown-lineage divergence times by ∼40%. We conclude that existing phylogenomic approaches to infer the Tree of Life may be highly misleading without considering the genomic architecture of phylogenetic signal relative to recombination rate and its interplay with historical hybridization.
Collapse
Affiliation(s)
- Gang Li
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX
| | - Henrique V Figueiró
- PUCRS, Escola de Ciências, Laboratory of Genomics and Molecular Biology, Porto Alegre, Brazil.,INCT-EECBio, Brazil
| | - Eduardo Eizirik
- PUCRS, Escola de Ciências, Laboratory of Genomics and Molecular Biology, Porto Alegre, Brazil.,INCT-EECBio, Brazil
| | - William J Murphy
- Veterinary Integrative Biosciences, Texas A&M University, College Station, TX
| |
Collapse
|
16
|
Springer MS, Molloy EK, Sloan DB, Simmons MP, Gatesy J. ILS-Aware Analysis of Low-Homoplasy Retroelement Insertions: Inference of Species Trees and Introgression Using Quartets. J Hered 2019; 111:147-168. [DOI: 10.1093/jhered/esz076] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Accepted: 12/12/2019] [Indexed: 12/20/2022] Open
Abstract
Abstract
DNA sequence alignments have provided the majority of data for inferring phylogenetic relationships with both concatenation and coalescent methods. However, DNA sequences are susceptible to extensive homoplasy, especially for deep divergences in the Tree of Life. Retroelement insertions have emerged as a powerful alternative to sequences for deciphering evolutionary relationships because these data are nearly homoplasy-free. In addition, retroelement insertions satisfy the “no intralocus-recombination” assumption of summary coalescent methods because they are singular events and better approximate neutrality relative to DNA loci commonly sampled in phylogenomic studies. Retroelements have traditionally been analyzed with parsimony, distance, and network methods. Here, we analyze retroelement data sets for vertebrate clades (Placentalia, Laurasiatheria, Balaenopteroidea, Palaeognathae) with 2 ILS-aware methods that operate by extracting, weighting, and then assembling unrooted quartets into a species tree. The first approach constructs a species tree from retroelement bipartitions with ASTRAL, and the second method is based on split-decomposition with parsimony. We also develop a Quartet-Asymmetry test to detect hybridization using retroelements. Both ILS-aware methods recovered the same species-tree topology for each data set. The ASTRAL species trees for Laurasiatheria have consecutive short branch lengths in the anomaly zone whereas Palaeognathae is outside of this zone. For the Balaenopteroidea data set, which includes rorquals (Balaenopteridae) and gray whale (Eschrichtiidae), both ILS-aware methods resolved balaeonopterids as paraphyletic. Application of the Quartet-Asymmetry test to this data set detected 19 different quartets of species for which historical introgression may be inferred. Evidence for introgression was not detected in the other data sets.
Collapse
Affiliation(s)
- Mark S Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA
| | - Erin K Molloy
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL
| | - Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins, CO
| | - Mark P Simmons
- Department of Biology, Colorado State University, Fort Collins, CO
| | - John Gatesy
- Division of Vertebrate Zoology and Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY
| |
Collapse
|
17
|
Springer MS, Foley NM, Brady PL, Gatesy J, Murphy WJ. Evolutionary Models for the Diversification of Placental Mammals Across the KPg Boundary. Front Genet 2019; 10:1241. [PMID: 31850081 PMCID: PMC6896846 DOI: 10.3389/fgene.2019.01241] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Accepted: 11/08/2019] [Indexed: 01/29/2023] Open
Abstract
Deciphering the timing of the placental mammal radiation is a longstanding problem in evolutionary biology, but consensus on the tempo and mode of placental diversification remains elusive. Nevertheless, an accurate timetree is essential for understanding the role of important events in Earth history (e.g., Cretaceous Terrestrial Revolution, KPg mass extinction) in promoting the taxonomic and ecomorphological diversification of Placentalia. Archibald and Deutschman described three competing models for the diversification of placental mammals, which are the Explosive, Long Fuse, and Short Fuse Models. More recently, the Soft Explosive Model and Trans-KPg Model have emerged as additional hypotheses for the placental radiation. Here, we review molecular and paleontological evidence for each of these five models including the identification of general problems that can negatively impact divergence time estimates. The Long Fuse Model has received more support from relaxed clock studies than any of the other models, but this model is not supported by morphological cladistic studies that position Cretaceous eutherians outside of crown Placentalia. At the same time, morphological cladistics has a poor track record of reconstructing higher-level relationships among the orders of placental mammals including the results of new pseudoextinction analyses that we performed on the largest available morphological data set for mammals (4,541 characters). We also examine the strengths and weaknesses of different timetree methods (node dating, tip dating, and fossilized birth-death dating) that may now be applied to estimate the timing of the placental radiation. While new methods such as tip dating are promising, they also have problems that must be addressed if these methods are to effectively discriminate among competing hypotheses for placental diversification. Finally, we discuss the complexities of timetree estimation when the signal of speciation times is impacted by incomplete lineage sorting (ILS) and hybridization. Not accounting for ILS results in dates that are older than speciation events. Hybridization, in turn, can result in dates than are younger or older than speciation dates. Disregarding this potential variation in "gene" history across the genome can distort phylogenetic branch lengths and divergence estimates when multiple unlinked genomic loci are combined together in a timetree analysis.
Collapse
Affiliation(s)
- Mark S. Springer
- Department of Evolution, Ecology, and Evolutionary Biology, University of California, Riverside, Riverside, CA, United States
| | - Nicole M. Foley
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, United States
| | - Peggy L. Brady
- Department of Evolution, Ecology, and Evolutionary Biology, University of California, Riverside, Riverside, CA, United States
| | - John Gatesy
- Division of Vertebrate Zoology, American Museum of Natural History, New York, NY, United States
| | - William J. Murphy
- Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX, United States
| |
Collapse
|
18
|
Gatesy J, Sloan DB, Warren JM, Baker RH, Simmons MP, Springer MS. Partitioned coalescence support reveals biases in species-tree methods and detects gene trees that determine phylogenomic conflicts. Mol Phylogenet Evol 2019; 139:106539. [DOI: 10.1016/j.ympev.2019.106539] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2018] [Revised: 06/10/2019] [Accepted: 06/17/2019] [Indexed: 12/26/2022]
|
19
|
Halliday TJD, dos Reis M, Tamuri AU, Ferguson-Gow H, Yang Z, Goswami A. Rapid morphological evolution in placental mammals post-dates the origin of the crown group. Proc Biol Sci 2019; 286:20182418. [PMID: 30836875 PMCID: PMC6458320 DOI: 10.1098/rspb.2018.2418] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Accepted: 02/12/2019] [Indexed: 12/28/2022] Open
Abstract
Resolving the timing and pattern of early placental mammal evolution has been confounded by conflict among divergence date estimates from interpretation of the fossil record and from molecular-clock dating studies. Despite both fossil occurrences and molecular sequences favouring a Cretaceous origin for Placentalia, no unambiguous Cretaceous placental mammal has been discovered. Investigating the differing patterns of evolution in morphological and molecular data reveals a possible explanation for this conflict. Here, we quantified the relationship between morphological and molecular rates of evolution. We show that, independent of divergence dates, morphological rates of evolution were slow relative to molecular evolution during the initial divergence of Placentalia, but substantially increased during the origination of the extant orders. The rapid radiation of placentals into a highly morphologically disparate Cenozoic fauna is thus not associated with the origin of Placentalia, but post-dates superordinal origins. These findings predict that early members of major placental groups may not be easily distinguishable from one another or from stem eutherians on the basis of skeleto-dental morphology. This result supports a Late Cretaceous origin of crown placentals with an ordinal-level adaptive radiation in the early Paleocene, with the high relative rate permitting rapid anatomical change without requiring unreasonably fast molecular evolutionary rates. The lack of definitive Cretaceous placental mammals may be a result of morphological similarity among stem and early crown eutherians, providing an avenue for reconciling the fossil record with molecular divergence estimates for Placentalia.
Collapse
Affiliation(s)
- Thomas J. D. Halliday
- Department of Genetics, Evolution, and Environment, University College London, Gower Street, London WC1E 6BT, UK
- School of Geography, Earth, and Environmental Science, University of Birmingham, Edgbaston B15 2TT, UK
| | - Mario dos Reis
- School of Biological and Chemical Sciences, Queen Mary University London, Mile End Road, London E1 4NS, UK
| | - Asif U. Tamuri
- Research IT Services, University College London, Gower Street, London WC1E 6BT, UK
- European Molecular Biology Laboratory, European Bioinformatics, Wellcome Genome Campus, Hinxton, Cambridgeshire CB10 1SD, UK
| | - Henry Ferguson-Gow
- Department of Genetics, Evolution, and Environment, University College London, Gower Street, London WC1E 6BT, UK
| | - Ziheng Yang
- Department of Genetics, Evolution, and Environment, University College London, Gower Street, London WC1E 6BT, UK
| | - Anjali Goswami
- Department of Genetics, Evolution, and Environment, University College London, Gower Street, London WC1E 6BT, UK
- Department of Earth Sciences, University College London, Gower Street, London WC1E 6BT, UK
- Faculty of Life Sciences, Natural History Museum, Cromwell Road, London SW9 5DJ, UK
| |
Collapse
|
20
|
Simmons MP, Sloan DB, Springer MS, Gatesy J. Gene-wise resampling outperforms site-wise resampling in phylogenetic coalescence analyses. Mol Phylogenet Evol 2019; 131:80-92. [DOI: 10.1016/j.ympev.2018.10.001] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2018] [Accepted: 10/01/2018] [Indexed: 01/15/2023]
|
21
|
What are the roles of taxon sampling and model fit in tests of cyto-nuclear discordance using avian mitogenomic data? Mol Phylogenet Evol 2019; 130:132-142. [DOI: 10.1016/j.ympev.2018.10.008] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2018] [Revised: 09/11/2018] [Accepted: 10/09/2018] [Indexed: 11/23/2022]
|