51
|
Chen D, Hosner PA, Dittmann DL, O'Neill JP, Birks SM, Braun EL, Kimball RT. Divergence time estimation of Galliformes based on the best gene shopping scheme of ultraconserved elements. BMC Ecol Evol 2021; 21:209. [PMID: 34809586 PMCID: PMC8609756 DOI: 10.1186/s12862-021-01935-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 11/08/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Divergence time estimation is fundamental to understanding many aspects of the evolution of organisms, such as character evolution, diversification, and biogeography. With the development of sequence technology, improved analytical methods, and knowledge of fossils for calibration, it is possible to obtain robust molecular dating results. However, while phylogenomic datasets show great promise in phylogenetic estimation, the best ways to leverage the large amounts of data for divergence time estimation has not been well explored. A potential solution is to focus on a subset of data for divergence time estimation, which can significantly reduce the computational burdens and avoid problems with data heterogeneity that may bias results. RESULTS In this study, we obtained thousands of ultraconserved elements (UCEs) from 130 extant galliform taxa, including representatives of all genera, to determine the divergence times throughout galliform history. We tested the effects of different "gene shopping" schemes on divergence time estimation using a carefully, and previously validated, set of fossils. Our results found commonly used clock-like schemes may not be suitable for UCE dating (or other data types) where some loci have little information. We suggest use of partitioning (e.g., PartitionFinder) and selection of tree-like partitions may be good strategies to select a subset of data for divergence time estimation from UCEs. Our galliform time tree is largely consistent with other molecular clock studies of mitochondrial and nuclear loci. With our increased taxon sampling, a well-resolved topology, carefully vetted fossil calibrations, and suitable molecular dating methods, we obtained a high quality galliform time tree. CONCLUSIONS We provide a robust galliform backbone time tree that can be combined with more fossil records to further facilitate our understanding of the evolution of Galliformes and can be used as a resource for comparative and biogeographic studies in this group.
Collapse
Affiliation(s)
- De Chen
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China
- Department of Biology, University of Florida, Gainesville, FL, USA
| | - Peter A Hosner
- Department of Biology, University of Florida, Gainesville, FL, USA
- Natural History Museum of Denmark and Center for Global Mountain Biodiversity, University of Copenhagen, Copenhagen, Denmark
| | - Donna L Dittmann
- Museum of Natural Science, Louisiana State University, Baton Rouge, LA, USA
| | - John P O'Neill
- Museum of Natural Science, Louisiana State University, Baton Rouge, LA, USA
| | - Sharon M Birks
- Burke Museum of Natural History and Culture, University of Washington, Seattle, WA, USA
| | - Edward L Braun
- Department of Biology, University of Florida, Gainesville, FL, USA
| | | |
Collapse
|
52
|
Simmons MP, Springer MS, Gatesy J. Gene-tree misrooting drives conflicts in phylogenomic coalescent analyses of palaeognath birds. Mol Phylogenet Evol 2021; 167:107344. [PMID: 34748873 DOI: 10.1016/j.ympev.2021.107344] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Revised: 10/08/2021] [Accepted: 11/02/2021] [Indexed: 10/19/2022]
Abstract
Phylogenomic analyses of ancient rapid radiations can produce conflicting results that are driven by differential sampling of taxa and characters as well as the limitations of alternative analytical methods. We re-examine basal relationships of palaeognath birds (ratites and tinamous) using recently published datasets of nucleotide characters from 20,850 loci as well as 4301 retroelement insertions. The original studies attributed conflicting resolutions of rheas in their inferred coalescent and concatenation trees to concatenation failing in the anomaly zone. By contrast, we find that the coalescent-based resolution of rheas is premised upon extensive gene-tree estimation errors. Furthermore, retroelement insertions contain much more conflict than originally reported and multiple insertion loci support the basal position of rheas found in concatenation trees, while none were reported in the original publication. We demonstrate how even remarkable congruence in phylogenomic studies may be driven by long-branch misplacement of a divergent outgroup, highly incongruent gene trees, differential taxon sampling that can result in gene-tree misrooting errors that bias species-tree inference, and gross homology errors. What was previously interpreted as broad, robustly supported corroboration for a single resolution in coalescent analyses may instead indicate a common bias that taints phylogenomic results across multiple genome-scale datasets. The updated retroelement dataset now supports a species tree with branch lengths that suggest an ancient anomaly zone, and both concatenation and coalescent analyses of the huge nucleotide datasets fail to yield coherent, reliable results in this challenging phylogenetic context.
Collapse
Affiliation(s)
- Mark P Simmons
- Department of Biology, Colorado State University, Fort Collins, CO 80523, USA.
| | - Mark S Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA 92521, USA
| | - John Gatesy
- Division of Vertebrate Zoology and Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA
| |
Collapse
|
53
|
Bravo GA, Schmitt CJ, Edwards SV. What Have We Learned from the First 500 Avian Genomes? ANNUAL REVIEW OF ECOLOGY, EVOLUTION, AND SYSTEMATICS 2021. [DOI: 10.1146/annurev-ecolsys-012121-085928] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The increased capacity of DNA sequencing has significantly advanced our understanding of the phylogeny of birds and the proximate and ultimate mechanisms molding their genomic diversity. In less than a decade, the number of available avian reference genomes has increased to over 500—approximately 5% of bird diversity—placing birds in a privileged position to advance the fields of phylogenomics and comparative, functional, and population genomics. Whole-genome sequence data, as well as indels and rare genomic changes, are further resolving the avian tree of life. The accumulation of bird genomes, increasingly with long-read sequence data, greatly improves the resolution of genomic features such as germline-restricted chromosomes and the W chromosome, and is facilitating the comparative integration of genotypes and phenotypes. Community-based initiatives such as the Bird 10,000 Genomes Project and Vertebrate Genome Project are playing a fundamental role in amplifying and coalescing a vibrant international program in avian comparative genomics.
Collapse
Affiliation(s)
- Gustavo A. Bravo
- Department of Organismic and Evolutionary Biology and Museum of Comparative Zoology, Harvard University, Cambridge, Massachusetts 02138, USA;, ,
| | - C. Jonathan Schmitt
- Department of Organismic and Evolutionary Biology and Museum of Comparative Zoology, Harvard University, Cambridge, Massachusetts 02138, USA;, ,
| | - Scott V. Edwards
- Department of Organismic and Evolutionary Biology and Museum of Comparative Zoology, Harvard University, Cambridge, Massachusetts 02138, USA;, ,
| |
Collapse
|
54
|
Systematics of the rare Amazonian genus Eutrachelophis (Serpentes: Dipsadidae), with an emended diagnosis for Eutrachelophis papilio. ZOOL ANZ 2021. [DOI: 10.1016/j.jcz.2021.10.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
55
|
Protein Structure, Models of Sequence Evolution, and Data Type Effects in Phylogenetic Analyses of Mitochondrial Data: A Case Study in Birds. DIVERSITY 2021. [DOI: 10.3390/d13110555] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Phylogenomic analyses have revolutionized the study of biodiversity, but they have revealed that estimated tree topologies can depend, at least in part, on the subset of the genome that is analyzed. For example, estimates of trees for avian orders differ if protein-coding or non-coding data are analyzed. The bird tree is a good study system because the historical signal for relationships among orders is very weak, which should permit subtle non-historical signals to be identified, while monophyly of orders is strongly corroborated, allowing identification of strong non-historical signals. Hydrophobic amino acids in mitochondrially-encoded proteins, which are expected to be found in transmembrane helices, have been hypothesized to be associated with non-historical signals. We tested this hypothesis by comparing the evolution of transmembrane helices and extramembrane segments of mitochondrial proteins from 420 bird species, sampled from most avian orders. We estimated amino acid exchangeabilities for both structural environments and assessed the performance of phylogenetic analysis using each data type. We compared those relative exchangeabilities with values calculated using a substitution matrix for transmembrane helices estimated using a variety of nuclear- and mitochondrially-encoded proteins, allowing us to compare the bird-specific mitochondrial models with a general model of transmembrane protein evolution. To complement our amino acid analyses, we examined the impact of protein structure on patterns of nucleotide evolution. Models of transmembrane and extramembrane sequence evolution for amino acids and nucleotides exhibited striking differences, but there was no evidence for strong topological data type effects. However, incorporating protein structure into analyses of mitochondrially-encoded proteins improved model fit. Thus, we believe that considering protein structure will improve analyses of mitogenomic data, both in birds and in other taxa.
Collapse
|
56
|
Tilic E, Stiller J, Campos E, Pleijel F, Rouse GW. Phylogenomics resolves ambiguous relationships within Aciculata (Errantia, Annelida). Mol Phylogenet Evol 2021; 166:107339. [PMID: 34751138 DOI: 10.1016/j.ympev.2021.107339] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 10/23/2021] [Accepted: 10/27/2021] [Indexed: 10/20/2022]
Abstract
Aciculata (Eunicida + Phyllodocida) is among the largest clades of annelids, comprising almost half of the known diversity of all marine annelids. Despite the group's large size and biological importance, most phylogenomic studies on Annelida to date have had a limited sampling of this clade. The phylogenetic placement of many clades within Phyllodocida in particular has remained poorly understood. To resolve the relationships within Aciculata we conducted a large-scale phylogenomic analysis based on 24 transcriptomes (13 new), chosen to represent many family-ranked taxa that have never been included in a broad phylogenomic study. Our sampling also includes several enigmatic taxa with challenging phylogenetic placement, such as Histriobdella, Struwela, Lacydonia, Pilargis and the holopelagic worms Lopadorrhynchus, Travisiopsis and Tomopteris. Our robust phylogeny allows us to name and place some of these problematic clades and has significant implications on the systematics of the group. Within Eunicida we reinstate the names Eunicoidea and Oenonoidea. Within Phyllodocida we delineate Phyllodociformia, Glyceriformia, Nereidiformia, Nephtyiformia and Aphroditiformia. Phyllodociformia now includes: Lacydonia, Typhloscolecidae, Lopadorrhynchidae and Phyllodocidae. Nephtyiformia includes Nephtyidae and Pilargidae. We also broaden the delineation of Glyceriformia to include Sphaerodoridae, Tomopteridae and Glyceroidea (Glyceridae + Goniadidae). Furthermore, our study demonstrates and explores how conflicting, yet highly supported topologies can result from confounding signals in gene trees.
Collapse
Affiliation(s)
- Ekin Tilic
- Scripps Institution of Oceanography, UC San Diego, La Jolla, CA, USA; Institute of Evolutionary Biology and Animal Ecology, University of Bonn, Germany; Marine Biological Section, Department of Biology, University of Copenhagen, DK-2100 Copenhagen, Denmark.
| | - Josefin Stiller
- Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, DK-2100 Copenhagen, Denmark
| | - Ernesto Campos
- Facultad de Ciencias, Universidad Autónoma de Baja California. Ensenada, Baja California, México
| | - Fredrik Pleijel
- Department of Marine Sciences, University of Gothenburg, Tjärnö, Sweden
| | - Greg W Rouse
- Scripps Institution of Oceanography, UC San Diego, La Jolla, CA, USA.
| |
Collapse
|
57
|
Abstract
The study of vocal production learning in birds is heavily biased towards oscine songbirds, making the songbird model the reference for comparative studies. However, as vocal learning was probably ancestral in songbirds, interspecific variations might all be variations on a single theme and need not be representative of the nature and characteristics of vocal learning in other bird groups. To assess the possible mechanisms of vocal learning and its evolution therefore requires knowledge about independently evolved incidences of vocal learning. This review examines the presence and nature of vocal production learning in non-songbirds. Using a broad definition of vocal learning and a comparative phylogenetic framework, I evaluate the evidence for vocal learning and its characteristics in non-oscine birds, including well-known vocal learners such as parrots and hummingbirds but also (putative) cases from other taxa. Despite the sometimes limited evidence, it is clear that vocal learning occurs in a range of different, non-related, taxa and can be caused by a variety of mechanisms. It is more widespread than often realized, calling for more systematic studies. Examining this variation may provide a window onto the evolution of vocal learning and increase the value of comparative research for understanding vocal learning in humans. This article is part of the theme issue ‘Vocal learning in animals and humans’.
Collapse
Affiliation(s)
- Carel Ten Cate
- Institute of Biology, Leiden University, PO Box 9505, 2300 RA Leiden, The Netherlands
| |
Collapse
|
58
|
Positive selection in noncoding genomic regions of vocal learning birds is associated with genes implicated in vocal learning and speech functions in humans. Genome Res 2021; 31:2035-2049. [PMID: 34667117 PMCID: PMC8559704 DOI: 10.1101/gr.275989.121] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Accepted: 08/17/2021] [Indexed: 11/25/2022]
Abstract
Vocal learning, the ability to imitate sounds from conspecifics and the environment, is a key component of human spoken language and learned song in three independently evolved avian groups—oscine songbirds, parrots, and hummingbirds. Humans and each of these three bird clades exhibit specialized behavioral, neuroanatomical, and brain gene expression convergence related to vocal learning, speech, and song. To understand the evolutionary basis of vocal learning gene specializations and convergence, we searched for and identified accelerated genomic regions (ARs), a marker of positive selection, specific to vocal learning birds. We found avian vocal learner-specific ARs, and they were enriched in noncoding regions near genes with known speech functions or brain gene expression specializations in humans and vocal learning birds, including FOXP2, NEUROD6, ZEB2, and MEF2C, and near genes with major neurodevelopmental functions, including NR2F1, NRP2, and BCL11B. We also found enrichment near the SFARI class S genes associated with syndromic vocal communication forms of autism spectrum disorders. These findings reveal strong candidate noncoding regions near genes for the evolutionary adaptations that distinguish vocal learning species from their close vocal nonlearning relatives and provide further evidence of molecular convergence between birdsong and human spoken language.
Collapse
|
59
|
White ND, Batz ZA, Braun EL, Braun MJ, Carleton KL, Kimball RT, Swaroop A. A novel exome probe set captures phototransduction genes across birds (Aves) enabling efficient analysis of vision evolution. Mol Ecol Resour 2021; 22:587-601. [PMID: 34652059 DOI: 10.1111/1755-0998.13496] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Revised: 08/17/2021] [Accepted: 08/20/2021] [Indexed: 11/27/2022]
Abstract
The diversity of avian visual phenotypes provides a framework for studying mechanisms of trait diversification generally, and the evolution of vertebrate vision, specifically. Previous research has focused on opsins, but to fully understand visual adaptation, we must study the complete phototransduction cascade (PTC). Here, we developed a probe set that captures exonic regions of 46 genes representing the PTC and other light responses. For a subset of species, we directly compared gene capture between our probe set and low-coverage whole genome sequencing (WGS), and we discuss considerations for choosing between these methods. Finally, we developed a unique strategy to avoid chimeric assembly by using "decoy" reference sequences. We successfully captured an average of 64% of our targeted exome in 46 species across 14 orders using the probe set and had similar recovery using the WGS data. Compared to WGS or transcriptomes, our probe set: (1) reduces sequencing requirements by efficiently capturing vision genes, (2) employs a simpler bioinformatic pipeline by limiting required assembly and negating annotation, and (3) eliminates the need for fresh tissues, enabling researchers to leverage existing museum collections. We then utilized our vision exome data to identify positively selected genes in two evolutionary scenarios-evolution of night vision in nocturnal birds and evolution of high-speed vision specific to manakins (Pipridae). We found parallel positive selection of SLC24A1 in both scenarios, implicating the alteration of rod response kinetics, which could improve color discrimination in dim light conditions and/or facilitate higher temporal resolution.
Collapse
Affiliation(s)
- Noor D White
- Neurobiology Neurodegeneration and Repair Laboratory, National Eye Institute, National Institutes of Health, Bethesda, Maryland, USA.,Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, USA.,Behavior, Ecology, Evolution and Systematics Program, University of Maryland, College Park, Maryland, USA
| | - Zachary A Batz
- Neurobiology Neurodegeneration and Repair Laboratory, National Eye Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Edward L Braun
- Department of Biology, University of Florida, Gainesville, Florida, USA
| | - Michael J Braun
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, USA.,Behavior, Ecology, Evolution and Systematics Program, University of Maryland, College Park, Maryland, USA.,Department of Biology, University of Maryland, College Park, Maryland, USA
| | - Karen L Carleton
- Behavior, Ecology, Evolution and Systematics Program, University of Maryland, College Park, Maryland, USA.,Department of Biology, University of Maryland, College Park, Maryland, USA
| | - Rebecca T Kimball
- Department of Biology, University of Florida, Gainesville, Florida, USA
| | - Anand Swaroop
- Neurobiology Neurodegeneration and Repair Laboratory, National Eye Institute, National Institutes of Health, Bethesda, Maryland, USA
| |
Collapse
|
60
|
Almeida FC, Porzecanski AL, Cracraft JL, Bertelli S. The evolution of tinamous (Palaeognathae: Tinamidae) in light of molecular and combined analyses. Zool J Linn Soc 2021. [DOI: 10.1093/zoolinnean/zlab080] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Abstract
The Neotropical tinamous are of particular interest in our efforts to understand the evolution of modern birds. They inhabit both forested and open environments and, although volant, have limited flight capabilities. Numerous studies have recognized the monophyly of tinamous and their relationships either as sister to the flightless ratites (ostriches, emus and their relatives) or within the ratites themselves. Despite the numerous bird phylogenies published recently, modern investigations of relationships within the tinamous themselves have been limited. Here, we present the first detailed phylogenetic analysis and divergence-date estimation including a significant number of tinamou species, both extant and fossil. The monophyly of most currently recognized polytypic genera is recovered with high support, with the exception of the paraphyletic Nothura and Nothoprocta. The traditional subdivision between those tinamous inhabiting open areas (Nothurinae) and forest environments (Tinaminae) is also confirmed. A temporal calibration of the resultant phylogeny estimates that the basal divergence of crown Tinamidae took place between 31 and 40 Mya.
Collapse
Affiliation(s)
- Francisca C Almeida
- Instituto de Ecología, Genética y Evolución (IEGEBA), Consejo Nacional de Investigaciones Científicas y Tecnológicas (CONICET)/ Universidad de Buenos Aires (UBA), Ciudad Autónoma de Buenos Aires, Argentina
| | - Ana L Porzecanski
- American Museum of Natural History, 200 Central Park West, New York, NY, 10024-5102, USA
| | - Joel L Cracraft
- American Museum of Natural History, 200 Central Park West, New York, NY, 10024-5102, USA
| | - Sara Bertelli
- American Museum of Natural History, 200 Central Park West, New York, NY, 10024-5102, USA
- Fundación Miguel Lillo (FML), Miguel Lillo 251, 4000 San Miguel de Tucumán, Argentina
- Unidad Ejecutora Lillo (UEL) - Consejo Nacional de Investigaciones Científicas y Tecnológicas (CONICET), San Miguel de Tucumán, Tucumán, Argentina
| |
Collapse
|
61
|
Abstract
The reconstruction of evolutionary relationships among species is fundamental for our understanding of biodiversity. Today, evolutionary relationships are closely related with the depiction of the tree of life, and research on the topic is underpinned by methods in molecular phylogenetics that have grown in popularity since the 1960s. These methods depend on our understanding of how nucleotide or amino acid sequences evolve through time and in different lineages. Armed with this knowledge, researchers can make inferences about the relationships and amount of genomic divergence among species.
Collapse
Affiliation(s)
- David A Duchêne
- Centre for Evolutionary Hologenomics, University of Copenhagen, Øster Farimagsgade 5A, 1352 Copenhagen, Denmark.
| |
Collapse
|
62
|
Chafin TK, Douglas MR, Bangs MR, Martin BT, Mussmann SM, Douglas ME. Taxonomic Uncertainty and the Anomaly Zone: Phylogenomics Disentangle a Rapid Radiation to Resolve Contentious Species (Gila robusta Complex) in the Colorado River. Genome Biol Evol 2021; 13:evab200. [PMID: 34432005 PMCID: PMC8449829 DOI: 10.1093/gbe/evab200] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/19/2021] [Indexed: 12/18/2022] Open
Abstract
Species are indisputable units for biodiversity conservation, yet their delimitation is fraught with both conceptual and methodological difficulties. A classic example is the taxonomic controversy surrounding the Gila robusta complex in the lower Colorado River of southwestern North America. Nominal species designations were originally defined according to weakly diagnostic morphological differences, but these conflicted with subsequent genetic analyses. Given this ambiguity, the complex was re-defined as a single polytypic unit, with the proposed "threatened" status under the U.S. Endangered Species Act of two elements being withdrawn. Here we re-evaluated the status of the complex by utilizing dense spatial and genomic sampling (n = 387 and >22 k loci), coupled with SNP-based coalescent and polymorphism-aware phylogenetic models. In doing so, we found that all three species were indeed supported as evolutionarily independent lineages, despite widespread phylogenetic discordance. To juxtapose this discrepancy with previous studies, we first categorized those evolutionary mechanisms driving discordance, then tested (and subsequently rejected) prior hypotheses which argued phylogenetic discord in the complex was driven by the hybrid origin of Gila nigra. The inconsistent patterns of diversity we found within G. robusta were instead associated with rapid Plio-Pleistocene drainage evolution, with subsequent divergence within the "anomaly zone" of tree space producing ambiguities that served to confound prior studies. Our results not only support the resurrection of the three species as distinct entities but also offer an empirical example of how phylogenetic discordance can be categorized within other recalcitrant taxa, particularly when variation is primarily partitioned at the species level.
Collapse
Affiliation(s)
- Tyler K Chafin
- Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, USA
- Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, Colorado, USA
| | - Marlis R Douglas
- Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, USA
| | - Max R Bangs
- Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, USA
- Department of Biological Science, Florida State University, Tallahassee, Florida, USA
| | - Bradley T Martin
- Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, USA
- Global Campus, University of Arkansas, Fayetteville, Arkansas, USA
| | - Steven M Mussmann
- Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, USA
- Southwestern Native Aquatic Resources and Recovery Center, U.S. Fish & Wildlife Service, Dexter, New Mexico, USA
| | - Michael E Douglas
- Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, USA
| |
Collapse
|
63
|
Asher RJ, Smith MR. Phylogenetic Signal and Bias in Paleontology. Syst Biol 2021; 71:986-1008. [PMID: 34469583 PMCID: PMC9248965 DOI: 10.1093/sysbio/syab072] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Revised: 08/24/2021] [Accepted: 08/27/2021] [Indexed: 11/12/2022] Open
Abstract
An unprecedented amount of evidence now illuminates the phylogeny of living mammals and birds on the Tree of Life. We use this tree to measure phylogenetic value of data typically used in paleontology (bones and teeth) from six datasets derived from five published studies. We ask three interrelated questions: 1) Can these data adequately reconstruct known parts of the Tree of Life? 2) Is accuracy generally similar for studies using morphology, or do some morphological datasets perform better than others? 3) Does the loss of non-fossilizable data cause taxa to occur in misleadingly basal positions? Adding morphology to DNA datasets usually increases congruence of resulting topologies to the well corroborated tree, but this varies among morphological datasets. Extant taxa with a high proportion of missing morphological characters can greatly reduce phylogenetic resolution when analyzed together with fossils. Attempts to ameliorate this by deleting extant taxa missing morphology are prone to decreased accuracy due to long-branch artefacts. We find no evidence that fossilization causes extinct taxa to incorrectly appear at or near topologically basal branches. Morphology comprises the evidence held in common by living taxa and fossils, and phylogenetic analysis of fossils greatly benefits from inclusion of molecular and morphological data sampled for living taxa, whatever methods are used for phylogeny estimation.
Collapse
Affiliation(s)
- Robert J Asher
- Department of Zoology, Downing St., University of Cambridge CB2 3EJ, UK
| | - Martin R Smith
- Department of Earth Sciences, Lower Mount Joy, Durham University, Durham DH1 3LE, UK
| |
Collapse
|
64
|
Literman R, Schwartz R. Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data. Mol Biol Evol 2021; 38:2306-2318. [PMID: 33528497 PMCID: PMC8136493 DOI: 10.1093/molbev/msab026] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Many evolutionary relationships remain controversial despite whole-genome sequencing data. These controversies arise, in part, due to challenges associated with accurately modeling the complex phylogenetic signal coming from genomic regions experiencing distinct evolutionary forces. Here, we examine how different regions of the genome support or contradict well-established relationships among three mammal groups using millions of orthologous parsimony-informative biallelic sites (PIBS) distributed across primate, rodent, and Pecora genomes. We compared PIBS concordance percentages among locus types (e.g. coding sequences (CDS), introns, intergenic regions), and contrasted PIBS utility over evolutionary timescales. Sites derived from noncoding sequences provided more data and proportionally more concordant sites compared with those from CDS in all clades. CDS PIBS were also predominant drivers of tree incongruence in two cases of topological conflict. PIBS derived from most locus types provided surprisingly consistent support for splitting events spread across the timescales we examined, although we find evidence that CDS and intronic PIBS may, respectively and to a limited degree, inform disproportionately about older and younger splits. In this era of accessible wholegenome sequence data, these results:1) suggest benefits to more intentionally focusing on noncoding loci as robust data for tree inference and 2) reinforce the importance of accurate modeling, especially when using CDS data.
Collapse
Affiliation(s)
- Robert Literman
- Department of Biological Sciences, University of Rhode Island, South Kingstown, RI, USA.,Center for Food Safety and Applied Nutrition, Office of Regulatory Science, U.S. Food and Drug Administration, College Park, MD, USA
| | - Rachel Schwartz
- Department of Biological Sciences, University of Rhode Island, South Kingstown, RI, USA
| |
Collapse
|
65
|
Ferrer Obiol J, James HF, Chesser RT, Bretagnolle V, González-Solís J, Rozas J, Riutort M, Welch AJ. Integrating Sequence Capture and Restriction Site-Associated DNA Sequencing to Resolve Recent Radiations of Pelagic Seabirds. Syst Biol 2021; 70:976-996. [PMID: 33512506 PMCID: PMC8357341 DOI: 10.1093/sysbio/syaa101] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2020] [Revised: 11/13/2020] [Accepted: 12/15/2020] [Indexed: 01/01/2023] Open
Abstract
The diversification of modern birds has been shaped by a number of radiations. Rapid diversification events make reconstructing the evolutionary relationships among taxa challenging due to the convoluted effects of incomplete lineage sorting (ILS) and introgression. Phylogenomic data sets have the potential to detect patterns of phylogenetic incongruence, and to address their causes. However, the footprints of ILS and introgression on sequence data can vary between different phylogenomic markers at different phylogenetic scales depending on factors such as their evolutionary rates or their selection pressures. We show that combining phylogenomic markers that evolve at different rates, such as paired-end double-digest restriction site-associated DNA (PE-ddRAD) and ultraconserved elements (UCEs), allows a comprehensive exploration of the causes of phylogenetic discordance associated with short internodes at different timescales. We used thousands of UCE and PE-ddRAD markers to produce the first well-resolved phylogeny of shearwaters, a group of medium-sized pelagic seabirds that are among the most phylogenetically controversial and endangered bird groups. We found that phylogenomic conflict was mainly derived from high levels of ILS due to rapid speciation events. We also documented a case of introgression, despite the high philopatry of shearwaters to their breeding sites, which typically limits gene flow. We integrated state-of-the-art concatenated and coalescent-based approaches to expand on previous comparisons of UCE and RAD-Seq data sets for phylogenetics, divergence time estimation, and inference of introgression, and we propose a strategy to optimize RAD-Seq data for phylogenetic analyses. Our results highlight the usefulness of combining phylogenomic markers evolving at different rates to understand the causes of phylogenetic discordance at different timescales. [Aves; incomplete lineage sorting; introgression; PE-ddRAD-Seq; phylogenomics; radiations; shearwaters; UCEs.].
Collapse
Affiliation(s)
- Joan Ferrer Obiol
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona, Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Barcelona, Catalonia, Spain
| | - Helen F James
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
| | - R Terry Chesser
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
- U.S. Geological Survey, Patuxent Wildlife Research Center, Laurel, MD, USA
| | - Vincent Bretagnolle
- Centre d’Études Biologiques de Chizé, CNRS & La Rochelle Université, 79360, Villiers en Bois, France
| | - Jacob González-Solís
- Institut de Recerca de la Biodiversitat (IRBio), Barcelona, Catalonia, Spain
- Departament de Biologia Evolutiva, Ecologia i Ciències Ambientals, Facultat de Biologia, Universitat de Barcelona, Barcelona, Catalonia, Spain
| | - Julio Rozas
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona, Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Barcelona, Catalonia, Spain
| | - Marta Riutort
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona, Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Barcelona, Catalonia, Spain
| | | |
Collapse
|
66
|
Adams RH, Castoe TA, DeGiorgio M. PhyloWGA: chromosome-aware phylogenetic interrogation of whole genome alignments. Bioinformatics 2021; 37:1923-1925. [PMID: 33051672 DOI: 10.1093/bioinformatics/btaa884] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2019] [Revised: 09/16/2020] [Accepted: 09/29/2020] [Indexed: 11/13/2022] Open
Abstract
SUMMARY Here, we present PhyloWGA, an open source R package for conducting phylogenetic analysis and investigation of whole genome data. AVAILABILITYAND IMPLEMENTATION Available at Github (https://github.com/radamsRHA/PhyloWGA). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Richard H Adams
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL 33431, USA
| | - Todd A Castoe
- Department of Biology, University of Texas at Arlington, Arlington, TX 76019, USA
| | - Michael DeGiorgio
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL 33431, USA
| |
Collapse
|
67
|
Alda F, Ludt WB, Elías DJ, McMahan CD, Chakrabarty P. Comparing Ultraconserved Elements and Exons for Phylogenomic Analyses of Middle American Cichlids: When Data Agree to Disagree. Genome Biol Evol 2021; 13:evab161. [PMID: 34272856 PMCID: PMC8369075 DOI: 10.1093/gbe/evab161] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/05/2021] [Indexed: 12/20/2022] Open
Abstract
Choosing among types of genomic markers to be used in a phylogenomic study can have a major influence on the cost, design, and results of a study. Yet few attempts have been made to compare categories of next-generation sequence markers limiting our ability to compare the suitability of these different genomic fragment types. Here, we explore properties of different genomic markers to find if they vary in the accuracy of component phylogenetic trees and to clarify the causes of conflict obtained from different data sets or inference methods. As a test case, we explore the causes of discordance between phylogenetic hypotheses obtained using a novel data set of ultraconserved elements (UCEs) and a recently published exon data set of the cichlid tribe Heroini. Resolving relationships among heroine cichlids has historically been difficult, and the processes of colonization and diversification in Middle America and the Greater Antilles are not yet well understood. Despite differences in informativeness and levels of gene tree discordance between UCEs and exons, the resulting phylogenomic hypotheses generally agree on most relationships. The independent data sets disagreed in areas with low phylogenetic signal that were overwhelmed by incomplete lineage sorting and nonphylogenetic signals. For UCEs, high levels of incomplete lineage sorting were found to be the major cause of gene tree discordance, whereas, for exons, nonphylogenetic signal is most likely caused by a reduced number of highly informative loci. This paucity of informative loci in exons might be due to heterogeneous substitution rates that are problematic to model (i.e., computationally restrictive) resulting in systematic errors that UCEs (being less informative individually but more uniform) are less prone to. These results generally demonstrate the robustness of phylogenomic methods to accommodate genomic markers with different biological and phylogenetic properties. However, we identify common and unique pitfalls of different categories of genomic fragments when inferring enigmatic phylogenetic relationships.
Collapse
Affiliation(s)
- Fernando Alda
- Department of Biology, Geology and Environmental Science, University of Tennessee at Chattanooga, Tennessee, USA
| | - William B Ludt
- Department of Ichthyology, Natural History Museum of Los Angeles County, Los Angeles, California, USA
| | - Diego J Elías
- Museum of Natural Science, Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA
| | | | - Prosanta Chakrabarty
- Museum of Natural Science, Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA
| |
Collapse
|
68
|
Guinand B, Oral M, Tougard C. Brown trout phylogenetics: A persistent mirage towards (too) many species. JOURNAL OF FISH BIOLOGY 2021; 99:298-307. [PMID: 33483952 DOI: 10.1111/jfb.14686] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Revised: 12/28/2020] [Accepted: 01/19/2021] [Indexed: 06/12/2023]
Affiliation(s)
- Bruno Guinand
- ISEM, Université de Montpellier, CNRS, IRD, EPHE, Montpellier, France
| | - Münevver Oral
- Faculty of Fisheries and Aquatic Science, Recep Tayyip Erdogan University, Rize, Turkey
| | | |
Collapse
|
69
|
Goloboff PA, Catalano SA, Torres A. Parsimony analysis of phylogenomic datasets (II): evaluation of PAUP*, MEGA and MPBoot. Cladistics 2021; 38:126-146. [DOI: 10.1111/cla.12476] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/01/2021] [Indexed: 12/11/2022] Open
Affiliation(s)
- Pablo A. Goloboff
- Unidad Ejecutora Lillo Consejo Nacional de Investigaciones Científicas y Técnicas ‐ Fundación Miguel Lillo Miguel Lillo 251 San Miguel de Tucumán Tucumán4000Argentina
- American Museum of Natural History 200 Central Park West New York NY10024USA
| | - Santiago A. Catalano
- Unidad Ejecutora Lillo Consejo Nacional de Investigaciones Científicas y Técnicas ‐ Fundación Miguel Lillo Miguel Lillo 251 San Miguel de Tucumán Tucumán4000Argentina
- Facultad de Ciencias Naturales e Instituto Miguel Lillo Universidad Nacional de Tucumán Miguel Lillo 205 San Miguel de Tucumán Tucumán4000Argentina
| | - Ambrosio Torres
- Unidad Ejecutora Lillo Consejo Nacional de Investigaciones Científicas y Técnicas ‐ Fundación Miguel Lillo Miguel Lillo 251 San Miguel de Tucumán Tucumán4000Argentina
| |
Collapse
|
70
|
Callender-Crowe LM, Sansom RS. Osteological characters of birds and reptiles are more congruent with molecular phylogenies than soft characters are. Zool J Linn Soc 2021. [DOI: 10.1093/zoolinnean/zlaa136] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Abstract
Despite increased use of genomic data in phylogenetics, morphological information remains vital for resolving evolutionary relationships, particularly for fossil taxa. The properties and models of evolution of molecular sequence data are well characterized and mature, relative to those of morphological data. Furthermore, heterogeneity, integration and relative homoplasy of empirical morphological data could prove problematic for phylogenetic reconstruction. Here we compare osteological and non-osteological characters of 28 morphological datasets of extant saurians in terms of their homoplasy relative to molecular trees. Analysis of individual avian datasets finds osteological characters to be significantly more consistent with molecular data than soft characters are. Significant differences between morphological partitions were also observed in the age at which characters resolved on molecular trees. Osteological character changes occur relatively earlier in deep branches, whilst soft-tissue character transitions are more recent in shallow branches. The combined results demonstrate differences in evolutionary dynamics between morphological partitions. This may reflect evolutionary constraints acting on osteological characters, compared with the relative lability of soft characters. Furthermore, it provides some support to phylogenetic interpretations of fossil data, including dinosaurs, which are predominately osteological. Recent advances in amphibian and mammal phylogenetics may make these patterns possible to test for all tetrapods.
Collapse
Affiliation(s)
- Leah M Callender-Crowe
- The University of Manchester, Department of Earth and Environmental Sciences, Manchester, UK
- School of Biological Sciences, University of Reading, Reading, UK
| | - Robert S Sansom
- The University of Manchester, Department of Earth and Environmental Sciences, Manchester, UK
| |
Collapse
|
71
|
Torres A, Goloboff PA, Catalano SA. Parsimony analysis of phylogenomic datasets (I): scripts and guidelines for using TNT (Tree Analysis using New Technology). Cladistics 2021; 38:103-125. [DOI: 10.1111/cla.12477] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/01/2021] [Indexed: 12/15/2022] Open
Affiliation(s)
- Ambrosio Torres
- Unidad Ejecutora Lillo Consejo Nacional de Investigaciones Científicas y Técnicas ‐ Fundación Miguel Lillo Miguel Lillo 251 S. M. de Tucumán Tucumán 4000 Argentina
| | - Pablo A. Goloboff
- Unidad Ejecutora Lillo Consejo Nacional de Investigaciones Científicas y Técnicas ‐ Fundación Miguel Lillo Miguel Lillo 251 S. M. de Tucumán Tucumán 4000 Argentina
- American Museum of Natural History 200 Central Park West New York NY 10024 USA
| | - Santiago A. Catalano
- Unidad Ejecutora Lillo Consejo Nacional de Investigaciones Científicas y Técnicas ‐ Fundación Miguel Lillo Miguel Lillo 251 S. M. de Tucumán Tucumán 4000 Argentina
- Facultad de Ciencias Naturales e Instituto Miguel Lillo Universidad Nacional de Tucumán Miguel Lillo 205 S. M. de Tucumán Tucumán 4000 Argentina
| |
Collapse
|
72
|
Kozak KM, Joron M, McMillan WO, Jiggins CD. Rampant Genome-Wide Admixture across the Heliconius Radiation. Genome Biol Evol 2021; 13:evab099. [PMID: 33944917 PMCID: PMC8283734 DOI: 10.1093/gbe/evab099] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/30/2021] [Indexed: 12/12/2022] Open
Abstract
How frequent is gene flow between species? The pattern of evolution is typically portrayed as a phylogenetic tree, yet gene flow between good species may be an important mechanism in diversification, spreading adaptive traits and leading to a complex pattern of phylogenetic incongruence. This process has thus far been studied mainly among a few closely related species, or in geographically restricted areas such as islands, but not on the scale of a continental radiation. Using a genomic representation of 40 out of 47 species in the genus, we demonstrate that admixture has played a role throughout the evolution of the charismatic Neotropical butterflies Heliconius. Modeling of phylogenetic networks based on the exome uncovers up to 13 instances of interspecific gene flow. Admixture is detected among the relatives of Heliconius erato, as well as between the ancient lineages leading to modern clades. Interspecific gene flow played a role throughout the evolution of the genus, although the process has been most frequent in the clade of Heliconius melpomene and relatives. We identify Heliconius hecalesia and relatives as putative hybrids, including new evidence for introgression at the loci controlling the mimetic wing patterns. Models accounting for interspecific gene flow yield a more complete picture of the radiation as a network, which will improve our ability to study trait evolution in a realistic comparative framework.
Collapse
Affiliation(s)
- Krzysztof M Kozak
- Smithsonian Tropical Research Institute, Panamá, Panamá
- Department of Zoology, University of Cambridge, United Kingdom
| | - Mathieu Joron
- Centre d’Ecologie Fonctionnelle et Evolutive (CEFE), CNRS, Université de Montpellier, Université Paul Valéry Montpellier 3, EPHE, IRD, France
| | | | - Chris D Jiggins
- Smithsonian Tropical Research Institute, Panamá, Panamá
- Department of Zoology, University of Cambridge, United Kingdom
| |
Collapse
|
73
|
Vázquez-Miranda H, Barker FK. Autosomal, sex-linked and mitochondrial loci resolve evolutionary relationships among wrens in the genus Campylorhynchus. Mol Phylogenet Evol 2021; 163:107242. [PMID: 34224849 DOI: 10.1016/j.ympev.2021.107242] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Revised: 06/14/2021] [Accepted: 06/29/2021] [Indexed: 01/18/2023]
Abstract
Although there is general consensus that sampling of multiple genetic loci is critical in accurate reconstruction of species trees, the exact numbers and the best types of molecular markers remain an open question. In particular, the phylogenetic utility of sex-linked loci is underexplored. Here, we sample all species and 70% of the named diversity of the New World wren genus Campylorhynchus using sequences from 23 loci, to evaluate the effects of linkage on efficiency in recovering a well-supported tree for the group. At a tree-wide level, we found that most loci supported fewer than half the possible clades and that sex-linked loci produced similar resolution to slower-coalescing autosomal markers, controlling for locus length. By contrast, we did find evidence that linkage affected the efficiency of recovery of individual relationships; as few as two sex-linked loci were necessary to resolve a selection of clades with long to medium subtending branches, whereas 4-6 autosomal loci were necessary to achieve comparable results. These results support an expanded role for sampling of the avian Z chromosome in phylogenetic studies, including target enrichment approaches. Our concatenated and species tree analyses represent significant improvements in our understanding of diversification in Campylorhynchus, and suggest a relatively complex scenario for its radiation across the Miocene/Pliocene boundary, with multiple invasions of South America.
Collapse
Affiliation(s)
- Hernán Vázquez-Miranda
- Departamento de Zoología, Instituto de Biología, Universidad Nacional Autónoma de México, Ciudad de México C.P. 04510, Mexico
| | - F Keith Barker
- Department of Ecology, Evolution and Behavior, Bell Museum of Natural History, University of Minnesota, 40 Gortner Laboratory, 1479 Gortner Avenue, Saint Paul, MN 55108, USA
| |
Collapse
|
74
|
Kuhl H, Frankl-Vilches C, Bakker A, Mayr G, Nikolaus G, Boerno ST, Klages S, Timmermann B, Gahr M. An Unbiased Molecular Approach Using 3'-UTRs Resolves the Avian Family-Level Tree of Life. Mol Biol Evol 2021; 38:108-127. [PMID: 32781465 PMCID: PMC7783168 DOI: 10.1093/molbev/msaa191] [Citation(s) in RCA: 62] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
Presumably, due to a rapid early diversification, major parts of the higher-level phylogeny of birds are still resolved controversially in different analyses or are considered unresolvable. To address this problem, we produced an avian tree of life, which includes molecular sequences of one or several species of ∼90% of the currently recognized family-level taxa (429 species, 379 genera) including all 106 family-level taxa of the nonpasserines and 115 of the passerines (Passeriformes). The unconstrained analyses of noncoding 3-prime untranslated region (3′-UTR) sequences and those of coding sequences yielded different trees. In contrast to the coding sequences, the 3′-UTR sequences resulted in a well-resolved and stable tree topology. The 3′-UTR contained, unexpectedly, transcription factor binding motifs that were specific for different higher-level taxa. In this tree, grebes and flamingos are the sister clade of all other Neoaves, which are subdivided into five major clades. All nonpasserine taxa were placed with robust statistical support including the long-time enigmatic hoatzin (Opisthocomiformes), which was found being the sister taxon of the Caprimulgiformes. The comparatively late radiation of family-level clades of the songbirds (oscine Passeriformes) contrasts with the attenuated diversification of nonpasseriform taxa since the early Miocene. This correlates with the evolution of vocal production learning, an important speciation factor, which is ancestral for songbirds and evolved convergent only in hummingbirds and parrots. As 3′-UTR-based phylotranscriptomics resolved the avian family-level tree of life, we suggest that this procedure will also resolve the all-species avian tree of life
Collapse
Affiliation(s)
- Heiner Kuhl
- Department of Behavioural Neurobiology, Max Planck Institute for Ornithology, Seewiesen, Germany.,Max Planck Institute for Molecular Genetics, Sequencing Core Facility, Berlin, Germany.,Department of Ecophysiology and Aquaculture, Leibniz-Institute of Freshwater Ecology and Inland Fisheries, Berlin, Germany
| | - Carolina Frankl-Vilches
- Department of Behavioural Neurobiology, Max Planck Institute for Ornithology, Seewiesen, Germany
| | - Antje Bakker
- Department of Behavioural Neurobiology, Max Planck Institute for Ornithology, Seewiesen, Germany
| | - Gerald Mayr
- Ornithological Section, Senckenberg Research Institute, Frankfurt am Main, Germany
| | - Gerhard Nikolaus
- Department of Behavioural Neurobiology, Max Planck Institute for Ornithology, Seewiesen, Germany
| | - Stefan T Boerno
- Max Planck Institute for Molecular Genetics, Sequencing Core Facility, Berlin, Germany
| | - Sven Klages
- Max Planck Institute for Molecular Genetics, Sequencing Core Facility, Berlin, Germany
| | - Bernd Timmermann
- Max Planck Institute for Molecular Genetics, Sequencing Core Facility, Berlin, Germany
| | - Manfred Gahr
- Department of Behavioural Neurobiology, Max Planck Institute for Ornithology, Seewiesen, Germany
| |
Collapse
|
75
|
Adams RH, Blackmon H, DeGiorgio M. Of Traits and Trees: Probabilistic Distances under Continuous Trait Models for Dissecting the Interplay among Phylogeny, Model, and Data. Syst Biol 2021; 70:660-680. [PMID: 33587145 PMCID: PMC8208806 DOI: 10.1093/sysbio/syab009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Accepted: 02/01/2021] [Indexed: 12/03/2022] Open
Abstract
Stochastic models of character trait evolution have become a cornerstone of evolutionary biology in an array of contexts. While probabilistic models have been used extensively for statistical inference, they have largely been ignored for the purpose of measuring distances between phylogeny-aware models. Recent contributions to the problem of phylogenetic distance computation have highlighted the importance of explicitly considering evolutionary model parameters and their impacts on molecular sequence data when quantifying dissimilarity between trees. By comparing two phylogenies in terms of their induced probability distributions that are functions of many model parameters, these distances can be more informative than traditional approaches that rely strictly on differences in topology or branch lengths alone. Currently, however, these approaches are designed for comparing models of nucleotide substitution and gene tree distributions, and thus, are unable to address other classes of traits and associated models that may be of interest to evolutionary biologists. Here, we expand the principles of probabilistic phylogenetic distances to compute tree distances under models of continuous trait evolution along a phylogeny. By explicitly considering both the degree of relatedness among species and the evolutionary processes that collectively give rise to character traits, these distances provide a foundation for comparing models and their predictions, and for quantifying the impacts of assuming one phylogenetic background over another while studying the evolution of a particular trait. We demonstrate the properties of these approaches using theory, simulations, and several empirical data sets that highlight potential uses of probabilistic distances in many scenarios. We also introduce an open-source R package named PRDATR for easy application by the scientific community for computing phylogenetic distances under models of character trait evolution.[Brownian motion; comparative methods; phylogeny; quantitative traits.].
Collapse
Affiliation(s)
- Richard H Adams
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL 33431, USA
| | - Heath Blackmon
- Department of Biology, Texas A&M University, College Station, TX 77843, USA
| | - Michael DeGiorgio
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL 33431, USA
| |
Collapse
|
76
|
Mahbub M, Wahab Z, Reaz R, Rahman MS, Bayzid MS. wQFM: Highly Accurate Genome-scale Species Tree Estimation from Weighted Quartets. Bioinformatics 2021; 37:3734-3743. [PMID: 34086858 DOI: 10.1093/bioinformatics/btab428] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 05/24/2021] [Accepted: 06/03/2021] [Indexed: 02/01/2023] Open
Abstract
MOTIVATION Species tree estimation from genes sampled from throughout the whole genome is complicated due to the gene tree-species tree discordance. Incomplete lineage sorting (ILS) is one of the most frequent causes for this discordance, where alleles can coexist in populations for periods that may span several speciation events. Quartet-based summary methods for estimating species trees from a collection of gene trees are becoming popular due to their high accuracy and statistical guarantee under ILS. Generating quartets with appropriate weights, where weights correspond to the relative importance of quartets, and subsequently amalgamating the weighted quartets to infer a single coherent species tree can allow for a statistically consistent way of estimating species trees. However, handling weighted quartets is challenging. RESULTS We propose wQFM, a highly accurate method for species tree estimation from multi-locus data, by extending the quartet FM (QFM) algorithm to a weighted setting. wQFM was assessed on a collection of simulated and real biological datasets, including the avian phylogenomic dataset which is one of the largest phylogenomic datasets to date. We compared wQFM with wQMC, which is the best alternate method for weighted quartet amalgamation, and with ASTRAL, which is one of the most accurate and widely used coalescent-based species tree estimation methods. Our results suggest that wQFM matches or improves upon the accuracy of wQMC and ASTRAL. AVAILABILITY wQFM is available in open source form at https://github.com/Mahim1997/wQFM-2020. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mahim Mahbub
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - Zahin Wahab
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - Rezwana Reaz
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - M Saifur Rahman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - Md Shamsuzzoha Bayzid
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| |
Collapse
|
77
|
Xu N, Ding J, Que Z, Xu W, Ye W, Liu H. The mitochondrial genome and phylogenetic characteristics of the Thick-billed Green-Pigeon, Treron curvirostra: the first sequence for the genus. Zookeys 2021; 1041:167-182. [PMID: 34149293 PMCID: PMC8190031 DOI: 10.3897/zookeys.1041.60150] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 02/17/2021] [Indexed: 11/12/2022] Open
Abstract
Members of the genus Treron (Columbidae) are widely distributed in southern Asia and the Indo-Malayan Region but their relationships are poorly understood. Better knowledge of the systematic status of this genus may help studies of historical biogeography and taxonomy. The complete mitochondrial genome of T.curvirostra was characterized, a first for the genus. It is 17,414 base pairs in length, containing two rRNAs, 22 tRNAs, 13 protein coding genes (PCGs), and one D-loop with a primary structure that is similar to that found in most members of Columbidae. Most PCGs start with the common ATG codon but are terminated by different codons. The highest value of the Ka/Ks ratio within 13 PCGs was found in ATP8 with 0.1937, suggesting that PCGs of the mitochondrial genome tend to be conservative in Columbidae. Moreover, the phylogenetic relationships within Columbidae, which was based on sequences of 13 PCGs, showed that (T.curvirostra + Hemiphaganovaeseelandiae) were clustered in one clade, suggesting a potentially close relationship between Treron and Hemiphaga. However, the monophyly of the subfamilies of Columbidae recognized by the Interagency Taxonomic Information System could not be corroborated. Hence, the position of the genus Treron in the classification of Columbidae may have to be revised.
Collapse
Affiliation(s)
- Nan Xu
- College of Biology and the Environment, Nanjing Forestry University, Nanjing 210037, China Nanjing Forestry University Nanjing China
| | - Jiayu Ding
- College of Biology and the Environment, Nanjing Forestry University, Nanjing 210037, China Nanjing Forestry University Nanjing China
| | - Ziting Que
- College of Biology and the Environment, Nanjing Forestry University, Nanjing 210037, China Nanjing Forestry University Nanjing China
| | - Wei Xu
- College of Biology and the Environment, Nanjing Forestry University, Nanjing 210037, China Nanjing Forestry University Nanjing China
| | - Wentao Ye
- College of Biology and the Environment, Nanjing Forestry University, Nanjing 210037, China Nanjing Forestry University Nanjing China
| | - Hongyi Liu
- College of Biology and the Environment, Nanjing Forestry University, Nanjing 210037, China Nanjing Forestry University Nanjing China
| |
Collapse
|
78
|
Hanson M, Hoffman EA, Norell MA, Bhullar BAS. The early origin of a birdlike inner ear and the evolution of dinosaurian movement and vocalization. Science 2021; 372:601-609. [PMID: 33958471 DOI: 10.1126/science.abb4305] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Accepted: 03/09/2021] [Indexed: 12/27/2022]
Abstract
Reptiles, including birds, exhibit a range of behaviorally relevant adaptations that are reflected in changes to the structure of the inner ear. These adaptations include the capacity for flight and sensitivity to high-frequency sound. We used three-dimensional morphometric analyses of a large sample of extant and extinct reptiles to investigate inner ear correlates of locomotor ability and hearing acuity. Statistical analyses revealed three vestibular morphotypes, best explained by three locomotor categories-quadrupeds, bipeds and simple fliers (including bipedal nonavialan dinosaurs), and high-maneuverability fliers. Troodontids fall with Archaeopteryx among the extant low-maneuverability fliers. Analyses of cochlear shape revealed a single instance of elongation, on the stem of Archosauria. We suggest that this transformation coincided with the origin of both high-pitched juvenile location, alarm, and hatching-synchronization calls and adult responses to them.
Collapse
Affiliation(s)
- Michael Hanson
- Department of Earth and Planetary Sciences, Yale University, New Haven, CT, USA.,Peabody Museum of Natural History, Yale University, New Haven, CT, USA
| | - Eva A Hoffman
- Division of Paleontology, American Museum of Natural History, New York, NY, USA
| | - Mark A Norell
- Division of Paleontology, American Museum of Natural History, New York, NY, USA
| | - Bhart-Anjan S Bhullar
- Department of Earth and Planetary Sciences, Yale University, New Haven, CT, USA. .,Peabody Museum of Natural History, Yale University, New Haven, CT, USA
| |
Collapse
|
79
|
The Roles of Protein Structure, Taxon Sampling, and Model Complexity in Phylogenomics: A Case Study Focused on Early Animal Divergences. BIOPHYSICA 2021. [DOI: 10.3390/biophysica1020008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Despite the long history of using protein sequences to infer the tree of life, the potential for different parts of protein structures to retain historical signal remains unclear. We propose that it might be possible to improve analyses of phylogenomic datasets by incorporating information about protein structure. We test this idea using the position of the root of Metazoa (animals) as a model system. We examined the distribution of “strongly decisive” sites (alignment positions that support a specific tree topology) in a dataset comprising >1500 proteins and almost 100 taxa. The proportion of each class of strongly decisive sites in different structural environments was very sensitive to the model used to analyze the data when a limited number of taxa were used but they were stable when taxa were added. As long as enough taxa were analyzed, sites in all structural environments supported the same topology regardless of whether standard tree searches or decisive sites were used to select the optimal tree. However, the use of decisive sites revealed a difference between the support for minority topologies for sites in different structural environments: buried sites and sites in sheet and coil environments exhibited equal support for the minority topologies, whereas solvent-exposed and helix sites had unequal numbers of sites, supporting the minority topologies. This suggests that the relatively slowly evolving buried, sheet, and coil sites are giving an accurate picture of the true species tree and the amount of conflict among gene trees. Taken as a whole, this study indicates that phylogenetic analyses using sites in different structural environments can yield different topologies for the deepest branches in the animal tree of life and that analyzing larger numbers of taxa eliminates this conflict. More broadly, our results highlight the desirability of incorporating information about protein structure into phylogenomic analyses.
Collapse
|
80
|
Sangster G, Mayr G. Feraequornithes: a name for the clade formed by Procellariiformes, Sphenisciformes, Ciconiiformes, Suliformes and Pelecaniformes (Aves). VERTEBRATE ZOOLOGY 2021. [DOI: 10.3897/vz.71.e61728] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Recent genomic data sets have resolved many aspects of higher-level phylogenetic relationships of birds. Eleven phylogenomic studies provide congruent support for a clade formed by Procellariiformes, Sphenisciformes, Ciconiiformes, Suliformes and Pelecaniformes. This clade is here named ‘Feraequornithes’ following the rules and requirements of the PhyloCode.
Collapse
|
81
|
Assessing topological congruence among concatenation-based phylogenomic approaches in empirical datasets. Mol Phylogenet Evol 2021; 161:107086. [PMID: 33609710 DOI: 10.1016/j.ympev.2021.107086] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Revised: 09/25/2020] [Accepted: 01/22/2021] [Indexed: 10/22/2022]
Abstract
Assessing the effect of methodological decisions on the resulting hypotheses is critical in phylogenetics. Recent studies have focused on evaluating how model selection, orthology definition and confounding factors affect phylogenomic results. Here, we compare the results of three concatenated phylogenetic methods (Maximum Likelihood, ML; Bayesian Inference, BI; Maximum Parsimony, MP) in 157 empirical phylogenomic datasets. The resulting trees were very similar, with 96.7% of all nodes shared between BI and ML (90.6% for ML-MP and 89.1% for BI-MP). Differing nodes were predominantly those of lower support. The main conclusions of most of the studies agreed for the three phylogenetic methods and the discordance involved nodes considered as recalcitrant problems in systematics. The differences between methods were proportionally larger in datasets that analyze the relationships at higher taxonomic levels (particularly phyla and kingdoms), and independent of the number of characters included in the datasets. Note: a spanish version of this article is available in the Supplementary material (Supplementary material online).
Collapse
|
82
|
Neumann JS, Desalle R, Narechania A, Schierwater B, Tessler M. Morphological Characters Can Strongly Influence Early Animal Relationships Inferred from Phylogenomic Data Sets. Syst Biol 2021; 70:360-375. [PMID: 32462193 PMCID: PMC7875439 DOI: 10.1093/sysbio/syaa038] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Revised: 01/27/2020] [Accepted: 01/29/2020] [Indexed: 12/19/2022] Open
Abstract
There are considerable phylogenetic incongruencies between morphological and phylogenomic data for the deep evolution of animals. This has contributed to a heated debate over the earliest-branching lineage of the animal kingdom: the sister to all other Metazoa (SOM). Here, we use published phylogenomic data sets ($\sim $45,000-400,000 characters in size with $\sim $15-100 taxa) that focus on early metazoan phylogeny to evaluate the impact of incorporating morphological data sets ($\sim $15-275 characters). We additionally use small exemplar data sets to quantify how increased taxon sampling can help stabilize phylogenetic inferences. We apply a plethora of common methods, that is, likelihood models and their "equivalent" under parsimony: character weighting schemes. Our results are at odds with the typical view of phylogenomics, that is, that genomic-scale data sets will swamp out inferences from morphological data. Instead, weighting morphological data 2-10$\times $ in both likelihood and parsimony can in some cases "flip" which phylum is inferred to be the SOM. This typically results in the molecular hypothesis of Ctenophora as the SOM flipping to Porifera (or occasionally Placozoa). However, greater taxon sampling improves phylogenetic stability, with some of the larger molecular data sets ($>$200,000 characters and up to $\sim $100 taxa) showing node stability even with $\geqq100\times $ upweighting of morphological data. Accordingly, our analyses have three strong messages. 1) The assumption that genomic data will automatically "swamp out" morphological data is not always true for the SOM question. Morphological data have a strong influence in our analyses of combined data sets, even when outnumbered thousands of times by molecular data. Morphology therefore should not be counted out a priori. 2) We here quantify for the first time how the stability of the SOM node improves for several genomic data sets when the taxon sampling is increased. 3) The patterns of "flipping points" (i.e., the weighting of morphological data it takes to change the inferred SOM) carry information about the phylogenetic stability of matrices. The weighting space is an innovative way to assess comparability of data sets that could be developed into a new sensitivity analysis tool. [Metazoa; Morphology; Phylogenomics; Weighting.].
Collapse
Affiliation(s)
- Johannes S Neumann
- Richard Gilder Graduate School, American Museum of Natural History, New York, NY 10024, USA
- Division of Invertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA
| | - Rob Desalle
- Division of Invertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA
| | - Apurva Narechania
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA
| | - Bernd Schierwater
- Division of Invertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA
- ITZ, Division of Ecology and Evolution, Tierärztliche Hochschule Hannover, Bünteweg 9, 30559 Hannover, Germany
| | - Michael Tessler
- Division of Invertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA
| |
Collapse
|
83
|
Dibaeinia P, Tabe-Bordbar S, Warnow T. FASTRAL: Improving scalability of phylogenomic analysis. Bioinformatics 2021; 37:2317-2324. [PMID: 33576396 PMCID: PMC8388037 DOI: 10.1093/bioinformatics/btab093] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Revised: 02/02/2021] [Accepted: 02/04/2021] [Indexed: 01/22/2023] Open
Abstract
MOTIVATION ASTRAL is the current leading method for species tree estimation from phylogenomic datasets (i.e., hundreds to thousands of genes) that addresses gene tree discord resulting from incomplete lineage sorting (ILS). ASTRAL is statistically consistent under the multi-locus coalescent model (MSC), runs in polynomial time, and is able to run on large datasets. Key to ASTRAL's algorithm is the use of dynamic programming to find an optimal solution to the MQSST (maximum quartet support supertree) within a constraint space that it computes from the input. Yet, ASTRAL can fail to complete within reasonable timeframes on large datasets with many genes and species, because in these cases the constraint space it computes is too large. RESULTS Here we introduce FASTRAL, a phylogenomic estimation method. FASTRAL is based on ASTRAL, but uses a different technique for constructing the constraint space. The technique we use to define the constraint space maintains statistical consistency and is polynomial time; thus we prove that FASTRAL is a polynomial time algorithm that is statistically consistent under the MSC. Our performance study on both biological and simulated data sets demonstrates that FASTRAL matches or improves on ASTRAL with respect to species tree topology accuracy (and under high ILS conditions it is statistically significantly more accurate), while being dramatically faster-especially on datasets with large numbers of genes and high ILS-due to using a significantly smaller constraint space. AVAILABILITY FASTRAL is available in open-source form at https://github.com/PayamDiba/FASTRAL. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Payam Dibaeinia
- Department of Computer Science, University of Illinois, Urbana, IL 61801, USA
| | - Shayan Tabe-Bordbar
- Department of Computer Science, University of Illinois, Urbana, IL 61801, USA
| | - Tandy Warnow
- Department of Computer Science, University of Illinois, Urbana, IL 61801, USA,To whom correspondence should be addressed.
| |
Collapse
|
84
|
Jaiswal SK, Gupta A, Shafer ABA, P. K. VP, Vijay N, Sharma VK. Genomic Insights Into the Molecular Basis of Sexual Selection in Birds. Front Ecol Evol 2021. [DOI: 10.3389/fevo.2021.538498] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Sexual selection is a well-known biological process, yet the genomic basis and patterns of sexual selection are not fully understood. The extravagant ornamental plumage of peacock (Pavo cristatus) was instrumental in shaping Charles Darwin's theory of sexual selection and is considered to be an honest signal of its immunocompetence. Here, we used the recently generated draft genome sequence of peafowl (Pavo cristatus) and carried out a comparative analysis across 11 bird genomes that encompass a range of sexual selection and also had high-quality genomic and phenotypic data publically available to study the genomic basis of sexual selection. We found that varying degree of purifying selection was the predominant mechanism of action for sexual selection at the genome-wide scale and observed that sexual selection mostly influences genes regulating gene expression and protein processing. Specifically, the genome-wide phylogenetically corrected regression analysis supported the continuous or ongoing model of sexual selection. Genes involved in nucleic acid binding and gene expression regulation, including a specific regulator of sex-determination known as TRA2A to be under positive selection in the species with high post-copulatory sexual selection manifested as high sperm competition. We also detected specific feather-related and immune-related gene-pairs evolving under similar selection pressures across the 11 species, including peacock (Pavo cristatus), which is consistent with the Hamilton-Zuk hypothesis. The comparative genomics analysis of 11 avian taxa has provided new insights on the molecular underpinnings of sexual selection and identifies specific genomic regions for future in-depth analysis.
Collapse
|
85
|
Abstract
The phylogeny of Neoaves, the largest clade of extant birds, has remained unclear despite intense study. The difficulty associated with resolving the early branches in Neoaves is likely driven by the rapid radiation of this group. However, conflicts among studies may be exacerbated by the data type analyzed. For example, analyses of coding exons typically yield trees that place Strisores (nightjars and allies) sister to the remaining Neoaves, while analyses of non-coding data typically yield trees where Mirandornites (flamingos and grebes) is the sister of the remaining Neoaves. Our understanding of data type effects is hampered by the fact that previous analyses have used different taxa, loci, and types of non-coding data. Herein, we provide strong corroboration of the data type effects hypothesis for Neoaves by comparing trees based on coding and non-coding data derived from the same taxa and gene regions. A simple analytical method known to minimize biases due to base composition (coding nucleotides as purines and pyrimidines) resulted in coding exon data with increased congruence to the non-coding topology using concatenated analyses. These results improve our understanding of the resolution of neoavian phylogeny and point to a challenge—data type effects—that is likely to be an important factor in phylogenetic analyses of birds (and many other taxonomic groups). Using our results, we provide a summary phylogeny that identifies well-corroborated relationships and highlights specific nodes where future efforts should focus.
Collapse
|
86
|
Kandziora M, Sklenář P, Kolář F, Schmickl R. How to Tackle Phylogenetic Discordance in Recent and Rapidly Radiating Groups? Developing a Workflow Using Loricaria (Asteraceae) as an Example. FRONTIERS IN PLANT SCIENCE 2021; 12:765719. [PMID: 35069621 PMCID: PMC8777076 DOI: 10.3389/fpls.2021.765719] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Accepted: 11/22/2021] [Indexed: 05/17/2023]
Abstract
A major challenge in phylogenetics and -genomics is to resolve young rapidly radiating groups. The fast succession of species increases the probability of incomplete lineage sorting (ILS), and different topologies of the gene trees are expected, leading to gene tree discordance, i.e., not all gene trees represent the species tree. Phylogenetic discordance is common in phylogenomic datasets, and apart from ILS, additional sources include hybridization, whole-genome duplication, and methodological artifacts. Despite a high degree of gene tree discordance, species trees are often well supported and the sources of discordance are not further addressed in phylogenomic studies, which can eventually lead to incorrect phylogenetic hypotheses, especially in rapidly radiating groups. We chose the high-Andean Asteraceae genus Loricaria to shed light on the potential sources of phylogenetic discordance and generated a phylogenetic hypothesis. By accounting for paralogy during gene tree inference, we generated a species tree based on hundreds of nuclear loci, using Hyb-Seq, and a plastome phylogeny obtained from off-target reads during target enrichment. We observed a high degree of gene tree discordance, which we found implausible at first sight, because the genus did not show evidence of hybridization in previous studies. We used various phylogenomic analyses (trees and networks) as well as the D-statistics to test for ILS and hybridization, which we developed into a workflow on how to tackle phylogenetic discordance in recent radiations. We found strong evidence for ILS and hybridization within the genus Loricaria. Low genetic differentiation was evident between species located in different Andean cordilleras, which could be indicative of substantial introgression between populations, promoted during Pleistocene glaciations, when alpine habitats shifted creating opportunities for secondary contact and hybridization.
Collapse
Affiliation(s)
- Martha Kandziora
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
- *Correspondence: Martha Kandziora,
| | - Petr Sklenář
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
| | - Filip Kolář
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
- Institute of Botany, The Czech Academy of Sciences, Průhonice, Czechia
| | - Roswitha Schmickl
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
- Institute of Botany, The Czech Academy of Sciences, Průhonice, Czechia
| |
Collapse
|
87
|
Lv X, Hu J, Hu Y, Li Y, Xu D, Ryder OA, Irwin DM, Yu L. Diverse phylogenomic datasets uncover a concordant scenario of laurasiatherian interordinal relationships. Mol Phylogenet Evol 2020; 157:107065. [PMID: 33387649 DOI: 10.1016/j.ympev.2020.107065] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Revised: 12/22/2020] [Accepted: 12/24/2020] [Indexed: 10/22/2022]
Abstract
Resolving the interordinal relationships in the mammalian superorder Laurasiatheria has been among the most intractable problems in higher-level mammalian systematics, with many conflicting hypotheses having been proposed. The present study collected three different sources of genome-scale data with comprehensive taxon sampling of laurasiatherian species, including two protein-coding datasets (4,186 protein-coding genes for an amino acid dataset comprising 2,761,247 amino acid residues and a nucleotide dataset comprising 5,516,340 nucleotides from 1st and 2nd codon positions), an intronic dataset (1,210 introns comprising 1,162,723 nucleotides) and an ultraconserved elements (UCEs) dataset (1,246 UCEs comprising 1,946,472 nucleotides) from 40 species representing all six laurasiatherian orders and 7 non-laurasiatherian outgroups. Remarkably, phylogenetic trees reconstructed with the four datasets using different tree-building methods (RAxML, FastTree, ASTRAL and MP-EST) all supported the relationship (Eulipotyphla, (Chiroptera, ((Carnivora, Pholidota), (Cetartiodactyla, Perissodactyla)))). We find a resolution of interordinal relationships of Laurasiatheria among all types of markers used in the present study, and the likelihood ratio tests for tree comparisons confirmed that the present tree topology is the optimal hypothesis compared to other examined hypotheses. Jackknifing subsampling analyses demonstrate that the results of laurasiatherian tree reconstruction varied with the number of loci and ordinal representatives used, which are likely the two main contributors to phylogenetic disagreements of Laurasiatheria seen in previous studies. Our study provides significant insight into laurasiatherian evolution, and moreover, an important methodological strategy and reference for resolving phylogenies of adaptive radiation, which have been a long-standing challenge in the field of phylogenetics.
Collapse
Affiliation(s)
- Xue Lv
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Jingyang Hu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Yiwen Hu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Yitian Li
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Dongming Xu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Kunming, China
| | - Oliver A Ryder
- Institute for Conservation Research, San Diego Zoo Global, Escondido, CA, USA
| | - David M Irwin
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Canada
| | - Li Yu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China.
| |
Collapse
|
88
|
Chan KO, Hutter CR, Wood PL, Grismer LL, Brown RM. Target-capture phylogenomics provide insights on gene and species tree discordances in Old World treefrogs (Anura: Rhacophoridae). Proc Biol Sci 2020; 287:20202102. [PMID: 33290680 PMCID: PMC7739936 DOI: 10.1098/rspb.2020.2102] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Accepted: 11/13/2020] [Indexed: 11/12/2022] Open
Abstract
Genome-scale data have greatly facilitated the resolution of recalcitrant nodes that Sanger-based datasets have been unable to resolve. However, phylogenomic studies continue to use traditional methods such as bootstrapping to estimate branch support; and high bootstrap values are still interpreted as providing strong support for the correct topology. Furthermore, relatively little attention has been given to assessing discordances between gene and species trees, and the underlying processes that produce phylogenetic conflict. We generated novel genomic datasets to characterize and determine the causes of discordance in Old World treefrogs (Family: Rhacophoridae)-a group that is fraught with conflicting and poorly supported topologies among major clades. Additionally, a suite of data filtering strategies and analytical methods were applied to assess their impact on phylogenetic inference. We showed that incomplete lineage sorting was detected at all nodes that exhibited high levels of discordance. Those nodes were also associated with extremely short internal branches. We also clearly demonstrate that bootstrap values do not reflect uncertainty or confidence for the correct topology and, hence, should not be used as a measure of branch support in phylogenomic datasets. Overall, we showed that phylogenetic discordances in Old World treefrogs resulted from incomplete lineage sorting and that species tree inference can be improved using a multi-faceted, total-evidence approach, which uses the most amount of data and considers results from different analytical methods and datasets.
Collapse
Affiliation(s)
- Kin Onn Chan
- Lee Kong Chian Natural History Museum, National University of Singapore, 2 Conservatory Drive, Singapore 117377, Republic of Singapore
| | - Carl R. Hutter
- Museum of Natural Sciences and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Perry L. Wood
- Department of Biological Sciences and Museum of Natural History, Auburn University, Auburn, AL 36849, USA
| | - L. Lee Grismer
- Herpetology Laboratory, Department of Biology, La Sierra University, Riverside, CA 92505, USA
| | - Rafe M. Brown
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045, USA
| |
Collapse
|
89
|
Phylogenomics of manakins (Aves: Pipridae) using alternative locus filtering strategies based on informativeness. Mol Phylogenet Evol 2020; 155:107013. [PMID: 33217578 DOI: 10.1016/j.ympev.2020.107013] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2019] [Revised: 11/07/2020] [Accepted: 11/11/2020] [Indexed: 01/11/2023]
Abstract
Target capture sequencing effectively generates molecular marker arrays useful for molecular systematics. These extensive data sets are advantageous where previous studies using a few loci have failed to resolve relationships confidently. Moreover, target capture is well-suited to fragmented source DNA, allowing data collection from species that lack fresh tissues. Herein we use target capture to generate data for a phylogeny of the avian family Pipridae (manakins), a group that has been the subject of many behavioral and ecological studies. Most manakin species feature lek mating systems, where males exhibit complex behavioral displays including mechanical and vocal sounds, coordinated movements of multiple males, and high speed movements. We analyzed thousands of ultraconserved element (UCE) loci along with a smaller number of coding exons and their flanking regions from all but one species of Pipridae. We examined three different methods of phylogenetic estimation (concatenation and two multispecies coalescent methods). Phylogenetic inferences using UCE data yielded strongly supported estimates of phylogeny regardless of analytical method. Exon probes had limited capability to capture sequence data and resulted in phylogeny estimates with reduced support and modest topological differences relative to the UCE trees, although these conflicts had limited support. Two genera were paraphyletic among all analyses and data sets, with Antilophia nested within Chiroxiphia and Tyranneutes nested within Neopelma. The Chiroxiphia-Antilophia clade was an exception to the generally high support we observed; the topology of this clade differed among analyses, even those based on UCE data. To further explore relationships within this group, we employed two filtering strategies to remove low-information loci. Those analyses resulted in distinct topologies, suggesting that the relationships we identified within Chiroxiphia-Antilophia should be interpreted with caution. Despite the existence of a few continuing uncertainties, our analyses resulted in a robust phylogenetic hypothesis of the family Pipridae that provides a comparative framework for future ecomorphological and behavioral studies.
Collapse
|
90
|
Cai L, Xi Z, Lemmon EM, Lemmon AR, Mast A, Buddenhagen CE, Liu L, Davis CC. The Perfect Storm: Gene Tree Estimation Error, Incomplete Lineage Sorting, and Ancient Gene Flow Explain the Most Recalcitrant Ancient Angiosperm Clade, Malpighiales. Syst Biol 2020; 70:491-507. [PMID: 33169797 DOI: 10.1093/sysbio/syaa083] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Revised: 10/20/2020] [Accepted: 10/28/2020] [Indexed: 12/20/2022] Open
Abstract
The genomic revolution offers renewed hope of resolving rapid radiations in the Tree of Life. The development of the multispecies coalescent model and improved gene tree estimation methods can better accommodate gene tree heterogeneity caused by incomplete lineage sorting (ILS) and gene tree estimation error stemming from the short internal branches. However, the relative influence of these factors in species tree inference is not well understood. Using anchored hybrid enrichment, we generated a data set including 423 single-copy loci from 64 taxa representing 39 families to infer the species tree of the flowering plant order Malpighiales. This order includes 9 of the top 10 most unstable nodes in angiosperms, which have been hypothesized to arise from the rapid radiation during the Cretaceous. Here, we show that coalescent-based methods do not resolve the backbone of Malpighiales and concatenation methods yield inconsistent estimations, providing evidence that gene tree heterogeneity is high in this clade. Despite high levels of ILS and gene tree estimation error, our simulations demonstrate that these two factors alone are insufficient to explain the lack of resolution in this order. To explore this further, we examined triplet frequencies among empirical gene trees and discovered some of them deviated significantly from those attributed to ILS and estimation error, suggesting gene flow as an additional and previously unappreciated phenomenon promoting gene tree variation in Malpighiales. Finally, we applied a novel method to quantify the relative contribution of these three primary sources of gene tree heterogeneity and demonstrated that ILS, gene tree estimation error, and gene flow contributed to 10.0$\%$, 34.8$\%$, and 21.4$\%$ of the variation, respectively. Together, our results suggest that a perfect storm of factors likely influence this lack of resolution, and further indicate that recalcitrant phylogenetic relationships like the backbone of Malpighiales may be better represented as phylogenetic networks. Thus, reducing such groups solely to existing models that adhere strictly to bifurcating trees greatly oversimplifies reality, and obscures our ability to more clearly discern the process of evolution. [Coalescent; concatenation; flanking region; hybrid enrichment, introgression; phylogenomics; rapid radiation, triplet frequency.].
Collapse
Affiliation(s)
- Liming Cai
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, Cambridge, MA 02138, USA
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Zhenxiang Xi
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, Cambridge, MA 02138, USA
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Emily Moriarty Lemmon
- Department of Biological Sciences, Florida State University, Tallahassee, FL 32306, USA
| | - Alan R Lemmon
- Department of Scientific Computing, Florida State University, Tallahassee, FL 32306, USA
| | - Austin Mast
- Department of Biological Sciences, Florida State University, Tallahassee, FL 32306, USA
| | - Christopher E Buddenhagen
- Department of Biological Sciences, Florida State University, Tallahassee, FL 32306, USA
- AgResearch, 10 Bisley Road, Hamilton 3214, New Zealand
| | - Liang Liu
- Department of Statistics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Charles C Davis
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, Cambridge, MA 02138, USA
| |
Collapse
|
91
|
Musser G, Clarke JA. An Exceptionally Preserved Specimen From the Green River Formation Elucidates Complex Phenotypic Evolution in Gruiformes and Charadriiformes. Front Ecol Evol 2020. [DOI: 10.3389/fevo.2020.559929] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The stem lineage relationships and early phenotypic evolution of Charadriiformes (shorebirds) and Gruiformes (rails, cranes, and allies) remain unresolved. It is still debated whether these clades are sister-taxa. New phylogenetic analyses incorporating Paleogene fossils have the potential to reveal the evolutionary connections of these two speciose and evolutionarily critical neoavian subclades. Although Gruiformes have a rich Paleogene fossil record, most of these fossils have not been robustly placed. The Paleogene fossil record of Charadriiformes is scarce and largely consists of fragmentary single elements. Only one proposed Eocene charadriiform-like taxon,Scandiavis mikkelseniof Denmark, is represented by a partial skeleton. Here, we describe a new species from the early Eocene Green River Formation of North America comprising a partial skeleton and feather remains. Because the skeleton lacks the pectoral girdle and forelimbs as inS. mikkelseni, only features of the skull, axial skeleton, and hind limb are available to resolve the phylogenetic placement of this taxon. These anatomical subregions initially showed features seen in Charadriiformes and Gruiformes. To assess placement of this taxon, we use a matrix consisting of 693 morphological characters and 60 taxa, includingS. mikkelseniand the oldest known charadriiform taxa represented by single elements. These more fragmentary records comprise two distal humeri from the earliest Eocene Naranbulag Formation of Mongolia and the early Eocene Nanjemoy Formation of Virginia. Our phylogenetic analyses recover the new taxon andS. mikkelsenialternatively as a charadriiform or as a stem-gruiform; placement is contingent upon enforced relationships for major neoavian subclades recovered by recent molecular-based phylogenies. Specifically, when constraint trees based on results that do not recover Charadriiformes and Gruiformes as sister-taxa are used, the new taxon andS. mikkelseniare recovered within stem Gruiformes. Both Paleogene fossil humeri are consistently recovered within crown Charadriiformes. If placement of these humeri or the new taxon as charadriiforms are correct, this may indicate that recent divergence time analyses have underestimated the crown age of another major crown avian subclade; however, more complete sampling of these taxa is necessary, especially of more complete specimens with pectoral elements.
Collapse
|
92
|
Jing M, Yang H, Li K, Huang L. Characterization of three new mitochondrial genomes of Coraciiformes (Megaceryle lugubris, Alcedo atthis, Halcyon smyrnensis) and insights into their phylogenetics. Genet Mol Biol 2020; 43:e20190392. [PMID: 33026411 PMCID: PMC7539371 DOI: 10.1590/1678-4685-gmb-2019-0392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Accepted: 07/13/2020] [Indexed: 12/04/2022] Open
Abstract
Coraciiformes contains more than 200 species with great differences on external
morphology and life-style. The evolutionary relationships within Coraciiformes
and the phylogenetic placement of Coraciiformes in Aves are still questioned.
Mitochondrial genome (mitogenome) sequences are popular markers in molecular
phylogenetic studies of birds. This study presented the genome characteristics
of three new mitogenomes in Coraciiformes and explored the phylogenetic
relationships among Coraciiformes and other five related orders with mitogenome
data of 30 species. The sizes of three mitogenomes were 17,383 bp
(Alcedo atthis), 17,892 bp (Halcyon
smyrnensis) and 17,223 bp (Megaceryle lugubris).
Each mitogenome contained one control region and 37 genes that were common in
vertebrate mitogenomes. The organization of three mitogenomes was identical to
the putative ancestral gene order in Aves. Among 13 available Coraciiform
mitogenomes, 12 protein coding genes showed indications of negative selection,
while the MT-ND6 presented sign of positive selection or relaxed purifying
selection. The phylogenetic results supported that Upupidae and Bucerotidae
should be separated from Coraciiformes, and that Coraciiformes is more closely
related to Piciformes than to Strigiformes, Trogoniformes and Cuculiformes. Our
study provide valuable data for further phylogenetic investigation of
Coraciiformes.
Collapse
Affiliation(s)
- Meidong Jing
- Nantong University, School of Life Sciences, Nantong, Jiangsu, P. R. China
| | - Huanhuan Yang
- Ludong University, School of Life Sciences, Yantai, Shandong, P. R. China
| | - Kai Li
- Nantong Xingdong International Airport, Nantong, Jiangsu, P. R. China
| | - Ling Huang
- Nantong University, School of Life Sciences, Nantong, Jiangsu, P. R. China
| |
Collapse
|
93
|
Vasilikopoulos A, Gustafson GT, Balke M, Niehuis O, Beutel RG, Misof B. Resolving the phylogenetic position of Hygrobiidae (Coleoptera: Adephaga) requires objective statistical tests and exhaustive phylogenetic methodology: a response to Cai et al. (2020). Mol Phylogenet Evol 2020; 162:106923. [PMID: 32771549 DOI: 10.1016/j.ympev.2020.106923] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Accepted: 08/03/2020] [Indexed: 12/20/2022]
Affiliation(s)
- Alexandros Vasilikopoulos
- Center for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, 53121 Bonn, Germany.
| | - Grey T Gustafson
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, 66045 KS, USA
| | - Michael Balke
- Department of Entomology, SNSB-Bavarian State Collections of Zoology, 81247 Munich, Germany
| | - Oliver Niehuis
- Department of Evolutionary Biology and Ecology, Institute of Biology I (Zoology), Albert Ludwig University of Freiburg, 79104 Freiburg, Germany
| | - Rolf G Beutel
- Institut für Zoologie und Evolutionsforschung, Friedrich-Schiller-Universität Jena, 07743 Jena, Germany
| | - Bernhard Misof
- Zoological Research Museum Alexander Koenig, 53121 Bonn, Germany
| |
Collapse
|
94
|
Abstract
A groundbreaking study of brain evolution across birds and dinosaurs reveals potential drivers of increased brain size including biogeography and ecology. The most dramatic change occurred in the Neoaves after the Cretaceous-Paleogene extinction rather than earlier in bird evolution.
Collapse
Affiliation(s)
- Peter J Makovicky
- Department of Earth and Environmental Sciences, University of Minnesota, Twin Cities, 116 Church St SE, 150 John T. Tate Hall, Minneapolis, MN 55455, USA.
| | - Sushma Reddy
- Bell Museum of Natural History and Department of Fisheries, Wildlife, Conservation Biology, University of Minnesota, Twin Cities, 35A Skok Hall, 2003 Upper Buford Circle, Saint Paul, MN 55108, USA.
| |
Collapse
|
95
|
Chan KO, Hutter CR, Wood PL, Grismer LL, Brown RM. Larger, unfiltered datasets are more effective at resolving phylogenetic conflict: Introns, exons, and UCEs resolve ambiguities in Golden-backed frogs (Anura: Ranidae; genus Hylarana). Mol Phylogenet Evol 2020; 151:106899. [PMID: 32590046 DOI: 10.1016/j.ympev.2020.106899] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 05/18/2020] [Accepted: 06/17/2020] [Indexed: 01/01/2023]
Abstract
Using FrogCap, a recently-developed sequence-capture protocol, we obtained >12,000 highly informative exons, introns, and ultraconserved elements (UCEs), which we used to illustrate variation in evolutionary histories of these classes of markers, and to resolve long-standing systematic problems in Southeast Asian Golden-backed frogs of the genus-complex Hylarana. We also performed a comprehensive suite of analyses to assess the relative performance of different genetic markers, data filtering strategies, tree inference methods, and different measures of branch support. To reduce gene tree estimation error, we filtered the data using different thresholds of taxon completeness (missing data) and parsimony informative sites (PIS). We then estimated species trees using concatenated datasets and Maximum Likelihood (IQ-TREE) in addition to summary (ASTRAL-III), distance-based (ASTRID), and site-based (SVDQuartets) multispecies coalescent methods. Topological congruence and branch support were examined using traditional bootstrap, local posterior probabilities, gene concordance factors, quartet frequencies, and quartet scores. Our results did not yield a single concordant topology. Instead, introns, exons, and UCEs clearly possessed different phylogenetic signals, resulting in conflicting, yet strongly-supported phylogenetic estimates. However, a combined analysis comprising the most informative introns, exons, and UCEs converged on a similar topology across all analyses, with the exception of SVDQuartets. Bootstrap values were consistently high despite high levels of incongruence and high proportions of gene trees supporting conflicting topologies. Although low bootstrap values did indicate low heuristic support, high bootstrap support did not necessarily reflect congruence or support for the correct topology. This study reiterates findings of some previous studies, which demonstrated that traditional bootstrap values can produce positively misleading measures of support in large phylogenomic datasets. We also showed a remarkably strong positive relationship between branch length and topological congruence across all datasets, implying that very short internodes remain a challenge to resolve, even with orders of magnitude more data than ever before. Overall, our results demonstrate that more data from unfiltered or combined datasets produced superior results. Although data filtering reduced gene tree incongruence, decreased amounts of data also biased phylogenetic estimation. A point of diminishing returns was evident, at which higher congruence (from more stringent filtering) at the expense of amount of data led to topological error as assessed by comparison to more complete datasets across different genomic markers. Additionally, we showed that applying a parameter-rich model to a partitioned analysis of concatenated data produces better results compared to unpartitioned, or even partitioned analysis using model selection. Despite some lingering uncertainties, a combined analysis of our genomic data and sequences supplemented from GenBank (on the basis of a few gene regions) revealed highly supported novel systematic arrangements. Based on these new findings, we transfer Amnirana nicobariensis into the genus Indosylvirana; and I. milleti and Hylarana celebensis to the genus Papurana. We also provisionally place H. attigua in the genus Papurana pending verification from positively identified (voucher substantiated) samples.
Collapse
Affiliation(s)
- Kin Onn Chan
- Lee Kong Chian National History Museum, Faculty of Science, National University of Singapore, 2 Conservatory Drive, 117377, Singapore.
| | - Carl R Hutter
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045, USA; Museum of Natural Sciences and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Perry L Wood
- Museum of Natural Sciences and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA; Department of Biological Sciences & Museum of Natural History, Auburn University, Auburn, AL 36849, USA
| | - L Lee Grismer
- Herpetology Laboratory, Department of Biology, La Sierra University, 4500 Riverwalk Parkway, Riverside, CA 92505, USA
| | - Rafe M Brown
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045, USA
| |
Collapse
|
96
|
Vasilikopoulos A, Misof B, Meusemann K, Lieberz D, Flouri T, Beutel RG, Niehuis O, Wappler T, Rust J, Peters RS, Donath A, Podsiadlowski L, Mayer C, Bartel D, Böhm A, Liu S, Kapli P, Greve C, Jepson JE, Liu X, Zhou X, Aspöck H, Aspöck U. An integrative phylogenomic approach to elucidate the evolutionary history and divergence times of Neuropterida (Insecta: Holometabola). BMC Evol Biol 2020; 20:64. [PMID: 32493355 PMCID: PMC7268685 DOI: 10.1186/s12862-020-01631-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Accepted: 05/19/2020] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND The latest advancements in DNA sequencing technologies have facilitated the resolution of the phylogeny of insects, yet parts of the tree of Holometabola remain unresolved. The phylogeny of Neuropterida has been extensively studied, but no strong consensus exists concerning the phylogenetic relationships within the order Neuroptera. Here, we assembled a novel transcriptomic dataset to address previously unresolved issues in the phylogeny of Neuropterida and to infer divergence times within the group. We tested the robustness of our phylogenetic estimates by comparing summary coalescent and concatenation-based phylogenetic approaches and by employing different quartet-based measures of phylogenomic incongruence, combined with data permutations. RESULTS Our results suggest that the order Raphidioptera is sister to Neuroptera + Megaloptera. Coniopterygidae is inferred as sister to all remaining neuropteran families suggesting that larval cryptonephry could be a ground plan feature of Neuroptera. A clade that includes Nevrorthidae, Osmylidae, and Sisyridae (i.e. Osmyloidea) is inferred as sister to all other Neuroptera except Coniopterygidae, and Dilaridae is placed as sister to all remaining neuropteran families. Ithonidae is inferred as the sister group of monophyletic Myrmeleontiformia. The phylogenetic affinities of Chrysopidae and Hemerobiidae were dependent on the data type analyzed, and quartet-based analyses showed only weak support for the placement of Hemerobiidae as sister to Ithonidae + Myrmeleontiformia. Our molecular dating analyses suggest that most families of Neuropterida started to diversify in the Jurassic and our ancestral character state reconstructions suggest a primarily terrestrial environment of the larvae of Neuropterida and Neuroptera. CONCLUSION Our extensive phylogenomic analyses consolidate several key aspects in the backbone phylogeny of Neuropterida, such as the basal placement of Coniopterygidae within Neuroptera and the monophyly of Osmyloidea. Furthermore, they provide new insights into the timing of diversification of Neuropterida. Despite the vast amount of analyzed molecular data, we found that certain nodes in the tree of Neuroptera are not robustly resolved. Therefore, we emphasize the importance of integrating the results of morphological analyses with those of sequence-based phylogenomics. We also suggest that comparative analyses of genomic meta-characters should be incorporated into future phylogenomic studies of Neuropterida.
Collapse
Affiliation(s)
- Alexandros Vasilikopoulos
- Centre for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, 53113, Bonn, Germany.
| | - Bernhard Misof
- Centre for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, 53113, Bonn, Germany.
| | - Karen Meusemann
- Centre for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, 53113, Bonn, Germany
- Department of Evolutionary Biology and Ecology, Institute of Biology I (Zoology), Albert-Ludwigs-Universität Freiburg, 79104, Freiburg, Germany
- Australian National Insect Collection, National Research Collections Australia, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Canberra, ACT 2601, Australia
| | - Doria Lieberz
- Centre for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, 53113, Bonn, Germany
| | - Tomáš Flouri
- Department of Genetics, Evolution and Environment, University College London, London, WC1E 6BT, UK
| | - Rolf G Beutel
- Institut für Zoologie und Evolutionsforschung, Friedrich-Schiller-Universität Jena, 07743, Jena, Germany
| | - Oliver Niehuis
- Department of Evolutionary Biology and Ecology, Institute of Biology I (Zoology), Albert-Ludwigs-Universität Freiburg, 79104, Freiburg, Germany
| | - Torsten Wappler
- Natural History Department, Hessisches Landesmuseum Darmstadt, 64283, Darmstadt, Germany
| | - Jes Rust
- Steinmann-Institut für Geologie, Mineralogie und Paläontologie, Rheinische Friedrich-Wilhelms-Universität Bonn, 53115, Bonn, Germany
| | - Ralph S Peters
- Centre for Taxonomy and Evolutionary Research, Arthropoda Department, Zoological Research Museum Alexander Koenig, 53113, Bonn, Germany
| | - Alexander Donath
- Centre for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, 53113, Bonn, Germany
| | - Lars Podsiadlowski
- Centre for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, 53113, Bonn, Germany
| | - Christoph Mayer
- Centre for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, 53113, Bonn, Germany
| | - Daniela Bartel
- Department of Evolutionary Biology, University of Vienna, 1090, Vienna, Austria
| | - Alexander Böhm
- Department of Evolutionary Biology, University of Vienna, 1090, Vienna, Austria
| | - Shanlin Liu
- Department of Entomology, China Agricultural University, 100193, Beijing, People's Republic of China
| | - Paschalia Kapli
- Department of Genetics, Evolution and Environment, University College London, London, WC1E 6BT, UK
| | - Carola Greve
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), 60325, Frankfurt, Germany
| | - James E Jepson
- School of Biological, Earth and Environmental Sciences, University College Cork, Distillery Fields, North Mall, T23 N73K, Cork, Ireland
| | - Xingyue Liu
- Department of Entomology, China Agricultural University, 100193, Beijing, People's Republic of China
| | - Xin Zhou
- Department of Entomology, China Agricultural University, 100193, Beijing, People's Republic of China
| | - Horst Aspöck
- Institute of Specific Prophylaxis and Tropical Medicine, Medical Parasitology, Medical University of Vienna (MUW), 1090, Vienna, Austria
| | - Ulrike Aspöck
- Department of Evolutionary Biology, University of Vienna, 1090, Vienna, Austria
- Zoological Department II, Natural History Museum of Vienna, 1010, Vienna, Austria
| |
Collapse
|
97
|
Biologia Futura: rapid diversification and behavioural adaptation of birds in response to Oligocene-Miocene climatic conditions. Biol Futur 2020; 71:109-121. [PMID: 34554530 DOI: 10.1007/s42977-020-00013-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Accepted: 05/28/2020] [Indexed: 12/14/2022]
Abstract
Our knowledge about the origin of landbirds (Telluraves) is increasing rapidly but new questions are arising because of the contradictory findings from previous studies. All of the major lineages in the highly diverse clade of Neoaves have a Gondwanan origin, although studies often disagree about the origin of different sub-lineages. Nevertheless, understanding the biogeographical histories of these groups (e.g. Accipitriformes, Passeriformes) is important when studying the evolution of variation in life history and behavioural traits. Therefore, we would like to find answers to questions such as which biogeographic changes affected the radiation of birds? When did the most influential climatic events affect the diversification of birds? What behavioural adaptations occurred in response to those large-scale changes? The major orogenetic events in Asia and South America formed specific corridors that enabled the radiation of birds. The climatic changes and habitat differentiation they caused during the Oligocene-Miocene era made the divergence of birds possible through their adaptation to newly available niches. Consequently, variation in life history and behavioural traits emerged as adaptive outcomes of changes in foraging, nestling and migratory behaviours.
Collapse
|
98
|
Larson DA, Walker JF, Vargas OM, Smith SA. A consensus phylogenomic approach highlights paleopolyploid and rapid radiation in the history of Ericales. AMERICAN JOURNAL OF BOTANY 2020; 107:773-789. [PMID: 32350864 DOI: 10.1002/ajb2.1469] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2019] [Accepted: 02/12/2020] [Indexed: 05/27/2023]
Abstract
PREMISE Large genomic data sets offer the promise of resolving historically recalcitrant species relationships. However, different methodologies can yield conflicting results, especially when clades have experienced ancient, rapid diversification. Here, we analyzed the ancient radiation of Ericales and explored sources of uncertainty related to species tree inference, conflicting gene tree signal, and the inferred placement of gene and genome duplications. METHODS We used a hierarchical clustering approach, with tree-based homology and orthology detection, to generate six filtered phylogenomic matrices consisting of data from 97 transcriptomes and genomes. Support for species relationships was inferred from multiple lines of evidence including shared gene duplications, gene tree conflict, gene-wise edge-based analyses, concatenation, and coalescent-based methods, and is summarized in a consensus framework. RESULTS Our consensus approach supported a topology largely concordant with previous studies, but suggests that the data are not capable of resolving several ancient relationships because of lack of informative characters, sensitivity to methodology, and extensive gene tree conflict correlated with paleopolyploidy. We found evidence of a whole-genome duplication before the radiation of all or most ericalean families, and demonstrate that tree topology and heterogeneous evolutionary rates affect the inferred placement of genome duplications. CONCLUSIONS We provide several hypotheses regarding the history of Ericales, and confidently resolve most nodes, but demonstrate that a series of ancient divergences are unresolvable with these data. Whether paleopolyploidy is a major source of the observed phylogenetic conflict warrants further investigation.
Collapse
Affiliation(s)
- Drew A Larson
- Department of Ecology & Evolutionary Biology, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Joseph F Walker
- Sainsbury Laboratory (SLCU), University of Cambridge, Cambridge, CB2 1LR, UK
| | - Oscar M Vargas
- Department of Ecology & Evolutionary Biology, University of California, Santa Cruz, CA, 95060, USA
| | - Stephen A Smith
- Department of Ecology & Evolutionary Biology, University of Michigan, Ann Arbor, MI, 48109, USA
| |
Collapse
|
99
|
Swanson MT, Oliveros CH, Esselstyn JA. A phylogenomic rodent tree reveals the repeated evolution of masseter architectures. Proc Biol Sci 2020; 286:20190672. [PMID: 31064307 DOI: 10.1098/rspb.2019.0672] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Understanding the number of times a trait has evolved is a necessary foundation for comprehending its potential relationships with selective regimes, developmental constraints and evolutionary diversification. Rodents make up over 40% of extant mammalian species, and their ecological and evolutionary success has been partially attributed to the increase in biting efficiency that resulted from a forward shift of one or two portions of the masseter muscle from the zygomatic arch onto the rostrum. This forward shift has occurred in three discrete ways, but the number of times it has occurred has never been explicitly quantified. We estimated an ultrametric phylogeny, the first to include all rodent families, using thousands of ultraconserved elements. We examined support for evolutionary relationships among the five rodent suborders and then incorporated relevant fossils, fitted models of character evolution, and used stochastic character mapping to determine that a portion of the masseter muscle has moved forward onto the rostrum at least seven times (with one reversal) during the approximately 70 Myr history of rodents. Combined, the repeated evolution of this key innovation, its increasing prevalence through time, and the species diversity of clades with this character underscores the adaptive value of improved biting efficiency and the relative ease with which some advantageous traits arise.
Collapse
Affiliation(s)
- Mark T Swanson
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University , Baton Rouge, LA , USA
| | - Carl H Oliveros
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University , Baton Rouge, LA , USA
| | - Jacob A Esselstyn
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University , Baton Rouge, LA , USA
| |
Collapse
|
100
|
Deep-Time Demographic Inference Suggests Ecological Release as Driver of Neoavian Adaptive Radiation. DIVERSITY-BASEL 2020. [DOI: 10.3390/d12040164] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Assessing the applicability of theory to major adaptive radiations in deep time represents an extremely difficult problem in evolutionary biology. Neoaves, which includes 95% of living birds, is believed to have undergone a period of rapid diversification roughly coincident with the Cretaceous–Paleogene (K-Pg) boundary. We investigate whether basal neoavian lineages experienced an ecological release in response to ecological opportunity, as evidenced by density compensation. We estimated effective population sizes (Ne) of basal neoavian lineages by combining coalescent branch lengths (CBLs) and the numbers of generations between successive divergences. We used a modified version of Accurate Species TRee Algorithm (ASTRAL) to estimate CBLs directly from insertion–deletion (indel) data, as well as from gene trees using DNA sequence and/or indel data. We found that some divergences near the K-Pg boundary involved unexpectedly high gene tree discordance relative to the estimated number of generations between speciation events. The simplest explanation for this result is an increase in Ne, despite the caveats discussed herein. It appears that at least some early neoavian lineages, similar to the ancestor of the clade comprising doves, mesites, and sandgrouse, experienced ecological release near the time of the K-Pg mass extinction.
Collapse
|