1
|
Filtration of Gene Trees from 9000 Exons, Introns and UCEs Disentangle Conflicting Phylogenomic Relationships in Tree Frogs (Hylidae). Genome Biol Evol 2023; 15:7147705. [PMID: 37129064 PMCID: PMC10171231 DOI: 10.1093/gbe/evad070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 02/20/2023] [Accepted: 03/09/2023] [Indexed: 05/03/2023] Open
Abstract
An emerging challenge in interpreting phylogenomic datasets is that concatenation and multi-species coalescent summary species tree approaches may produce conflicting results. Concatenation is problematic because it can strongly support an incorrect topology when Incomplete Lineage Sorting (ILS) results in elevated gene-tree discordance. Conversely, summary species tree methods account for ILS to recover the correct topology, but these methods do not account for erroneous gene trees ("EGTs") resulting from gene tree estimation error (GTEE). Third, site-based and full-likelihood methods promise to alleviate GTEE as these methods use the sequence data from alignments. To understand the impact of GTEE on species tree estimation in Hylidae tree frogs, we use an expansive dataset of ∼9000 exons, introns, and UCEs and initially found conflict between all three types of analytical methods. We filtered EGTs using alignment metrics that could lead to GTEE (length, parsimony-informative sites, missing data) and found that removing shorter, less informative alignments reconciled the conflict between concatenation and summary species tree methods with increased gene concordance, with the filtered topologies matching expected results from past studies. Contrarily, site-based and full-likelihood methods were mixed where one method was consistent with past studies and the other varied markedly. Critical to other studies, these results suggest a widespread conflation of ILS and GTEE, where EGTs rather than ILS are driving discordance. Finally, we apply these recommendations to an R package named PhyloConfigR, which facilitates phylogenetic software setup, summarizes alignments, and provides tools for filtering alignments and gene trees.
Collapse
|
2
|
Phylogeography and population genetics of a widespread cold-adapted ant, Prenolepis imparis. Mol Ecol 2022; 31:4884-4899. [PMID: 35866574 DOI: 10.1111/mec.16624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Revised: 07/01/2022] [Accepted: 07/04/2022] [Indexed: 11/30/2022]
Abstract
As species arise, evolve, and diverge, they are shaped by forces that unfold across short and long time scales and at both local and vast geographic scales. It is rare, however, to be able document this history across broad sweeps of time and space in a single species. Here, we report the results of a continental-scale phylogenomic analysis across the entire range of a widespread species. We analyzed sequences of 1,402 orthologous Ultraconserved Element (UCE) loci from 75 individuals to identify population genetic structure and historical demographic patterns across the continent-wide range of a cold-adapted ant, the winter ant, Prenolepis imparis. We recovered five well-supported, genetically isolated clades representing lineages that diverged from 8.2-2.2 million years ago. These include: 1) an early diverging lineage located in Florida, 2) a lineage that spans the southern United States, 3) populations that extend across the midwestern and northeastern United States, 4) populations from the western United States, and 5) populations in southwestern Arizona and Mexico. Population genetic analyses revealed little or no gene flow among these lineages, but patterns consistent with more recent gene flow among populations within lineages, and localized structure with migration in the western United States. High support for five major geographic lineages and lack of evidence of contemporary gene flow indicate in situ diversification across the species' range, producing relatively ancient lineages that persisted through subsequent climate change and glaciation during the Quaternary.
Collapse
|
3
|
Historical specimens and the limits of subspecies phylogenomics in the New World quails (Odontophoridae). Mol Phylogenet Evol 2022; 175:107559. [PMID: 35803448 DOI: 10.1016/j.ympev.2022.107559] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 05/19/2022] [Accepted: 05/26/2022] [Indexed: 01/22/2023]
Abstract
As phylogenomics focuses on comprehensive taxon sampling at the species and population/subspecies levels, incorporating genomic data from historical specimens has become increasingly common. While historical samples can fill critical gaps in our understanding of the evolutionary history of diverse groups, they also introduce additional sources of phylogenomic uncertainty, making it difficult to discern novel evolutionary relationships from artifacts caused by sample quality issues. These problems highlight the need for improved strategies to disentangle artifactual patterns from true biological signal as historical specimens become more prevalent in phylogenomic datasets. Here, we tested the limits of historical specimen-driven phylogenomics to resolve subspecies-level relationships within a highly polytypic family, the New World quails (Odontophoridae), using thousands of ultraconserved elements (UCEs). We found that relationships at and above the species-level were well-resolved and highly supported across all analyses, with the exception of discordant relationships within the two most polytypic genera which included many historical specimens. We examined the causes of discordance and found that inferring phylogenies from subsets of taxa resolved the disagreements, suggesting that analyzing subclades can help remove artifactual causes of discordance in datasets that include historical samples. At the subspecies-level, we found well-resolved geographic structure within the two most polytypic genera, including the most polytypic species in this family, Northern Bobwhites (Colinus virginianus), demonstrating that variable sites within UCEs are capable of resolving phylogenetic structure below the species level. Our results highlight the importance of complete taxonomic sampling for resolving relationships among polytypic species, often through the inclusion of historical specimens, and we propose an integrative strategy for understanding and addressing the uncertainty that historical samples sometimes introduce to phylogenetic analyses.
Collapse
|
4
|
Delimiting the cryptic diversity and host preferences of Sycophila parasitoid wasps associated with oak galls using phylogenomic data. Mol Ecol 2022; 31:4417-4433. [PMID: 35762844 DOI: 10.1111/mec.16582] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 06/06/2022] [Accepted: 06/23/2022] [Indexed: 11/28/2022]
Abstract
Cryptic species diversity is a major challenge for the species-rich community of parasitoids attacking oak gall wasps due to a high degree of sexual dimorphism, morphological plasticity, small size, and poorly known biology. As such, we know very little about the number of species present, nor the evolutionary forces responsible for generating this diversity. One hypothesis is that trait diversity in the gall wasps, including the morphology of the galls they induce, has evolved in response to selection imposed by the parasitoid community, with reciprocal selection driving diversification of the parasitoids. Using a rare, continental-scale data set of Sycophila parasitoid wasps reared from 44 species of cynipid galls from 18 species of oak across the US, we combined mitochondrial DNA barcodes, Ultraconserved Elements (UCEs), morphological, and natural history data to delimit putative species. Using these results, we generate the first large-scale assessment of ecological specialization and host association in this species-rich group, with implications for evolutionary ecology and biocontrol. We find most Sycophila target specific subsets of available cynipid host galls with similar morphologies, and generally attack larger galls. Our results suggest that parasitoid wasps such as Sycophila have adaptations allowing them to exploit particular host trait combinations, while hosts with contrasting traits are resistant to attack. These findings support the tritrophic niche concept for the structuring of plant-herbivore-parasitoid communities.
Collapse
|
5
|
Phylogenomic analysis of Syngnathidae reveals novel relationships, origins of endemic diversity and variable diversification rates. BMC Biol 2022; 20:75. [PMID: 35346180 PMCID: PMC8962102 DOI: 10.1186/s12915-022-01271-w] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Accepted: 03/04/2022] [Indexed: 12/03/2022] Open
Abstract
Background Seahorses, seadragons, pygmy pipehorses, and pipefishes (Syngnathidae, Syngnathiformes) are among the most recognizable groups of fishes because of their derived morphology, unusual life history, and worldwide distribution. Despite previous phylogenetic studies and recent new species descriptions of syngnathids, the evolutionary relationships among several major groups within this family remain unresolved. Results Here, we provide a reconstruction of syngnathid phylogeny based on genome-wide sampling of 1314 ultraconserved elements (UCEs) and expanded taxon sampling to assess the current taxonomy and as a basis for macroevolutionary insights. We sequenced a total of 244 new specimens across 117 species and combined with published UCE data for a total of 183 species of Syngnathidae, about 62% of the described species diversity, to compile the most data-rich phylogeny to date. We estimated divergence times using 14 syngnathiform fossils, including nine fossils with newly proposed phylogenetic affinities, to better characterize current and historical biogeographical patterns, and to reconstruct diversification through time. We present a phylogenetic hypothesis that is well-supported and provides several notable insights into syngnathid evolution. We found nine non-monophyletic genera, evidence for seven cryptic species, five potentially invalid synonyms, and identified a novel sister group to the seahorses, the Indo-Pacific pipefishes Halicampus macrorhynchus and H. punctatus. In addition, the morphologically distinct southwest Pacific seahorse Hippocampus jugumus was recovered as the sister to all other non-pygmy seahorses. As found in many other groups, a high proportion of syngnathid lineages appear to have originated in the Central Indo-Pacific and subsequently dispersed to adjoining regions. Conversely, we also found an unusually high subsequent return of lineages from southern Australasia to the Central Indo-Pacific. Diversification rates rose abruptly during the Middle Miocene Climate Transition and peaked after the closure of the Tethys Sea. Conclusions Our results reveal a previously underappreciated diversity of syngnathid lineages. The observed biogeographic patterns suggest a significant role of the southern Australasian region as a source and sink of lineages. Shifts in diversification rates imply possible links to declining global temperatures, the separation of the Atlantic and Pacific faunas, and the environmental changes associated with these events. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-022-01271-w.
Collapse
|
6
|
Phylogenomic analysis of evolutionary relationships in Ranitomeya poison frogs (Family Dendrobatidae) using ultraconserved elements. Mol Phylogenet Evol 2022; 168:107389. [PMID: 35026428 DOI: 10.1016/j.ympev.2022.107389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 11/14/2021] [Accepted: 11/17/2021] [Indexed: 10/19/2022]
Abstract
The use of genome-scale data in phylogenetics has enabled recent strides in determining the relationships between taxa that are taxonomically problematic because of extensive morphological variation. Here, we employ a phylogenomic approach to infer evolutionary relationships within Ranitomeya (Anura: Dendrobatidae), an Amazonian lineage of poison frogs consisting of 16 species with remarkable diversity in color pattern, range size, and parental care behavior. We infer phylogenies with all described species of Ranitomeya from ultraconserved nuclear genomic elements (UCEs) and also estimate divergence times. Our results differ from previous analyses regarding interspecific relationships. Notably, we find that R. toraro and R. defleri are not sister species but rather distantly related, contrary to previous analyses based on smaller genetic datasets. We recover R. uakarii as paraphyletic, designate certain populations formerly assigned to R. fantastica from Peru as R. summersi, and transfer the French Guianan and eastern Brazilian R. amazonica populations to R. variabilis. By clarifying both inter- and intraspecific relationships within Ranitomeya, our study paves the way for future tests of hypotheses on color pattern evolution and historical biogeography.
Collapse
|
7
|
A phylogenomic perspective on the evolutionary history of the stonefly genus Suwallia (Plecoptera: Chloroperlidae) revealed by ultraconserved genomic elements. Mol Phylogenet Evol 2021; 166:107320. [PMID: 34626810 DOI: 10.1016/j.ympev.2021.107320] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2020] [Revised: 09/20/2021] [Accepted: 10/04/2021] [Indexed: 11/18/2022]
Abstract
Evolutionary biologists have long sought to disentangle phylogenetic relationships among taxa spanning the tree of life, an increasingly important task as anthropogenic influences accelerate population declines and species extinctions, particularly in insects. Phylogenetic analyses are commonly used to identify unique evolutionary lineages, to clarify taxonomic designations of the focal taxa, and to inform conservation decisions. Advances in DNA sequencing techniques have increasingly facilitated the ability of researchers to apply genomic methods to phylogenetic analyses, even for non-model organisms. Stoneflies are non-model insects that are important bioindicators of the quality of freshwater habitats and landscape disturbance as they spend the immature stages of their life cycles in fresh water, and the adult stages in terrestrial environments. Phylogenetic relationships within the stonefly genus Suwallia (Insecta: Plecoptera: Chloroperlidae) are poorly understood, and have never been assessed using molecular data. We used DNA sequence data from genome-wide ultraconserved element loci to generate the first molecular phylogeny for the group and assess its monophyly. We found that Palearctic and Nearctic Suwallia do not form reciprocally monophyletic clades, and that a biogeographic history including dispersal, vicariance, and founder event speciation via jump dispersal best explains the geographic distribution of this group. Our results also strongly suggest that Neaviperla forcipata (Neave, 1929) is nested within Suwallia, and the concept of the genus Suwallia should be revised to include it. Thus, we formally propose a new taxonomic combination wherein Neaviperla forcipata (Neave, 1929) is reclassified as Suwallia forcipata (Neave, 1929). Moreover, some Suwallia species (e.g., S. amoenacolens, S. kerzhneri, S. marginata, S. pallidula, and S. starki) exhibit pronounced cryptic diversity that is worthy of further investigation. These findings provide a first glimpse into the evolutionary history of Suwallia, improve our understanding of stonefly diversity in the tribe Suwallini, and highlight areas where additional research is needed.
Collapse
|
8
|
FrogCap: A modular sequence capture probe-set for phylogenomics and population genetics for all frogs, assessed across multiple phylogenetic scales. Mol Ecol Resour 2021; 22:1100-1119. [PMID: 34569723 DOI: 10.1111/1755-0998.13517] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 09/08/2021] [Accepted: 09/14/2021] [Indexed: 12/01/2022]
Abstract
Despite the prevalence of high-throughput sequencing in phylogenetics, many relationships remain difficult to resolve because of conflicting signal among genomic regions. Selection of different types of molecular markers from different genomic regions is required to overcome these challenges. For evolutionary studies in frogs, we introduce the publicly available FrogCap suite of genomic resources, which is a large collection of ~15,000 markers that unifies previous genetic sequencing efforts. FrogCap is designed to be modular, such that subsets of markers and SNPs can be selected based on the desired phylogenetic scale. FrogCap uses a variety of marker types that include exons and introns, ultraconserved elements, and previously sequenced Sanger markers, which span up to 10,000 bp in alignment lengths; in addition, we demonstrate potential for SNP-based analyses. We tested FrogCap using 121 samples distributed across five phylogenetic scales, comparing probes designed using a consensus- or exemplar genome-based approach. Using the consensus design is more resilient to issues with sensitivity, specificity, and missing data than picking an exemplar genome sequence. We also tested the impact of different bait kit sizes (20,020 vs. 40,040) on depth of coverage and found triple the depth for the 20,020 bait kit. We observed sequence capture success (i.e., missing data, sequenced markers/bases, marker length, and informative sites) across phylogenetic scales. The incorporation of different marker types is effective for deep phylogenetic relationships and shallow population genetics studies. Having demonstrated FrogCap's utility and modularity, we conclude that these new resources are efficacious for high-throughput sequencing projects across variable timescales.
Collapse
|
9
|
A phylogeny of white-eyes based on ultraconserved elements. Mol Phylogenet Evol 2021; 164:107273. [PMID: 34333115 DOI: 10.1016/j.ympev.2021.107273] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Revised: 07/22/2021] [Accepted: 07/27/2021] [Indexed: 10/20/2022]
Abstract
White-eyes are an iconic radiation of passerine birds that have been the subject of studies in evolutionary biology, biogeography, and speciation theory. Zosterops white-eyes in particular are thought to have radiated rapidly across continental and insular regions of the Afro- and Indo-Pacific tropics, yet their phylogenetic history remains equivocal. Here, we sampled 77% of the genera and 47% of known white-eye species and sequenced thousands of ultraconserved elements to infer the phylogeny of the avian family Zosteropidae. We used concatenated maximum likelihood and species tree methods and found strong support for seven clades of white-eyes and three clades within the species-rich Zosterops radiation.
Collapse
|
10
|
Ultraconserved Elements Improve the Resolution of Difficult Nodes within the Rapid Radiation of Neotropical Sigmodontine Rodents (Cricetidae: Sigmodontinae). Syst Biol 2021; 70:1090-1100. [PMID: 33787920 DOI: 10.1093/sysbio/syab023] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Revised: 03/23/2021] [Accepted: 03/29/2021] [Indexed: 11/14/2022] Open
Abstract
Sigmodontine rodents (Cricetidae, Sigmodontinae) represent the second largest muroid subfamily and the most species-rich group of New World mammals, encompassing above 410 living species and ca. 87 genera. Even with advances on the clarification of sigmodontine phylogenetic relationships that have been made recently, the phylogenetic relationships among the 12 main group of genera (i.e., tribes) remain poorly resolved, in particular among those forming the large clade Oryzomyalia. This pattern has been interpreted as consequence of a rapid radiation upon the group entrance into South America. Here, we attempted to resolve phylogenetic relationships within Sigmodontinae using target capture and high-throughput sequencing of ultraconserved elements (UCEs). We enriched and sequenced UCEs for 56 individuals and collected data from four already available genomes. Analyses of distinct data sets, based on the capture of 4,634 loci, resulted in a highly resolved phylogeny consistent across different methods. Coalescent species-tree based approaches, concatenated matrices, and Bayesian analyses recovered similar topologies that were congruent at the resolution of difficult nodes. We recovered good support for the intertribal relationships within Oryzomyalia; for instance, the tribe Oryzomyini appears as the sister taxa of the remaining oryzomyalid tribes. The estimates of divergence times agree with results of previous studies. We inferred the crown age of the sigmodontine rodents at the end of Middle Miocene, while the main lineages of Oryzomyalia appear to have radiated in a short interval during the Late Miocene. Thus, the collection of a genomic scale data set with a wide taxonomic sampling, provided resolution for the first time of the relationships among the main lineages of Sigmodontinae. We expect the phylogeny presented here will become the backbone for future systematic and evolutionary studies of the group.
Collapse
|
11
|
Partitioned Gene-Tree Analyses and Gene-Based Topology Testing Help Resolve Incongruence in a Phylogenomic Study of Host-Specialist Bees (Apidae: Eucerinae). Mol Biol Evol 2021; 38:1090-1100. [PMID: 33179746 PMCID: PMC7947843 DOI: 10.1093/molbev/msaa277] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Incongruence among phylogenetic results has become a common occurrence in analyses of genome-scale data sets. Incongruence originates from uncertainty in underlying evolutionary processes (e.g., incomplete lineage sorting) and from difficulties in determining the best analytical approaches for each situation. To overcome these difficulties, more studies are needed that identify incongruences and demonstrate practical ways to confidently resolve them. Here, we present results of a phylogenomic study based on the analysis 197 taxa and 2,526 ultraconserved element (UCE) loci. We investigate evolutionary relationships of Eucerinae, a diverse subfamily of apid bees (relatives of honey bees and bumble bees) with >1,200 species. We sampled representatives of all tribes within the group and >80% of genera, including two mysterious South American genera, Chilimalopsis and Teratognatha. Initial analysis of the UCE data revealed two conflicting hypotheses for relationships among tribes. To resolve the incongruence, we tested concatenation and species tree approaches and used a variety of additional strategies including locus filtering, partitioned gene-trees searches, and gene-based topological tests. We show that within-locus partitioning improves gene tree and subsequent species-tree estimation, and that this approach, confidently resolves the incongruence observed in our data set. After exploring our proposed analytical strategy on eucerine bees, we validated its efficacy to resolve hard phylogenetic problems by implementing it on a published UCE data set of Adephaga (Insecta: Coleoptera). Our results provide a robust phylogenetic hypothesis for Eucerinae and demonstrate a practical strategy for resolving incongruence in other phylogenomic data sets.
Collapse
|
12
|
Phylogenomic Analysis of Concatenated Ultraconserved Elements Reveals the Recent Evolutionary Radiation of the Fairy Wrasses (Teleostei: Labridae: Cirrhilabrus). Syst Biol 2021; 71:1-12. [PMID: 33620490 DOI: 10.1093/sysbio/syab012] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Revised: 02/11/2021] [Accepted: 02/17/2021] [Indexed: 01/22/2023] Open
Abstract
The fairy wrasses (genus Cirrhilabrus) are among the most successful of the extant wrasse lineages (Teleostei: Labridae), with their 61 species accounting for nearly 10% of the family. Although species complexes within the genus have been diagnosed on the basis of coloration patterns and synapomorphies, attempts to resolve evolutionary relationships among these groups using molecular and morphological data have largely been unsuccessful. Here we use a phylogenomic approach with a data set comprising 991 ultraconserved elements (UCEs) and mitochondrial COI to uncover the evolutionary history and patterns of temporal and spatial diversification of the fairy wrasses. Our analyses of phylogenetic signal suggest that most gene-tree incongruence is caused by estimation error, leading to poor resolution in a summary-coalescent analysis of the data. In contrast, analyses of concatenated sequences are able to resolve the major relationships of Cirrhilabrus. We determine the placements of species that were previously regarded as incertae sedis and find evidence for the nesting of Conniella, an unusual, monotypic genus, within Cirrhilabrus. Our relaxed-clock dating analysis indicates that the major divergences within the genus occurred around the Miocene-Pliocene boundary, followed by extensive cladogenesis of species complexes in the Pliocene-Pleistocene. Biogeographic reconstruction suggests that the fairy wrasses emerged within the Coral Triangle, with episodic fluctuations of sea levels during glacial cycles coinciding with shallow divergence events but providing few opportunities for more widespread dispersal. Our study demonstrates both the resolving power and limitations of UCEs across shallow timescales where there is substantial estimation error in individual gene trees.
Collapse
|
13
|
Phylogenomic Analysis of Ultraconserved Elements Resolves the Evolutionary and Biogeographic History of Segmented Trapdoor Spiders. Syst Biol 2020; 70:1110-1122. [PMID: 33367903 DOI: 10.1093/sysbio/syaa098] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 12/03/2020] [Accepted: 12/08/2020] [Indexed: 01/24/2023] Open
Abstract
The segmented trapdoor spiders (Liphistiidae) are the sole surviving family of the suborder Mesothelae, which forms the sister lineage to all other living spiders. Liphistiids have retained a number of plesiomorphic traits and their present-day distribution is limited to East and Southeast Asia. Studying this group has the potential to shed light on the deep evolutionary history of spiders, but the phylogeny and divergence times of the family have not been resolved with confidence. We performed phylogenomic and molecular dating analyses of 2,765 ultraconserved element loci from 185 liphistiid taxa. Our analyses show that the crown group of Liphistiidae appeared in the mid-Cretaceous at 102 Ma (95% credibility interval 92-113 Ma), but it was not until the Neogene that much of the diversification within the family occurred in mainland Southeast and East Asia. This diversification was coincident with tectonic events such as the extension of the East Asian continental margin, as well as geological upheavals in Indochina induced by the collision between India and Asia. Our study highlights the important role of major tectonic events in shaping the evolutionary history, present-day diversity, and geographical distribution of mesothele and liphistiid spiders.
Collapse
|
14
|
Divergence, gene flow, and speciation in eight lineages of trans-Beringian birds. Mol Ecol 2020; 29:3526-3542. [PMID: 32745340 DOI: 10.1111/mec.15574] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2019] [Revised: 07/22/2020] [Accepted: 07/27/2020] [Indexed: 12/13/2022]
Abstract
Determining how genetic diversity is structured between populations that span the divergence continuum from populations to biological species is key to understanding the generation and maintenance of biodiversity. We investigated genetic divergence and gene flow in eight lineages of birds with a trans-Beringian distribution, where Asian and North American populations have likely been split and reunited through multiple Pleistocene glacial cycles. Our study transects the speciation process, including eight pairwise comparisons in three orders (ducks, shorebirds and passerines) at population, subspecies and species levels. Using ultraconserved elements (UCEs), we found that these lineages represent conditions from slightly differentiated populations to full biological species. Although allopatric speciation is considered the predominant mode of divergence in birds, all of our best divergence models included gene flow, supporting speciation with gene flow as the predominant mode in Beringia. In our eight lineages, three were best described by a split-migration model (divergence with gene flow), three best fit a secondary contact scenario (isolation followed by gene flow), and two showed support for both models. The lineages were not evenly distributed across a divergence space defined by gene flow (M) and differentiation (FST ), instead forming two discontinuous groups: one with relatively shallow divergence, no fixed single nucleotide polymorphisms (SNPs), and high rates of gene flow between populations; and the second with relatively deeply divergent lineages, multiple fixed SNPs, and low gene flow. Our results highlight the important role that gene flow plays in avian divergence in Beringia.
Collapse
|
15
|
Genetic Variations of Ultraconserved Elements in the Human Genome. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2020; 23:549-559. [PMID: 31689173 DOI: 10.1089/omi.2019.0156] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Ultraconserved elements (UCEs) are among the most popular DNA markers for phylogenomic analysis. In at least three of five placental mammalian genomes (human, dog, cow, mouse, and rat), 2189 UCEs of at least 200 bp in length that are identical have been identified. Most of these regions have not yet been functionally annotated, and their associations with diseases remain largely unknown. This is an important knowledge gap in human genomics with regard to UCE roles in physiologically critical functions, and by extension, their relevance for shared susceptibilities to common complex diseases across several mammalian organisms in the event of their polymorphic variations. In the present study, we remapped the genomic locations of these UCEs to the latest human genome assembly, and examined them for documented polymorphisms in sequenced human genomes. We identified 29,983 polymorphisms within analyzed UCEs, but revealed that a vast majority exhibits very low minor allele frequencies. Notably, only 112 of the identified polymorphisms are associated with a phenotype in the Ensembl genome browser. Through literature analyses, we confirmed associations of 37 (i.e., out of the 112) polymorphisms within 23 UCEs with 25 diseases and phenotypic traits, including, muscular dystrophies, eye diseases, and cancers (e.g., familial adenomatous polyposis). Most reports of UCE polymorphism-disease associations appeared to be not cognizant that their candidate polymorphisms were actually within UCEs. The present study offers strategic directions and knowledge gaps for future computational and experimental work so as to better understand the thus far intriguing and puzzling role(s) of UCEs in mammalian genomes.
Collapse
|
16
|
Ultraconserved Elements Occupy Specific Arenas of Three-Dimensional Mammalian Genome Organization. Cell Rep 2019; 24:479-488. [PMID: 29996107 DOI: 10.1016/j.celrep.2018.06.031] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2017] [Revised: 05/09/2018] [Accepted: 06/07/2018] [Indexed: 12/23/2022] Open
Abstract
This study explores the relationship between three-dimensional genome organization and ultraconserved elements (UCEs), an enigmatic set of DNA elements that are perfectly conserved between the reference genomes of distantly related species. Examining both human and mouse genomes, we interrogate the relationship of UCEs to three features of chromosome organization derived from Hi-C studies. We find that UCEs are enriched within contact domains and, further, that the subset of UCEs within domains shared across diverse cell types are linked to kidney-related and neuronal processes. In boundaries, UCEs are generally depleted, with those that do overlap boundaries being overrepresented in exonic UCEs. Regarding loop anchors, UCEs are neither overrepresented nor underrepresented, but those present in loop anchors are enriched for splice sites. Finally, as the relationships between UCEs and human Hi-C features are conserved in mouse, our findings suggest that UCEs contribute to interspecies conservation of genome organization and, thus, genome stability.
Collapse
|
17
|
Phylogenetic relationships and systematics of the Amazonian poison frog genus Ameerega using ultraconserved genomic elements. Mol Phylogenet Evol 2019; 142:106638. [PMID: 31586688 DOI: 10.1016/j.ympev.2019.106638] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Revised: 10/02/2019] [Accepted: 10/02/2019] [Indexed: 12/13/2022]
Abstract
The Amazonian poison frog genus Ameerega is one of the largest yet most understudied of the brightly colored genera in the anuran family Dendrobatidae, with 30 described species ranging throughout tropical South America. Phylogenetic analyses of Ameerega are highly discordant, lacking consistency due to variation in data types and methods, and often with limited coverage of species diversity in the genus. Here, we present a comprehensive phylogenomic reconstruction of Ameerega, utilizing state-of-the-art sequence capture techniques and phylogenetic methods. We sequenced thousands of ultraconserved elements from over 100 tissue samples, representing almost every described Ameerega species, as well as undescribed cryptic diversity. We generated topologies using maximum likelihood and coalescent methods and compared the use of maximum likelihood and Bayesian methods for estimating divergence times. Our phylogenetic inference diverged strongly from those of previous studies, and we recommend steps to bring Ameerega taxonomy in line with the new phylogeny. We place several species in a phylogeny for the first time, as well as provide evidence for six potential candidate species. We estimate that Ameerega experienced a rapid radiation approximately 7-11 million years ago and that the ancestor of all Ameerega was likely an aposematic, montane species. This study underscores the utility of phylogenomic data in improving our understanding of the phylogeny of understudied clades and making novel inferences about their evolution.
Collapse
|
18
|
Phylogenomic species delimitation in microendemic frogs of the Brazilian Atlantic Forest. Mol Phylogenet Evol 2019; 141:106627. [PMID: 31539606 DOI: 10.1016/j.ympev.2019.106627] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Revised: 08/17/2019] [Accepted: 09/17/2019] [Indexed: 10/26/2022]
Abstract
The advent of next-generation sequencing allows researchers to use large-scale datasets for species delimitation analyses, yet one can envision an inflection point where the added accuracy of including more loci does not offset the increased computational burden. One alternative to including all loci could be to prioritize the analysis of loci for which there is an expectation of high informativeness. Here, we explore the issue of species delimitation and locus selection with montane species from two anuran genera that have been isolated in sky islands across the southern Brazilian Atlantic Forest: Melanophryniscus (Bufonidae) and Brachycephalus (Brachycephalidae). To delimit species, we obtained genetic data using target enrichment of ultraconserved elements from 32 populations (13 for Melanophryniscus and 19 for Brachycephalus), and we were able to create datasets that included over 800 loci with no missing data. We ranked loci according to their number of parsimony-informative sites, and we performed species delimitation analyses using BPP with the most informative 10, 20, 40, 80, 160, 320, and 640 loci. We identified three types of phylogenetic node: nodes with either consistently high or low support regardless of the number of loci or their informativeness and nodes that were initially poorly supported where support became stronger as we included more data. When viewed across all sensitivity analyses, our results suggest that the current species richness in both genera is likely underestimated. In addition, our results show the effects of different sampling strategies on species delimitation using phylogenomic datasets.
Collapse
|
19
|
Optimized DNA extraction and library preparation for minute arthropods: Application to target enrichment in chalcid wasps used for biocontrol. Mol Ecol Resour 2019; 19:702-710. [PMID: 30758892 DOI: 10.1111/1755-0998.13006] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Revised: 01/18/2019] [Accepted: 02/01/2019] [Indexed: 01/04/2023]
Abstract
Target enrichment is increasingly used for genotyping of plant and animal species or to better understand the evolutionary history of important lineages through the inference of statistically robust phylogenies. Limitations to routine target enrichment are both the complexity of current protocols and low input DNA quantity. Thus, working with tiny organisms such as microarthropods can be challenging. Here, we propose easy to set up optimizations for DNA extraction and library preparation prior to target enrichment. Prepared libraries were used to capture 1,432 ultraconserved elements (UCEs) from microhymenoptera (Chalcidoidea), which are among the tiniest insects on Earth and the most commercialized worldwide for biological control purposes. Results show no correlation between input DNA quantities (1.8-250 ng, 0.4 ng with an extra whole genome amplification step) and the number of sequenced UCEs on an Illumina MiSeq. Phylogenetic inferences highlight the potential of UCEs to solve relationships within the families of chalcid wasps, which has not been achieved so far. The protocol (library preparation + target enrichment) allows processing 96 specimens in five working days, by a single person, without requiring the use of expensive robotic molecular biology platforms, which could help to generalize the use of target enrichment for minute specimens.
Collapse
|
20
|
Phylogenomic incongruence, hypothesis testing, and taxonomic sampling: The monophyly of characiform fishes. Evolution 2018; 73:329-345. [PMID: 30426469 DOI: 10.1111/evo.13649] [Citation(s) in RCA: 58] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Revised: 10/23/2018] [Accepted: 11/05/2018] [Indexed: 11/30/2022]
Abstract
Phylogenomic studies using genome-wide datasets are quickly becoming the state of the art for systematics and comparative studies, but in many cases, they result in strongly supported incongruent results. The extent to which this conflict is real depends on different sources of error potentially affecting big datasets (assembly, stochastic, and systematic error). Here, we apply a recently developed methodology (GGI or gene genealogy interrogation) and data curation to new and published datasets with more than 1000 exons, 500 ultraconserved element (UCE) loci, and transcriptomic sequences that support incongruent hypotheses. The contentious non-monophyly of the order Characiformes proposed by two studies is shown to be a spurious outcome induced by sample contamination in the transcriptomic dataset and an ambiguous result due to poor taxonomic sampling in the UCE dataset. By exploring the effects of number of taxa and loci used for analysis, we show that the power of GGI to discriminate among competing hypotheses is diminished by limited taxonomic sampling, but not equally sensitive to gene sampling. Taken together, our results reinforce the notion that merely increasing the number of genetic loci for a few representative taxa is not a robust strategy to advance phylogenetic knowledge of recalcitrant groups. We leverage the expanded exon capture dataset generated here for Characiformes (206 species in 23 out of 24 families) to produce a comprehensive phylogeny and a revised classification of the order.
Collapse
|
21
|
Combining transcriptomes and ultraconserved elements to illuminate the phylogeny of Apidae. Mol Phylogenet Evol 2018; 130:121-131. [PMID: 30326287 DOI: 10.1016/j.ympev.2018.10.012] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Revised: 10/08/2018] [Accepted: 10/09/2018] [Indexed: 02/07/2023]
Abstract
Two increasingly popular approaches to reconstruct the Tree of Life involve whole transcriptome sequencing and the target capture of ultraconserved elements (UCEs). Both methods can be used to generate large, multigene datasets for analysis of phylogenetic relationships in non-model organisms. While targeted exon sequencing across divergent lineages is now a standard method, it is still not clear if UCE data can be readily combined with published transcriptomes. In this study, we evaluate the combination of UCEs and transcriptomes in a single analysis using genome-, transcriptome-, and UCE data for 79 bees in the largest and most biologically diverse bee family, Apidae. Using existing tools, we first developed a workflow to assemble phylogenomic data from different sources and produced two large nucleotide matrices of combined data. We then reconstructed the phylogeny of the Apidae using concatenation- and coalescent-based methods, and critically evaluated the resulting phylogenies in the context of previously published genetic, genomic, and morphological data sets. Our estimated phylogenetic trees are robustly supported and largely congruent with previous molecular hypotheses, from deep nodes to shallow species-level phylogenies. Moreover, the combined approach allows us to resolve controversial nodes of the apid Tree of Life, by clarifying the relationships among the genera of orchid bees (Euglossini) and the monophyly of the Centridini. Additionally, we present novel phylogenetic evidence supporting the monophyly of the diverse clade of cleptoparasitic Apidae and the placement of two enigmatic, oil-collecting genera (Ctenoplectra and Tetrapedia). Lastly, we propose a revised classification of the family Apidae that reflects our improved understanding of apid higher-level relationships.
Collapse
|
22
|
A simple method for data partitioning based on relative evolutionary rates. PeerJ 2018; 6:e5498. [PMID: 30186687 PMCID: PMC6118207 DOI: 10.7717/peerj.5498] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2017] [Accepted: 08/01/2018] [Indexed: 11/20/2022] Open
Abstract
BACKGROUND Multiple studies have demonstrated that partitioning of molecular datasets is important in model-based phylogenetic analyses. Commonly, partitioning is done a priori based on some known properties of sequence evolution, e.g. differences in rate of evolution among codon positions of a protein-coding gene. Here we propose a new method for data partitioning based on relative evolutionary rates of the sites in the alignment of the dataset being analysed. The rates are inferred using the previously published Tree Independent Generation of Evolutionary Rates (TIGER), and the partitioning is conducted using our novel python script RatePartitions. We conducted simulations to assess the performance of our new method, and we applied it to eight published multi-locus phylogenetic datasets, representing different taxonomic ranks within the insect order Lepidoptera (butterflies and moths) and one phylogenomic dataset, which included ultra-conserved elements as well as introns. METHODS We used TIGER-rates to generate relative evolutionary rates for all sites in the alignments. Then, using RatePartitions, we partitioned the data into partitions based on their relative evolutionary rate. RatePartitions applies a simple formula that ensures a distribution of sites into partitions following the distribution of rates of the characters from the full dataset. This ensures that the invariable sites are placed in a partition with slowly evolving sites, avoiding the pitfalls of previously used methods, such as k-means. Different partitioning strategies were evaluated using BIC scores as calculated by PartitionFinder. RESULTS Simulations did not highlight any misbehaviour of our partitioning approach, even under difficult parameter conditions or missing data. In all eight phylogenetic datasets, partitioning using TIGER-rates and RatePartitions was significantly better as measured by the BIC scores than other partitioning strategies, such as the commonly used partitioning by gene and codon position. We compared the resulting topologies and node support for these eight datasets as well as for the phylogenomic dataset. DISCUSSION We developed a new method of partitioning phylogenetic datasets without using any prior knowledge (e.g. DNA sequence evolution). This method is entirely based on the properties of the data being analysed and can be applied to DNA sequences (protein-coding, introns, ultra-conserved elements), protein sequences, as well as morphological characters. A likely explanation for why our method performs better than other tested partitioning strategies is that it accounts for the heterogeneity in the data to a much greater extent than when data are simply subdivided based on prior knowledge.
Collapse
|
23
|
Upregulation of uc.189 in patients with esophageal squamous cell carcinoma and its clinicopathologic value. Pathol Res Pract 2017; 213:1400-1403. [PMID: 28941722 DOI: 10.1016/j.prp.2017.09.005] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Revised: 08/14/2017] [Accepted: 09/05/2017] [Indexed: 11/18/2022]
Abstract
Ultraconserved elements (UCEs) encoding noncoding RNAs serve as important regulators in cancer biology. Until now, the role of the UCE uc.189 in human cancers remains undefined and the clinical significance of uc.189 in esophageal cancers remains unknown. This study was to identify the prognostic value of uc.189 expression in esophageal squamous cell carcinomas (ESCC). Quantitative real-time polymerase chain reaction (qRT-PCR) was used to detect the expression level of uc.189 in matched cancerous tissues and adjacent noncancerous tissues from 152 patients with ESCC. The correlation of uc.189 with clinicopathological features and prognosis were also analyzed. The expression of uc.189 was significantly higher in human ESCC compared with the adjacent noncancerous tissues (122/152, 80.3%, p<0.01), and the high level of uc.189 expression was significantly correlated with invasion of the tumor (p=0.009), advanced clinical stage (p=0.000), lymph node metastasis (p=0.000), and poor prognosis. High expression of uc.189 might reflect poor prognosis of ESCC and indicate a potential diagnostic target in ESCC patients. Uc.189 might be considered as a novel molecule involved in ESCC progression, which provides a potential prognostic biomarker and therapeutic target.
Collapse
MESH Headings
- Aged
- Aged, 80 and over
- Biomarkers, Tumor/analysis
- Biomarkers, Tumor/genetics
- Carcinoma, Squamous Cell/diagnosis
- Carcinoma, Squamous Cell/genetics
- Carcinoma, Squamous Cell/metabolism
- Carcinoma, Squamous Cell/pathology
- Cell Line, Tumor
- Esophageal Neoplasms/diagnosis
- Esophageal Neoplasms/genetics
- Esophageal Neoplasms/metabolism
- Esophageal Neoplasms/pathology
- Esophageal Squamous Cell Carcinoma
- Female
- Gene Expression Regulation, Neoplastic
- Humans
- Lymphatic Metastasis/diagnosis
- Lymphatic Metastasis/genetics
- Lymphatic Metastasis/pathology
- Male
- Middle Aged
- Prognosis
- RNA, Long Noncoding/metabolism
- Up-Regulation
Collapse
|
24
|
Abstract
Ultraconserved elements (UCEs) are discrete genomic elements conserved across large evolutionary distances. Although UCEs have been linked to multiple facets of mammalian gene regulation their extreme evolutionary conservation remains largely unexplained. Here, we apply a computational approach to investigate this question in Drosophila, exploring the molecular functions of more than 1,500 UCEs shared across the genomes of 12 Drosophila species. Our data indicate that Drosophila UCEs are hubs for gene regulatory functions and suggest that UCE sequence invariance originates from their combinatorial roles in gene control. We also note that the gene regulatory roles of intronic and intergenic UCEs (iUCEs) are distinct from those found in exonic UCEs (eUCEs). In iUCEs, transcription factor (TF) and epigenetic factor binding data strongly support iUCE roles in transcriptional and epigenetic regulation. In contrast, analyses of eUCEs indicate that they are two orders of magnitude more likely than the expected to simultaneously include protein-coding sequence, TF-binding sites, splice sites, and RNA editing sites but have reduced roles in transcriptional or epigenetic regulation. Furthermore, we use a Drosophila cell culture system and transgenic Drosophila embryos to validate the notion of UCE combinatorial regulatory roles using an eUCE within the Hox gene Ultrabithorax and show that its protein-coding region also contains alternative splicing regulatory information. Taken together our experiments indicate that UCEs emerge as a result of combinatorial gene regulatory roles and highlight common features in mammalian and insect UCEs implying that similar processes might underlie ultraconservation in diverse animal taxa.
Collapse
|
25
|
How Should Genes and Taxa be Sampled for Phylogenomic Analyses with Missing Data? An Empirical Study in Iguanian Lizards. Syst Biol 2015; 65:128-45. [PMID: 26330450 DOI: 10.1093/sysbio/syv058] [Citation(s) in RCA: 110] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2014] [Accepted: 08/04/2015] [Indexed: 11/12/2022] Open
Abstract
Targeted sequence capture is becoming a widespread tool for generating large phylogenomic data sets to address difficult phylogenetic problems. However, this methodology often generates data sets in which increasing the number of taxa and loci increases amounts of missing data. Thus, a fundamental (but still unresolved) question is whether sampling should be designed to maximize sampling of taxa or genes, or to minimize the inclusion of missing data cells. Here, we explore this question for an ancient, rapid radiation of lizards, the pleurodont iguanians. Pleurodonts include many well-known clades (e.g., anoles, basilisks, iguanas, and spiny lizards) but relationships among families have proven difficult to resolve strongly and consistently using traditional sequencing approaches. We generated up to 4921 ultraconserved elements with sampling strategies including 16, 29, and 44 taxa, from 1179 to approximately 2.4 million characters per matrix and approximately 30% to 60% total missing data. We then compared mean branch support for interfamilial relationships under these 15 different sampling strategies for both concatenated (maximum likelihood) and species tree (NJst) approaches (after showing that mean branch support appears to be related to accuracy). We found that both approaches had the highest support when including loci with up to 50% missing taxa (matrices with ~40-55% missing data overall). Thus, our results show that simply excluding all missing data may be highly problematic as the primary guiding principle for the inclusion or exclusion of taxa and genes. The optimal strategy was somewhat different for each approach, a pattern that has not been shown previously. For concatenated analyses, branch support was maximized when including many taxa (44) but fewer characters (1.1 million). For species-tree analyses, branch support was maximized with minimal taxon sampling (16) but many loci (4789 of 4921). We also show that the choice of these sampling strategies can be critically important for phylogenomic analyses, since some strategies lead to demonstrably incorrect inferences (using the same method) that have strong statistical support. Our preferred estimate provides strong support for most interfamilial relationships in this important but phylogenetically challenging group.
Collapse
|