1
|
Silva-Ramos CR, Ballesteros-Ballesteros JA, Chala-Quintero SM, Matiz-González JM, Herrera-Sepúlveda MT, Faccini-Martínez ÁA, Pulido-Villamarín ADP, Hidalgo M, Pinto CM, Pérez-Torres J, Cuervo C. Genetic diversity of Bartonella spp. among cave-dwelling bats from Colombia. Acta Trop 2024; 259:107370. [PMID: 39216810 DOI: 10.1016/j.actatropica.2024.107370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 07/06/2024] [Accepted: 08/26/2024] [Indexed: 09/04/2024]
Abstract
Bartonella is a bacterial genus that comprises arthropod-borne microorganisms. Several Bartonella isolates have been detected from bats worldwide, which are thought to be undescribed species. We aimed to test the presence of Bartonella spp. among bats from Colombia, and evaluate the genetic diversity of bat-associated Bartonella spp. through phylogenetic analyses. A total of 108 bat blood samples were collected from three bat species (Carollia perspicillata, Mormoops megalophylla, and Natalus tumidirostris) that inhabit the Macaregua cave. The Bartonella ssrA gene was targeted through real-time and end-point PCR; additionally, the gltA and rpoB genes were detected by end-point PCR. All obtained amplicons were purified and bidirectionally sequenced for phylogenetic analysis using a concatenated supermatrix and a supertree approaches. A detection frequency of 49.1 % (53/108) for Bartonella spp. was evidenced among bat blood samples, of which 59.1 % (26/44), 54.3 % (19/35) and 27.6 % (8/29) were identified in Carollia perspicillata, Natalus tumidirostris and Mormoops megalophylla respectively. A total of 35 ssrA, 5 gltA and 4 rpoB good-quality sequences were obtained which were used for phylogenetic analysis. All obtained bat sequences clustered together with sequences obtained from Neotropical bat species into two bat-restricted clades namely clade A and clade N. We detected the presence of Bartonella spp. that clustered within two different bat-associated Bartonella clades, giving the first data of the genetic diversity of these bacteria among bats from Colombia.
Collapse
Affiliation(s)
- Carlos Ramiro Silva-Ramos
- Grupo de Enfermedades Infecciosas, Departamento de Microbiología, Facultad de Ciencias, Pontificia Universidad Javeriana, Bogotá, Colombia
| | - Jesús A Ballesteros-Ballesteros
- Grupo de Enfermedades Infecciosas, Departamento de Microbiología, Facultad de Ciencias, Pontificia Universidad Javeriana, Bogotá, Colombia
| | - Sandra M Chala-Quintero
- Grupo de Enfermedades Infecciosas, Departamento de Microbiología, Facultad de Ciencias, Pontificia Universidad Javeriana, Bogotá, Colombia
| | - J Manuel Matiz-González
- Grupo de Enfermedades Infecciosas, Departamento de Microbiología, Facultad de Ciencias, Pontificia Universidad Javeriana, Bogotá, Colombia; Molecular Genetics and Antimicrobial Resistance Unit, Universidad El Bosque, Bogotá, Colombia
| | - María T Herrera-Sepúlveda
- Grupo de Enfermedades Infecciosas, Departamento de Microbiología, Facultad de Ciencias, Pontificia Universidad Javeriana, Bogotá, Colombia
| | - Álvaro A Faccini-Martínez
- Servicio de Infectología, Hospital Militar Central, Bogotá, Colombia; Servicios y Asesorías en Infectología - SAI, Bogotá, Colombia; Facultad de Medicina, Universidad Militar Nueva Granada, Bogotá, Colombia
| | - Adriana Del Pilar Pulido-Villamarín
- Unidad de Investigaciones Agropecuarias (UNIDIA), Departamento de Microbiología, Facultad de Ciencias, Pontificia Universidad Javeriana, Bogotá, Colombia
| | - Marylin Hidalgo
- Grupo de Enfermedades Infecciosas, Departamento de Microbiología, Facultad de Ciencias, Pontificia Universidad Javeriana, Bogotá, Colombia
| | - C Miguel Pinto
- Charles Darwin Research Station, Charles Darwin Foundation, Puerto Ayora, Galápagos, Ecuador
| | - Jairo Pérez-Torres
- Unidad de Ecología y Sistemática (UNESIS), Laboratorio de Ecología Funcional, Facultad de Ciencias, Pontificia Universidad Javeriana, Bogotá, Colombia
| | - Claudia Cuervo
- Grupo de Enfermedades Infecciosas, Departamento de Microbiología, Facultad de Ciencias, Pontificia Universidad Javeriana, Bogotá, Colombia.
| |
Collapse
|
2
|
Leal JL, Milesi P, Hodková E, Zhou Q, James J, Eklund DM, Pyhäjärvi T, Salojärvi J, Lascoux M. Complex Polyploids: Origins, Genomic Composition, and Role of Introgressed Alleles. Syst Biol 2024; 73:392-418. [PMID: 38613229 PMCID: PMC11282369 DOI: 10.1093/sysbio/syae012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 12/18/2023] [Accepted: 03/28/2024] [Indexed: 04/14/2024] Open
Abstract
Introgression allows polyploid species to acquire new genomic content from diploid progenitors or from other unrelated diploid or polyploid lineages, contributing to genetic diversity and facilitating adaptive allele discovery. In some cases, high levels of introgression elicit the replacement of large numbers of alleles inherited from the polyploid's ancestral species, profoundly reshaping the polyploid's genomic composition. In such complex polyploids, it is often difficult to determine which taxa were the progenitor species and which taxa provided additional introgressive blocks through subsequent hybridization. Here, we use population-level genomic data to reconstruct the phylogenetic history of Betula pubescens (downy birch), a tetraploid species often assumed to be of allopolyploid origin and which is known to hybridize with at least four other birch species. This was achieved by modeling polyploidization and introgression events under the multispecies coalescent and then using an approximate Bayesian computation rejection algorithm to evaluate and compare competing polyploidization models. We provide evidence that B. pubescens is the outcome of an autoploid genome doubling event in the common ancestor of B. pendula and its extant sister species, B. platyphylla, that took place approximately 178,000-188,000 generations ago. Extensive hybridization with B. pendula, B. nana, and B. humilis followed in the aftermath of autopolyploidization, with the relative contribution of each of these species to the B. pubescens genome varying markedly across the species' range. Functional analysis of B. pubescens loci containing alleles introgressed from B. nana identified multiple genes involved in climate adaptation, while loci containing alleles derived from B. humilis revealed several genes involved in the regulation of meiotic stability and pollen viability in plant species.
Collapse
Affiliation(s)
- J Luis Leal
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
| | - Pascal Milesi
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
- Science for Life Laboratory (SciLifeLab), Uppsala University, 75237 Uppsala, Sweden
| | - Eva Hodková
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
- Faculty of Environmental Sciences, Czech University of Life Sciences Prague, Kamýcká 129, 16521 Prague, Czech Republic
| | - Qiujie Zhou
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
| | - Jennifer James
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
| | - D Magnus Eklund
- Physiology and Environmental Toxicology, Department of Organismal Biology, Uppsala University, Norbyvägen 18A, 75236 Uppsala, Sweden
| | - Tanja Pyhäjärvi
- Organismal and Evolutionary Biology Research Program, Faculty of Biological and Environmental Sciences, and Viikki Plant Science Centre, University of Helsinki, P.O. Box 65 (Viikinkaari 1), 00014 Helsinki, Finland
- Department of Forest Sciences, University of Helsinki, 00014 Helsinki, Finland
| | - Jarkko Salojärvi
- School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551, Singapore
- Organismal and Evolutionary Biology Research Program, Faculty of Biological and Environmental Sciences, and Viikki Plant Science Centre, University of Helsinki, P.O. Box 65 (Viikinkaari 1), 00014 Helsinki, Finland
| | - Martin Lascoux
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, 75236 Uppsala, Sweden
- Science for Life Laboratory (SciLifeLab), Uppsala University, 75237 Uppsala, Sweden
| |
Collapse
|
3
|
Frost LA, Bedoya AM, Lagomarsino LP. Artifactual Orthologs and the Need for Diligent Data Exploration in Complex Phylogenomic Datasets: A Museomic Case Study from the Andean Flora. Syst Biol 2024; 73:308-322. [PMID: 38170162 DOI: 10.1093/sysbio/syad076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 11/20/2023] [Accepted: 01/02/2024] [Indexed: 01/05/2024] Open
Abstract
The Andes mountains of western South America are a globally important biodiversity hotspot, yet there is a paucity of resolved phylogenies for plant clades from this region. Filling an important gap in our understanding of the World's richest flora, we present the first phylogeny of Freziera (Pentaphylacaceae), an Andean-centered, cloud forest radiation. Our dataset was obtained via hybrid-enriched target sequence capture of Angiosperms353 universal loci for 50 of the ca. 75 spp., obtained almost entirely from herbarium specimens. We identify high phylogenomic complexity in Freziera, including the presence of data artifacts. Via by-eye observation of gene trees, detailed examination of warnings from recently improved assembly pipelines, and gene tree filtering, we identified that artifactual orthologs (i.e., the presence of only one copy of a multicopy gene due to differential assembly) were an important source of gene tree heterogeneity that had a negative impact on phylogenetic inference and support. These artifactual orthologs may be common in plant phylogenomic datasets, where multiple instances of genome duplication are common. After accounting for artifactual orthologs as source of gene tree error, we identified a significant, but nonspecific signal of introgression using Patterson's D and f4 statistics. Despite phylogenomic complexity, we were able to resolve Freziera into 9 well-supported subclades whose evolution has been shaped by multiple evolutionary processes, including incomplete lineage sorting, historical gene flow, and gene duplication. Our results highlight the complexities of plant phylogenomics, which are heightened in Andean radiations, and show the impact of filtering data processing artifacts and standard filtering approaches on phylogenetic inference.
Collapse
Affiliation(s)
- Laura A Frost
- Shirley C. Tucker Herbarium, Department of Biological Sciences, Louisiana State University, Life Science Annex Building A257, Baton Rouge, LA 70803, USA
- Biology Department, University of South Alabama, 5871 USA N Dr, Mobile, AL 36688, USA
| | - Ana M Bedoya
- Shirley C. Tucker Herbarium, Department of Biological Sciences, Louisiana State University, Life Science Annex Building A257, Baton Rouge, LA 70803, USA
| | - Laura P Lagomarsino
- Shirley C. Tucker Herbarium, Department of Biological Sciences, Louisiana State University, Life Science Annex Building A257, Baton Rouge, LA 70803, USA
| |
Collapse
|
4
|
Fleming J, Eriksen PM, Struck TH. Scoutknife: A naïve, whole genome informed phylogenetic robusticity metric. F1000Res 2024; 12:945. [PMID: 38799242 PMCID: PMC11128044 DOI: 10.12688/f1000research.139356.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/05/2024] [Indexed: 05/29/2024] Open
Abstract
Background: The phylogenetic bootstrap, first proposed by Felsenstein in 1985, is a critically important statistical method in assessing the robusticity of phylogenetic datasets. Core to its concept was the use of pseudo sampling - assessing the data by generating new replicates derived from the initial dataset that was used to generate the phylogeny. In this way, phylogenetic support metrics could overcome the lack of perfect, infinite data. With infinite data, however, it is possible to sample smaller replicates directly from the data to obtain both the phylogeny and its statistical robusticity in the same analysis. Due to the growth of whole genome sequencing, the depth and breadth of our datasets have greatly expanded and are set to only expand further. With genome-scale datasets comprising thousands of genes, we can now obtain a proxy for infinite data. Accordingly, we can potentially abandon the notion of pseudo sampling and instead randomly sample small subsets of genes from the thousands of genes in our analyses. Methods: We introduce Scoutknife, a jackknife-style subsampling implementation that generates 100 datasets by randomly sampling a small number of genes from an initial large-gene dataset to jointly establish both a phylogenetic hypothesis and assess its robusticity. We assess its effectiveness by using 18 previously published datasets and 100 simulation studies. Results: We show that Scoutknife is conservative and informative as to conflicts and incongruence across the whole genome, without the need for subsampling based on traditional model selection criteria. Conclusions: Scoutknife reliably achieves comparable results to selecting the best genes on both real and simulation datasets, while being resistant to the potential biases caused by selecting for model fit. As the amount of genome data grows, it becomes an even more exciting option to assess the robusticity of phylogenetic hypotheses.
Collapse
Affiliation(s)
- James Fleming
- Natural History Museum, Universitetet i Oslo, Oslo, Oslo, 0562, Norway
| | | | | |
Collapse
|
5
|
Yan Y, da Fonseca RR, Rahbek C, Borregaard MK, Davis CC. A new nuclear phylogeny of the tea family (Theaceae) unravels rapid radiations in genus Camellia. Mol Phylogenet Evol 2024; 196:108089. [PMID: 38679302 DOI: 10.1016/j.ympev.2024.108089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Revised: 03/08/2024] [Accepted: 04/25/2024] [Indexed: 05/01/2024]
Abstract
Molecular analyses of rapidly radiating groups often reveal incongruence between gene trees. This mainly results from incomplete lineage sorting, introgression, and gene tree estimation error, which complicate the estimation of phylogenetic relationships. In this study, we reconstruct the phylogeny of Theaceae using 348 nuclear loci from 68 individuals and two outgroup taxa. Sequence data were obtained by target enrichment using the recently released Angiosperm 353 universal probe set applied to herbarium specimens. The robustness of the topologies to variation in data quality was established under a range of different filtering schemes, using both coalescent and concatenation approaches. Our results confirmed most of the previously hypothesized relationships among tribes and genera, while clarifying additional interspecific relationships within the rapidly radiating genus Camellia. We recovered a remarkably high degree of gene tree heterogeneity indicative of rapid radiation in the group and observed cytonuclear conflicts, especially within Camellia. This was especially pronounced around short branches, which we primarily associate with gene tree estimation error. Our analysis also indicates that incomplete lineage sorting (ILS) contributed to gene-tree conflicts and accounted for approximately 14 % of the explained variation, whereas inferred introgression levels were low. Our study advances the understanding of the evolution of this important plant family and provides guidance on the application of target capture methods and the evaluation of key processes that influence phylogenetic discordances.
Collapse
Affiliation(s)
- Yujing Yan
- Center for Macroecology, Evolution and Climate, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark; Department of Organismic and Evolutionary Biology, Harvard University Herbaria, 22 Divinity Ave, Cambridge, MA 02138, USA.
| | - Rute R da Fonseca
- Center for Macroecology, Evolution and Climate, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark
| | - Carsten Rahbek
- Center for Macroecology, Evolution and Climate, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark; Center for Global Mountain Biodiversity, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark; Department of Life Sciences, Imperial College London, Silkwood Park campus, Ascot SL5 7PY, UK; Danish Institute for Advanced Study, University of Southern Denmark, 5230 Odense M, Denmark
| | - Michael K Borregaard
- Center for Macroecology, Evolution and Climate, Globe Institute, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen, Denmark
| | - Charles C Davis
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, 22 Divinity Ave, Cambridge, MA 02138, USA
| |
Collapse
|
6
|
Zhang Z, Liu G, Li M. Incomplete lineage sorting and gene flow within Allium (Amayllidaceae). Mol Phylogenet Evol 2024; 195:108054. [PMID: 38471599 DOI: 10.1016/j.ympev.2024.108054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 02/01/2024] [Accepted: 03/07/2024] [Indexed: 03/14/2024]
Abstract
The phylogeny and systematics of the genus Allium have been studied with a variety of diverse data types, including an increasing amount of molecular data. However, strong phylogenetic discordance and high levels of uncertainty have prevented the identification of a consistent phylogeny. The difficulty in establishing phylogenetic consensus and evidence for genealogical discordance make Allium a compelling test case to assess the relative contribution of incomplete lineage sorting (ILS), gene flow and gene tree estimation error on phylogenetic reconstruction. In this study, we obtained 75 transcriptomes of 38 Allium species across 10 subgenera. Whole plastid genome, single copy genes and consensus CDS were generated to estimate phylogenetic trees both using coalescence and concatenation methods. Multiple approaches including coalescence simulation, quartet sampling, reticulate network inference, sequence simulation, theta of ILS and reticulation index were carried out across the CDS gene trees to investigate the degrees of ILS, gene flow and gene tree estimation error. Afterward, a regression analysis was used to test the relative contributions of each of these forms of uncertainty to the final phylogeny. Despite extensive topological discordance among gene trees, we found a fully supported species tree that agrees with the most of well-accepted relationships and establishes monophyly of the genus Allium. We presented clear evidence for substantial ILS across the phylogeny of Allium. Further, we identified two ancient hybridization events for the formation of the second evolutionary line and subg. Butomissa as well as several introgression events between recently diverged species. Our regression analysis revealed that gene tree inference error and gene flow were the two most dominant factors explaining for the overall gene tree variation, with the difficulty in disentangling the effects of ILS and gene tree estimation error due to a positive correlation between them. Based on our efforts to mitigate the methodological errors in reconstructing trees, we believed ILS and gene flow are two principal reasons for the oft-reported phylogenetic heterogeneity of Allium. This study presents a strongly-supported and well-resolved phylogenetic backbone for the sampled Allium species, and exemplifies how to untangle heterogeneity in phylogenetic signal and reconstruct the true evolutionary history of the target taxa.
Collapse
Affiliation(s)
- ZengZhu Zhang
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou 730000, People's Republic of China
| | - Gang Liu
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou 730000, People's Republic of China
| | - Minjie Li
- State Key Laboratory of Herbage Improvement and Grassland Agro-ecosystems, College of Ecology, Lanzhou University, Lanzhou 730000, People's Republic of China.
| |
Collapse
|
7
|
Jiang Z, Zang W, Ericson PGP, Song G, Wu S, Feng S, Drovetski SV, Liu G, Zhang D, Saitoh T, Alström P, Edwards SV, Lei F, Qu Y. Gene flow and an anomaly zone complicate phylogenomic inference in a rapidly radiated avian family (Prunellidae). BMC Biol 2024; 22:49. [PMID: 38413944 PMCID: PMC10900574 DOI: 10.1186/s12915-024-01848-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2023] [Accepted: 02/15/2024] [Indexed: 02/29/2024] Open
Abstract
BACKGROUND Resolving the phylogeny of rapidly radiating lineages presents a challenge when building the Tree of Life. An Old World avian family Prunellidae (Accentors) comprises twelve species that rapidly diversified at the Pliocene-Pleistocene boundary. RESULTS Here we investigate the phylogenetic relationships of all species of Prunellidae using a chromosome-level de novo assembly of Prunella strophiata and 36 high-coverage resequenced genomes. We use homologous alignments of thousands of exonic and intronic loci to build the coalescent and concatenated phylogenies and recover four different species trees. Topology tests show a large degree of gene tree-species tree discordance but only 40-54% of intronic gene trees and 36-75% of exonic genic trees can be explained by incomplete lineage sorting and gene tree estimation errors. Estimated branch lengths for three successive internal branches in the inferred species trees suggest the existence of an empirical anomaly zone. The most common topology recovered for species in this anomaly zone was not similar to any coalescent or concatenated inference phylogenies, suggesting presence of anomalous gene trees. However, this interpretation is complicated by the presence of gene flow because extensive introgression was detected among these species. When exploring tree topology distributions, introgression, and regional variation in recombination rate, we find that many autosomal regions contain signatures of introgression and thus may mislead phylogenetic inference. Conversely, the phylogenetic signal is concentrated to regions with low-recombination rate, such as the Z chromosome, which are also more resistant to interspecific introgression. CONCLUSIONS Collectively, our results suggest that phylogenomic inference should consider the underlying genomic architecture to maximize the consistency of phylogenomic signal.
Collapse
Affiliation(s)
- Zhiyong Jiang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Wenqing Zang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Per G P Ericson
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, PO Box 50007, Stockholm, SE-104 05, Sweden
| | - Gang Song
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Shaoyuan Wu
- Jiangsu International Joint Center of Genomics, Jiangsu Key Laboratory of Phylogenomics & Comparative Genomics, School of Life Sciences, Jiangsu Normal University, Xuzhou, 221116, Jiangsu, China
| | - Shaohong Feng
- Center for Evolutionary & Organismal Biology, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Liangzhu Laboratory, Zhejiang University, 1369 West Wenyi Road, Hangzhou, 311121, China
- Innovation Center of Yangtze River Delta, Zhejiang University, Jiashan, 314102, China
| | - Sergei V Drovetski
- National Museum of Natural History, Smithsonian Institution, Washington, DC, 20004, USA
- Present address: U.S. Geological Survey, Eastern Ecological Science Center at Patuxent Research Refuge, Laurel, MD, 20708, USA
| | - Gang Liu
- Chinese Academy of Forestry, Institute of Ecological Conservation and Restoration, Beijing, 100091, China
| | - Dezhi Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Takema Saitoh
- Yamashina Institute for Ornithology, Abiko, Chiba, Japan
| | - Per Alström
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- Animal Ecology, Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Norbyvägen 18 D, 752 36, Uppsala, Sweden
| | - Scott V Edwards
- Museum of Comparative Zoology and Department of Organismic & Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA, 02138, USA
| | - Fumin Lei
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Yanhua Qu
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China.
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, PO Box 50007, Stockholm, SE-104 05, Sweden.
| |
Collapse
|
8
|
Piwczyński M, Granjon L, Trzeciak P, Carlos Brito J, Oana Popa M, Daba Dinka M, Johnston NP, Boratyński Z. Unraveling phylogenetic relationships and species boundaries in the arid adapted Gerbillus rodents (Muridae: Gerbillinae) by RAD-seq data. Mol Phylogenet Evol 2023; 189:107913. [PMID: 37659480 DOI: 10.1016/j.ympev.2023.107913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 08/25/2023] [Accepted: 08/28/2023] [Indexed: 09/04/2023]
Abstract
Gerbillus is one of the most speciose genera among rodents, with ca. 51 recognized species. Previous attempts to reconstruct the evolutionary history of Gerbillus mainly relied on the mitochondrial cyt-b marker as a source of phylogenetic information. In this study, we utilize RAD-seq genomic data from 37 specimens representing 11 species to reconstruct the phylogenetic tree for Gerbillus, applying concatenation and coalescence methods. We identified four highly supported clades corresponding to the traditionally recognized subgenera: Dipodillus, Gerbillus, Hendecapleura and Monodia. Only two uncertain branches were detected in the resulting trees, with one leading to diversification of the main lineages in the genus, recognized by quartet sampling analysis as uncertain due to possible introgression. We also examined species boundaries for four pairs of sister taxa, including potentially new species from Morocco, using SNAPP. The results strongly supported a speciation model in which all taxa are treated as separate species. The dating analyses confirmed the Plio-Pleistocene diversification of the genus, with the uncertain branch coinciding with the beginning of aridification of the Sahara at the the Plio-Pleistocene boundary. This study aligns well with the earlier analyses based on the cyt-b marker, reaffirming its suitability as an adequate marker for estimating genetic diversity in Gerbillus.
Collapse
Affiliation(s)
- Marcin Piwczyński
- Department of Ecology and Biogeography, Nicolaus Copernicus University in Toruń, Lwowska 1, PL-87-100 Toruń, Poland.
| | - Laurent Granjon
- CBGP, IRD, CIRAD, INRAE, Institut Agro, Université de Montpellier, Montpellier, France
| | - Paulina Trzeciak
- Department of Ecology and Biogeography, Nicolaus Copernicus University in Toruń, Lwowska 1, PL-87-100 Toruń, Poland
| | - José Carlos Brito
- CIBIO-InBio, Research Center in Biodiversity and Genetic Resources, University of Porto, Campus de Vairão, Rua Padre Armando Quintas 7, 4485-661 Vairão, Portugal; BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, Vairão, Portugal; Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Porto, Portugal
| | - Madalina Oana Popa
- Department of Ecology and Biogeography, Nicolaus Copernicus University in Toruń, Lwowska 1, PL-87-100 Toruń, Poland; "Stejarul" Research Centre for Biological Sciences, National Institute of Research and Development for Biological Sciences, Alexandru cel Bun 6, RO-610004, Piatra Neamţ, Romania
| | - Mergi Daba Dinka
- Department of Ecology and Biogeography, Nicolaus Copernicus University in Toruń, Lwowska 1, PL-87-100 Toruń, Poland
| | - Nikolas P Johnston
- School of Life Sciences, University of Technology Sydney, 15 Broadway, Ultimo, NSW 2007, Australia; Centre for Sustainable Ecosystem Solutions, School of Earth, Atmospheric and Life Sciences, University of Wollongong, Northfields Ave, Wollongong, NSW 2500, Australia
| | - Zbyszek Boratyński
- CIBIO-InBio, Research Center in Biodiversity and Genetic Resources, University of Porto, Campus de Vairão, Rua Padre Armando Quintas 7, 4485-661 Vairão, Portugal; BIOPOLIS Program in Genomics, Biodiversity and Land Planning, CIBIO, Campus de Vairão, Vairão, Portugal
| |
Collapse
|
9
|
Simmons MP, Goloboff PA, Stöver BC, Springer MS, Gatesy J. Quantification of congruence among gene trees with polytomies using overall success of resolution for phylogenomic coalescent analyses. Cladistics 2023; 39:418-436. [PMID: 37096985 DOI: 10.1111/cla.12540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Revised: 02/22/2023] [Accepted: 03/24/2023] [Indexed: 04/26/2023] Open
Abstract
Gene-tree-inference error can cause species-tree-inference artefacts in summary phylogenomic coalescent analyses. Here we integrate two ways of accommodating these inference errors: collapsing arbitrarily or dubiously resolved gene-tree branches, and subsampling gene trees based on their pairwise congruence. We tested the effect of collapsing gene-tree branches with 0% approximate-likelihood-ratio-test (SH-like aLRT) support in likelihood analyses and strict consensus trees for parsimony, and then subsampled those partially resolved trees based on congruence measures that do not penalize polytomies. For this purpose we developed a new TNT script for congruence sorting (congsort), and used it to calculate topological incongruence for eight phylogenomic datasets using three distance measures: standard Robinson-Foulds (RF) distances; overall success of resolution (OSR), which is based on counting both matching and contradicting clades; and RF contradictions, which only counts contradictory clades. As expected, we found that gene-tree incongruence was often concentrated in clades that are arbitrarily or dubiously resolved and that there was greater congruence between the partially collapsed gene trees and the coalescent and concatenation topologies inferred from those genes. Coalescent branch lengths typically increased as the most incongruent gene trees were excluded, although branch supports typically did not. We investigated two successful and complementary approaches to prioritizing genes for investigation of alignment or homology errors. Coalescent-tree clades that contradicted concatenation-tree clades were generally less robust to gene-tree subsampling than congruent clades. Our preferred approach to collapsing likelihood gene-tree clades (0% SH-like aLRT support) and subsampling those trees (OSR) generally outperformed competing approaches for a large fungal dataset with respect to branch lengths, support and congruence. We recommend widespread application of this approach (and strict consensus trees for parsimony-based analyses) for improving quantification of gene-tree congruence/conflict, estimating coalescent branch lengths, testing robustness of coalescent analyses to gene-tree-estimation error, and improving topological robustness of summary coalescent analyses. This approach is quick and easy to implement, even for huge datasets.
Collapse
Affiliation(s)
- Mark P Simmons
- Department of Biology, Colorado State University, Fort Collins, CO, 80523, USA
| | - Pablo A Goloboff
- CONICET, INSUE, Fundación Miguel Lillo, Miguel Lillo 251, 4000, S.M. de Tucumán, Argentina
| | - Ben C Stöver
- Institute for Evolution and Biodiversity, WMU Münster, 48149, Münster, Germany
| | - Mark S Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA, 92521, USA
| | - John Gatesy
- Division of Vertebrate Zoology, American Museum of Natural History, New York, NY, 10024, USA
| |
Collapse
|
10
|
DeRaad DA, McCullough JM, DeCicco LH, Hime PM, Joseph L, Andersen MJ, Moyle RG. Mitonuclear discordance results from incomplete lineage sorting, with no detectable evidence for gene flow, in a rapid radiation of Todiramphus kingfishers. Mol Ecol 2023; 32:4844-4862. [PMID: 37515525 DOI: 10.1111/mec.17080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 03/15/2023] [Accepted: 06/19/2023] [Indexed: 07/31/2023]
Abstract
Many organisms possess multiple discrete genomes (i.e. nuclear and organellar), which are inherited separately and may have unique and even conflicting evolutionary histories. Phylogenetic reconstructions from these discrete genomes can yield different patterns of relatedness, a phenomenon known as cytonuclear discordance. In many animals, mitonuclear discordance (i.e. discordant evolutionary histories between the nuclear and mitochondrial genomes) has been widely documented, but its causes are often considered idiosyncratic and inscrutable. We show that a case of mitonuclear discordance in Todiramphus kingfishers can be explained by extensive genome-wide incomplete lineage sorting (ILS), likely a result of the explosive diversification history of this genus. For these kingfishers, quartet frequencies reveal that the nuclear genome is dominated by discordant topologies, with none of the internal branches in our consensus nuclear tree recovered in >50% of genome-wide gene trees. Meanwhile, a lack of inter-species shared ancestry, non-significant pairwise tests for gene flow, and little evidence for meaningful migration edges between species, leads to the conclusion that gene flow cannot explain the mitonuclear discordance we observe. This lack of evidence for gene flow combined with evidence for extensive genome-wide gene tree discordance, a hallmark of ILS, leads us to conclude that the mitonuclear discordance we observe likely results from ILS, specifically deep coalescence of the mitochondrial genome. Based on this case study, we hypothesize that similar demographic histories in other 'great speciator' taxa across the Indo-Pacific likely predispose these groups to high levels of ILS and high likelihoods of mitonuclear discordance.
Collapse
Affiliation(s)
- Devon A DeRaad
- Biodiversity Institute and Natural History Museum, University of Kansas, Lawrence, Kansas, USA
| | - Jenna M McCullough
- Department of Biology and Museum of Southwestern Biology, University of New Mexico, Albuquerque, New Mexico, USA
| | - Lucas H DeCicco
- Biodiversity Institute and Natural History Museum, University of Kansas, Lawrence, Kansas, USA
| | - Paul M Hime
- Biodiversity Institute and Natural History Museum, University of Kansas, Lawrence, Kansas, USA
| | - Leo Joseph
- Australian National Wildlife Collection, CSIRO National Research Collections Australia, Canberra, Australian Capital Territory, Australia
| | - Michael J Andersen
- Department of Biology and Museum of Southwestern Biology, University of New Mexico, Albuquerque, New Mexico, USA
| | - Robert G Moyle
- Biodiversity Institute and Natural History Museum, University of Kansas, Lawrence, Kansas, USA
| |
Collapse
|
11
|
Ortiz D, Pekár S, Bilat J, Shafaie S, Alvarez N, Gauthier J. Dynamic evolution of size and colour in the highly specialized Zodarion ant-eating spiders. Proc Biol Sci 2023; 290:20230797. [PMID: 37554037 PMCID: PMC10410226 DOI: 10.1098/rspb.2023.0797] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Accepted: 07/17/2023] [Indexed: 08/10/2023] Open
Abstract
Ecological specialists constitute relevant case studies for understanding the mechanisms, potential and limitations of evolution. The species-rich and strictly myrmecophagous spiders of the genus Zodarion show diversified defence mechanisms, including myrmecomorphy of different ant species and nocturnality. Through Hybridization Capture Using RAD Probes (hyRAD), a phylogenomic technique designed for sequencing poorly preserved specimens, we reconstructed a phylogeny of Zodarion using 52 (approx. a third of the nominal) species that cover its phylogenetic and distributional diversity. We then estimated the evolution of body size and colour, traits that have diversified noticeably and are linked to defence mechanisms, across the group. Our genomic matrix of 300 loci led to a well-supported phylogenetic hypothesis that uncovered two main clades inside Zodarion. Ancestral state estimation revealed the highly dynamic evolution of body size and colour across the group, with multiple transitions and convergences in both traits, which we propose is likely indicative of multiple transitions in ant specialization across the genus. Our study will allow the informed targeted selection of Zodarion taxa of special interest for research into the group's remarkable adaptations to ant specialization. It also exemplifies the utility of hyRAD for phylogenetic studies using museum material.
Collapse
Affiliation(s)
- David Ortiz
- Department of Botany and Zoology, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Stano Pekár
- Department of Botany and Zoology, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Julia Bilat
- Geneva Natural History Museum, Geneva, Switzerland
| | - Sepideh Shafaie
- Department of Botany and Zoology, Faculty of Science, Masaryk University, Brno, Czech Republic
| | | | | |
Collapse
|
12
|
Bernot JP, Owen CL, Wolfe JM, Meland K, Olesen J, Crandall KA. Major Revisions in Pancrustacean Phylogeny and Evidence of Sensitivity to Taxon Sampling. Mol Biol Evol 2023; 40:msad175. [PMID: 37552897 PMCID: PMC10414812 DOI: 10.1093/molbev/msad175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Revised: 06/14/2023] [Accepted: 06/19/2023] [Indexed: 08/10/2023] Open
Abstract
The clade Pancrustacea, comprising crustaceans and hexapods, is the most diverse group of animals on earth, containing over 80% of animal species and half of animal biomass. It has been the subject of several recent phylogenomic analyses, yet relationships within Pancrustacea show a notable lack of stability. Here, the phylogeny is estimated with expanded taxon sampling, particularly of malacostracans. We show small changes in taxon sampling have large impacts on phylogenetic estimation. By analyzing identical orthologs between two slightly different taxon sets, we show that the differences in the resulting topologies are due primarily to the effects of taxon sampling on the phylogenetic reconstruction method. We compare trees resulting from our phylogenomic analyses with those from the literature to explore the large tree space of pancrustacean phylogenetic hypotheses and find that statistical topology tests reject the previously published trees in favor of the maximum likelihood trees produced here. Our results reject several clades including Caridoida, Eucarida, Multicrustacea, Vericrustacea, and Syncarida. Notably, we find Copepoda nested within Allotriocarida with high support and recover a novel relationship between decapods, euphausiids, and syncarids that we refer to as the Syneucarida. With denser taxon sampling, we find Stomatopoda sister to this latter clade, which we collectively name Stomatocarida, dividing Malacostraca into three clades: Leptostraca, Peracarida, and Stomatocarida. A new Bayesian divergence time estimation is conducted using 13 vetted fossils. We review our results in the context of other pancrustacean phylogenetic hypotheses and highlight 15 key taxa to sample in future studies.
Collapse
Affiliation(s)
- James P Bernot
- Department of Invertebrate Zoology, US National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| | - Christopher L Owen
- Systematic Entomology Laboratory, USDA-ARS, ℅ National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
| | - Joanna M Wolfe
- Museum of Comparative Zoology and Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Kenneth Meland
- Department of Biology, University of Bergen, Bergen, Norway
| | - Jørgen Olesen
- Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Keith A Crandall
- Department of Invertebrate Zoology, US National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
- Department of Biostatistics and Bioinformatics, Milken Institute School of Public Health, George Washington University, Washington, DC, USA
| |
Collapse
|
13
|
Liu B, Warnow T. Weighted ASTRID: fast and accurate species trees from weighted internode distances. Algorithms Mol Biol 2023; 18:6. [PMID: 37468904 PMCID: PMC10355063 DOI: 10.1186/s13015-023-00230-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 06/10/2023] [Indexed: 07/21/2023] Open
Abstract
BACKGROUND Species tree estimation is a basic step in many biological research projects, but is complicated by the fact that gene trees can differ from the species tree due to processes such as incomplete lineage sorting (ILS), gene duplication and loss (GDL), and horizontal gene transfer (HGT), which can cause different regions within the genome to have different evolutionary histories (i.e., "gene tree heterogeneity"). One approach to estimating species trees in the presence of gene tree heterogeneity resulting from ILS operates by computing trees on each genomic region (i.e., computing "gene trees") and then using these gene trees to define a matrix of average internode distances, where the internode distance in a tree T between two species x and y is the number of nodes in T between the leaves corresponding to x and y. Given such a matrix, a tree can then be computed using methods such as neighbor joining. Methods such as ASTRID and NJst (which use this basic approach) are provably statistically consistent, very fast (low degree polynomial time) and have had high accuracy under many conditions that makes them competitive with other popular species tree estimation methods. In this study, inspired by the very recent work of weighted ASTRAL, we present weighted ASTRID, a variant of ASTRID that takes the branch uncertainty on the gene trees into account in the internode distance. RESULTS Our experimental study evaluating weighted ASTRID typically shows improvements in accuracy compared to the original (unweighted) ASTRID, and shows competitive accuracy against weighted ASTRAL, the state of the art. Our re-implementation of ASTRID also improves the runtime, with marked improvements on large datasets. CONCLUSIONS Weighted ASTRID is a new and very fast method for species tree estimation that typically improves upon ASTRID and has comparable accuracy to weighted ASTRAL, while remaining much faster. Weighted ASTRID is available at https://github.com/RuneBlaze/internode .
Collapse
Affiliation(s)
- Baqiao Liu
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, IL USA
| | - Tandy Warnow
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, IL USA
| |
Collapse
|
14
|
Mongiardino Koch N, Tilic E, Miller AK, Stiller J, Rouse GW. Confusion will be my epitaph: genome-scale discordance stifles phylogenetic resolution of Holothuroidea. Proc Biol Sci 2023; 290:20230988. [PMID: 37434530 PMCID: PMC10336381 DOI: 10.1098/rspb.2023.0988] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Accepted: 06/12/2023] [Indexed: 07/13/2023] Open
Abstract
Sea cucumbers (Holothuroidea) are a diverse clade of echinoderms found from intertidal waters to the bottom of the deepest oceanic trenches. Their reduced skeletons and limited number of phylogenetically informative traits have long obfuscated morphological classifications. Sanger-sequenced molecular datasets have also failed to constrain the position of major lineages. Noteworthy, topological uncertainty has hindered a resolution for Neoholothuriida, a highly diverse clade of Permo-Triassic age. We perform the first phylogenomic analysis of Holothuroidea, combining existing datasets with 13 novel transcriptomes. Using a highly curated dataset of 1100 orthologues, our efforts recapitulate previous results, struggling to resolve interrelationships among neoholothuriid clades. Three approaches to phylogenetic reconstruction (concatenation under both site-homogeneous and site-heterogeneous models, and coalescent-aware inference) result in alternative resolutions, all of which are recovered with strong support and across a range of datasets filtered for phylogenetic usefulness. We explore this intriguing result using gene-wise log-likelihood scores and attempt to correlate these with a large set of gene properties. While presenting novel ways of exploring and visualizing support for alternative trees, we are unable to discover significant predictors of topological preference, and our efforts fail to favour one topology. Neoholothuriid genomes seem to retain an amalgam of signals derived from multiple phylogenetic histories.
Collapse
Affiliation(s)
| | - Ekin Tilic
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
- Department of Marine Zoology, Senckenberg Research Institute and Museum, Frankfurt, Germany
| | - Allison K. Miller
- Anatomy Department, University of Otago, Dunedin, Otago, New Zealand
| | - Josefin Stiller
- Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Greg W. Rouse
- Scripps Institution of Oceanography, University of California San Diego, La Jolla, CA, USA
| |
Collapse
|
15
|
Fleming JF, Valero‐Gracia A, Struck TH. Identifying and addressing methodological incongruence in phylogenomics: A review. Evol Appl 2023; 16:1087-1104. [PMID: 37360032 PMCID: PMC10286231 DOI: 10.1111/eva.13565] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 04/07/2023] [Accepted: 05/17/2023] [Indexed: 06/28/2023] Open
Abstract
The availability of phylogenetic data has greatly expanded in recent years. As a result, a new era in phylogenetic analysis is dawning-one in which the methods we use to analyse and assess our data are the bottleneck to producing valuable phylogenetic hypotheses, rather than the need to acquire more data. This makes the ability to accurately appraise and evaluate new methods of phylogenetic analysis and phylogenetic artefact identification more important than ever. Incongruence in phylogenetic reconstructions based on different datasets may be due to two major sources: biological and methodological. Biological sources comprise processes like horizontal gene transfer, hybridization and incomplete lineage sorting, while methodological ones contain falsely assigned data or violations of the assumptions of the underlying model. While the former provides interesting insights into the evolutionary history of the investigated groups, the latter should be avoided or minimized as best as possible. However, errors introduced by methodology must first be excluded or minimized to be able to conclude that biological sources are the cause. Fortunately, a variety of useful tools exist to help detect such misassignments and model violations and to apply ameliorating measurements. Still, the number of methods and their theoretical underpinning can be overwhelming and opaque. Here, we present a practical and comprehensive review of recent developments in techniques to detect artefacts arising from model violations and poorly assigned data. The advantages and disadvantages of the different methods to detect such misleading signals in phylogenetic reconstructions are also discussed. As there is no one-size-fits-all solution, this review can serve as a guide in choosing the most appropriate detection methods depending on both the actual dataset and the computational power available to the researcher. Ultimately, this informed selection will have a positive impact on the broader field, allowing us to better understand the evolutionary history of the group of interest.
Collapse
|
16
|
DeSalle R, Narechania A, Tessler M. Multiple Outgroups Can Cause Random Rooting in Phylogenomics. Mol Phylogenet Evol 2023; 184:107806. [PMID: 37172862 DOI: 10.1016/j.ympev.2023.107806] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 02/06/2023] [Accepted: 04/26/2023] [Indexed: 05/15/2023]
Abstract
Outgroup selection has been a major challenge since the rise of phylogenetics, and it has remained so in the phylogenomic era. Our goal here is to use large phylogenomic animal datasets to examine the impact of outgroup selection on the final topology. The results of our analyses further solidify the fact that distant outgroups can cause random rooting, and that this holds for concatenated and coalescent-based methods. The results also indicate that the standard practice of using multiple outgroups often causes random rooting. Most researchers go out of their way to get multiple outgroups, as this has been standard practice for decades. Based on our findings, this practice should stop. Instead, our results suggest that a single (most closely) related relative should be selected as the outgroup, unless all outgroups are roughly equally closely related to the ingroup.
Collapse
Affiliation(s)
- Rob DeSalle
- Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA; Division of Invertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA
| | - Apurva Narechania
- Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA
| | - Michael Tessler
- Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA; Division of Invertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA; St. Francis College, Department of Biology, Brooklyn, NY 11201, USA
| |
Collapse
|
17
|
Raiyemo DA, Tranel PJ. Comparative analysis of dioecious Amaranthus plastomes and phylogenomic implications within Amaranthaceae s.s. BMC Ecol Evol 2023; 23:15. [PMID: 37149567 PMCID: PMC10164334 DOI: 10.1186/s12862-023-02121-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 04/28/2023] [Indexed: 05/08/2023] Open
Abstract
BACKGROUND The genus Amaranthus L. consists of 70-80 species distributed across temperate and tropical regions of the world. Nine species are dioecious and native to North America; two of which are agronomically important weeds of row crops. The genus has been described as taxonomically challenging and relationships among species including the dioecious ones are poorly understood. In this study, we investigated the phylogenetic relationships among the dioecious amaranths and sought to gain insights into plastid tree incongruence. A total of 19 Amaranthus species' complete plastomes were analyzed. Among these, seven dioecious Amaranthus plastomes were newly sequenced and assembled, an additional two were assembled from previously published short reads sequences and 10 other plastomes were obtained from a public repository (GenBank). RESULTS Comparative analysis of the dioecious Amaranthus species' plastomes revealed sizes ranged from 150,011 to 150,735 bp and consisted of 112 unique genes (78 protein-coding genes, 30 transfer RNAs and 4 ribosomal RNAs). Maximum likelihood trees, Bayesian inference trees and splits graphs support the monophyly of subgenera Acnida (7 dioecious species) and Amaranthus; however, the relationship of A. australis and A. cannabinus to the other dioecious species in Acnida could not be established, as it appears a chloroplast capture occurred from the lineage leading to the Acnida + Amaranthus clades. Our results also revealed intraplastome conflict at some tree branches that were in some cases alleviated with the use of whole chloroplast genome alignment, indicating non-coding regions contribute valuable phylogenetic signals toward shallow relationship resolution. Furthermore, we report a very low evolutionary distance between A. palmeri and A. watsonii, indicating that these two species are more genetically related than previously reported. CONCLUSIONS Our study provides valuable plastome resources as well as a framework for further evolutionary analyses of the entire Amaranthus genus as more species are sequenced.
Collapse
Affiliation(s)
- Damilola A Raiyemo
- Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA
| | - Patrick J Tranel
- Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA.
| |
Collapse
|
18
|
Nie ZL, Hodel R, Ma ZY, Johnson G, Ren C, Meng Y, Ickert-Bond SM, Liu XQ, Zimmer E, Wen J. Climate-influenced boreotropical survival and rampant introgressions explain the thriving of New World grapes in the north temperate zone. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2023; 65:1183-1203. [PMID: 36772845 DOI: 10.1111/jipb.13466] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Accepted: 02/08/2023] [Indexed: 05/13/2023]
Abstract
The north temperate region was characterized by a warm climate and a rich thermophilic flora before the Eocene, but early diversifications of the temperate biome under global climate change and biome shift remain uncertain. Moreover, it is becoming clear that hybridization/introgression is an important driving force of speciation in plant diversity. Here, we applied analyses from biogeography and phylogenetic networks to account for both introgression and incomplete lineage sorting based on genomic data from the New World Vitis, a charismatic component of the temperate North American flora with known and suspected gene flow among species. Biogeographic inference and fossil evidence suggest that the grapes were widely distributed from North America to Europe during the Paleocene to the Eocene, followed by widespread extinction and survival of relicts in the tropical New World. During the climate warming in the early Miocene, a Vitis ancestor migrated northward from the refugia with subsequent diversification in the North American region. We found strong evidence for widespread incongruence and reticulate evolution among nuclear genes within both recent and ancient lineages of the New World Vitis. Furthermore, the organellar genomes showed strong conflicts with the inferred species tree from the nuclear genomes. Our phylogenomic analyses provided an important assessment of the wide occurrence of reticulate introgression in the New World Vitis, which potentially represents one of the most important mechanisms for the diversification of Vitis species in temperate North America and even the entire temperate Northern Hemisphere. The scenario we report here may be a common model of temperate diversification of flowering plants adapted to the global climate cooling and fluctuation in the Neogene.
Collapse
Affiliation(s)
- Ze-Long Nie
- College of Biology and Environmental Sciences, Jishou University, Jishou, 416000, China
| | - Richard Hodel
- Department of Botany, National Museum of Natural History, Smithsonian Institution, Washington, DC, 20013-7012, USA
| | - Zhi-Yao Ma
- Department of Botany, National Museum of Natural History, Smithsonian Institution, Washington, DC, 20013-7012, USA
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518000, China
| | - Gabriel Johnson
- Department of Botany, National Museum of Natural History, Smithsonian Institution, Washington, DC, 20013-7012, USA
| | - Chen Ren
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China
| | - Ying Meng
- College of Biology and Environmental Sciences, Jishou University, Jishou, 416000, China
| | - Stefanie M Ickert-Bond
- Herbarium (ALA), University of Alaska Museum of the North, University of Alaska Fairbanks, Fairbanks, AK, 99775, USA
- Department of Biology and Wildlife, University of Alaska Fairbanks, Fairbanks, AK, 99775, USA
| | - Xiu-Qun Liu
- Key Laboratory of Horticultural Plant Biology, College of Horticulture and Forestry Science, Huazhong Agricultural University, Wuhan, 430070, China
| | - Elizabeth Zimmer
- Department of Botany, National Museum of Natural History, Smithsonian Institution, Washington, DC, 20013-7012, USA
| | - Jun Wen
- Department of Botany, National Museum of Natural History, Smithsonian Institution, Washington, DC, 20013-7012, USA
| |
Collapse
|
19
|
Yi H, Dong S, Yang L, Wang J, Kidner C, Kang M. Genome-wide data reveal cryptic diversity and hybridization in a group of tree ferns. Mol Phylogenet Evol 2023; 184:107801. [PMID: 37088242 DOI: 10.1016/j.ympev.2023.107801] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 04/07/2023] [Accepted: 04/18/2023] [Indexed: 04/25/2023]
Abstract
Discovery of cryptic diversity is essential to understanding both the process of speciation and the conservation of species. Determining species boundaries in fern lineages represents a major challenge due to lack of morphologically diagnostic characters and frequent hybridization. Genomic data has substantially enhanced our understanding of the speciation process, increased the resolution of species delimitation studies, and led to the discovery of cryptic diversity. Here, we employed restriction-site-associated DNA sequencing (RAD-seq) and integrated phylogenomic and population genomic analyses to investigate phylogenetic relationships and evolutionary history of 16 tree ferns with marginate scales (Cyatheaceae) from China and Vietnam. We conducted multiple species delimitation analyses using the multispecies coalescent (MSC) model and novel approaches based on genealogical divergence index (gdi) and isolation by distance (IBD). In addition, we inferred species trees using concatenation and several coalescent-based methods, and assessed hybridization patterns and rate of gene flow across the phylogeny. We obtained highly supported and generally congruent phylogenies inferred from concatenated and summary-coalescent methods, and the monophyly of all currently recognized species were strongly supported. Our results revealed substantial evidence of cryptic diversity in three widely distributed Gymnosphaera species, each of which was composite of two highly structure lineages that may correspond to cryptic species. We found that hybridization was fairly common between not only closely related species, but also distantly related species. Collectively, it appears that scaly tree ferns may contain cryptic diversity and hybridization has played an important role throughout the evolutionary history of this group.
Collapse
Affiliation(s)
- Huiqin Yi
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China; South China National Botanical Garden, Guangzhou 510650, China
| | - Shiying Dong
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China; South China National Botanical Garden, Guangzhou 510650, China
| | - Lihua Yang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China; South China National Botanical Garden, Guangzhou 510650, China
| | - Jing Wang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China; South China National Botanical Garden, Guangzhou 510650, China
| | - Catherine Kidner
- Institute of Molecular Plant Sciences, University of Edinburgh, Daniel Rutherford Building Max Born Crescent, The King's Buildings, Edinburgh EH9 3BF, UK; Royal Botanic Garden Edinburgh, 20a Inverleith Row, Edinburgh EH3 5LR, UK
| | - Ming Kang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China; South China National Botanical Garden, Guangzhou 510650, China.
| |
Collapse
|
20
|
Das S, Greenbaum E, Meiri S, Bauer AM, Burbrink FT, Raxworthy CJ, Weinell JL, Brown RM, Brecko J, Pauwels OSG, Rabibisoa N, Raselimanana AP, Merilä J. Ultraconserved elements-based phylogenomic systematics of the snake superfamily Elapoidea, with the description of a new Afro-Asian family. Mol Phylogenet Evol 2023; 180:107700. [PMID: 36603697 DOI: 10.1016/j.ympev.2022.107700] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2022] [Revised: 12/27/2022] [Accepted: 12/29/2022] [Indexed: 01/04/2023]
Abstract
The highly diverse snake superfamily Elapoidea is considered to be a classic example of ancient, rapid radiation. Such radiations are challenging to fully resolve phylogenetically, with the highly diverse Elapoidea a case in point. Previous attempts at inferring a phylogeny of elapoids produced highly incongruent estimates of their evolutionary relationships, often with very low statistical support. We sought to resolve this situation by sequencing over 4,500 ultraconserved element loci from multiple representatives of every elapoid family/subfamily level taxon and inferring their phylogenetic relationships with multiple methods. Concatenation and multispecies coalescent based species trees yielded largely congruent and well-supported topologies. Hypotheses of a hard polytomy were not retained for any deep branches. Our phylogenies recovered Cyclocoridae and Elapidae as diverging early within Elapoidea. The Afro-Malagasy radiation of elapoid snakes, classified as multiple subfamilies of an inclusive Lamprophiidae by some earlier authors, was found to be monophyletic in all analyses. The genus Micrelaps was consistently recovered as sister to Lamprophiidae. We establish a new family, Micrelapidae fam. nov., for Micrelaps and assign Brachyophis to this family based on cranial osteological synapomorphy. We estimate that Elapoidea originated in the early Eocene and rapidly diversified into all the major lineages during this epoch. Ecological opportunities presented by the post-Cretaceous-Paleogene mass extinction event may have promoted the explosive radiation of elapoid snakes.
Collapse
Affiliation(s)
- Sunandan Das
- Ecological Genetics Research Unit, Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, FI-00014 University of Helsinki, Finland.
| | - Eli Greenbaum
- Department of Biological Sciences, University of Texas at El Paso, 500 W. University Avenue, El Paso, TX 79968, USA
| | - Shai Meiri
- School of Zoology, Tel Aviv University, Tel Aviv, Israel; The Steinhardt Museum of Natural History, Tel Aviv University, Tel Aviv, Israel
| | - Aaron M Bauer
- Department of Biology and Center for Biodiversity and Ecosystem Stewardship, Villanova University, 800 Lancaster Avenue, Villanova, PA 19085, USA
| | - Frank T Burbrink
- Department of Herpetology, American Museum of Natural History, 200 Central Park West, New York, NY 10024-5192, USA
| | - Christopher J Raxworthy
- Department of Herpetology, American Museum of Natural History, 200 Central Park West, New York, NY 10024-5192, USA
| | - Jeffrey L Weinell
- Department of Herpetology, American Museum of Natural History, 200 Central Park West, New York, NY 10024-5192, USA; Department of Ecology and Evolutionary Biology and Biodiversity Institute, University of Kansas, Lawrence, KS 66045, USA
| | - Rafe M Brown
- Department of Ecology and Evolutionary Biology and Biodiversity Institute, University of Kansas, Lawrence, KS 66045, USA
| | - Jonathan Brecko
- Royal Belgian Institute of Natural Sciences, Rue Vautier 29, B-1000 Brussels, Belgium; Royal Museum for Central Africa, Tervuren, Belgium
| | - Olivier S G Pauwels
- Royal Belgian Institute of Natural Sciences, Rue Vautier 29, B-1000 Brussels, Belgium
| | - Nirhy Rabibisoa
- Sciences de la Vie et de l'Environnement, Faculté des Sciences, de Technologies et de l'Environnement, Université de Mahajanga, Campus Universitaire d'Ambondrona, BP 652, Mahajanga 401, Madagascar
| | - Achille P Raselimanana
- Zoologie et Biodiversité Animale, Faculté des Sciences, Université d'Antananarivo, BP 906, Antananarivo 101, Madagascar
| | - Juha Merilä
- Ecological Genetics Research Unit, Organismal and Evolutionary Biology Research Programme, Faculty of Biological and Environmental Sciences, FI-00014 University of Helsinki, Finland; Area of Ecology and Biodiversity, School of Biological Sciences, Kadoorie Biological Sciences Building, Pokfulam Road, The University of Hong Kong, Hong Kong Special Administrative Region
| |
Collapse
|
21
|
Armstrong EE, Campana MG, Solari KA, Morgan SR, Ryder OA, Naude VN, Samelius G, Sharma K, Hadly EA, Petrov DA. Genome report: chromosome-level draft assemblies of the snow leopard, African leopard, and tiger (Panthera uncia, Panthera pardus pardus, and Panthera tigris). G3 (BETHESDA, MD.) 2022; 12:jkac277. [PMID: 36250809 PMCID: PMC9713438 DOI: 10.1093/g3journal/jkac277] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Accepted: 09/14/2022] [Indexed: 04/07/2024]
Abstract
The big cats (genus Panthera) represent some of the most popular and charismatic species on the planet. Although some reference genomes are available for this clade, few are at the chromosome level, inhibiting high-resolution genomic studies. We assembled genomes from 3 members of the genus, the tiger (Panthera tigris), the snow leopard (Panthera uncia), and the African leopard (Panthera pardus pardus), at chromosome or near-chromosome level. We used a combination of short- and long-read technologies, as well as proximity ligation data from Hi-C technology, to achieve high continuity and contiguity for each individual. We hope that these genomes will aid in further evolutionary and conservation research of this iconic group of mammals.
Collapse
Affiliation(s)
- Ellie E Armstrong
- Department of Biology, Stanford University, Stanford, CA 94305, USA
- Department of Biology, Washington State University, Pullman, WA 99164, USA
| | - Michael G Campana
- Center for Conservation Genomics, Smithsonian’s National Zoological Park and Conservation Biology Institute, Washington, DC 20008, USA
| | | | - Simon R Morgan
- Department of Biology, Stanford University, Stanford, CA 94305, USA
- Wildlife ACT Fund Trust, Cape Town 8001, South Africa
| | - Oliver A Ryder
- San Diego Zoo Wildlife Alliance, Beckman Center for Conservation Research, San Diego, CA 92027, USA
| | - Vincent N Naude
- Department of Conservation Ecology and Entomology, University of Stellenbosch, Stellenbosch, 7602, South Africa
- School of Animal, Plant and Environmental Sciences, University of the Witwatersrand, Johannesburg 2000, South Africa
| | | | - Koustubh Sharma
- Snow Leopard Trust, Seattle, WA 98103, USA
- Nature Conservation Foundation, Mysore 570 017, India
| | | | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
22
|
Glässnerová K, Sklenář F, Jurjević Ž, Houbraken J, Yaguchi T, Visagie C, Gené J, Siqueira J, Kubátová A, Kolařík M, Hubka V. A monograph of Aspergillus section Candidi. Stud Mycol 2022; 102:1-51. [PMID: 36760463 PMCID: PMC9903906 DOI: 10.3114/sim.2022.102.01] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Accepted: 08/03/2022] [Indexed: 01/09/2023] Open
Abstract
Aspergillus section Candidi encompasses white- or yellow-sporulating species mostly isolated from indoor and cave environments, food, feed, clinical material, soil and dung. Their identification is non-trivial due to largely uniform morphology. This study aims to re-evaluate the species boundaries in the section Candidi and present an overview of all existing species along with information on their ecology. For the analyses, we assembled a set of 113 strains with diverse origin. For the molecular analyses, we used DNA sequences of three house-keeping genes (benA, CaM and RPB2) and employed species delimitation methods based on a multispecies coalescent model. Classical phylogenetic methods and genealogical concordance phylogenetic species recognition (GCPSR) approaches were used for comparison. Phenotypic studies involved comparisons of macromorphology on four cultivation media, seven micromorphological characters and growth at temperatures ranging from 10 to 45 °C. Based on the integrative approach comprising four criteria (phylogenetic and phenotypic), all currently accepted species gained support, while two new species are proposed (A. magnus and A. tenebricus). In addition, we proposed the new name A. neotritici to replace an invalidly described A. tritici. The revised section Candidi now encompasses nine species, some of which manifest a high level of intraspecific genetic and/or phenotypic variability (e.g., A. subalbidus and A. campestris) while others are more uniform (e.g., A. candidus or A. pragensis). The growth rates on different media and at different temperatures, colony colours, production of soluble pigments, stipe dimensions and vesicle diameters contributed the most to the phenotypic species differentiation. Taxonomic novelties: New species: Aspergillus magnus Glässnerová & Hubka; Aspergillus neotritici Glässnerová & Hubka; Aspergillus tenebricus Houbraken, Glässnerová & Hubka. Citation: Glässnerová K, Sklenář F, Jurjević Ž, Houbraken J, Yaguchi T, Visagie CM, Gené J, Siqueira JPZ, Kubátová A, Kolařík M, Hubka V (2022). A monograph of Aspergillus section Candidi. Studies in Mycology 102: 1-51. doi: 10.3114/sim.2022.102.01.
Collapse
Affiliation(s)
- K. Glässnerová
- Department of Botany, Faculty of Science, Charles University, Prague, Czech Republic
| | - F. Sklenář
- Department of Botany, Faculty of Science, Charles University, Prague, Czech Republic
- Laboratory of Fungal Genetics and Metabolism, Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic
| | - Ž. Jurjević
- EMSL Analytical, Cinnaminson, New Jersey, USA
| | - J. Houbraken
- Westerdijk Fungal Biodiversity Institute, Utrecht, The Netherlands
| | - T. Yaguchi
- Medical Mycology Research Center, Chiba University, Chuo-ku, Chiba, Japan
| | - C.M. Visagie
- Department of Biochemistry, Genetics and Microbiology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - J. Gené
- Unitat de Micologia, Facultat de Medicina i Ciències de la Salut, IISPV, Universitat Rovira i Virgili, Reus, Spain
| | - J.P.Z. Siqueira
- Unitat de Micologia, Facultat de Medicina i Ciències de la Salut, IISPV, Universitat Rovira i Virgili, Reus, Spain
- Laboratório de Microbiologia, Faculdade de Medicina de São José do Rio Preto, São José do Rio Preto, Brazil
| | - A. Kubátová
- Department of Botany, Faculty of Science, Charles University, Prague, Czech Republic
| | - M. Kolařík
- Department of Botany, Faculty of Science, Charles University, Prague, Czech Republic
- Laboratory of Fungal Genetics and Metabolism, Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic
| | - V. Hubka
- Department of Botany, Faculty of Science, Charles University, Prague, Czech Republic
- Laboratory of Fungal Genetics and Metabolism, Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic
- Medical Mycology Research Center, Chiba University, Chuo-ku, Chiba, Japan
| |
Collapse
|
23
|
Rachtman E, Sarmashghi S, Bafna V, Mirarab S. Quantifying the uncertainty of assembly-free genome-wide distance estimates and phylogenetic relationships using subsampling. Cell Syst 2022; 13:817-829.e3. [PMID: 36265468 PMCID: PMC9589918 DOI: 10.1016/j.cels.2022.06.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 03/14/2022] [Accepted: 06/28/2022] [Indexed: 01/26/2023]
Abstract
Computing distance between two genomes without alignments or even access to assemblies has many downstream analyses. However, alignment-free methods, including in the fast-growing field of genome skimming, are hampered by a significant methodological gap. While accurate methods (many k-mer-based) for assembly-free distance calculation exist, measuring the uncertainty of estimated distances has not been sufficiently studied. In this paper, we show that bootstrapping, the standard non-parametric method of measuring estimator uncertainty, is not accurate for k-mer-based methods that rely on k-mer frequency profiles. Instead, we propose using subsampling (with no replacement) in combination with a correction step to reduce the variance of the inferred distribution. We show that the distribution of distances using our procedure matches the true uncertainty of the estimator. The resulting phylogenetic support values effectively differentiate between correct and incorrect branches and identify controversial branches that change across alignment-free and alignment-based phylogenies reported in the literature.
Collapse
Affiliation(s)
- Eleonora Rachtman
- Bioinformatics and Systems Biology Graduate Program, UC San Diego, San Diego, CA 92093, USA
| | - Shahab Sarmashghi
- Department of Electrical and Computer Engineering, UC San Diego, San Diego, CA 92093, USA
| | - Vineet Bafna
- Department of Computer Science and Engineering, UC San Diego, San Diego, CA 92093, USA
| | - Siavash Mirarab
- Department of Electrical and Computer Engineering, UC San Diego, San Diego, CA 92093, USA.
| |
Collapse
|
24
|
Zhang C, Mirarab S. Weighting by Gene Tree Uncertainty Improves Accuracy of Quartet-based Species Trees. Mol Biol Evol 2022; 39:6750035. [PMID: 36201617 PMCID: PMC9750496 DOI: 10.1093/molbev/msac215] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Revised: 09/20/2022] [Accepted: 10/03/2022] [Indexed: 01/07/2023] Open
Abstract
Phylogenomic analyses routinely estimate species trees using methods that account for gene tree discordance. However, the most scalable species tree inference methods, which summarize independently inferred gene trees to obtain a species tree, are sensitive to hard-to-avoid errors introduced in the gene tree estimation step. This dilemma has created much debate on the merits of concatenation versus summary methods and practical obstacles to using summary methods more widely and to the exclusion of concatenation. The most successful attempt at making summary methods resilient to noisy gene trees has been contracting low support branches from the gene trees. Unfortunately, this approach requires arbitrary thresholds and poses new challenges. Here, we introduce threshold-free weighting schemes for the quartet-based species tree inference, the metric used in the popular method ASTRAL. By reducing the impact of quartets with low support or long terminal branches (or both), weighting provides stronger theoretical guarantees and better empirical performance than the unweighted ASTRAL. Our simulations show that weighting improves accuracy across many conditions and reduces the gap with concatenation in conditions with low gene tree discordance and high noise. On empirical data, weighting improves congruence with concatenation and increases support. Together, our results show that weighting, enabled by a new optimization algorithm we introduce, improves the utility of summary methods and can reduce the incongruence often observed across analytical pipelines.
Collapse
Affiliation(s)
- Chao Zhang
- Bioinformatics and Systems Biology, UC San Diego, La Jolla, CA, USA
| | | |
Collapse
|
25
|
Liu D, Niu M, Lu Y, Wei J, Zhang H. Taxon-specific ultraconserved element probe design for phylogenetic analyses of scale insects (Hemiptera: Sternorrhyncha: Coccoidea). Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.984396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Scale insects (Coccoidea) are morphologically specialized members of the order Hemiptera, with 56 families recognized to date. However, the phylogenetic relationships within and among families are poorly resolved. In this study, to further characterize the phylogenetic relationships among scale insects, an ultraconserved element (UCE) probe set was designed specifically for Coccoidea based on three low-coverage whole genome sequences along with three publicly available genomes. An in silico test including eight additional genomes was performed to evaluate the effectiveness of the probe set. Most scale insect lineages were recovered by the phylogenetic analysis. This study recovered the monophyly of neococcoids. The newly developed UCE probe set has the potential to reshape and improve our understanding of the phylogenetic relationships within and among families of scale insects at the genome level.
Collapse
|
26
|
Thureborn O, Razafimandimbison SG, Wikström N, Rydin C. Target capture data resolve recalcitrant relationships in the coffee family (Rubioideae, Rubiaceae). FRONTIERS IN PLANT SCIENCE 2022; 13:967456. [PMID: 36160958 PMCID: PMC9493367 DOI: 10.3389/fpls.2022.967456] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/12/2022] [Accepted: 08/03/2022] [Indexed: 06/16/2023]
Abstract
Subfamily Rubioideae is the largest of the main lineages in the coffee family (Rubiaceae), with over 8,000 species and 29 tribes. Phylogenetic relationships among tribes and other major clades within this group of plants are still only partly resolved despite considerable efforts. While previous studies have mainly utilized data from the organellar genomes and nuclear ribosomal DNA, we here use a large number of low-copy nuclear genes obtained via a target capture approach to infer phylogenetic relationships within Rubioideae. We included 101 Rubioideae species representing all but two (the monogeneric tribes Foonchewieae and Aitchinsonieae) of the currently recognized tribes, and all but one non-monogeneric tribe were represented by more than one genus. Using data from the 353 genes targeted with the universal Angiosperms353 probe set we investigated the impact of data type, analytical approach, and potential paralogs on phylogenetic reconstruction. We inferred a robust phylogenetic hypothesis of Rubioideae with the vast majority (or all) nodes being highly supported across all analyses and datasets and few incongruences between the inferred topologies. The results were similar to those of previous studies but novel relationships were also identified. We found that supercontigs [coding sequence (CDS) + non-coding sequence] clearly outperformed CDS data in levels of support and gene tree congruence. The full datasets (353 genes) outperformed the datasets with potentially paralogous genes removed (186 genes) in levels of support but increased gene tree incongruence slightly. The pattern of gene tree conflict at short internal branches were often consistent with high levels of incomplete lineage sorting (ILS) due to rapid speciation in the group. While concatenation- and coalescence-based trees mainly agreed, the observed phylogenetic discordance between the two approaches may be best explained by their differences in accounting for ILS. The use of target capture data greatly improved our confidence and understanding of the Rubioideae phylogeny, highlighted by the increased support for previously uncertain relationships and the increased possibility to explore sources of underlying phylogenetic discordance.
Collapse
Affiliation(s)
- Olle Thureborn
- Department of Ecology, Environment and Plant Sciences, Stockholm University, Stockholm, Sweden
| | | | - Niklas Wikström
- Department of Ecology, Environment and Plant Sciences, Stockholm University, Stockholm, Sweden
- Bergius Foundation, Royal Swedish Academy of Sciences, Stockholm, Sweden
| | - Catarina Rydin
- Department of Ecology, Environment and Plant Sciences, Stockholm University, Stockholm, Sweden
- Bergius Foundation, Royal Swedish Academy of Sciences, Stockholm, Sweden
| |
Collapse
|
27
|
Herrera ND, Bell KC, Callahan CM, Nordquist E, Sarver BAJ, Sullivan J, Demboski JR, Good JM. Genomic resolution of cryptic species diversity in chipmunks. Evolution 2022; 76:2004-2019. [PMID: 35778920 DOI: 10.1111/evo.14546] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 06/01/2022] [Accepted: 06/06/2022] [Indexed: 01/22/2023]
Abstract
Discovery of cryptic species is essential to understand the process of speciation and assessing the impacts of anthropogenic stressors. Here, we used genomic data to test for cryptic species diversity within an ecologically well-known radiation of North American rodents, western chipmunks (Tamias). We assembled a de novo reference genome for a single species (Tamias minimus) combined with new and published targeted sequence-capture data for 21,551 autosomal and 493 X-linked loci sampled from 121 individuals spanning 22 species. We identified at least two cryptic lineages corresponding with an isolated subspecies of least chipmunk (T. minimus grisescens) and with a restricted subspecies of the yellow-pine chipmunk (Tamias amoenus cratericus) known only from around the extensive Craters of the Moon lava flow. Additional population-level sequence data revealed that the so-called Crater chipmunk is a distinct species that is abundant throughout the coniferous forests of southern Idaho. This cryptic lineage does not appear to be most closely related to the ecologically and phenotypically similar yellow-pine chipmunk but does show evidence for recurrent hybridization with this and other species.
Collapse
Affiliation(s)
- Nathanael D Herrera
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Kayce C Bell
- Natural History Museum of Los Angeles County, Los Angeles, California, USA
| | - Colin M Callahan
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Erin Nordquist
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Brice A J Sarver
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Jack Sullivan
- Department of Biological Sciences, University of Idaho, Moscow, Idaho, USA.,Institute for Bioinformatics and Evolutionary Studies (IBEST), University of Idaho, Moscow, Idaho, USA
| | - John R Demboski
- Department of Zoology, Denver Museum of Nature & Sciences, Denver, Colorado, USA
| | - Jeffrey M Good
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA.,Wildlife Biology Program, University of Montana, Missoula, Montana, USA
| |
Collapse
|
28
|
Černý D, Natale R. Comprehensive taxon sampling and vetted fossils help clarify the time tree of shorebirds (Aves, Charadriiformes). Mol Phylogenet Evol 2022; 177:107620. [PMID: 36038056 DOI: 10.1016/j.ympev.2022.107620] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 06/03/2022] [Accepted: 08/17/2022] [Indexed: 01/20/2023]
Abstract
Shorebirds (Charadriiformes) are a globally distributed clade of modern birds and, due to their ecological and morphological disparity, a frequent subject of comparative studies. While molecular phylogenies have been key to establishing the suprafamilial backbone of the charadriiform tree, a number of relationships at both deep and shallow taxonomic levels remain poorly resolved. The timescale of shorebird evolution also remains uncertain as a result of extensive disagreements among the published divergence dating studies, stemming largely from different choices of fossil calibrations. Here, we present the most comprehensive non-supertree phylogeny of shorebirds to date, based on a total-evidence dataset comprising 353 ingroup taxa (90% of all extant or recently extinct species), 27 loci (15 mitochondrial and 12 nuclear), and 69 morphological characters. We further clarify the timeline of charadriiform evolution by time-scaling this phylogeny using a set of 14 up-to-date and thoroughly vetted fossil calibrations. In addition, we assemble a taxonomically restricted 100-locus dataset specifically designed to resolve outstanding problems in higher-level charadriiform phylogeny. In terms of tree topology, our results are largely congruent with previous studies but indicate that some of the conflicts among earlier analyses reflect a genuine signal of pervasive gene tree discordance. Monophyly of the plovers (Charadriidae), the position of the ibisbill (Ibidorhyncha), and the relationships among the five subfamilies of the gulls (Laridae) could not be resolved even with greatly increased locus and taxon sampling. Moreover, several localized regions of uncertainty persist in shallower parts of the tree, including the interrelationships of the true auks (Alcinae) and anarhynchine plovers. Our node-dating and macroevolutionary rate analyses find support for a Paleocene origin of crown-group shorebirds, as well as exceptionally rapid recent radiations of Old World oystercatchers (Haematopodidae) and select genera of gulls. Our study underscores the challenges involved in estimating a comprehensively sampled and carefully calibrated time tree for a diverse avian clade, and highlights areas in need of further research.
Collapse
Affiliation(s)
- David Černý
- Department of the Geophysical Sciences, University of Chicago, Chicago 60637, USA.
| | - Rossy Natale
- Department of Organismal Biology & Anatomy, University of Chicago, Chicago 60637, USA
| |
Collapse
|
29
|
Smith BT, Merwin J, Provost KL, Thom G, Brumfield RT, Ferreira M, Mauck Iii WM, Moyle RG, Wright T, Joseph L. Phylogenomic analysis of the parrots of the world distinguishes artifactual from biological sources of gene tree discordance. Syst Biol 2022; 72:228-241. [PMID: 35916751 DOI: 10.1093/sysbio/syac055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Revised: 02/22/2022] [Accepted: 07/22/2022] [Indexed: 11/14/2022] Open
Abstract
Gene tree discordance is expected in phylogenomic trees and biological processes are often invoked to explain it. However, heterogeneous levels of phylogenetic signal among individuals within datasets may cause artifactual sources of topological discordance. We examined how the information content in tips and subclades impacts topological discordance in the parrots (Order: Psittaciformes), a diverse and highly threatened clade of nearly 400 species. Using ultraconserved elements from 96% of the clade's species-level diversity, we estimated concatenated and species trees for 382 ingroup taxa. We found that discordance among tree topologies was most common at nodes dating between the late Miocene and Pliocene, and often at the taxonomic level of genus. Accordingly, we used two metrics to characterize information content in tips and assess the degree to which conflict between trees was being driven by lower quality samples. Most instances of topological conflict and non-monophyletic genera in the species tree could be objectively identified using these metrics. For subclades still discordant after tip-based filtering, we used a machine learning approach to determine whether phylogenetic signal or noise was the more important predictor of metrics supporting the alternative topologies. We found that when signal favored one of the topologies, noise was the most important variable in poorly performing models that favored the alternative topology. In sum, we show that artifactual sources of gene tree discordance, which are likely a common phenomenon in many datasets, can be distinguished from biological sources by quantifying the information content in each tip and modeling which factors support each topology.
Collapse
Affiliation(s)
- Brian Tilston Smith
- Department of Ornithology, American Museum of Natural History, Central Park West at 79th Street, New York, NY 10024, USA
| | - Jon Merwin
- Department of Ornithology, Academy of Natural Sciences of Drexel University, 1900 Benjamin Franklin Parkway, Philadelphia, PA 19103, USA.,Department of Biodiversity, Earth, and Environmental Science, Drexel University, Philadelphia, PA 19103, USA
| | - Kaiya L Provost
- Department of Evolution, Ecology, and Organismal Biology, The Ohio State University, 318 W. 12th Avenue, Columbus, OH 43210, USA
| | - Gregory Thom
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Robb T Brumfield
- Museum of Natural Science and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Mateus Ferreira
- Centro de Estudos da Biodiversidade, Universidade Federal de Roraima, Av. Cap. Ene Garcez, 2413, Boa Vista, RR, Brazil
| | - William M Mauck Iii
- Department of Ornithology, American Museum of Natural History, Central Park West at 79th Street, New York, NY 10024, USA
| | - Robert G Moyle
- Department of Ecology and Evolutionary Biology and Biodiversity Institute, University of Kansas, 1345 Jayhawk Blvd., Lawrence, KS 66045, USA
| | - Timothy Wright
- Department of Biology, New Mexico State University, Las Cruces, NM, 88003, USA
| | - Leo Joseph
- Australian National Wildlife Collection, National Research Collections Australia, CSIRO, GPO Box 1700, Canberra, ACT, 2601, Australia
| |
Collapse
|
30
|
Astudillo-Clavijo V, Stiassny MLJ, Ilves KL, Musilova Z, Salzburger W, López-Fernández H. Exon-based phylogenomics and the relationships of African cichlid fishes: tackling the challenges of reconstructing phylogenies with repeated rapid radiations. Syst Biol 2022; 72:134-149. [PMID: 35880863 DOI: 10.1093/sysbio/syac051] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Revised: 07/06/2022] [Accepted: 07/19/2022] [Indexed: 11/13/2022] Open
Abstract
African cichlids (subfamily: Pseudocrenilabrinae) are among the most diverse vertebrates, and their propensity for repeated rapid radiation has made them a celebrated model system in evolutionary research. Nonetheless, despite numerous studies, phylogenetic uncertainty persists, and riverine lineages remain comparatively underrepresented in higher-level phylogenetic studies. Heterogeneous gene histories resulting from incomplete lineage sorting (ILS) and hybridization are likely sources of uncertainty, especially during episodes of rapid speciation. We investigate relationships of Pseudocrenilabrinae and its close relatives while accounting for multiple sources of genetic discordance using species tree and hybrid network analyses with hundreds of single-copy exons. We improve sequence recovery for distant relatives, thereby extending the taxonomic reach of our probes, with a hybrid reference guided/de novo assembly approach. Our analyses provide robust hypotheses for most higher-level relationships and reveal widespread gene heterogeneity, including in riverine taxa. ILS and past hybridization are identified as sources of genetic discordance in different lineages. Sampling of various Blenniiformes (formerly Ovalentaria) adds strong phylogenomic support for convict blennies (Pholidichthyidae) as sister to Cichlidae, and points to other potentially useful protein-coding markers across the order. A reliable phylogeny with representatives from diverse environments will support ongoing taxonomic and comparative evolutionary research in the cichlid model system.
Collapse
Affiliation(s)
- Viviana Astudillo-Clavijo
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, M5S 3B2, Canada.,Department of Natural History, Royal Ontario Museum, Toronto, M5S 2C6, Canada.,Department of Ecology and Evolutionary Biology and Museum of Zoology, University of Michigan, Ann Arbor, 48109, USA
| | - Melanie L J Stiassny
- Department of Ichthyology, American Museum of Natural History, New York, 10024-5102, USA
| | - Katriina L Ilves
- Research & Collections, Zoology, Canadian Museum of Nature, Ottawa, K1P 6P4, Canada
| | - Zuzana Musilova
- Department of Zoology, Charles University in Prague, Vinicna 7, Prague, CZ-128 44, Czech Republic
| | - Walter Salzburger
- Zoological Institute, University of Basel, Vesalgasse 1, CH-4051, Basel, Switzerland
| | - Hernán López-Fernández
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, M5S 3B2, Canada.,Department of Natural History, Royal Ontario Museum, Toronto, M5S 2C6, Canada.,Department of Ecology and Evolutionary Biology and Museum of Zoology, University of Michigan, Ann Arbor, 48109, USA
| |
Collapse
|
31
|
Out of chaos: Phylogenomics of Asian Sonerileae. Mol Phylogenet Evol 2022; 175:107581. [PMID: 35810973 DOI: 10.1016/j.ympev.2022.107581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Revised: 05/23/2022] [Accepted: 05/26/2022] [Indexed: 11/22/2022]
Abstract
Sonerileae is a diverse Melastomataceae lineage comprising ca. 1000 species in 44 genera, with >70% of genera and species distributed in Asia. Asian Sonerileae are taxonomically intractable with obscure generic circumscriptions. The backbone phylogeny of this group remains poorly resolved, possibly due to complexity caused by rapid species radiation in early and middle Miocene, which hampers further systematic study. Here, we used genome resequencing data to reconstruct the phylogeny of Asian Sonerileae. Three parallel datasets, viz. single-copy ortholog (SCO), genomic SNPs, and whole plastome, were assembled from genome resequencing data of 205 species for this purpose. Based on these genome-scale data, we provided the first well resolved phylogeny of Asian Sonerileae, with 34 major clades identified and 74% of the interclade relationships consistently resolved by both SCO and genomic data. Meanwhile, widespread phylogenetic discordance was detected among SCO gene trees as well as species trees reconstructed using different tree estimation methods (concatenation/site-based coalescent method/summary method) or different datasets (SCO/genomic/plastome). We explored sources of discordance using multiple approaches and found that the observed discordance in Asian Sonerileae was mainly caused by a combination of biased distribution of missing data, random noise from uninformative genes, incomplete lineage sorting, and hybridization/introgression. Exploration of these sources can enable us to generate hypotheses for future testing, which is the first step towards understanding the evolution of Asian Sonerileae. We also detected high levels of homoplasy for some characters traditionally used in taxonomy, which explains current chaotic generic delimitations. The backbone phylogeny of Asian Sonerileae revealed in this study offers a solid basis for future taxonomic revision at the generic level.
Collapse
|
32
|
Abreu EF, Pavan SE, Tsuchiya MTN, McLean BS, Wilson DE, Percequillo AR, Maldonado JE. Old specimens for old branches: Assessing effects of sample age in resolving a rapid Neotropical radiation of squirrels. Mol Phylogenet Evol 2022; 175:107576. [PMID: 35809853 DOI: 10.1016/j.ympev.2022.107576] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 06/10/2022] [Accepted: 07/01/2022] [Indexed: 11/15/2022]
Abstract
Ultraconserved Elements (UCEs) have been useful to resolve challenging phylogenies of non-model clades, unpuzzling long-conflicted relationships in key branches of the Tree of Life at both deep and shallow levels. UCEs are often reliably recovered from historical samples, unlocking a vast number of preserved natural history specimens for analysis. However, the extent to which sample age and preservation method impact UCE recovery as well as downstream inferences remains unclear. Furthermore, there is an ongoing debate on how to curate, filter, and properly analyze UCE data when locus recovery is uneven across sample age and quality. In the present study we address these questions with an empirical dataset composed of over 3800 UCE loci from 219 historical and modern samples of Sciuridae, a globally distributed and ecologically important family of rodents. We provide a genome-scale phylogeny of two squirrel subfamilies (Sciurillinae and Sciurinae: Sciurini) and investigate their placement within Sciuridae. For historical specimens, recovery of UCE loci and mean length per locus were inversely related to sample age; deeper sequencing improved the number of UCE loci recovered but not locus length. Most of our phylogenetic inferences-performed on six datasets with alternative data-filtering strategies, and using three distinct optimality criteria-resulted in distinct topologies. Datasets containing more loci (40% and 50% taxa representativeness matrices) yielded more concordant topologies and higher support values than strictly filtered datasets (60% matrices) particularly with IQ-Tree and SVDquartets, while filtering based on information content provided better topological resolution for inferences with the coalescent gene-tree based approach in ASTRAL-III. We resolved deep relationships in Sciuridae (including among the five currently recognized subfamilies) and relationships among the deepest branches of Sciurini, but conflicting relationships remain at both genus- and species-levels for the rapid Neotropical tree squirrel radiation. Our results suggest that phylogenomic consensus can be difficult and heavily influenced by the age of available samples and the filtering steps used to optimize dataset properties.
Collapse
Affiliation(s)
- Edson F Abreu
- Laboratório de Mamíferos, Departamento de Ciências Biológicas, Escola Superior de Agricultura Luiz de Queiroz, Universidade de São Paulo, Piracicaba, SP, Brazil; Center for Conservation Genomics, Smithsonian National Zoo and Conservation Biology Institute, Washington, DC, USA.
| | - Silvia E Pavan
- Center for Conservation Genomics, Smithsonian National Zoo and Conservation Biology Institute, Washington, DC, USA
| | - Mirian T N Tsuchiya
- Center for Conservation Genomics, Smithsonian National Zoo and Conservation Biology Institute, Washington, DC, USA; Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution, Washington, DC, USA
| | - Bryan S McLean
- Department of Biology, University of North Carolina Greensboro, Greensboro, NC, USA
| | - Don E Wilson
- Division of Mammals, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
| | - Alexandre R Percequillo
- Laboratório de Mamíferos, Departamento de Ciências Biológicas, Escola Superior de Agricultura Luiz de Queiroz, Universidade de São Paulo, Piracicaba, SP, Brazil
| | - Jesús E Maldonado
- Center for Conservation Genomics, Smithsonian National Zoo and Conservation Biology Institute, Washington, DC, USA
| |
Collapse
|
33
|
Gatesy J, Springer MS. Phylogenomic Coalescent Analyses of Avian Retroelements Infer Zero-Length Branches at the Base of Neoaves, Emergent Support for Controversial Clades, and Ancient Introgressive Hybridization in Afroaves. Genes (Basel) 2022; 13:1167. [PMID: 35885951 PMCID: PMC9324441 DOI: 10.3390/genes13071167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Revised: 06/20/2022] [Accepted: 06/21/2022] [Indexed: 01/25/2023] Open
Abstract
Retroelement insertions (RIs) are low-homoplasy characters that are ideal data for addressing deep evolutionary radiations, where gene tree reconstruction errors can severely hinder phylogenetic inference with DNA and protein sequence data. Phylogenomic studies of Neoaves, a large clade of birds (>9000 species) that first diversified near the Cretaceous−Paleogene boundary, have yielded an array of robustly supported, contradictory relationships among deep lineages. Here, we reanalyzed a large RI matrix for birds using recently proposed quartet-based coalescent methods that enable inference of large species trees including branch lengths in coalescent units, clade-support, statistical tests for gene flow, and combined analysis with DNA-sequence-based gene trees. Genome-scale coalescent analyses revealed extremely short branches at the base of Neoaves, meager branch support, and limited congruence with previous work at the most challenging nodes. Despite widespread topological conflicts with DNA-sequence-based trees, combined analyses of RIs with thousands of gene trees show emergent support for multiple higher-level clades (Columbea, Passerea, Columbimorphae, Otidimorphae, Phaethoquornithes). RIs express asymmetrical support for deep relationships within the subclade Afroaves that hints at ancient gene flow involving the owl lineage (Strigiformes). Because DNA-sequence data are challenged by gene tree-reconstruction error, analysis of RIs represents one approach for improving gene tree-based methods when divergences are deep, internodes are short, terminal branches are long, and introgressive hybridization further confounds species−tree inference.
Collapse
Affiliation(s)
- John Gatesy
- Division of Vertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA
| | - Mark S. Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA 92521, USA;
| |
Collapse
|
34
|
Xiong H, Wang D, Shao C, Yang X, Yang J, Ma T, Davis CC, Liu L, Xi Z. Species Tree Estimation and the Impact of Gene Loss Following Whole-Genome Duplication. Syst Biol 2022; 71:1348-1361. [PMID: 35689633 PMCID: PMC9558847 DOI: 10.1093/sysbio/syac040] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2021] [Revised: 06/03/2022] [Accepted: 06/07/2022] [Indexed: 12/02/2022] Open
Abstract
Whole-genome duplication (WGD) occurs broadly and repeatedly across the history of eukaryotes and is recognized as a prominent evolutionary force, especially in plants. Immediately following WGD, most genes are present in two copies as paralogs. Due to this redundancy, one copy of a paralog pair commonly undergoes pseudogenization and is eventually lost. When speciation occurs shortly after WGD; however, differential loss of paralogs may lead to spurious phylogenetic inference resulting from the inclusion of pseudoorthologs–paralogous genes mistakenly identified as orthologs because they are present in single copies within each sampled species. The influence and impact of including pseudoorthologs versus true orthologs as a result of gene extinction (or incomplete laboratory sampling) are only recently gaining empirical attention in the phylogenomics community. Moreover, few studies have yet to investigate this phenomenon in an explicit coalescent framework. Here, using mathematical models, numerous simulated data sets, and two newly assembled empirical data sets, we assess the effect of pseudoorthologs on species tree estimation under varying degrees of incomplete lineage sorting (ILS) and differential gene loss scenarios following WGD. When gene loss occurs along the terminal branches of the species tree, alignment-based (BPP) and gene-tree-based (ASTRAL, MP-EST, and STAR) coalescent methods are adversely affected as the degree of ILS increases. This can be greatly improved by sampling a sufficiently large number of genes. Under the same circumstances, however, concatenation methods consistently estimate incorrect species trees as the number of genes increases. Additionally, pseudoorthologs can greatly mislead species tree inference when gene loss occurs along the internal branches of the species tree. Here, both coalescent and concatenation methods yield inconsistent results. These results underscore the importance of understanding the influence of pseudoorthologs in the phylogenomics era. [Coalescent method; concatenation method; incomplete lineage sorting; pseudoorthologs; single-copy gene; whole-genome duplication.]
Collapse
Affiliation(s)
- Haifeng Xiong
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Danying Wang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Chen Shao
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Xuchen Yang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Jialin Yang
- Department of Statistics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Tao Ma
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Charles C Davis
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, Cambridge, MA 02138, USA
| | - Liang Liu
- Department of Statistics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Zhenxiang Xi
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| |
Collapse
|
35
|
Pozzi L, Penna A. Rocks and clocks revised: New promises and challenges in dating the primate tree of life. Evol Anthropol 2022; 31:138-153. [PMID: 35102633 DOI: 10.1002/evan.21940] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 10/04/2021] [Accepted: 01/12/2022] [Indexed: 01/14/2023]
Abstract
In recent years, multiple technological and methodological advances have increased our ability to estimate phylogenies, leading to more accurate dating of the primate tree of life. Here we provide an overview of the limitations and potentials of some of these advancements and discuss how dated phylogenies provide the crucial temporal scale required to understand primate evolution. First, we review new methods, such as the total-evidence dating approach, that promise a better integration between the fossil record and molecular data. We then explore how the ever-increasing availability of genomic-level data for more primate species can impact our ability to accurately estimate timetrees. Finally, we discuss more recent applications of mutation rates to date divergence times. We highlight example studies that have applied these approaches to estimate divergence dates within primates. Our goal is to provide a critical overview of these new developments and explore the promises and challenges of their application in evolutionary anthropology.
Collapse
Affiliation(s)
- Luca Pozzi
- Department of Anthropology, The University of Texas at San Antonio, San Antonio, Texas, USA
| | - Anna Penna
- Department of Anthropology, The University of Texas at San Antonio, San Antonio, Texas, USA
| |
Collapse
|
36
|
Willson J, Roddur MS, Liu B, Zaharias P, Warnow T. DISCO: Species Tree Inference using Multicopy Gene Family Tree Decomposition. Syst Biol 2022; 71:610-629. [PMID: 34450658 PMCID: PMC9016570 DOI: 10.1093/sysbio/syab070] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 08/18/2021] [Accepted: 08/23/2021] [Indexed: 11/21/2022] Open
Abstract
Species tree inference from gene family trees is a significant problem in computational biology. However, gene tree heterogeneity, which can be caused by several factors including gene duplication and loss, makes the estimation of species trees very challenging. While there have been several species tree estimation methods introduced in recent years to specifically address gene tree heterogeneity due to gene duplication and loss (such as DupTree, FastMulRFS, ASTRAL-Pro, and SpeciesRax), many incur high cost in terms of both running time and memory. We introduce a new approach, DISCO, that decomposes the multi-copy gene family trees into many single copy trees, which allows for methods previously designed for species tree inference in a single copy gene tree context to be used. We prove that using DISCO with ASTRAL (i.e., ASTRAL-DISCO) is statistically consistent under the GDL model, provided that ASTRAL-Pro correctly roots and tags each gene family tree. We evaluate DISCO paired with different methods for estimating species trees from single copy genes (e.g., ASTRAL, ASTRID, and IQ-TREE) under a wide range of model conditions, and establish that high accuracy can be obtained even when ASTRAL-Pro is not able to correctly roots and tags the gene family trees. We also compare results using MI, an alternative decomposition strategy from Yang Y. and Smith S.A. (2014), and find that DISCO provides better accuracy, most likely as a result of covering more of the gene family tree leafset in the output decomposition. [Concatenation analysis; gene duplication and loss; species tree inference; summary method.].
Collapse
Affiliation(s)
- James Willson
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Mrinmoy Saha Roddur
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Baqiao Liu
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Paul Zaharias
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Tandy Warnow
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
37
|
Dasarathy G, Mossel E, Nowak R, Roch S. A stochastic Farris transform for genetic data under the multispecies coalescent with applications to data requirements. J Math Biol 2022; 84:36. [PMID: 35394192 PMCID: PMC9258723 DOI: 10.1007/s00285-022-01731-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 02/15/2022] [Accepted: 02/17/2022] [Indexed: 10/18/2022]
Abstract
Species tree estimation faces many significant hurdles. Chief among them is that the trees describing the ancestral lineages of each individual gene-the gene trees-often differ from the species tree. The multispecies coalescent is commonly used to model this gene tree discordance, at least when it is believed to arise from incomplete lineage sorting, a population-genetic effect. Another significant challenge in this area is that molecular sequences associated to each gene typically provide limited information about the gene trees themselves. While the modeling of sequence evolution by single-site substitutions is well-studied, few species tree reconstruction methods with theoretical guarantees actually address this latter issue. Instead, a standard-but unsatisfactory-assumption is that gene trees are perfectly reconstructed before being fed into a so-called summary method. Hence much remains to be done in the development of inference methodologies that rigorously account for gene tree estimation error-or completely avoid gene tree estimation in the first place. In previous work, a data requirement trade-off was derived between the number of loci m needed for an accurate reconstruction and the length of the locus sequences k. It was shown that to reconstruct an internal branch of length f, one needs m to be of the order of [Formula: see text]. That previous result was obtained under the restrictive assumption that mutation rates as well as population sizes are constant across the species phylogeny. Here we further generalize this result beyond this assumption. Our main contribution is a novel reduction to the molecular clock case under the multispecies coalescent, which we refer to as a stochastic Farris transform. As a corollary, we also obtain a new identifiability result of independent interest: for any species tree with [Formula: see text] species, the rooted topology of the species tree can be identified from the distribution of its unrooted weighted gene trees even in the absence of a molecular clock.
Collapse
Affiliation(s)
- Gautam Dasarathy
- School of Electrical, Computer, and Energy Engineering, Arizona State University, Tempe, USA
| | - Elchanan Mossel
- Department of Mathematics and IDSS, Massachusetts Institute of Technology, Cambridge, USA
| | - Robert Nowak
- Department of Electrical and Computer Engineering, University of Wisconsin, Madison, USA
| | - Sebastien Roch
- Department of Mathematics, University of Wisconsin, Madison, USA.
| |
Collapse
|
38
|
Hou Z, Ma X, Shi X, Li X, Yang L, Xiao S, De Clerck O, Leliaert F, Zhong B. Phylotranscriptomic insights into a Mesoproterozoic-Neoproterozoic origin and early radiation of green seaweeds (Ulvophyceae). Nat Commun 2022; 13:1610. [PMID: 35318329 PMCID: PMC8941102 DOI: 10.1038/s41467-022-29282-9] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 03/09/2022] [Indexed: 01/09/2023] Open
Abstract
The Ulvophyceae, a major group of green algae, is of particular evolutionary interest because of its remarkable morphological and ecological diversity. Its phylogenetic relationships and diversification timeline, however, are still not fully resolved. In this study, using an extensive nuclear gene dataset, we apply coalescent- and concatenation-based approaches to reconstruct the phylogeny of the Ulvophyceae and to explore the sources of conflict in previous phylogenomic studies. The Ulvophyceae is recovered as a paraphyletic group, with the Bryopsidales being a sister group to the Chlorophyceae, and the remaining taxa forming a clade (Ulvophyceae sensu stricto). Molecular clock analyses with different calibration strategies emphasize the large impact of fossil calibrations, and indicate a Meso-Neoproterozoic origin of the Ulvophyceae (sensu stricto), earlier than previous estimates. The results imply that ulvophyceans may have had a profound influence on oceanic redox structures and global biogeochemical cycles at the Mesoproterozoic-Neoproterozoic transition. “Ulvophyceae is a remarkably morphologically and ecologically diverse clade of green algae. Here, the authors reconstruct the Ulvophyceae phylogeny, showing that these algae originated earlier than expected and may have influenced biogeochemical cycles at the Mesoproterozoic-Neoproterozoic transition.”
Collapse
Affiliation(s)
- Zheng Hou
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Xiaoya Ma
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Xuan Shi
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Xi Li
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Lingxiao Yang
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Shuhai Xiao
- Department of Geosciences and Global Change Center, Virginia Tech, Blacksburg, VA, USA
| | - Olivier De Clerck
- Phycology Research Group and Center for Molecular Phylogenetics and Evolution, Ghent University, Ghent, Belgium
| | - Frederik Leliaert
- Phycology Research Group and Center for Molecular Phylogenetics and Evolution, Ghent University, Ghent, Belgium.,Meise Botanic Garden, Meise, Belgium
| | - Bojian Zhong
- College of Life Sciences, Nanjing Normal University, Nanjing, China.
| |
Collapse
|
39
|
Schull JK, Turakhia Y, Hemker JA, Dally WJ, Bejerano G. Champagne: Automated Whole-Genome Phylogenomic Character Matrix Method Using Large Genomic Indels for Homoplasy-Free Inference. Genome Biol Evol 2022; 14:evac013. [PMID: 35171243 PMCID: PMC8920512 DOI: 10.1093/gbe/evac013] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/10/2022] [Indexed: 11/14/2022] Open
Abstract
We present Champagne, a whole-genome method for generating character matrices for phylogenomic analysis using large genomic indel events. By rigorously picking orthologous genes and locating large insertion and deletion events, Champagne delivers a character matrix that considerably reduces homoplasy compared with morphological and nucleotide-based matrices, on both established phylogenies and difficult-to-resolve nodes in the mammalian tree. Champagne provides ample evidence in the form of genomic structural variation to support incomplete lineage sorting and possible introgression in Paenungulata and human-chimp-gorilla which were previously inferred primarily through matrices composed of aligned single-nucleotide characters. Champagne also offers further evidence for Myomorpha as sister to Sciuridae and Hystricomorpha in the rodent tree. Champagne harbors distinct theoretical advantages as an automated method that produces nearly homoplasy-free character matrices on the whole-genome scale.
Collapse
Affiliation(s)
- James K Schull
- Department of Computer Science, Stanford University, USA
| | - Yatish Turakhia
- Department of Electrical and Computer Engineering, University of California San Diego, USA
| | - James A Hemker
- Department of Computer Science, Stanford University, USA
| | - William J Dally
- Department of Computer Science, Stanford University, USA
- NVIDIA, Santa Clara, California, USA
- Department of Electrical Engineering, Stanford University, USA
| | - Gill Bejerano
- Department of Computer Science, Stanford University, USA
- Department of Developmental Biology, Stanford University, USA
- Department of Biomedical Data Science, Stanford University, USA
- Department of Pediatrics, Stanford University, USA
| |
Collapse
|
40
|
Herranz M, Stiller J, Worsaae K, Sørensen MV. Phylogenomic analyses of mud dragons (Kinorhyncha). Mol Phylogenet Evol 2021; 168:107375. [PMID: 34952205 DOI: 10.1016/j.ympev.2021.107375] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 11/18/2021] [Accepted: 11/29/2021] [Indexed: 10/19/2022]
Abstract
Mud dragons (Kinorhyncha) are microscopic invertebrates, inhabiting marine sediments across the globe from intertidal to hadal depths. They are segmented, moulting animals like arthropods, but grouping with the unsegmented priapulans and loriciferans within Ecdysozoa. There are more than 300 species of kinorhynchs described within 31 genera and 11 families, however, their evolutionary relationships have so far only been investigated using morphology and a few molecular markers. Here we aim to resolve the relationships and classification of major clades within Kinorhyncha using transcriptomic data. In addition, we wish to revisit the position of three indistinctly segmented, aberrant genera in order to reconstruct the evolution of distinct segmentation within the group. We conducted a phylogenomic analysis of Kinorhyncha including 21 kinorhynch transcriptomes (of which 18 are new) representing 15 genera, and seven outgroups including priapulan, loriciferan, nematode and nematomorph transcriptomes. Results show a congruent and robust tree that supports the division of Kinorhyncha into two major clades: Cyclorhagida and Allomalorhagida. Cyclorhagida is composed of three subclades: Xenosomata, Kentrorhagata comb. nov. (including the aberrant Zelinkaderes) and Echinorhagata. Allomalorhagida is composed of two subclades: Pycnophyidae and Anomoirhaga nom. nov. Anomoirhaga nom. nov. accommodates the aberrant genera Cateria (previously nested within Cyclorhagida) and Franciscideres together with five additional genera. The distant and derived positions of the aberrant Zelinkaderes, Cateria and Franciscideres species suggest that their less distinct trunk segmentation evolved convergently, and that segmentation evolved among kinorhynch stem groups.
Collapse
Affiliation(s)
- Maria Herranz
- Marine Biology section, Department of Biology, University of Copenhagen, 2100 Copenhagen, Denmark; Natural History Museum of Denmark, University of Copenhagen, 2100 Copenhagen, Denmark.
| | - Josefin Stiller
- Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, 2100 Copenhagen, Denmark.
| | - Katrine Worsaae
- Marine Biology section, Department of Biology, University of Copenhagen, 2100 Copenhagen, Denmark.
| | - Martin V Sørensen
- Natural History Museum of Denmark, University of Copenhagen, 2100 Copenhagen, Denmark.
| |
Collapse
|
41
|
How challenging RADseq data turned out to favor coalescent-based species tree inference. A case study in Aichryson (Crassulaceae). Mol Phylogenet Evol 2021; 167:107342. [PMID: 34785384 DOI: 10.1016/j.ympev.2021.107342] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2020] [Revised: 07/05/2021] [Accepted: 10/29/2021] [Indexed: 12/24/2022]
Abstract
Analysing multiple genomic regions while incorporating detection and qualification of discordance among regions has become standard for understanding phylogenetic relationships. In plants, which usually have comparatively large genomes, this is feasible by the combination of reduced-representation library (RRL) methods and high-throughput sequencing enabling the cost effective acquisition of genomic data for thousands of loci from hundreds of samples. One popular RRL method is RADseq. A major disadvantage of established RADseq approaches is the rather short fragment and sequencing range, leading to loci of little individual phylogenetic information. This issue hampers the application of coalescent-based species tree inference. The modified RADseq protocol presented here targets ca. 5,000 loci of 300-600nt length, sequenced with the latest short-read-sequencing (SRS) technology, has the potential to overcome this drawback. To illustrate the advantages of this approach we use the study group Aichryson Webb & Berthelott (Crassulaceae), a plant genus that diversified on the Canary Islands. The data analysis approach used here aims at a careful quality control of the long loci dataset. It involves an informed selection of thresholds for accurate clustering, a thorough exploration of locus properties, such as locus length, coverage and variability, to identify potential biased data and a comparative phylogenetic inference of filtered datasets, accompanied by an evaluation of resulting BS support, gene and site concordance factor values, to improve overall resolution of the resulting phylogenetic trees. The final dataset contains variable loci with an average length of 373nt and facilitates species tree estimation using a coalescent-based summary approach. Additional improvements brought by the approach are critically discussed.
Collapse
|
42
|
Trad RJ, Cabral FN, Bittrich V, Silva SRD, Amaral MDCED. Calophyllaceae plastomes, their structure and insights in relationships within the clusioids. Sci Rep 2021; 11:20712. [PMID: 34671062 PMCID: PMC8528878 DOI: 10.1038/s41598-021-99178-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2021] [Accepted: 09/21/2021] [Indexed: 11/16/2022] Open
Abstract
A complete chloroplast genome is not yet available for numerous species of plants. Among the groups that lack plastome information is the clusioid clade (Malpighiales), which includes five families: Bonnetiaceae, Calophyllaceae, Clusiaceae, Hypericaceae, and Podostemaceae. With around 2200 species, it has few published plastomes and most of them are from Podostemaceae. Here we assembled and compared six plastomes from members of the clusioids: five from Calophyllaceae (newly sequenced) and one from Clusiaceae. Putative regions for evolutionary studies were identified and the newly assembled chloroplasts were analyzed with other available chloroplasts for the group, focusing on Calophyllaceae. Our results mostly agree with recent studies which found a general conserved structure, except for the two Podostemaceae species that have a large inversion (trnK-UUU–rbcL) and lack one intron from ycf3. Within Calophyllaceae we observed a longer LSC and reduced IRs in Mahurea exstipulata, resulting in some genic rearrangement, and a short inversion (psbJ–psbE) in Kielmeyera coriacea. Phylogenetic analyses recovered the clusioids and the five families as monophyletic and revealed that conflicts in relationships reported in the literature for the group agree with nodes concentrating uninformative or conflicting gene trees. Our study brings new insights about clusioid plastome architecture and its evolution.
Collapse
Affiliation(s)
- Rafaela Jorge Trad
- Department of Plant Biology, Biology Institute, University of Campinas (UNICAMP), CP 6109, Campinas, SP, 13083-970, Brazil. .,Macroecology Lab @ J3-166, Institute of Biological Sciences - ICB, Federal University of Minas Gerais (UFMG), Belo Horizonte, Campinas, MG, 31270-901, Brazil.
| | - Fernanda Nunes Cabral
- Departamento de Ciências e Linguagens, Instituto Federal de Minas Gerais - Campus Bambuí, Bambuí, MG, 38900-000, Brazil
| | - Volker Bittrich
- Volker Bittrich is an independent scientist, Campinas, Brazil
| | - Saura Rodrigues da Silva
- Department of Technology, UNESP - São Paulo State University, Campus Jaboticabal, Jaboticabal, SP, 14884-900, Brazil
| | | |
Collapse
|
43
|
Borowiec ML, Cover SP, Rabeling C. The evolution of social parasitism in Formica ants revealed by a global phylogeny. Proc Natl Acad Sci U S A 2021; 118:e2026029118. [PMID: 34535549 PMCID: PMC8463886 DOI: 10.1073/pnas.2026029118] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/29/2021] [Indexed: 02/07/2023] Open
Abstract
Studying the behavioral and life history transitions from a cooperative, eusocial life history to exploitative social parasitism allows for deciphering the conditions under which changes in behavior and social organization lead to diversification. The Holarctic ant genus Formica is ideally suited for studying the evolution of social parasitism because half of its 172 species are confirmed or suspected social parasites, which includes all three major classes of social parasitism known in ants. However, the life history transitions associated with the evolution of social parasitism in this genus are largely unexplored. To test competing hypotheses regarding the origins and evolution of social parasitism, we reconstructed a global phylogeny of Formica ants. The genus originated in the Old World ∼30 Ma ago and dispersed multiple times to the New World and back. Within Formica, obligate dependent colony-founding behavior arose once from a facultatively polygynous common ancestor practicing independent and facultative dependent colony foundation. Temporary social parasitism likely preceded or arose concurrently with obligate dependent colony founding, and dulotic social parasitism evolved once within the obligate dependent colony-founding clade. Permanent social parasitism evolved twice from temporary social parasitic ancestors that rarely practiced colony budding, demonstrating that obligate social parasitism can originate from a facultative parasitic background in socially polymorphic organisms. In contrast to permanently socially parasitic ants in other genera, the high parasite diversity in Formica likely originated via allopatric speciation, highlighting the diversity of convergent evolutionary trajectories resulting in nearly identical parasitic life history syndromes.
Collapse
Affiliation(s)
- Marek L Borowiec
- School of Life Sciences, Arizona State University, Tempe, AZ 85287;
- Department of Entomology, Plant Pathology, and Nematology, University of Idaho, Moscow, ID 83844
- Institute of Bioinformatics and Evolutionary Studies, University of Idaho, Moscow, ID 83844
| | - Stefan P Cover
- Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138
| | | |
Collapse
|
44
|
Forthman M, Braun EL, Kimball RT. Gene tree quality affects empirical coalescent branch length estimation. ZOOL SCR 2021. [DOI: 10.1111/zsc.12512] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Affiliation(s)
- Michael Forthman
- Department of Entomology & Nematology University of Florida Gainesville FL USA
- California State Collection of Arthropods Plant Pest Diagnostics Branch California Department of Food & Agriculture Sacramento CA USA
| | - Edward L. Braun
- Department of Biology University of Florida Gainesville FL USA
| | | |
Collapse
|
45
|
Ferrer Obiol J, James HF, Chesser RT, Bretagnolle V, González-Solís J, Rozas J, Riutort M, Welch AJ. Integrating Sequence Capture and Restriction Site-Associated DNA Sequencing to Resolve Recent Radiations of Pelagic Seabirds. Syst Biol 2021; 70:976-996. [PMID: 33512506 PMCID: PMC8357341 DOI: 10.1093/sysbio/syaa101] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2020] [Revised: 11/13/2020] [Accepted: 12/15/2020] [Indexed: 01/01/2023] Open
Abstract
The diversification of modern birds has been shaped by a number of radiations. Rapid diversification events make reconstructing the evolutionary relationships among taxa challenging due to the convoluted effects of incomplete lineage sorting (ILS) and introgression. Phylogenomic data sets have the potential to detect patterns of phylogenetic incongruence, and to address their causes. However, the footprints of ILS and introgression on sequence data can vary between different phylogenomic markers at different phylogenetic scales depending on factors such as their evolutionary rates or their selection pressures. We show that combining phylogenomic markers that evolve at different rates, such as paired-end double-digest restriction site-associated DNA (PE-ddRAD) and ultraconserved elements (UCEs), allows a comprehensive exploration of the causes of phylogenetic discordance associated with short internodes at different timescales. We used thousands of UCE and PE-ddRAD markers to produce the first well-resolved phylogeny of shearwaters, a group of medium-sized pelagic seabirds that are among the most phylogenetically controversial and endangered bird groups. We found that phylogenomic conflict was mainly derived from high levels of ILS due to rapid speciation events. We also documented a case of introgression, despite the high philopatry of shearwaters to their breeding sites, which typically limits gene flow. We integrated state-of-the-art concatenated and coalescent-based approaches to expand on previous comparisons of UCE and RAD-Seq data sets for phylogenetics, divergence time estimation, and inference of introgression, and we propose a strategy to optimize RAD-Seq data for phylogenetic analyses. Our results highlight the usefulness of combining phylogenomic markers evolving at different rates to understand the causes of phylogenetic discordance at different timescales. [Aves; incomplete lineage sorting; introgression; PE-ddRAD-Seq; phylogenomics; radiations; shearwaters; UCEs.].
Collapse
Affiliation(s)
- Joan Ferrer Obiol
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona, Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Barcelona, Catalonia, Spain
| | - Helen F James
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
| | - R Terry Chesser
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
- U.S. Geological Survey, Patuxent Wildlife Research Center, Laurel, MD, USA
| | - Vincent Bretagnolle
- Centre d’Études Biologiques de Chizé, CNRS & La Rochelle Université, 79360, Villiers en Bois, France
| | - Jacob González-Solís
- Institut de Recerca de la Biodiversitat (IRBio), Barcelona, Catalonia, Spain
- Departament de Biologia Evolutiva, Ecologia i Ciències Ambientals, Facultat de Biologia, Universitat de Barcelona, Barcelona, Catalonia, Spain
| | - Julio Rozas
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona, Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Barcelona, Catalonia, Spain
| | - Marta Riutort
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona, Barcelona, Catalonia, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Barcelona, Catalonia, Spain
| | | |
Collapse
|
46
|
Doyle JJ. Defining coalescent genes: Theory meets practice in organelle phylogenomics. Syst Biol 2021; 71:476-489. [PMID: 34191012 DOI: 10.1093/sysbio/syab053] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 06/24/2021] [Accepted: 06/28/2021] [Indexed: 11/13/2022] Open
Abstract
The species tree paradigm that dominates current molecular systematic practice infers species trees from collections of sequences under assumptions of the multispecies coalescent (MSC), i.e., that there is free recombination between the sequences and no (or very low) recombination within them. These coalescent genes (c-genes) are thus defined in an historical rather than molecular sense, and can in theory be as large as an entire genome or as small as a single nucleotide. A debate about how to define c-genes centers on the contention that nuclear gene sequences used in many coalescent analyses undergo too much recombination, such that their introns comprise multiple c-genes, violating a key assumption of the MSC. Recently a similar argument has been made for the genes of plastid (e.g., chloroplast) and mitochondrial genomes, which for the last 30 or more years have been considered to represent a single c-gene for the purposes of phylogeny reconstruction because they are non-recombining in a historical sense. Consequently, it has been suggested that these genomes should be analyzed using coalescent methods that treat their genes-over 70 protein-coding genes in the case of most plastid genomes (plastomes)-as independent estimates of species phylogeny, in contrast to the usual practice of concatenation, which is appropriate for generating gene trees. However, although recombination certainly occurs in the plastome, as has been recognized since the 1970's, it is unlikely to be phylogenetically relevant. This is because such historically effective recombination can only occur when plastomes with incongruent histories are brought together in the same plastid. However, plastids sort rapidly into different cell lineages and rarely fuse. Thus, because of plastid biology, the plastome is a more canonical c-gene than is the average multi-intron mammalian nuclear gene. The plastome should thus continue to be treated as a single estimate of the underlying species phylogeny, as should the mitochondrial genome. The implications of this long-held insight of molecular systematics for studies in the phylogenomic era are explored.
Collapse
Affiliation(s)
- Jeff J Doyle
- Plant Biology Section, Plant Breeding & Genetics Section, and L. H. Bailey Hortorium, School of Integrative Plant Science, Cornell University, Ithaca, NY 14853 USA
| |
Collapse
|
47
|
Zhao T, Zwaenepoel A, Xue JY, Kao SM, Li Z, Schranz ME, Van de Peer Y. Whole-genome microsynteny-based phylogeny of angiosperms. Nat Commun 2021; 12:3498. [PMID: 34108452 PMCID: PMC8190143 DOI: 10.1038/s41467-021-23665-0] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Accepted: 05/10/2021] [Indexed: 02/05/2023] Open
Abstract
Plant genomes vary greatly in size, organization, and architecture. Such structural differences may be highly relevant for inference of genome evolution dynamics and phylogeny. Indeed, microsynteny-the conservation of local gene content and order-is recognized as a valuable source of phylogenetic information, but its use for the inference of large phylogenies has been limited. Here, by combining synteny network analysis, matrix representation, and maximum likelihood phylogenetic inference, we provide a way to reconstruct phylogenies based on microsynteny information. Both simulations and use of empirical data sets show our method to be accurate, consistent, and widely applicable. As an example, we focus on the analysis of a large-scale whole-genome data set for angiosperms, including more than 120 available high-quality genomes, representing more than 50 different plant families and 30 orders. Our 'microsynteny-based' tree is largely congruent with phylogenies proposed based on more traditional sequence alignment-based methods and current phylogenetic classifications but differs for some long-contested and controversial relationships. For instance, our synteny-based tree finds Vitales as early diverging eudicots, Saxifragales within superasterids, and magnoliids as sister to monocots. We discuss how synteny-based phylogenetic inference can complement traditional methods and could provide additional insights into some long-standing controversial phylogenetic relationships.
Collapse
Affiliation(s)
- Tao Zhao
- State Key Laboratory of Crop Stress Biology for Arid Areas/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling, China.
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.
- Center for Plant Systems Biology, VIB, Ghent, Belgium.
| | - Arthur Zwaenepoel
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Center for Plant Systems Biology, VIB, Ghent, Belgium
| | - Jia-Yu Xue
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing, China
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing, China
| | - Shu-Min Kao
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Center for Plant Systems Biology, VIB, Ghent, Belgium
| | - Zhen Li
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Center for Plant Systems Biology, VIB, Ghent, Belgium
| | - M Eric Schranz
- Biosystematics Group, Wageningen University and Research, Wageningen, The Netherlands
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.
- Center for Plant Systems Biology, VIB, Ghent, Belgium.
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, Nanjing, China.
- Center for Microbial Ecology and Genomics, Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria, South Africa.
| |
Collapse
|
48
|
Mahbub M, Wahab Z, Reaz R, Rahman MS, Bayzid MS. wQFM: Highly Accurate Genome-scale Species Tree Estimation from Weighted Quartets. Bioinformatics 2021; 37:3734-3743. [PMID: 34086858 DOI: 10.1093/bioinformatics/btab428] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 05/24/2021] [Accepted: 06/03/2021] [Indexed: 02/01/2023] Open
Abstract
MOTIVATION Species tree estimation from genes sampled from throughout the whole genome is complicated due to the gene tree-species tree discordance. Incomplete lineage sorting (ILS) is one of the most frequent causes for this discordance, where alleles can coexist in populations for periods that may span several speciation events. Quartet-based summary methods for estimating species trees from a collection of gene trees are becoming popular due to their high accuracy and statistical guarantee under ILS. Generating quartets with appropriate weights, where weights correspond to the relative importance of quartets, and subsequently amalgamating the weighted quartets to infer a single coherent species tree can allow for a statistically consistent way of estimating species trees. However, handling weighted quartets is challenging. RESULTS We propose wQFM, a highly accurate method for species tree estimation from multi-locus data, by extending the quartet FM (QFM) algorithm to a weighted setting. wQFM was assessed on a collection of simulated and real biological datasets, including the avian phylogenomic dataset which is one of the largest phylogenomic datasets to date. We compared wQFM with wQMC, which is the best alternate method for weighted quartet amalgamation, and with ASTRAL, which is one of the most accurate and widely used coalescent-based species tree estimation methods. Our results suggest that wQFM matches or improves upon the accuracy of wQMC and ASTRAL. AVAILABILITY wQFM is available in open source form at https://github.com/Mahim1997/wQFM-2020. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mahim Mahbub
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - Zahin Wahab
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - Rezwana Reaz
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - M Saifur Rahman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - Md Shamsuzzoha Bayzid
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| |
Collapse
|
49
|
Introgression is widespread in the radiation of carnivorous Nepenthes pitcher plants. Mol Phylogenet Evol 2021; 163:107214. [PMID: 34052438 DOI: 10.1016/j.ympev.2021.107214] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2021] [Revised: 05/14/2021] [Accepted: 05/25/2021] [Indexed: 11/23/2022]
Abstract
Introgression and hybridization are important processes in plant evolution, but they are difficult to study from a phylogenetic perspective, because they conflict with the bifurcating evolutionary history typically depicted in phylogenetic models. The role of hybridization in plant evolution is best documented in the form of allo-polyploidizations. In contrast, homoploid hybridization and introgression are less explored, although they may be crucial in adaptive radiations. Here we employ genome-wide data (ddRAD-seq, transcriptomes) to investigate the evolutionary history of Nepenthes, a radiation of c. 160 species of iconic carnivorous plants mainly from tropical Asia. Our data indicates that the main radiation is only c. 5 million years old, and confirms previous bifurcating phylogenies. However, due to a greatly expanded number of loci, we were able test for the first time the long-standing hypotheses of introgression and historical hybridization. The genus presents one very clear case of organellar capture between two distantly related but sympatric groups. Furthermore, all Nepenthes species show introgression signals in their nuclear genomes, as uncovered by a general survey of ABBA-BABA-like statistics. The ancestor of the rapid main radiation shows ancestry from two deeply diverged lineages, as indicated by phylogenetic network analyses. All major clades of the main radiation show further introgression both within and between each other, as suggested by admixture graphs. Our study supports the hypothesis that rapid adaptive radiations are hotspots of introgression in the tree of life, and highlights the need to consider non-treelike processes in evolutionary studies of Nepenthes in particular.
Collapse
|
50
|
Farah IT, Islam MM, Zinat KT, Rahman AH, Bayzid MS. Species tree estimation from gene trees by minimizing deep coalescence and maximizing quartet consistency: a comparative study and the presence of pseudo species tree terraces. Syst Biol 2021; 70:1213-1231. [PMID: 33844023 DOI: 10.1093/sysbio/syab026] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2020] [Revised: 03/25/2021] [Accepted: 03/29/2021] [Indexed: 11/14/2022] Open
Abstract
Species tree estimation from multi-locus datasets is extremely challenging, especially in the presence of gene tree heterogeneity across the genome due to incomplete lineage sorting (ILS). Summary methods have been developed which estimate gene trees and then combine the gene trees to estimate a species tree by optimizing various optimization scores. In this study, we have extended and adapted the concept of phylogenetic terraces to species tree estimation by "summarizing" a set of gene trees, where multiple species trees with distinct topologies may have exactly the same optimality score (i.e., quartet score, extra lineage score, etc.). We particularly investigated the presence and impacts of equally optimal trees in species tree estimation from multi-locus data using summary methods by taking ILS into account. We analyzed two of the most popular ILS-aware optimization criteria: maximize quartet consistency (MQC) and minimize deep coalescence (MDC). Methods based on MQC are provably statistically consistent, whereas MDC is not a consistent criterion for species tree estimation. We present a comprehensive comparative study of these two optimality criteria. Our experiments, on a collection of datasets simulated under ILS, indicate that MDC may result in competitive or identical quartet consistency score as MQC, but could be significantly worse than MQC in terms of tree accuracy - demonstrating the presence and impacts of equally optimal species trees. This is the first known study that provides the conditions for the datasets to have equally optimal trees in the context of phylogenomic inference using summary methods.
Collapse
Affiliation(s)
- Ishrat Tanzila Farah
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology Dhaka-1205, Bangladesh
| | - Md Muktadirul Islam
- Applied Statistics and Data Science (ASDS), Department of Statistics Jahangirnagar University Dhaka-1342, Bangladesh
| | - Kazi Tasnim Zinat
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology Dhaka-1205, Bangladesh.,Department of Computer Science University of Maryland, College Park, Maryland, USA
| | - Atif Hasan Rahman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology Dhaka-1205, Bangladesh
| | - Md Shamsuzzoha Bayzid
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology Dhaka-1205, Bangladesh
| |
Collapse
|