1
|
Jin Y, Du X, Jiang C, Ji W, Yang P. Disentangling sources of gene tree discordance for Hordeum species via target-enriched sequencing assays. Mol Phylogenet Evol 2024; 199:108160. [PMID: 39019201 DOI: 10.1016/j.ympev.2024.108160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2024] [Revised: 07/04/2024] [Accepted: 07/14/2024] [Indexed: 07/19/2024]
Abstract
Hordeum is an economically and evolutionarily important genus within the Triticeae tribe of the family Poaceae, and contains 33 widely distributed and diverse species which cytologically represent four subgenomes (H, Xa, Xu and I). These wild species (except Hordeum spontaneum, which is the primary gene pool of barley) are secondary or tertiary gene-pool germplasms for barley and wheat improvement, and uncovering their complicated evolutionary relationships would benefit for future breeding programs. Here, we developed a complexity-reduced pipeline via capturing genome-wide distributed fragments via two novel target-enriched assays (HorCap v1.0 and BarPlex v1.0) in conjugation with high-throughput sequencing of the enrichments. Both assays were tested for genotyping 40 species from three genera (Hordeum, Triticum, and Aegilops) containing 82 samples 67 accessions. Either of both assays worked efficiently in genotyping, while integration of both assays can significantly improve the robustness and resolution of the Hordeum phylogenetic trees. Interestingly, the incomplete lineage sorting (ILS) was inferred for the first time as the major factor causing phylogenetic discordance among the four subgenomes, whereas in New World species (carrying I genome) post-speciation introgression events were revealed. Through revising the evolutionary relationships of the Hordeum species based on an ancestral state reconstruction for the diploids and parental donor inference for the polyploids, our results raised new queries about the Hordeum phylogeny. Moreover, both newly-developed assays are applicable in genotyping and phylogenetic analysis of Hordeum and other Triticeae wild species.
Collapse
Affiliation(s)
- Yanlong Jin
- State Key Laboratory of Crop Gene Resources and Breeding, Key Laboratory of Grain Crop Genetic Resources Evaluation and Utilization (MARA), Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China; State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest AandF University, Yangling 712100, China
| | - Xin Du
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest AandF University, Yangling 712100, China
| | - Congcong Jiang
- State Key Laboratory of Crop Gene Resources and Breeding, Key Laboratory of Grain Crop Genetic Resources Evaluation and Utilization (MARA), Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Wanquan Ji
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest AandF University, Yangling 712100, China
| | - Ping Yang
- State Key Laboratory of Crop Gene Resources and Breeding, Key Laboratory of Grain Crop Genetic Resources Evaluation and Utilization (MARA), Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China.
| |
Collapse
|
2
|
Raza M, Ortiz EM, Schwung L, Shigita G, Schaefer H. Resolving the phylogeny of Thladiantha (Cucurbitaceae) with three different target capture pipelines. BMC Ecol Evol 2023; 23:75. [PMID: 38087247 PMCID: PMC10714463 DOI: 10.1186/s12862-023-02185-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 12/05/2023] [Indexed: 12/18/2023] Open
Abstract
BACKGROUND Despite recent advances, reliable tools to simultaneously handle different types of sequencing data (e.g., target capture, genome skimming) for phylogenomics are still scarce. Here, we evaluate the performance of the recently developed pipeline Captus in comparison with the well-known target capture pipelines HybPiper and SECAPR. As test data, we analyzed newly generated sequences for the genus Thladiantha (Cucurbitaceae) for which no well-resolved phylogeny estimate has been available so far, as well as simulated reads derived from the genome of Arabidopsis thaliana. RESULTS Our pipeline comparisons are based on (1) the time needed for data assembly and locus extraction, (2) locus recovery per sample, (3) the number of informative sites in nucleotide alignments, and (4) the topology of the nuclear and plastid phylogenies. Additionally, the simulated reads derived from the genome of Arabidopsis thaliana were used to evaluate the accuracy and completeness of the recovered loci. In terms of computation time, locus recovery per sample, and informative sites, Captus outperforms HybPiper and SECAPR. The resulting topologies of Captus and SECAPR are identical for coalescent trees but differ when trees are inferred from concatenated alignments. The HybPiper phylogeny is similar to Captus in both methods. The nuclear genes recover a deep split of Thladiantha in two clades, but this is not supported by the plastid data. CONCLUSIONS Captus is the best choice among the three pipelines in terms of computation time and locus recovery. Even though there is no significant topological difference between the Thladiantha species trees produced by the three pipelines, Captus yields a higher number of gene trees in agreement with the topology of the species tree (i.e., fewer genes in conflict with the species tree topology).
Collapse
Affiliation(s)
- Mustafa Raza
- Plant Biodiversity Research, Dept. Life Science Systems, Technical University of Munich (TUM), Emil-Ramann-Str. 2, D-85354, Freising, Germany
| | - Edgardo M Ortiz
- Plant Biodiversity Research, Dept. Life Science Systems, Technical University of Munich (TUM), Emil-Ramann-Str. 2, D-85354, Freising, Germany
| | - Lea Schwung
- Plant Biodiversity Research, Dept. Life Science Systems, Technical University of Munich (TUM), Emil-Ramann-Str. 2, D-85354, Freising, Germany
| | - Gentaro Shigita
- Plant Biodiversity Research, Dept. Life Science Systems, Technical University of Munich (TUM), Emil-Ramann-Str. 2, D-85354, Freising, Germany
| | - Hanno Schaefer
- Plant Biodiversity Research, Dept. Life Science Systems, Technical University of Munich (TUM), Emil-Ramann-Str. 2, D-85354, Freising, Germany.
| |
Collapse
|
3
|
Renner SS. A time tree for the evolution of insect, vertebrate, wind, and water pollination in the angiosperms. THE NEW PHYTOLOGIST 2023; 240:464-465. [PMID: 37564031 DOI: 10.1111/nph.19201] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/12/2023]
Abstract
This article is a Commentary on Stephens et al. (2023), 240: 880–891.
Collapse
Affiliation(s)
- Susanne S Renner
- Department of Biology, Washington University, St Louis, MO, 63130, USA
| |
Collapse
|
4
|
Simmons MP, Goloboff PA, Stöver BC, Springer MS, Gatesy J. Quantification of congruence among gene trees with polytomies using overall success of resolution for phylogenomic coalescent analyses. Cladistics 2023; 39:418-436. [PMID: 37096985 DOI: 10.1111/cla.12540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Revised: 02/22/2023] [Accepted: 03/24/2023] [Indexed: 04/26/2023] Open
Abstract
Gene-tree-inference error can cause species-tree-inference artefacts in summary phylogenomic coalescent analyses. Here we integrate two ways of accommodating these inference errors: collapsing arbitrarily or dubiously resolved gene-tree branches, and subsampling gene trees based on their pairwise congruence. We tested the effect of collapsing gene-tree branches with 0% approximate-likelihood-ratio-test (SH-like aLRT) support in likelihood analyses and strict consensus trees for parsimony, and then subsampled those partially resolved trees based on congruence measures that do not penalize polytomies. For this purpose we developed a new TNT script for congruence sorting (congsort), and used it to calculate topological incongruence for eight phylogenomic datasets using three distance measures: standard Robinson-Foulds (RF) distances; overall success of resolution (OSR), which is based on counting both matching and contradicting clades; and RF contradictions, which only counts contradictory clades. As expected, we found that gene-tree incongruence was often concentrated in clades that are arbitrarily or dubiously resolved and that there was greater congruence between the partially collapsed gene trees and the coalescent and concatenation topologies inferred from those genes. Coalescent branch lengths typically increased as the most incongruent gene trees were excluded, although branch supports typically did not. We investigated two successful and complementary approaches to prioritizing genes for investigation of alignment or homology errors. Coalescent-tree clades that contradicted concatenation-tree clades were generally less robust to gene-tree subsampling than congruent clades. Our preferred approach to collapsing likelihood gene-tree clades (0% SH-like aLRT support) and subsampling those trees (OSR) generally outperformed competing approaches for a large fungal dataset with respect to branch lengths, support and congruence. We recommend widespread application of this approach (and strict consensus trees for parsimony-based analyses) for improving quantification of gene-tree congruence/conflict, estimating coalescent branch lengths, testing robustness of coalescent analyses to gene-tree-estimation error, and improving topological robustness of summary coalescent analyses. This approach is quick and easy to implement, even for huge datasets.
Collapse
Affiliation(s)
- Mark P Simmons
- Department of Biology, Colorado State University, Fort Collins, CO, 80523, USA
| | - Pablo A Goloboff
- CONICET, INSUE, Fundación Miguel Lillo, Miguel Lillo 251, 4000, S.M. de Tucumán, Argentina
| | - Ben C Stöver
- Institute for Evolution and Biodiversity, WMU Münster, 48149, Münster, Germany
| | - Mark S Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA, 92521, USA
| | - John Gatesy
- Division of Vertebrate Zoology, American Museum of Natural History, New York, NY, 10024, USA
| |
Collapse
|
5
|
Moreyra LD, Garcia-Jacas N, Roquet C, Ackerfield JR, Arabacı T, Blanco-Gavaldà C, Brochmann C, Calleja JA, Dirmenci T, Fujikawa K, Galbany-Casals M, Gao T, Gizaw A, López-Alvarado J, Mehregan I, Vilatersana R, Yıldız B, Leliaert F, Seregin AP, Susanna A. African Mountain Thistles: Three New Genera in the Carduus-Cirsium Group. PLANTS (BASEL, SWITZERLAND) 2023; 12:3083. [PMID: 37687332 PMCID: PMC10489743 DOI: 10.3390/plants12173083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 08/21/2023] [Accepted: 08/23/2023] [Indexed: 09/10/2023]
Abstract
The floras on the highest mountains in tropical eastern Africa are among the most unique floras in the world. Despite the exceptionally high concentration of endemic species, these floras remain understudied from an evolutionary point of view. In this study, we focus on the Carduus-Cirsium group (subtribe Carduinae) to unravel the evolutionary relationships of the species endemic to the tropical Afromontane and Afroalpine floras, aiming to improve the systematics of the group. We applied the Hyb-Seq approach using the Compositae1061 probe set on 190 samples (159 species), encompassing representatives of all genera of Carduinae. We used two recently developed pipelines that enabled the processing of raw sequence reads, identification of paralogous sequences and segregation into orthologous alignments. After the implementation of a missing data filter, we retained sequences from 986 nuclear loci and 177 plastid regions. Phylogenomic analyses were conducted using both concatenated and summary-coalescence methods. The resulting phylogenies were highly resolved and revealed three distinct evolutionary lineages consisting of the African species traditionally referred to as Carduus and Cirsium. Consequently, we propose the three new genera Afrocarduus, Afrocirsium and Nuriaea; the latter did notably not belong to the Carduus-Cirsium group. We detected some incongruences between the phylogenies based on concatenation vs. coalescence and on nuclear vs. plastid datasets, likely attributable to incomplete lineage sorting and/or hybridization.
Collapse
Affiliation(s)
- Lucía D. Moreyra
- Botanic Institute of Barcelona (IBB), CSIC-Ajuntament de Barcelona, Pg. Migdia, s.n., 08038 Barcelona, Spain; (L.D.M.); (N.G.-J.); (R.V.)
| | - Núria Garcia-Jacas
- Botanic Institute of Barcelona (IBB), CSIC-Ajuntament de Barcelona, Pg. Migdia, s.n., 08038 Barcelona, Spain; (L.D.M.); (N.G.-J.); (R.V.)
| | - Cristina Roquet
- Systematics and Evolution of Vascular Plants (UAB)—Associated Unit to CSIC by IBB, Autonomous University of Barcelona, 08193 Cerdanyola del Vallès, Spain; (C.R.); (C.B.-G.); (M.G.-C.)
| | | | - Turan Arabacı
- Department of Pharmaceutical Botany, Faculty of Pharmacy, Inönü University, 44280 Malatya, Türkiye;
| | - Carme Blanco-Gavaldà
- Systematics and Evolution of Vascular Plants (UAB)—Associated Unit to CSIC by IBB, Autonomous University of Barcelona, 08193 Cerdanyola del Vallès, Spain; (C.R.); (C.B.-G.); (M.G.-C.)
| | - Christian Brochmann
- Natural History Museum, University of Oslo, Blindern, 0318 Oslo, Norway; (C.B.); (A.G.)
| | - Juan Antonio Calleja
- Department of Biology, Autonomous University of Madrid, 28049 Madrid, Spain;
- Centro de Investigación en Biodiversidad y Cambio Global, Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | - Tuncay Dirmenci
- Department of Biology, Faculty of Necatibey Education, Balıkesir University, 10145 Balıkesir, Türkiye;
| | - Kazumi Fujikawa
- Kochi Prefectural Makino Botanical Garden, 4200-6, Godaisan, Kochi 781-8125, Japan;
| | - Mercè Galbany-Casals
- Systematics and Evolution of Vascular Plants (UAB)—Associated Unit to CSIC by IBB, Autonomous University of Barcelona, 08193 Cerdanyola del Vallès, Spain; (C.R.); (C.B.-G.); (M.G.-C.)
| | - Tiangang Gao
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China;
| | - Abel Gizaw
- Natural History Museum, University of Oslo, Blindern, 0318 Oslo, Norway; (C.B.); (A.G.)
- Department of Plant Biology and Biodiversity Management, Addis Ababa University, Addis Ababa P.O. Box 3434, Ethiopia
| | - Javier López-Alvarado
- Systematics and Evolution of Vascular Plants (UAB)—Associated Unit to CSIC by IBB, Autonomous University of Barcelona, 08193 Cerdanyola del Vallès, Spain; (C.R.); (C.B.-G.); (M.G.-C.)
| | - Iraj Mehregan
- Department of Biology, Science and Research Branch, Islamic Azad University, Tehran 1477893855, Iran;
| | - Roser Vilatersana
- Botanic Institute of Barcelona (IBB), CSIC-Ajuntament de Barcelona, Pg. Migdia, s.n., 08038 Barcelona, Spain; (L.D.M.); (N.G.-J.); (R.V.)
| | - Bayram Yıldız
- Ismail Cem Street, No. 35, Yenikale District, 35320 Narlidere Türkiye;
| | | | - Alexey P. Seregin
- Faculty of Biology, M. V. Lomonosov Moscow State University, 119991 Moscow, Russia;
| | - Alfonso Susanna
- Botanic Institute of Barcelona (IBB), CSIC-Ajuntament de Barcelona, Pg. Migdia, s.n., 08038 Barcelona, Spain; (L.D.M.); (N.G.-J.); (R.V.)
| |
Collapse
|
6
|
Romeiro-Brito M, Khan G, Perez MF, Zappi DC, Taylor NP, Olsthoorn G, Franco FF, Moraes EM. Revisiting phylogeny, systematics, and biogeography of a Pleistocene radiation. AMERICAN JOURNAL OF BOTANY 2023; 110:1-17. [PMID: 36708517 DOI: 10.1002/ajb2.16134] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Revised: 01/03/2023] [Accepted: 01/05/2023] [Indexed: 05/11/2023]
Abstract
PREMISE Pilosocereus (Cactaceae) is an important dry forest element in all subregions and transitional zones of the neotropics, with the highest diversity in eastern Brazil. The genus is subdivided into informal taxonomic groups; however, most of these are not supported by recent molecular phylogenetic inferences. This lack of confidence is probably due to the use of an insufficient number of loci and the complexity of cactus diversification. Here, we explored the species relationships in Pilosocereus in more detail, integrating multilocus phylogenetic approaches with the assessment of the ancestral range and the effect of geography on diversification shifts. METHODS We used 28 nuclear, plastid, and mitochondrial loci from 54 plant samples of 31 Pilosocereus species for phylogenetic analyses. We used concatenated and coalescent phylogenetic trees and Bayesian models to estimate the most likely ancestral range and diversification shifts. RESULTS All Pilosocereus species were clustered in the same branch, except P. bohlei. The phylogenetic relationships were more associated with the geographic distribution than taxonomic affinities among taxa. The genus began diversifying during the Plio-Pleistocene transition in the Caatinga domain and experienced an increased diversification rate during the Calabrian age. CONCLUSIONS We recovered a well-supported multispecies coalescent phylogeny. Our results refine the pattern of rapid diversification of Pilosocereus species across neotropical drylands during the Pleistocene and highlight the need for taxonomic rearrangements in the genus. We recovered a pulse of diversification during the Pleistocene that was likely driven by multiple dispersal and vicariance events within and among the Caatinga, Cerrado, and Atlantic Forest domains.
Collapse
Affiliation(s)
- Monique Romeiro-Brito
- Departamento de Biologia, Universidade Federal de São Carlos (UFSCar), Sorocaba, SP, 18052-780, Brazil
| | - Gulzar Khan
- Institute for Biology and Environmental Sciences, Carl von Ossietzky-University Oldenburg, Carl von Ossietzky-Str. 9-11, 26111, Oldenburg, Germany
| | - Manolo F Perez
- Departamento de Genética e Evolução, Universidade Federal de São Carlos (UFSCar), São Carlos, SP, 13565-905, Brazil
| | - Daniela C Zappi
- Programa de Pós-Graduação em Botânica, Instituto de Ciências Biológicas, Universidade de Brasília (UNB), PO Box 04457, Brasília, DF, 70910-970, Brazil
| | - Nigel P Taylor
- University of Gibraltar, Gibraltar Botanic Gardens Campus, The Alameda, PO Box 843, GX11 1AA, Gibraltar
| | | | - Fernando F Franco
- Departamento de Biologia, Universidade Federal de São Carlos (UFSCar), Sorocaba, SP, 18052-780, Brazil
| | - Evandro M Moraes
- Departamento de Biologia, Universidade Federal de São Carlos (UFSCar), Sorocaba, SP, 18052-780, Brazil
| |
Collapse
|
7
|
Hu H, Sun P, Yang Y, Ma J, Liu J. Genome-scale angiosperm phylogenies based on nuclear, plastome, and mitochondrial datasets. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2023. [PMID: 36647606 DOI: 10.1111/jipb.13455] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 01/16/2023] [Indexed: 06/17/2023]
Abstract
Angiosperms dominate the Earth's ecosystems and provide most of the basic necessities for human life. The major angiosperm clades comprise 64 orders, as recognized by the APG IV classification. However, the phylogenetic relationships of angiosperms remain unclear, as phylogenetic trees with different topologies have been reconstructed depending on the sequence datasets utilized, from targeted genes to transcriptomes. Here, we used currently available de novo genome data to reconstruct the phylogenies of 366 angiosperm species from 241 genera belonging to 97 families across 43 of the 64 orders based on orthologous genes from the nuclear, plastid, and mitochondrial genomes of the same species with compatible datasets. The phylogenetic relationships were largely consistent with previously constructed phylogenies based on sequence variations in each genome type. However, there were major inconsistencies in the phylogenetic relationships of the five Mesangiospermae lineages when different genomes were examined. We discuss ways to address these inconsistencies, which could ultimately lead to the reconstruction of a comprehensive angiosperm tree of life. The angiosperm phylogenies presented here provide a basic framework for further updates and comparisons. These phylogenies can also be used as guides to examine the evolutionary trajectories among the three genome types during lineage radiation.
Collapse
Affiliation(s)
- Hongyin Hu
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, 730000, China
| | - Pengchuan Sun
- Key Laboratory for Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| | - Yongzhi Yang
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, 730000, China
| | - Jianxiang Ma
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, 730000, China
| | - Jianquan Liu
- State Key Laboratory of Grassland Agro-Ecosystems, College of Ecology, Lanzhou University, Lanzhou, 730000, China
- Key Laboratory for Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, China
| |
Collapse
|
8
|
Wang L, Li F, Wang N, Gao Y, Liu K, Zhang G, Sun J. Characterization of the Dicranostigma leptopodum chloroplast genome and comparative analysis within subfamily Papaveroideae. BMC Genomics 2022; 23:794. [PMID: 36460956 PMCID: PMC9717546 DOI: 10.1186/s12864-022-09049-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 11/25/2022] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND Dicranostigma leptopodum (Maxim.) Fedde is a perennial herb with bright yellow flowers, well known as "Hongmao Cao" for its medicinal properties, and is an excellent early spring flower used in urban greening. However, its molecular genomic information remains largely unknown. Here, we sequenced and analyzed the chloroplast genome of D. leptopodum to discover its genome structure, organization, and phylogenomic position within the subfamily Papaveroideae. RESULTS The chloroplast genome size of D. leptopodum was 162,942 bp, and D. leptopodum exhibited a characteristic circular quadripartite structure, with a large single-copy (LSC) region (87,565 bp), a small single-copy (SSC) region (18,759 bp) and a pair of inverted repeat (IR) regions (28,309 bp). The D. leptopodum chloroplast genome encoded 113 genes, including 79 protein-coding genes, 30 tRNA genes, and four rRNA genes. The dynamics of the genome structures, genes, IR contraction and expansion, long repeats, and single sequence repeats exhibited similarities, with slight differences observed among the eight Papaveroideae species. In addition, seven interspace regions and three coding genes displayed highly variable divergence, signifying their potential to serve as molecular markers for phylogenetic and species identification studies. Molecular evolution analyses indicated that most of the genes were undergoing purifying selection. Phylogenetic analyses revealed that D. leptopodum formed a clade with the tribe Chelidonieae. CONCLUSIONS Our study provides detailed information on the D. leptopodum chloroplast genome, expanding the available genomic resources that may be used for future evolution and genetic diversity studies.
Collapse
Affiliation(s)
- Lei Wang
- grid.453074.10000 0000 9797 0900College of Horticulture and Plant Protection, Henan University of Science and Technology, Luoyang, 471023 Henan China
| | - Fuxing Li
- grid.453074.10000 0000 9797 0900College of Horticulture and Plant Protection, Henan University of Science and Technology, Luoyang, 471023 Henan China
| | - Ning Wang
- grid.453074.10000 0000 9797 0900College of Horticulture and Plant Protection, Henan University of Science and Technology, Luoyang, 471023 Henan China
| | - Yongwei Gao
- grid.66741.320000 0001 1456 856XLaboratory of Systematic Evolution and Biogeography of Woody Plants, School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, 100083 China
| | - Kangjia Liu
- grid.66741.320000 0001 1456 856XLaboratory of Systematic Evolution and Biogeography of Woody Plants, School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, 100083 China
| | - Gangmin Zhang
- grid.66741.320000 0001 1456 856XLaboratory of Systematic Evolution and Biogeography of Woody Plants, School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, 100083 China
| | - Jiahui Sun
- grid.410318.f0000 0004 0632 3409State Key Laboratory Breeding Base of Dao‑di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing, 100700 China
| |
Collapse
|
9
|
Mahbub S, Sawmya S, Saha A, Reaz R, Rahman MS, Bayzid MS. Quartet Based Gene Tree Imputation Using Deep Learning Improves Phylogenomic Analyses Despite Missing Data. JOURNAL OF COMPUTATIONAL BIOLOGY : A JOURNAL OF COMPUTATIONAL MOLECULAR CELL BIOLOGY 2022; 29:1156-1172. [PMID: 36048555 DOI: 10.1089/cmb.2022.0212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Species tree estimation is frequently based on phylogenomic approaches that use multiple genes from throughout the genome. However, for a combination of reasons (ranging from sampling biases to more biological causes, as in gene birth and loss), gene trees are often incomplete, meaning that not all species of interest have a common set of genes. Incomplete gene trees can potentially impact the accuracy of phylogenomic inference. We, for the first time, introduce the problem of imputing the quartet distribution induced by a set of incomplete gene trees, which involves adding the missing quartets back to the quartet distribution. We present Quartet based Gene tree Imputation using Deep Learning (QT-GILD), an automated and specially tailored unsupervised deep learning technique, accompanied by cues from natural language processing, which learns the quartet distribution in a given set of incomplete gene trees and generates a complete set of quartets accordingly. QT-GILD is a general-purpose technique needing no explicit modeling of the subject system or reasons for missing data or gene tree heterogeneity. Experimental studies on a collection of simulated and empirical datasets suggest that QT-GILD can effectively impute the quartet distribution, which results in a dramatic improvement in the species tree accuracy. Remarkably, QT-GILD not only imputes the missing quartets but can also account for gene tree estimation error. Therefore, QT-GILD advances the state-of-the-art in species tree estimation from gene trees in the face of missing data.
Collapse
Affiliation(s)
- Sazan Mahbub
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh.,Department of Computer Science, University of Maryland, College Park, Maryland, USA
| | - Shashata Sawmya
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - Arpita Saha
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - Rezwana Reaz
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - M Sohel Rahman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - Md Shamsuzzoha Bayzid
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| |
Collapse
|
10
|
Wu J, Zhang L, Shen C, Sin SYW, Lei C, Zhao H. Comparative transcriptome analysis reveals molecular adaptations underlying distinct immunity and inverted resting posture in bats. Integr Zool 2022; 18:493-505. [PMID: 36049759 DOI: 10.1111/1749-4877.12676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Understanding how natural selection shapes unique traits in mammals is a central topic in evolutionary biology. The mammalian order Chiroptera (bats) is attractive for biologists as well as the general public due to their specific traits of extraordinary immunity and inverted resting posture. However, genomic resources for bats that occupy key phylogenetic positions are not sufficient, which hinders comprehensive investigation of the molecular mechanisms underpinning the origin of specific traits in bats. Here, we sequenced the transcriptomes of five bats that are phylogenetically divergent and occupy key positions in the phylogenetic tree of bats. In combination with the available genomes of 19 bats and 21 other mammals, we built a database consisting of 10,918 one-to-one ortholog genes and reconstructed phylogenetic relationships of these mammals. We found that genes related to immunity, bone remodeling and cardiovascular system are targets of natural selection along the ancestral branch of bats. Further analyses revealed that the T cell receptor signaling pathway involved in immune adaptation is specifically enriched in bats. Moreover, molecular adaptations of bone remodeling, cardiovascular system, and balance sensing may help to explain the reverted resting posture in bats. Our study provides valuable transcriptome resources, enabling us to tentatively identify genetic changes associated with bat-specific traits. This work is among the first to advance our understanding of molecular underpinnings of inverted resting posture in bats, which could provide insight into healthcare applications such as hypertension in humans. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Jinwei Wu
- Engineering Research Center of Eco-environment in Three Gorges Reservoir Region of Ministry of Education, China Three Gorges University, Yichang, China
| | - Libiao Zhang
- Guangdong Key Laboratory of Animal Conservation and Resource Utilization, Guangdong Public Laboratory of Wild Animal Conservation and Utilization, Institute of Zoology, Guangdong Academy of Sciences, Guangzhou, China
| | - Chao Shen
- College of Life Sciences, Wuhan University, Wuhan, China
| | - Simon Yung Wa Sin
- School of Biological Sciences, The University of Hong Kong, Pok Fu Lam Road, Hong Kong SAR, China
| | - Caoqi Lei
- College of Life Sciences, Wuhan University, Wuhan, China
| | - Huabin Zhao
- College of Life Sciences, Wuhan University, Wuhan, China
| |
Collapse
|
11
|
Xiong H, Wang D, Shao C, Yang X, Yang J, Ma T, Davis CC, Liu L, Xi Z. Species Tree Estimation and the Impact of Gene Loss Following Whole-Genome Duplication. Syst Biol 2022; 71:1348-1361. [PMID: 35689633 PMCID: PMC9558847 DOI: 10.1093/sysbio/syac040] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2021] [Revised: 06/03/2022] [Accepted: 06/07/2022] [Indexed: 12/02/2022] Open
Abstract
Whole-genome duplication (WGD) occurs broadly and repeatedly across the history of eukaryotes and is recognized as a prominent evolutionary force, especially in plants. Immediately following WGD, most genes are present in two copies as paralogs. Due to this redundancy, one copy of a paralog pair commonly undergoes pseudogenization and is eventually lost. When speciation occurs shortly after WGD; however, differential loss of paralogs may lead to spurious phylogenetic inference resulting from the inclusion of pseudoorthologs–paralogous genes mistakenly identified as orthologs because they are present in single copies within each sampled species. The influence and impact of including pseudoorthologs versus true orthologs as a result of gene extinction (or incomplete laboratory sampling) are only recently gaining empirical attention in the phylogenomics community. Moreover, few studies have yet to investigate this phenomenon in an explicit coalescent framework. Here, using mathematical models, numerous simulated data sets, and two newly assembled empirical data sets, we assess the effect of pseudoorthologs on species tree estimation under varying degrees of incomplete lineage sorting (ILS) and differential gene loss scenarios following WGD. When gene loss occurs along the terminal branches of the species tree, alignment-based (BPP) and gene-tree-based (ASTRAL, MP-EST, and STAR) coalescent methods are adversely affected as the degree of ILS increases. This can be greatly improved by sampling a sufficiently large number of genes. Under the same circumstances, however, concatenation methods consistently estimate incorrect species trees as the number of genes increases. Additionally, pseudoorthologs can greatly mislead species tree inference when gene loss occurs along the internal branches of the species tree. Here, both coalescent and concatenation methods yield inconsistent results. These results underscore the importance of understanding the influence of pseudoorthologs in the phylogenomics era. [Coalescent method; concatenation method; incomplete lineage sorting; pseudoorthologs; single-copy gene; whole-genome duplication.]
Collapse
Affiliation(s)
- Haifeng Xiong
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Danying Wang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Chen Shao
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Xuchen Yang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Jialin Yang
- Department of Statistics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Tao Ma
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Charles C Davis
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, Cambridge, MA 02138, USA
| | - Liang Liu
- Department of Statistics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Zhenxiang Xi
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| |
Collapse
|
12
|
Kim D, Taylor AT, Near TJ. Phylogenomics and species delimitation of the economically important Black Basses (Micropterus). Sci Rep 2022; 12:9113. [PMID: 35668124 PMCID: PMC9170712 DOI: 10.1038/s41598-022-11743-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Accepted: 04/28/2022] [Indexed: 01/25/2023] Open
Abstract
Informed management and conservation efforts are vital to sustainable recreational fishing and biodiversity conservation. Because the taxonomic rank of species is important in conservation and management strategies, success of these efforts depends on accurate species delimitation. The Black Basses (Micropterus) are an iconic lineage of freshwater fishes that include some of the world’s most popular species for recreational fishing and world's most invasive species. Despite their popularity, previous studies to delimit species and lineages in Micropterus suffer from insufficient geographic coverage and uninformative molecular markers. Our phylogenomic analyses of ddRAD data result in the delimitation of 19 species of Micropterus, which includes 14 described species, the undescribed but well-known Altamaha, Bartram’s, and Choctaw basses, and two additional undescribed species currently classified as Smallmouth Bass (M. dolomieu). We provide a revised delimitation of species in the Largemouth Bass complex that necessitates a change in scientific nomenclature: Micropterus salmoides is retained for the Florida Bass and Micropterus nigricans is elevated from synonymy for the Largemouth Bass. The new understanding of diversity, distribution, and systematics of Black Basses will serve as important basis for the management and conservation of this charismatic and economically important clade of fishes.
Collapse
Affiliation(s)
- Daemin Kim
- Department of Ecology and Evolutionary Biology, Yale University, P.O. Box 208106, New Haven, CT, 06511, USA.
| | - Andrew T Taylor
- Department of Biology, University of Central Oklahoma, Edmond, OK, 73034, USA.,Department of Biology, University of North Georgia, Dahlonega, GA, 30597, USA
| | - Thomas J Near
- Department of Ecology and Evolutionary Biology, Yale University, P.O. Box 208106, New Haven, CT, 06511, USA.,Peabody Museum, Yale University, New Haven, CT, 06511, USA
| |
Collapse
|
13
|
Cooper BJ, Moore MJ, Douglas NA, Wagner WL, Johnson MG, Overson RP, Kinosian SP, McDonnell AJ, Levin RA, Raguso RA, Flores Olvera H, Ochoterena H, Fant JB, Skogen KA, Wickett NJ. Target enrichment and extensive population sampling help untangle the recent, rapid radiation of Oenothera sect. Calylophus. Syst Biol 2022:6588089. [PMID: 35583314 DOI: 10.1093/sysbio/syac032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Revised: 04/14/2022] [Accepted: 04/22/2022] [Indexed: 11/14/2022] Open
Abstract
Oenothera sect. Calylophus is a North American group of 13 recognized taxa in the evening primrose family (Onagraceae) with an evolutionary history that may include independent origins of bee pollination, edaphic endemism, and permanent translocation heterozygosity. Like other groups that radiated relatively recently and rapidly, taxon boundaries within Oenothera sect. Calylophus have remained challenging to circumscribe. In this study, we used target enrichment, flanking non-coding regions, gene tree/species tree methods, tests for gene flow modified for target-enrichment data, and morphometric analysis to reconstruct phylogenetic hypotheses, evaluate current taxon circumscriptions, and examine character evolution in Oenothera sect. Calylophus. Because sect. Calylophus comprises a clade with a relatively restricted geographic range, we were able to extensively sample across the range of geographic, edaphic and morphological diversity in the group. We found that the combination of exons and flanking non-coding regions led to improved support for species relationships. We reconstructed potential hybrid origins of some accessions and note that if processes such as hybridization are not taken into account, the number of inferred evolutionary transitions may be artificially inflated. We recovered strong evidence for multiple evolutionary origins of bee pollination from ancestral hawkmoth pollination, edaphic specialization on gypsum, and permanent translocation heterozygosity. This study applies newly emerging techniques alongside dense infraspecific sampling and morphological analyses to effectively reconstruct the recalcitrant history of a rapid radiation.
Collapse
Affiliation(s)
- Benjamin J Cooper
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA.,Northwestern University,Program in Plant Biology and Conservation,O.T. Hogan Hall, Room, 6-140B, 2205 Tech Drive, Evanston, IL 60208, USA
| | - Michael J Moore
- Oberlin College, Department of Biology, 119 Woodland St., Oberlin, OH 44074, USA
| | - Norman A Douglas
- Department of Biology, University of Florida, Gainesville, FL 32611, USA
| | - Warren L Wagner
- Department of Botany, MRC-166, Smithsonian Institution, PO Box 37012, Washington, DC 20013-7012, USA
| | - Matthew G Johnson
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA.,Department of Biological Sciences, Texas Tech University, Box 43131 Lubbock, TX 79409, USA
| | - Rick P Overson
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA.,School of Sustainability, Arizona State University, PO Box 875502, Tempe, AZ 85287-5502, USA
| | - Sylvia P Kinosian
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA
| | - Angela J McDonnell
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA
| | - Rachel A Levin
- Department of Biology, Amherst College, 25 East Drive, Amherst, MA, 01002, USA
| | - Robert A Raguso
- Department of Neurobiology and Behavior, Cornell University, 215 Tower Road, Ithaca, NY 14853, USA
| | - Hilda Flores Olvera
- Departamento de Botánica, Instituto de Biología, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Helga Ochoterena
- Departamento de Botánica, Instituto de Biología, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Jeremie B Fant
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA.,Northwestern University,Program in Plant Biology and Conservation,O.T. Hogan Hall, Room, 6-140B, 2205 Tech Drive, Evanston, IL 60208, USA
| | - Krissa A Skogen
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA.,Northwestern University,Program in Plant Biology and Conservation,O.T. Hogan Hall, Room, 6-140B, 2205 Tech Drive, Evanston, IL 60208, USA
| | - Norman J Wickett
- The Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, 1000 Lake Cook Rd., Glencoe, IL 60022, USA.,Northwestern University,Program in Plant Biology and Conservation,O.T. Hogan Hall, Room, 6-140B, 2205 Tech Drive, Evanston, IL 60208, USA
| |
Collapse
|
14
|
Luo J, Chen J, Guo W, Yang Z, Lim KJ, Wang Z. Reassessment of Annamocarya sinesis ( Carya sinensis) Taxonomy through Concatenation and Coalescence Phylogenetic Analysis. PLANTS (BASEL, SWITZERLAND) 2021; 11:plants11010052. [PMID: 35009055 PMCID: PMC8747223 DOI: 10.3390/plants11010052] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Revised: 12/21/2021] [Accepted: 12/22/2021] [Indexed: 05/20/2023]
Abstract
Due to its peculiar morphological characteristics, there is dispute as to whether the genus of Annamocarya sinensis, a species of Juglandaceae, is Annamocarya or Carya. Most morphologists believe it should be distinguished from the Carya genus while genomicists suggest that A. sinensis belongs to the Carya genus. To explore the taxonomic status of A. sinensis using chloroplast genes, we collected chloroplast genomes of 16 plant species and assembled chloroplast genomes of 10 unpublished Carya species. We analyzed all 26 species' chloroplast genomes through two analytical approaches (concatenation and coalescence), using the entire and unique chloroplast coding sequence (CDS) and entire and protein sequences. Our results indicate that the analysis of the CDS and protein sequences or unique CDS and unique protein sequence of chloroplast genomes shows that A. sinensis indeed belongs to the Carya genus. In addition, our analysis shows that, compared to single chloroplast genes, the phylogeny trees constructed using numerous genes showed higher consistency. Moreover, the phylogenetic analysis calculated with the coalescence method and unique gene sequences was more robust than that done with the concatenation method, particularly for analyzing phylogenetically controversial species. Through the analysis, our results concluded that A. sinensis should be called C. sinensis.
Collapse
Affiliation(s)
- Jie Luo
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Lin’an, Hangzhou 311300, China; (J.L.); (J.C.); (W.G.); (Z.Y.)
| | - Junhao Chen
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Lin’an, Hangzhou 311300, China; (J.L.); (J.C.); (W.G.); (Z.Y.)
- Department of Biology, Saint Louis University, St. Louis, MO 63104, USA
| | - Wenlei Guo
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Lin’an, Hangzhou 311300, China; (J.L.); (J.C.); (W.G.); (Z.Y.)
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Zhengfu Yang
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Lin’an, Hangzhou 311300, China; (J.L.); (J.C.); (W.G.); (Z.Y.)
| | - Kean-Jin Lim
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Lin’an, Hangzhou 311300, China; (J.L.); (J.C.); (W.G.); (Z.Y.)
- Correspondence: (K.-J.L.); (Z.W.)
| | - Zhengjia Wang
- State Key Laboratory of Subtropical Silviculture, College of Forestry and Biotechnology, Zhejiang A&F University, Lin’an, Hangzhou 311300, China; (J.L.); (J.C.); (W.G.); (Z.Y.)
- Correspondence: (K.-J.L.); (Z.W.)
| |
Collapse
|
15
|
Van Damme K, Cornetti L, Fields PD, Ebert D. Whole-Genome Phylogenetic Reconstruction as a Powerful Tool to Reveal Homoplasy and Ancient Rapid Radiation in Waterflea Evolution. Syst Biol 2021; 71:777-787. [PMID: 34850935 PMCID: PMC9203061 DOI: 10.1093/sysbio/syab094] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2020] [Revised: 10/04/2021] [Accepted: 11/15/2021] [Indexed: 11/28/2022] Open
Abstract
Although phylogeny estimation is notoriously difficult in radiations that occurred several hundred million years ago, phylogenomic approaches offer new ways to examine relationships among ancient lineages and evaluate hypotheses that are key to evolutionary biology. Here, we reconstruct the deep-rooted relationships of one of the oldest living arthropod clades, the branchiopod crustaceans, using a kaleidoscopic approach. We use concatenation and coalescent tree-building methods to analyze a large multigene data set at the nucleotide and amino acid level and examine gene tree versus species tree discordance. We unequivocally resolve long-debated relationships among extant orders of the Cladocera, the waterfleas, an ecologically relevant zooplankton group in global aquatic and marine ecosystems that is famous for its model systems in ecology and evolution. To build the data set, we assembled eight de novo genomes of key taxa including representatives of all extant cladoceran orders and suborders. Our phylogenetic analysis focused on a BUSCO-based set of 823 conserved single-copy orthologs shared among 23 representative taxa spanning all living branchiopod orders, including 11 cladoceran families. Our analysis supports the monophyly of the Cladocera and reveals remarkable homoplasy in their body plans. We found large phylogenetic distances between lineages with similar ecological specializations, indicating independent evolution in major body plans, such as in the pelagic predatory orders Haplopoda and Onychopoda (the “Gymnomera”). In addition, we assessed rapid cladogenesis by estimating relative timings of divergence in major lineages using reliable fossil-calibrated priors on eight nodes in the branchiopod tree, suggesting a Paleozoic origin around 325 Ma for the cladoceran ancestor and an ancient rapid radiation around 252 Ma at the Perm/Triassic boundary. These findings raise new questions about the roles of homoplasy and rapid radiation in the diversification of the cladocerans and help examine trait evolution from a genomic perspective in a functionally well understood, ancient arthropod group. [Cladocera; Daphnia; evolution; homoplasy; molecular clock; phylogenomics; systematics; waterfleas.]
Collapse
Affiliation(s)
- Kay Van Damme
- Centre for Academic Heritage and Archives & Ghent University Botanical Garden, Ghent University, K.L. Ledeganckstraat 35, 9000 Ghent, Belgium.,Tvärminne Zoological Station (TZS), University of Helsinki, J.A. Palménin tie 260, Hanko, Finland
| | - Luca Cornetti
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| | - Peter D Fields
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| | - Dieter Ebert
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, 4051 Basel, Switzerland
| |
Collapse
|
16
|
Simmons MP, Springer MS, Gatesy J. Gene-tree misrooting drives conflicts in phylogenomic coalescent analyses of palaeognath birds. Mol Phylogenet Evol 2021; 167:107344. [PMID: 34748873 DOI: 10.1016/j.ympev.2021.107344] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Revised: 10/08/2021] [Accepted: 11/02/2021] [Indexed: 10/19/2022]
Abstract
Phylogenomic analyses of ancient rapid radiations can produce conflicting results that are driven by differential sampling of taxa and characters as well as the limitations of alternative analytical methods. We re-examine basal relationships of palaeognath birds (ratites and tinamous) using recently published datasets of nucleotide characters from 20,850 loci as well as 4301 retroelement insertions. The original studies attributed conflicting resolutions of rheas in their inferred coalescent and concatenation trees to concatenation failing in the anomaly zone. By contrast, we find that the coalescent-based resolution of rheas is premised upon extensive gene-tree estimation errors. Furthermore, retroelement insertions contain much more conflict than originally reported and multiple insertion loci support the basal position of rheas found in concatenation trees, while none were reported in the original publication. We demonstrate how even remarkable congruence in phylogenomic studies may be driven by long-branch misplacement of a divergent outgroup, highly incongruent gene trees, differential taxon sampling that can result in gene-tree misrooting errors that bias species-tree inference, and gross homology errors. What was previously interpreted as broad, robustly supported corroboration for a single resolution in coalescent analyses may instead indicate a common bias that taints phylogenomic results across multiple genome-scale datasets. The updated retroelement dataset now supports a species tree with branch lengths that suggest an ancient anomaly zone, and both concatenation and coalescent analyses of the huge nucleotide datasets fail to yield coherent, reliable results in this challenging phylogenetic context.
Collapse
Affiliation(s)
- Mark P Simmons
- Department of Biology, Colorado State University, Fort Collins, CO 80523, USA.
| | - Mark S Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA 92521, USA
| | - John Gatesy
- Division of Vertebrate Zoology and Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA
| |
Collapse
|
17
|
Nesi N, Tsagkogeorga G, Tsang SM, Nicolas V, Lalis A, Scanlon AT, Riesle-Sbarbaro SA, Wiantoro S, Hitch AT, Juste J, Pinzari CA, Bonaccorso FJ, Todd CM, Lim BK, Simmons NB, McGowen MR, Rossiter SJ. Interrogating Phylogenetic Discordance Resolves Deep Splits in the Rapid Radiation of Old World Fruit Bats (Chiroptera: Pteropodidae). Syst Biol 2021; 70:1077-1089. [PMID: 33693838 PMCID: PMC8513763 DOI: 10.1093/sysbio/syab013] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2019] [Revised: 04/27/2021] [Accepted: 03/03/2021] [Indexed: 11/14/2022] Open
Abstract
The family Pteropodidae (Old World fruit bats) comprises $>$200 species distributed across the Old World tropics and subtropics. Most pteropodids feed on fruit, suggesting an early origin of frugivory, although several lineages have shifted to nectar-based diets. Pteropodids are of exceptional conservation concern with $>$50% of species considered threatened, yet the systematics of this group has long been debated, with uncertainty surrounding early splits attributed to an ancient rapid diversification. Resolving the relationships among the main pteropodid lineages is essential if we are to fully understand their evolutionary distinctiveness, and the extent to which these bats have transitioned to nectar-feeding. Here we generated orthologous sequences for $>$1400 nuclear protein-coding genes (2.8 million base pairs) across 114 species from 43 genera of Old World fruit bats (57% and 96% of extant species- and genus-level diversity, respectively), and combined phylogenomic inference with filtering by information content to resolve systematic relationships among the major lineages. Concatenation and coalescent-based methods recovered three distinct backbone topologies that were not able to be reconciled by filtering via phylogenetic information content. Concordance analysis and gene genealogy interrogation show that one topology is consistently the best supported, and that observed phylogenetic conflicts arise from both gene tree error and deep incomplete lineage sorting. In addition to resolving long-standing inconsistencies in the reported relationships among major lineages, we show that Old World fruit bats have likely undergone at least seven independent dietary transitions from frugivory to nectarivory. Finally, we use this phylogeny to identify and describe one new genus. [Chiroptera; coalescence; concordance; incomplete lineage sorting; nectar feeder; species tree; target enrichment.].
Collapse
Affiliation(s)
- Nicolas Nesi
- School of Biological and Chemical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS, UK
| | - Georgia Tsagkogeorga
- School of Biological and Chemical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS, UK
| | - Susan M Tsang
- Department of Mammalogy, Division of Vertebrate Zoology, American Museum of Natural History, New York, USA
- Zoology Section, National Museum of Natural History, Manila, Philippines
| | - Violaine Nicolas
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Muséum national d’Histoire naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, Paris, France
| | - Aude Lalis
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Muséum national d’Histoire naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, Paris, France
| | - Annette T Scanlon
- School of Natural and Built Environments, University of South Australia, Mawson Lakes, SA, Australia
| | - Silke A Riesle-Sbarbaro
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
- Institute of Zoology, Zoological Society of London, London, UK
- Centre for Biological Threats and Special Pathogens, Robert Koch Institute, Berlin, Germany
| | - Sigit Wiantoro
- Museum Zoologicum Bogoriense, Research Center for Biology, Indonesian Institute of Sciences, Cibinong, Indonesia
| | - Alan T Hitch
- Department of Wildlife, Fish, and Conservation Biology, University of California Davis, CA, USA
| | - Javier Juste
- Estación Biológica de Doñana (CSIC), Avda. Américo Vespucio, Sevilla, Spain
| | | | | | - Christopher M Todd
- The Hawkesbury institute for the Environment, Western Sydney University, Australia
| | - Burton K Lim
- Royal Ontario Museum, Toronto, ON M5S 2C6, Canada
| | - Nancy B Simmons
- Department of Mammalogy, Division of Vertebrate Zoology, American Museum of Natural History, New York, USA
| | - Michael R McGowen
- Department of Vertebrate Zoology, Smithsonian National Museum of Natural History, Washington, DC, USA
| | - Stephen J Rossiter
- School of Biological and Chemical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS, UK
| |
Collapse
|
18
|
Shah T, Schneider JV, Zizka G, Maurin O, Baker W, Forest F, Brewer GE, Savolainen V, Darbyshire I, Larridon I. Joining forces in Ochnaceae phylogenomics: a tale of two targeted sequencing probe kits. AMERICAN JOURNAL OF BOTANY 2021; 108:1201-1216. [PMID: 34180046 DOI: 10.1002/ajb2.1682] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Accepted: 02/23/2021] [Indexed: 05/10/2023]
Abstract
PREMISE Both universal and family-specific targeted sequencing probe kits are becoming widely used for reconstruction of phylogenetic relationships in angiosperms. Within the pantropical Ochnaceae, we show that with careful data filtering, universal kits are equally as capable in resolving intergeneric relationships as custom probe kits. Furthermore, we show the strength in combining data from both kits to mitigate bias and provide a more robust result to resolve evolutionary relationships. METHODS We sampled 23 Ochnaceae genera and used targeted sequencing with two probe kits, the universal Angiosperms353 kit and a family-specific kit. We used maximum likelihood inference with a concatenated matrix of loci and multispecies-coalescence approaches to infer relationships in the family. We explored phylogenetic informativeness and the impact of missing data on resolution and tree support. RESULTS For the Angiosperms353 data set, the concatenation approach provided results more congruent with those of the Ochnaceae-specific data set. Filtering missing data was most impactful on the Angiosperms353 data set, with a relaxed threshold being the optimum scenario. The Ochnaceae-specific data set resolved consistent topologies using both inference methods, and no major improvements were obtained after data filtering. Merging of data obtained with the two kits resulted in a well-supported phylogenetic tree. CONCLUSIONS The Angiosperms353 data set improved upon data filtering, and missing data played an important role in phylogenetic reconstruction. The Angiosperms353 data set resolved the phylogenetic backbone of Ochnaceae as equally well as the family specific data set. All analyses indicated that both Sauvagesia L. and Campylospermum Tiegh. as currently circumscribed are polyphyletic and require revised delimitation.
Collapse
Affiliation(s)
- Toral Shah
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
- Department of Life Sciences, Imperial College, Silwood Park Campus, Ascot, Berks, SL5 7PY, UK
| | - Julio V Schneider
- Department of Botany and Molecular Evolution, Senckenberg Research Institute and Natural History Museum Frankfurt, Senckenberganlage 25, Frankfurt am Main, D-60325, Germany
| | - Georg Zizka
- Department of Botany and Molecular Evolution, Senckenberg Research Institute and Natural History Museum Frankfurt, Senckenberganlage 25, Frankfurt am Main, D-60325, Germany
- Institute of Ecology, Evolution and Diversity, Goethe University, Max-von-Laue-Str. 13, Frankfurt am Main, 60438, Germany
| | - Olivier Maurin
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - William Baker
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Félix Forest
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Grace E Brewer
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Vincent Savolainen
- Department of Life Sciences, Imperial College, Silwood Park Campus, Ascot, Berks, SL5 7PY, UK
| | | | - Isabel Larridon
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
- Systematic and Evolutionary Botany Lab, Department of Biology, Ghent University, K.L., Ledeganckstraat 35, Gent, 9000, Belgium
| |
Collapse
|
19
|
Mahbub M, Wahab Z, Reaz R, Rahman MS, Bayzid MS. wQFM: Highly Accurate Genome-scale Species Tree Estimation from Weighted Quartets. Bioinformatics 2021; 37:3734-3743. [PMID: 34086858 DOI: 10.1093/bioinformatics/btab428] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 05/24/2021] [Accepted: 06/03/2021] [Indexed: 02/01/2023] Open
Abstract
MOTIVATION Species tree estimation from genes sampled from throughout the whole genome is complicated due to the gene tree-species tree discordance. Incomplete lineage sorting (ILS) is one of the most frequent causes for this discordance, where alleles can coexist in populations for periods that may span several speciation events. Quartet-based summary methods for estimating species trees from a collection of gene trees are becoming popular due to their high accuracy and statistical guarantee under ILS. Generating quartets with appropriate weights, where weights correspond to the relative importance of quartets, and subsequently amalgamating the weighted quartets to infer a single coherent species tree can allow for a statistically consistent way of estimating species trees. However, handling weighted quartets is challenging. RESULTS We propose wQFM, a highly accurate method for species tree estimation from multi-locus data, by extending the quartet FM (QFM) algorithm to a weighted setting. wQFM was assessed on a collection of simulated and real biological datasets, including the avian phylogenomic dataset which is one of the largest phylogenomic datasets to date. We compared wQFM with wQMC, which is the best alternate method for weighted quartet amalgamation, and with ASTRAL, which is one of the most accurate and widely used coalescent-based species tree estimation methods. Our results suggest that wQFM matches or improves upon the accuracy of wQMC and ASTRAL. AVAILABILITY wQFM is available in open source form at https://github.com/Mahim1997/wQFM-2020. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mahim Mahbub
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - Zahin Wahab
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - Rezwana Reaz
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - M Saifur Rahman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| | - Md Shamsuzzoha Bayzid
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka-1205, Bangladesh
| |
Collapse
|
20
|
Li X, Hou Z, Xu C, Shi X, Yang L, Lewis LA, Zhong B. Large Phylogenomic Data sets Reveal Deep Relationships and Trait Evolution in Chlorophyte Green Algae. Genome Biol Evol 2021; 13:6265471. [PMID: 33950183 PMCID: PMC8271138 DOI: 10.1093/gbe/evab101] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/04/2021] [Indexed: 12/01/2022] Open
Abstract
The chlorophyte green algae (Chlorophyta) are species-rich ancient groups ubiquitous in various habitats with high cytological diversity, ranging from microscopic to macroscopic organisms. However, the deep phylogeny within core Chlorophyta remains unresolved, in part due to the relatively sparse taxon and gene sampling in previous studies. Here we contribute new transcriptomic data and reconstruct phylogenetic relationships of core Chlorophyta based on four large data sets up to 2,698 genes of 70 species, representing 80% of extant orders. The impacts of outgroup choice, missing data, bootstrap-support cutoffs, and model misspecification in phylogenetic inference of core Chlorophyta are examined. The species tree topologies of core Chlorophyta from different analyses are highly congruent, with strong supports at many relationships (e.g., the Bryopsidales and the Scotinosphaerales-Dasycladales clade). The monophyly of Chlorophyceae and of Trebouxiophyceae as well as the uncertain placement of Chlorodendrophyceae and Pedinophyceae corroborate results from previous studies. The reconstruction of ancestral scenarios illustrates the evolution of the freshwater-sea and microscopic–macroscopic transition in the Ulvophyceae, and the transformation of unicellular→colonial→multicellular in the chlorophyte green algae. In addition, we provided new evidence that serine is encoded by both canonical codons and noncanonical TAG code in Scotinosphaerales, and stop-to-sense codon reassignment in the Ulvophyceae has originated independently at least three times. Our robust phylogenetic framework of core Chlorophyta unveils the evolutionary history of phycoplast, cyto-morphology, and noncanonical genetic codes in chlorophyte green algae.
Collapse
Affiliation(s)
- Xi Li
- College of Life Sciences, Nanjing Normal University, China
| | - Zheng Hou
- College of Life Sciences, Nanjing Normal University, China
| | - Chenjie Xu
- College of Life Sciences, Nanjing Normal University, China
| | - Xuan Shi
- College of Life Sciences, Nanjing Normal University, China
| | - Lingxiao Yang
- College of Life Sciences, Nanjing Normal University, China
| | - Louise A Lewis
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Bojian Zhong
- College of Life Sciences, Nanjing Normal University, China
| |
Collapse
|
21
|
Kulkarni S, Kallal RJ, Wood H, Dimitrov D, Giribet G, Hormiga G. Interrogating Genomic-Scale Data to Resolve Recalcitrant Nodes in the Spider Tree of Life. Mol Biol Evol 2021; 38:891-903. [PMID: 32986823 PMCID: PMC7947752 DOI: 10.1093/molbev/msaa251] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Genome-scale data sets are converging on robust, stable phylogenetic hypotheses for many lineages; however, some nodes have shown disagreement across classes of data. We use spiders (Araneae) as a system to identify the causes of incongruence in phylogenetic signal between three classes of data: exons (as in phylotranscriptomics), noncoding regions (included in ultraconserved elements [UCE] analyses), and a combination of both (as in UCE analyses). Gene orthologs, coded as amino acids and nucleotides (with and without third codon positions), were generated by querying published transcriptomes for UCEs, recovering 1,931 UCE loci (codingUCEs). We expected that congeners represented in the codingUCE and UCEs data would form clades in the presence of phylogenetic signal. Noncoding regions derived from UCE sequences were recovered to test the stability of relationships. Phylogenetic relationships resulting from all analyses were largely congruent. All nucleotide data sets from transcriptomes, UCEs, or a combination of both recovered similar topologies in contrast with results from transcriptomes analyzed as amino acids. Most relationships inferred from low-occupancy data sets, containing several hundreds of loci, were congruent across Araneae, as opposed to high occupancy data matrices with fewer loci, which showed more variation. Furthermore, we found that low-occupancy data sets analyzed as nucleotides (as is typical of UCE data sets) can result in more congruent relationships than high occupancy data sets analyzed as amino acids (as in phylotranscriptomics). Thus, omitting data, through amino acid translation or via retention of only high occupancy loci, may have a deleterious effect in phylogenetic reconstruction.
Collapse
Affiliation(s)
- Siddharth Kulkarni
- Department of Biological Sciences, The George Washington University, Washington, DC
- Department of Entomology, National Museum of Natural History, Smithsonian Institution, Washington, DC
| | - Robert J Kallal
- Department of Entomology, National Museum of Natural History, Smithsonian Institution, Washington, DC
| | - Hannah Wood
- Department of Entomology, National Museum of Natural History, Smithsonian Institution, Washington, DC
| | - Dimitar Dimitrov
- Department of Natural History, University Museum of Bergen, University of Bergen, Bergen, Norway
| | - Gonzalo Giribet
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA
| | - Gustavo Hormiga
- Department of Biological Sciences, The George Washington University, Washington, DC
| |
Collapse
|
22
|
Collapsing dubiously resolved gene-tree branches in phylogenomic coalescent analyses. Mol Phylogenet Evol 2021; 158:107092. [PMID: 33545272 DOI: 10.1016/j.ympev.2021.107092] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Revised: 12/30/2020] [Accepted: 01/28/2021] [Indexed: 01/15/2023]
Abstract
In two-step coalescent analyses of phylogenomic data, gene-tree topologies are treated as fixed prior to species-tree inference. Although all gene-tree conflict is assumed to be caused by lineage sorting when applying these methods, in empirical datasets much of the conflict can be caused by estimation error. Weakly supported and even arbitrarily resolved clades are important sources of this estimation error for gene trees inferred from few informative characters relative to the number of sampled terminals, and the resulting extraneous conflict among gene trees can negatively impact species-tree inference. In this study, we quantified the relative severity of alternative methods for collapsing gene-tree branches for seven empirical datasets and quantified their effects on species-tree inference. The branch-collapsing methods that we employed were based on the strict consensus of optimal topologies, various bootstrap thresholds, and 0% approximate likelihood ratio test (SH-like aLRT) support. Up to 86% of internal gene-tree branches are dubiously or arbitrarily resolved in reanalyses of these published phylogenomic datasets, and collapsing these branches increased inferred species-tree coalescent branch lengths by up to 455%. For two datasets, the longer inferred branch lengths sometimes impacted inference of anomaly-zone conditions. Although branch-collapsing methods did not consistently affect the species-tree topology, they often increased branch support. The more severe and clearly justified gene-tree branch-collapsing methods, which we recommend be broadly applied for two-step coalescent analyses, are use of the strict consensus in parsimony analyses and the collapse clades with 0% SH-like aLRT support in likelihood analyses. Collapsing dubiously or arbitrarily resolved branches in gene trees sometimes improved congruence between coalescent-based results and concatenation trees. In such cases, we contend that the resolution provided by concatenation should be preferred and that incomplete lineage sorting is a poor explanation for the initial conflict between phylogenetic approaches.
Collapse
|
23
|
Steenwyk JL, Buida TJ, Li Y, Shen XX, Rokas A. ClipKIT: A multiple sequence alignment trimming software for accurate phylogenomic inference. PLoS Biol 2020; 18:e3001007. [PMID: 33264284 PMCID: PMC7735675 DOI: 10.1371/journal.pbio.3001007] [Citation(s) in RCA: 163] [Impact Index Per Article: 40.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Revised: 12/14/2020] [Accepted: 11/10/2020] [Indexed: 12/22/2022] Open
Abstract
Highly divergent sites in multiple sequence alignments (MSAs), which can stem from erroneous inference of homology and saturation of substitutions, are thought to negatively impact phylogenetic inference. Thus, several different trimming strategies have been developed for identifying and removing these sites prior to phylogenetic inference. However, a recent study reported that doing so can worsen inference, underscoring the need for alternative alignment trimming strategies. Here, we introduce ClipKIT, an alignment trimming software that, rather than identifying and removing putatively phylogenetically uninformative sites, instead aims to identify and retain parsimony-informative sites, which are known to be phylogenetically informative. To test the efficacy of ClipKIT, we examined the accuracy and support of phylogenies inferred from 14 different alignment trimming strategies, including those implemented in ClipKIT, across nearly 140,000 alignments from a broad sampling of evolutionary histories. Phylogenies inferred from ClipKIT-trimmed alignments are accurate, robust, and time saving. Furthermore, ClipKIT consistently outperformed other trimming methods across diverse datasets, suggesting that strategies based on identifying and retaining parsimony-informative sites provide a robust framework for alignment trimming.
Collapse
Affiliation(s)
- Jacob L. Steenwyk
- Vanderbilt University, Department of Biological Sciences, Nashville, Tennessee, United States of America
- * E-mail: (JLS); (AR)
| | | | - Yuanning Li
- Vanderbilt University, Department of Biological Sciences, Nashville, Tennessee, United States of America
| | - Xing-Xing Shen
- Ministry of Agriculture Key Lab of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Antonis Rokas
- Vanderbilt University, Department of Biological Sciences, Nashville, Tennessee, United States of America
- * E-mail: (JLS); (AR)
| |
Collapse
|
24
|
Cai L, Xi Z, Lemmon EM, Lemmon AR, Mast A, Buddenhagen CE, Liu L, Davis CC. The Perfect Storm: Gene Tree Estimation Error, Incomplete Lineage Sorting, and Ancient Gene Flow Explain the Most Recalcitrant Ancient Angiosperm Clade, Malpighiales. Syst Biol 2020; 70:491-507. [PMID: 33169797 DOI: 10.1093/sysbio/syaa083] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Revised: 10/20/2020] [Accepted: 10/28/2020] [Indexed: 12/20/2022] Open
Abstract
The genomic revolution offers renewed hope of resolving rapid radiations in the Tree of Life. The development of the multispecies coalescent model and improved gene tree estimation methods can better accommodate gene tree heterogeneity caused by incomplete lineage sorting (ILS) and gene tree estimation error stemming from the short internal branches. However, the relative influence of these factors in species tree inference is not well understood. Using anchored hybrid enrichment, we generated a data set including 423 single-copy loci from 64 taxa representing 39 families to infer the species tree of the flowering plant order Malpighiales. This order includes 9 of the top 10 most unstable nodes in angiosperms, which have been hypothesized to arise from the rapid radiation during the Cretaceous. Here, we show that coalescent-based methods do not resolve the backbone of Malpighiales and concatenation methods yield inconsistent estimations, providing evidence that gene tree heterogeneity is high in this clade. Despite high levels of ILS and gene tree estimation error, our simulations demonstrate that these two factors alone are insufficient to explain the lack of resolution in this order. To explore this further, we examined triplet frequencies among empirical gene trees and discovered some of them deviated significantly from those attributed to ILS and estimation error, suggesting gene flow as an additional and previously unappreciated phenomenon promoting gene tree variation in Malpighiales. Finally, we applied a novel method to quantify the relative contribution of these three primary sources of gene tree heterogeneity and demonstrated that ILS, gene tree estimation error, and gene flow contributed to 10.0$\%$, 34.8$\%$, and 21.4$\%$ of the variation, respectively. Together, our results suggest that a perfect storm of factors likely influence this lack of resolution, and further indicate that recalcitrant phylogenetic relationships like the backbone of Malpighiales may be better represented as phylogenetic networks. Thus, reducing such groups solely to existing models that adhere strictly to bifurcating trees greatly oversimplifies reality, and obscures our ability to more clearly discern the process of evolution. [Coalescent; concatenation; flanking region; hybrid enrichment, introgression; phylogenomics; rapid radiation, triplet frequency.].
Collapse
Affiliation(s)
- Liming Cai
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, Cambridge, MA 02138, USA
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Zhenxiang Xi
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, Cambridge, MA 02138, USA
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Emily Moriarty Lemmon
- Department of Biological Sciences, Florida State University, Tallahassee, FL 32306, USA
| | - Alan R Lemmon
- Department of Scientific Computing, Florida State University, Tallahassee, FL 32306, USA
| | - Austin Mast
- Department of Biological Sciences, Florida State University, Tallahassee, FL 32306, USA
| | - Christopher E Buddenhagen
- Department of Biological Sciences, Florida State University, Tallahassee, FL 32306, USA
- AgResearch, 10 Bisley Road, Hamilton 3214, New Zealand
| | - Liang Liu
- Department of Statistics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Charles C Davis
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, Cambridge, MA 02138, USA
| |
Collapse
|
25
|
Feng Y, Comes HP, Qiu YX. Phylogenomic insights into the temporal-spatial divergence history, evolution of leaf habit and hybridization in Stachyurus (Stachyuraceae). Mol Phylogenet Evol 2020; 150:106878. [DOI: 10.1016/j.ympev.2020.106878] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2020] [Revised: 05/07/2020] [Accepted: 06/01/2020] [Indexed: 12/14/2022]
|
26
|
Perea S, Sousa‐Santos C, Robalo J, Doadrio I. Multilocus phylogeny and systematics of Iberian endemicSqualius(Actinopterygii, Leuciscidae). ZOOL SCR 2020. [DOI: 10.1111/zsc.12420] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Silvia Perea
- Department of Biodiversity and Evolutionary Biology Museo Nacional de Ciencias Naturales - CSIC Madrid Spain
| | - Carla Sousa‐Santos
- MARE – Marine and Environmental Sciences Centre ISPA‐Instituto Universitário Lisbon Portugal
| | - Joana Robalo
- MARE – Marine and Environmental Sciences Centre ISPA‐Instituto Universitário Lisbon Portugal
| | - Ignacio Doadrio
- Department of Biodiversity and Evolutionary Biology Museo Nacional de Ciencias Naturales - CSIC Madrid Spain
| |
Collapse
|
27
|
Water lily ( Nymphaea thermarum) genome reveals variable genomic signatures of ancient vascular cambium losses. Proc Natl Acad Sci U S A 2020; 117:8649-8656. [PMID: 32234787 DOI: 10.1073/pnas.1922873117] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
For more than 225 million y, all seed plants were woody trees, shrubs, or vines. Shortly after the origin of angiosperms ∼140 million y ago (MYA), the Nymphaeales (water lilies) became one of the first lineages to deviate from their ancestral, woody habit by losing the vascular cambium, the meristematic population of cells that produces secondary xylem (wood) and phloem. Many of the genes and gene families that regulate differentiation of secondary tissues also regulate the differentiation of primary xylem and phloem, which are produced by apical meristems and retained in nearly all seed plants. Here, we sequenced and assembled a draft genome of the water lily Nymphaea thermarum, an emerging system for the study of early flowering plant evolution, and compared it to genomes from other cambium-bearing and cambium-less lineages (e.g., monocots and Nelumbo). This revealed lineage-specific patterns of gene loss and divergence. Nymphaea is characterized by a significant contraction of the HD-ZIP III transcription factors, specifically loss of REVOLUTA, which influences cambial activity in other angiosperms. We also found the Nymphaea and monocot copies of cambium-associated CLE signaling peptides display unique substitutions at otherwise highly conserved amino acids. Nelumbo displays no obvious divergence in cambium-associated genes. The divergent genomic signatures of convergent loss of vascular cambium reveals that even pleiotropic genes can exhibit unique divergence patterns in association with independent events of trait loss. Our results shed light on the evolution of herbaceousness-one of the key biological innovations associated with the earliest phases of angiosperm evolution.
Collapse
|
28
|
Yang L, Su D, Chang X, Foster CS, Sun L, Huang CH, Zhou X, Zeng L, Ma H, Zhong B. Phylogenomic Insights into Deep Phylogeny of Angiosperms Based on Broad Nuclear Gene Sampling. PLANT COMMUNICATIONS 2020; 1:100027. [PMID: 33367231 PMCID: PMC7747974 DOI: 10.1016/j.xplc.2020.100027] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2019] [Revised: 01/23/2020] [Accepted: 01/25/2020] [Indexed: 05/02/2023]
Abstract
Angiosperms (flowering plants) are the most diverse and species-rich group of plants. The vast majority (∼99.95%) of angiosperms form a clade called Mesangiospermae, which is subdivided into five major groups: eudicots, monocots, magnoliids, Chloranthales, and Ceratophyllales. The relationships among these Mesangiospermae groups have been the subject of long debate. In this study, we assembled a phylogenomic dataset of 1594 genes from 151 angiosperm taxa, including representatives of all five lineages, to investigate the phylogeny of major angiosperm lineages under both coalescent- and concatenation-based methods. We dissected the phylogenetic signal and found that more than half of the genes lack phylogenetic information for the backbone of angiosperm phylogeny. We further removed the genes with weak phylogenetic signal and showed that eudicots, Ceratophyllales, and Chloranthales form a clade, with magnoliids and monocots being the next successive sister lineages. Similar frequencies of gene tree conflict are suggestive of incomplete lineage sorting along the backbone of the angiosperm phylogeny. Our analyses suggest that a fully bifurcating species tree may not be the best way to represent the early radiation of angiosperms. Meanwhile, we inferred that the crown-group angiosperms originated approximately between 255.1 and 222.2 million years ago, and Mesangiospermae diversified into the five extant groups in a short time span (∼27 million years) at the Early to Late Jurassic.
Collapse
Affiliation(s)
- Lingxiao Yang
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Danyan Su
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Xin Chang
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Charles S.P. Foster
- School of Life and Environmental Sciences, University of Sydney, Sydney, Australia
| | - Linhua Sun
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Chien-Hsun Huang
- State Key Laboratory of Genetic Engineering and Collaborative Innovation Center for Genetics and Development, Ministry of Education Key Laboratory of Biodiversity Sciences and Ecological Engineering, School of Life Sciences, Fudan University, Shanghai, China
| | - Xiaofan Zhou
- Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Integrative Microbiology Research Centre, South China Agricultural University, Guangzhou, China
| | - Liping Zeng
- Institute for Integrative Genome Biology and Department of Botany and Plant Sciences, University of California, Riverside, CA, USA
| | - Hong Ma
- Department of Biology, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA, USA
| | - Bojian Zhong
- College of Life Sciences, Nanjing Normal University, Nanjing, China
| |
Collapse
|
29
|
Goremykin V. A Novel Test for Absolute Fit of Evolutionary Models Provides a Means to Correctly Identify the Substitution Model and the Model Tree. Genome Biol Evol 2020; 11:2403-2419. [PMID: 31368483 PMCID: PMC6736042 DOI: 10.1093/gbe/evz167] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/29/2019] [Indexed: 02/07/2023] Open
Abstract
A novel test is described that visualizes the absolute model-data fit of the substitution and tree components of an evolutionary model. The test utilizes statistics based on counts of character state matches and mismatches in alignments of observed and simulated sequences. This comparison is used to assess model-data fit. In simulations conducted to evaluate the performance of the test, the test estimator was able to identify both the correct tree topology and substitution model under conditions where the Goldman-Cox test-which tests the fit of a substitution model to sequence data and is also based on comparing simulated replicates with observed data-showed high error rates. The novel test was found to identify the correct tree topology within a wide range of DNA substitution model misspecifications, indicating the high discriminatory power of the test. Use of this test provides a practical approach for assessing absolute model-data fit when testing phylogenetic hypotheses.
Collapse
Affiliation(s)
- Vadim Goremykin
- Research and Innovation Centre, Fondazione Edmund Mach, San Michele all'Adige, Trentino, Italy
| |
Collapse
|
30
|
Murphy B, Forest F, Barraclough T, Rosindell J, Bellot S, Cowan R, Golos M, Jebb M, Cheek M. A phylogenomic analysis of Nepenthes (Nepenthaceae). Mol Phylogenet Evol 2020; 144:106668. [DOI: 10.1016/j.ympev.2019.106668] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Revised: 10/28/2019] [Accepted: 10/29/2019] [Indexed: 10/25/2022]
|
31
|
Neupane S, Fučíková K, Lewis LA, Kuo L, Chen MH, Lewis PO. Assessing Combinability of Phylogenomic Data Using Bayes Factors. Syst Biol 2020; 68:744-754. [PMID: 30726954 DOI: 10.1093/sysbio/syz007] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2018] [Revised: 01/26/2019] [Accepted: 02/04/2019] [Indexed: 11/14/2022] Open
Abstract
With the rapid reduction in sequencing costs of high-throughput genomic data, it has become commonplace to use hundreds of genes to infer phylogeny of any study system. While sampling a large number of genes has given us a tremendous opportunity to uncover previously unknown relationships and improve phylogenetic resolution, it also presents us with new challenges when the phylogenetic signal is confused by differences in the evolutionary histories of sampled genes. Given the incorporation of accurate marginal likelihood estimation methods into popular Bayesian software programs, it is natural to consider using the Bayes Factor (BF) to compare different partition models in which genes within any given partition subset share both tree topology and edge lengths. We explore using marginal likelihood to assess data subset combinability when data subsets have varying levels of phylogenetic discordance due to deep coalescence events among genes (simulated within a species tree), and compare the results with our recently described phylogenetic informational dissonance index (D) estimated for each data set. BF effectively detects phylogenetic incongruence and provides a way to assess the statistical significance of D values. We use BFs to assess data combinability using an empirical data set comprising 56 plastid genes from the green algal order Volvocales. We also discuss the potential need for calibrating BFs and demonstrate that BFs used in this study are correctly calibrated.
Collapse
Affiliation(s)
- Suman Neupane
- Department of Biological Sciences, Virginia Tech University, 4076 Derring Hall, 926 West Campus Drive, Blacksburg, VA 24061, USA.,Department of Ecology and Evolutionary Biology, University of Connecticut, 75 N. Eagleville Road, Unit 3043, Storrs, CT 06269, USA
| | - Karolina Fučíková
- Department of Natural Sciences, Assumption College, 500 Salisbury St., Worcester, MA 01609, USA
| | - Louise A Lewis
- Department of Ecology and Evolutionary Biology, University of Connecticut, 75 N. Eagleville Road, Unit 3043, Storrs, CT 06269, USA
| | - Lynn Kuo
- Department of Statistics, University of Connecticut, 215 Glenbrook Road, Unit 4120, Storrs, CT 06269, USA
| | - Ming-Hui Chen
- Department of Statistics, University of Connecticut, 215 Glenbrook Road, Unit 4120, Storrs, CT 06269, USA
| | - Paul O Lewis
- Department of Ecology and Evolutionary Biology, University of Connecticut, 75 N. Eagleville Road, Unit 3043, Storrs, CT 06269, USA
| |
Collapse
|
32
|
Smith SA, Walker-Hale N, Walker JF, Brown JW. Phylogenetic Conflicts, Combinability, and Deep Phylogenomics in Plants. Syst Biol 2019; 69:579-592. [DOI: 10.1093/sysbio/syz078] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Revised: 10/16/2019] [Accepted: 11/18/2019] [Indexed: 11/13/2022] Open
Abstract
Abstract
Studies have demonstrated that pervasive gene tree conflict underlies several important phylogenetic relationships where different species tree methods produce conflicting results. Here, we present a means of dissecting the phylogenetic signal for alternative resolutions within a data set in order to resolve recalcitrant relationships and, importantly, identify what the data set is unable to resolve. These procedures extend upon methods for isolating conflict and concordance involving specific candidate relationships and can be used to identify systematic error and disambiguate sources of conflict among species tree inference methods. We demonstrate these on a large phylogenomic plant data set. Our results support the placement of Amborella as sister to the remaining extant angiosperms, Gnetales as sister to pines, and the monophyly of extant gymnosperms. Several other contentious relationships, including the resolution of relationships within the bryophytes and the eudicots, remain uncertain given the low number of supporting gene trees. To address whether concatenation of filtered genes amplified phylogenetic signal for relationships, we implemented a combinatorial heuristic to test combinability of genes. We found that nested conflicts limited the ability of data filtering methods to fully ameliorate conflicting signal amongst gene trees. These analyses confirmed that the underlying conflicting signal does not support broad concatenation of genes. Our approach provides a means of dissecting a specific data set to address deep phylogenetic relationships while also identifying the inferential boundaries of the data set. [Angiosperms; coalescent; gene-tree conflict; genomics; phylogenetics; phylogenomics.]
Collapse
Affiliation(s)
- Stephen A Smith
- Department of Ecology and Evolutionary Biology, University of Michigan, 1105 North University Ave, Biological Sciences Building, Ann Arbor, MI 48109-1085, USA
| | - Nathanael Walker-Hale
- Department of Ecology and Evolutionary Biology, University of Michigan, 1105 North University Ave, Biological Sciences Building, Ann Arbor, MI 48109-1085, USA
- Department of Plant Sciences, University of Cambridge, Downing Street, Cambridge CB2 3EA, Cambridge, UK
| | - Joseph F Walker
- Department of Ecology and Evolutionary Biology, University of Michigan, 1105 North University Ave, Biological Sciences Building, Ann Arbor, MI 48109-1085, USA
- Sainsbury Laboratory (SLCU), University of Cambrige, Bateman St, Cambridge CB2 1LR, Cambridge, UK
| | - Joseph W Brown
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield S10 2TN, Sheffield, UK
| |
Collapse
|
33
|
Du Y, Wu S, Edwards SV, Liu L. The effect of alignment uncertainty, substitution models and priors in building and dating the mammal tree of life. BMC Evol Biol 2019; 19:203. [PMID: 31694538 PMCID: PMC6833305 DOI: 10.1186/s12862-019-1534-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Accepted: 10/21/2019] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND The flood of genomic data to help build and date the tree of life requires automation at several critical junctures, most importantly during sequence assembly and alignment. It is widely appreciated that automated alignment protocols can yield inaccuracies, but the relative impact of various sources error on phylogenomic analysis is not yet known. This study employs an updated mammal data set of 5162 coding loci sampled from 90 species to evaluate the effects of alignment uncertainty, substitution models, and fossil priors on gene tree, species tree, and divergence time estimation. Additionally, a novel coalescent likelihood ratio test is introduced for comparing competing species trees against a given set of gene trees. RESULTS The aligned DNA sequences of 5162 loci from 90 species were trimmed and filtered using trimAL and two filtering protocols. The final dataset contains 4 sets of alignments - before trimming, after trimming, filtered by a recently proposed pipeline, and further filtered by comparing ML gene trees for each locus with the concatenation tree. Our analyses suggest that the average discordance among the coalescent trees is significantly smaller than that among the concatenation trees estimated from the 4 sets of alignments or with different substitution models. There is no significant difference among the divergence times estimated with different substitution models. However, the divergence dates estimated from the alignments after trimming are more recent than those estimated from the alignments before trimming. CONCLUSIONS Our results highlight that alignment uncertainty of the updated mammal data set and the choice of substitution models have little impact on tree topologies yielded by coalescent methods for species tree estimation, whereas they are more influential on the trees made by concatenation. Given the choice of calibration scheme and clock models, divergence time estimates are robust to the choice of substitution models, but removing alignments deemed problematic by trimming algorithms can lead to more recent dates. Although the fossil prior is important in divergence time estimation, Bayesian estimates of divergence times in this data set are driven primarily by the sequence data.
Collapse
Affiliation(s)
- Yan Du
- Department of Statistics, University of Georgia, 310 Herty Drive, Athens, GA 30606 USA
| | - Shaoyuan Wu
- Jiangsu Key Laboratory of Phylogenomics & Comparative Genomics, School of Life Sciences, Jiangsu Normal University, Xuzhou, Jiangsu 221116 People’s Republic of China
| | - Scott V. Edwards
- Department of Organismic & Evolutionary Biology, Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138 USA
| | - Liang Liu
- Liang Liu, Department of Statistics and Institute of Bioinformatics, University of Georgia, 310 Herty Drive, Athens, GA 30606 USA
| |
Collapse
|
34
|
Gatesy J, Sloan DB, Warren JM, Baker RH, Simmons MP, Springer MS. Partitioned coalescence support reveals biases in species-tree methods and detects gene trees that determine phylogenomic conflicts. Mol Phylogenet Evol 2019; 139:106539. [DOI: 10.1016/j.ympev.2019.106539] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2018] [Revised: 06/10/2019] [Accepted: 06/17/2019] [Indexed: 12/26/2022]
|
35
|
Li YX, Li ZH, Schuiteman A, Chase MW, Li JW, Huang WC, Hidayat A, Wu SS, Jin XH. Phylogenomics of Orchidaceae based on plastid and mitochondrial genomes. Mol Phylogenet Evol 2019; 139:106540. [DOI: 10.1016/j.ympev.2019.106540] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Revised: 06/05/2019] [Accepted: 06/18/2019] [Indexed: 10/26/2022]
|
36
|
Garcia-Porta J, Irisarri I, Kirchner M, Rodríguez A, Kirchhof S, Brown JL, MacLeod A, Turner AP, Ahmadzadeh F, Albaladejo G, Crnobrnja-Isailovic J, De la Riva I, Fawzi A, Galán P, Göçmen B, Harris DJ, Jiménez-Robles O, Joger U, Jovanović Glavaš O, Karış M, Koziel G, Künzel S, Lyra M, Miles D, Nogales M, Oğuz MA, Pafilis P, Rancilhac L, Rodríguez N, Rodríguez Concepción B, Sanchez E, Salvi D, Slimani T, S'khifa A, Qashqaei AT, Žagar A, Lemmon A, Moriarty Lemmon E, Carretero MA, Carranza S, Philippe H, Sinervo B, Müller J, Vences M, Wollenberg Valero KC. Environmental temperatures shape thermal physiology as well as diversification and genome-wide substitution rates in lizards. Nat Commun 2019; 10:4077. [PMID: 31501432 PMCID: PMC6733905 DOI: 10.1038/s41467-019-11943-x] [Citation(s) in RCA: 61] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Accepted: 08/13/2019] [Indexed: 11/09/2022] Open
Abstract
Climatic conditions changing over time and space shape the evolution of organisms at multiple levels, including temperate lizards in the family Lacertidae. Here we reconstruct a dated phylogenetic tree of 262 lacertid species based on a supermatrix relying on novel phylogenomic datasets and fossil calibrations. Diversification of lacertids was accompanied by an increasing disparity among occupied bioclimatic niches, especially in the last 10 Ma, during a period of progressive global cooling. Temperate species also underwent a genome-wide slowdown in molecular substitution rates compared to tropical and desert-adapted lacertids. Evaporative water loss and preferred temperature are correlated with bioclimatic parameters, indicating physiological adaptations to climate. Tropical, but also some populations of cool-adapted species experience maximum temperatures close to their preferred temperatures. We hypothesize these species-specific physiological preferences may constitute a handicap to prevail under rapid global warming, and contribute to explaining local lizard extinctions in cool and humid climates.
Collapse
Affiliation(s)
- Joan Garcia-Porta
- CREAF, 08193, Cerdanyola del Vallès, Spain
- Department of Biology, Washington University in Saint Louis, St. Louis, MO, 63130, USA
| | - Iker Irisarri
- Department of Organismal Biology, Uppsala University, Norbyvägen 18D, 752 36, Uppsala, Sweden
| | - Martin Kirchner
- Museum für Naturkunde, Leibniz Institute for Evolution and Biodiversity Science, Invalidenstr. 43, 10115, Berlin, Germany
| | - Ariel Rodríguez
- Institute of Zoology, Tierärztliche Hochschule Hannover, Bünteweg 17, 30559, Hannover, Germany
| | - Sebastian Kirchhof
- Museum für Naturkunde, Leibniz Institute for Evolution and Biodiversity Science, Invalidenstr. 43, 10115, Berlin, Germany
| | - Jason L Brown
- Department of Zoology, Southern Illinois University, Carbondale, IL, USA
| | - Amy MacLeod
- Museum für Naturkunde, Leibniz Institute for Evolution and Biodiversity Science, Invalidenstr. 43, 10115, Berlin, Germany
| | - Alexander P Turner
- School of Engineering and Computer Science, University of Hull, Cottingham Road, HU6 7RX, Kingston-Upon-Hull, UK
| | - Faraham Ahmadzadeh
- Department of Biodiversity and Ecosystem Management, Environmental Sciences Research Institute, Shahid Beheshti University, G.C, Tehran, Iran
| | - Gonzalo Albaladejo
- Instituto de Productos Naturales y Agrobiología (IPNA), Consejo Superior de Investigaciones Científicas (CSIC), c/Astrofísico Francisco Sánchez, 38206, Tenerife, Canary Islands, Spain
| | - Jelka Crnobrnja-Isailovic
- Department of Biology and Ecology, Faculty of Sciences and Mathematics, University of Niš, Višegradska 33, 18000 Niš, Institute for Biological Research "S. Stanković" University of Belgrade, Despota Stefana 142, Belgrade, 11000, Serbia
| | - Ignacio De la Riva
- Department of Biodiversity and Evolutionary Biology, Museo Nacional de Ciencias Naturales, CSIC, C/José Gutiérrez Abascal 2, 28006, Madrid, Spain
| | - Adnane Fawzi
- Faculty of Sciences, Biodiversity and Ecosystem Dynamics Laboratory, Cadi Ayyad University, Marrakech, Morocco
| | - Pedro Galán
- Departamento de Bioloxía, Facultade de Ciencias, Universidade da Coruña, Grupo de Investigación en Biología Evolutiva (GIBE), 15071, A Coruña, Spain
| | - Bayram Göçmen
- Zoology Section, Biology Department, Faculty of Science, Ege University, 35100, Bornova, Izmir, Turkey
| | - D James Harris
- CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, University of Porto, Campus Agrário de Vairão, 4485-661, Vairão, Portugal
| | - Octavio Jiménez-Robles
- Department of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Ulrich Joger
- Staatliches Naturhistorisches Museum, Braunschweig, Germany
| | | | - Mert Karış
- Department of Chemistry and Chemical Process Technologies, Acıgöl Vocational High School of Technical Sciences, Nevşehir Hacı Bektaş Veli University, 50300, Nevşehir, Turkey
| | - Giannina Koziel
- Zoological Institute, Braunschweig University of Technology, Mendelssohnstr. 4, 38106, Braunschweig, Germany
| | - Sven Künzel
- Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Mariana Lyra
- Departamento de Zoologia, Instituto de Biociências, UNESP - Universidade Estadual Paulista, Rio Claro, Brazil
| | - Donald Miles
- Department of Biological Sciences, Ohio University, Athens, OH, 45701, USA
| | - Manuel Nogales
- Instituto de Productos Naturales y Agrobiología (IPNA), Consejo Superior de Investigaciones Científicas (CSIC), c/Astrofísico Francisco Sánchez, 38206, Tenerife, Canary Islands, Spain
| | - Mehmet Anıl Oğuz
- Zoology Section, Biology Department, Faculty of Science, Ege University, 35100, Bornova, Izmir, Turkey
| | - Panayiotis Pafilis
- Section of Zoology and Marine Biology, Department of Biology, National and Kapodistrian University of Athens, Panepistimioupolis, Ilissia, Athens, 157-84, Greece
| | - Loïs Rancilhac
- Zoological Institute, Braunschweig University of Technology, Mendelssohnstr. 4, 38106, Braunschweig, Germany
| | - Noemí Rodríguez
- Instituto de Productos Naturales y Agrobiología (IPNA), Consejo Superior de Investigaciones Científicas (CSIC), c/Astrofísico Francisco Sánchez, 38206, Tenerife, Canary Islands, Spain
| | - Benza Rodríguez Concepción
- Instituto de Productos Naturales y Agrobiología (IPNA), Consejo Superior de Investigaciones Científicas (CSIC), c/Astrofísico Francisco Sánchez, 38206, Tenerife, Canary Islands, Spain
| | - Eugenia Sanchez
- Zoological Institute, Braunschweig University of Technology, Mendelssohnstr. 4, 38106, Braunschweig, Germany
| | - Daniele Salvi
- CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, University of Porto, Campus Agrário de Vairão, 4485-661, Vairão, Portugal
- Department of Health, Life and Environmental Sciences, University of L'Aquila, 67100, Coppito, L'Aquila, Italy
| | - Tahar Slimani
- Faculty of Sciences, Biodiversity and Ecosystem Dynamics Laboratory, Cadi Ayyad University, Marrakech, Morocco
| | - Abderrahim S'khifa
- Faculty of Sciences, Biodiversity and Ecosystem Dynamics Laboratory, Cadi Ayyad University, Marrakech, Morocco
| | - Ali Turk Qashqaei
- Department of Biodiversity and Ecosystem Management, Environmental Sciences Research Institute, Shahid Beheshti University, G.C, Tehran, Iran
| | - Anamarija Žagar
- National Institute of Biology NIB, Department of Organisms and Ecosystems Research, Vecna pot 111, 1000, Ljubljana, Slovenia
| | - Alan Lemmon
- Department of Scientific Computing, Florida State University, Dirac Science Library, Tallahassee, FL, USA
| | | | - Miguel Angel Carretero
- CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, University of Porto, Campus Agrário de Vairão, 4485-661, Vairão, Portugal
| | - Salvador Carranza
- Institute of Evolutionary Biology (CSIC-Universitat, Pompeu Fabra), Passeig Marítim de la Barceloneta 37-,49, 08003, Barcelona, Spain
| | - Hervé Philippe
- Centre for Biodiversity Theory and Modelling, UMR CNRS 5321, Station of Theoretical and Experimental Ecology, 09200, Moulis, France
| | - Barry Sinervo
- Department of Ecology and Evolutionary Biology, and Institute for the Study of the Ecological and Evolutionary Climate Impacts, University of California, 130 McAllister Way, Coastal Biology Building, Santa Cruz, CA, 95064, USA
| | - Johannes Müller
- Museum für Naturkunde, Leibniz Institute for Evolution and Biodiversity Science, Invalidenstr. 43, 10115, Berlin, Germany
| | - Miguel Vences
- Zoological Institute, Braunschweig University of Technology, Mendelssohnstr. 4, 38106, Braunschweig, Germany.
| | | |
Collapse
|
37
|
Steenwyk JL, Rokas A. Treehouse: a user-friendly application to obtain subtrees from large phylogenies. BMC Res Notes 2019; 12:541. [PMID: 31455362 PMCID: PMC6712805 DOI: 10.1186/s13104-019-4577-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Accepted: 08/21/2019] [Indexed: 01/13/2023] Open
Abstract
Objective Phylogenetic trees that contain hundreds to thousands of taxa are now routinely generated. Retrieving the relationships among a subset of taxa in these large phylogenies can be a challenging or time-consuming task. Addressing this challenge requires the development of tools that facilitate the easy retrieval of subtrees from any user-specified set of taxa in a given phylogeny. Results We developed treehouse, an open source tool that enables the retrieval of any subtree from a given large phylogeny. With a three-step workflow, treehouse successfully allows a user to obtain a subtree from any phylogeny. Treehouse can help researchers to explore the relationships among any set of taxa from across the tree of life. Treehouse is implemented as a shiny application in the R programming language. Treehouse software and usage instructions are publicly available at https://github.com/JLSteenwyk/treehouse.
Collapse
Affiliation(s)
- Jacob L Steenwyk
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, 37235, USA.
| |
Collapse
|
38
|
Coiro M, Doyle JA, Hilton J. How deep is the conflict between molecular and fossil evidence on the age of angiosperms? THE NEW PHYTOLOGIST 2019; 223:83-99. [PMID: 30681148 DOI: 10.1111/nph.15708] [Citation(s) in RCA: 62] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Accepted: 01/04/2019] [Indexed: 05/06/2023]
Abstract
The timing of the origin of angiosperms is a hotly debated topic in plant evolution. Molecular dating analyses that consistently retrieve pre-Cretaceous ages for crown-group angiosperms have eroded confidence in the fossil record, which indicates a radiation and possibly also origin in the Early Cretaceous. Here, we evaluate paleobotanical evidence on the age of the angiosperms, showing how fossils provide crucial data for clarifying the situation. Pollen floras document a Northern Gondwanan appearance of monosulcate angiosperms in the Valanginian and subsequent poleward spread of monosulcates and tricolpate eudicots, accelerating in the Albian. The sequence of pollen types agrees with molecular phylogenetic inferences on the course of pollen evolution, but it conflicts strongly with Triassic and early Jurassic molecular ages, and the discrepancy is difficult to explain by geographic or taphonomic biases. Critical scrutiny shows that supposed pre-Cretaceous angiosperms either represent other plant groups or lack features that might confidently assign them to the angiosperms. However, the record may allow the Late Jurassic existence of ecologically restricted angiosperms, like those seen in the basal ANITA grade. Finally, we examine recently recognized biases in molecular dating and argue that a thoughtful integration of fossil and molecular evidence could help resolve these conflicts.
Collapse
Affiliation(s)
- Mario Coiro
- Department of Systematic and Evolutionary Botany, University of Zurich, 8008, Zurich, Switzerland
| | - James A Doyle
- Department of Evolution and Ecology, University of California, Davis, CA, 95616, USA
| | - Jason Hilton
- School of Geography, Earth and Environmental Sciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK
| |
Collapse
|
39
|
Widhelm TJ, Grewe F, Huang JP, Mercado-Díaz JA, Goffinet B, Lücking R, Moncada B, Mason-Gamer R, Lumbsch HT. Multiple historical processes obscure phylogenetic relationships in a taxonomically difficult group (Lobariaceae, Ascomycota). Sci Rep 2019; 9:8968. [PMID: 31222061 PMCID: PMC6586878 DOI: 10.1038/s41598-019-45455-x] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2018] [Accepted: 06/03/2019] [Indexed: 12/19/2022] Open
Abstract
In the age of next-generation sequencing, the number of loci available for phylogenetic analyses has increased by orders of magnitude. But despite this dramatic increase in the amount of data, some phylogenomic studies have revealed rampant gene-tree discordance that can be caused by many historical processes, such as rapid diversification, gene duplication, or reticulate evolution. We used a target enrichment approach to sample 400 single-copy nuclear genes and estimate the phylogenetic relationships of 13 genera in the lichen-forming family Lobariaceae to address the effect of data type (nucleotides and amino acids) and phylogenetic reconstruction method (concatenation and species tree approaches). Furthermore, we examined datasets for evidence of historical processes, such as rapid diversification and reticulate evolution. We found incongruence associated with sequence data types (nucleotide vs. amino acid sequences) and with different methods of phylogenetic reconstruction (species tree vs. concatenation). The resulting phylogenetic trees provided evidence for rapid and reticulate evolution based on extremely short branches in the backbone of the phylogenies. The observed rapid and reticulate diversifications may explain conflicts among gene trees and the challenges to resolving evolutionary relationships. Based on divergence times, the diversification at the backbone occurred near the Cretaceous-Paleogene (K-Pg) boundary (65 Mya) which is consistent with other rapid diversifications in the tree of life. Although some phylogenetic relationships within the Lobariaceae family remain with low support, even with our powerful phylogenomic dataset of up to 376 genes, our use of target-capturing data allowed for the novel exploration of the mechanisms underlying phylogenetic and systematic incongruence.
Collapse
Affiliation(s)
- Todd J Widhelm
- Field Museum, Science and Education, Chicago, 60605, USA.
- University of Illinois at Chicago, Biological Sciences, Chicago, 60607, USA.
| | - Felix Grewe
- Field Museum, Grainger Bioinformatics Center, Chicago, 60605, USA
| | - Jen-Pan Huang
- Field Museum, Science and Education, Chicago, 60605, USA
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | | | - Bernard Goffinet
- University of Connecticut, Ecology and Evolutionary Biology, Storrs, 06268, USA
| | - Robert Lücking
- Botanischer Garten und Botanisches Museum, Herbarium, Berlin, 14195, Germany
| | - Bibiana Moncada
- Universidad Distrital Francisco José de Caldas, Torre de Laboratorios, Herbario, Bogotá, 11021, Colombia
| | | | | |
Collapse
|
40
|
Zhou X, Shen XX, Hittinger CT, Rokas A. Evaluating Fast Maximum Likelihood-Based Phylogenetic Programs Using Empirical Phylogenomic Data Sets. Mol Biol Evol 2019; 35:486-503. [PMID: 29177474 PMCID: PMC5850867 DOI: 10.1093/molbev/msx302] [Citation(s) in RCA: 84] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
The sizes of the data matrices assembled to resolve branches of the tree of life have increased dramatically, motivating the development of programs for fast, yet accurate, inference. For example, several different fast programs have been developed in the very popular maximum likelihood framework, including RAxML/ExaML, PhyML, IQ-TREE, and FastTree. Although these programs are widely used, a systematic evaluation and comparison of their performance using empirical genome-scale data matrices has so far been lacking. To address this question, we evaluated these four programs on 19 empirical phylogenomic data sets with hundreds to thousands of genes and up to 200 taxa with respect to likelihood maximization, tree topology, and computational speed. For single-gene tree inference, we found that the more exhaustive and slower strategies (ten searches per alignment) outperformed faster strategies (one tree search per alignment) using RAxML, PhyML, or IQ-TREE. Interestingly, single-gene trees inferred by the three programs yielded comparable coalescent-based species tree estimations. For concatenation-based species tree inference, IQ-TREE consistently achieved the best-observed likelihoods for all data sets, and RAxML/ExaML was a close second. In contrast, PhyML often failed to complete concatenation-based analyses, whereas FastTree was the fastest but generated lower likelihood values and more dissimilar tree topologies in both types of analyses. Finally, data matrix properties, such as the number of taxa and the strength of phylogenetic signal, sometimes substantially influenced the programs’ relative performance. Our results provide real-world gene and species tree phylogenetic inference benchmarks to inform the design and execution of large-scale phylogenomic data analyses.
Collapse
Affiliation(s)
- Xiaofan Zhou
- Integrative Microbiology Research Centre, South China Agricultural University, Guangzhou, P.R. China.,Guangdong Province Key Laboratory of Microbial Signals and Disease Control, Department of Plant Pathology, South China Agricultural University, Guangzhou, P.R. China
| | - Xing-Xing Shen
- Department of Biological Sciences, Vanderbilt University, Nashville, TN
| | - Chris Todd Hittinger
- Laboratory of Genetics, Genome Center of Wisconsin, DOE Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, J. F. Crow Institute for the Study of Evolution, University of Wisconsin-Madison, Madison, WI
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN
| |
Collapse
|
41
|
Simmons MP, Sloan DB, Springer MS, Gatesy J. Gene-wise resampling outperforms site-wise resampling in phylogenetic coalescence analyses. Mol Phylogenet Evol 2019; 131:80-92. [DOI: 10.1016/j.ympev.2018.10.001] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2018] [Accepted: 10/01/2018] [Indexed: 01/15/2023]
|
42
|
Couvreur TLP, Helmstetter AJ, Koenen EJM, Bethune K, Brandão RD, Little SA, Sauquet H, Erkens RHJ. Phylogenomics of the Major Tropical Plant Family Annonaceae Using Targeted Enrichment of Nuclear Genes. FRONTIERS IN PLANT SCIENCE 2019; 9:1941. [PMID: 30687347 PMCID: PMC6334231 DOI: 10.3389/fpls.2018.01941] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2018] [Accepted: 12/13/2018] [Indexed: 05/19/2023]
Abstract
Targeted enrichment and sequencing of hundreds of nuclear loci for phylogenetic reconstruction is becoming an important tool for plant systematics and evolution. Annonaceae is a major pantropical plant family with 110 genera and ca. 2,450 species, occurring across all major and minor tropical forests of the world. Baits were designed by sequencing the transcriptomes of five species from two of the largest Annonaceae subfamilies. Orthologous loci were identified. The resulting baiting kit was used to reconstruct phylogenetic relationships at two different levels using concatenated and gene tree approaches: a family wide Annonaceae analysis sampling 65 genera and a species level analysis of tribe Piptostigmateae sampling 29 species with multiple individuals per species. DNA extraction was undertaken mainly on silicagel dried leaves, with two samples from herbarium dried leaves. Our kit targets 469 exons (364,653 bp of sequence data), successfully capturing sequences from across Annonaceae. Silicagel dried and herbarium DNA worked equally well. We present for the first time a nuclear gene-based phylogenetic tree at the generic level based on 317 supercontigs. Results mainly confirm previous chloroplast based studies. However, several new relationships are found and discussed. We show significant differences in branch lengths between the two large subfamilies Annonoideae and Malmeoideae. A new tribe, Annickieae, is erected containing a single African genus Annickia. We also reconstructed a well-resolved species-level phylogenetic tree of the Piptostigmteae tribe. Our baiting kit is useful for reconstructing well-supported phylogenetic relationships within Annonaceae at different taxonomic levels. The nuclear genome is mainly concordant with plastome information with a few exceptions. Moreover, we find that substitution rate heterogeneity between the two subfamilies is also found within the nuclear compartment, and not just plastomes and ribosomal DNA as previously shown. Our results have implications for understanding the biogeography, molecular dating and evolution of Annonaceae.
Collapse
Affiliation(s)
| | | | - Erik J. M. Koenen
- Institute of Systematic Botany, University of Zurich, Zurich, Switzerland
| | - Kevin Bethune
- IRD, UMR DIADE, Univ. Montpellier, Montpellier, France
| | - Rita D. Brandão
- Maastricht Science Programme, Maastricht University, Maastricht, Netherlands
| | - Stefan A. Little
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech, Université-Paris Saclay, Orsay, France
| | - Hervé Sauquet
- Ecologie Systématique Evolution, Univ. Paris-Sud, CNRS, AgroParisTech, Université-Paris Saclay, Orsay, France
- National Herbarium of New South Wales (NSW), Royal Botanic Gardens and Domain Trust, Sydney, NSW, Australia
| | - Roy H. J. Erkens
- Maastricht Science Programme, Maastricht University, Maastricht, Netherlands
| |
Collapse
|
43
|
Walker JF, Brown JW, Smith SA. Analyzing Contentious Relationships and Outlier Genes in Phylogenomics. Syst Biol 2018; 67:916-924. [PMID: 29893968 DOI: 10.1093/sysbio/syy043] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2017] [Accepted: 06/02/2018] [Indexed: 01/19/2023] Open
Abstract
Recent studies have demonstrated that conflict is common among gene trees in phylogenomic studies, and that less than one percent of genes may ultimately drive species tree inference in supermatrix analyses. Herein, we examined two data sets where supermatrix and coalescent-based species trees conflict. We identified two highly influential "outlier" genes in each data set. When removed from each data set, the inferred supermatrix trees matched the topologies obtained from coalescent analyses. We also demonstrate that, while the outlier genes in the vertebrate data set have been shown in a previous study to be the result of errors in orthology detection, the outlier genes from a plant data set did not exhibit any obvious systematic error, and therefore, may be the result of some biological process yet to be determined. While topological comparisons among a small set of alternate topologies can be helpful in discovering outlier genes, they can be limited in several ways, such as assuming all genes share the same topology. Coalescent species tree methods relax this assumption but do not explicitly facilitate the examination of specific edges. Coalescent methods often also assume that conflict is the result of incomplete lineage sorting. Herein, we explored a framework that allows for quickly examining alternative edges and support for large phylogenomic data sets that does not assume a single topology for all genes. For both data sets, these analyses provided detailed results confirming the support for coalescent-based topologies. This framework suggests that we can improve our understanding of the underlying signal in phylogenomic data sets by asking more targeted edge-based questions.
Collapse
Affiliation(s)
- Joseph F Walker
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Joseph W Brown
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield S10 2TN, UK
| | - Stephen A Smith
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
44
|
Incomplete lineage sorting rather than hybridization explains the inconsistent phylogeny of the wisent. Commun Biol 2018; 1:169. [PMID: 30374461 PMCID: PMC6195592 DOI: 10.1038/s42003-018-0176-6] [Citation(s) in RCA: 57] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2018] [Accepted: 09/12/2018] [Indexed: 12/30/2022] Open
Abstract
The wisent or European bison is the largest European herbivore and is completely cross-fertile with its American relative. However, mtDNA genome of wisent is similar to that of cattle, which suggests that wisent emerged as a hybrid of bison and an extinct cattle-like species. Here, we analyzed nuclear whole-genome sequences of the bovine species, and found only a minor and recent gene flow between wisent and cattle. Furthermore, we identified an appreciable heterogeneity of the nuclear gene tree topologies of the bovine species. The relative frequencies of various topologies, including the mtDNA topology, were consistent with frequencies of incomplete lineage sorting (ILS) as estimated by tree coalescence analysis. This indicates that ILS has occurred and may well account for the anomalous wisent mtDNA phylogeny as the outcome of a rare event. We propose that ILS is a possible explanation of phylogenomic anomalies among closely related species. Kun Wang et al. present a genomic analysis identifying incomplete lineage sorting and hybridization in the mitochondrial DNA of the European bison (wisent). They find that incomplete lineage sorting is the most feasible explanation for the phylogenetic heterogeneity observed in Bovidae.
Collapse
|
45
|
Wang X, Lim BK, Ting N, Hu J, Liang Y, Roos C, Yu L. Reconstructing the phylogeny of new world monkeys ( platyrrhini): evidence from multiple non-coding loci. Curr Zool 2018; 65:579-588. [PMID: 31616489 PMCID: PMC6784508 DOI: 10.1093/cz/zoy072] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Accepted: 09/12/2018] [Indexed: 11/27/2022] Open
Abstract
Among mammalian phylogenies, those characterized by rapid radiations are particularly problematic. The New World monkeys (NWMs, Platyrrhini) comprise 3 families and 7 subfamilies, which radiated within a relatively short time period. Accordingly, their phylogenetic relationships are still largely disputed. In the present study, 56 nuclear non-coding loci, including 33 introns (INs) and 23 intergenic regions (IGs), from 20 NWM individuals representing 18 species were used to investigate phylogenetic relationships among families and subfamilies. Of the 56 loci, 43 have not been used in previous NWM phylogenetics. We applied concatenation and coalescence tree-inference methods, and a recently proposed question-specific approach to address NWM phylogeny. Our results indicate incongruence between concatenation and coalescence methods for the IN and IG datasets. However, a consensus was reached with a single tree topology from all analyses of combined INs and IGs as well as all analyses of question-specific loci using both concatenation and coalescence methods, albeit with varying degrees of statistical support. In detail, our results indicated the sister-group relationships between the families Atelidae and Pitheciidae, and between the subfamilies Aotinae and Callithrichinae among Cebidae. Our study provides insights into the disputed phylogenetic relationships among NWM families and subfamilies from the perspective of multiple non-coding loci and various tree-inference approaches. However, the present phylogenetic framework needs further evaluation by adding more independent sequence data and a deeper taxonomic sampling. Overall, our work has important implications for phylogenetic studies dealing with rapid radiations.
Collapse
Affiliation(s)
- Xiaoping Wang
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China.,School of Life Sciences, Yunnan University, Kunming, China
| | - Burton K Lim
- Department of Natural History, Royal Ontario Museum, Toronto, ON, Canada
| | - Nelson Ting
- Department of Anthropology and Institute of Ecology and Evolution, University of Oregon, Eugene, Oregon, USA
| | - Jingyang Hu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China.,State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.,Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Yunpeng Liang
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China
| | - Christian Roos
- Gene Bank of Primates and Primate Genetics Laboratory, German Primate Center, Leibniz Institute for Primate Research, Kellnerweg, Göttingen, Germany
| | - Li Yu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China
| |
Collapse
|
46
|
Freitas L, Mello B, Schrago CG. Multispecies coalescent analysis confirms standing phylogenetic instability in Hexapoda. J Evol Biol 2018; 31:1623-1631. [PMID: 30058265 DOI: 10.1111/jeb.13355] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2018] [Revised: 06/28/2018] [Accepted: 07/23/2018] [Indexed: 11/28/2022]
Abstract
The multispecies coalescent (MSC) has been increasingly used in phylogenomic analyses due to the accommodation of gene tree topological heterogeneity by taking into account population-level processes, such as incomplete lineage sorting. In this sense, the phylogeny of insect species, which are characterized by their large effective population sizes, is suitable for a coalescent-based analysis. Furthermore, studies so far recovered short internal branches at early divergences of the insect tree of life, indicating fast evolutionary radiations that increase the probability of incomplete lineage sorting in deep time. Here, we investigated the performance of the MSC for a phylogenomic data set of hexapods compiled by Misof et al. (2014, Science 346:763). Our analysis recovered the monophyly of most insect orders, and major phylogenetic relationships were in agreement with current insect systematics. We identified, however, some evolutionary associations that were consistently problematic. Most noticeable, Hexapod monophyly was disrupted by the sister group relationship between the remiped crustacean and Insecta. Additionally, the interordinal relationships within Polyneoptera and Neuropteroidea were found to be phylogenetically unstable. We show that these controversial phylogenetic arrangements were also poorly supported by previous analyses, and therefore, we evaluated their robustness to stochastic errors from sampling sites and terminals, confirming standing problems in hexapod phylogeny in the genomics age.
Collapse
Affiliation(s)
- Lucas Freitas
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| | - Beatriz Mello
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| | - Carlos G Schrago
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| |
Collapse
|
47
|
Dong S, Zhao C, Chen F, Liu Y, Zhang S, Wu H, Zhang L, Liu Y. The complete mitochondrial genome of the early flowering plant Nymphaea colorata is highly repetitive with low recombination. BMC Genomics 2018; 19:614. [PMID: 30107780 PMCID: PMC6092842 DOI: 10.1186/s12864-018-4991-4] [Citation(s) in RCA: 73] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2017] [Accepted: 08/02/2018] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Mitochondrial genomes of flowering plants (angiosperms) are highly dynamic in genome structure. The mitogenome of the earliest angiosperm Amborella is remarkable in carrying rampant foreign DNAs, in contrast to Liriodendron, the other only known early angiosperm mitogenome that is described as 'fossilized'. The distinctive features observed in the two early flowering plant mitogenomes add to the current confusions of what early flowering plants look like. Expanded sampling would provide more details in understanding the mitogenomic evolution of early angiosperms. Here we report the complete mitochondrial genome of water lily Nymphaea colorata from Nymphaeales, one of the three orders of the earliest angiosperms. RESULTS Assembly of data from Pac-Bio long-read sequencing yielded a circular mitochondria chromosome of 617,195 bp with an average depth of 601×. The genome encoded 41 protein coding genes, 20 tRNA and three rRNA genes with 25 group II introns disrupting 10 protein coding genes. Nearly half of the genome is composed of repeated sequences, which contributed substantially to the intron size expansion, making the gross intron length of the Nymphaea mitochondrial genome one of the longest among angiosperms, including an 11.4-Kb intron in cox2, which is the longest organellar intron reported to date in plants. Nevertheless, repeat mediated homologous recombination is unexpectedly low in Nymphaea evidenced by 74 recombined reads detected from ten recombinationally active repeat pairs among 886,982 repeat pairs examined. Extensive gene order changes were detected in the three early angiosperm mitogenomes, i.e. 38 or 44 events of inversions and translocations are needed to reconcile the mitogenome of Nymphaea with Amborella or Liriodendron, respectively. In contrast to Amborella with six genome equivalents of foreign mitochondrial DNA, not a single horizontal gene transfer event was observed in the Nymphaea mitogenome. CONCLUSIONS The Nymphaea mitogenome resembles the other available early angiosperm mitogenomes by a similarly rich 64-coding gene set, and many conserved gene clusters, whereas stands out by its highly repetitive nature and resultant remarkable intron expansions. The low recombination level in Nymphaea provides evidence for the predominant master conformation in vivo with a highly substoichiometric set of rearranged molecules.
Collapse
Affiliation(s)
- Shanshan Dong
- Fairylake Botanical Garden, Shenzhen & Chinese Academy of Sciences, Shenzhen, China
- College of Life Sciences, South China Agricultural University, Guangzhou, China
| | - Chaoxian Zhao
- Fairylake Botanical Garden, Shenzhen & Chinese Academy of Sciences, Shenzhen, China
- Department of Biology, School of Life Sciences, East China Normal University, Shanghai, China
| | - Fei Chen
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Fujian Agriculture and Forestry University, Fuzhou, China
- Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Yanhui Liu
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Fujian Agriculture and Forestry University, Fuzhou, China
- Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Shouzhou Zhang
- Fairylake Botanical Garden, Shenzhen & Chinese Academy of Sciences, Shenzhen, China
| | - Hong Wu
- College of Life Sciences, South China Agricultural University, Guangzhou, China
| | - Liangsheng Zhang
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Fujian Agriculture and Forestry University, Fuzhou, China
- Ministry of Education Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Fujian Agriculture and Forestry University, Fuzhou, China
- Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Yang Liu
- Fairylake Botanical Garden, Shenzhen & Chinese Academy of Sciences, Shenzhen, China
- BGI-Shenzhen, Shenzhen, 518083 China
| |
Collapse
|
48
|
Herrando-Moraira S. Exploring data processing strategies in NGS target enrichment to disentangle radiations in the tribe Cardueae (Compositae). Mol Phylogenet Evol 2018; 128:69-87. [PMID: 30036700 DOI: 10.1016/j.ympev.2018.07.012] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2018] [Revised: 07/13/2018] [Accepted: 07/14/2018] [Indexed: 12/17/2022]
Abstract
Target enrichment is a cost-effective sequencing technique that holds promise for elucidating evolutionary relationships in fast-evolving lineages. However, potential biases and impact of bioinformatic sequence treatments in phylogenetic inference have not been thoroughly explored yet. Here, we investigate this issue with an ultimate goal to shed light into a highly diversified group of Compositae (Asteraceae) constituted by four main genera: Arctium, Cousinia, Saussurea, and Jurinea. Specifically, we compared sequence data extraction methods implemented in two easy-to-use workflows, PHYLUCE and HybPiper, and assessed the impact of two filtering practices intended to reduce phylogenetic noise. In addition, we compared two phylogenetic inference methods: (1) the concatenation approach, in which all loci were concatenated in a supermatrix; and (2) the coalescence approach, in which gene trees were produced independently and then used to construct a species tree under coalescence assumptions. Here we confirm the usefulness of the set of 1061 COS targets (a nuclear conserved orthology loci set developed for the Compositae) across a variety of taxonomic levels. Intergeneric relationships were completely resolved: there are two sister groups, Arctium-Cousinia and Saussurea-Jurinea, which are in agreement with a morphological hypothesis. Intrageneric relationships among species of Arctium, Cousinia, and Saussurea are also well defined. Conversely, conflicting species relationships remain for Jurinea. Methodological choices significantly affected phylogenies in terms of topology, branch length, and support. Across all analyses, the phylogeny obtained using HybPiper and the strictest scheme of removing fast-evolving sites was estimated as the optimal. Regarding methodological choices, we conclude that: (1) trees obtained under the coalescence approach are topologically more congruent between them than those inferred using the concatenation approach; (2) refining treatments only improved support values under the concatenation approach; and (3) branch support values are maximized when fast-evolving sites are removed in the concatenation approach, and when a higher number of loci is analyzed in the coalescence approach.
Collapse
Affiliation(s)
- Sonia Herrando-Moraira
- Botanic Institute of Barcelona (IBB, CSIC-ICUB), Pg. del Migdia, s.n., 08038 Barcelona, Spain.
| | | |
Collapse
|
49
|
Sayyari E, Whitfield JB, Mirarab S. Fragmentary Gene Sequences Negatively Impact Gene Tree and Species Tree Reconstruction. Mol Biol Evol 2018; 34:3279-3291. [PMID: 29029241 DOI: 10.1093/molbev/msx261] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Species tree reconstruction from genome-wide data is increasingly being attempted, in most cases using a two-step approach of first estimating individual gene trees and then summarizing them to obtain a species tree. The accuracy of this approach, which promises to account for gene tree discordance, depends on the quality of the inferred gene trees. At the same time, phylogenomic and phylotranscriptomic analyses typically use involved bioinformatics pipelines for data preparation. Errors and shortcomings resulting from these preprocessing steps may impact the species tree analyses at the other end of the pipeline. In this article, we first show that the presence of fragmentary data for some species in a gene alignment, as often seen on real data, can result in substantial deterioration of gene trees, and as a result, the species tree. We then investigate a simple filtering strategy where individual fragmentary sequences are removed from individual genes but the rest of the gene is retained. Both in simulations and by reanalyzing a large insect phylotranscriptomic data set, we show the effectiveness of this simple filtering strategy.
Collapse
Affiliation(s)
- Erfan Sayyari
- Department of Electrical and Computer Engineering, University of California at San Diego, La Jolla, CA
| | | | - Siavash Mirarab
- Department of Electrical and Computer Engineering, University of California at San Diego, La Jolla, CA
| |
Collapse
|
50
|
Hughes DF, Tolley KA, Behangana M, Lukwago W, Menegon M, Dehling JM, Stipala J, Tilbury CR, Khan AM, Kusamba C, Greenbaum E. Cryptic diversity in Rhampholeon boulengeri (Sauria: Chamaeleonidae), a pygmy chameleon from the Albertine Rift biodiversity hotspot. Mol Phylogenet Evol 2018; 122:125-141. [PMID: 29199108 PMCID: PMC6010225 DOI: 10.1016/j.ympev.2017.11.015] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2017] [Revised: 10/03/2017] [Accepted: 11/27/2017] [Indexed: 11/30/2022]
Abstract
Several biogeographic barriers in the Central African highlands have reduced gene flow among populations of many terrestrial species in predictable ways. Yet, a comprehensive understanding of mechanisms underlying species divergence in the Afrotropics can be obscured by unrecognized levels of cryptic diversity, particularly in widespread species. We implemented a multilocus phylogeographic approach to examine diversity within the widely distributed Central African pygmy chameleon, Rhampholeon boulengeri. Gene-tree analyses coupled with a comparative coalescent-based species delimitation framework revealed R. boulengeri as a complex of at least six genetically distinct species. The spatiotemporal speciation patterns for these cryptic species conform to general biogeographic hypotheses supporting vicariance as the main factor behind patterns of divergence in the Albertine Rift, a biodiversity hotspot in Central Africa. However, we found that parapatric species and sister species inhabited adjacent habitats, but were found in largely non-overlapping elevational ranges in the Albertine Rift, suggesting that differentiation in elevation was also an important mode of divergence. The phylogeographic patterns recovered for the genus-level phylogeny provide additional evidence for speciation by isolation in forest refugia, and dating estimates indicated that the Miocene was a significant period for this diversification. Our results highlight the importance of investigating cryptic diversity in widespread species to improve understanding of diversification patterns in environmentally diverse regions such as the montane Afrotropics.
Collapse
Affiliation(s)
- Daniel F Hughes
- Department of Biological Sciences, University of Texas at El Paso, El Paso, TX 79968, USA.
| | - Krystal A Tolley
- South African National Biodiversity Institute, Private Bag X7, Claremont, Cape Town, South Africa; Centre for Ecological Genomics and Wildlife Conservation, Department of Zoology, University of Johannesburg, Auckland Park 2000, Johannesburg, South Africa
| | - Mathias Behangana
- Department of Environmental Sciences, Makerere University, P.O. Box 7062, Kampala, Uganda
| | - Wilber Lukwago
- Department of Environmental Sciences, Makerere University, P.O. Box 7062, Kampala, Uganda
| | - Michele Menegon
- Tropical Biodiversity Section, MUSE - The Science Museum of Trento, Corso del Lavoro e della Scienza 3, Trento 38123, Italy
| | - J Maximilian Dehling
- Institut für Integrierte Naturwissenschaften, Abteilung Biologie, AG Zoologie, Universität Koblenz-Landau, Universitätsstraße 1, 56070 Koblenz, Germany
| | - Jan Stipala
- School of Biosciences, University of Exeter, Tremough Campus, Penryn, Cornwall TR10 9EZ, United Kingdom
| | - Colin R Tilbury
- Department of Botany & Zoology, University of Stellenbosch, Private Bag X1, Matieland, 7602 Stellenbosch, South Africa
| | - Arshad M Khan
- Department of Biological Sciences, University of Texas at El Paso, El Paso, TX 79968, USA
| | - Chifundera Kusamba
- Laboratoire d'Herpétologie, Département de Biologie, Centre de Recherche en Sciences Naturelles, Lwiro, Democratic Republic of the Congo
| | - Eli Greenbaum
- Department of Biological Sciences, University of Texas at El Paso, El Paso, TX 79968, USA
| |
Collapse
|