1
|
Sin SYW, Ke F, Chen G, Huang PY, Enbody ED, Karubian J, Webster MS, Edwards SV. Genetic Basis and Evolution of Structural Color Polymorphism in an Australian Songbird. Mol Biol Evol 2024; 41:msae046. [PMID: 38415852 PMCID: PMC10962638 DOI: 10.1093/molbev/msae046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 02/02/2024] [Accepted: 02/22/2024] [Indexed: 02/29/2024] Open
Abstract
Island organisms often evolve phenotypes divergent from their mainland counterparts, providing a useful system for studying adaptation under differential selection. In the white-winged fairywren (Malurus leucopterus), subspecies on two islands have a black nuptial plumage whereas the subspecies on the Australian mainland has a blue nuptial plumage. The black subspecies have a feather nanostructure that could in principle produce a blue structural color, suggesting a blue ancestor. An earlier study proposed independent evolution of melanism on the islands based on the history of subspecies divergence. However, the genetic basis of melanism and the origin of color differentiation in this group are still unknown. Here, we used whole-genome resequencing to investigate the genetic basis of melanism by comparing the blue and black M. leucopterus subspecies to identify highly divergent genomic regions. We identified a well-known pigmentation gene ASIP and four candidate genes that may contribute to feather nanostructure development. Contrary to the prediction of convergent evolution of island melanism, we detected signatures of a selective sweep in genomic regions containing ASIP and SCUBE2 not in the black subspecies but in the blue subspecies, which possesses many derived SNPs in these regions, suggesting that the mainland subspecies has re-evolved a blue plumage from a black ancestor. This proposed re-evolution was likely driven by a preexisting female preference. Our findings provide new insight into the evolution of plumage coloration in island versus continental populations, and, importantly, we identify candidate genes that likely play roles in the development and evolution of feather structural coloration.
Collapse
Affiliation(s)
- Simon Yung Wa Sin
- School of Biological Sciences, The University of Hong Kong, Hong Kong, China
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Fushi Ke
- School of Biological Sciences, The University of Hong Kong, Hong Kong, China
| | - Guoling Chen
- School of Biological Sciences, The University of Hong Kong, Hong Kong, China
| | - Pei-Yu Huang
- School of Biological Sciences, The University of Hong Kong, Hong Kong, China
| | - Erik D Enbody
- Department of Ecology and Evolutionary Biology, Tulane University, New Orleans, LA 70118, USA
| | - Jordan Karubian
- Department of Ecology and Evolutionary Biology, Tulane University, New Orleans, LA 70118, USA
| | - Michael S Webster
- Cornell Lab of Ornithology and Department of Neurobiology and Behavior, Cornell University, Ithaca, NY 14853, USA
| | - Scott V Edwards
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| |
Collapse
|
2
|
Tumendemberel O, Hendricks SA, Hohenlohe PA, Sullivan J, Zedrosser A, Saebø M, Proctor MF, Koprowski JL, Waits LP. Range-wide evolutionary relationships and historical demography of brown bears (Ursus arctos) revealed by whole-genome sequencing of isolated central Asian populations. Mol Ecol 2023; 32:5156-5169. [PMID: 37528604 DOI: 10.1111/mec.17091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 07/02/2023] [Accepted: 07/19/2023] [Indexed: 08/03/2023]
Abstract
Phylogeographic studies uncover hidden pathways of divergence and inform conservation. Brown bears (Ursus arctos) have one of the broadest distributions of all land mammals, ranging from Eurasia to North America, and are an important model for evolutionary studies. Although several whole genomes were available for individuals from North America, Europe and Asia, limited whole-genome data were available from Central Asia, including the highly imperilled brown bears in the Gobi Desert. To fill this knowledge gap, we sequenced whole genomes from nine Asian brown bears from the Gobi Desert of Mongolia, Northern Mongolia and the Himalayas of Pakistan. We combined these data with published brown bear sequences from Europe, Asia and North America, as well as other bear species. Our goals were to determine the evolutionary relationships among brown bear populations worldwide, their genetic diversity and their historical demography. Our analyses revealed five major lineages of brown bears based on a filtered set of 684,081 single nucleotide polymorphisms. We found distinct evolutionary lineages of brown bears in the Gobi, Himalayas, northern Mongolia, Europe and North America. The lowest level of genetic diversity and the highest level of inbreeding were found in Pakistan, the Gobi Desert and Central Italy. Furthermore, the effective population size (Ne ) for all brown bears decreased over the last 70,000 years. Our results confirm the genetic distinctiveness and ancient lineage of brown bear subspecies in the Gobi Desert of Mongolia and the Himalayas of Pakistan and highlight their importance for conservation.
Collapse
Affiliation(s)
- Odbayar Tumendemberel
- Haub School of Environment and Natural Resources, University of Wyoming, Laramie, Wyoming, USA
- Department of Natural Science and Environmental Health, University of South-Eastern Norway, Bø i Telemark, Norway
| | - Sarah A Hendricks
- Institute for Interdisciplinary Data Sciences, University of Idaho, Moscow, Idaho, USA
| | - Paul A Hohenlohe
- Institute for Interdisciplinary Data Sciences, University of Idaho, Moscow, Idaho, USA
- Department of Biological Sciences, University of Idaho, Moscow, Idaho, USA
| | - Jack Sullivan
- Institute for Interdisciplinary Data Sciences, University of Idaho, Moscow, Idaho, USA
- Department of Biological Sciences, University of Idaho, Moscow, Idaho, USA
| | - Andreas Zedrosser
- Department of Natural Science and Environmental Health, University of South-Eastern Norway, Bø i Telemark, Norway
| | - Mona Saebø
- Department of Natural Science and Environmental Health, University of South-Eastern Norway, Bø i Telemark, Norway
| | | | - John L Koprowski
- Haub School of Environment and Natural Resources, University of Wyoming, Laramie, Wyoming, USA
| | - Lisette P Waits
- Department of Fish and Wildlife Sciences, University of Idaho, Moscow, Idaho, USA
| |
Collapse
|
3
|
Qin SY, Zuo ZY, Guo C, Du XY, Liu SY, Yu XQ, Xiang XG, Rong J, Liu B, Liu ZF, Ma PF, Li DZ. Phylogenomic insights into the origin and evolutionary history of evergreen broadleaved forests in East Asia under Cenozoic climate change. Mol Ecol 2023; 32:2850-2868. [PMID: 36847615 DOI: 10.1111/mec.16904] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 02/09/2023] [Accepted: 02/23/2023] [Indexed: 03/01/2023]
Abstract
The evergreen versus deciduous leaf habit is an important functional trait for adaptation of forest trees and has been hypothesized to be related to the evolutionary processes of the component species under paleoclimatic change, and potentially reflected in the dynamic history of evergreen broadleaved forests (EBLFs) in East Asia. However, knowledge about the shift of evergreen versus deciduous leaf with the impact of paleoclimatic change using genomic data remains rare. Here, we focus on the Litsea complex (Lauraceae), a key lineage with dominant species of EBLFs, to gain insights into how evergreen versus deciduous trait shifted, providing insights into the origin and historical dynamics of EBLFs in East Asia under Cenozoic climate change. We reconstructed a robust phylogeny of the Litsea complex using genome-wide single-nucleotide variants (SNVs) with eight clades resolved. Fossil-calibrated analyses, diversification rate shifts, ancestral habit, ecological niche modelling and climate niche reconstruction were employed to estimate its origin and diversification pattern. Taking into account studies on other plant lineages dominating EBLFs of East Asia, it was revealed that the prototype of EBLFs in East Asia probably emerged in the Early Eocene (55-50 million years ago [Ma]), facilitated by the greenhouse warming. As a response to the cooling and drying climate in the Middle to Late Eocene (48-38 Ma), deciduous habits were evolved in the dominant lineages of the EBLFs in East Asia. Up to the Early Miocene (23 Ma), the prevailing of East Asian monsoon increased the extreme seasonal precipitation and accelerated the emergence of evergreen habits of the dominant lineages, and ultimately shaped the vegetation resembling that of today.
Collapse
Affiliation(s)
- Sheng-Yuan Qin
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Zheng-Yu Zuo
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| | - Cen Guo
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| | - Xin-Yu Du
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| | - Shui-Yin Liu
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Xiang-Qin Yu
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| | - Xiao-Guo Xiang
- Jiangxi Province Key Laboratory of Watershed Ecosystem Change and Biodiversity, Centre for Watershed Ecology, Institute of Life Science and School of Life Sciences, Nanchang University, Nanchang, China
| | - Jun Rong
- Jiangxi Province Key Laboratory of Watershed Ecosystem Change and Biodiversity, Centre for Watershed Ecology, Institute of Life Science and School of Life Sciences, Nanchang University, Nanchang, China
| | - Bing Liu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, China.,Sino-African Joint Research Center, Chinese Academy of Sciences, Wuhan, China
| | - Zhi-Fang Liu
- Key Laboratory of Chemical Biology of Ministry of Education, School of Pharmaceutical Sciences, Cheeloo College of Medicine, Shandong University, Jinan, China
| | - Peng-Fei Ma
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| | - De-Zhu Li
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China.,University of Chinese Academy of Sciences, Beijing, China.,CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| |
Collapse
|
4
|
Zaharias P, Warnow T. Recent progress on methods for estimating and updating large phylogenies. Philos Trans R Soc Lond B Biol Sci 2022; 377:20210244. [PMID: 35989607 PMCID: PMC9393559 DOI: 10.1098/rstb.2021.0244] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 01/07/2022] [Indexed: 12/20/2022] Open
Abstract
With the increased availability of sequence data and even of fully sequenced and assembled genomes, phylogeny estimation of very large trees (even of hundreds of thousands of sequences) is now a goal for some biologists. Yet, the construction of these phylogenies is a complex pipeline presenting analytical and computational challenges, especially when the number of sequences is very large. In the past few years, new methods have been developed that aim to enable highly accurate phylogeny estimations on these large datasets, including divide-and-conquer techniques for multiple sequence alignment and/or tree estimation, methods that can estimate species trees from multi-locus datasets while addressing heterogeneity due to biological processes (e.g. incomplete lineage sorting and gene duplication and loss), and methods to add sequences into large gene trees or species trees. Here we present some of these recent advances and discuss opportunities for future improvements. This article is part of a discussion meeting issue 'Genomic population structures of microbial pathogens'.
Collapse
Affiliation(s)
- Paul Zaharias
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| | - Tandy Warnow
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
5
|
Durán-Castillo M, Hudson A, Wilson Y, Field DL, Twyford AD. A phylogeny of Antirrhinum reveals parallel evolution of alpine morphology. THE NEW PHYTOLOGIST 2022; 233:1426-1439. [PMID: 34170548 DOI: 10.1111/nph.17581] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 06/20/2021] [Indexed: 06/13/2023]
Abstract
Parallel evolution of similar morphologies in closely related lineages provides insight into the repeatability and predictability of evolution. In the genus Antirrhinum (snapdragons), as in other plants, a suite of morphological characters are associated with adaptation to alpine environments. We tested for parallel trait evolution in Antirrhinum by investigating phylogenetic relationships using restriction-site associated DNA (RAD) sequencing. We then associated phenotypic information to our phylogeny to reconstruct the patterns of morphological evolution and related this to evidence for hybridisation between emergent lineages. Phylogenetic analyses showed that the alpine character syndrome is present in multiple groups, suggesting that Antirrhinum has repeatedly colonised alpine habitats. Dispersal to novel environments happened in the presence of intraspecific and interspecific gene flow. We found support for a model of parallel evolution in Antirrhinum. Hybridisation in natural populations, and a complex genetic architecture underlying the alpine morphology syndrome, support an important role of natural selection in maintaining species divergence in the face of gene flow.
Collapse
Affiliation(s)
- Mario Durán-Castillo
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, EH9 3FL, UK
| | - Andrew Hudson
- Institute of Molecular Plant Sciences, University of Edinburgh, Edinburgh, EH9 3BF, UK
| | - Yvette Wilson
- Institute of Molecular Plant Sciences, University of Edinburgh, Edinburgh, EH9 3BF, UK
| | - David L Field
- School of Science, Edith Cowan University, 270 Joondalup Drive, Joondalup, 6027, Australia
| | - Alex D Twyford
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, EH9 3FL, UK
- Royal Botanic Garden Edinburgh, 20A Inverleith Row, Edinburgh, EH3 5LR, UK
| |
Collapse
|
6
|
Mirarab S, Nakhleh L, Warnow T. Multispecies Coalescent: Theory and Applications in Phylogenetics. ANNUAL REVIEW OF ECOLOGY, EVOLUTION, AND SYSTEMATICS 2021. [DOI: 10.1146/annurev-ecolsys-012121-095340] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Species tree estimation is a basic part of many biological research projects, ranging from answering basic evolutionary questions (e.g., how did a group of species adapt to their environments?) to addressing questions in functional biology. Yet, species tree estimation is very challenging, due to processes such as incomplete lineage sorting, gene duplication and loss, horizontal gene transfer, and hybridization, which can make gene trees differ from each other and from the overall evolutionary history of the species. Over the last 10–20 years, there has been tremendous growth in methods and mathematical theory for estimating species trees and phylogenetic networks, and some of these methods are now in wide use. In this survey, we provide an overview of the current state of the art, identify the limitations of existing methods and theory, and propose additional research problems and directions.
Collapse
Affiliation(s)
- Siavash Mirarab
- Electrical and Computer Engineering Department, University of California, San Diego, La Jolla, California 92093, USA
| | - Luay Nakhleh
- Department of Computer Science, Rice University, Houston, Texas 77005, USA
| | - Tandy Warnow
- Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, Illinois 61801, USA
| |
Collapse
|
7
|
Dong W, Liu Y, Li E, Xu C, Sun J, Li W, Zhou S, Zhang Z, Suo Z. Phylogenomics and biogeography of Catalpa (Bignoniaceae) reveal incomplete lineage sorting and three dispersal events. Mol Phylogenet Evol 2021; 166:107330. [PMID: 34687844 DOI: 10.1016/j.ympev.2021.107330] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Revised: 10/10/2021] [Accepted: 10/13/2021] [Indexed: 01/21/2023]
Abstract
Catalpa Scop. (Bignoniaceae) is a small genus (8 spp.) of trees that is disjunctly distributed among eastern Asia, eastern United States, and the West Indies. Catalpa bears beautiful inflorescences and have been cultivated as important ornamental trees for landscaping, gardening, and timber. However, the phylogenetic relationships and biogeographic history of the genus have remained unresolved. In this study, we used a large genomic dataset that includes data from the chloroplast (plastomes), and nuclear genomes (ITS and 5,759 single-copy nuclear genes) to reconstruct phylogenetic relationship within Catalpa, test interspecific gene flow events within the genus, and infer its biogeographic history. Our phylogenetic results indicate that Catalpa is monophyletic containing two main clades, section Catalpa and section Macrocatalpa. Section Catalpa is further divided into three subclades. While most relationships are congruent between the chloroplast and nuclear datasets, the position of C. ovata differs, likely due to incomplete lineage sorting. Interspecific gene flow events include C. bungei s.s. with vectors of inheritance from C. duclouxii and C. fargesii, supporting a combination of these three species and recognizing a broadly circumscribed C. bungei s.l. Our biogeographic study suggests three main dispersal events, two of which occurred during the Oligocene. The first dispersal event occurred from southwestern North America and Mexico into the Greater Antilles giving rise to the ancestor of the section of Macrocatalpa. The second dispersal event also occurred from southwestern North America and Mexico, but led to central and northern North America, subsequently reaching China through the Bering land bridge, and also reaching Europe through the North Atlantic land bridge. The third dispersal event took place in the Miocene from China to North America and gave rise to a clade composed of C. bignonioides and C. speciosa. This study uses a phylogenomic approach and biogeographical methods to infer the evolutionary history of Catalpa, highlighting issues associated with gene tree discordance, and suggesting that incomplete lineage sorting likely played an important role in the evolutionary history of Catalpa.
Collapse
Affiliation(s)
- Wenpan Dong
- Laboratory of Systematic Evolution and Biogeography of Woody Plants, School of Ecology and Nature Conservation, Beijing Forestry University, Beijing 100083, China.
| | - Yanlei Liu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Enze Li
- Laboratory of Systematic Evolution and Biogeography of Woody Plants, School of Ecology and Nature Conservation, Beijing Forestry University, Beijing 100083, China
| | - Chao Xu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Jiahui Sun
- State Key Laboratory Breeding Base of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China
| | - Wenying Li
- Institute of Forestry New Technologies, Chinese Academy of Forestry, Beijing 100091, China
| | - Shiliang Zhou
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Zhixiang Zhang
- Laboratory of Systematic Evolution and Biogeography of Woody Plants, School of Ecology and Nature Conservation, Beijing Forestry University, Beijing 100083, China
| | - Zhili Suo
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China.
| |
Collapse
|
8
|
Ke F, Vasseur L, Yi H, Yang L, Wei X, Wang B, Kang M. Gene flow, linked selection, and divergent sorting of ancient polymorphism shape genomic divergence landscape in a group of edaphic specialists. Mol Ecol 2021; 31:104-118. [PMID: 34664755 DOI: 10.1111/mec.16226] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 10/07/2021] [Accepted: 10/11/2021] [Indexed: 12/24/2022]
Abstract
Interpreting the formation of genomic variation landscape, especially genomic regions with elevated differentiation (i.e. islands), is fundamental to a better understanding of the genomic consequences of adaptation and speciation. Edaphic islands provide excellent systems for understanding the interplay of gene flow and selection in driving population divergence and speciation. However, discerning the relative contribution of these factors that modify patterns of genomic variation remains difficult. We analysed 132 genomes from five recently divergent species in Primulina genus, with four species distributed in Karst limestone habitats and the fifth one growing in Danxia habitats. We demonstrated that both gene flow and linked selection have contributed to genome-wide variation landscape, where genomic regions with elevated differentiation (i.e., islands) were largely derived by divergent sorting of ancient polymorphism. Specifically, we identified several lineage-specific genomic islands that might have facilitated adaptation of P. suichuanensis to Danxia habitats. Our study is amongst the first cases disentangling evolutionary processes that shape genomic variation of plant specialists, and demonstrates the important role of ancient polymorphism in the formation of genomic islands that potentially mediate adaptation and speciation of endemic plants in special soil habitats.
Collapse
Affiliation(s)
- Fushi Ke
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, China
| | - Liette Vasseur
- Department of Biological Sciences, Brock University, St. Catharines, Ontario, Canada
| | - Huiqin Yi
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Lihua Yang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, China
| | - Xiao Wei
- Guangxi Institute of Botany, Guangxi Zhuang Autonomous Region and the Chinese Academy of Sciences, Guilin, China
| | - Baosheng Wang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, China.,Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou, China
| | - Ming Kang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, China.,Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou, China
| |
Collapse
|
9
|
Unravelling hybridization in Phytophthora using phylogenomics and genome size estimation. IMA Fungus 2021; 12:16. [PMID: 34193315 PMCID: PMC8246709 DOI: 10.1186/s43008-021-00068-w] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2020] [Accepted: 05/23/2021] [Indexed: 02/06/2023] Open
Abstract
The genus Phytophthora comprises many economically and ecologically important plant pathogens. Hybrid species have previously been identified in at least six of the 12 phylogenetic clades. These hybrids can potentially infect a wider host range and display enhanced vigour compared to their progenitors. Phytophthora hybrids therefore pose a serious threat to agriculture as well as to natural ecosystems. Early and correct identification of hybrids is therefore essential for adequate plant protection but this is hampered by the limitations of morphological and traditional molecular methods. Identification of hybrids is also important in evolutionary studies as the positioning of hybrids in a phylogenetic tree can lead to suboptimal topologies. To improve the identification of hybrids we have combined genotyping-by-sequencing (GBS) and genome size estimation on a genus-wide collection of 614 Phytophthora isolates. Analyses based on locus- and allele counts and especially on the combination of species-specific loci and genome size estimations allowed us to confirm and characterize 27 previously described hybrid species and discover 16 new hybrid species. Our method was also valuable for species identification at an unprecedented resolution and further allowed correct naming of misidentified isolates. We used both a concatenation- and a coalescent-based phylogenomic method to construct a reliable phylogeny using the GBS data of 140 non-hybrid Phytophthora isolates. Hybrid species were subsequently connected to their progenitors in this phylogenetic tree. In this study we demonstrate the application of two validated techniques (GBS and flow cytometry) for relatively low cost but high resolution identification of hybrids and their phylogenetic relations.
Collapse
|
10
|
Bourgeois YXC, Warren BH. An overview of current population genomics methods for the analysis of whole-genome resequencing data in eukaryotes. Mol Ecol 2021; 30:6036-6071. [PMID: 34009688 DOI: 10.1111/mec.15989] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Revised: 04/26/2021] [Accepted: 05/11/2021] [Indexed: 01/01/2023]
Abstract
Characterizing the population history of a species and identifying loci underlying local adaptation is crucial in functional ecology, evolutionary biology, conservation and agronomy. The constant improvement of high-throughput sequencing techniques has facilitated the production of whole genome data in a wide range of species. Population genomics now provides tools to better integrate selection into a historical framework, and take into account selection when reconstructing demographic history. However, this improvement has come with a profusion of analytical tools that can confuse and discourage users. Such confusion limits the amount of information effectively retrieved from complex genomic data sets, and impairs the diffusion of the most recent analytical tools into fields such as conservation biology. It may also lead to redundancy among methods. To address these isssues, we propose an overview of more than 100 state-of-the-art methods that can deal with whole genome data. We summarize the strategies they use to infer demographic history and selection, and discuss some of their limitations. A website listing these methods is available at www.methodspopgen.com.
Collapse
Affiliation(s)
| | - Ben H Warren
- Institut de Systématique, Evolution, Biodiversité (ISYEB), Muséum National d'Histoire Naturelle, CNRS, Sorbonne Université, EPHE, UA, CP 51, Paris, France
| |
Collapse
|
11
|
Dibaeinia P, Tabe-Bordbar S, Warnow T. FASTRAL: Improving scalability of phylogenomic analysis. Bioinformatics 2021; 37:2317-2324. [PMID: 33576396 PMCID: PMC8388037 DOI: 10.1093/bioinformatics/btab093] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Revised: 02/02/2021] [Accepted: 02/04/2021] [Indexed: 01/22/2023] Open
Abstract
MOTIVATION ASTRAL is the current leading method for species tree estimation from phylogenomic datasets (i.e., hundreds to thousands of genes) that addresses gene tree discord resulting from incomplete lineage sorting (ILS). ASTRAL is statistically consistent under the multi-locus coalescent model (MSC), runs in polynomial time, and is able to run on large datasets. Key to ASTRAL's algorithm is the use of dynamic programming to find an optimal solution to the MQSST (maximum quartet support supertree) within a constraint space that it computes from the input. Yet, ASTRAL can fail to complete within reasonable timeframes on large datasets with many genes and species, because in these cases the constraint space it computes is too large. RESULTS Here we introduce FASTRAL, a phylogenomic estimation method. FASTRAL is based on ASTRAL, but uses a different technique for constructing the constraint space. The technique we use to define the constraint space maintains statistical consistency and is polynomial time; thus we prove that FASTRAL is a polynomial time algorithm that is statistically consistent under the MSC. Our performance study on both biological and simulated data sets demonstrates that FASTRAL matches or improves on ASTRAL with respect to species tree topology accuracy (and under high ILS conditions it is statistically significantly more accurate), while being dramatically faster-especially on datasets with large numbers of genes and high ILS-due to using a significantly smaller constraint space. AVAILABILITY FASTRAL is available in open-source form at https://github.com/PayamDiba/FASTRAL. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Payam Dibaeinia
- Department of Computer Science, University of Illinois, Urbana, IL 61801, USA
| | - Shayan Tabe-Bordbar
- Department of Computer Science, University of Illinois, Urbana, IL 61801, USA
| | - Tandy Warnow
- Department of Computer Science, University of Illinois, Urbana, IL 61801, USA,To whom correspondence should be addressed.
| |
Collapse
|
12
|
Le T, Sy A, Molloy EK, Zhang Q, Rao S, Warnow T. Using Constrained-INC for Large-Scale Gene Tree and Species Tree Estimation. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2-15. [PMID: 32750844 DOI: 10.1109/tcbb.2020.2990867] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Incremental tree building (INC) is a new phylogeny estimation method that has been proven to be absolute fast converging under standard sequence evolution models. A variant of INC, called Constrained-INC, is designed for use in divide-and-conquer pipelines for phylogeny estimation where a set of species is divided into disjoint subsets, trees are computed on the subsets using a selected base method, and then the subset trees are combined together. We evaluate the accuracy of INC and Constrained-INC for gene tree and species tree estimation on simulated datasets, and compare it to similar pipelines using NJMerge (another method that merges disjoint trees). For gene tree estimation, we find that INC has very poor accuracy in comparison to standard methods, and even Constrained-INC(using maximum likelihood methods to compute constraint trees) does not match the accuracy of the better maximum likelihood methods. Results for species trees are somewhat different, with Constrained-INC coming close to the accuracy of the best species tree estimation methods, while being much faster; furthermore, using Constrained-INC allows species tree estimation methods to scale to large datasets within limited computational resources. Overall, this study exposes the benefits and limitations of divide-and-conquer strategies for large-scale phylogenetic tree estimation.
Collapse
|
13
|
Lv X, Hu J, Hu Y, Li Y, Xu D, Ryder OA, Irwin DM, Yu L. Diverse phylogenomic datasets uncover a concordant scenario of laurasiatherian interordinal relationships. Mol Phylogenet Evol 2020; 157:107065. [PMID: 33387649 DOI: 10.1016/j.ympev.2020.107065] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Revised: 12/22/2020] [Accepted: 12/24/2020] [Indexed: 10/22/2022]
Abstract
Resolving the interordinal relationships in the mammalian superorder Laurasiatheria has been among the most intractable problems in higher-level mammalian systematics, with many conflicting hypotheses having been proposed. The present study collected three different sources of genome-scale data with comprehensive taxon sampling of laurasiatherian species, including two protein-coding datasets (4,186 protein-coding genes for an amino acid dataset comprising 2,761,247 amino acid residues and a nucleotide dataset comprising 5,516,340 nucleotides from 1st and 2nd codon positions), an intronic dataset (1,210 introns comprising 1,162,723 nucleotides) and an ultraconserved elements (UCEs) dataset (1,246 UCEs comprising 1,946,472 nucleotides) from 40 species representing all six laurasiatherian orders and 7 non-laurasiatherian outgroups. Remarkably, phylogenetic trees reconstructed with the four datasets using different tree-building methods (RAxML, FastTree, ASTRAL and MP-EST) all supported the relationship (Eulipotyphla, (Chiroptera, ((Carnivora, Pholidota), (Cetartiodactyla, Perissodactyla)))). We find a resolution of interordinal relationships of Laurasiatheria among all types of markers used in the present study, and the likelihood ratio tests for tree comparisons confirmed that the present tree topology is the optimal hypothesis compared to other examined hypotheses. Jackknifing subsampling analyses demonstrate that the results of laurasiatherian tree reconstruction varied with the number of loci and ordinal representatives used, which are likely the two main contributors to phylogenetic disagreements of Laurasiatheria seen in previous studies. Our study provides significant insight into laurasiatherian evolution, and moreover, an important methodological strategy and reference for resolving phylogenies of adaptive radiation, which have been a long-standing challenge in the field of phylogenetics.
Collapse
Affiliation(s)
- Xue Lv
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Jingyang Hu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Yiwen Hu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Yitian Li
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Dongming Xu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Kunming, China
| | - Oliver A Ryder
- Institute for Conservation Research, San Diego Zoo Global, Escondido, CA, USA
| | - David M Irwin
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Canada
| | - Li Yu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China.
| |
Collapse
|
14
|
Holding ML, Sovic MG, Colston TJ, Gibbs HL. The scales of coevolution: comparative phylogeography and genetic demography of a locally adapted venomous predator and its prey. Biol J Linn Soc Lond 2020. [DOI: 10.1093/biolinnean/blaa192] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
Abstract
Coevolutionary theory predicts that differences in the genetic demography of interacting species can influence patterns of local adaptation by affecting the potential of local populations to respond to selection. We conducted a comparative phylogeographical study of venomous rattlesnakes and their venom-resistant ground squirrel prey across California, and assessed how effective population size (Ne) estimates correspond with a previously documented pattern of rattlesnake local adaptation. Using RAD sequencing markers, we detected lineage relationships among both the rattlesnakes (Crotalus oreganus ssp.) and ground squirrels (Otospermophilus sp.) that are incongruent with previous phylogenetic hypotheses. Both rattlesnakes and squirrels share a deep divergence at the Sacramento–San Joaquin River Delta. At this broad phylogeographical scale, we found that the locally adapted rattlesnakes had higher Ne than squirrels. At the population scale, snakes also had larger Ne accompanied by larger values of several metrics of population genetic diversity. However, the specific magnitude of local adaptation of venom activity to ground squirrel venom resistance was not significantly correlated with local differences in Ne or other diversity statistics between predator and prey populations, suggesting that other factors in the geographic mosaic of coevolution contribute to the specific local-scale outcomes of this interaction. These results suggest an evolutionary mechanism that may explain some (but clearly not all) of rattlesnake local adaptation in this coevolutionary interaction – larger population sizes raise the adaptive potential of rattlesnakes compared to ground squirrels.
Collapse
Affiliation(s)
- Matthew L Holding
- Department of Evolution, Ecology, and Organismal Biology, Ohio State University, Columbus, OH, USA
- Department of Biological Science, Florida State University, Tallahassee, FL, USA
| | - Michael G Sovic
- Department of Evolution, Ecology, and Organismal Biology, Ohio State University, Columbus, OH, USA
| | - Timothy J Colston
- Department of Biological Science, Florida State University, Tallahassee, FL, USA
| | - H Lisle Gibbs
- Department of Evolution, Ecology, and Organismal Biology, Ohio State University, Columbus, OH, USA
| |
Collapse
|
15
|
Murphy WJ, Foley NM, Bredemeyer KR, Gatesy J, Springer MS. Phylogenomics and the Genetic Architecture of the Placental Mammal Radiation. Annu Rev Anim Biosci 2020; 9:29-53. [PMID: 33228377 DOI: 10.1146/annurev-animal-061220-023149] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The genomes of placental mammals are being sequenced at an unprecedented rate. Alignments of hundreds, and one day thousands, of genomes spanning the rich living and extinct diversity of species offer unparalleled power to resolve phylogenetic controversies, identify genomic innovations of adaptation, and dissect the genetic architecture of reproductive isolation. We highlight outstanding questions about the earliest phases of placental mammal diversification and the promise of newer methods, as well as remaining challenges, toward using whole genome data to resolve placental mammal phylogeny. The next phase of mammalian comparative genomics will see the completion and application of finished-quality, gapless genome assemblies from many ordinal lineages and closely related species. Interspecific comparisons between the most hypervariable genomic loci will likely reveal large, but heretofore mostly underappreciated, effects on population divergence, morphological innovation, and the origin of new species.
Collapse
Affiliation(s)
- William J Murphy
- Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA;
| | - Nicole M Foley
- Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA;
| | - Kevin R Bredemeyer
- Veterinary Integrative Biosciences, Texas A&M University, College Station, Texas 77843, USA;
| | - John Gatesy
- Division of Vertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA
| | - Mark S Springer
- Department of Evolution, Ecology and Organismal Biology, University of California, Riverside, California 92521, USA
| |
Collapse
|
16
|
Chan KO, Hutter CR, Wood PL, Grismer LL, Das I, Brown RM. Gene flow creates a mirage of cryptic species in a Southeast Asian spotted stream frog complex. Mol Ecol 2020; 29:3970-3987. [PMID: 32808335 DOI: 10.1111/mec.15603] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Revised: 07/29/2020] [Accepted: 08/13/2020] [Indexed: 02/06/2023]
Abstract
Most new cryptic species are described using conventional tree- and distance-based species delimitation methods (SDMs), which rely on phylogenetic arrangements and measures of genetic divergence. However, although numerous factors such as population structure and gene flow are known to confound phylogenetic inference and species delimitation, the influence of these processes is not frequently evaluated. Using large numbers of exons, introns, and ultraconserved elements obtained using the FrogCap sequence-capture protocol, we compared conventional SDMs with more robust genomic analyses that assess population structure and gene flow to characterize species boundaries in a Southeast Asian frog complex (Pulchrana picturata). Our results showed that gene flow and introgression can produce phylogenetic patterns and levels of divergence that resemble distinct species (up to 10% divergence in mitochondrial DNA). Hybrid populations were inferred as independent (singleton) clades that were highly divergent from adjacent populations (7%-10%) and unusually similar (<3%) to allopatric populations. Such anomalous patterns are not uncommon in Southeast Asian amphibians, which brings into question whether the high levels of cryptic diversity observed in other amphibian groups reflect distinct cryptic species-or, instead, highly admixed and structured metapopulation lineages. Our results also provide an alternative explanation to the conundrum of divergent (sometimes nonsister) sympatric lineages-a pattern that has been celebrated as indicative of true cryptic speciation. Based on these findings, we recommend that species delimitation of continuously distributed "cryptic" groups should not rely solely on conventional SDMs, but should necessarily examine population structure and gene flow to avoid taxonomic inflation.
Collapse
Affiliation(s)
- Kin O Chan
- Lee Kong Chian National History Museum, Faculty of Science, National University of Singapore, Singapore
| | - Carl R Hutter
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS, USA.,Museum of Natural Sciences and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA, USA
| | - Perry L Wood
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS, USA.,Department of Biological Sciences & Museum of Natural History, Auburn University, Auburn, AL, USA
| | - L L Grismer
- Herpetology Laboratory, Department of Biology, La Sierra University, Riverside, CA, USA
| | - Indraneil Das
- Institute of Biodiversity and Environmental Conservation, Universiti Malaysia Sarawak, Kota Samarahan, Sarawak, Malaysia
| | - Rafe M Brown
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS, USA
| |
Collapse
|
17
|
Chan KO, Hutter CR, Wood PL, Grismer LL, Brown RM. Larger, unfiltered datasets are more effective at resolving phylogenetic conflict: Introns, exons, and UCEs resolve ambiguities in Golden-backed frogs (Anura: Ranidae; genus Hylarana). Mol Phylogenet Evol 2020; 151:106899. [PMID: 32590046 DOI: 10.1016/j.ympev.2020.106899] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 05/18/2020] [Accepted: 06/17/2020] [Indexed: 01/01/2023]
Abstract
Using FrogCap, a recently-developed sequence-capture protocol, we obtained >12,000 highly informative exons, introns, and ultraconserved elements (UCEs), which we used to illustrate variation in evolutionary histories of these classes of markers, and to resolve long-standing systematic problems in Southeast Asian Golden-backed frogs of the genus-complex Hylarana. We also performed a comprehensive suite of analyses to assess the relative performance of different genetic markers, data filtering strategies, tree inference methods, and different measures of branch support. To reduce gene tree estimation error, we filtered the data using different thresholds of taxon completeness (missing data) and parsimony informative sites (PIS). We then estimated species trees using concatenated datasets and Maximum Likelihood (IQ-TREE) in addition to summary (ASTRAL-III), distance-based (ASTRID), and site-based (SVDQuartets) multispecies coalescent methods. Topological congruence and branch support were examined using traditional bootstrap, local posterior probabilities, gene concordance factors, quartet frequencies, and quartet scores. Our results did not yield a single concordant topology. Instead, introns, exons, and UCEs clearly possessed different phylogenetic signals, resulting in conflicting, yet strongly-supported phylogenetic estimates. However, a combined analysis comprising the most informative introns, exons, and UCEs converged on a similar topology across all analyses, with the exception of SVDQuartets. Bootstrap values were consistently high despite high levels of incongruence and high proportions of gene trees supporting conflicting topologies. Although low bootstrap values did indicate low heuristic support, high bootstrap support did not necessarily reflect congruence or support for the correct topology. This study reiterates findings of some previous studies, which demonstrated that traditional bootstrap values can produce positively misleading measures of support in large phylogenomic datasets. We also showed a remarkably strong positive relationship between branch length and topological congruence across all datasets, implying that very short internodes remain a challenge to resolve, even with orders of magnitude more data than ever before. Overall, our results demonstrate that more data from unfiltered or combined datasets produced superior results. Although data filtering reduced gene tree incongruence, decreased amounts of data also biased phylogenetic estimation. A point of diminishing returns was evident, at which higher congruence (from more stringent filtering) at the expense of amount of data led to topological error as assessed by comparison to more complete datasets across different genomic markers. Additionally, we showed that applying a parameter-rich model to a partitioned analysis of concatenated data produces better results compared to unpartitioned, or even partitioned analysis using model selection. Despite some lingering uncertainties, a combined analysis of our genomic data and sequences supplemented from GenBank (on the basis of a few gene regions) revealed highly supported novel systematic arrangements. Based on these new findings, we transfer Amnirana nicobariensis into the genus Indosylvirana; and I. milleti and Hylarana celebensis to the genus Papurana. We also provisionally place H. attigua in the genus Papurana pending verification from positively identified (voucher substantiated) samples.
Collapse
Affiliation(s)
- Kin Onn Chan
- Lee Kong Chian National History Museum, Faculty of Science, National University of Singapore, 2 Conservatory Drive, 117377, Singapore.
| | - Carl R Hutter
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045, USA; Museum of Natural Sciences and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Perry L Wood
- Museum of Natural Sciences and Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA; Department of Biological Sciences & Museum of Natural History, Auburn University, Auburn, AL 36849, USA
| | - L Lee Grismer
- Herpetology Laboratory, Department of Biology, La Sierra University, 4500 Riverwalk Parkway, Riverside, CA 92505, USA
| | - Rafe M Brown
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS 66045, USA
| |
Collapse
|
18
|
Abstract
Background To account for genome-wide discordance among gene trees, several widely-used methods seek to find a species tree with the minimum distance to input gene trees. To efficiently explore the large space of species trees, some of these methods, including ASTRAL, use dynamic programming (DP). The DP paradigm can restrict the search space, and thus, ASTRAL and similar methods use heuristic methods to define a restricted search space. However, arbitrary constraints provided by the user on the output tree cannot be trivially incorporated into such restrictions. The ability to infer trees that honor user-defined constraints is needed for many phylogenetic analyses, but no solution currently exists for constraining the output of ASTRAL. Results We introduce methods that enable the ASTRAL dynamic programming to infer constrained trees in an effective and scalable manner. To do so, we adopt a recently developed tree completion algorithm and extend it to allow multifurcating input and output trees. In simulation studies, we show that the approach for honoring constraints is both effective and fast. On real data, we show that constrained searches can help interrogate branches not recovered in the optimal ASTRAL tree to reveal support for alternative hypotheses. Conclusions The new algorithm is added ASTRAL to all user-provided constraints on the species tree.
Collapse
Affiliation(s)
- Maryam Rabiee
- Department of Computer Science and Engineering, UC San Diego, 9500 Gilman Dr, La Jolla, 92093, USA
| | - Siavash Mirarab
- Department of Electrical and Computer Engineering, UC San Diego, 9500 Gilman Dr, La Jolla, 92093, USA.
| |
Collapse
|
19
|
Springer MS, Molloy EK, Sloan DB, Simmons MP, Gatesy J. ILS-Aware Analysis of Low-Homoplasy Retroelement Insertions: Inference of Species Trees and Introgression Using Quartets. J Hered 2019; 111:147-168. [DOI: 10.1093/jhered/esz076] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Accepted: 12/12/2019] [Indexed: 12/20/2022] Open
Abstract
Abstract
DNA sequence alignments have provided the majority of data for inferring phylogenetic relationships with both concatenation and coalescent methods. However, DNA sequences are susceptible to extensive homoplasy, especially for deep divergences in the Tree of Life. Retroelement insertions have emerged as a powerful alternative to sequences for deciphering evolutionary relationships because these data are nearly homoplasy-free. In addition, retroelement insertions satisfy the “no intralocus-recombination” assumption of summary coalescent methods because they are singular events and better approximate neutrality relative to DNA loci commonly sampled in phylogenomic studies. Retroelements have traditionally been analyzed with parsimony, distance, and network methods. Here, we analyze retroelement data sets for vertebrate clades (Placentalia, Laurasiatheria, Balaenopteroidea, Palaeognathae) with 2 ILS-aware methods that operate by extracting, weighting, and then assembling unrooted quartets into a species tree. The first approach constructs a species tree from retroelement bipartitions with ASTRAL, and the second method is based on split-decomposition with parsimony. We also develop a Quartet-Asymmetry test to detect hybridization using retroelements. Both ILS-aware methods recovered the same species-tree topology for each data set. The ASTRAL species trees for Laurasiatheria have consecutive short branch lengths in the anomaly zone whereas Palaeognathae is outside of this zone. For the Balaenopteroidea data set, which includes rorquals (Balaenopteridae) and gray whale (Eschrichtiidae), both ILS-aware methods resolved balaeonopterids as paraphyletic. Application of the Quartet-Asymmetry test to this data set detected 19 different quartets of species for which historical introgression may be inferred. Evidence for introgression was not detected in the other data sets.
Collapse
Affiliation(s)
- Mark S Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA
| | - Erin K Molloy
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL
| | - Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins, CO
| | - Mark P Simmons
- Department of Biology, Colorado State University, Fort Collins, CO
| | - John Gatesy
- Division of Vertebrate Zoology and Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY
| |
Collapse
|
20
|
Martín-Hernanz S, Aparicio A, Fernández-Mazuecos M, Rubio E, Reyes-Betancort JA, Santos-Guerra A, Olangua-Corral M, Albaladejo RG. Maximize Resolution or Minimize Error? Using Genotyping-By-Sequencing to Investigate the Recent Diversification of Helianthemum (Cistaceae). FRONTIERS IN PLANT SCIENCE 2019; 10:1416. [PMID: 31781140 PMCID: PMC6859804 DOI: 10.3389/fpls.2019.01416] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Accepted: 10/11/2019] [Indexed: 05/27/2023]
Abstract
A robust phylogenetic framework, in terms of extensive geographical and taxonomic sampling, well-resolved species relationships and high certainty of tree topologies and branch length estimations, is critical in the study of macroevolutionary patterns. Whereas Sanger sequencing-based methods usually recover insufficient phylogenetic signal, especially in recently diversified lineages, reduced-representation sequencing methods tend to provide well-supported phylogenetic relationships, but usually entail remarkable bioinformatic challenges due to the inherent trade-off between the number of SNPs and the magnitude of associated error rates. The genus Helianthemum (Cistaceae) is a species-rich and taxonomically complex Palearctic group of plants that diversified mainly since the Upper Miocene. It is a challenging case study since previous attempts using Sanger sequencing were unable to resolve the intrageneric phylogenetic relationships. Aiming to obtain a robust phylogenetic reconstruction based on genotyping-by-sequencing (GBS), we established a rigorous methodological workflow in which we i) explored how variable settings during dataset assembly have an impact on error rates and on the degree of resolution under concatenation and coalescent approaches, ii) assessed the effect of two extreme parameter configurations (minimizing error rates vs. maximizing phylogenetic resolution) on tree topology and branch lengths, and iii) evaluated the effects of these two configurations on estimates of divergence times and diversification rates. Our analyses produced highly supported topologically congruent phylogenetic trees for both configurations. However, minimizing error rates did produce more reliable branch lengths, critically affecting the accuracy of downstream analyses (i.e. divergence times and diversification rates). In addition to recommending a revision of intrageneric systematics, our results enabled us to identify three highly diversified lineages in Helianthemum in contrasting geographical areas and ecological conditions, which started radiating in the Upper Miocene.
Collapse
Affiliation(s)
- Sara Martín-Hernanz
- Departamento de Biología Vegetal y Ecología, Universidad de Sevilla, Sevilla, Spain
| | - Abelardo Aparicio
- Departamento de Biología Vegetal y Ecología, Universidad de Sevilla, Sevilla, Spain
| | | | - Encarnación Rubio
- Departamento de Biología Vegetal y Ecología, Universidad de Sevilla, Sevilla, Spain
| | - J. Alfredo Reyes-Betancort
- Jardín de Aclimatación de la Orotava, Instituto Canario de Investigaciones Agrarias (ICIA), Santa Cruz de Tenerife, Spain
| | - Arnoldo Santos-Guerra
- Jardín de Aclimatación de la Orotava, Instituto Canario de Investigaciones Agrarias (ICIA), Santa Cruz de Tenerife, Spain
| | - María Olangua-Corral
- Departamento de Biología Reproductiva y Micro-morfología, Jardín Botánico Canario ‘Viera y Clavijo’—Unidad Asociada CSIC (Cabildo de Gran Canaria), Las Palmas de Gran Canaria, Spain
| | - Rafael G. Albaladejo
- Departamento de Biología Vegetal y Ecología, Universidad de Sevilla, Sevilla, Spain
| |
Collapse
|