1
|
Tims AR, Unmack PJ, Hammer MP, Brown C, Adams M, McGee MD. Museum Genomics Reveals the Hybrid Origin of an Extinct Crater Lake Endemic. Syst Biol 2024; 73:506-520. [PMID: 38597146 PMCID: PMC11377190 DOI: 10.1093/sysbio/syae017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 03/13/2024] [Accepted: 04/09/2024] [Indexed: 04/11/2024] Open
Abstract
Crater lake fishes are common evolutionary model systems, with recent studies suggesting a key role for gene flow in promoting rapid adaptation and speciation. However, the study of these young lakes can be complicated by human-mediated extinctions. Museum genomics approaches integrating genetic data from recently extinct species are, therefore, critical to understanding the complex evolutionary histories of these fragile systems. Here, we examine the evolutionary history of an extinct Southern Hemisphere crater lake endemic, the rainbowfish Melanotaenia eachamensis. We undertook a comprehensive sampling of extant rainbowfish populations of the Atherton Tablelands of Australia alongside historical museum material to understand the evolutionary origins of the extinct crater lake population and the dynamics of gene flow across the ecoregion. The extinct crater lake species is genetically distinct from all other nearby populations due to historic introgression between 2 proximate riverine lineages, similar to other prominent crater lake speciation systems, but this historic gene flow has not been sufficient to induce a species flock. Our results suggest that museum genomics approaches can be successfully combined with extant sampling to unravel complex speciation dynamics involving recently extinct species.
Collapse
Affiliation(s)
- Amy R Tims
- School of Biological Sciences, Monash University, Melbourne, Victoria 3800, Australia
- School of Natural Sciences, Macquarie University, Sydney, New South Wales 2109, Australia
| | - Peter J Unmack
- School of Biological Sciences, Monash University, Melbourne, Victoria 3800, Australia
- Centre for Applied Water Science, Institute for Applied Ecology, University of Canberra, Australian Capital Territory 2601, Australia
| | - Michael P Hammer
- Museum and Art Gallery of the Northern Territory, Darwin, Northern Territory 0801, Australia
| | - Culum Brown
- School of Natural Sciences, Macquarie University, Sydney, New South Wales 2109, Australia
| | - Mark Adams
- Evolutionary Biology Unit, South Australian Museum, North Terrace, Adelaide, South Australia 5000, Australia
- School of Biological Sciences, The University of Adelaide, Adelaide, South Australia 5005, Australia
| | - Matthew D McGee
- School of Biological Sciences, Monash University, Melbourne, Victoria 3800, Australia
| |
Collapse
|
2
|
Hao X, Lu Q, Zhao H. A molecular phylogeny for all 21 families within Chiroptera (bats). Integr Zool 2024; 19:989-998. [PMID: 37853557 DOI: 10.1111/1749-4877.12772] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2023]
Abstract
Bats, members of the Chiroptera order, rank as the second most diverse group among mammals. Recent molecular systematic studies on bats have successfully classified 21 families within two suborders: Yinpterochiroptera and Yangochiroptera. Nevertheless, the phylogeny within these 21 families has remained a subject of controversy. In this study, we have employed a balanced approach to establish a robust family-level phylogenetic hypothesis for bats, utilizing a more comprehensive molecular dataset. This dataset includes representative species from all 21 bat families, resulting in a reduced level of missing genetic information. The resulting phylogenetic tree comprises 21 lineages that are strongly supported, each corresponding to one of the bat families. Our findings support to place the Emballonuroidea superfamily as the basal lineage of Yangochiroptera, and that Myzopodidae should be situated as a basal lineage of Emballonuroidea, forming a sister relationship with the clade consisting of Nycteridae and Emballonuridae. Finally, we have conducted dating analyses on this newly resolved phylogenetic tree, providing divergence times for each bat family. Collectively, our study has employed a relatively comprehensive molecular dataset to establish a more robust phylogeny encompassing all 21 bat families. This improved phylogenetic framework will significantly contribute to our understanding of evolutionary processes, ecological roles, disease dynamics, and biodiversity conservation in the realm of bats.
Collapse
Affiliation(s)
- Xiangyu Hao
- College of Life Sciences, Wuhan University, Wuhan, Hubei, China
- College of Plant Protection, Northwest A&F University, Yangling, Shaanxi, China
| | - Qin Lu
- College of Life Sciences, Wuhan University, Wuhan, Hubei, China
| | - Huabin Zhao
- College of Life Sciences, Wuhan University, Wuhan, Hubei, China
| |
Collapse
|
3
|
Myburgh AM, Barnes A, Henriques R, Daniels SR. Congruent patterns of cryptic cladogenesis revealed using RADseq and Sanger sequencing in a velvet worm species complex (Onychophora: Peripatopsidae: Peripatopsis sedgwicki). Mol Phylogenet Evol 2024; 198:108132. [PMID: 38909874 DOI: 10.1016/j.ympev.2024.108132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 05/24/2024] [Accepted: 06/15/2024] [Indexed: 06/25/2024]
Abstract
In the present study, first generation DNA sequencing (mitochondrial cytochrome c oxidase subunit one, COI) and reduced-representative genomic RADseq data were used to understand the patterns and processes of diversification of the velvet worm, Peripatopsis sedgwicki species complex across its distribution range in South Africa. For the RADseq data, three datasets (two primary and one supplementary) were generated corresponding to 1,259-11,468 SNPs, in order to assess the diversity and phylogeography of the species complex. Tree topologies for the two primary datasets were inferred using maximum likelihood and Bayesian inferences methods. Phylogenetic analyses using the COI datasets retrieved four distinct, well-supported clades within the species complex. Five species delimitation methods applied to the COI data (ASAP, bPTP, bGMYC, STACEY and iBPP) all showed support for the distinction of the Fort Fordyce Nature Reserve specimens. In the main P. sedgwicki species complex, the species delimitation methods revealed a variable number of operational taxonomic units and overestimated the number of putative taxa. Divergence time estimates coupled with the geographic exclusivity of species and phylogeographic results suggest recent cladogenesis during the Plio/Pleistocene. The RADseq data were subjected to a principal components analysis and a discriminant analysis of principal components, under a maximum-likelihood framework. The latter results corroborate the four main clades observed using the COI data, however, applying additional filtering revealed additional diversity. The high overall congruence observed between the RADseq data and COI data suggest that first generation sequence data remain a cheap and effective method for evolutionary studies, although RADseq does provide a far greater resolution of contemporary temporo-spatial patterns.
Collapse
Affiliation(s)
- Angus Macgregor Myburgh
- Department of Botany and Zoology, Private Bag X1, Stellenbosch University, 7602, South Africa
| | - Aaron Barnes
- Department of Botany and Zoology, Private Bag X1, Stellenbosch University, 7602, South Africa
| | - Romina Henriques
- Department of Biochemistry, Genetics and Microbiology, University of Pretoria, South Africa
| | - Savel R Daniels
- Department of Botany and Zoology, Private Bag X1, Stellenbosch University, 7602, South Africa.
| |
Collapse
|
4
|
Kokkonen AL, Searle PC, Shiozawa DK, Evans RP. Using de novo transcriptomes to decipher the relationships in cutthroat trout subspecies ( Oncorhynchus clarkii). Evol Appl 2024; 17:e13735. [PMID: 39006004 PMCID: PMC11239772 DOI: 10.1111/eva.13735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 05/18/2024] [Accepted: 05/27/2024] [Indexed: 07/16/2024] Open
Abstract
For almost 200 years, the taxonomy of cutthroat trout (Oncorhynchus clarkii), a salmonid native to Western North America, has been in flux as ichthyologists and fisheries biologists have tried to describe the diversity within these fishes. Starting in the 1950s, Robert Behnke reexamined the cutthroat trout and identified 14 subspecies based on morphological traits, Pleistocene events, and modern geographic ranges. His designations became instrumental in recognizing and preserving the remaining diversity of cutthroat trout. Over time, molecular techniques (i.e. karyotypes, allozymes, mitochondrial DNA, SNPs, and microsatellite arrays) have largely reinforced Behnke's phylogenies, but have also revealed that some relationships are consistently weakly supported. To further resolve these relationships, we generated de novo transcriptomes for nine cutthroat subspecies, as well as a Bear River Bonneville form and two Colorado River lineages (blue and green). We present phylogenies of these subspecies generated from multiple sets of orthologous genes extracted from our transcriptomes. We confirm many of the relationships identified in previous morphological and molecular studies, as well as discuss the importance of significant differences apparent in our phylogenies from these studies within a geological perspective. Specific findings include three distinct clades: (1) Bear River Bonneville form and Yellowstone cutthroat trout; (2) Bonneville cutthroat trout (n = 2); and (3) Greenback and Rio Grande cutthroat trout. We also identify potential gene transfer between Bonneville cutthroat trout and a population of Colorado River green lineage cutthroat trout. Using these findings, it appears that additional groups warrant species-level consideration if other recent species elevations are retained.
Collapse
Affiliation(s)
- Andrea L. Kokkonen
- Department of Microbiology and Molecular BiologyBrigham Young UniversityProvoUtahUSA
| | - Peter C. Searle
- Department of Ecology and Evolutionary BiologyCornell UniversityIthacaNew YorkUSA
| | | | - R. Paul Evans
- Department of Microbiology and Molecular BiologyBrigham Young UniversityProvoUtahUSA
| |
Collapse
|
5
|
Islam R, Rahman A. An alignment-free method for detection of missing regions for phylogenetic analysis. Heliyon 2024; 10:e32227. [PMID: 38933968 PMCID: PMC11200290 DOI: 10.1016/j.heliyon.2024.e32227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2024] [Revised: 05/17/2024] [Accepted: 05/29/2024] [Indexed: 06/28/2024] Open
Abstract
Phylogenetic tree estimation using conventional approaches usually requires pairwise or multiple sequence alignment. However, sequence alignment has difficulties related to scalability and accuracy in case of long sequences such as whole genomes, low sequence identity, and in presence of genomic rearrangements. To address these issues, alignment-free approaches have been proposed. While these methods have demonstrated promising results, many of these lead to errors when regions are missing from the sequences of one or more species that are trivially detected in alignment-based methods. Here, we present an alignment-free method for detecting missing regions in sequences of species for which phylogeny is to be estimated. It is based on counts of k-mers and can be used to filter out k-mers belonging to regions in one species that are missing in one or more of the other species. We perform experiments with real and simulated datasets containing missing regions and find that it can successfully detect a large fraction of such k-mers and can lead to improvements in the estimated phylogenies. Our method can be used in k-mer based alignment-free phylogeny estimation methods to filter out k-mers corresponding to missing regions.
Collapse
Affiliation(s)
- Rubyeat Islam
- Department of Computer Science and Engineering, Military Institute of Science and Technology, Dhaka, Bangladesh
| | - Atif Rahman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| |
Collapse
|
6
|
Zhang G, Yang J, Zhang C, Jiao B, Panero JL, Cai J, Zhang ZR, Gao LM, Gao T, Ma H. Nuclear phylogenomics of Asteraceae with increased sampling provides new insights into convergent morphological and molecular evolution. PLANT COMMUNICATIONS 2024; 5:100851. [PMID: 38409784 PMCID: PMC11211554 DOI: 10.1016/j.xplc.2024.100851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Revised: 01/22/2024] [Accepted: 02/21/2024] [Indexed: 02/28/2024]
Abstract
Convergent morphological evolution is widespread in flowering plants, and understanding this phenomenon relies on well-resolved phylogenies. Nuclear phylogenetic reconstruction using transcriptome datasets has been successful in various angiosperm groups, but it is limited to taxa with available fresh materials. Asteraceae, which are one of the two largest angiosperm families and are important for both ecosystems and human livelihood, show multiple examples of convergent evolution. Nuclear Asteraceae phylogenies have resolved relationships among most subfamilies and many tribes, but many phylogenetic and evolutionary questions regarding subtribes and genera remain, owing to limited sampling. Here, we increased the sampling for Asteraceae phylogenetic reconstruction using transcriptomes and genome-skimming datasets and produced nuclear phylogenetic trees with 706 species representing two-thirds of recognized subtribes. Ancestral character reconstruction supports multiple convergent evolutionary events in Asteraceae, with gains and losses of bilateral floral symmetry correlated with diversification of some subfamilies and smaller groups, respectively. Presence of the calyx-related pappus may have been especially important for the success of some subtribes and genera. Molecular evolutionary analyses support the likely contribution of duplications of MADS-box and TCP floral regulatory genes to innovations in floral morphology, including capitulum inflorescences and bilaterally symmetric flowers, potentially promoting the diversification of Asteraceae. Subsequent divergences and reductions in CYC2 gene expression are related to the gain and loss of zygomorphic flowers. This phylogenomic work with greater taxon sampling through inclusion of genome-skimming datasets reveals the feasibility of expanded evolutionary analyses using DNA samples for understanding convergent evolution.
Collapse
Affiliation(s)
- Guojin Zhang
- College of Life Sciences, Hunan Normal University, Changsha, Hunan 410081, China; Department of Biology, the Huck Institute of the Life Sciences, the Pennsylvania State University, State College, PA 16801, USA; State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - Junbo Yang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China
| | - Caifei Zhang
- Wuhan Botanical Garden and Sino-Africa Joint Research Center, Chinese Academy of Sciences, Wuhan 430074, China
| | - Bohan Jiao
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
| | - José L Panero
- Department of Integrative Biology, University of Texas, Austin, TX 78712, USA
| | - Jie Cai
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China
| | - Zhi-Rong Zhang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China
| | - Lian-Ming Gao
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China; Lijiang National Forest Biodiversity National Observation and Research Station, Kunming Institute of Botany, Chinese Academy of Sciences, Lijiang, Yunnan 674100, China.
| | - Tiangang Gao
- State Key Laboratory of Plant Diversity and Specialty Crops, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China.
| | - Hong Ma
- Department of Biology, the Huck Institute of the Life Sciences, the Pennsylvania State University, State College, PA 16801, USA.
| |
Collapse
|
7
|
Rincón-Barrado M, Villaverde T, Perez MF, Sanmartín I, Riina R. The sweet tabaiba or there and back again: phylogeographical history of the Macaronesian Euphorbia balsamifera. ANNALS OF BOTANY 2024; 133:883-904. [PMID: 38197716 PMCID: PMC11082519 DOI: 10.1093/aob/mcae001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 03/01/2024] [Indexed: 01/11/2024]
Abstract
BACKGROUND AND AIMS Biogeographical relationships between the Canary Islands and north-west Africa are often explained by oceanic dispersal and geographical proximity. Sister-group relationships between Canarian and eastern African/Arabian taxa, the 'Rand Flora' pattern, are rare among plants and have been attributed to the extinction of north-western African populations. Euphorbia balsamifera is the only representative species of this pattern that is distributed in the Canary Islands and north-west Africa; it is also one of few species present in all seven islands. Previous studies placed African populations of E. balsamifera as sister to the Canarian populations, but this relationship was based on herbarium samples with highly degraded DNA. Here, we test the extinction hypothesis by sampling new continental populations; we also expand the Canarian sampling to examine the dynamics of island colonization and diversification. METHODS Using target enrichment with genome skimming, we reconstructed phylogenetic relationships within E. balsamifera and between this species and its disjunct relatives. A single nucleotide polymorphism dataset obtained from the target sequences was used to infer population genetic diversity patterns. We used convolutional neural networks to discriminate among alternative Canary Islands colonization scenarios. KEY RESULTS The results confirmed the Rand Flora sister-group relationship between western E. balsamifera and Euphorbia adenensis in the Eritreo-Arabian region and recovered an eastern-western geographical structure among E. balsamifera Canarian populations. Convolutional neural networks supported a scenario of east-to-west island colonization, followed by population extinctions in Lanzarote and Fuerteventura and recolonization from Tenerife and Gran Canaria; a signal of admixture between the eastern island and north-west African populations was recovered. CONCLUSIONS Our findings support the Surfing Syngameon Hypothesis for the colonization of the Canary Islands by E. balsamifera, but also a recent back-colonization to the continent. Populations of E. balsamifera from northwest Africa are not the remnants of an ancestral continental stock, but originated from migration events from Lanzarote and Fuerteventura. This is further evidence that oceanic archipelagos are not a sink for biodiversity, but may be a source of new genetic variability.
Collapse
Affiliation(s)
- Mario Rincón-Barrado
- Real Jardín Botánico (RJB), CSIC, Madrid, 28014, Spain
- Centro Nacional de Biotecnología (CNB), CSIC, Madrid, 28049, Spain
| | - Tamara Villaverde
- Universidad Rey Juan Carlos (URJC), Área de Biodiversidad y Conservación, Móstoles, 28933, Spain
| | - Manolo F Perez
- Institut de Systématique, Evolution, Biodiversité (ISYEB – URM 7205 CNRS), Muséum National d’Histoire Naturelle, SU, EPHE & UA, Paris, France
| | | | - Ricarda Riina
- Real Jardín Botánico (RJB), CSIC, Madrid, 28014, Spain
| |
Collapse
|
8
|
Thomé PC, Irisarri I, Wolinska J, Monaghan MT, Strassert JFH. Single-cell genomics reveals new rozellid lineages and supports their sister relationship to Microsporidia. Biol Lett 2023; 19:20230398. [PMID: 38087939 PMCID: PMC10716661 DOI: 10.1098/rsbl.2023.0398] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Accepted: 11/20/2023] [Indexed: 12/18/2023] Open
Abstract
The phylum Rozellomycota has been proposed for a group of early-branching holomycotan lineages representing obligate parasites and hyperparasites of zoosporic fungi, oomycotes or phytoplankton. Given their predominantly intracellular lifestyle, rozellids are typically known from environmental ribosomal DNA data, except for the well-studied Rozella species. To date, the phylogenetic relationship between rozellids and microsporidians (Microsporidia) is not fully understood and most reliable hypotheses are based on phylogenomic analyses that incorporate the only publicly available rozellid genome of Rozella allomycis. Here, we provide genomic data of three new rozellid lineages obtained by single-cell sequencing from environmental samples and show with a phylogenomic approach that rozellids form a monophyletic group that is sister to microsporidians, corroborating the previously proposed phylum Rozellomycota. Whereas no mitochondrial genes coding for the respiratory Complex I could be found, we discovered a gene coding for a nucleotide phosphate transporter in one of the three draft genomes. The scattered absence of Complex I genes and scattered presence of nucleotide transporter genes across diverse microsporidian and rozellid lineages suggest that these adaptations to a parasitic lifestyle, which reduce the parasite's capability to synthesize ATP but enables it to steal ATP from its host, evolved independently in microsporidians and rozellids.
Collapse
Affiliation(s)
- Pauline C. Thomé
- Department of Evolutionary and Integrative Ecology, Leibniz Institute of Freshwater Ecology and Inland Fisheries (IGB), Berlin, Germany
| | - Iker Irisarri
- Section Phylogenomics, Centre for Molecular Biodiversity Research, Leibniz Institute for the Analysis of Biodiversity Change, Museum of Nature Hamburg, Hamburg, Germany
| | - Justyna Wolinska
- Department of Evolutionary and Integrative Ecology, Leibniz Institute of Freshwater Ecology and Inland Fisheries (IGB), Berlin, Germany
- Institut für Biologie, Freie Universität Berlin, Berlin, Germany
| | - Michael T. Monaghan
- Department of Evolutionary and Integrative Ecology, Leibniz Institute of Freshwater Ecology and Inland Fisheries (IGB), Berlin, Germany
- Institut für Biologie, Freie Universität Berlin, Berlin, Germany
| | - Jürgen F. H. Strassert
- Department of Evolutionary and Integrative Ecology, Leibniz Institute of Freshwater Ecology and Inland Fisheries (IGB), Berlin, Germany
| |
Collapse
|
9
|
Tian R, Imanian B. VBCG: 20 validated bacterial core genes for phylogenomic analysis with high fidelity and resolution. MICROBIOME 2023; 11:247. [PMID: 37936197 PMCID: PMC10631056 DOI: 10.1186/s40168-023-01705-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 10/18/2023] [Indexed: 11/09/2023]
Abstract
BACKGROUND Phylogenomic analysis has become an inseparable part of studies of bacterial diversity and evolution, and many different bacterial core genes have been collated and used for phylogenomic tree reconstruction. However, these genes have been selected based on their presence and single-copy ratio in all bacterial genomes, leaving out the gene's 'phylogenetic fidelity' unexamined. RESULTS From 30,522 complete genomes covering 11,262 species, we examined 148 bacterial core genes that have been previously used for phylogenomic analysis. In addition to the gene presence and single-copy rations, we evaluated the gene's phylogenetic fidelity by comparing each gene's phylogeny with its corresponding 16S rRNA gene tree. Out of the 148 bacterial genes, 20 validated bacterial core genes (VBCG) were selected as the core gene set with the highest bacterial phylogenetic fidelity. Compared to the larger gene set, the 20-gene core set resulted in more species having all genes present and fewer species with missing data, thereby enhancing the accuracy of phylogenomic analysis. Using Escherichia coli strains as examples of prominent bacterial foodborne pathogens, we demonstrated that the 20 VBCG produced phylogenies with higher fidelity and resolution at species and strain levels while 16S rRNA gene tree alone could not. CONCLUSION The 20 validated core gene set improves the fidelity and speed of phylogenomic analysis. Among other uses, this tool improves our ability to explore the evolution, typing and tracking of bacterial strains, such as human pathogens. We have developed a Python pipeline and a desktop graphic app (available on GitHub) for users to perform phylogenomic analysis with high fidelity and resolution. Video Abstract.
Collapse
Affiliation(s)
- Renmao Tian
- Institute for Food Safety and Health, Illinois Institute of Technology, Bedford Park, IL, 60501, USA
| | - Behzad Imanian
- Institute for Food Safety and Health, Illinois Institute of Technology, Bedford Park, IL, 60501, USA.
- Food Science and Nutrition Department, Illinois Institute of Technology, Chicago, IL, 60616, USA.
| |
Collapse
|
10
|
Pereira DS, Hilário S, Gonçalves MFM, Phillips AJL. Diaporthe Species on Palms: Molecular Re-Assessment and Species Boundaries Delimitation in the D. arecae Species Complex. Microorganisms 2023; 11:2717. [PMID: 38004729 PMCID: PMC10673533 DOI: 10.3390/microorganisms11112717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 10/25/2023] [Accepted: 11/03/2023] [Indexed: 11/26/2023] Open
Abstract
Due to cryptic diversification, phenotypic plasticity and host associations, multilocus phylogenetic analyses have become the most important tool in accurately identifying and circumscribing species in the Diaporthe genus. However, the application of the genealogical concordance criterion has often been overlooked, ultimately leading to an exponential increase in novel Diaporthe spp. Due to the large number of species, many lineages remain poorly understood under the so-called species complexes. For this reason, a robust delimitation of the species boundaries in Diaporthe is still an ongoing challenge. Therefore, the present study aimed to resolve the species boundaries of the Diaporthe arecae species complex (DASC) by implementing an integrative taxonomic approach. The Genealogical Phylogenetic Species Recognition (GCPSR) principle revealed incongruences between the individual gene genealogies. Moreover, the Poisson Tree Processes' (PTPs) coalescent-based species delimitation models identified three well-delimited subclades represented by the species D. arecae, D. chiangmaiensis and D. smilacicola. These results evidence that all species previously described in the D. arecae subclade are conspecific, which is coherent with the morphological indistinctiveness observed and the absence of reproductive isolation and barriers to gene flow. Thus, 52 Diaporthe spp. are reduced to synonymy under D. arecae. Recent population expansion and the possibility of incomplete lineage sorting suggested that the D. arecae subclade may be considered as ongoing evolving lineages under active divergence and speciation. Hence, the genetic diversity and intraspecific variability of D. arecae in the context of current global climate change and the role of D. arecae as a pathogen on palm trees and other hosts are also discussed. This study illustrates that species in Diaporthe are highly overestimated, and highlights the relevance of applying an integrative taxonomic approach to accurately circumscribe the species boundaries in the genus Diaporthe.
Collapse
Affiliation(s)
- Diana S. Pereira
- Faculdade de Ciências, Biosystems and Integrative Sciences Institute (BioISI), Universidade de Lisboa, Campo Grande, 1749-016 Lisboa, Portugal;
| | - Sandra Hilário
- Interdisciplinary Centre of Marine and Environmental Research (CIIMAR), Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos s/n, 4450-208 Porto, Portugal;
- Faculty of Sciences, Biology Department, University of Porto, Rua do Campo Alegre, Edifício FC4, 4169-007 Porto, Portugal
| | - Micael F. M. Gonçalves
- Faculty of Sciences, Biology Department, University of Porto, Rua do Campo Alegre, Edifício FC4, 4169-007 Porto, Portugal
- Centre for Environmental and Marine Studies, Department of Biology, Campus Universitário de Santiago, University of Aveiro, 3810-193 Aveiro, Portugal
| | - Alan J. L. Phillips
- Faculdade de Ciências, Biosystems and Integrative Sciences Institute (BioISI), Universidade de Lisboa, Campo Grande, 1749-016 Lisboa, Portugal;
| |
Collapse
|
11
|
Portik DM, Streicher JW, Wiens JJ. Frog phylogeny: A time-calibrated, species-level tree based on hundreds of loci and 5,242 species. Mol Phylogenet Evol 2023; 188:107907. [PMID: 37633542 DOI: 10.1016/j.ympev.2023.107907] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2023] [Revised: 08/15/2023] [Accepted: 08/15/2023] [Indexed: 08/28/2023]
Abstract
Large-scale, time-calibrated phylogenies from supermatrix studies have become crucial for evolutionary and ecological studies in many groups of organisms. However, in frogs (anuran amphibians), there is a serious problem with existing supermatrix estimates. Specifically, these trees are based on a limited number of loci (15 or fewer), and the higher-level relationships estimated are discordant with recent phylogenomic estimates based on much larger numbers of loci. Here, we attempted to rectify this problem by generating an expanded supermatrix and combining this with data from phylogenomic studies. To assist in aligning ribosomal sequences for this supermatrix, we developed a new program (TaxonomyAlign) to help perform taxonomy-guided alignments. The new combined matrix contained 5,242 anuran species with data from 307 markers, but with 95% missing data overall. This dataset represented a 71% increase in species sampled relative to the previous largest supermatrix analysis of anurans (adding 2,175 species). Maximum-likelihood analyses generated a tree in which higher-level relationships (and estimated clade ages) were generally concordant with those from phylogenomic analyses but were more discordant with the previous largest supermatrix analysis. We found few obvious problems arising from the extensive missing data in most species. We also generated a set of 100 time-calibrated trees for use in comparative analyses. Overall, we provide an improved estimate of anuran phylogeny based on the largest number of combined taxa and markers to date. More broadly, we demonstrate the potential to combine phylogenomic and supermatrix analyses in other groups of organisms.
Collapse
Affiliation(s)
- Daniel M Portik
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721 USA; California Academy of Sciences, San Francisco, CA 94118, USA
| | | | - John J Wiens
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721 USA.
| |
Collapse
|
12
|
McLay TGB, Fowler RM, Fahey PS, Murphy DJ, Udovicic F, Cantrill DJ, Bayly MJ. Phylogenomics reveals extreme gene tree discordance in a lineage of dominant trees: hybridization, introgression, and incomplete lineage sorting blur deep evolutionary relationships despite clear species groupings in Eucalyptus subgenus Eudesmia. Mol Phylogenet Evol 2023; 187:107869. [PMID: 37423562 DOI: 10.1016/j.ympev.2023.107869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 06/29/2023] [Accepted: 06/30/2023] [Indexed: 07/11/2023]
Abstract
Eucalypts are a large and ecologically important group of plants on the Australian continent, and understanding their evolution is important in understanding evolution of the unique Australian flora. Previous phylogenies using plastome DNA, nuclear-ribosomal DNA, or random genome-wide SNPs, have been confounded by limited genetic sampling or by idiosyncratic biological features of the eucalypts, including widespread plastome introgression. Here we present phylogenetic analyses of Eucalyptus subgenus Eudesmia (22 species from western, northern, central and eastern Australia), in the first study to apply a target-capture sequencing approach using custom, eucalypt-specific baits (of 568 genes) to a lineage of Eucalyptus. Multiple accessions of all species were included, and target-capture data were supplemented by separate analyses of plastome genes (average of 63 genes per sample). Analyses revealed a complex evolutionary history likely shaped by incomplete lineage sorting and hybridization. Gene tree discordance generally increased with phylogenetic depth. Species, or groups of species, toward the tips of the tree are mostly supported, and three major clades are identified, but the branching order of these clades cannot be confirmed with confidence. Multiple approaches to filtering the nuclear dataset, by removing genes or samples, could not reduce gene tree conflict or resolve these relationships. Despite inherent complexities in eucalypt evolution, the custom bait kit devised for this research will be a powerful tool for investigating the evolutionary history of eucalypts more broadly.
Collapse
Affiliation(s)
- Todd G B McLay
- Royal Botanic Gardens Victoria, Melbourne 3004, Vic, Australia; School of BioSciences, The University of Melbourne, Parkville 3010, Vic, Australia.
| | - Rachael M Fowler
- School of BioSciences, The University of Melbourne, Parkville 3010, Vic, Australia
| | - Patrick S Fahey
- Research Centre for Ecosystem Resilience, The Royal Botanic Garden Sydney, Sydney 2000, NSW, Australia; Qld Alliance for Agriculture and Food Innovation, The University of Queensland, St Lucia 4072, Qld, Australia
| | - Daniel J Murphy
- Royal Botanic Gardens Victoria, Melbourne 3004, Vic, Australia; School of BioSciences, The University of Melbourne, Parkville 3010, Vic, Australia
| | - Frank Udovicic
- Royal Botanic Gardens Victoria, Melbourne 3004, Vic, Australia
| | - David J Cantrill
- Royal Botanic Gardens Victoria, Melbourne 3004, Vic, Australia; School of BioSciences, The University of Melbourne, Parkville 3010, Vic, Australia
| | - Michael J Bayly
- School of BioSciences, The University of Melbourne, Parkville 3010, Vic, Australia
| |
Collapse
|
13
|
Simmons MP, Goloboff PA, Stöver BC, Springer MS, Gatesy J. Quantification of congruence among gene trees with polytomies using overall success of resolution for phylogenomic coalescent analyses. Cladistics 2023; 39:418-436. [PMID: 37096985 DOI: 10.1111/cla.12540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Revised: 02/22/2023] [Accepted: 03/24/2023] [Indexed: 04/26/2023] Open
Abstract
Gene-tree-inference error can cause species-tree-inference artefacts in summary phylogenomic coalescent analyses. Here we integrate two ways of accommodating these inference errors: collapsing arbitrarily or dubiously resolved gene-tree branches, and subsampling gene trees based on their pairwise congruence. We tested the effect of collapsing gene-tree branches with 0% approximate-likelihood-ratio-test (SH-like aLRT) support in likelihood analyses and strict consensus trees for parsimony, and then subsampled those partially resolved trees based on congruence measures that do not penalize polytomies. For this purpose we developed a new TNT script for congruence sorting (congsort), and used it to calculate topological incongruence for eight phylogenomic datasets using three distance measures: standard Robinson-Foulds (RF) distances; overall success of resolution (OSR), which is based on counting both matching and contradicting clades; and RF contradictions, which only counts contradictory clades. As expected, we found that gene-tree incongruence was often concentrated in clades that are arbitrarily or dubiously resolved and that there was greater congruence between the partially collapsed gene trees and the coalescent and concatenation topologies inferred from those genes. Coalescent branch lengths typically increased as the most incongruent gene trees were excluded, although branch supports typically did not. We investigated two successful and complementary approaches to prioritizing genes for investigation of alignment or homology errors. Coalescent-tree clades that contradicted concatenation-tree clades were generally less robust to gene-tree subsampling than congruent clades. Our preferred approach to collapsing likelihood gene-tree clades (0% SH-like aLRT support) and subsampling those trees (OSR) generally outperformed competing approaches for a large fungal dataset with respect to branch lengths, support and congruence. We recommend widespread application of this approach (and strict consensus trees for parsimony-based analyses) for improving quantification of gene-tree congruence/conflict, estimating coalescent branch lengths, testing robustness of coalescent analyses to gene-tree-estimation error, and improving topological robustness of summary coalescent analyses. This approach is quick and easy to implement, even for huge datasets.
Collapse
Affiliation(s)
- Mark P Simmons
- Department of Biology, Colorado State University, Fort Collins, CO, 80523, USA
| | - Pablo A Goloboff
- CONICET, INSUE, Fundación Miguel Lillo, Miguel Lillo 251, 4000, S.M. de Tucumán, Argentina
| | - Ben C Stöver
- Institute for Evolution and Biodiversity, WMU Münster, 48149, Münster, Germany
| | - Mark S Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA, 92521, USA
| | - John Gatesy
- Division of Vertebrate Zoology, American Museum of Natural History, New York, NY, 10024, USA
| |
Collapse
|
14
|
Scheunert A, Lautenschlager U, Ott T, Oberprieler C. Nano-Strainer: A workflow for the identification of single-copy nuclear loci for plant systematic studies, using target capture kits and Oxford Nanopore long reads. Ecol Evol 2023; 13:e10190. [PMID: 37475726 PMCID: PMC10354226 DOI: 10.1002/ece3.10190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 05/18/2023] [Accepted: 06/01/2023] [Indexed: 07/22/2023] Open
Abstract
In modern plant systematics, target enrichment enables simultaneous analysis of hundreds of genes. However, when dealing with reticulate or polyploidization histories, few markers may suffice, but often are required to be single-copy, a condition that is not necessarily met with commercial capture kits. Also, large genome sizes can render target capture ineffective, so that amplicon sequencing would be preferable; however, knowledge about suitable loci is often missing. Here, we present a comprehensive workflow for the identification of putative single-copy nuclear markers in a genus of interest, by mining a small dataset from target capture using a few representative taxa. The proposed pipeline assesses sequence variability contained in the data from targeted loci and assigns reads to their respective genes, via a combined BLAST/clustering procedure. Cluster consensus sequences are then examined based on four pre-defined criteria presumably indicative for absence of paralogy. This is done by calculating four specialized indices; loci are ranked according to their performance in these indices, and top-scoring loci are considered putatively single- or low copy. The approach can be applied to any probe set. As it relies on long reads, the present contribution also provides template workflows for processing Nanopore-based target capture data. Obtained markers are further tested and then entered into amplicon sequencing. For the detection of possibly remaining paralogy in these data, which might occur in groups with rampant paralogy, we also employ the long-read assembly tool canu. In diploid representatives of the young Compositae genus Leucanthemum, characterized by high levels of polyploidy, our approach resulted in successful amplification of 13 loci. Modifications to remove traces of paralogy were made in seven of these. A species tree from the markers correctly reproduced main relationships in the genus, however, at low resolution. The presented workflow has the potential to valuably support phylogenetic research, for example in polyploid plant groups.
Collapse
Affiliation(s)
- Agnes Scheunert
- Evolutionary and Systematic Botany Group, Institute of Plant SciencesUniversity of RegensburgRegensburgGermany
| | - Ulrich Lautenschlager
- Evolutionary and Systematic Botany Group, Institute of Plant SciencesUniversity of RegensburgRegensburgGermany
| | - Tankred Ott
- Evolutionary and Systematic Botany Group, Institute of Plant SciencesUniversity of RegensburgRegensburgGermany
| | - Christoph Oberprieler
- Evolutionary and Systematic Botany Group, Institute of Plant SciencesUniversity of RegensburgRegensburgGermany
| |
Collapse
|
15
|
Pacheco MA, Escalante AA. Origin and diversity of malaria parasites and other Haemosporida. Trends Parasitol 2023; 39:501-516. [PMID: 37202254 DOI: 10.1016/j.pt.2023.04.004] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2023] [Revised: 04/17/2023] [Accepted: 04/23/2023] [Indexed: 05/20/2023]
Abstract
Symbionts, including parasites, are ubiquitous in all world ecosystems. Understanding the diversity of symbiont species addresses diverse questions, from the origin of infectious diseases to inferring processes shaping regional biotas. Here, we review the current approaches to studying Haemosporida's species diversity and evolutionary history. Despite the solid knowledge of species linked to diseases, such as the agents of human malaria, studies on haemosporidian phylogeny, diversity, ecology, and evolution are still limited. The available data, however, indicate that Haemosporida is an extraordinarily diverse and cosmopolitan clade of symbionts. Furthermore, this clade seems to have originated with their vertebrate hosts, particularly birds, as part of complex community level processes that we are still characterizing.
Collapse
Affiliation(s)
- M Andreína Pacheco
- Biology Department/Institute of Genomics and Evolutionary Medicine (iGEM), Temple University, Philadelphia, PA 19122-1801, USA.
| | - Ananias A Escalante
- Biology Department/Institute of Genomics and Evolutionary Medicine (iGEM), Temple University, Philadelphia, PA 19122-1801, USA.
| |
Collapse
|
16
|
Liu L, Yu L, Wu S, Arnold J, Whalen C, Davis C, Edwards S. Short branch attraction in phylogenomic inference under the multispecies coalescent. Front Ecol Evol 2023; 11:1134764. [PMID: 39233780 PMCID: PMC11372852 DOI: 10.3389/fevo.2023.1134764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/06/2024] Open
Abstract
Accurate reconstruction of species trees often relies on the quality of input gene trees estimated from molecular sequences. Previous studies suggested that if the sequence length is fixed, the maximum likelihood may produce biased gene trees which subsequently mislead inference of species trees. Two key questions need to be answered in this context: what are the scenarios that may result in consistently biased gene trees? and for those scenarios, are there any remedies that may remove or at least reduce the misleading effects of consistently biased gene trees? In this article, we establish a theoretical framework to address these questions. Considering a scenario where the true gene tree is a 4-taxon star treeT * = S 1 , S 2 , S 3 , S 4 with two short branches leading to the speciesS 1 andS 2 , we demonstrate that maximum likelihood significantly favors the wrong bifurcating treeS 1 , S 2 , S 3 , S 4 grouping the two speciesS 1 andS 2 with short branches. We name this inconsistent behavior short branch attraction, which may occur in real-world data involving a 4-taxon bifurcating gene tree with a short internal branch. If no mutation occurs along the internal branch, which is likely if the internal branch is short, the 4-taxon bifurcating tree is equivalent to the 4-taxon star tree and thus will suffer the same misleading effect of short branch attraction. Theoretical and simulation results further demonstrate that short branch attraction may occur in gene trees and species trees of arbitrary size. Moreover, short branch attraction is primarily caused by a lack of phylogenetic information in sequence data, suggesting that converting short internal branches to polytomies in the estimated gene trees can significantly reduce artifacts induced by short branch attraction.
Collapse
Affiliation(s)
- Liang Liu
- Department of Statistics and Institute of Bioinformatics, University of Georgia, Athens, GA, United States
| | - Lili Yu
- Department of Biostatistics, Georgia Southern University, Statesboro, GA, United States
| | - Shaoyuan Wu
- Jiangsu Key Laboratory of Phylogenomics and Comparative Genomics, Jiangsu International Joint Center of Genomics, School of Life Sciences, Jiangsu Normal University, Xuzhou, Jiangsu, China
| | - Jonathan Arnold
- Department of Genetics, University of Georgia, Athens, GA, United States
| | - Christopher Whalen
- Department of Epidemiology and Biostatistics, College of Public Health, University of Georgia, Athens, GA, United States
| | - Charles Davis
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, United States
| | - Scott Edwards
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, United States
| |
Collapse
|
17
|
Jiao B, Chen C, Wei M, Niu G, Zheng J, Zhang G, Shen J, Vitales D, Vallès J, Verloove F, Erst AS, Soejima A, Mehregan I, Kokubugata G, Chung GY, Ge X, Gao L, Yuan Y, Joly C, Jabbour F, Wang W, Shultz LM, Gao T. Phylogenomics and morphological evolution of the mega-diverse genus Artemisia (Asteraceae: Anthemideae): implications for its circumscription and infrageneric taxonomy. ANNALS OF BOTANY 2023; 131:867-883. [PMID: 36976653 PMCID: PMC10184459 DOI: 10.1093/aob/mcad051] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 03/24/2023] [Indexed: 05/16/2023]
Abstract
BACKGROUND AND AIMS Artemisia is a mega-diverse genus consisting of ~400 species. Despite its medicinal importance and ecological significance, a well-resolved phylogeny for global Artemisia, a natural generic delimitation and infrageneric taxonomy remain missing, owing to the obstructions from limited taxon sampling and insufficient information on DNA markers. Its morphological characters, such as capitulum, life form and leaf, show marked variations and are widely used in its infrageneric taxonomy. However, their evolution within Artemisia is poorly understood. Here, we aimed to reconstruct a well-resolved phylogeny for global Artemisia via a phylogenomic approach, to infer the evolutionary patterns of its key morphological characters and to update its circumscription and infrageneric taxonomy. METHODS We sampled 228 species (258 samples) of Artemisia and its allies from both fresh and herbarium collections, covering all the subgenera and its main geographical areas, and conducted a phylogenomic analysis based on nuclear single nucleotide polymorphisms (SNPs) obtained from genome skimming data. Based on the phylogenetic framework, we inferred the possible evolutionary patterns of six key morphological characters widely used in its previous taxonomy. KEY RESULTS The genus Kaschgaria was revealed to be nested in Artemisia with strong support. A well-resolved phylogeny of Artemisia consisting of eight highly supported clades was recovered, two of which were identified for the first time. Most of the previously recognized subgenera were not supported as monophyletic. Evolutionary inferences based on the six morphological characters showed that different states of these characters originated independently more than once. CONCLUSIONS The circumscription of Artemisia is enlarged to include the genus Kaschgaria. The morphological characters traditionally used for the infrageneric taxonomy of Artemisia do not match the new phylogenetic tree. They experienced a more complex evolutionary history than previously thought. We propose a revised infrageneric taxonomy of the newly circumscribed Artemisia, with eight recognized subgenera to accommodate the new results.
Collapse
Affiliation(s)
- Bohan Jiao
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Chen Chen
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Meng Wei
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Guohao Niu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jiye Zheng
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Guojin Zhang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jiahao Shen
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences (Nanjing Botanical Garden Mem. Sun Yat-Sen), Nanjing 210014, China
| | - Daniel Vitales
- Institut Botànic de Barcelona (IBB, CSIC-Ajuntament de Barcelona), Pg. del Migdia, s.n., 08038 Barcelona, Spain
| | - Joan Vallès
- Laboratori de Botànica – Unitat associada al CSIC, Facultat de Farmàcia i Ciències de l'Alimentació -Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Av. Joan XXIII 27-31, 08028 Barcelona, Catalonia, Spain
| | - Filip Verloove
- Meise Botanic Garden, Nieuwelaan 38, B-1860 Meise, Belgium
| | - Andrey S Erst
- Laboratory Herbarium (NS), Central Siberian Botanical Garden, Russian Academy of Sciences Russia, Novosibirsk, 630090, Zolotodolinskaya st. 101, Russia
- Tomsk State University, Laboratoryof Systematics and Phylogeny of Plants (TK), Tomsk 634050, Russia
| | - Akiko Soejima
- Faculty of Advanced Science and Technology, Kumamoto University, 2-39-1 Kurokami, Chuo-ku, Kumamoto, 860-8555, Japan
| | - Iraj Mehregan
- Laboratory for Plant Molecular Phylogeny and Systematics, Department of Biology, Science and Research Branch, Azad University, Tehran, Iran
| | - Goro Kokubugata
- Department of Botany, National Museum of Nature and Science, Amakubo 4-1-1, Tsukuba, Ibaraki 305-0005, Japan
| | - Gyu-Young Chung
- Department of Forest Science, Andong National University, 1375 Gyeongdong-ro Andong, Gyeongsangbuk-do, 36729, Republic of Korea
| | - Xuejun Ge
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou 510650, China
| | - Lianming Gao
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, Yunnan 650201, China
- Lijiang National Forest Biodiversity National Observation and Research Station, Kunming Institute of Botany, Chinese Academy of Sciences, Lijiang, Yunnan 67410, China
| | - Yuan Yuan
- National Resource Center for Chinese Meteria Medica, Chinese Academy of Chinese Medical Sciences, 100700, Beijing, China
| | - Cyprien Joly
- Institut de Systématique Evolution Biodiversité (ISYEB), Muséum national d’Histoire naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, 57 rue Cuvier CP39, 75005 Paris, France
| | - Florian Jabbour
- Institut de Systématique Evolution Biodiversité (ISYEB), Muséum national d’Histoire naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, 57 rue Cuvier CP39, 75005 Paris, France
| | - Wei Wang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Leila M Shultz
- Department of Wildland Resources, Utah State University, Logan, UT 84322-5230, USA
| | - Tiangang Gao
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China
- China National Botanical Garden, Beijing 100093, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
18
|
Bartoš O, Bohlen J, Šlechtová VB, Kočí J, Röslein J, Janko K. Sequence capture: Obsolete or irreplaceable? A thorough validation across phylogenetic distances and its applicability to hybrids and allopolyploids. Mol Ecol Resour 2023. [PMID: 37122140 DOI: 10.1111/1755-0998.13806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 04/05/2023] [Accepted: 04/12/2023] [Indexed: 05/02/2023]
Abstract
As whole-genome sequencing has become pervasive, some have suggested that reduced genomic representation approaches, for example, sequence capture, are becoming obsolete. In the present study, we argue that these techniques still provide excellent tools in terms of price and quality of data as well as in their ability to provide markers with specific features, as required, for example, in phylogenomics. A potential drawback of the wide-scale application of reduced representation approaches could be their drop in efficiency with increasing phylogenetic distance from the reference species. While some studies have focused on the degree and performance of reduced representation techniques in such situations, to our knowledge, none of them evaluated their applicability to inter-specific hybrids and polyploids. This highlights a significant gap in current knowledge since there is increasing evidence for the frequent occurrence of natural hybrids and polyploids, as well as for the major importance of both phenomena in evolution. The main aim of the present study was to carry out a thorough validation of SEQcap applicability to (1) a set of non-model taxa with a wide range of phylogenetic relatedness and (2) inter-specific hybrids of various ploidies and genomic compositions. Considering the latter point, we especially focused on mechanisms causing allelic bias and consequent allelic dropout, as these could have confounding effects with respect to the evolutionary genomic dynamics of hybrids, especially in asexuals, which virtually reproduce as a frozen F1 generation.
Collapse
Affiliation(s)
- Oldřich Bartoš
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Jörg Bohlen
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
| | - Vendula Bohlen Šlechtová
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
| | - Jan Kočí
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Jan Röslein
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Karel Janko
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| |
Collapse
|
19
|
Brandt S, Sole C, Lyle R. The phylogenetic structure and coalescent species delimitation of an endemic trapdoor spider genus, Stasimopus (Araneae, Mygalomorphae, Stasimopidae) in the Karoo region of South Africa. Mol Phylogenet Evol 2023; 184:107798. [PMID: 37094612 DOI: 10.1016/j.ympev.2023.107798] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Revised: 03/30/2023] [Accepted: 04/18/2023] [Indexed: 04/26/2023]
Abstract
The Karoo region of South Africa is a unique and sensitive ecosystem which is facing pressure for development due to economic incentives such as mining, farming and shale gas exploration. The species diversity of many taxa in the area is largely unknown. A phylogenetic analysis of the cork-lid trapdoor spider genus, Stasimopus (Stasimopidae) was undertaken in order to gain insight into the relationships between the species that may be present in the area. The species within Stasimopus are challenging to identify and define using traditional morphological methods due to a high degree of morphological conservatism within the genus. For this reason, multiple coalescent based species delimitation methods were used to attempt to determine the species present for Stasimopus in the region which was tested against the morphological identifications and genetic clades (based on CO1, 16S and EF-1ɣ). We tested single-locus methods Automatic Barcode Gap Discovery (ABGD), Bayesian implementation of Poisson Tree Processes (bPTP) and General Mixed Yule- Coalescent (GMYC), as well as multi-locus Brownie. The phylogenetic analysis of Stasimopus in the Karoo showed that there is a high degree of genetic diversity within the genus. The species delimitation results proved unfruitful for the genus, as they appear to delimit population structure rather than species for most methods. Alternative methods should be investigated to aid in the identification of the species in order truly understand the species diversity of the genus.
Collapse
Affiliation(s)
- Shannon Brandt
- Department of Zoology and Entomology, University of Pretoria, Hatfield, Pretoria, South Africa.
| | - Catherine Sole
- Department of Zoology and Entomology, University of Pretoria, Hatfield, Pretoria, South Africa.
| | - Robin Lyle
- Agricultural Research Council - Plant Health and Protection, Biosystematics, Roodeplaat, Pretoria, South Africa
| |
Collapse
|
20
|
Moravec JC, Lanfear R, Spector DL, Diermeier SD, Gavryushkin A. Testing for Phylogenetic Signal in Single-Cell RNA-Seq Data. J Comput Biol 2023; 30:518-537. [PMID: 36475926 PMCID: PMC10125402 DOI: 10.1089/cmb.2022.0357] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Phylogenetic methods are emerging as a useful tool to understand cancer evolutionary dynamics, including tumor structure, heterogeneity, and progression. Most currently used approaches utilize either bulk whole genome sequencing or single-cell DNA sequencing and are based on calling copy number alterations and single nucleotide variants (SNVs). Single-cell RNA sequencing (scRNA-seq) is commonly applied to explore differential gene expression of cancer cells throughout tumor progression. The method exacerbates the single-cell sequencing problem of low yield per cell with uneven expression levels. This accounts for low and uneven sequencing coverage and makes SNV detection and phylogenetic analysis challenging. In this article, we demonstrate for the first time that scRNA-seq data contain sufficient evolutionary signal and can also be utilized in phylogenetic analyses. We explore and compare results of such analyses based on both expression levels and SNVs called from scRNA-seq data. Both techniques are shown to be useful for reconstructing phylogenetic relationships between cells, reflecting the clonal composition of a tumor. Both standardized expression values and SNVs appear to be equally capable of reconstructing a similar pattern of phylogenetic relationship. This pattern is stable even when phylogenetic uncertainty is taken in account. Our results open up a new direction of somatic phylogenetics based on scRNA-seq data. Further research is required to refine and improve these approaches to capture the full picture of somatic evolutionary dynamics in cancer.
Collapse
Affiliation(s)
- Jiří C. Moravec
- Department of Computer Science, University of Otago, Dunedin, New Zealand
- School of Mathematics and Statistics, University of Canterbury, Christchurch, New Zealand
| | - Robert Lanfear
- Division of Ecology and Evolution, Research School of Biology, Australian National University, Canberra, Australia
| | | | | | - Alex Gavryushkin
- School of Mathematics and Statistics, University of Canterbury, Christchurch, New Zealand
| |
Collapse
|
21
|
Chambers EA, Tarvin RD, Santos JC, Ron SR, Betancourth-Cundar M, Hillis DM, Matz MV, Cannatella DC. 2b or not 2b? 2bRAD is an effective alternative to ddRAD for phylogenomics. Ecol Evol 2023; 13:e9842. [PMID: 36911313 PMCID: PMC9994478 DOI: 10.1002/ece3.9842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 02/02/2023] [Accepted: 02/03/2023] [Indexed: 03/10/2023] Open
Abstract
Restriction-site-associated DNA sequencing (RADseq) has become an accessible way to obtain genome-wide data in the form of single-nucleotide polymorphisms (SNPs) for phylogenetic inference. Nonetheless, how differences in RADseq methods influence phylogenetic estimation is poorly understood because most comparisons have largely relied on conceptual predictions rather than empirical tests. We examine how differences in ddRAD and 2bRAD data influence phylogenetic estimation in two non-model frog groups. We compare the impact of method choice on phylogenetic information, missing data, and allelic dropout, considering different sequencing depths. Given that researchers must balance input (funding, time) with output (amount and quality of data), we also provide comparisons of laboratory effort, computational time, monetary costs, and the repeatability of library preparation and sequencing. Both 2bRAD and ddRAD methods estimated well-supported trees, even at low sequencing depths, and had comparable amounts of missing data, patterns of allelic dropout, and phylogenetic signal. Compared to ddRAD, 2bRAD produced more repeatable datasets, had simpler laboratory protocols, and had an overall faster bioinformatics assembly. However, many fewer parsimony-informative sites per SNP were obtained from 2bRAD data when using native pipelines, highlighting a need for further investigation into the effects of each pipeline on resulting datasets. Our study underscores the importance of comparing RADseq methods, such as expected results and theoretical performance using empirical datasets, before undertaking costly experiments.
Collapse
Affiliation(s)
- E Anne Chambers
- Department of Integrative Biology and Biodiversity Center University of Texas at Austin Austin Texas USA.,Department of Environmental Science, Policy, and Management and Museum of Vertebrate Zoology University of California Berkeley Berkeley California USA
| | - Rebecca D Tarvin
- Department of Integrative Biology and Biodiversity Center University of Texas at Austin Austin Texas USA.,Department of Integrative Biology and Museum of Vertebrate Zoology University of California Berkeley Berkeley California USA
| | - Juan C Santos
- Department of Biological Sciences St John's University New York New York USA
| | - Santiago R Ron
- Museo de Zoología, Escuela de Ciencias Biológicas Pontificia Universidad Católica del Ecuador Quito Ecuador
| | | | - David M Hillis
- Department of Integrative Biology and Biodiversity Center University of Texas at Austin Austin Texas USA
| | - Mikhail V Matz
- Department of Integrative Biology and Biodiversity Center University of Texas at Austin Austin Texas USA
| | - David C Cannatella
- Department of Integrative Biology and Biodiversity Center University of Texas at Austin Austin Texas USA
| |
Collapse
|
22
|
Next-generation sequencing data show rapid radiation and several long-distance dispersal events in early Costaceae. Mol Phylogenet Evol 2023; 179:107664. [PMID: 36403710 DOI: 10.1016/j.ympev.2022.107664] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 02/12/2022] [Accepted: 11/02/2022] [Indexed: 11/18/2022]
Abstract
The monocot family Costaceae Nakai consists of seven genera but their mutual relationships have not been satisfactorily resolved in previous studies employing classical molecular markers. Phylogenomic analyses of 365 nuclear genes and nearly-complete plastome data provide almost fully resolved insights into their diversification. Paracostus is identified as sister to all other taxa, followed by several very short branches leading to discrete lineages, suggesting an ancient rapid radiation of these early lineages and leaving the exact relationships among them unresolved. Relationships among Chamaecostus, Dimerocostus and Monocostus confirmed earlier findings that these genera form a monophyletic group. The Afro-American Costus is also monophyletic. By contrast, Tapeinochilos appeared as a well-supported crown lineage of Cheilocostus rendering it paraphyletic. As these two genera differ morphologically from one another owing to a shift from insect- to bird-pollination, we propose to keep both names. The divergence time within Costaceae was estimated using penalized likelihood utilizing two fossils within Zingiberales, †Spirematospermum chandlerae and †Ensete oregonense, indicated a relatively recent diversification of Costaceae, between 18 and 9 Mya. Based on these data, the current pantropical distribution of the family is hypothesized to be the result of several long-distance intercontinental dispersal events, which do not correlate with global geoclimatic changes.
Collapse
|
23
|
Lozano-Fernandez J. A Practical Guide to Design and Assess a Phylogenomic Study. Genome Biol Evol 2022; 14:evac129. [PMID: 35946263 PMCID: PMC9452790 DOI: 10.1093/gbe/evac129] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/03/2022] [Indexed: 11/13/2022] Open
Abstract
Over the last decade, molecular systematics has undergone a change of paradigm as high-throughput sequencing now makes it possible to reconstruct evolutionary relationships using genome-scale datasets. The advent of "big data" molecular phylogenetics provided a battery of new tools for biologists but simultaneously brought new methodological challenges. The increase in analytical complexity comes at the price of highly specific training in computational biology and molecular phylogenetics, resulting very often in a polarized accumulation of knowledge (technical on one side and biological on the other). Interpreting the robustness of genome-scale phylogenetic studies is not straightforward, particularly as new methodological developments have consistently shown that the general belief of "more genes, more robustness" often does not apply, and because there is a range of systematic errors that plague phylogenomic investigations. This is particularly problematic because phylogenomic studies are highly heterogeneous in their methodology, and best practices are often not clearly defined. The main aim of this article is to present what I consider as the ten most important points to take into consideration when planning a well-thought-out phylogenomic study and while evaluating the quality of published papers. The goal is to provide a practical step-by-step guide that can be easily followed by nonexperts and phylogenomic novices in order to assess the technical robustness of phylogenomic studies or improve the experimental design of a project.
Collapse
Affiliation(s)
- Jesus Lozano-Fernandez
- Department of Genetics, Microbiology and Statistics, Biodiversity Research Institute (IRBio), University of Barcelona, Avd. Diagonal 643, 08028 Barcelona, Spain
- Institute of Evolutionary Biology (CSIC – Universitat Pompeu Fabra), Passeig marítim de la Barcelona 37-49, 08003 Barcelona, Spain
| |
Collapse
|
24
|
Mahbub S, Sawmya S, Saha A, Reaz R, Rahman MS, Bayzid MS. Quartet Based Gene Tree Imputation Using Deep Learning Improves Phylogenomic Analyses Despite Missing Data. JOURNAL OF COMPUTATIONAL BIOLOGY : A JOURNAL OF COMPUTATIONAL MOLECULAR CELL BIOLOGY 2022; 29:1156-1172. [PMID: 36048555 DOI: 10.1089/cmb.2022.0212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Species tree estimation is frequently based on phylogenomic approaches that use multiple genes from throughout the genome. However, for a combination of reasons (ranging from sampling biases to more biological causes, as in gene birth and loss), gene trees are often incomplete, meaning that not all species of interest have a common set of genes. Incomplete gene trees can potentially impact the accuracy of phylogenomic inference. We, for the first time, introduce the problem of imputing the quartet distribution induced by a set of incomplete gene trees, which involves adding the missing quartets back to the quartet distribution. We present Quartet based Gene tree Imputation using Deep Learning (QT-GILD), an automated and specially tailored unsupervised deep learning technique, accompanied by cues from natural language processing, which learns the quartet distribution in a given set of incomplete gene trees and generates a complete set of quartets accordingly. QT-GILD is a general-purpose technique needing no explicit modeling of the subject system or reasons for missing data or gene tree heterogeneity. Experimental studies on a collection of simulated and empirical datasets suggest that QT-GILD can effectively impute the quartet distribution, which results in a dramatic improvement in the species tree accuracy. Remarkably, QT-GILD not only imputes the missing quartets but can also account for gene tree estimation error. Therefore, QT-GILD advances the state-of-the-art in species tree estimation from gene trees in the face of missing data.
Collapse
Affiliation(s)
- Sazan Mahbub
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh.,Department of Computer Science, University of Maryland, College Park, Maryland, USA
| | - Shashata Sawmya
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - Arpita Saha
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - Rezwana Reaz
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - M Sohel Rahman
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| | - Md Shamsuzzoha Bayzid
- Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, Bangladesh
| |
Collapse
|
25
|
Doyle JJ. Cell types as species: Exploring a metaphor. FRONTIERS IN PLANT SCIENCE 2022; 13:868565. [PMID: 36072310 PMCID: PMC9444152 DOI: 10.3389/fpls.2022.868565] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Accepted: 07/29/2022] [Indexed: 06/05/2023]
Abstract
The concept of "cell type," though fundamental to cell biology, is controversial. Cells have historically been classified into types based on morphology, physiology, or location. More recently, single cell transcriptomic studies have revealed fine-scale differences among cells with similar gross phenotypes. Transcriptomic snapshots of cells at various stages of differentiation, and of cells under different physiological conditions, have shown that in many cases variation is more continuous than discrete, raising questions about the relationship between cell type and cell state. Some researchers have rejected the notion of fixed types altogether. Throughout the history of discussions on cell type, cell biologists have compared the problem of defining cell type with the interminable and often contentious debate over the definition of arguably the most important concept in systematics and evolutionary biology, "species." In the last decades, systematics, like cell biology, has been transformed by the increasing availability of molecular data, and the fine-grained resolution of genetic relationships have generated new ideas about how that variation should be classified. There are numerous parallels between the two fields that make exploration of the "cell types as species" metaphor timely. These parallels begin with philosophy, with discussion of both cell types and species as being either individuals, groups, or something in between (e.g., homeostatic property clusters). In each field there are various different types of lineages that form trees or networks that can (and in some cases do) provide criteria for grouping. Developing and refining models for evolutionary divergence of species and for cell type differentiation are parallel goals of the two fields. The goal of this essay is to highlight such parallels with the hope of inspiring biologists in both fields to look for new solutions to similar problems outside of their own field.
Collapse
Affiliation(s)
- Jeff J. Doyle
- Section of Plant Biology and Section of Plant Breeding and Genetics, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| |
Collapse
|
26
|
Astudillo-Clavijo V, Stiassny MLJ, Ilves KL, Musilova Z, Salzburger W, López-Fernández H. Exon-based phylogenomics and the relationships of African cichlid fishes: tackling the challenges of reconstructing phylogenies with repeated rapid radiations. Syst Biol 2022; 72:134-149. [PMID: 35880863 DOI: 10.1093/sysbio/syac051] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Revised: 07/06/2022] [Accepted: 07/19/2022] [Indexed: 11/13/2022] Open
Abstract
African cichlids (subfamily: Pseudocrenilabrinae) are among the most diverse vertebrates, and their propensity for repeated rapid radiation has made them a celebrated model system in evolutionary research. Nonetheless, despite numerous studies, phylogenetic uncertainty persists, and riverine lineages remain comparatively underrepresented in higher-level phylogenetic studies. Heterogeneous gene histories resulting from incomplete lineage sorting (ILS) and hybridization are likely sources of uncertainty, especially during episodes of rapid speciation. We investigate relationships of Pseudocrenilabrinae and its close relatives while accounting for multiple sources of genetic discordance using species tree and hybrid network analyses with hundreds of single-copy exons. We improve sequence recovery for distant relatives, thereby extending the taxonomic reach of our probes, with a hybrid reference guided/de novo assembly approach. Our analyses provide robust hypotheses for most higher-level relationships and reveal widespread gene heterogeneity, including in riverine taxa. ILS and past hybridization are identified as sources of genetic discordance in different lineages. Sampling of various Blenniiformes (formerly Ovalentaria) adds strong phylogenomic support for convict blennies (Pholidichthyidae) as sister to Cichlidae, and points to other potentially useful protein-coding markers across the order. A reliable phylogeny with representatives from diverse environments will support ongoing taxonomic and comparative evolutionary research in the cichlid model system.
Collapse
Affiliation(s)
- Viviana Astudillo-Clavijo
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, M5S 3B2, Canada.,Department of Natural History, Royal Ontario Museum, Toronto, M5S 2C6, Canada.,Department of Ecology and Evolutionary Biology and Museum of Zoology, University of Michigan, Ann Arbor, 48109, USA
| | - Melanie L J Stiassny
- Department of Ichthyology, American Museum of Natural History, New York, 10024-5102, USA
| | - Katriina L Ilves
- Research & Collections, Zoology, Canadian Museum of Nature, Ottawa, K1P 6P4, Canada
| | - Zuzana Musilova
- Department of Zoology, Charles University in Prague, Vinicna 7, Prague, CZ-128 44, Czech Republic
| | - Walter Salzburger
- Zoological Institute, University of Basel, Vesalgasse 1, CH-4051, Basel, Switzerland
| | - Hernán López-Fernández
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, M5S 3B2, Canada.,Department of Natural History, Royal Ontario Museum, Toronto, M5S 2C6, Canada.,Department of Ecology and Evolutionary Biology and Museum of Zoology, University of Michigan, Ann Arbor, 48109, USA
| |
Collapse
|
27
|
Out of chaos: Phylogenomics of Asian Sonerileae. Mol Phylogenet Evol 2022; 175:107581. [PMID: 35810973 DOI: 10.1016/j.ympev.2022.107581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Revised: 05/23/2022] [Accepted: 05/26/2022] [Indexed: 11/22/2022]
Abstract
Sonerileae is a diverse Melastomataceae lineage comprising ca. 1000 species in 44 genera, with >70% of genera and species distributed in Asia. Asian Sonerileae are taxonomically intractable with obscure generic circumscriptions. The backbone phylogeny of this group remains poorly resolved, possibly due to complexity caused by rapid species radiation in early and middle Miocene, which hampers further systematic study. Here, we used genome resequencing data to reconstruct the phylogeny of Asian Sonerileae. Three parallel datasets, viz. single-copy ortholog (SCO), genomic SNPs, and whole plastome, were assembled from genome resequencing data of 205 species for this purpose. Based on these genome-scale data, we provided the first well resolved phylogeny of Asian Sonerileae, with 34 major clades identified and 74% of the interclade relationships consistently resolved by both SCO and genomic data. Meanwhile, widespread phylogenetic discordance was detected among SCO gene trees as well as species trees reconstructed using different tree estimation methods (concatenation/site-based coalescent method/summary method) or different datasets (SCO/genomic/plastome). We explored sources of discordance using multiple approaches and found that the observed discordance in Asian Sonerileae was mainly caused by a combination of biased distribution of missing data, random noise from uninformative genes, incomplete lineage sorting, and hybridization/introgression. Exploration of these sources can enable us to generate hypotheses for future testing, which is the first step towards understanding the evolution of Asian Sonerileae. We also detected high levels of homoplasy for some characters traditionally used in taxonomy, which explains current chaotic generic delimitations. The backbone phylogeny of Asian Sonerileae revealed in this study offers a solid basis for future taxonomic revision at the generic level.
Collapse
|
28
|
Hatami E, Jones KE, Kilian N. New Insights Into the Relationships Within Subtribe Scorzonerinae (Cichorieae, Asteraceae) Using Hybrid Capture Phylogenomics (Hyb-Seq). FRONTIERS IN PLANT SCIENCE 2022; 13:851716. [PMID: 35873957 PMCID: PMC9298463 DOI: 10.3389/fpls.2022.851716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 05/18/2022] [Indexed: 06/15/2023]
Abstract
Subtribe Scorzonerinae (Cichorieae, Asteraceae) contains 12 main lineages and approximately 300 species. Relationships within the subtribe, either at inter- or intrageneric levels, were largely unresolved in phylogenetic studies to date, due to the lack of phylogenetic signal provided by traditional Sanger sequencing markers. In this study, we employed a phylogenomics approach (Hyb-Seq) that targets 1,061 nuclear-conserved ortholog loci designed for Asteraceae and obtained chloroplast coding regions as a by-product of off-target reads. Our objectives were to evaluate the potential of the Hyb-Seq approach in resolving the phylogenetic relationships across the subtribe at deep and shallow nodes, investigate the relationships of major lineages at inter- and intrageneric levels, and examine the impact of the different datasets and approaches on the robustness of phylogenetic inferences. We analyzed three nuclear datasets: exon only, excluding all potentially paralogous loci; exon only, including loci that were only potentially paralogous in 1-3 samples; exon plus intron regions (supercontigs); and the plastome CDS region. Phylogenetic relationships were reconstructed using both multispecies coalescent and concatenation (Maximum Likelihood and Bayesian analyses) approaches. Overall, our phylogenetic reconstructions recovered the same monophyletic major lineages found in previous studies and were successful in fully resolving the backbone phylogeny of the subtribe, while the internal resolution of the lineages was comparatively poor. The backbone topologies were largely congruent among all inferences, but some incongruent relationships were recovered between nuclear and plastome datasets, which are discussed and assumed to represent cases of cytonuclear discordance. Considering the newly resolved phylogenies, a new infrageneric classification of Scorzonera in its revised circumscription is proposed.
Collapse
Affiliation(s)
- Elham Hatami
- Department of Biology, Faculty of Science, Shahid Bahonar University of Kerman, Kerman, Iran
| | - Katy E. Jones
- Botanic Garden and Botanical Museum Berlin, Freie Universität Berlin, Berlin, Germany
| | - Norbert Kilian
- Botanic Garden and Botanical Museum Berlin, Freie Universität Berlin, Berlin, Germany
| |
Collapse
|
29
|
Lin X, Liu Z, Yan L, Duan X, Bu W, Wang X, Zheng C. Mitogenomes provide new insights of evolutionary history of Boreheptagyiini and Diamesini (Diptera: Chironomidae: Diamesinae). Ecol Evol 2022; 12:e8957. [PMID: 35646319 PMCID: PMC9130564 DOI: 10.1002/ece3.8957] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2021] [Revised: 04/16/2022] [Accepted: 05/06/2022] [Indexed: 01/11/2023] Open
Abstract
Mitogenomes have been widely used for phylogenetic reconstruction of various Dipteran groups, but specifically for chironomid, they have not been carried out to resolve the relationships. Diamesinae (Diptera: Chironomidae) are important bioindicators for freshwater ecosystem monitoring, but its evolutionary history remains uncertain for lack of information. Here, coupled with one previously published and 30 new mitogenomes of Diamesinae, we carried out comparative mitogenomic analysis and phylogenetic analysis. Mitogenomes of Diamesinae were conserved in structure, and all genes arranged in the same order as the ancestral insect mitogenome. All protein-coding genes in Diamesinae were under stronger purifying selection than those of other nonbiting midge species, which may exhibit signs of adaptation to life at cold living conditions. Phylogenetic analyses strongly supported the monophyly of Diamesinae, with Boreheptagyiini deeply nested within Diamesini. In addition, phylogenetic relationship of selected six genera was resolved, except Sympotthastia remained unstable. Our study revealed that the mitogenomes of Diamesinae are highly conserved, and they are practically useful for phylogenetic inference.
Collapse
Affiliation(s)
- Xiao‐Long Lin
- College of Life SciencesNankai UniversityTianjinChina
| | - Zheng Liu
- Geological Museum of ChinaBeijingChina
| | - Li‐Ping Yan
- School of Ecology and Nature ConservationBeijing Forestry UniversityBeijingChina
| | - Xin Duan
- Department of Plant ProtectionCollege of Horticulture and LandscapeTianjin Agricultural UniversityTianjinChina
| | - Wen‐Jun Bu
- College of Life SciencesNankai UniversityTianjinChina
| | - Xin‐Hua Wang
- College of Life SciencesNankai UniversityTianjinChina
| | | |
Collapse
|
30
|
Bertola LD, Vermaat M, Lesilau F, Chege M, Tumenta PN, Sogbohossou EA, Schaap OD, Bauer H, Patterson BD, White PA, de Iongh HH, Laros JFJ, Vrieling K. Whole genome sequencing and the application of a SNP panel reveal primary evolutionary lineages and genomic variation in the lion (Panthera leo). BMC Genomics 2022; 23:321. [PMID: 35459090 PMCID: PMC9027350 DOI: 10.1186/s12864-022-08510-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Accepted: 03/29/2022] [Indexed: 11/23/2022] Open
Abstract
Background Previous phylogeographic studies of the lion (Panthera leo) have improved our insight into the distribution of genetic variation, as well as a revised taxonomy which now recognizes a northern (Panthera leo leo) and a southern (Panthera leo melanochaita) subspecies. However, existing whole range phylogeographic studies on lions either consist of very limited numbers of samples, or are focused on mitochondrial DNA and/or a limited set of microsatellites. The geographic extent of genetic lineages and their phylogenetic relationships remain uncertain, clouded by massive sampling gaps, sex-biased dispersal and incomplete lineage sorting. Results In this study we present results of low depth whole genome sequencing and subsequent variant calling in ten lions sampled throughout the geographic range, resulting in the discovery of >150,000 Single Nucleotide Polymorphisms (SNPs). Phylogenetic analyses revealed the same basal split between northern and southern populations, as well as four population clusters on a more local scale. Further, we designed a SNP panel, including 125 autosomal and 14 mitochondrial SNPs, which was tested on >200 lions from across their range. Results allow us to assign individuals to one of these four major clades (West & Central Africa, India, East Africa, or Southern Africa) and delineate these clades in more detail. Conclusions The results presented here, particularly the validated SNP panel, have important applications, not only for studying populations on a local geographic scale, but also for tracing samples of unknown origin for forensic purposes, and for guiding conservation management of ex situ populations. Thus, these genomic resources not only contribute to our understanding of the evolutionary history of the lion, but may also play a crucial role in conservation efforts aimed at protecting the species in its full diversity. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08510-y.
Collapse
Affiliation(s)
- L D Bertola
- City University of New York, City College of New York, 160 Convent Avenue, New York, NY, 10031, USA. .,Institute of Environmental Sciences (CML), Leiden University, PO Box 9518, 2300 RA, Leiden, The Netherlands. .,Institute of Biology Leiden (IBL), Leiden University, PO Box 9505, 2300 RA, Leiden, The Netherlands.
| | - M Vermaat
- Department of Human Genetics, Leiden University Medical Center, 2300 RC, Leiden, The Netherlands.,Leiden Genome Technology Center, Leiden University Medical Center, 2300 RC, Leiden, The Netherlands
| | - F Lesilau
- Institute of Environmental Sciences (CML), Leiden University, PO Box 9518, 2300 RA, Leiden, The Netherlands.,Kenya Wildlife Service, Nairobi, Kenya
| | - M Chege
- Institute of Environmental Sciences (CML), Leiden University, PO Box 9518, 2300 RA, Leiden, The Netherlands.,Kenya Wildlife Service, Nairobi, Kenya
| | - P N Tumenta
- Centre for Environment and Developmental Studies, Cameroon (CEDC), Yaounde, Cameroon.,Regional Training Centre Specialized in Agriculture, Forest and Wood, University of Dschang, BP 138, Yaounde, Cameroon
| | - E A Sogbohossou
- Laboratoire d'Ecologie Appliquée, Université d'Abomey-Calavi, 03 BP 294, Cotonou, Benin
| | - O D Schaap
- Institute of Biology Leiden (IBL), Leiden University, PO Box 9505, 2300 RA, Leiden, The Netherlands
| | - H Bauer
- Wildlife Conservation Research Unit, Zoology, University of Oxford Recanati-Kaplan Centre, Tubney, OX13 5QL, UK
| | - B D Patterson
- Negaunee Integrative Research Center, Field Museum of Natural History, Chicago, IL, 60605, USA
| | - P A White
- Center for Tropical Research, Institute of the Environment and Sustainability, University of California, Los Angeles, CA, 90095-1496, USA
| | - H H de Iongh
- Institute of Environmental Sciences (CML), Leiden University, PO Box 9518, 2300 RA, Leiden, The Netherlands.,Department of Biology, Evolutionary Ecology Group, University of Antwerp, Groenenborgerlaan 171, 2020, Antwerpen, Belgium
| | - J F J Laros
- Department of Human Genetics, Leiden University Medical Center, 2300 RC, Leiden, The Netherlands.,Leiden Genome Technology Center, Leiden University Medical Center, 2300 RC, Leiden, The Netherlands
| | - K Vrieling
- Institute of Biology Leiden (IBL), Leiden University, PO Box 9505, 2300 RA, Leiden, The Netherlands
| |
Collapse
|
31
|
Mai U, Mirarab S. Completing gene trees without species trees in sub-quadratic time. Bioinformatics 2022; 38:1532-1541. [PMID: 34978565 DOI: 10.1093/bioinformatics/btab875] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Revised: 11/27/2021] [Accepted: 12/30/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION As genome-wide reconstruction of phylogenetic trees becomes more widespread, limitations of available data are being appreciated more than ever before. One issue is that phylogenomic datasets are riddled with missing data, and gene trees, in particular, almost always lack representatives from some species otherwise available in the dataset. Since many downstream applications of gene trees require or can benefit from access to complete gene trees, it will be beneficial to algorithmically complete gene trees. Also, gene trees are often unrooted, and rooting them is useful for downstream applications. While completing and rooting a gene tree with respect to a given species tree has been studied, those problems are not studied in depth when we lack such a reference species tree. RESULTS We study completion of gene trees without a need for a reference species tree. We formulate an optimization problem to complete the gene trees while minimizing their quartet distance to the given set of gene trees. We extend a seminal algorithm by Brodal et al. to solve this problem in quasi-linear time. In simulated studies and on a large empirical data, we show that completion of gene trees using other gene trees is relatively accurate and, unlike the case where a species tree is available, is unbiased. AVAILABILITY AND IMPLEMENTATION Our method, tripVote, is available at https://github.com/uym2/tripVote. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Uyen Mai
- Department of Computer Science and Engineering, University of California San Diego, San Diego, CA 92093, USA
| | - Siavash Mirarab
- Department of Electrical and Computer Engineering, University of California San Diego, San Diego, CA 92093, USA
| |
Collapse
|
32
|
Morel B, Schade P, Lutteropp S, Williams TA, Szöllősi GJ, Stamatakis A. SpeciesRax: A Tool for Maximum Likelihood Species Tree Inference from Gene Family Trees under Duplication, Transfer, and Loss. Mol Biol Evol 2022; 39:msab365. [PMID: 35021210 PMCID: PMC8826479 DOI: 10.1093/molbev/msab365] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Species tree inference from gene family trees is becoming increasingly popular because it can account for discordance between the species tree and the corresponding gene family trees. In particular, methods that can account for multiple-copy gene families exhibit potential to leverage paralogy as informative signal. At present, there does not exist any widely adopted inference method for this purpose. Here, we present SpeciesRax, the first maximum likelihood method that can infer a rooted species tree from a set of gene family trees and can account for gene duplication, loss, and transfer events. By explicitly modeling events by which gene trees can depart from the species tree, SpeciesRax leverages the phylogenetic rooting signal in gene trees. SpeciesRax infers species tree branch lengths in units of expected substitutions per site and branch support values via paralogy-aware quartets extracted from the gene family trees. Using both empirical and simulated data sets we show that SpeciesRax is at least as accurate as the best competing methods while being one order of magnitude faster on large data sets at the same time. We used SpeciesRax to infer a biologically plausible rooted phylogeny of the vertebrates comprising 188 species from 31,612 gene families in 1 h using 40 cores. SpeciesRax is available under GNU GPL at https://github.com/BenoitMorel/GeneRax and on BioConda.
Collapse
Affiliation(s)
- Benoit Morel
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
- Institute for Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Paul Schade
- Institute for Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Sarah Lutteropp
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Tom A Williams
- School of Biological Sciences, University of Bristol, Bristol, United Kingdom
| | - Gergely J Szöllősi
- ELTE-MTA “Lendület” Evolutionary Genomics Research Group, Budapest, Hungary
- Department of Biological Physics, Eötvös University, Budapest, Hungary
- Institute of Evolution, Centre for Ecological Research, Budapest, Hungary
| | - Alexandros Stamatakis
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
- Institute for Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
| |
Collapse
|
33
|
Field JT, Abrams AJ, Cartee JC, McTavish EJ. Rapid alignment updating with Extensiphy. Methods Ecol Evol 2022. [DOI: 10.1111/2041-210x.13790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Jasper Toscani Field
- Quantitative and Systems Biology Program School of Natural Sciences University of California Merced CA USA
| | - A. Jeanine Abrams
- Division of STD Prevention National Centers for HIV/AIDS Viral Hepatitis, STD, and TB Prevention Atlanta GA USA
| | - John C. Cartee
- Division of STD Prevention National Centers for HIV/AIDS Viral Hepatitis, STD, and TB Prevention Atlanta GA USA
| | - Emily Jane McTavish
- Life and Environmental Sciences Department School of Natural Sciences University of California Merced CA USA
| |
Collapse
|
34
|
Abstract
Growing popularity of herbal medicine has increased the demand of medicinal orchids in the global markets leading to their overharvesting from natural habitats for illegal trade. To stop such illegal trade, the correct identification of orchid species from their traded products is a foremost requirement. Different species of medicinal orchids are traded as their dried or fresh parts (tubers, pseudobulbs, stems), which look similar to each other making it almost impossible to identify them merely based on morphological observation. To overcome this problem, DNA barcoding could be an important method for accurate identification of medicinal orchids. Therefore, this research evaluated DNA barcoding of medicinal orchids in Asia where illegal trade of medicinal orchids has long existed. Based on genetic distance, similarity-based and tree-based methods with sampling nearly 7,000 sequences from five single barcodes (ITS, ITS2, matK, rbcL, trnH-psbA and their seven combinations), this study revealed that DNA barcoding is effective for identifying medicinal orchids. Among single locus, ITS performed the best barcode, whereas ITS + matK exhibited the most efficient barcode among multi-loci. A barcode library as a resource for identifying medicinal orchids has been established which contains about 7,000 sequences of 380 species (i.e. 90%) of medicinal orchids in Asia.
Collapse
|
35
|
How challenging RADseq data turned out to favor coalescent-based species tree inference. A case study in Aichryson (Crassulaceae). Mol Phylogenet Evol 2021; 167:107342. [PMID: 34785384 DOI: 10.1016/j.ympev.2021.107342] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2020] [Revised: 07/05/2021] [Accepted: 10/29/2021] [Indexed: 12/24/2022]
Abstract
Analysing multiple genomic regions while incorporating detection and qualification of discordance among regions has become standard for understanding phylogenetic relationships. In plants, which usually have comparatively large genomes, this is feasible by the combination of reduced-representation library (RRL) methods and high-throughput sequencing enabling the cost effective acquisition of genomic data for thousands of loci from hundreds of samples. One popular RRL method is RADseq. A major disadvantage of established RADseq approaches is the rather short fragment and sequencing range, leading to loci of little individual phylogenetic information. This issue hampers the application of coalescent-based species tree inference. The modified RADseq protocol presented here targets ca. 5,000 loci of 300-600nt length, sequenced with the latest short-read-sequencing (SRS) technology, has the potential to overcome this drawback. To illustrate the advantages of this approach we use the study group Aichryson Webb & Berthelott (Crassulaceae), a plant genus that diversified on the Canary Islands. The data analysis approach used here aims at a careful quality control of the long loci dataset. It involves an informed selection of thresholds for accurate clustering, a thorough exploration of locus properties, such as locus length, coverage and variability, to identify potential biased data and a comparative phylogenetic inference of filtered datasets, accompanied by an evaluation of resulting BS support, gene and site concordance factor values, to improve overall resolution of the resulting phylogenetic trees. The final dataset contains variable loci with an average length of 373nt and facilitates species tree estimation using a coalescent-based summary approach. Additional improvements brought by the approach are critically discussed.
Collapse
|
36
|
Yardeni G, Viruel J, Paris M, Hess J, Groot Crego C, de La Harpe M, Rivera N, Barfuss MHJ, Till W, Guzmán-Jacob V, Krömer T, Lexer C, Paun O, Leroy T. Taxon-specific or universal? Using target capture to study the evolutionary history of rapid radiations. Mol Ecol Resour 2021; 22:927-945. [PMID: 34606683 PMCID: PMC9292372 DOI: 10.1111/1755-0998.13523] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 09/09/2021] [Accepted: 09/22/2021] [Indexed: 12/20/2022]
Abstract
Target capture has emerged as an important tool for phylogenetics and population genetics in nonmodel taxa. Whereas developing taxon‐specific capture probes requires sustained efforts, available universal kits may have a lower power to reconstruct relationships at shallow phylogenetic scales and within rapidly radiating clades. We present here a newly developed target capture set for Bromeliaceae, a large and ecologically diverse plant family with highly variable diversification rates. The set targets 1776 coding regions, including genes putatively involved in key innovations, with the aim to empower testing of a wide range of evolutionary hypotheses. We compare the relative power of this taxon‐specific set, Bromeliad1776, to the universal Angiosperms353 kit. The taxon‐specific set results in higher enrichment success across the entire family; however, the overall performance of both kits to reconstruct phylogenetic trees is relatively comparable, highlighting the vast potential of universal kits for resolving evolutionary relationships. For more detailed phylogenetic or population genetic analyses, for example the exploration of gene tree concordance, nucleotide diversity or population structure, the taxon‐specific capture set presents clear benefits. We discuss the potential lessons that this comparative study provides for future phylogenetic and population genetic investigations, in particular for the study of evolutionary radiations.
Collapse
Affiliation(s)
- Gil Yardeni
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | | | - Margot Paris
- Unit of Ecology & Evolution, Department of Biology, University of Fribourg, Fribourg, Switzerland
| | - Jaqueline Hess
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria.,Department of Soil Ecology, Helmholtz Centre for Environmental Research, UFZ, Halle (Saale), Germany
| | - Clara Groot Crego
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria.,Vienna Graduate School of Population Genetics, Vienna, Austria
| | - Marylaure de La Harpe
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Norma Rivera
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Michael H J Barfuss
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Walter Till
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Valeria Guzmán-Jacob
- Biodiversity, Macroecology and Biogeography, University of Goettingen, Göttingen, Germany
| | - Thorsten Krömer
- Centro de Investigaciones Tropicales, Universidad Veracruzana, Xalapa, Mexico
| | - Christian Lexer
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Ovidiu Paun
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Thibault Leroy
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| |
Collapse
|
37
|
Blotto BL, Lyra ML, Cardoso MCS, Trefaut Rodrigues M, R Dias I, Marciano-Jr E, Dal Vechio F, Orrico VGD, Brandão RA, Lopes de Assis C, Lantyer-Silva ASF, Rutherford MG, Gagliardi-Urrutia G, Solé M, Baldo D, Nunes I, Cajade R, Torres A, Grant T, Jungfer KH, da Silva HR, Haddad CFB, Faivovich J. The phylogeny of the Casque-headed Treefrogs (Hylidae: Hylinae: Lophyohylini). Cladistics 2021; 37:36-72. [PMID: 34478174 DOI: 10.1111/cla.12409] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/21/2019] [Indexed: 12/24/2022] Open
Abstract
The South American and West Indian Casque-headed Treefrogs (Hylidae: Hylinae: Lophyohylini) include 85 species. These are notably diverse in morphology (e.g. disparate levels of cranial hyperossification) and life history (e.g. different reproductive modes, chemical defences), have a wide distribution, and occupy habitats from the tropical rainforests to semiarid scrubland. In this paper, we present a phylogenetic analysis of this hylid tribe based on sequence fragments of up to five mitochondrial (12S, 16S, ND1, COI, Cytb) and six nuclear genes (POMC, RAG-1, RHOD, SIAH, TNS3, TYR). We included most of its species (> 96%), in addition to a number of new species. Our results indicate: (i) the paraphyly of Trachycephalus with respect to Aparasphenodon venezolanus; (ii) the nonmonophyly of Aparasphenodon, with Argenteohyla siemersi, Corythomantis galeata and Nyctimantis rugiceps nested within it, and Ap. venezolanus nested within Trachycephalus; (iii) the polyphyly of Corythomantis; (iv) the nonmonophyly of the recognized species groups of Phyllodytes; and (v) a pervasive low support for the deep relationships among the major clades of Lophyohylini, including C. greeningi and the monotypic genera Itapotihyla and Phytotriades. To remedy the nonmonophyly of Aparasphenodon, Corythomantis, and Trachycephalus, we redefined Nyctimantis to include Aparasphenodon (with the exception of Ap. venezolanus, which we transferred to Trachycephalus), Argenteohyla, and C. galeata. Additionally, our results indicate the need for taxonomic work in the following clades: (i) Trachycephalus dibernardoi and Tr. imitatrix; (ii) Tr. atlas, Tr. mambaiensis and Tr. nigromaculatus; and (iii) Phyllodytes. On the basis of our phylogenetic results, we analyzed the evolution of skull hyperossification and reproductive biology, with emphasis on the multiple independent origins of phytotelm breeding, in the context of Anura. We also analyzed the inter-related aspects of chemical defences, venom delivery, phragmotic behaviour, co-ossification, and prevention of evaporative water loss.
Collapse
Affiliation(s)
- Boris L Blotto
- Departamento de Biodiversidade and Centro de Aquicultura, Instituto de Biociências, Universidade Estadual Paulista, Av. 24A 1515, 13506-900, Rio Claro, São Paulo, Brazil.,Departamento de Zoologia, Instituto de Biociências, Universidade de São Paulo, 05508-090, São Paulo, São Paulo, Brazil
| | - Mariana L Lyra
- Departamento de Biodiversidade and Centro de Aquicultura, Instituto de Biociências, Universidade Estadual Paulista, Av. 24A 1515, 13506-900, Rio Claro, São Paulo, Brazil
| | - Monica C S Cardoso
- Setor de Herpetologia, Departamento de Vertebrados, Museu Nacional, Universidade Federal do Rio de Janeiro, Quinta da Boa Vista, CEP 20940-040, Rio de Janeiro, Rio de Janeiro, Brazil
| | - Miguel Trefaut Rodrigues
- Departamento de Zoologia, Instituto de Biociências, Universidade de São Paulo, 05508-090, São Paulo, São Paulo, Brazil
| | - Iuri R Dias
- Tropical Herpetology Laboratory, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz, Rodovia Jorge Amado, km 16, CEP 45662-900, Ilhéus, Bahia, Brazil
| | - Euvaldo Marciano-Jr
- Tropical Herpetology Laboratory, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz, Rodovia Jorge Amado, km 16, CEP 45662-900, Ilhéus, Bahia, Brazil
| | - Francisco Dal Vechio
- Departamento de Zoologia, Instituto de Biociências, Universidade de São Paulo, 05508-090, São Paulo, São Paulo, Brazil
| | - Victor G D Orrico
- Tropical Herpetology Laboratory, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz, Rodovia Jorge Amado, km 16, CEP 45662-900, Ilhéus, Bahia, Brazil
| | - Reuber A Brandão
- Laboratório de Fauna e Unidades de Conservação, Departamento de Engenharia Florestal, Universidade de Brasília, 70910-900, Brasília, Distrito Federal, Brazil
| | - Clodoaldo Lopes de Assis
- Museu de Zoologia João Moojen, Departamento de Biologia Animal, Universidade Federal de Viçosa, 36570-900, Viçosa, Minas Gerais, Brazil
| | - Amanda S F Lantyer-Silva
- Departamento de Biodiversidade and Centro de Aquicultura, Instituto de Biociências, Universidade Estadual Paulista, Av. 24A 1515, 13506-900, Rio Claro, São Paulo, Brazil
| | - Mike G Rutherford
- Department of Life Sciences, The University of The West Indies Zoology Museum, The University of The West Indies, St. Augustine, Trinidad & Tobago
| | - Giussepe Gagliardi-Urrutia
- Laboratorio de Sistemática de Vertebrados, Pontifícia Universidade Católica do Rio Grande do Sul (PUCRS), Av. Ipiranga, 6681, Prédio 40, sala 110, 90619-900, Porto Alegre, Rio Grande do Sul, Brazil
| | - Mirco Solé
- Tropical Herpetology Laboratory, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz, Rodovia Jorge Amado, km 16, CEP 45662-900, Ilhéus, Bahia, Brazil
| | - Diego Baldo
- Laboratorio de Genetica Evolutiva "Claudio Juan Bidau", Instituto de Biologıa Subtropical (CONICET-UNaM), Félix de Azara, 1552, CPA N3300LQF Posadas, Misiones, Argentina
| | - Ivan Nunes
- Laboratório de Herpetologia, Instituto de Biociências, Universidade Estadual Paulista, Campus do Litoral Paulista, CEP 11330-900, São Vicente, São Paulo, Brazil
| | - Rodrigo Cajade
- Laboratorio de Herpetología, Departamento de Biología, Facultad de Ciencias Exactas y Naturales y Agrimensura, CONICET, Universidad Nacional del Nordeste, Av. Libertad 5470, 3400, Corrientes, Argentina
| | - Ambrosio Torres
- Unidad Ejecutora Lillo, CONICET - Fundación Miguel Lillo, Miguel Lillo 251, 4000, San Miguel de Tucumán, Argentina
| | - Taran Grant
- Departamento de Zoologia, Instituto de Biociências, Universidade de São Paulo, 05508-090, São Paulo, São Paulo, Brazil
| | - Karl-Heinz Jungfer
- Department of Biology, Institute of Integrated Sciences, University of Koblenz-Landau, Universitätsstr. 1, 56070, Koblenz, Germany
| | - Helio R da Silva
- Departamento de Biologia Animal, Instituto de Biologia, Universidade Federal Rural do Rio de Janeiro, Caixa Postal 74524, 23851-970, Seropédica, Rio de Janeiro, Brazil
| | - Célio F B Haddad
- Departamento de Biodiversidade and Centro de Aquicultura, Instituto de Biociências, Universidade Estadual Paulista, Av. 24A 1515, 13506-900, Rio Claro, São Paulo, Brazil
| | - Julián Faivovich
- División Herpetología, Museo Argentino de Ciencias Naturales "Bernardino Rivadavia"-CONICET, Angel Gallardo 470, C1405DJR, Buenos Aires, Argentina.,Departamento de Biodiversidad y Biología Experimental, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| |
Collapse
|
38
|
Smith BT, Mauck WM, Benz BW, Andersen MJ. Uneven Missing Data Skew Phylogenomic Relationships within the Lories and Lorikeets. Genome Biol Evol 2021; 12:1131-1147. [PMID: 32470111 PMCID: PMC7486955 DOI: 10.1093/gbe/evaa113] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/26/2020] [Indexed: 01/21/2023] Open
Abstract
The resolution of the Tree of Life has accelerated with advances in DNA sequencing technology. To achieve dense taxon sampling, it is often necessary to obtain DNA from historical museum specimens to supplement modern genetic samples. However, DNA from historical material is generally degraded, which presents various challenges. In this study, we evaluated how the coverage at variant sites and missing data among historical and modern samples impacts phylogenomic inference. We explored these patterns in the brush-tongued parrots (lories and lorikeets) of Australasia by sampling ultraconserved elements in 105 taxa. Trees estimated with low coverage characters had several clades where relationships appeared to be influenced by whether the sample came from historical or modern specimens, which were not observed when more stringent filtering was applied. To assess if the topologies were affected by missing data, we performed an outlier analysis of sites and loci, and a data reduction approach where we excluded sites based on data completeness. Depending on the outlier test, 0.15% of total sites or 38% of loci were driving the topological differences among trees, and at these sites, historical samples had 10.9× more missing data than modern ones. In contrast, 70% data completeness was necessary to avoid spurious relationships. Predictive modeling found that outlier analysis scores were correlated with parsimony informative sites in the clades whose topologies changed the most by filtering. After accounting for biased loci and understanding the stability of relationships, we inferred a more robust phylogenetic hypothesis for lories and lorikeets.
Collapse
Affiliation(s)
- Brian Tilston Smith
- Department of Ornithology, American Museum of Natural History, New York, New York
| | - William M Mauck
- Department of Ornithology, American Museum of Natural History, New York, New York.,New York Genome Center, New York, New York
| | - Brett W Benz
- Museum of Zoology and Department of Ecology and Evolutionary Biology, University of Michigan
| | - Michael J Andersen
- Department of Biology and Museum of Southwestern Biology, University of New Mexico
| |
Collapse
|
39
|
Thomas SK, Liu X, Du Z, Dong Y, Cummings A, Pokorny L, Xiang Q(J, Leebens‐Mack JH. Comprehending Cornales: phylogenetic reconstruction of the order using the Angiosperms353 probe set. AMERICAN JOURNAL OF BOTANY 2021; 108:1112-1121. [PMID: 34263456 PMCID: PMC8361741 DOI: 10.1002/ajb2.1696] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Accepted: 05/12/2021] [Indexed: 05/08/2023]
Abstract
PREMISE Cornales is an order of flowering plants containing ecologically and horticulturally important families, including Cornaceae (dogwoods) and Hydrangeaceae (hydrangeas), among others. While many relationships in Cornales are strongly supported by previous studies, some uncertainty remains with regards to the placement of Hydrostachyaceae and to relationships among families in Cornales and within Cornaceae. Here we analyzed hundreds of nuclear loci to test published phylogenetic hypotheses and estimated a robust species tree for Cornales. METHODS Using the Angiosperms353 probe set and existing data sets, we generated phylogenomic data for 158 samples, representing all families in the Cornales, with intensive sampling in the Cornaceae. RESULTS We curated an average of 312 genes per sample, constructed maximum likelihood gene trees, and inferred a species tree using the summary approach implemented in ASTRAL-III, a method statistically consistent with the multispecies coalescent model. CONCLUSIONS The species tree we constructed generally shows high support values and a high degree of concordance among individual nuclear gene trees. Relationships among families are largely congruent with previous molecular studies, except for the placement of the nyssoids and the Grubbiaceae-Curtisiaceae clades. Furthermore, we were able to place Hydrostachyaceae within Cornales, and within Cornaceae, the monophyly of known morphogroups was well supported. However, patterns of gene tree discordance suggest potential ancient reticulation, gene flow, and/or ILS in the Hydrostachyaceae lineage and the early diversification of Cornus. Our findings reveal new insights into the diversification process across Cornales and demonstrate the utility of the Angiosperms353 probe set.
Collapse
Affiliation(s)
- Shawn K. Thomas
- Department of Plant BiologyUniversity of GeorgiaAthensGA30602USA
- Division of Biological SciencesUniversity of MissouriColumbiaMO65203USA
| | - Xiang Liu
- Department of Plant and Microbial BiologyNorth Carolina State UniversityRaleighNC27695USA
- SyngentaResearch Triangle ParkNC27709USA
| | - Zhi‐Yuan Du
- Wuhan Botanical GardenThe Chinese Academy of SciencesWuhanHubei430074China
| | - Yibo Dong
- Department of Plant and Microbial BiologyNorth Carolina State UniversityRaleighNC27695USA
- Global Health Infectious Disease ResearchCollege of Public HealthUniversity of South FloridaTampaFL33612USA
| | - Amanda Cummings
- Department of Plant BiologyUniversity of GeorgiaAthensGA30602USA
| | - Lisa Pokorny
- Royal Botanic Gardens, KewRichmondLondonTW9 3AEUK
- Computational/Systems Biology and Genomics ProgramCentre for Plant Biotechnology and GenomicsUPM‐INIA‐CSICPozuelo de Alarcón (Madrid)28223Spain
| | - Qui‐Yun (Jenny) Xiang
- Department of Plant and Microbial BiologyNorth Carolina State UniversityRaleighNC27695USA
| | | |
Collapse
|
40
|
Shah T, Schneider JV, Zizka G, Maurin O, Baker W, Forest F, Brewer GE, Savolainen V, Darbyshire I, Larridon I. Joining forces in Ochnaceae phylogenomics: a tale of two targeted sequencing probe kits. AMERICAN JOURNAL OF BOTANY 2021; 108:1201-1216. [PMID: 34180046 DOI: 10.1002/ajb2.1682] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Accepted: 02/23/2021] [Indexed: 05/10/2023]
Abstract
PREMISE Both universal and family-specific targeted sequencing probe kits are becoming widely used for reconstruction of phylogenetic relationships in angiosperms. Within the pantropical Ochnaceae, we show that with careful data filtering, universal kits are equally as capable in resolving intergeneric relationships as custom probe kits. Furthermore, we show the strength in combining data from both kits to mitigate bias and provide a more robust result to resolve evolutionary relationships. METHODS We sampled 23 Ochnaceae genera and used targeted sequencing with two probe kits, the universal Angiosperms353 kit and a family-specific kit. We used maximum likelihood inference with a concatenated matrix of loci and multispecies-coalescence approaches to infer relationships in the family. We explored phylogenetic informativeness and the impact of missing data on resolution and tree support. RESULTS For the Angiosperms353 data set, the concatenation approach provided results more congruent with those of the Ochnaceae-specific data set. Filtering missing data was most impactful on the Angiosperms353 data set, with a relaxed threshold being the optimum scenario. The Ochnaceae-specific data set resolved consistent topologies using both inference methods, and no major improvements were obtained after data filtering. Merging of data obtained with the two kits resulted in a well-supported phylogenetic tree. CONCLUSIONS The Angiosperms353 data set improved upon data filtering, and missing data played an important role in phylogenetic reconstruction. The Angiosperms353 data set resolved the phylogenetic backbone of Ochnaceae as equally well as the family specific data set. All analyses indicated that both Sauvagesia L. and Campylospermum Tiegh. as currently circumscribed are polyphyletic and require revised delimitation.
Collapse
Affiliation(s)
- Toral Shah
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
- Department of Life Sciences, Imperial College, Silwood Park Campus, Ascot, Berks, SL5 7PY, UK
| | - Julio V Schneider
- Department of Botany and Molecular Evolution, Senckenberg Research Institute and Natural History Museum Frankfurt, Senckenberganlage 25, Frankfurt am Main, D-60325, Germany
| | - Georg Zizka
- Department of Botany and Molecular Evolution, Senckenberg Research Institute and Natural History Museum Frankfurt, Senckenberganlage 25, Frankfurt am Main, D-60325, Germany
- Institute of Ecology, Evolution and Diversity, Goethe University, Max-von-Laue-Str. 13, Frankfurt am Main, 60438, Germany
| | - Olivier Maurin
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - William Baker
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Félix Forest
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Grace E Brewer
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
| | - Vincent Savolainen
- Department of Life Sciences, Imperial College, Silwood Park Campus, Ascot, Berks, SL5 7PY, UK
| | | | - Isabel Larridon
- Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AE, UK
- Systematic and Evolutionary Botany Lab, Department of Biology, Ghent University, K.L., Ledeganckstraat 35, Gent, 9000, Belgium
| |
Collapse
|
41
|
Barrientos LS, Streicher JW, Miller EC, Pie MR, Wiens JJ, Crawford AJ. Phylogeny of terraranan frogs based on 2,665 loci and impacts of missing data on phylogenomic analyses. SYST BIODIVERS 2021. [DOI: 10.1080/14772000.2021.1933249] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Lucas S. Barrientos
- Department of Biological Sciences, Universidad de los Andes, Bogotá, 111711, Colombia
| | - Jeffrey W. Streicher
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, 85721-0088, AZ, USA
- Department of Life Sciences, The Natural History Museum, South Kensington, London, SW7 5BD, England, UK
| | - Elizabeth C. Miller
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, 85721-0088, AZ, USA
- School of Aquatic and Fishery Sciences, University of Washington, Seattle, 98195-5020, WA, USA
| | - Marcio R. Pie
- Departamento de Zoologia, Universidade Federal do Paraná, Curitiba, 81531-980, Paraná, Brazil
| | - John J. Wiens
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, 85721-0088, AZ, USA
| | - Andrew J. Crawford
- Department of Biological Sciences, Universidad de los Andes, Bogotá, 111711, Colombia
| |
Collapse
|
42
|
Li X, Hou Z, Xu C, Shi X, Yang L, Lewis LA, Zhong B. Large Phylogenomic Data sets Reveal Deep Relationships and Trait Evolution in Chlorophyte Green Algae. Genome Biol Evol 2021; 13:6265471. [PMID: 33950183 PMCID: PMC8271138 DOI: 10.1093/gbe/evab101] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/04/2021] [Indexed: 12/01/2022] Open
Abstract
The chlorophyte green algae (Chlorophyta) are species-rich ancient groups ubiquitous in various habitats with high cytological diversity, ranging from microscopic to macroscopic organisms. However, the deep phylogeny within core Chlorophyta remains unresolved, in part due to the relatively sparse taxon and gene sampling in previous studies. Here we contribute new transcriptomic data and reconstruct phylogenetic relationships of core Chlorophyta based on four large data sets up to 2,698 genes of 70 species, representing 80% of extant orders. The impacts of outgroup choice, missing data, bootstrap-support cutoffs, and model misspecification in phylogenetic inference of core Chlorophyta are examined. The species tree topologies of core Chlorophyta from different analyses are highly congruent, with strong supports at many relationships (e.g., the Bryopsidales and the Scotinosphaerales-Dasycladales clade). The monophyly of Chlorophyceae and of Trebouxiophyceae as well as the uncertain placement of Chlorodendrophyceae and Pedinophyceae corroborate results from previous studies. The reconstruction of ancestral scenarios illustrates the evolution of the freshwater-sea and microscopic–macroscopic transition in the Ulvophyceae, and the transformation of unicellular→colonial→multicellular in the chlorophyte green algae. In addition, we provided new evidence that serine is encoded by both canonical codons and noncanonical TAG code in Scotinosphaerales, and stop-to-sense codon reassignment in the Ulvophyceae has originated independently at least three times. Our robust phylogenetic framework of core Chlorophyta unveils the evolutionary history of phycoplast, cyto-morphology, and noncanonical genetic codes in chlorophyte green algae.
Collapse
Affiliation(s)
- Xi Li
- College of Life Sciences, Nanjing Normal University, China
| | - Zheng Hou
- College of Life Sciences, Nanjing Normal University, China
| | - Chenjie Xu
- College of Life Sciences, Nanjing Normal University, China
| | - Xuan Shi
- College of Life Sciences, Nanjing Normal University, China
| | - Lingxiao Yang
- College of Life Sciences, Nanjing Normal University, China
| | - Louise A Lewis
- Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Bojian Zhong
- College of Life Sciences, Nanjing Normal University, China
| |
Collapse
|
43
|
Minh BQ, Hahn MW, Lanfear R. New Methods to Calculate Concordance Factors for Phylogenomic Datasets. Mol Biol Evol 2021; 37:2727-2733. [PMID: 32365179 PMCID: PMC7475031 DOI: 10.1093/molbev/msaa106] [Citation(s) in RCA: 263] [Impact Index Per Article: 87.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
We implement two measures for quantifying genealogical concordance in phylogenomic data sets: the gene concordance factor (gCF) and the novel site concordance factor (sCF). For every branch of a reference tree, gCF is defined as the percentage of "decisive" gene trees containing that branch. This measure is already in wide usage, but here we introduce a package that calculates it while accounting for variable taxon coverage among gene trees. sCF is a new measure defined as the percentage of decisive sites supporting a branch in the reference tree. gCF and sCF complement classical measures of branch support in phylogenetics by providing a full description of underlying disagreement among loci and sites. An easy to use implementation and tutorial is freely available in the IQ-TREE software package (http://www.iqtree.org/doc/Concordance-Factor, last accessed May 13, 2020).
Collapse
Affiliation(s)
- Bui Quang Minh
- Research School of Computer Science, Australian National University, Canberra, ACT, Australia.,Department of Ecology and Evolution, Research School of Biology, Australian National University, Canberra, ACT, Australia
| | - Matthew W Hahn
- Department of Biology, Indiana University, Bloomington, IN.,Department of Computer Science, Indiana University, Bloomington, IN
| | - Robert Lanfear
- Department of Ecology and Evolution, Research School of Biology, Australian National University, Canberra, ACT, Australia
| |
Collapse
|
44
|
A novel reference dated phylogeny for the genus Spodoptera Guenée (Lepidoptera: Noctuidae: Noctuinae): new insights into the evolution of a pest-rich genus. Mol Phylogenet Evol 2021; 161:107161. [PMID: 33794395 DOI: 10.1016/j.ympev.2021.107161] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Revised: 03/16/2021] [Accepted: 03/23/2021] [Indexed: 02/03/2023]
Abstract
The noctuid genus Spodoptera currently consists of 31 species with varied host plant breadths, ranging from monophagous and oligophagous non-pest species to polyphagous pests of economic importance. Several of these pest species have become major invaders, colonizing multiple continents outside their native range. Such is the case of the infamous fall armyworm, Spodoptera frugiperda (J.E. Smith), which includes two recognized host strains that have not been treated as separate species. Following its accidental introduction to Africa in 2016, it quickly spread through Africa and Asia to Australia. Given that half the described Spodoptera species cause major crop losses, comparative genomics studies of several Spodoptera species have highlighted major adaptive changes in genetic architecture, possibly relating to their pest status. Several recent population genomics studies conducted on two species enable a more refined understanding of their population structures, migration patterns and invasion processes. Despite growing interest in the genus, the taxonomic status of several Spodoptera species remains unstable and evolutionary studies suffer from the absence of a robust and comprehensive dated phylogenetic framework. We generated mitogenomic data for 14 Spodoptera taxa, which are combined with data from 15 noctuoid outgroups to generate a resolved mitogenomic backbone phylogeny using both concatenation and multi-species coalescent approaches. We combine this backbone with additional mitochondrial and nuclear data to improve our understanding of the evolutionary history of the genus. We also carry out comprehensive dating analyses, which implement three distinct calibration strategies based on either primary or secondary fossil calibrations. Our results provide an updated phylogenetic framework for 28 Spodoptera species, identifying two well-supported ecologically diverse clades that are recovered for the first time. Well-studied larvae in each of these clades are characterized by differences in mandibular shape, with one clade's being more specialized on silica-rich C4 grasses. Interestingly, the inferred timeframe for the genus suggests an earlier origin than previously thought for the genus: about 17-18 million years ago.
Collapse
|
45
|
Bianchi FM, Krein V, Rider D, Grazia J. Finding Padaeus bovillus (Hemiptera: Pentatomidae): A Phylogenetic Placement and the Description of Its Sister Species. Zool Stud 2021; 60:e11. [PMID: 34630718 PMCID: PMC8473673 DOI: 10.6620/zs.2021.60-11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2020] [Accepted: 01/14/2021] [Indexed: 11/18/2022]
Abstract
The pentatomids (Hemiptera: Heteroptera) are the third most speciose family within the Heteroptera or the true bugs. The family occurs worldwide and comprises around five thousand valid species within 950 genera. Padaeus Stål belongs to a complex of other genera of Carpocorini Mulsant and Rey related to Euschistus Dallas. These genera present similarities in color, size, and shape, and share common features. However, among its four congeneric species, Padaeus bovillus Distant has been highlighted as an outlier by the posterior margins of the bucculae evanescent, while its congeneric species present posterior margins of the bucculae lobed. Thus, herein we redescribe P. bovillus and present a hypothesis regarding its phylogenetic placement within the Carpocorini. Furthermore, a new species similar to P. bovillus is described. Four molecular markers (COI, CytB, 16S, and 28S) plus 86 morphological characters were used to infer the phylogeny under Maximum likelihood and Bayesian inference. For the descriptions, we measured 16 morphometric parameters and dissected the genitalic structures. We also include illustrations of the habitus, internal and external genitalic structures, and provide distribution maps. Mitripus seclusus sp. n. Bianchi, Krein, Rider, and Grazia is recovered as the sister species to Mitripus bovillus comb. n., and both within Mitripus Rolston. Among other shared characters, species of Mitripus have the femora unarmed, they have a macula near the apex of the radial vein, and the mandibular plates tapering apically. Mitripus bovillus comb. n. and Mitripus seclusus sp. n. have the posterior margin of the pygophore projecting as a spine, a unique pattern within the genus. According to our results, Mitripus including M. bovillus comb. n. and Mitripus seclusus sp. n. now includes five species.
Collapse
Affiliation(s)
- Filipe Michels Bianchi
- Laboratório de Entomologia Sistemática, Departamento de Zoologia, Instituto de Biociências, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil. E-mail: (Bianchi); (Krein); (Grazia)
| | - Verônica Krein
- Laboratório de Entomologia Sistemática, Departamento de Zoologia, Instituto de Biociências, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil. E-mail: (Bianchi); (Krein); (Grazia)
| | - David Rider
- Entomology Department, School of Natural Resource Sciences, North Dakota State University, Fargo, ND, USA. E-mail: (Rider)
| | - Jocelia Grazia
- Laboratório de Entomologia Sistemática, Departamento de Zoologia, Instituto de Biociências, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil. E-mail: (Bianchi); (Krein); (Grazia)
| |
Collapse
|
46
|
Harvey MG, Bravo GA, Claramunt S, Cuervo AM, Derryberry GE, Battilana J, Seeholzer GF, McKay JS, O'Meara BC, Faircloth BC, Edwards SV, Pérez-Emán J, Moyle RG, Sheldon FH, Aleixo A, Smith BT, Chesser RT, Silveira LF, Cracraft J, Brumfield RT, Derryberry EP. The evolution of a tropical biodiversity hotspot. Science 2021; 370:1343-1348. [PMID: 33303617 DOI: 10.1126/science.aaz6970] [Citation(s) in RCA: 83] [Impact Index Per Article: 27.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Revised: 05/01/2020] [Accepted: 10/29/2020] [Indexed: 12/17/2022]
Abstract
The tropics are the source of most biodiversity yet inadequate sampling obscures answers to fundamental questions about how this diversity evolves. We leveraged samples assembled over decades of fieldwork to study diversification of the largest tropical bird radiation, the suboscine passerines. Our phylogeny, estimated using data from 2389 genomic regions in 1940 individuals of 1283 species, reveals that peak suboscine species diversity in the Neotropics is not associated with high recent speciation rates but rather with the gradual accumulation of species over time. Paradoxically, the highest speciation rates are in lineages from regions with low species diversity, which are generally cold, dry, unstable environments. Our results reveal a model in which species are forming faster in environmental extremes but have accumulated in moderate environments to form tropical biodiversity hotspots.
Collapse
Affiliation(s)
- Michael G Harvey
- Department of Biological Sciences, The University of Texas at El Paso, El Paso, TX 79968, USA. .,Biodiversity Collections, The University of Texas at El Paso, El Paso, TX 79968, USA.,Department of Ecology and Evolutionary Biology, University of Tennessee, Knoxville, TN 37996, USA
| | - Gustavo A Bravo
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA. .,Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA.,Museu de Zoologia da Universidade de São Paulo, 04263-000 Ipiranga, São Paulo, SP, Brazil
| | - Santiago Claramunt
- Department of Natural History, Royal Ontario Museum, Toronto, Ontario M5S2C6, Canada.,Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario M5S3B2, Canada.,Department of Ornithology, American Museum of Natural History, New York, NY 10024, USA
| | - Andrés M Cuervo
- Instituto de Ciencias Naturales, Universidad Nacional de Colombia, Bogotá 111321, Colombia.,Department of Ecology and Evolutionary Biology, Tulane University, LA 70118, USA
| | - Graham E Derryberry
- Department of Ecology and Evolutionary Biology, University of Tennessee, Knoxville, TN 37996, USA.,Museum of Natural Science, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Jaqueline Battilana
- Museu de Zoologia da Universidade de São Paulo, 04263-000 Ipiranga, São Paulo, SP, Brazil
| | - Glenn F Seeholzer
- Department of Ornithology, American Museum of Natural History, New York, NY 10024, USA.,Museum of Natural Science, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Jessica Shearer McKay
- Department of Ornithology, American Museum of Natural History, New York, NY 10024, USA
| | - Brian C O'Meara
- Department of Ecology and Evolutionary Biology, University of Tennessee, Knoxville, TN 37996, USA
| | - Brant C Faircloth
- Museum of Natural Science, Louisiana State University, Baton Rouge, LA 70803, USA.,Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Scott V Edwards
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.,Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, USA
| | - Jorge Pérez-Emán
- Instituto de Zoología y Ecología Tropical, Universidad Central de Venezuela, Caracas, Venezuela.,Colección Ornitológica Phelps, Caracas, Venezuela
| | - Robert G Moyle
- Biodiversity Institute, University of Kansas, Lawrence, KS 66045, USA
| | - Frederick H Sheldon
- Museum of Natural Science, Louisiana State University, Baton Rouge, LA 70803, USA.,Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Alexandre Aleixo
- Finnish Museum of Natural History, University of Helsinki, 00014, Helsinki, Finland.,Department of Zoology, Museu Paraense Emílio Goeldi, CP 399, 66040-170 Belém, PA, Brazil
| | - Brian Tilston Smith
- Department of Ornithology, American Museum of Natural History, New York, NY 10024, USA
| | - R Terry Chesser
- US Geological Survey, Patuxent Wildlife Research Center, Laurel, MD 20708, USA.,National Museum of Natural History, Smithsonian Institution, Washington, DC 20560, USA
| | - Luís Fábio Silveira
- Museu de Zoologia da Universidade de São Paulo, 04263-000 Ipiranga, São Paulo, SP, Brazil
| | - Joel Cracraft
- Department of Ornithology, American Museum of Natural History, New York, NY 10024, USA
| | - Robb T Brumfield
- Museum of Natural Science, Louisiana State University, Baton Rouge, LA 70803, USA.,Department of Biological Sciences, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Elizabeth P Derryberry
- Department of Ecology and Evolutionary Biology, University of Tennessee, Knoxville, TN 37996, USA.,Department of Ecology and Evolutionary Biology, Tulane University, LA 70118, USA
| |
Collapse
|
47
|
New Approaches for Inferring Phylogenies in the Presence of Paralogs. Trends Genet 2021; 37:174-187. [DOI: 10.1016/j.tig.2020.08.012] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 08/13/2020] [Accepted: 08/19/2020] [Indexed: 12/18/2022]
|
48
|
Jiang X, Edwards SV, Liu L. The Multispecies Coalescent Model Outperforms Concatenation Across Diverse Phylogenomic Data Sets. Syst Biol 2021; 69:795-812. [PMID: 32011711 PMCID: PMC7302055 DOI: 10.1093/sysbio/syaa008] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Revised: 12/24/2019] [Accepted: 01/02/2020] [Indexed: 11/30/2022] Open
Abstract
A statistical framework of model comparison and model validation is essential to resolving the debates over concatenation and coalescent models in phylogenomic data analysis. A set of statistical tests are here applied and developed to evaluate and compare the adequacy of substitution, concatenation, and multispecies coalescent (MSC) models across 47 phylogenomic data sets collected across tree of life. Tests for substitution models and the concatenation assumption of topologically congruent gene trees suggest that a poor fit of substitution models, rejected by 44% of loci, and concatenation models, rejected by 38% of loci, is widespread. Logistic regression shows that the proportions of GC content and informative sites are both negatively correlated with the fit of substitution models across loci. Moreover, a substantial violation of the concatenation assumption of congruent gene trees is consistently observed across six major groups (birds, mammals, fish, insects, reptiles, and others, including other invertebrates). In contrast, among those loci adequately described by a given substitution model, the proportion of loci rejecting the MSC model is 11%, significantly lower than those rejecting the substitution and concatenation models. Although conducted on reduced data sets due to computational constraints, Bayesian model validation and comparison both strongly favor the MSC over concatenation across all data sets; the concatenation assumption of congruent gene trees rarely holds for phylogenomic data sets with more than 10 loci. Thus, for large phylogenomic data sets, model comparisons are expected to consistently and more strongly favor the coalescent model over the concatenation model. We also found that loci rejecting the MSC have little effect on species tree estimation. Our study reveals the value of model validation and comparison in phylogenomic data analysis, as well as the need for further improvements of multilocus models and computational tools for phylogenetic inference. [Bayes factor; Bayesian model validation; coalescent prior; congruent gene trees; independent prior; Metazoa; posterior predictive simulation.]
Collapse
Affiliation(s)
- Xiaodong Jiang
- Department of Statistics, University of Georgia, 310 Herty Drive, Athens, GA 30602, USA
| | - Scott V Edwards
- Department of Organismic and Evolutionary Biology and Museum of Comparative Zoology, Harvard, 26 Oxford Street, Cambridge, MA 02138, USA
| | - Liang Liu
- Department of Statistics, University of Georgia, 310 Herty Drive, Athens, GA 30602, USA.,Institute of Bioinformatics, University of Georgia, 120 Green Street, Athens, GA 30602, USA
| |
Collapse
|
49
|
Lv X, Hu J, Hu Y, Li Y, Xu D, Ryder OA, Irwin DM, Yu L. Diverse phylogenomic datasets uncover a concordant scenario of laurasiatherian interordinal relationships. Mol Phylogenet Evol 2020; 157:107065. [PMID: 33387649 DOI: 10.1016/j.ympev.2020.107065] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Revised: 12/22/2020] [Accepted: 12/24/2020] [Indexed: 10/22/2022]
Abstract
Resolving the interordinal relationships in the mammalian superorder Laurasiatheria has been among the most intractable problems in higher-level mammalian systematics, with many conflicting hypotheses having been proposed. The present study collected three different sources of genome-scale data with comprehensive taxon sampling of laurasiatherian species, including two protein-coding datasets (4,186 protein-coding genes for an amino acid dataset comprising 2,761,247 amino acid residues and a nucleotide dataset comprising 5,516,340 nucleotides from 1st and 2nd codon positions), an intronic dataset (1,210 introns comprising 1,162,723 nucleotides) and an ultraconserved elements (UCEs) dataset (1,246 UCEs comprising 1,946,472 nucleotides) from 40 species representing all six laurasiatherian orders and 7 non-laurasiatherian outgroups. Remarkably, phylogenetic trees reconstructed with the four datasets using different tree-building methods (RAxML, FastTree, ASTRAL and MP-EST) all supported the relationship (Eulipotyphla, (Chiroptera, ((Carnivora, Pholidota), (Cetartiodactyla, Perissodactyla)))). We find a resolution of interordinal relationships of Laurasiatheria among all types of markers used in the present study, and the likelihood ratio tests for tree comparisons confirmed that the present tree topology is the optimal hypothesis compared to other examined hypotheses. Jackknifing subsampling analyses demonstrate that the results of laurasiatherian tree reconstruction varied with the number of loci and ordinal representatives used, which are likely the two main contributors to phylogenetic disagreements of Laurasiatheria seen in previous studies. Our study provides significant insight into laurasiatherian evolution, and moreover, an important methodological strategy and reference for resolving phylogenies of adaptive radiation, which have been a long-standing challenge in the field of phylogenetics.
Collapse
Affiliation(s)
- Xue Lv
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Jingyang Hu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Yiwen Hu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Yitian Li
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China; School of Life Sciences, Yunnan University, Kunming, China
| | - Dongming Xu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Kunming, China
| | - Oliver A Ryder
- Institute for Conservation Research, San Diego Zoo Global, Escondido, CA, USA
| | - David M Irwin
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, Canada
| | - Li Yu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, China.
| |
Collapse
|
50
|
Bossert S, Murray EA, Pauly A, Chernyshov K, Brady SG, Danforth BN. Gene Tree Estimation Error with Ultraconserved Elements: An Empirical Study on Pseudapis Bees. Syst Biol 2020; 70:803-821. [PMID: 33367855 DOI: 10.1093/sysbio/syaa097] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 11/18/2020] [Accepted: 12/02/2020] [Indexed: 11/12/2022] Open
Abstract
Summarizing individual gene trees to species phylogenies using two-step coalescent methods is now a standard strategy in the field of phylogenomics. However, practical implementations of summary methods suffer from gene tree estimation error, which is caused by various biological and analytical factors. Greatly understudied is the choice of gene tree inference method and downstream effects on species tree estimation for empirical data sets. To better understand the impact of this method choice on gene and species tree accuracy, we compare gene trees estimated through four widely used programs under different model-selection criteria: PhyloBayes, MrBayes, IQ-Tree, and RAxML. We study their performance in the phylogenomic framework of $>$800 ultraconserved elements from the bee subfamily Nomiinae (Halictidae). Our taxon sampling focuses on the genus Pseudapis, a distinct lineage with diverse morphological features, but contentious morphology-based taxonomic classifications and no molecular phylogenetic guidance. We approximate topological accuracy of gene trees by assessing their ability to recover two uncontroversial, monophyletic groups, and compare branch lengths of individual trees using the stemminess metric (the relative length of internal branches). We further examine different strategies of removing uninformative loci and the collapsing of weakly supported nodes into polytomies. We then summarize gene trees with ASTRAL and compare resulting species phylogenies, including comparisons to concatenation-based estimates. Gene trees obtained with the reversible jump model search in MrBayes were most concordant on average and all Bayesian methods yielded gene trees with better stemminess values. The only gene tree estimation approach whose ASTRAL summary trees consistently produced the most likely correct topology, however, was IQ-Tree with automated model designation (ModelFinder program). We discuss these findings and provide practical advice on gene tree estimation for summary methods. Lastly, we establish the first phylogeny-informed classification for Pseudapis s. l. and map the distribution of distinct morphological features of the group. [ASTRAL; Bees; concordance; gene tree estimation error; IQ-Tree; MrBayes, Nomiinae; PhyloBayes; RAxML; phylogenomics; stemminess].
Collapse
Affiliation(s)
- Silas Bossert
- Department of Entomology, Cornell University, Comstock Hall, Ithaca, NY 14853, USA.,Department of Entomology, National Museum of Natural History, Smithsonian Institution, Washington, DC 20560, USA.,Department of Entomology, Washington State University, Pullman, Washington 99164, USA
| | - Elizabeth A Murray
- Department of Entomology, National Museum of Natural History, Smithsonian Institution, Washington, DC 20560, USA.,Department of Entomology, Washington State University, Pullman, Washington 99164, USA
| | - Alain Pauly
- O.D. Taxonomy and Phylogeny, Royal Belgian Institute of Natural Sciences, Rue Vautier 29, 1000 Brussels, Belgium
| | - Kyrylo Chernyshov
- College of Arts and Sciences, Cornell University, Ithaca, NY 14853, USA
| | - Seán G Brady
- Department of Entomology, National Museum of Natural History, Smithsonian Institution, Washington, DC 20560, USA
| | - Bryan N Danforth
- Department of Entomology, Cornell University, Comstock Hall, Ithaca, NY 14853, USA
| |
Collapse
|