1
|
Nicol DA, Saldivia P, Summerfield TC, Heads M, Lord JM, Khaing EP, Larcombe MJ. Phylogenomics and morphology of Celmisiinae (Asteraceae: Astereae): Taxonomic and evolutionary implications. Mol Phylogenet Evol 2024; 195:108064. [PMID: 38508479 DOI: 10.1016/j.ympev.2024.108064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 03/12/2024] [Accepted: 03/17/2024] [Indexed: 03/22/2024]
Abstract
The tribe Astereae (Asteraceae) includes 36 subtribes and 252 genera, and is distributed worldwide in temperate and tropical regions. One of the subtribes, Celmisiinae Saldivia, has been recently circumscribed to include six genera and ca. 160 species, and is restricted to eastern Australia, New Zealand, and New Guinea. The species show an impressive range of growth habit, from small herbs and ericoid subshrubs to medium-sized trees. They live in a wide range of habitats and are often dominant in subalpine and alpine vegetation. Despite the well-supported circumscription of Celmisiinae, uncertainties have remained about their internal relationships and classification at genus and species levels. This study exploited recent advances in high-throughput sequencing to build a robust multi-gene phylogeny for the subtribe Celmisiinae. The target enrichment Angiosperms353 bait set and the hybpiper-nf and paragone-nf pipelines were used to retrieve, infer, and assemble orthologous loci from 75 taxa representing all the main putative clades within the subtribe. Because of the diploidised ploidy level in Celmisiinae, as well as missing data in the assemblies, uncertainty remains surrounding the inference of orthology detection. However, based on a variety of gene-family sets, coalescent and concatenation-based phylogenetic reconstructions recovered similar topologies. Paralogy and missing data in the gene-families caused some problems, but the estimated phylogenies were well-supported and well-resolved. The phylogenomic evidence supported Celmisiinae and three main clades: the Pleurophyllum clade (Pleurophyllum, Macrolearia and Damnamenia), mostly in the New Zealand Subantarctic Islands, Celmisia of mainland New Zealand and Australia, and Shawia (including 'Olearia pro parte' and Pachystegia) of New Zealand, Australia and New Guinea. The results presented here add to the accumulating support for the Angiosperms353 bait set as an efficient method for documenting plant diversity.
Collapse
Affiliation(s)
- Duncan A Nicol
- Department of Botany, University of Otago, PO Box 56, Dunedin, New Zealand.
| | - Patricio Saldivia
- Biota Ltda. Av. Miguel Claro 1224, Providencia, Santiago, Chile; Museo Regional de Aysén, Km 3 Camino a Coyhaique Alto, Coyhaique, Chile
| | - Tina C Summerfield
- Department of Botany, University of Otago, PO Box 56, Dunedin, New Zealand
| | - Michael Heads
- Buffalo Museum of Science, Buffalo, NY 14211-1293, USA
| | - Janice M Lord
- Department of Botany, University of Otago, PO Box 56, Dunedin, New Zealand
| | - Ei P Khaing
- Department of Biochemistry, University of Otago, PO Box 56, Dunedin, New Zealand
| | - Matthew J Larcombe
- Department of Botany, University of Otago, PO Box 56, Dunedin, New Zealand
| |
Collapse
|
2
|
Quatela AS, Cangren P, Jafari F, Michel T, de Boer HJ, Oxelman B. Retrieval of long DNA reads from herbarium specimens. AOB PLANTS 2023; 15:plad074. [PMID: 38130422 PMCID: PMC10735254 DOI: 10.1093/aobpla/plad074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 11/06/2023] [Indexed: 12/23/2023]
Abstract
High-throughput sequencing of herbarium specimens' DNA with short-read platforms has helped explore many biological questions. Here, for the first time, we investigate the potential of using herbarium specimens as a resource for long-read DNA sequencing technologies. We use target capture of 48 low-copy nuclear loci in 12 herbarium specimens of Silene as a basis for long-read sequencing using SMRT PacBio Sequel. The samples were collected between 1932 and 2019. A simple optimization of size selection protocol enabled the retrieval of both long DNA fragments (>1 kb) and long on-target reads for nine of them. The limited sampling size does not enable statistical evaluation of the influence of specimen age to the DNA fragmentation, but our results confirm that younger samples, that is, collected after 1990, are less fragmented and have better sequencing success than specimens collected before this date. Specimens collected between 1990 and 2019 yield between 167 and 3403 on-target reads > 1 kb. They enabled recovering between 34 loci and 48 (i.e. all loci recovered). Three samples from specimens collected before 1990 did not yield on-target reads > 1 kb. The four other samples collected before this date yielded up to 144 reads and recovered up to 25 loci. Young herbarium specimens seem promising for long-read sequencing. However, older ones have partly failed. Further exploration would be necessary to statistically test and understand the potential of older material in the quest for long reads. We would encourage greatly expanding the sampling size and comparing different taxonomic groups.
Collapse
Affiliation(s)
- Anne-Sophie Quatela
- Department of Biological and Environmental Sciences, University of Gothenburg, Box 463, 405 30, Gothenburg, Sweden
- Gothenburg Global Biodiversity Center, Gothenburg, Box 463, 405 30, Sweden
| | - Patrik Cangren
- Department of Biological and Environmental Sciences, University of Gothenburg, Box 463, 405 30, Gothenburg, Sweden
| | - Farzaneh Jafari
- Department of Biology, Faculty of Basic Sciences, Lorestan University, P.O. BOX 6815144316, Khorramabad, Iran
- Department of Plant Science, Center of Excellence in Phylogeny of Living Organisms, School of Biology, College of Science, University of Tehran, P.O. Box 14155-6455, Tehran, Iran
| | - Thibauld Michel
- Tropical Diversity Research Department, Royal Botanic Garden of Edinburgh, 20A Inverleith Row, Edinburgh, EH3 5LRUK
| | - Hugo J de Boer
- Natural History Museum, University of Oslo, P.O. Box 1172 Blindern, 0318 Oslo, Norway
| | - Bengt Oxelman
- Department of Biological and Environmental Sciences, University of Gothenburg, Box 463, 405 30, Gothenburg, Sweden
- Gothenburg Global Biodiversity Center, Gothenburg, Box 463, 405 30, Sweden
| |
Collapse
|
3
|
Lautenschlager U, Scheunert A. LoCoLotive: In silico mining for low-copy nuclear loci based on target capture probe sets and arbitrary reference genomes. APPLICATIONS IN PLANT SCIENCES 2023; 11:e11535. [PMID: 38106539 PMCID: PMC10719872 DOI: 10.1002/aps3.11535] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 02/06/2023] [Accepted: 04/11/2023] [Indexed: 12/19/2023]
Abstract
Premise Universal target enrichment probe kits are used to circumvent the individual identification of loci suitable for phylogenetic studies in a given taxon. Under certain circumstances, however, target capture can be inefficient and costly, and lower numbers of marker loci may be sufficient. We therefore propose a computational pipeline that enables the easy identification of a subset of promising candidate loci for a taxon of interest. Methods and Results Target sequences used for probe design are filtered based on an assembled reference genome, resulting in presumably intron-containing single-copy loci as present in the reference taxon. The applicability of the proposed approach is demonstrated based on two probe kits (universal and family-specific) in combination with several publicly available reference genomes. Conclusions Guided by commercial probe kits, LoCoLotive enables fast and cost-efficient marker mining. Its accuracy mainly depends on the quality of the reference genome and its relatedness to the taxa under study.
Collapse
Affiliation(s)
- Ulrich Lautenschlager
- Evolutionary and Systematic Botany Group, Institute of Plant SciencesUniversity of RegensburgUniversitätsstraße 31RegensburgD‐93053Germany
| | - Agnes Scheunert
- Evolutionary and Systematic Botany Group, Institute of Plant SciencesUniversity of RegensburgUniversitätsstraße 31RegensburgD‐93053Germany
- Genomics Core Facility of the SNSBMenzinger Straße 67D‐80638MunichGermany
| |
Collapse
|
4
|
Scheunert A, Lautenschlager U, Ott T, Oberprieler C. Nano-Strainer: A workflow for the identification of single-copy nuclear loci for plant systematic studies, using target capture kits and Oxford Nanopore long reads. Ecol Evol 2023; 13:e10190. [PMID: 37475726 PMCID: PMC10354226 DOI: 10.1002/ece3.10190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 05/18/2023] [Accepted: 06/01/2023] [Indexed: 07/22/2023] Open
Abstract
In modern plant systematics, target enrichment enables simultaneous analysis of hundreds of genes. However, when dealing with reticulate or polyploidization histories, few markers may suffice, but often are required to be single-copy, a condition that is not necessarily met with commercial capture kits. Also, large genome sizes can render target capture ineffective, so that amplicon sequencing would be preferable; however, knowledge about suitable loci is often missing. Here, we present a comprehensive workflow for the identification of putative single-copy nuclear markers in a genus of interest, by mining a small dataset from target capture using a few representative taxa. The proposed pipeline assesses sequence variability contained in the data from targeted loci and assigns reads to their respective genes, via a combined BLAST/clustering procedure. Cluster consensus sequences are then examined based on four pre-defined criteria presumably indicative for absence of paralogy. This is done by calculating four specialized indices; loci are ranked according to their performance in these indices, and top-scoring loci are considered putatively single- or low copy. The approach can be applied to any probe set. As it relies on long reads, the present contribution also provides template workflows for processing Nanopore-based target capture data. Obtained markers are further tested and then entered into amplicon sequencing. For the detection of possibly remaining paralogy in these data, which might occur in groups with rampant paralogy, we also employ the long-read assembly tool canu. In diploid representatives of the young Compositae genus Leucanthemum, characterized by high levels of polyploidy, our approach resulted in successful amplification of 13 loci. Modifications to remove traces of paralogy were made in seven of these. A species tree from the markers correctly reproduced main relationships in the genus, however, at low resolution. The presented workflow has the potential to valuably support phylogenetic research, for example in polyploid plant groups.
Collapse
Affiliation(s)
- Agnes Scheunert
- Evolutionary and Systematic Botany Group, Institute of Plant SciencesUniversity of RegensburgRegensburgGermany
| | - Ulrich Lautenschlager
- Evolutionary and Systematic Botany Group, Institute of Plant SciencesUniversity of RegensburgRegensburgGermany
| | - Tankred Ott
- Evolutionary and Systematic Botany Group, Institute of Plant SciencesUniversity of RegensburgRegensburgGermany
| | - Christoph Oberprieler
- Evolutionary and Systematic Botany Group, Institute of Plant SciencesUniversity of RegensburgRegensburgGermany
| |
Collapse
|
5
|
Joyce EM, Appelhans MS, Buerki S, Cheek M, de Vos JM, Pirani JR, Zuntini AR, Bachelier JB, Bayly MJ, Callmander MW, Devecchi MF, Pell SK, Groppo M, Lowry PP, Mitchell J, Siniscalchi CM, Munzinger J, Orel HK, Pannell CM, Nauheimer L, Sauquet H, Weeks A, Muellner-Riehl AN, Leitch IJ, Maurin O, Forest F, Nargar K, Thiele KR, Baker WJ, Crayn DM. Phylogenomic analyses of Sapindales support new family relationships, rapid Mid-Cretaceous Hothouse diversification, and heterogeneous histories of gene duplication. FRONTIERS IN PLANT SCIENCE 2023; 14:1063174. [PMID: 36959945 PMCID: PMC10028101 DOI: 10.3389/fpls.2023.1063174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 01/31/2023] [Indexed: 06/18/2023]
Abstract
Sapindales is an angiosperm order of high economic and ecological value comprising nine families, c. 479 genera, and c. 6570 species. However, family and subfamily relationships in Sapindales remain unclear, making reconstruction of the order's spatio-temporal and morphological evolution difficult. In this study, we used Angiosperms353 target capture data to generate the most densely sampled phylogenetic trees of Sapindales to date, with 448 samples and c. 85% of genera represented. The percentage of paralogous loci and allele divergence was characterized across the phylogeny, which was time-calibrated using 29 rigorously assessed fossil calibrations. All families were supported as monophyletic. Two core family clades subdivide the order, the first comprising Kirkiaceae, Burseraceae, and Anacardiaceae, the second comprising Simaroubaceae, Meliaceae, and Rutaceae. Kirkiaceae is sister to Burseraceae and Anacardiaceae, and, contrary to current understanding, Simaroubaceae is sister to Meliaceae and Rutaceae. Sapindaceae is placed with Nitrariaceae and Biebersteiniaceae as sister to the core Sapindales families, but the relationships between these families remain unclear, likely due to their rapid and ancient diversification. Sapindales families emerged in rapid succession, coincident with the climatic change of the Mid-Cretaceous Hothouse event. Subfamily and tribal relationships within the major families need revision, particularly in Sapindaceae, Rutaceae and Meliaceae. Much of the difficulty in reconstructing relationships at this level may be caused by the prevalence of paralogous loci, particularly in Meliaceae and Rutaceae, that are likely indicative of ancient gene duplication events such as hybridization and polyploidization playing a role in the evolutionary history of these families. This study provides key insights into factors that may affect phylogenetic reconstructions in Sapindales across multiple scales, and provides a state-of-the-art phylogenetic framework for further research.
Collapse
Affiliation(s)
- Elizabeth M. Joyce
- Systematics, Biodiversity and Evolution of Plants, Ludwig-Maximilians-Universität München, Munich, Germany
- College of Science and Engineering, James Cook University, Cairns, QLD, Australia
- Australian Tropical Herbarium, James Cook University, Cairns, QLD, Australia
| | - Marc S. Appelhans
- Department of Systematics, Biodiversity and Evolution of Plants, University of Göttingen, Goettingen, Germany
- Department of Botany, National Museum of Natural History, Smithsonian Institution, Washington, DC, United States
| | - Sven Buerki
- Department of Biological Sciences, Boise State University, Boise, ID, United States
| | - Martin Cheek
- Royal Botanic Gardens, Kew, Richmond, United Kingdom
| | - Jurriaan M. de Vos
- Department of Environmental Sciences, University Basel, Basel, Switzerland
| | - José R. Pirani
- Departamento de Botaênica, Universidade de Saão Paulo, Herbário SPF, Saão Paulo, Brazil
| | | | | | - Michael J. Bayly
- School of BioSciences, The University of Melbourne, Parkville, VIC, Australia
| | | | - Marcelo F. Devecchi
- Departamento de Botaênica, Universidade de Saão Paulo, Herbário SPF, Saão Paulo, Brazil
| | - Susan K. Pell
- United States Botanic Garden, Washington, DC, United States
| | - Milton Groppo
- Departamento de Botaênica, Universidade de Saão Paulo, Herbário SPF, Saão Paulo, Brazil
| | - Porter P. Lowry
- Missouri Botanical Garden, St. Louis, MO, United States
- Institut de Systématique, Évolution, et Biodiversité (ISYEB), Muséum National d’Histoire Naturelle, Centre National de la Recherche Scientifique, Sorbonne Université, École Pratique des Hautes Études, Université des Antilles, Paris, France
| | - John Mitchell
- New York Botanical Garden, New York, NY, United States
| | - Carolina M. Siniscalchi
- Department of Biological Sciences, Harned Hall, Mississippi State University, Mississippi State, MS, United States
| | - Jérôme Munzinger
- AMAP, Université Montpellier, Institut de Recherche pour le Développement (IRD), Centre de coopération internationale en recherche agronomique pour le développement (CIRAD), Centre National de la Recherche Scientifique (CNRS), Institut national de la recherche agronomique (INRAE), Montpellier, France
| | - Harvey K. Orel
- School of BioSciences, The University of Melbourne, Parkville, VIC, Australia
| | - Caroline M. Pannell
- Royal Botanic Gardens, Kew, Richmond, United Kingdom
- Department of Biology, Oxford University, Oxford, United Kingdom
- Marine Laboratory, Queen’s University Belfast, Portaferry, United Kingdom
| | - Lars Nauheimer
- College of Science and Engineering, James Cook University, Cairns, QLD, Australia
- Australian Tropical Herbarium, James Cook University, Cairns, QLD, Australia
| | - Hervé Sauquet
- National Herbarium of New South Wales (NSW), Royal Botanic Gardens and Domain Trust, Sydney, NSW, Australia
| | - Andrea Weeks
- Department of Biology, George Mason University, Fairfax, VA, United States
| | - Alexandra N. Muellner-Riehl
- Department of Molecular Evolution and Plant Systematics & Herbarium, Faculty of Life Sciences, University of Leipzig, Leipzig, Germany
- German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany
| | | | | | - Félix Forest
- Royal Botanic Gardens, Kew, Richmond, United Kingdom
| | - Katharina Nargar
- Australian Tropical Herbarium, James Cook University, Cairns, QLD, Australia
- National Research Collections Australia, Commonwealth Industrial and Scientific Research Organization (CSIRO), Canberra, ACT, Australia
| | - Kevin R. Thiele
- School of Biological Sciences, University of Western Australia, Perth, WA, Australia
| | | | - Darren M. Crayn
- College of Science and Engineering, James Cook University, Cairns, QLD, Australia
- Australian Tropical Herbarium, James Cook University, Cairns, QLD, Australia
| |
Collapse
|
6
|
Next-generation sequencing data show rapid radiation and several long-distance dispersal events in early Costaceae. Mol Phylogenet Evol 2023; 179:107664. [PMID: 36403710 DOI: 10.1016/j.ympev.2022.107664] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 02/12/2022] [Accepted: 11/02/2022] [Indexed: 11/18/2022]
Abstract
The monocot family Costaceae Nakai consists of seven genera but their mutual relationships have not been satisfactorily resolved in previous studies employing classical molecular markers. Phylogenomic analyses of 365 nuclear genes and nearly-complete plastome data provide almost fully resolved insights into their diversification. Paracostus is identified as sister to all other taxa, followed by several very short branches leading to discrete lineages, suggesting an ancient rapid radiation of these early lineages and leaving the exact relationships among them unresolved. Relationships among Chamaecostus, Dimerocostus and Monocostus confirmed earlier findings that these genera form a monophyletic group. The Afro-American Costus is also monophyletic. By contrast, Tapeinochilos appeared as a well-supported crown lineage of Cheilocostus rendering it paraphyletic. As these two genera differ morphologically from one another owing to a shift from insect- to bird-pollination, we propose to keep both names. The divergence time within Costaceae was estimated using penalized likelihood utilizing two fossils within Zingiberales, †Spirematospermum chandlerae and †Ensete oregonense, indicated a relatively recent diversification of Costaceae, between 18 and 9 Mya. Based on these data, the current pantropical distribution of the family is hypothesized to be the result of several long-distance intercontinental dispersal events, which do not correlate with global geoclimatic changes.
Collapse
|
7
|
de Mestier A, Lücking R, Gutierrez J, Brokamp G, Celis M, Borsch T. Nested singletons in molecular trees: Utility of adding morphological and geographical data from digitized herbarium specimens to test taxon concepts at species level in the case of Casearia (Salicaceae). Ecol Evol 2023; 13:e9736. [PMID: 36694555 PMCID: PMC9843533 DOI: 10.1002/ece3.9736] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 12/21/2022] [Accepted: 12/23/2022] [Indexed: 01/19/2023] Open
Abstract
Using the genus Casearia, we assessed the status of nested singletons: individual specimens corresponding to accepted species but in molecular trees appearing nested within clades of closely related species. Normally, such cases would be left undecided, while on the other hand, timely taxonomic decisions are required. We argue that morphological, chorological, and ecological data can be informative to illuminate patterns of speciation. Their use can provide a first step in testing taxon concepts at species level. We focused on five cases of nested singletons in trees of the genus Casearia. We employed PCA and cluster analysis to assess phenotypic differentiation. Using geocoordinates, we calculated niche space differentiation based on 19 bioclim variables, by means of PCA and niche equivalency and similarity tests and generated dot maps. We found that the singletons were morphologically distinctive in two of the five cases (Casearia selloana and C. manausensis), relatively distinctive in two other cases (C. zizyphoides and C. mariquitensis), and partially overlapping in the last case (C. grandiflora). For two cases (C. mariquitensis and C. selloana), ecological niche space was broadly overlapping, in two cases it was found broadly nested (C. grandiflora and C. zizyphoides), and in one case narrowly nested (C. manausensis), but in no case niche differentiation was observed. Niche overlap, similarity and equivalency showed corresponding patterns. Given these data, one would interpret C. selloana and C. manausensis as presumably well-distinguished taxa, their narrow distribution ranges suggesting recently emerging lineages. The other three cases are not clearcut. Morphological data would suggest particularly C. grandiflora conspecific with C. arborea, but differences in the distribution are intriguing. Our approach would reject the notion of potential synonymy based on nested phylogenetic placement for at least two of the five cases. The other case also shows no complete lack of differentiation which would support synonymy.
Collapse
Affiliation(s)
- Astrid de Mestier
- Botanischer GartenFreie Universität BerlinBerlinGermany,Institut für Biologie – Systematische Botanik und PflanzengeographieFreie Universität BerlinBerlinGermany
| | | | - Jorge Gutierrez
- Jardín Botánico Nacional CalabazarUniversidad de La HabanaBoyerosCuba
| | - Grischa Brokamp
- Botanischer GartenFreie Universität BerlinBerlinGermany,Fachbereich Wald und UmweltHochschule für Nachhaltige Entwicklung EberswaldeEberswaldeGermany
| | - Marcela Celis
- Departamento de Química y BiologíaUniversidad del NorteBarranquillaColombia
| | - Thomas Borsch
- Botanischer GartenFreie Universität BerlinBerlinGermany,Institut für Biologie – Systematische Botanik und PflanzengeographieFreie Universität BerlinBerlinGermany
| |
Collapse
|
8
|
Thureborn O, Razafimandimbison SG, Wikström N, Rydin C. Target capture data resolve recalcitrant relationships in the coffee family (Rubioideae, Rubiaceae). FRONTIERS IN PLANT SCIENCE 2022; 13:967456. [PMID: 36160958 PMCID: PMC9493367 DOI: 10.3389/fpls.2022.967456] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/12/2022] [Accepted: 08/03/2022] [Indexed: 06/16/2023]
Abstract
Subfamily Rubioideae is the largest of the main lineages in the coffee family (Rubiaceae), with over 8,000 species and 29 tribes. Phylogenetic relationships among tribes and other major clades within this group of plants are still only partly resolved despite considerable efforts. While previous studies have mainly utilized data from the organellar genomes and nuclear ribosomal DNA, we here use a large number of low-copy nuclear genes obtained via a target capture approach to infer phylogenetic relationships within Rubioideae. We included 101 Rubioideae species representing all but two (the monogeneric tribes Foonchewieae and Aitchinsonieae) of the currently recognized tribes, and all but one non-monogeneric tribe were represented by more than one genus. Using data from the 353 genes targeted with the universal Angiosperms353 probe set we investigated the impact of data type, analytical approach, and potential paralogs on phylogenetic reconstruction. We inferred a robust phylogenetic hypothesis of Rubioideae with the vast majority (or all) nodes being highly supported across all analyses and datasets and few incongruences between the inferred topologies. The results were similar to those of previous studies but novel relationships were also identified. We found that supercontigs [coding sequence (CDS) + non-coding sequence] clearly outperformed CDS data in levels of support and gene tree congruence. The full datasets (353 genes) outperformed the datasets with potentially paralogous genes removed (186 genes) in levels of support but increased gene tree incongruence slightly. The pattern of gene tree conflict at short internal branches were often consistent with high levels of incomplete lineage sorting (ILS) due to rapid speciation in the group. While concatenation- and coalescence-based trees mainly agreed, the observed phylogenetic discordance between the two approaches may be best explained by their differences in accounting for ILS. The use of target capture data greatly improved our confidence and understanding of the Rubioideae phylogeny, highlighted by the increased support for previously uncertain relationships and the increased possibility to explore sources of underlying phylogenetic discordance.
Collapse
Affiliation(s)
- Olle Thureborn
- Department of Ecology, Environment and Plant Sciences, Stockholm University, Stockholm, Sweden
| | | | - Niklas Wikström
- Department of Ecology, Environment and Plant Sciences, Stockholm University, Stockholm, Sweden
- Bergius Foundation, Royal Swedish Academy of Sciences, Stockholm, Sweden
| | - Catarina Rydin
- Department of Ecology, Environment and Plant Sciences, Stockholm University, Stockholm, Sweden
- Bergius Foundation, Royal Swedish Academy of Sciences, Stockholm, Sweden
| |
Collapse
|
9
|
Ufimov R, Gorospe JM, Fér T, Kandziora M, Salomon L, van Loo M, Schmickl R. Utilizing paralogs for phylogenetic reconstruction has the potential to increase species tree support and reduce gene tree discordance in target enrichment data. Mol Ecol Resour 2022; 22:3018-3034. [PMID: 35796729 DOI: 10.1111/1755-0998.13684] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Revised: 05/28/2022] [Accepted: 06/22/2022] [Indexed: 11/30/2022]
Abstract
The analysis of target enrichment data in phylogenetics lacks optimization toward using paralogs for phylogenetic reconstruction. We developed a novel approach of detecting paralogs and utilizing them for phylogenetic tree inference, by retrieving both ortho- and paralogous copies and creating orthologous alignments, from which the gene trees are built. We implemented this approach in ParalogWizard and demonstrate its performance in plant groups that underwent a whole genome duplication relatively recently: the subtribe Malinae (family Rosaceae), using Angiosperms353 as well as Malinae481 probes, the genus Oritrophium (family Asteraceae), using Compositae1061 probes, and the genus Amomum (family Zingiberaceae), using Zingiberaceae1180 probes. Discriminating between orthologs and paralogs reduced gene tree discordance and increased the species tree support in the case of the Malinae, but not for Oritrophium and Amomum. This may relate to the difference in the proportion of paralogous loci between the datasets, which was highest for the Malinae. Overall, retrieving paralogs for phylogenetic reconstruction following ParalogWizard has the potential to increase the species tree support and reduce gene tree discordance in target enrichment data, particularly if the proportion of paralogous loci is high.
Collapse
Affiliation(s)
- Roman Ufimov
- Department of Forest Growth, Silviculture and Genetics, Austrian Research Centre for Forests, Seckendorff-Gudent-Weg 8, 1130, Vienna, Austria.,Komarov Botanical Institute, Russian Academy of Sciences, ul. Prof. Popova 2, 197376, St. Petersburg, Russian Federation
| | - Juan Manuel Gorospe
- Institute of Botany, The Czech Academy of Sciences, Zámek 1, 252 43, Průhonice, Czech Republic.,Department of Botany, Faculty of Science, Charles University, Benátská 2, 128 01, Prague, Czech Republic
| | - Tomáš Fér
- Department of Botany, Faculty of Science, Charles University, Benátská 2, 128 01, Prague, Czech Republic
| | - Martha Kandziora
- Department of Botany, Faculty of Science, Charles University, Benátská 2, 128 01, Prague, Czech Republic
| | - Luciana Salomon
- Department of Botany, Faculty of Science, Charles University, Benátská 2, 128 01, Prague, Czech Republic
| | - Marcela van Loo
- Department of Forest Growth, Silviculture and Genetics, Austrian Research Centre for Forests, Seckendorff-Gudent-Weg 8, 1130, Vienna, Austria
| | - Roswitha Schmickl
- Institute of Botany, The Czech Academy of Sciences, Zámek 1, 252 43, Průhonice, Czech Republic.,Department of Botany, Faculty of Science, Charles University, Benátská 2, 128 01, Prague, Czech Republic
| |
Collapse
|
10
|
Yu XQ, Jiang YZ, Folk RA, Zhao JL, Fu CN, Fang L, Peng H, Yang JB, Yang SX. Species discrimination in Schima (Theaceae): Next-generation super-barcodes meet evolutionary complexity. Mol Ecol Resour 2022; 22:3161-3175. [PMID: 35789203 DOI: 10.1111/1755-0998.13683] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Revised: 06/20/2022] [Accepted: 06/27/2022] [Indexed: 11/26/2022]
Abstract
Plastid genome and nrDNA arrays, proposed recently as "super barcodes", might provide additional discriminatory power and overcome the limitations of traditional barcoding loci, yet super barcodes need to be tested for their effectiveness in more plant groups. Morphological homoplasy among Schima species makes the genus a model for testing the efficacy of super barcodes. In this study, we generated multiple datasets comprising standard DNA barcodes (matK, rbcL, trnH-psbA, nrITS) and super-barcodes (plastid genome, nrDNA arrays) across 58 individuals from 12 out of 13 species of Schima from China. No samples were correctly assigned to species using standard DNA barcodes and nrDNA arrays, while only 27.27% of species with multiple accessions were distinguished using the plastid genome and its partitioned datasets-the lowest estimated rate of super barcode success in the literature so far. For Schima and other taxa with similarly recently divergence and low levels of genetic variation, incomplete lineage sorting, hybridization, or taxonomic oversplitting are all possible causes of the failure. Taken together, our study suggests that by no means are super barcodes immune to the challenges imposed by evolutionary complexity. We therefore call for developing multi-locus nuclear markers for species discrimination in plant groups.
Collapse
Affiliation(s)
- Xiang-Qin Yu
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences
| | - Yin-Zi Jiang
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences.,College of Life Sciences, University of Chinese Academy of Sciences, 100049, Beijing, China
| | - Ryan A Folk
- Department of Biological Sciences, Mississippi State University, 39762, MS, United States
| | - Jian-Li Zhao
- Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology, Laboratory of Ecology and Evolutionary Biology, School of Ecology and Environmental Sciences, Yunnan University, Kunming, 650500, China, China
| | - Chao-Nan Fu
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences
| | - Liang Fang
- College of Life Sciences, Jiujiang University, 332000, Jiujiang, Jiangxi, China
| | - Hua Peng
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences
| | - Jun-Bo Yang
- Germplasm Bank of Wild Species, Kunming Institute of Botany, Chinese Academy of Sciences, 650201, Kunming, Yunnan, China
| | - Shi-Xiong Yang
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences
| |
Collapse
|
11
|
Hatami E, Jones KE, Kilian N. New Insights Into the Relationships Within Subtribe Scorzonerinae (Cichorieae, Asteraceae) Using Hybrid Capture Phylogenomics (Hyb-Seq). FRONTIERS IN PLANT SCIENCE 2022; 13:851716. [PMID: 35873957 PMCID: PMC9298463 DOI: 10.3389/fpls.2022.851716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 05/18/2022] [Indexed: 06/15/2023]
Abstract
Subtribe Scorzonerinae (Cichorieae, Asteraceae) contains 12 main lineages and approximately 300 species. Relationships within the subtribe, either at inter- or intrageneric levels, were largely unresolved in phylogenetic studies to date, due to the lack of phylogenetic signal provided by traditional Sanger sequencing markers. In this study, we employed a phylogenomics approach (Hyb-Seq) that targets 1,061 nuclear-conserved ortholog loci designed for Asteraceae and obtained chloroplast coding regions as a by-product of off-target reads. Our objectives were to evaluate the potential of the Hyb-Seq approach in resolving the phylogenetic relationships across the subtribe at deep and shallow nodes, investigate the relationships of major lineages at inter- and intrageneric levels, and examine the impact of the different datasets and approaches on the robustness of phylogenetic inferences. We analyzed three nuclear datasets: exon only, excluding all potentially paralogous loci; exon only, including loci that were only potentially paralogous in 1-3 samples; exon plus intron regions (supercontigs); and the plastome CDS region. Phylogenetic relationships were reconstructed using both multispecies coalescent and concatenation (Maximum Likelihood and Bayesian analyses) approaches. Overall, our phylogenetic reconstructions recovered the same monophyletic major lineages found in previous studies and were successful in fully resolving the backbone phylogeny of the subtribe, while the internal resolution of the lineages was comparatively poor. The backbone topologies were largely congruent among all inferences, but some incongruent relationships were recovered between nuclear and plastome datasets, which are discussed and assumed to represent cases of cytonuclear discordance. Considering the newly resolved phylogenies, a new infrageneric classification of Scorzonera in its revised circumscription is proposed.
Collapse
Affiliation(s)
- Elham Hatami
- Department of Biology, Faculty of Science, Shahid Bahonar University of Kerman, Kerman, Iran
| | - Katy E. Jones
- Botanic Garden and Botanical Museum Berlin, Freie Universität Berlin, Berlin, Germany
| | - Norbert Kilian
- Botanic Garden and Botanical Museum Berlin, Freie Universität Berlin, Berlin, Germany
| |
Collapse
|
12
|
Gostel MR, Sancho G, Roque N, Donato M, Funk VA. Phylogenomic loci define the generic boundaries of Gochnatieae and improve resolution at the species level in Moquiniastrum (Compositae). Mol Phylogenet Evol 2022; 175:107558. [PMID: 35772621 DOI: 10.1016/j.ympev.2022.107558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2021] [Revised: 05/26/2022] [Accepted: 06/10/2022] [Indexed: 11/16/2022]
Abstract
Understanding the evolution of the tribe Gochnatieae (Compositae) has been the subject of considerable effort in the past decade. This is due to the key position of this tribe in the phylogeny of the sunflower family and the corresponding implications for biogeographic and morphological evolution of Compositae. Previous studies have confirmed the monophyly of this tribe as well as most of the genera that belong to it. However, phylogenetic resolution of Gochnatieae at both the genus- and species-level has remained poor. A subset of new phylogenomic loci used in this study has proven effective and has improved phylogenetic resolution in this group. The results of this work demonstrate Gochnatieae is a well-supported clade comprised of nine genera (Anastraphia, Cnicothamnus, Cyclolepis, Gochnatia, Moquiniastrum, Nahuatlea, Pentaphorus, Richterago, Tehuasca). One recently described genus, Vickia, was not included in this study; but its placement in Gochnatieae as a tenth genus in the tribe is well-justified. The monospecific Cyclolepis, which had been circumscribed within the tribe since its inception but was subsequently removed and designated as incertae sedis since 2014, is also shown to belong to Gochnatieae. We confirmed the monophyletic Moquiniastrum with two well-supported subclades. Ancestral area reconstruction analyses show that Gochnatieae originated in Eastern South America about 53 my. Apparently, except for Cyclolepis and Richterago, the ancestors of the other genera of Gochnatieae originated about 44 my from an area that now corresponds to the central Andes. The presence of the genera in the Chaco phytogeographic province, central Chile, and Mexico-United States-Caribbean is a result of dispersal from the central Andes. The ancestral distribution of Moquiniastrum corresponds to a large area comprising Eastern South America and the current central Andes, about 32 my. Ancestral character state reconstruction that included four characters indicates several states associated with complex plant reproductive biology such as gynodioecy, gynomonoecy, and polygamodioecy are derived in Gochnatieae as are heterogamous capitula (in Moquiniastrum and Richterago), dimorphic and subdimorphic corollas (in Cnicothamnus, Moquiniastrum, and Richterago), and the presence of marginal female corollas (in Moquiniastrum and Richterago). Within Moquiniastrum, two subclades (Densicephalum and Polymorphum) exhibit divergent patterns of trait evolution associated with these reproductive characters which suggests this genus can serve as a model to understand the sexual system evolution in plants.
Collapse
Affiliation(s)
- Morgan R Gostel
- Botanical Research Institute of Texas, 1700 University Drive, Fort Worth, TX 76107, USA; United States National Herbarium, Department of Botany, National Museum of Natural History, Smithsonian Institution, MRC 166, Washington, DC, 20560, USA.
| | - Gisela Sancho
- División Plantas Vasculares, Museo de La Plata, FCNyM, Universidad Nacional de La Plata, Paseo del Bosque s/n, 1900 La Plata, Buenos Aires, Argentina
| | - Nádia Roque
- Universidade Federal da Bahia, Instituto de Biologia, RuaBarão de Jeremoabo, s/n°, Campus Universitário de Ondina, 40171-970 Salvador, Bahia, Brazil
| | - Mariano Donato
- ILPLA, Instituto de Limnología Dr. Raúl A. Ringuelet, FCNYM, UNLP and CONICET, 122 and 60, La Plata 1900, Buenos Aires, Argentina
| | - Vicki A Funk
- United States National Herbarium, Department of Botany, National Museum of Natural History, Smithsonian Institution, MRC 166, Washington, DC, 20560, USA
| |
Collapse
|
13
|
Morales-Briones DF, Gehrke B, Huang CH, Liston A, Ma H, Marx HE, Tank DC, Yang Y. Analysis of Paralogs in Target Enrichment Data Pinpoints Multiple Ancient Polyploidy Events in Alchemilla s.l. (Rosaceae). Syst Biol 2021; 71:190-207. [PMID: 33978764 PMCID: PMC8677558 DOI: 10.1093/sysbio/syab032] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 04/28/2021] [Accepted: 05/03/2021] [Indexed: 12/16/2022] Open
Abstract
Target enrichment is becoming increasingly popular for phylogenomic studies. Although baits for enrichment are typically designed to target single-copy genes, paralogs are often recovered with increased sequencing depth, sometimes from a significant proportion of loci, especially in groups experiencing whole-genome duplication (WGD) events. Common approaches for processing paralogs in target enrichment data sets include random selection, manual pruning, and mainly, the removal of entire genes that show any evidence of paralogy. These approaches are prone to errors in orthology inference or removing large numbers of genes. By removing entire genes, valuable information that could be used to detect and place WGD events is discarded. Here, we used an automated approach for orthology inference in a target enrichment data set of 68 species of Alchemilla s.l. (Rosaceae), a widely distributed clade of plants primarily from temperate climate regions. Previous molecular phylogenetic studies and chromosome numbers both suggested ancient WGDs in the group. However, both the phylogenetic location and putative parental lineages of these WGD events remain unknown. By taking paralogs into consideration and inferring orthologs from target enrichment data, we identified four nodes in the backbone of Alchemilla s.l. with an elevated proportion of gene duplication. Furthermore, using a gene-tree reconciliation approach, we established the autopolyploid origin of the entire Alchemilla s.l. and the nested allopolyploid origin of four major clades within the group. Here, we showed the utility of automated tree-based orthology inference methods, previously designed for genomic or transcriptomic data sets, to study complex scenarios of polyploidy and reticulate evolution from target enrichment data sets.[Alchemilla; allopolyploidy; autopolyploidy; gene tree discordance; orthology inference; paralogs; Rosaceae; target enrichment; whole genome duplication.].
Collapse
Affiliation(s)
- Diego F Morales-Briones
- Department of Plant and Microbial Biology, University of Minnesota-Twin Cities, 1445 Gortner Avenue, St. Paul, MN 55108, USA
- Department of Biological Sciences and Institute for Bioinformatics and Evolutionary Studies, University of Idaho, 875 Perimeter Drive MS 3051, Moscow, ID 83844, USA
| | - Berit Gehrke
- University Gardens, University Museum, University of Bergen, Mildeveien 240, 5259 Hjellestad, Norway
| | - Chien-Hsun Huang
- State Key Laboratory of Genetic Engineering and Collaborative Innovation Center of Genetics and Development, Ministry of Education Key Laboratory of Biodiversity and Ecological Engineering, Institute of Plant Biology, Center of Evolutionary Biology, School of Life Sciences, Fudan University, Shanghai 200433, China
| | - Aaron Liston
- Department of Botany and Plant Pathology, Oregon State University, 2082 Cordley Hall, Corvallis, OR 97331, USA
| | - Hong Ma
- Department of Biology, the Huck Institute of the Life Sciences, the Pennsylvania State University, 510D Mueller Laboratory, University Park, PA 16802 USA
| | - Hannah E Marx
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109-1048, USA
- Museum of Southwestern Biology and Department of Biology, University of New Mexico, Albuquerque, NM 87131, USA
| | - David C Tank
- Department of Biological Sciences and Institute for Bioinformatics and Evolutionary Studies, University of Idaho, 875 Perimeter Drive MS 3051, Moscow, ID 83844, USA
| | - Ya Yang
- Department of Plant and Microbial Biology, University of Minnesota-Twin Cities, 1445 Gortner Avenue, St. Paul, MN 55108, USA
| |
Collapse
|
14
|
Lara-Cabrera SI, Perez-Garcia MDLL, Maya-Lastra CA, Montero-Castro JC, Godden GT, Cibrian-Jaramillo A, Fisher AE, Porter JM. Phylogenomics of Salvia L. subgenus Calosphace (Lamiaceae). FRONTIERS IN PLANT SCIENCE 2021; 12:725900. [PMID: 34721456 PMCID: PMC8554000 DOI: 10.3389/fpls.2021.725900] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Accepted: 09/07/2021] [Indexed: 05/13/2023]
Abstract
The evolutionary relationships of Salvia have been difficult to estimate. In this study, we used the Next Generation Sequencing method Hyb-Seq to evaluate relationships among 90 Lamiaceae samples, including representatives of Mentheae, Ocimeae, Salvia subgenera Audibertia, Leonia, Salvia, and 69 species of subgenus Calosphace, representing 32 of Epling's sections. A bait set was designed in MarkerMiner using available transcriptome data to enrich 119 variable nuclear loci. Nuclear and chloroplast loci were assembled with hybphylomaker (HPM), followed by coalescent approach analyses for nuclear data (ASTRAL, BEAST) and a concatenated Maximum Likelihood analysis of chloroplast loci. The HPM assembly had an average of 1,314,368 mapped reads for the sample and 527 putative exons. Phylogenetic inferences resolved strongly supported relationships for the deep-level nodes, agreeing with previous hypotheses which assumed that subgenus Audibertia is sister to subgenus Calosphace. Within subgenus Calosphace, we recovered eight monophyletic sections sensu Epling, Cardinalis, Hastatae, Incarnatae, and Uricae in all the analyses (nDNA and cpDNA), Biflorae, Lavanduloideae, and Sigmoideae in nuclear analyses (ASTRAL, BEAST) and Curtiflorae in ASTRAL trees. Network analysis supports deep node relationships, some of the main clades, and recovers reticulation within the core Calosphace. The chloroplast phylogeny resolved deep nodes and four monophyletic Calosphace sections. Placement of S. axillaris is distinct in nuclear evidence and chloroplast, as sister to the rest of the S. subg. Calosphace in chloroplast and a clade with "Hastatae clade" sister to the rest of the subgenus in nuclear evidence. We also tested the monophyly of S. hispanica, S. polystachia, S. purpurea, and S. tiliifolia, including two samples of each, and found that S. hispanica and S. purpurea are monophyletic. Our baits can be used in future studies of Lamiaceae phylogeny to estimate relationships between genera and among species. In this study, we presented a Hyb-Seq phylogeny for complex, recently diverged Salvia, which could be implemented in other Lamiaceae.
Collapse
Affiliation(s)
- Sabina Irene Lara-Cabrera
- Laboratorio de Sistemática Molecular de Plantas, Facultad de Biología, Universidad Michoacana de San Nicolás de Hidalgo, Morelia, Mexico
| | - Maria de la Luz Perez-Garcia
- Departamento de Botánica y Zoología, Centro Universitario de Ciencias Biológicas y Agropecuarias, Universidad de Guadalajara, Guadalajara, Mexico
| | - Carlos Alonso Maya-Lastra
- Department of Ecology, Evolution, and Environmental Biology, Columbia University, New York, NY, United States
| | - Juan Carlos Montero-Castro
- Laboratorio de Sistemática Molecular de Plantas, Facultad de Biología, Universidad Michoacana de San Nicolás de Hidalgo, Morelia, Mexico
| | - Grant T. Godden
- Florida Museum of Natural History, University of Florida, Gainesville, FL, United States
| | - Angelica Cibrian-Jaramillo
- Laboratorio Nacional de Genómica para la Biodiversidad, Unidad de Genómica Avanzada del Centro de Investigación y de Estudios Avanzados del instituto Politécnico Nacional, Irapuato, Mexico
| | - Amanda E. Fisher
- Department of Biological Sciences, California State University, Long Beach, CA, United States
| | | |
Collapse
|
15
|
Thomas AE, Igea J, Meudt HM, Albach DC, Lee WG, Tanentzap AJ. Using target sequence capture to improve the phylogenetic resolution of a rapid radiation in New Zealand Veronica. AMERICAN JOURNAL OF BOTANY 2021; 108:1289-1306. [PMID: 34173225 DOI: 10.1002/ajb2.1678] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 03/10/2021] [Indexed: 05/08/2023]
Abstract
PREMISE Recent, rapid radiations present a challenge for phylogenetic reconstruction. Fast successive speciation events typically lead to low sequence divergence and poorly resolved relationships with standard phylogenetic markers. Target sequence capture of many independent nuclear loci has the potential to improve phylogenetic resolution for rapid radiations. METHODS Here we applied target sequence capture with 353 protein-coding genes (Angiosperms353 bait kit) to Veronica sect. Hebe (common name hebe) to determine its utility for improving the phylogenetic resolution of rapid radiations. Veronica section Hebe originated 5-10 million years ago in New Zealand, forming a monophyletic radiation of ca 130 extant species. RESULTS We obtained approximately 150 kbp of 353 protein-coding exons and an additional 200 kbp of flanking noncoding sequences for each of 77 hebe and two outgroup species. When comparing coding, noncoding, and combined data sets, we found that the latter provided the best overall phylogenetic resolution. While some deep nodes in the radiation remained unresolved, our phylogeny provided broad and often improved support for subclades identified by both morphology and standard markers in previous studies. Gene-tree discordance was nonetheless widespread, indicating that additional methods are needed to disentangle fully the history of the radiation. CONCLUSIONS Phylogenomic target capture data sets both increase phylogenetic signal and deliver new insights into the complex evolutionary history of rapid radiations as compared with traditional markers. Improving methods to resolve remaining discordance among loci from target sequence capture is now important to facilitate the further study of rapid radiations.
Collapse
Affiliation(s)
- Anne E Thomas
- Ecosystems and Global Change Group, Department of Plant Sciences, University of Cambridge, Cambridge, UK
| | - Javier Igea
- Ecosystems and Global Change Group, Department of Plant Sciences, University of Cambridge, Cambridge, UK
| | - Heidi M Meudt
- Museum of New Zealand Te Papa Tongarewa, Wellington, New Zealand
| | - Dirk C Albach
- Carl von Ossietzky-University, Oldenburg, D-26111, Germany
| | - William G Lee
- Manaaki Whenua - Landcare Research Otago, Dunedin, New Zealand
| | - Andrew J Tanentzap
- Ecosystems and Global Change Group, Department of Plant Sciences, University of Cambridge, Cambridge, UK
| |
Collapse
|
16
|
Siniscalchi CM, Hidalgo O, Palazzesi L, Pellicer J, Pokorny L, Maurin O, Leitch IJ, Forest F, Baker WJ, Mandel JR. Lineage-specific vs. universal: A comparison of the Compositae1061 and Angiosperms353 enrichment panels in the sunflower family. APPLICATIONS IN PLANT SCIENCES 2021; 9:APS311422. [PMID: 34336403 PMCID: PMC8312747 DOI: 10.1002/aps3.11422] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 03/15/2021] [Indexed: 05/10/2023]
Abstract
PREMISE Phylogenetic studies in the Compositae are challenging due to the sheer size of the family and the challenges they pose for molecular tools, ranging from the genomic impact of polyploid events to their very conserved plastid genomes. The search for better molecular tools for phylogenetic studies led to the development of the family-specific Compositae1061 probe set, as well as the universal Angiosperms353 probe set designed for all flowering plants. In this study, we evaluate the extent to which data generated using the family-specific kit and those obtained with the universal kit can be merged for downstream analyses. METHODS We used comparative methods to verify the presence of shared loci between probe sets. Using two sets of eight samples sequenced with Compositae1061 and Angiosperms353, we ran phylogenetic analyses with and without loci flagged as paralogs, a gene tree discordance analysis, and a complementary phylogenetic analysis mixing samples from both sample sets. RESULTS Our results show that the Compositae1061 kit provides an average of 721 loci, with 9-46% of them presenting paralogs, while the Angiosperms353 set yields an average of 287 loci, which are less affected by paralogy. Analyses mixing samples from both sets showed that the presence of 30 shared loci in the probe sets allows the combination of data generated in different ways. DISCUSSION Combining data generated using different probe sets opens up the possibility of collaborative efforts and shared data within the synantherological community.
Collapse
Affiliation(s)
- Carolina M. Siniscalchi
- Department of Biological SciencesMississippi State UniversityMississippi StateMississippi39762USA
- Department of Biological SciencesUniversity of MemphisMemphisTennessee38152USA
| | - Oriane Hidalgo
- Royal Botanic Gardens, KewRichmondSurreyTW9 3AEUnited Kingdom
- Institut Botànic de Barcelona (IBB, CSIC‐Ajuntament de Barcelona)Passeig del Migdia s.n.BarcelonaCatalonia08038Spain
| | - Luis Palazzesi
- División PaleobotánicaMuseo Argentino de Ciencias NaturalesCONICETBuenos AiresC1405DJRArgentina
| | - Jaume Pellicer
- Royal Botanic Gardens, KewRichmondSurreyTW9 3AEUnited Kingdom
- Institut Botànic de Barcelona (IBB, CSIC‐Ajuntament de Barcelona)Passeig del Migdia s.n.BarcelonaCatalonia08038Spain
| | - Lisa Pokorny
- Royal Botanic Gardens, KewRichmondSurreyTW9 3AEUnited Kingdom
- Present address:
Centre for Plant Biotechnology and Genomics (CBGP) UPM‐INIAPozuelo de Alarcón (Madrid)28223Spain
| | - Olivier Maurin
- Royal Botanic Gardens, KewRichmondSurreyTW9 3AEUnited Kingdom
| | - Ilia J. Leitch
- Royal Botanic Gardens, KewRichmondSurreyTW9 3AEUnited Kingdom
| | - Felix Forest
- Royal Botanic Gardens, KewRichmondSurreyTW9 3AEUnited Kingdom
| | | | - Jennifer R. Mandel
- Department of Biological SciencesUniversity of MemphisMemphisTennessee38152USA
- Center for BiodiversityUniversity of MemphisMemphisTennessee38152USA
| |
Collapse
|
17
|
Peakall R, Wong DCJ, Phillips RD, Ruibal M, Eyles R, Rodriguez-Delgado C, Linde CC. A multitiered sequence capture strategy spanning broad evolutionary scales: Application for phylogenetic and phylogeographic studies of orchids. Mol Ecol Resour 2021; 21:1118-1140. [PMID: 33453072 DOI: 10.1111/1755-0998.13327] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Revised: 12/22/2020] [Accepted: 01/05/2021] [Indexed: 11/30/2022]
Abstract
With over 25,000 species, the drivers of diversity in the Orchidaceae remain to be fully understood. Here, we outline a multitiered sequence capture strategy aimed at capturing hundreds of loci to enable phylogenetic resolution from subtribe to subspecific levels in orchids of the tribe Diurideae. For the probe design, we mined subsets of 18 transcriptomes, to give five target sequence sets aimed at the tribe (Sets 1 & 2), subtribe (Set 3), and within subtribe levels (Sets 4 & 5). Analysis included alternative de novo and reference-guided assembly, before target sequence extraction, annotation and alignment, and application of a homology-aware k-mer block phylogenomic approach, prior to maximum likelihood and coalescence-based phylogenetic inference. Our evaluation considered 87 taxa in two test data sets: 67 samples spanning the tribe, and 72 samples involving 24 closely related Caladenia species. The tiered design achieved high target loci recovery (>89%), with the median number of recovered loci in Sets 1-5 as follows: 212, 219, 816, 1024, and 1009, respectively. Interestingly, as a first test of the homologous k-mer approach for targeted sequence capture data, our study revealed its potential for enabling robust phylogenetic species tree inferences. Specifically, we found matching, and in one case improved phylogenetic resolution within species complexes, compared to conventional phylogenetic analysis involving target gene extraction. Our findings indicate that a customized multitiered sequence capture strategy, in combination with promising yet underutilized phylogenomic approaches, will be effective for groups where interspecific divergence is recent, but information on deeper phylogenetic relationships is also required.
Collapse
Affiliation(s)
- Rod Peakall
- Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Darren C J Wong
- Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Ryan D Phillips
- Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia.,Department of Ecology, Environment and Evolution, La Trobe University, Melbourne, Vic., Australia
| | - Monica Ruibal
- Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Rodney Eyles
- Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Claudia Rodriguez-Delgado
- Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Celeste C Linde
- Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| |
Collapse
|
18
|
Kandziora M, Sklenář P, Kolář F, Schmickl R. How to Tackle Phylogenetic Discordance in Recent and Rapidly Radiating Groups? Developing a Workflow Using Loricaria (Asteraceae) as an Example. FRONTIERS IN PLANT SCIENCE 2021; 12:765719. [PMID: 35069621 PMCID: PMC8777076 DOI: 10.3389/fpls.2021.765719] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Accepted: 11/22/2021] [Indexed: 05/17/2023]
Abstract
A major challenge in phylogenetics and -genomics is to resolve young rapidly radiating groups. The fast succession of species increases the probability of incomplete lineage sorting (ILS), and different topologies of the gene trees are expected, leading to gene tree discordance, i.e., not all gene trees represent the species tree. Phylogenetic discordance is common in phylogenomic datasets, and apart from ILS, additional sources include hybridization, whole-genome duplication, and methodological artifacts. Despite a high degree of gene tree discordance, species trees are often well supported and the sources of discordance are not further addressed in phylogenomic studies, which can eventually lead to incorrect phylogenetic hypotheses, especially in rapidly radiating groups. We chose the high-Andean Asteraceae genus Loricaria to shed light on the potential sources of phylogenetic discordance and generated a phylogenetic hypothesis. By accounting for paralogy during gene tree inference, we generated a species tree based on hundreds of nuclear loci, using Hyb-Seq, and a plastome phylogeny obtained from off-target reads during target enrichment. We observed a high degree of gene tree discordance, which we found implausible at first sight, because the genus did not show evidence of hybridization in previous studies. We used various phylogenomic analyses (trees and networks) as well as the D-statistics to test for ILS and hybridization, which we developed into a workflow on how to tackle phylogenetic discordance in recent radiations. We found strong evidence for ILS and hybridization within the genus Loricaria. Low genetic differentiation was evident between species located in different Andean cordilleras, which could be indicative of substantial introgression between populations, promoted during Pleistocene glaciations, when alpine habitats shifted creating opportunities for secondary contact and hybridization.
Collapse
Affiliation(s)
- Martha Kandziora
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
- *Correspondence: Martha Kandziora,
| | - Petr Sklenář
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
| | - Filip Kolář
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
- Institute of Botany, The Czech Academy of Sciences, Průhonice, Czechia
| | - Roswitha Schmickl
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
- Institute of Botany, The Czech Academy of Sciences, Průhonice, Czechia
| |
Collapse
|
19
|
Straub SCK, Boutte J, Fishbein M, Livshultz T. Enabling evolutionary studies at multiple scales in Apocynaceae through Hyb-Seq. APPLICATIONS IN PLANT SCIENCES 2020; 8:e11400. [PMID: 33304663 PMCID: PMC7705337 DOI: 10.1002/aps3.11400] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Accepted: 09/12/2020] [Indexed: 05/07/2023]
Abstract
PREMISE Apocynaceae is the 10th largest flowering plant family and a focus for study of plant-insect interactions, especially as mediated by secondary metabolites. However, it has few genomic resources relative to its size. Target capture sequencing is a powerful approach for genome reduction that facilitates studies requiring data from the nuclear genome in non-model taxa, such as Apocynaceae. METHODS Transcriptomes were used to design probes for targeted sequencing of putatively single-copy nuclear genes across Apocynaceae. The sequences obtained were used to assess the success of the probe design, the intrageneric and intraspecific variation in the targeted genes, and the utility of the genes for inferring phylogeny. RESULTS From 853 candidate nuclear genes, 835 were consistently recovered in single copy and were variable enough for phylogenomics. The inferred gene trees were useful for coalescent-based species tree analysis, which showed all subfamilies of Apocynaceae as monophyletic, while also resolving relationships among species within the genus Apocynum. Intraspecific comparison of Elytropus chilensis individuals revealed numerous single-nucleotide polymorphisms with potential for use in population-level studies. DISCUSSION Community use of this Hyb-Seq probe set will facilitate and promote progress in the study of Apocynaceae across scales from population genomics to phylogenomics.
Collapse
Affiliation(s)
- Shannon C. K. Straub
- Department of BiologyHobart and William Smith Colleges300 Pulteney StreetGenevaNew York14456USA
| | - Julien Boutte
- Department of BiologyHobart and William Smith Colleges300 Pulteney StreetGenevaNew York14456USA
| | - Mark Fishbein
- Department of Plant Biology, Ecology, and EvolutionOklahoma State University301 Physical SciencesStillwaterOklahoma74078USA
| | - Tatyana Livshultz
- Department of Biodiversity, Earth, and Environmental Sciences and the Academy of Natural SciencesDrexel University1900 Benjamin Franklin ParkwayPhiladelphiaPennsylvania19103USA
| |
Collapse
|
20
|
Bagley JC, Uribe-Convers S, Carlsen MM, Muchhala N. Utility of targeted sequence capture for phylogenomics in rapid, recent angiosperm radiations: Neotropical Burmeistera bellflowers as a case study. Mol Phylogenet Evol 2020; 152:106769. [PMID: 32081762 DOI: 10.1016/j.ympev.2020.106769] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Revised: 02/10/2020] [Accepted: 02/12/2020] [Indexed: 02/06/2023]
Abstract
Targeted sequence capture is a promising approach for large-scale phylogenomics. However, rapid evolutionary radiations pose significant challenges for phylogenetic inference (e.g. incomplete lineages sorting (ILS), phylogenetic noise), and the ability of targeted nuclear loci to resolve species trees despite such issues remains poorly studied. We test the utility of targeted sequence capture for inferring phylogenetic relationships in rapid, recent angiosperm radiations, focusing on Burmeistera bellflowers (Campanulaceae), which diversified into ~130 species over less than 3 million years. We compared phylogenies estimated from supercontig (exons plus flanking sequences), exon-only, and flanking-only datasets with 506-546 loci (~4.7 million bases) for 46 Burmeistera species/lineages and 10 outgroup taxa. Nuclear loci resolved backbone nodes and many congruent internal relationships with high support in concatenation and coalescent-based species tree analyses, and inferences were largely robust to effects of missing taxa and base composition biases. Nevertheless, species trees were incongruent between datasets, and gene trees exhibited remarkably high levels of conflict (~4-60% congruence, ~40-99% conflict) not simply driven by poor gene tree resolution. Higher gene tree heterogeneity at shorter branches suggests an important role of ILS, as expected for rapid radiations. Phylogenetic informativeness analyses also suggest this incongruence has resulted from low resolving power at short internal branches, consistent with ILS, and homoplasy at deeper nodes, with exons exhibiting much greater risk of incorrect topologies due to homoplasy than other datasets. Our findings suggest that targeted sequence capture is feasible for resolving rapid, recent angiosperm radiations, and that results based on supercontig alignments containing nuclear exons and flanking sequences have higher phylogenetic utility and accuracy than either alone. We use our results to make practical recommendations for future target capture-based studies of Burmeistera and other rapid angiosperm radiations, including that such studies should analyze supercontigs to maximize the phylogenetic information content of loci.
Collapse
Affiliation(s)
- Justin C Bagley
- Department of Biology, University of Missouri-St. Louis, St. Louis, MO 63121, USA; Department of Biology, Virginia Commonwealth University, Richmond, VA 23284, USA.
| | - Simon Uribe-Convers
- Department of Biology, University of Missouri-St. Louis, St. Louis, MO 63121, USA
| | - Mónica M Carlsen
- Research Department, Science and Conservation Division, Missouri Botanical Garden, St. Louis, MO 63110, USA
| | - Nathan Muchhala
- Department of Biology, University of Missouri-St. Louis, St. Louis, MO 63121, USA
| |
Collapse
|
21
|
Jones KE, Fér T, Schmickl RE, Dikow RB, Funk VA, Herrando‐Moraira S, Johnston PR, Kilian N, Siniscalchi CM, Susanna A, Slovák M, Thapa R, Watson LE, Mandel JR. An empirical assessment of a single family-wide hybrid capture locus set at multiple evolutionary timescales in Asteraceae. APPLICATIONS IN PLANT SCIENCES 2019; 7:e11295. [PMID: 31667023 PMCID: PMC6814182 DOI: 10.1002/aps3.11295] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/27/2019] [Accepted: 09/05/2019] [Indexed: 05/23/2023]
Abstract
PREMISE Hybrid capture with high-throughput sequencing (Hyb-Seq) is a powerful tool for evolutionary studies. The applicability of an Asteraceae family-specific Hyb-Seq probe set and the outcomes of different phylogenetic analyses are investigated here. METHODS Hyb-Seq data from 112 Asteraceae samples were organized into groups at different taxonomic levels (tribe, genus, and species). For each group, data sets of non-paralogous loci were built and proportions of parsimony informative characters estimated. The impacts of analyzing alternative data sets, removing long branches, and type of analysis on tree resolution and inferred topologies were investigated in tribe Cichorieae. RESULTS Alignments of the Asteraceae family-wide Hyb-Seq locus set were parsimony informative at all taxonomic levels. Levels of resolution and topologies inferred at shallower nodes differed depending on the locus data set and the type of analysis, and were affected by the presence of long branches. DISCUSSION The approach used to build a Hyb-Seq locus data set influenced resolution and topologies inferred in phylogenetic analyses. Removal of long branches improved the reliability of topological inferences in maximum likelihood analyses. The Astereaceae Hyb-Seq probe set is applicable at multiple taxonomic depths, which demonstrates that probe sets do not necessarily need to be lineage-specific.
Collapse
Affiliation(s)
- Katy E. Jones
- Botanischer Garten und Botanisches Museum BerlinFreie Universität BerlinKönigin‐Luise‐Str. 6–814195BerlinGermany
| | - Tomáš Fér
- Department of BotanyFaculty of ScienceCharles UniversityBenátská 2CZ 12800PragueCzech Republic
| | - Roswitha E. Schmickl
- Department of BotanyFaculty of ScienceCharles UniversityBenátská 2CZ 12800PragueCzech Republic
- Institute of BotanyThe Czech Academy of SciencesZámek 1CZ 25243PrůhoniceCzech Republic
| | - Rebecca B. Dikow
- Data Science LabOffice of the Chief Information OfficerSmithsonian InstitutionWashingtonD.C.20013‐7012USA
| | - Vicki A. Funk
- Department of BotanyNational Museum of Natural HistorySmithsonian InstitutionWashingtonD.C.20013‐7012USA
| | | | - Paul R. Johnston
- Freie Universität BerlinEvolutionary BiologyBerlinGermany
- Berlin Center for Genomics in Biodiversity ResearchBerlinGermany
- Leibniz‐Institute of Freshwater Ecology and Inland Fisheries (IGB)BerlinGermany
| | - Norbert Kilian
- Botanischer Garten und Botanisches Museum BerlinFreie Universität BerlinKönigin‐Luise‐Str. 6–814195BerlinGermany
| | - Carolina M. Siniscalchi
- Department of Biological SciencesUniversity of MemphisMemphisTennessee38152USA
- Center for BiodiversityUniversity of MemphisMemphisTennessee38152USA
| | - Alfonso Susanna
- Botanic Institute of Barcelona (IBB‐CSIC‐ICUB)Pg. del Migdia s.n.ES 08038BarcelonaSpain
| | - Marek Slovák
- Department of BotanyFaculty of ScienceCharles UniversityBenátská 2CZ 12800PragueCzech Republic
- Plant Science and Biodiversity CentreSlovak Academy of SciencesSK‐84523BratislavaSlovakia
| | - Ramhari Thapa
- Department of Biological SciencesUniversity of MemphisMemphisTennessee38152USA
- Center for BiodiversityUniversity of MemphisMemphisTennessee38152USA
| | - Linda E. Watson
- Department of Plant Biology, Ecology, and EvolutionOklahoma State UniversityStillwaterOklahoma74078USA
| | - Jennifer R. Mandel
- Department of Biological SciencesUniversity of MemphisMemphisTennessee38152USA
- Center for BiodiversityUniversity of MemphisMemphisTennessee38152USA
| |
Collapse
|