1
|
Karbstein K, Kösters L, Hodač L, Hofmann M, Hörandl E, Tomasello S, Wagner ND, Emerson BC, Albach DC, Scheu S, Bradler S, de Vries J, Irisarri I, Li H, Soltis P, Mäder P, Wäldchen J. Species delimitation 4.0: integrative taxonomy meets artificial intelligence. Trends Ecol Evol 2024:S0169-5347(23)00296-3. [PMID: 38849221 DOI: 10.1016/j.tree.2023.11.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 10/20/2023] [Accepted: 11/08/2023] [Indexed: 06/09/2024]
Abstract
Although species are central units for biological research, recent findings in genomics are raising awareness that what we call species can be ill-founded entities due to solely morphology-based, regional species descriptions. This particularly applies to groups characterized by intricate evolutionary processes such as hybridization, polyploidy, or asexuality. Here, challenges of current integrative taxonomy (genetics/genomics + morphology + ecology, etc.) become apparent: different favored species concepts, lack of universal characters/markers, missing appropriate analytical tools for intricate evolutionary processes, and highly subjective ranking and fusion of datasets. Now, integrative taxonomy combined with artificial intelligence under a unified species concept can enable automated feature learning and data integration, and thus reduce subjectivity in species delimitation. This approach will likely accelerate revising and unraveling eukaryotic biodiversity.
Collapse
Affiliation(s)
- Kevin Karbstein
- Max Planck Institute for Biogeochemistry, Department of Biogeochemical Integration, 07745 Jena, Germany.
| | - Lara Kösters
- Max Planck Institute for Biogeochemistry, Department of Biogeochemical Integration, 07745 Jena, Germany
| | - Ladislav Hodač
- Max Planck Institute for Biogeochemistry, Department of Biogeochemical Integration, 07745 Jena, Germany
| | - Martin Hofmann
- Technical University of Ilmenau, Institute for Computer and Systems Engineering, 98693 Ilmenau, Germany
| | - Elvira Hörandl
- University of Göttingen, Albrecht-von-Haller Institute for Plant Sciences, Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium), 37073 Göttingen, Germany
| | - Salvatore Tomasello
- University of Göttingen, Albrecht-von-Haller Institute for Plant Sciences, Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium), 37073 Göttingen, Germany
| | - Natascha D Wagner
- University of Göttingen, Albrecht-von-Haller Institute for Plant Sciences, Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium), 37073 Göttingen, Germany
| | - Brent C Emerson
- Institute of Natural Products and Agrobiology (IPNA-CSIC), Island Ecology and Evolution Research Group, 38206 La Laguna, Tenerife, Canary Islands, Spain
| | - Dirk C Albach
- Carl von Ossietzky-Universität Oldenburg, Institute of Biology and Environmental Science, 26129 Oldenburg, Germany
| | - Stefan Scheu
- University of Göttingen, Johann-Friedrich-Blumenbach Institute of Zoology and Anthropology, 37073 Göttingen, Germany; University of Göttingen, Centre of Biodiversity and Sustainable Land Use (CBL), 37073 Göttingen, Germany
| | - Sven Bradler
- University of Göttingen, Johann-Friedrich-Blumenbach Institute of Zoology and Anthropology, 37073 Göttingen, Germany
| | - Jan de Vries
- University of Göttingen, Institute for Microbiology and Genetics, Department of Applied Bioinformatics, 37077 Göttingen, Germany; University of Göttingen, Campus Institute Data Science (CIDAS), 37077 Göttingen, Germany; University of Göttingen, Göttingen Center for Molecular Biosciences (GZMB), Department of Applied Bioinformatics, 37077 Göttingen, Germany
| | - Iker Irisarri
- Leibniz Institute for the Analysis of Biodiversity Change (LIB), Centre for Molecular Biodiversity Research, Phylogenomics Section, Museum of Nature, 20146 Hamburg, Germany
| | - He Li
- Eastern China Conservation Centre for Wild Endangered Plant Resources, Chenshan Botanical Garden, 201602 Shanghai, China
| | - Pamela Soltis
- University of Florida, Florida Museum of Natural History, 32611 Gainesville, USA
| | - Patrick Mäder
- Technical University of Ilmenau, Institute for Computer and Systems Engineering, 98693 Ilmenau, Germany; German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig, Puschstrasse 4, 04103 Leipzig, Germany; Friedrich Schiller University Jena, Faculty of Biological Sciences, Institute of Ecology and Evolution, Philosophenweg 16, 07743 Jena, Germany
| | - Jana Wäldchen
- Max Planck Institute for Biogeochemistry, Department of Biogeochemical Integration, 07745 Jena, Germany; German Centre for Integrative Biodiversity Research (iDiv), Halle-Jena-Leipzig, Puschstrasse 4, 04103 Leipzig, Germany
| |
Collapse
|
2
|
Sousa FD, Bertrand YJ, Zizka A, Cangrén P, Oxelman B, Pfeil BE. Chloroplast genome and nuclear loci data for 71 Medicago species. Data Brief 2024; 54:110540. [PMID: 38868387 PMCID: PMC11166683 DOI: 10.1016/j.dib.2024.110540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 04/15/2024] [Accepted: 05/13/2024] [Indexed: 06/14/2024] Open
Abstract
We present a dataset containing nuclear and chloroplast sequences for 71 species in genus Medicago (Fabaceae), as well as for 8 species in genera Melilotus and Trigonella. Sequence data for a total of 130 samples was obtained with high-throughput sequencing of enriched genomic DNA libraries targeting 61 single-copy nuclear genes from across the Medicago truncatula genome. Chloroplast sequence reads were also generated, allowing for the recovery of chloroplast genome sequences for all 130 samples. A fully-resolved phylogenetic tree was inferred from the chloroplast dataset using maximum-likelihoood methods. More than 80% of accepted Medicago species are represented in this dataset, including three subspecies of Medicago sativa (alfalfa). These data can be further utilised for phylogenetic analyses in Medicago and related genera, but also for probe and primer design and plant breeding studies.
Collapse
Affiliation(s)
- Filipe de Sousa
- cE3c-Centre for Ecology, Evolution and Environmental Changes & CHANGE-Global Change and Sustainability Institute, Faculdade de Ciências, Universidade de Lisboa, 1749-016 Lisbon, Portugal
| | - Yann J.K. Bertrand
- Institute of Botany, Academy of Sciences of the Czech Republic, CZ-252 43 Průhonice, Czech Republic
| | - Alexander Zizka
- Department of Biology, Philipps-University Marburg, Karl-von-Frisch-Strape 8, 35043 Marburg, Germany
| | - Patrik Cangrén
- Department of Biological and Environmental Sciences, University of Gothenburg, Medicinaregatan 7B, Göteborg 413 90, Sweden
| | - Bengt Oxelman
- Department of Biological and Environmental Sciences, University of Gothenburg, Medicinaregatan 7B, Göteborg 413 90, Sweden
| | - Bernard E. Pfeil
- Department of Biological and Environmental Sciences, University of Gothenburg, Medicinaregatan 7B, Göteborg 413 90, Sweden
| |
Collapse
|
3
|
Lv Z, Addo Nyarko C, Ramtekey V, Behn H, Mason AS. Defining autopolyploidy: Cytology, genetics, and taxonomy. AMERICAN JOURNAL OF BOTANY 2024:e16292. [PMID: 38439575 DOI: 10.1002/ajb2.16292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 12/15/2023] [Accepted: 12/18/2023] [Indexed: 03/06/2024]
Abstract
Autopolyploidy is taxonomically defined as the presence of more than two copies of each genome within an organism or species, where the genomes present must all originate within the same species. Alternatively, "genetic" or "cytological" autopolyploidy is defined by polysomic inheritance: random pairing and segregation of the four (or more) homologous chromosomes present, with no preferential pairing partners. In this review, we provide an overview of methods used to categorize species as taxonomic and cytological autopolyploids, including both modern and obsolete cytological methods, marker-segregation-based and genomics methods. Subsequently, we also investigated how frequently polysomic inheritance has been reliably documented in autopolyploids. Pure or predominantly polysomic inheritance was documented in 39 of 43 putative autopolyploid species where inheritance data was available (91%) and in seven of eight synthetic autopolyploids, with several cases of more mixed inheritance within species. We found no clear cases of autopolyploids with disomic inheritance, which was likely a function of our search methodology. Interestingly, we found seven species with purely polysomic inheritance and another five species with partial or predominant polysomic inheritance that appear to be taxonomic allopolyploids. Our results suggest that observations of polysomic inheritance can lead to relabeling of taxonomically allopolyploid species as autopolyploid and highlight the need for further cytogenetic and genomic investigation into polyploid origins and inheritance types.
Collapse
Affiliation(s)
- Zhenling Lv
- Plant Breeding Department, University of Bonn, Kirschallee 1, 53115, Bonn, Germany
| | - Charles Addo Nyarko
- Plant Breeding Department, University of Bonn, Kirschallee 1, 53115, Bonn, Germany
| | - Vinita Ramtekey
- Plant Breeding Department, University of Bonn, Kirschallee 1, 53115, Bonn, Germany
- ICAR-Indian Institute of Seed Science, 275103, Mau, India
| | - Helen Behn
- Plant Breeding Department, University of Bonn, Kirschallee 1, 53115, Bonn, Germany
| | - Annaliese S Mason
- Plant Breeding Department, University of Bonn, Kirschallee 1, 53115, Bonn, Germany
| |
Collapse
|
4
|
Folk RA, Gaynor ML, Engle-Wrye NJ, O’Meara BC, Soltis PS, Soltis DE, Guralnick RP, Smith SA, Grady CJ, Okuyama Y. Identifying Climatic Drivers of Hybridization with a New Ancestral Niche Reconstruction Method. Syst Biol 2023; 72:856-873. [PMID: 37073863 PMCID: PMC10405357 DOI: 10.1093/sysbio/syad018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Revised: 03/23/2023] [Accepted: 04/17/2023] [Indexed: 04/20/2023] Open
Abstract
Applications of molecular phylogenetic approaches have uncovered evidence of hybridization across numerous clades of life, yet the environmental factors responsible for driving opportunities for hybridization remain obscure. Verbal models implicating geographic range shifts that brought species together during the Pleistocene have often been invoked, but quantitative tests using paleoclimatic data are needed to validate these models. Here, we produce a phylogeny for Heuchereae, a clade of 15 genera and 83 species in Saxifragaceae, with complete sampling of recognized species, using 277 nuclear loci and nearly complete chloroplast genomes. We then employ an improved framework with a coalescent simulation approach to test and confirm previous hybridization hypotheses and identify one new intergeneric hybridization event. Focusing on the North American distribution of Heuchereae, we introduce and implement a newly developed approach to reconstruct potential past distributions for ancestral lineages across all species in the clade and across a paleoclimatic record extending from the late Pliocene. Time calibration based on both nuclear and chloroplast trees recovers a mid- to late-Pleistocene date for most inferred hybridization events, a timeframe concomitant with repeated geographic range restriction into overlapping refugia. Our results indicate an important role for past episodes of climate change, and the contrasting responses of species with differing ecological strategies, in generating novel patterns of range contact among plant communities and therefore new opportunities for hybridization. The new ancestral niche method flexibly models the shape of niche while incorporating diverse sources of uncertainty and will be an important addition to the current comparative methods toolkit. [Ancestral niche reconstruction; hybridization; paleoclimate; pleistocene.].
Collapse
Affiliation(s)
- Ryan A Folk
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, USA
| | - Michelle L Gaynor
- Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
- Department of Biology, University of Florida, Gainesville, FL, USA
| | - Nicholas J Engle-Wrye
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, USA
| | - Brian C O’Meara
- Department of Ecology and Evolutionary Biology, University of Tennessee, Knoxville, TN, USA
| | - Pamela S Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Biodiversity Institute, University of Florida, Gainesville, FL, USA
| | - Douglas E Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
- Department of Biology, University of Florida, Gainesville, FL, USA
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Biodiversity Institute, University of Florida, Gainesville, FL, USA
| | - Robert P Guralnick
- Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
- Biodiversity Institute, University of Florida, Gainesville, FL, USA
| | - Stephen A Smith
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
| | - Charles J Grady
- Biodiversity Institute, University of Kansas, Lawrence, KS, 66045, USA
| | - Yudai Okuyama
- Tsukuba Botanical Garden, National Museum of Nature and Science, Tsukuba, Japan
| |
Collapse
|
5
|
Geometric Morphometric Versus Genomic Patterns in a Large Polyploid Plant Species Complex. BIOLOGY 2023; 12:biology12030418. [PMID: 36979110 PMCID: PMC10045763 DOI: 10.3390/biology12030418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 03/06/2023] [Accepted: 03/07/2023] [Indexed: 03/12/2023]
Abstract
Plant species complexes represent a particularly interesting example of taxonomically complex groups (TCGs), linking hybridization, apomixis, and polyploidy with complex morphological patterns. In such TCGs, mosaic-like character combinations and conflicts of morphological data with molecular phylogenies present a major problem for species classification. Here, we used the large polyploid apomictic European Ranunculus auricomus complex to study relationships among five diploid sexual progenitor species and 75 polyploid apomictic derivate taxa, based on geometric morphometrics using 11,690 landmarked objects (basal and stem leaves, receptacles), genomic data (97,312 RAD-Seq loci, 48 phased target enrichment genes, 71 plastid regions) from 220 populations. We showed that (1) observed genomic clusters correspond to morphological groupings based on basal leaves and concatenated traits, and morphological groups were best resolved with RAD-Seq data; (2) described apomictic taxa usually overlap within trait morphospace except for those taxa at the space edges; (3) apomictic phenotypes are highly influenced by parental subgenome composition and to a lesser extent by climatic factors; and (4) allopolyploid apomictic taxa, compared to their sexual progenitor, resemble a mosaic of ecological and morphological intermediate to transgressive biotypes. The joint evaluation of phylogenomic, phenotypic, reproductive, and ecological data supports a revision of purely descriptive, subjective traditional morphological classifications.
Collapse
|
6
|
Inference of Polyploid Origin and Inheritance Mode from Population Genomic Data. Methods Mol Biol 2023; 2545:279-295. [PMID: 36720819 DOI: 10.1007/978-1-0716-2561-3_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Whole-genome duplications yield varied chromosomal pairing patterns, ranging from strictly bivalent to multivalent, resulting in disomic and polysomic inheritance modes. In the bivalent case, homeologous chromosomes form pairs, where in a multivalent pattern all copies are homologous and are therefore free to pair and recombine. As sufficient sequencing data is more readily available than high-quality cytological assessments of meiotic behavior or population genetic assessment of allelic segregation, especially for non-model organisms, bioinformatics approaches to infer origins and inheritance modes of polyploids using short-read sequencing data are attractive. Here we describe two such approaches, where the first is based on distributions of allelic read depth at heterozygous sites within an individual, as the expectations of such distributions are different for disomic and polysomic inheritance modes. The second approach is more laborious and based on a phylogenetic assessment of partially phased haplotypes of a polyploid in comparison to the closest diploid relatives. We discuss the sources of deviations from expected inheritance patterns, advantages and pitfalls of both methods, effects of mating types on the performance of the methods, and possible future developments.
Collapse
|
7
|
Karbstein K, Tomasello S, Hodač L, Wagner N, Marinček P, Barke BH, Paetzold C, Hörandl E. Untying Gordian knots: unraveling reticulate polyploid plant evolution by genomic data using the large Ranunculus auricomus species complex. THE NEW PHYTOLOGIST 2022; 235:2081-2098. [PMID: 35633497 DOI: 10.1111/nph.18284] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Accepted: 05/14/2022] [Indexed: 06/15/2023]
Abstract
Speciation via hybridization and polyploidization is a major evolutionary force in plant evolution but is still poorly understood for neopolyploid groups. Challenges are attributed to high heterozygosity, low genetic divergence, and missing information on progenitors, ploidy, and reproduction. We study the large Eurasian Ranunculus auricomus species complex and use a comprehensive workflow integrating reduced-representation sequencing (RRS) genomic data to unravel reticulate evolution, genome diversity and composition of polyploids. We rely on 97 312 restriction site-associated DNA sequencing (RAD-Seq) loci, 576 targeted nuclear genes (48 phased), and 71 plastid regions derived from 78 polyploid apomictic taxa and four diploid and one tetraploid putative sexual progenitor species. We applied (phylo)genomic structure, network, and single nucleotide polymorphism (SNP)-origin analyses. Results consistently showed only 3-5 supported and geographically structured polyploid genetic groups, each containing extant sexual and one unknown progenitor species. Combined analyses demonstrated predominantly allopolyploid origins, each involving 2-3 different diploid sexual progenitor species. Young allotetraploids were characterized by subgenome dominance and nonhybrid SNPs, suggesting substantial post-origin but little lineage-specific evolution. The biodiversity of neopolyploid complexes can result from multiple hybrid origins involving different progenitors and substantial post-origin evolution (e.g. homoeologous exchanges, hybrid segregation, gene flow). Reduced-representation sequencing genomic data including multi-approach information is efficient to delimit shallow reticulate relationships.
Collapse
Affiliation(s)
- Kevin Karbstein
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium), Albrecht-von-Haller Institute for Plant Sciences, University of Göttingen, 37073, Göttingen, Germany
- Georg-August University School of Science (GAUSS), University of Göttingen, 37073, Göttingen, Germany
| | - Salvatore Tomasello
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium), Albrecht-von-Haller Institute for Plant Sciences, University of Göttingen, 37073, Göttingen, Germany
| | - Ladislav Hodač
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium), Albrecht-von-Haller Institute for Plant Sciences, University of Göttingen, 37073, Göttingen, Germany
- Department of Biogeochemical Integration, Max Planck Institute for Biogeochemistry, 07745, Jena, Germany
| | - Natascha Wagner
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium), Albrecht-von-Haller Institute for Plant Sciences, University of Göttingen, 37073, Göttingen, Germany
| | - Pia Marinček
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium), Albrecht-von-Haller Institute for Plant Sciences, University of Göttingen, 37073, Göttingen, Germany
| | - Birthe Hilkka Barke
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium), Albrecht-von-Haller Institute for Plant Sciences, University of Göttingen, 37073, Göttingen, Germany
| | - Claudia Paetzold
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium), Albrecht-von-Haller Institute for Plant Sciences, University of Göttingen, 37073, Göttingen, Germany
- Department of Botany and Molecular Evolution, Senckenberg Research Institute, 60325, Frankfurt (Main), Germany
| | - Elvira Hörandl
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium), Albrecht-von-Haller Institute for Plant Sciences, University of Göttingen, 37073, Göttingen, Germany
| |
Collapse
|
8
|
Saada OA, Friedrich A, Schacherer J. Towards accurate, contiguous and complete alignment-based polyploid phasing algorithms. Genomics 2022; 114:110369. [PMID: 35483655 DOI: 10.1016/j.ygeno.2022.110369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2021] [Revised: 03/09/2022] [Accepted: 04/11/2022] [Indexed: 01/14/2023]
Abstract
Phasing, and in particular polyploid phasing, have been challenging problems held back by the limited read length of high-throughput short read sequencing methods which can't overcome the distance between heterozygous sites and labor high cost of alternative methods such as the physical separation of chromosomes for example. Recently developed single molecule long-read sequencing methods provide much longer reads which overcome this previous limitation. Here we review the alignment-based methods of polyploid phasing that rely on four main strategies: population inference methods, which leverage the genetic information of several individuals to phase a sample; objective function minimization methods, which minimize a function such as the Minimum Error Correction (MEC); graph partitioning methods, which represent the read data as a graph and split it into k haplotype subgraphs; cluster building methods, which iteratively grow clusters of similar reads into a final set of clusters that represent the haplotypes. We discuss the advantages and limitations of these methods and the metrics used to assess their performance, proposing that accuracy and contiguity are the most meaningful metrics. Finally, we propose the field of alignment-based polyploid phasing would greatly benefit from the use of a well-designed benchmarking dataset with appropriate evaluation metrics. We consider that there are still significant improvements which can be achieved to obtain more accurate and contiguous polyploid phasing results which reflect the complexity of polyploid genome architectures.
Collapse
Affiliation(s)
- Omar Abou Saada
- Université de Strasbourg, CNRS, GMGM UMR, 7156 Strasbourg, France
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR, 7156 Strasbourg, France
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR, 7156 Strasbourg, France; Institut Universitaire de France (IUF), Paris, France.
| |
Collapse
|
9
|
McLean BS, Bell KC, Cook JA. SNP-based Phylogenomic Inference in Holarctic Ground Squirrels (Urocitellus). Mol Phylogenet Evol 2022; 169:107396. [PMID: 35031463 DOI: 10.1016/j.ympev.2022.107396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Revised: 12/02/2021] [Accepted: 12/08/2021] [Indexed: 11/24/2022]
Abstract
Resolution of rapid evolutionary radiations requires harvesting maximal signal from phylogenomic datasets. However, studies of non-model clades often target conserved loci that are characterized by reduced information content, which can negatively affect gene tree precision and species tree accuracy. Single nucleotide polymorphism (SNP)-based methods are an underutilized but potentially valuable tool for estimating phylogeny and divergence times because they do not rely on resolved gene trees, allowing information from many or all variant loci to be leveraged in species tree reconstruction. We evaluated the utility of SNP-based methods in resolving phylogeny of Holarctic ground squirrels (Urocitellus), a radiation that has been difficult to disentangle, even in prior phylogenomic studies. We inferred phylogeny from a dataset of >3,000 ultraconserved element loci (UCEs) using two methods (SNAPP, SVDquartets) and compared our results with a new mitogenome phylogeny. We also systematically evaluated how phasing of UCEs improves per-locus information content, and inference of topology and other parameters within each of these SNP-based methods. Phasing improved topological resolution and branch length estimation at shallow levels (within species complexes), but less so at deeper levels, likely reflecting true uncertainty due to ancestral polymorphisms segregating in these rapidly diverging lineages. We resolved several key clades in Urocitellus and present targeted opportunities for future phylogenomic inquiry. Our results extend the roadmap for use of SNPs to address vertebrate radiations and support comparative analyses at multiple temporal scales.
Collapse
Affiliation(s)
- Bryan S McLean
- University of North Carolina Greensboro, Department of Biology, Greensboro, NC 27402 USA.
| | - Kayce C Bell
- Natural History Museum of Los Angeles County, Department of Mammalogy, Los Angeles, CA 90007 USA.
| | - Joseph A Cook
- University of New Mexico, Department of Biology and Museum of Southwestern Biology, Albuquerque, NM 87131 USA.
| |
Collapse
|
10
|
Abou Saada O, Tsouris A, Eberlein C, Friedrich A, Schacherer J. nPhase: an accurate and contiguous phasing method for polyploids. Genome Biol 2021; 22:126. [PMID: 33926549 PMCID: PMC8082856 DOI: 10.1186/s13059-021-02342-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Accepted: 04/08/2021] [Indexed: 01/06/2023] Open
Abstract
While genome sequencing and assembly are now routine, we do not have a full, precise picture of polyploid genomes. No existing polyploid phasing method provides accurate and contiguous haplotype predictions. We developed nPhase, a ploidy agnostic tool that leverages long reads and accurate short reads to solve alignment-based phasing for samples of unspecified ploidy (https://github.com/OmarOakheart/nPhase). nPhase is validated by tests on simulated and real polyploids. nPhase obtains on average over 95% accuracy and a contiguous 1.25 haplotigs per haplotype to cover more than 90% of each chromosome (heterozygosity rate ≥ 0.5%). nPhase allows population genomics and hybrid studies of polyploids.
Collapse
Affiliation(s)
- Omar Abou Saada
- Université de Strasbourg, CNRS, GMGM UMR, 7156, Strasbourg, France
| | - Andreas Tsouris
- Université de Strasbourg, CNRS, GMGM UMR, 7156, Strasbourg, France
| | - Chris Eberlein
- Université de Strasbourg, CNRS, GMGM UMR, 7156, Strasbourg, France
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR, 7156, Strasbourg, France.
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR, 7156, Strasbourg, France. .,Institut Universitaire de France (IUF), Paris, France.
| |
Collapse
|
11
|
Šlenker M, Kantor A, Marhold K, Schmickl R, Mandáková T, Lysak MA, Perný M, Caboňová M, Slovák M, Zozomová-Lihová J. Allele Sorting as a Novel Approach to Resolving the Origin of Allotetraploids Using Hyb-Seq Data: A Case Study of the Balkan Mountain Endemic Cardamine barbaraeoides. FRONTIERS IN PLANT SCIENCE 2021; 12:659275. [PMID: 33995457 PMCID: PMC8115912 DOI: 10.3389/fpls.2021.659275] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Accepted: 03/10/2021] [Indexed: 05/19/2023]
Abstract
Mountains of the Balkan Peninsula are significant biodiversity hotspots with great species richness and a large proportion of narrow endemics. Processes that have driven the evolution of the rich Balkan mountain flora, however, are still insufficiently explored and understood. Here we focus on a group of Cardamine (Brassicaceae) perennials growing in wet, mainly mountainous habitats. It comprises several Mediterranean endemics, including those restricted to the Balkan Peninsula. We used target enrichment with genome skimming (Hyb-Seq) to infer their phylogenetic relationships, and, along with genomic in situ hybridization (GISH), to resolve the origin of tetraploid Cardamine barbaraeoides endemic to the Southern Pindos Mts. (Greece). We also explored the challenges of phylogenomic analyses of polyploid species and developed a new approach of allele sorting into homeologs that allows identifying subgenomes inherited from different progenitors. We obtained a robust phylogenetic reconstruction for diploids based on 1,168 low-copy nuclear genes, which suggested both allopatric and ecological speciation events. In addition, cases of plastid-nuclear discordance, in agreement with divergent nuclear ribosomal DNA (nrDNA) copy variants in some species, indicated traces of interspecific gene flow. Our results also support biogeographic links between the Balkan and Anatolian-Caucasus regions and illustrate the contribution of the latter region to high Balkan biodiversity. An allopolyploid origin was inferred for C. barbaraeoides, which highlights the role of mountains in the Balkan Peninsula both as refugia and melting pots favoring species contacts and polyploid evolution in response to Pleistocene climate-induced range dynamics. Overall, our study demonstrates the importance of a thorough phylogenomic approach when studying the evolution of recently diverged species complexes affected by reticulation events at both diploid and polyploid levels. We emphasize the significance of retrieving allelic and homeologous variation from nuclear genes, as well as multiple nrDNA copy variants from genome skim data.
Collapse
Affiliation(s)
- Marek Šlenker
- Institute of Botany, Plant Science and Biodiversity Centre, Slovak Academy of Sciences, Bratislava, Slovakia
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
| | - Adam Kantor
- Institute of Botany, Plant Science and Biodiversity Centre, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Karol Marhold
- Institute of Botany, Plant Science and Biodiversity Centre, Slovak Academy of Sciences, Bratislava, Slovakia
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
| | - Roswitha Schmickl
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
- Institute of Botany, The Czech Academy of Sciences, Průhonice, Czechia
| | - Terezie Mandáková
- Central European Institute of Technology, Masaryk University, Brno, Czechia
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czechia
| | - Martin A. Lysak
- Central European Institute of Technology, Masaryk University, Brno, Czechia
- National Centre for Biomolecular Research, Faculty of Science, Masaryk University, Brno, Czechia
| | | | - Michaela Caboňová
- Institute of Botany, Plant Science and Biodiversity Centre, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Marek Slovák
- Institute of Botany, Plant Science and Biodiversity Centre, Slovak Academy of Sciences, Bratislava, Slovakia
- Department of Botany, Faculty of Science, Charles University, Prague, Czechia
| | - Judita Zozomová-Lihová
- Institute of Botany, Plant Science and Biodiversity Centre, Slovak Academy of Sciences, Bratislava, Slovakia
| |
Collapse
|
12
|
Wang N, Kelly LJ, McAllister HA, Zohren J, Buggs RJA. Resolving phylogeny and polyploid parentage using genus-wide genome-wide sequence data from birch trees. Mol Phylogenet Evol 2021; 160:107126. [PMID: 33647400 DOI: 10.1016/j.ympev.2021.107126] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 02/15/2021] [Accepted: 02/22/2021] [Indexed: 01/04/2023]
Abstract
Numerous plant genera have a history including frequent hybridisation and polyploidisation (allopolyploidisation), which means that their phylogeny is a network of reticulate evolution that cannot be accurately depicted as a bifurcating tree with a single tip per species. The genus Betula, which contains many ecologically important tree species, is a case in point. We generated genome-wide sequence reads for 27 diploid and 36 polyploid Betula species or subspecies using restriction site associated DNA (RAD) sequences. These reads were assembled into contigs with a mean length of 675 bp. We reconstructed the evolutionary relationships among diploid Betula species using both supermatrix (concatenation) and species tree methods. We identified the closest diploid relatives of the polyploids according to the relative rates at which reads from polyploids mapped to contigs from different diploid species within a concatenated reference sequence. By mapping reads from allopolyploids to their different putative diploid relatives we assembled contigs from the putative sub-genomes of allopolyploid taxa. We used these to build new phylogenies that included allopolyploid sub-genomes as separate tips. This approach yielded a highly evidenced phylogenetic hypothesis for the genus Betula, including the complex reticulate origins of the majority of its polyploid taxa. Our phylogeny divides the genus into two well supported clades, which, interestingly, differ in their seed-wing morphology. We therefore propose to split Betula into two subgenera.
Collapse
Affiliation(s)
- Nian Wang
- School of Biological and Chemical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS, UK; State Forestry and Grassland Administration Key Laboratory of Silviculture in Downstream Areas of the Yellow River, College of Forestry, Shandong Agricultural University, Tai'an 271018, China
| | - Laura J Kelly
- School of Biological and Chemical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS, UK; Royal Botanic Gardens Kew, Richmond, Surrey TW9 3AB, UK
| | - Hugh A McAllister
- Institute of Integrative Biology, Biosciences Building, University of Liverpool, Crown Street, Liverpool L69 7ZB, UK
| | - Jasmin Zohren
- Sex Chromosome Biology Lab, the Francis Crick Institute, 1 Midland Road, London, NW1 1AT, UK
| | - Richard J A Buggs
- School of Biological and Chemical Sciences, Queen Mary University of London, Mile End Road, London E1 4NS, UK; Royal Botanic Gardens Kew, Richmond, Surrey TW9 3AB, UK.
| |
Collapse
|
13
|
Fehrer J, Slavíková R, Paštová L, Josefiová J, Mráz P, Chrtek J, Bertrand YJK. Molecular Evolution and Organization of Ribosomal DNA in the Hawkweed Tribe Hieraciinae (Cichorieae, Asteraceae). FRONTIERS IN PLANT SCIENCE 2021; 12:647375. [PMID: 33777082 PMCID: PMC7994888 DOI: 10.3389/fpls.2021.647375] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Accepted: 02/19/2021] [Indexed: 05/14/2023]
Abstract
Molecular evolution of ribosomal DNA can be highly dynamic. Hundreds to thousands of copies in the genome are subject to concerted evolution, which homogenizes sequence variants to different degrees. If well homogenized, sequences are suitable for phylogeny reconstruction; if not, sequence polymorphism has to be handled appropriately. Here we investigate non-coding rDNA sequences (ITS/ETS, 5S-NTS) along with the chromosomal organization of their respective loci (45S and 5S rDNA) in diploids of the Hieraciinae. The subtribe consists of genera Hieracium, Pilosella, Andryala, and Hispidella and has a complex evolutionary history characterized by ancient intergeneric hybridization, allele sharing among species, and incomplete lineage sorting. Direct or cloned Sanger sequences and phased alleles derived from Illumina genome sequencing were subjected to phylogenetic analyses. Patterns of homogenization and tree topologies based on the three regions were compared. In contrast to most other plant groups, 5S-NTS sequences were generally better homogenized than ITS and ETS sequences. A novel case of ancient intergeneric hybridization between Hispidella and Hieracium was inferred, and some further incongruences between the trees were found, suggesting independent evolution of these regions. In some species, homogenization of ITS/ETS and 5S-NTS sequences proceeded in different directions although the 5S rDNA locus always occurred on the same chromosome with one 45S rDNA locus. The ancestral rDNA organization in the Hieraciinae comprised 4 loci of 45S rDNA in terminal positions and 2 loci of 5S rDNA in interstitial positions per diploid genome. In Hieracium, some deviations from this general pattern were found (3, 6, or 7 loci of 45S rDNA; three loci of 5S rDNA). Some of these deviations concerned intraspecific variation, and most of them occurred at the tips of the tree or independently in different lineages. This indicates that the organization of rDNA loci is more dynamic than the evolution of sequences contained in them and that locus number is therefore largely unsuitable to inform about species relationships in Hieracium. No consistent differences in the degree of sequence homogenization and the number of 45S rDNA loci were found, suggesting interlocus concerted evolution.
Collapse
Affiliation(s)
- Judith Fehrer
- Institute of Botany, Czech Academy of Sciences, Průhonice, Czechia
- *Correspondence: Judith Fehrer,
| | - Renáta Slavíková
- Institute of Botany, Czech Academy of Sciences, Průhonice, Czechia
| | | | - Jiřina Josefiová
- Institute of Botany, Czech Academy of Sciences, Průhonice, Czechia
| | - Patrik Mráz
- Department of Botany, Charles University, Prague, Czechia
| | - Jindřich Chrtek
- Institute of Botany, Czech Academy of Sciences, Průhonice, Czechia
- Department of Botany, Charles University, Prague, Czechia
| | | |
Collapse
|
14
|
Wagner ND, He L, Hörandl E. Phylogenomic Relationships and Evolution of Polyploid Salix Species Revealed by RAD Sequencing Data. FRONTIERS IN PLANT SCIENCE 2020; 11:1077. [PMID: 32765560 PMCID: PMC7379873 DOI: 10.3389/fpls.2020.01077] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Accepted: 06/30/2020] [Indexed: 05/19/2023]
Abstract
Polyploidy is common in the genus Salix. However, little is known about the origin, parentage and genomic composition of polyploid species because of a lack of suitable molecular markers and analysis tools. We established a phylogenomic framework including species of all described sections of Eurasian shrub willows. We analyzed the genomic composition of seven polyploid willow species in comparison to putative diploid parental species to draw conclusions on their origin and the effects of backcrossing and post-origin evolution. We applied recently developed programs like SNAPP, HyDe, and SNiPloid to establish a bioinformatic pipeline for unravelling the complexity of polyploid genomes. RAD sequencing revealed 23,393 loci and 320,010 high quality SNPs for the analysis of relationships of 35 species of Eurasian shrub willows (Salix subg. Chamaetia/Vetrix). Polyploid willow species appear to be predominantly of allopolyploid origin. More ancient allopolyploidization events were observed for two hexaploid and one octoploid species, while our data suggested a more recent allopolyploid origin for the included tetraploids and identified putative parental taxa. SNiPloid analyses disentangled the different genomic signatures resulting from hybrid origin, backcrossing, and secondary post-origin evolution in the polyploid species. Our RAD sequencing data demonstrate that willow genomes are shaped by ancient and recent reticulate evolution, polyploidization, and post-origin divergence of species.
Collapse
Affiliation(s)
- Natascha D. Wagner
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium), University of Goettingen, Göttingen, Germany
| | - Li He
- College of Biological Sciences and Technology, Beijing Forestry University, Beijing, China
| | - Elvira Hörandl
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium), University of Goettingen, Göttingen, Germany
| |
Collapse
|
15
|
Tomasello S, Karbstein K, Hodač L, Paetzold C, Hörandl E. Phylogenomics unravels Quaternary vicariance and allopatric speciation patterns in temperate‐montane plant species: A case study on the
Ranunculus auricomus
species complex. Mol Ecol 2020; 29:2031-2049. [DOI: 10.1111/mec.15458] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Accepted: 04/21/2020] [Indexed: 01/06/2023]
Affiliation(s)
- Salvatore Tomasello
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium) Albrecht‐von‐Haller Institute for Plant Sciences University of Goettingen Göttingen Germany
| | - Kevin Karbstein
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium) Albrecht‐von‐Haller Institute for Plant Sciences University of Goettingen Göttingen Germany
- Georg‐August University School of Science (GAUSS) University of Goettingen Goettingen Germany
| | - Ladislav Hodač
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium) Albrecht‐von‐Haller Institute for Plant Sciences University of Goettingen Göttingen Germany
| | - Claudia Paetzold
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium) Albrecht‐von‐Haller Institute for Plant Sciences University of Goettingen Göttingen Germany
| | - Elvira Hörandl
- Department of Systematics, Biodiversity and Evolution of Plants (with Herbarium) Albrecht‐von‐Haller Institute for Plant Sciences University of Goettingen Göttingen Germany
| |
Collapse
|
16
|
Blanco-Pastor JL, Bertrand YJK, Liberal IM, Wei Y, Brummer EC, Pfeil BE. Evolutionary networks from RADseq loci point to hybrid origins of Medicago carstiensis and Medicago cretacea. AMERICAN JOURNAL OF BOTANY 2019; 106:1219-1228. [PMID: 31535720 DOI: 10.1002/ajb2.1352] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/21/2019] [Accepted: 07/12/2019] [Indexed: 06/10/2023]
Abstract
PREMISE Although hybridization has played an important role in the evolution of many plant species, phylogenetic reconstructions that include hybridizing lineages have been historically constrained by the available models and data. Restriction-site-associated DNA sequencing (RADseq) has been a popular sequencing technique for the reconstruction of hybridization in the next-generation sequencing era. However, the utility of RADseq for the reconstruction of complex evolutionary networks has not been thoroughly investigated. Conflicting phylogenetic relationships in the genus Medicago have been mainly attributed to hybridization, but the specific hybrid origins of taxa have not been yet clarified. METHODS We obtained new molecular data from diploid species of Medicago section Medicago using single-digest RADseq to reconstruct evolutionary networks from gene trees, an approach that is computationally tractable with data sets that include several species and complex hybridization patterns. RESULTS Our analyses revealed that assembly filters to exclusively select a small set of loci with high phylogenetic information led to the most-divergent network topologies. Conversely, alternative clustering thresholds or filters on the number of samples per locus had a lower impact on networks. A strong hybridization signal was detected for M. carstiensis and M. cretacea, while signals were less clear for M. rugosa, M. rhodopea, M. suffruticosa, M. marina, M. scutellata, and M. sativa. CONCLUSIONS Complex network reconstructions from RADseq gene trees were not robust under variations of the assembly parameters and filters. But when the most-divergent networks were discarded, all remaining analyses consistently supported a hybrid origin for M. carstiensis and M. cretacea.
Collapse
Affiliation(s)
- José Luis Blanco-Pastor
- Department of Biological and Environmental Sciences, University of Gothenburg, Box 461, 40530, Göteborg, Sweden
- INRA, Centre Nouvelle-Aquitaine-Poitiers, UR4 (URP3F), 86600, Lusignan, France
| | - Yann J K Bertrand
- Department of Biological and Environmental Sciences, University of Gothenburg, Box 461, 40530, Göteborg, Sweden
- Institute of Botany, Czech Academy of Sciences, Zámek 1, 25243, Průhonice, Czech Republic
| | | | - Yanling Wei
- Plant Breeding Center, Department of Plant Sciences, University of California, Davis, Davis, CA, USA
| | - E Charles Brummer
- Plant Breeding Center, Department of Plant Sciences, University of California, Davis, Davis, CA, USA
| | - Bernard E Pfeil
- Department of Biological and Environmental Sciences, University of Gothenburg, Box 461, 40530, Göteborg, Sweden
| |
Collapse
|
17
|
Herrando-Moraira S. Exploring data processing strategies in NGS target enrichment to disentangle radiations in the tribe Cardueae (Compositae). Mol Phylogenet Evol 2018; 128:69-87. [PMID: 30036700 DOI: 10.1016/j.ympev.2018.07.012] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2018] [Revised: 07/13/2018] [Accepted: 07/14/2018] [Indexed: 12/17/2022]
Abstract
Target enrichment is a cost-effective sequencing technique that holds promise for elucidating evolutionary relationships in fast-evolving lineages. However, potential biases and impact of bioinformatic sequence treatments in phylogenetic inference have not been thoroughly explored yet. Here, we investigate this issue with an ultimate goal to shed light into a highly diversified group of Compositae (Asteraceae) constituted by four main genera: Arctium, Cousinia, Saussurea, and Jurinea. Specifically, we compared sequence data extraction methods implemented in two easy-to-use workflows, PHYLUCE and HybPiper, and assessed the impact of two filtering practices intended to reduce phylogenetic noise. In addition, we compared two phylogenetic inference methods: (1) the concatenation approach, in which all loci were concatenated in a supermatrix; and (2) the coalescence approach, in which gene trees were produced independently and then used to construct a species tree under coalescence assumptions. Here we confirm the usefulness of the set of 1061 COS targets (a nuclear conserved orthology loci set developed for the Compositae) across a variety of taxonomic levels. Intergeneric relationships were completely resolved: there are two sister groups, Arctium-Cousinia and Saussurea-Jurinea, which are in agreement with a morphological hypothesis. Intrageneric relationships among species of Arctium, Cousinia, and Saussurea are also well defined. Conversely, conflicting species relationships remain for Jurinea. Methodological choices significantly affected phylogenies in terms of topology, branch length, and support. Across all analyses, the phylogeny obtained using HybPiper and the strictest scheme of removing fast-evolving sites was estimated as the optimal. Regarding methodological choices, we conclude that: (1) trees obtained under the coalescence approach are topologically more congruent between them than those inferred using the concatenation approach; (2) refining treatments only improved support values under the concatenation approach; and (3) branch support values are maximized when fast-evolving sites are removed in the concatenation approach, and when a higher number of loci is analyzed in the coalescence approach.
Collapse
Affiliation(s)
- Sonia Herrando-Moraira
- Botanic Institute of Barcelona (IBB, CSIC-ICUB), Pg. del Migdia, s.n., 08038 Barcelona, Spain.
| | | |
Collapse
|