1
|
Lorenzana GP, Figueiró HV, Coutinho LL, Villela PMS, Eizirik E. Comparative assessment of genotyping-by-sequencing and whole-exome sequencing for estimating genetic diversity and geographic structure in small sample sizes: insights from wild jaguar populations. Genetica 2024; 152:133-144. [PMID: 39322785 DOI: 10.1007/s10709-024-00212-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Accepted: 09/12/2024] [Indexed: 09/27/2024]
Abstract
Biologists currently have an assortment of high-throughput sequencing techniques allowing the study of population dynamics in increasing detail. The utility of genetic estimates depends on their ability to recover meaningful approximations while filtering out noise produced by artifacts. In this study, we empirically compared the congruence of two reduced representation approaches (genotyping-by-sequencing, GBS, and whole-exome sequencing, WES) in estimating genetic diversity and population structure using SNP markers typed in a small number of wild jaguar (Panthera onca) samples from South America. Due to its targeted nature, WES allowed for a more straightforward reconstruction of loci compared to GBS, facilitating the identification of true polymorphisms across individuals. We therefore used WES-derived metrics as a benchmark against which GBS-derived indicators were compared, adjusting parameters for locus assembly and SNP filtering in the latter. We observed significant variation in SNP call rates across samples in GBS datasets, leading to a recurrent miscalling of heterozygous sites. This issue was further amplified by small sample sizes, ultimately impacting the consistency of summary statistics between genotyping methods. Recognizing that the genetic markers obtained from GBS and WES are intrinsically different due to varying evolutionary pressures, particularly selection, we consider that our empirical comparison offers valuable insights and highlights critical considerations for estimating population genetic attributes using reduced representation datasets. Our results emphasize the critical need for careful evaluation of missing data and stringent filtering to achieve reliable estimates of genetic diversity and differentiation in elusive wildlife species.
Collapse
Affiliation(s)
- Gustavo P Lorenzana
- Laboratório de Biologia Genômica e Molecular, Escola de Ciências da Saúde e da Vida, PUCRS, Porto Alegre, Brazil.
- School of Forestry, Northern Arizona University, Flagstaff, AZ, USA.
| | - Henrique V Figueiró
- Laboratório de Biologia Genômica e Molecular, Escola de Ciências da Saúde e da Vida, PUCRS, Porto Alegre, Brazil
- Environmental Genomics Group, Vale Institute of Technology, Belem, Brazil
| | | | - Priscilla M S Villela
- Centro de Genômica Funcional, ESALQ-USP, Piracicaba, Brazil
- EcoMol Consultoria e Projetos, Piracicaba, Brazil
| | - Eduardo Eizirik
- Laboratório de Biologia Genômica e Molecular, Escola de Ciências da Saúde e da Vida, PUCRS, Porto Alegre, Brazil
- Instituto Pró-Carnívoros, Atibaia, Brazil
| |
Collapse
|
2
|
Hemstrom W, Grummer JA, Luikart G, Christie MR. Next-generation data filtering in the genomics era. Nat Rev Genet 2024; 25:750-767. [PMID: 38877133 DOI: 10.1038/s41576-024-00738-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/25/2024] [Indexed: 06/16/2024]
Abstract
Genomic data are ubiquitous across disciplines, from agriculture to biodiversity, ecology, evolution and human health. However, these datasets often contain noise or errors and are missing information that can affect the accuracy and reliability of subsequent computational analyses and conclusions. A key step in genomic data analysis is filtering - removing sequencing bases, reads, genetic variants and/or individuals from a dataset - to improve data quality for downstream analyses. Researchers are confronted with a multitude of choices when filtering genomic data; they must choose which filters to apply and select appropriate thresholds. To help usher in the next generation of genomic data filtering, we review and suggest best practices to improve the implementation, reproducibility and reporting standards for filter types and thresholds commonly applied to genomic datasets. We focus mainly on filters for minor allele frequency, missing data per individual or per locus, linkage disequilibrium and Hardy-Weinberg deviations. Using simulated and empirical datasets, we illustrate the large effects of different filtering thresholds on common population genetics statistics, such as Tajima's D value, population differentiation (FST), nucleotide diversity (π) and effective population size (Ne).
Collapse
Affiliation(s)
- William Hemstrom
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.
| | - Jared A Grummer
- Flathead Lake Biological Station, Wildlife Biology Program and Division of Biological Sciences, University of Montana, Missoula, MT, USA
| | - Gordon Luikart
- Flathead Lake Biological Station, Wildlife Biology Program and Division of Biological Sciences, University of Montana, Missoula, MT, USA
| | - Mark R Christie
- Department of Biological Sciences, Purdue University, West Lafayette, IN, USA.
- Department of Forestry and Natural Resources, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|
3
|
Jeffries DL, Lawson-Handley L, Lamatsch DK, Olsén KH, Sayer CD, Hänfling B. Towards the conservation of the crucian carp in Europe: Prolific hybridization but no evidence for introgression between native and non-native taxa. Mol Ecol 2024; 33:e17515. [PMID: 39212263 DOI: 10.1111/mec.17515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2024] [Revised: 07/02/2024] [Accepted: 08/13/2024] [Indexed: 09/04/2024]
Abstract
Hybridization plays a pivotal role in evolution, influencing local adaptation and speciation. However, it can also reduce biodiversity, which is especially damaging when native and non-native species meet. Hybridization can threaten native species via competition (with vigorous hybrids), reproductive resource wastage and gene introgression. The latter, in particular, could result in increased fitness in invasive species, decreased fitness of natives and compromise reintroduction or recovery conservation practices. In this study, we use a combination of RAD sequencing and microsatellites for a range-wide sample set of 1366 fish to evaluate the potential for hybridization and introgression between native crucian carp (Carassius carassius) and three non-native taxa (Carassius auratus auratus, Carassius auratus gibelio and Cyprinus carpio) in European water bodies. We found hybridization between native and non-native taxa in 82% of populations with non-natives present, highlighting the potential for substantial ecological impacts from hybrids on crucian carp populations. However, despite such high rates of hybridization, we could find no evidence of introgression between these taxa. The presence of triploid backcrosses in at least two populations suggests that the lack of introgression among these taxa is likely due to meiotic dysfunction in hybrids, leading to the production of polyploid offspring which are unable to reproduce sexually. This result is promising for crucian reintroduction programs, as it implies limited risk to the genetic integrity of source populations. Future research should investigate the reproductive potential of triploid hybrids and the ecological pressures hybrids impose on C. carassius.
Collapse
Affiliation(s)
- Daniel L Jeffries
- Evolutionary Biology Group, School of Biological, Biomedical and Environmental Sciences, University of Hull, Hull, UK
- Division of Evolutionary Ecology, Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
| | - Lori Lawson-Handley
- Evolutionary Biology Group, School of Biological, Biomedical and Environmental Sciences, University of Hull, Hull, UK
| | - Dunja K Lamatsch
- Universität Innsbruck, Research Department for Limnology, Mondsee, Austria
| | - K Håkan Olsén
- School of Natural Sciences, Technology and Environmental Studies, Södetörn University, Huddinge, Stockholm, Sweden
| | - Carl D Sayer
- Pond Restoration Research Group, Department of Geography, University College London, London, UK
| | - Bernd Hänfling
- Division of Evolutionary Ecology, Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
- Institute for Biodiversity and Freshwater Conservation, University of the Highlands and Islands, Inverness, UK
| |
Collapse
|
4
|
Peñafiel Loaiza N, Chafe AH, Moraes R M, Oleas NH, Roncal J. Genotyping-by-sequencing informs conservation of Andean palms sources of non-timber forest products. Evol Appl 2024; 17:e13765. [PMID: 39091352 PMCID: PMC11291087 DOI: 10.1111/eva.13765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 06/30/2024] [Accepted: 07/18/2024] [Indexed: 08/04/2024] Open
Abstract
Conservation and sustainable management of lineages providing non-timber forest products are imperative under the current global biodiversity loss. Most non-timber forest species, however, lack genomic studies that characterize their intraspecific variation and evolutionary history, which inform species' conservation practices. Contrary to many lineages in the Andean biodiversity hotspot that exhibit high diversification, the genus Parajubaea (Arecaceae) has only three species despite the genus' origin 22 million years ago. Two of the three palm species, P. torallyi and P. sunkha, are non-timber forest species endemic to the Andes of Bolivia and are listed as IUCN endangered. The third species, P. cocoides, is a vulnerable species with unknown wild populations. We investigated the evolutionary relationships of Parajubaea species and the genetic diversity and structure of wild Bolivian populations. Sequencing of five low-copy nuclear genes (3753 bp) challenged the hypothesis that P. cocoides is a cultigen that originated from the wild Bolivian species. We further obtained up to 15,134 de novo single-nucleotide polymorphism markers by genotyping-by-sequencing of 194 wild Parajubaea individuals. Our total DNA sequencing effort rejected the taxonomic separation of the two Bolivian species. As expected for narrow endemic species, we observed low genetic diversity, but no inbreeding signal. We found three genetic clusters shaped by geographic distance, which we use to propose three management units. Different percentages of missing genotypic data did not impact the genetic structure of populations. We use the management units to recommend in situ conservation by creating new protected areas, and ex situ conservation through seed collection.
Collapse
Affiliation(s)
- Nicolás Peñafiel Loaiza
- Department of BiologyMemorial University of NewfoundlandSt. John'sNewfoundland and LabradorCanada
- Present address:
Chone y BabahoyoLojaEcuador
| | - Abigail H. Chafe
- Department of BiologyMemorial University of NewfoundlandSt. John'sNewfoundland and LabradorCanada
| | - Mónica Moraes R
- Herbario Nacional de Bolivia, Instituto de EcologíaUniversidad Mayor de San AndrésLa PazBolivia
| | - Nora H. Oleas
- Centro de Investigación de la Biodiversidad y Cambio Climático – BioCamb e Ingeniería en Biodiversidad y Recursos Genéticos, Facultad de Ciencias de Medio AmbienteUniversidad IndoaméricaQuitoEcuador
| | - Julissa Roncal
- Department of BiologyMemorial University of NewfoundlandSt. John'sNewfoundland and LabradorCanada
| |
Collapse
|
5
|
Doublet M, Degalez F, Lagarrigue S, Lagoutte L, Gueret E, Allais S, Lecerf F. Variant calling and genotyping accuracy of ddRAD-seq: Comparison with 20X WGS in layers. PLoS One 2024; 19:e0298565. [PMID: 39058708 PMCID: PMC11280156 DOI: 10.1371/journal.pone.0298565] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Accepted: 05/23/2024] [Indexed: 07/28/2024] Open
Abstract
Whole Genome Sequencing (WGS) remains a costly or unsuitable method for routine genotyping of laying hens. Until now, breeding companies have been using or developing SNP chips. Nevertheless, alternatives methods based on sequencing have been developed. Among these, reduced representation sequencing approaches can offer sequencing quality and cost-effectiveness by reducing the genomic regions covered by sequencing. The aim of this study was to evaluate the ability of double digested Restriction site Associated DNA sequencing (ddRAD-seq) to identify and genotype SNPs in laying hens, by comparison with a presumed reliable WGS approach. Firstly, the sensitivity and precision of variant calling and the genotyping reliability of ddRADseq were determined. Next, the SNP Call Rate (CRSNP) and mean depth of sequencing per SNP (DPSNP) were compared between both methods. Finally, the effect of multiple combinations of thresholds for these parameters on genotyping reliability and amount of remaining SNPs in ddRAD-seq was studied. In raw form, the ddRAD-seq identified 349,497 SNPs evenly distributed on the genome with a CRSNP of 0.55, a DPSNP of 11X and a mean genotyping reliability rate per SNP of 80%. Considering genomic regions covered by expected enzymatic fragments (EFs), the sensitivity of the ddRAD-seq was estimated at 32.4% and its precision at 96.4%. The low CRSNP and DPSNP values were explained by the detection of SNPs outside the EFs theoretically generated by the ddRAD-seq protocol. Indeed, SNPs outside the EFs had significantly lower CRSNP (0.25) and DPSNP (1X) values than SNPs within the EFs (0.7 and 17X, resp.). The study demonstrated the relationship between CRSNP, DPSNP, genotyping reliability and the number of SNPs retained, to provide a decision-support tool for defining filtration thresholds. Severe quality control over ddRAD-seq data allowed to retain a minimum of 40% of the SNPs with a CcR of 98%. Then, ddRAD-seq was defined as a suitable method for variant calling and genotyping in layers.
Collapse
Affiliation(s)
| | | | | | | | - Elise Gueret
- MGX-Montpellier GenomiX, Univ. Montpellier, CNRS, INSERM, Montpellier, France
| | | | | |
Collapse
|
6
|
Byerly PA, Kearns AM, Welch A, Ochirbat ME, Marra PP, Wilson A, Campana MG, Fleischer RC. Museum genomics provide insight into the extinction of a specialist North American warbler species. Sci Rep 2024; 14:17047. [PMID: 39048633 PMCID: PMC11269716 DOI: 10.1038/s41598-024-67595-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2023] [Accepted: 07/12/2024] [Indexed: 07/27/2024] Open
Abstract
Museum genomics provide an opportunity to investigate population demographics of extinct species, especially valuable when research prior to extinction was minimal. The Bachman's warbler (Vermivora bachmanii) is hypothesized to have gone extinct due to loss of its specialized habitat. However, little is known about other potential contributing factors such as natural rarity or changes to connectivity following habitat fragmentation. We examined mitochondrial DNA (mtDNA) and genome-wide SNPs using specimens collected from breeding and migration sites across the range of the Bachman's warbler. We found no signals of strong population structuring across the breeding range of Bachman's warblers in both mtDNA and genome-wide SNPs. Thus, long-term population isolation did not appear to be a significant contributor to the extinction of the Bachman's warbler. Instead, our findings support the theory that Bachman's warblers underwent a rapid decline likely driven by habitat destruction, which may have been exacerbated by the natural rarity, habitat specificity and low genetic diversity of the species.
Collapse
Affiliation(s)
- Paige A Byerly
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, 20008, USA.
| | - Anna M Kearns
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, 20008, USA
- Australian National Wildlife Collection, CSIRO National Research Collections Australia, Canberra, Australia
| | - Andreanna Welch
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, 20008, USA
- Department of Biosciences, Durham University, South Road, Durham, UK
| | - Margad-Erdene Ochirbat
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, 20008, USA
| | - Peter P Marra
- Department of Biology and McCourt School of Public Policy, Georgetown University, 37th and O Streets NW, Washington, DC, 20057, USA
| | - Amy Wilson
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, 20008, USA
- Department of Forest and Conservation Sciences, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | - Michael G Campana
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, 20008, USA
| | - Robert C Fleischer
- Center for Conservation Genomics, Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, 20008, USA
| |
Collapse
|
7
|
Correa Abondano M, Ospina JA, Wenzl P, Carvajal-Yepes M. Sampling strategies for genotyping common bean ( Phaseolus vulgaris L.) Genebank accessions with DArTseq: a comparison of single plants, multiple plants, and DNA pools. FRONTIERS IN PLANT SCIENCE 2024; 15:1338332. [PMID: 39055360 PMCID: PMC11269218 DOI: 10.3389/fpls.2024.1338332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 06/19/2024] [Indexed: 07/27/2024]
Abstract
Introduction Genotyping large-scale gene bank collections requires an appropriate sampling strategy to represent the diversity within and between accessions. Methods A panel of 44 common bean (Phaseolus vulgaris L.) landraces from the Alliance Bioversity and The Alliance of Bioversity International and the International Center for Tropical Agriculture (CIAT) gene bank was genotyped with DArTseq using three sampling strategies: a single plant per accession, 25 individual plants per accession jointly analyzed after genotyping (in silico-pool), and by pooling tissue from 25 individual plants per accession (seq-pool). Sampling strategies were compared to assess the technical aspects of the samples, the marker information content, and the genetic composition of the panel. Results The seq-pool strategy resulted in more consistent DNA libraries for quality and call rate, although with fewer polymorphic markers (6,142 single-nucleotide polymorphisms) than the in silico-pool (14,074) or the single plant sets (6,555). Estimates of allele frequencies by seq-pool and in silico-pool genotyping were consistent, but the results suggest that the difference between pools depends on population heterogeneity. Principal coordinate analysis, hierarchical clustering, and the estimation of admixture coefficients derived from a single plant, in silico-pool, and seq-pool successfully identified the well-known structure of Andean and Mesoamerican gene pools of P. vulgaris across all datasets. Conclusion In conclusion, seq-pool proved to be a viable approach for characterizing common bean germplasm compared to genotyping individual plants separately by balancing genotyping effort and costs. This study provides insights and serves as a valuable guide for gene bank researchers embarking on genotyping initiatives to characterize their collections. It aids curators in effectively managing the collections and facilitates marker-trait association studies, enabling the identification of candidate markers for key traits.
Collapse
Affiliation(s)
| | | | | | - Monica Carvajal-Yepes
- Genetic Resources Program, International Center for Tropical Agriculture (CIAT), Palmira, Colombia
| |
Collapse
|
8
|
Pierson TW, Kozak KH, Glenn TC, Fitzpatrick BM. River Drainage Reorganization and Reticulate Evolution in the Two-Lined Salamander (Eurycea bislineata) Species Complex. Syst Biol 2024; 73:26-35. [PMID: 37879625 DOI: 10.1093/sysbio/syad064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 09/14/2023] [Accepted: 10/16/2023] [Indexed: 10/27/2023] Open
Abstract
The origin and eventual loss of biogeographic barriers can create alternating periods of allopatry and secondary contact, facilitating gene flow among distinct metapopulations and generating reticulate evolutionary histories that are not adequately described by a bifurcating evolutionary tree. One such example may exist in the two-lined salamander (Eurycea bislineata) species complex, where discordance among morphological and molecular datasets has created a "vexing taxonomic challenge." Previous phylogeographic analyses of mitochondrial DNA (mtDNA) suggested that the reorganization of Miocene paleodrainages drove vicariance and dispersal, but the inherent limitations of a single-locus dataset precluded the evaluation of subsequent gene flow. Here, we generate triple-enzyme restriction site-associated DNA sequencing (3RAD) data for > 100 individuals representing all major mtDNA lineages and use a suite of complementary methods to demonstrate that discordance among earlier datasets is best explained by a reticulate evolutionary history influenced by river drainage reorganization. Systematics of such groups should acknowledge these complex histories and relationships that are not strictly hierarchical. [Amphibian; hybridization; introgression; Plethodontidae; stream capture.].
Collapse
Affiliation(s)
- Todd W Pierson
- Department of Ecology, Evolution, and Organismal Biology, Kennesaw State University, Kennesaw, GA 30144, USA
| | - Kenneth H Kozak
- Bell Museum and Department of Fisheries, Wildlife and Conservation Biology, University of Minnesota, Saint Paul, MN 55108, USA
| | - Travis C Glenn
- Department of Environmental Health Science and Institute of Bioinformatics, University of Georgia, Athens, GA 30609, USA
| | - Benjamin M Fitzpatrick
- Department of Ecology and Evolutionary Biology, University of Tennessee Knoxville, Knoxville, TN 37996, USA
| |
Collapse
|
9
|
Koontz AC, Schumacher EK, Spence ES, Hoban SM. Ex situ conservation of two rare oak species using microsatellite and SNP markers. Evol Appl 2024; 17:e13650. [PMID: 38524684 PMCID: PMC10960078 DOI: 10.1111/eva.13650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 12/27/2023] [Accepted: 01/14/2024] [Indexed: 03/26/2024] Open
Abstract
Plant collections held by botanic gardens and arboreta are key components of ex situ conservation. Maintaining genetic diversity in such collections allows them to be used as resources for supplementing wild populations. However, most recommended minimum sample sizes for sufficient ex situ genetic diversity are based on microsatellite markers, and it remains unknown whether these sample sizes remain valid in light of more recently developed next-generation sequencing (NGS) approaches. To address this knowledge gap, we examine how ex situ conservation status and sampling recommendations differ when derived from microsatellites and single nucleotide polymorphisms (SNPs) in garden and wild samples of two threatened oak species. For Quercus acerifolia, SNPs show lower ex situ representation of wild allelic diversity and slightly lower minimum sample size estimates than microsatellites, while results for each marker are largely similar for Q. boyntonii. The application of missing data filters tends to lead to higher ex situ representation, while the impact of different SNP calling approaches is dependent on the species being analyzed. Measures of population differentiation within species are broadly similar between markers, but larger numbers of SNP loci allow for greater resolution of population structure and clearer assignment of ex situ individuals to wild source populations. Our results offer guidance for future ex situ conservation assessments utilizing SNP data, such as the application of missing data filters and the usage of a reference genome, and illustrate that both microsatellites and SNPs remain viable options for botanic gardens and arboreta seeking to ensure the genetic diversity of their collections.
Collapse
Affiliation(s)
| | | | - Emma S. Spence
- Morton ArboretumCenter for Tree ScienceLisleIllinoisUSA
- Cornell UniversityDepartment of Public and Ecosystem HealthIthacaNew YorkUSA
| | - Sean M. Hoban
- Morton ArboretumCenter for Tree ScienceLisleIllinoisUSA
| |
Collapse
|
10
|
Souza LHB, Pierson TW, Tenório RO, Ferro JM, Gatto KP, Silva BC, de Andrade GV, Suárez P, Haddad CFB, Lourenço LB. Multiple contact zones and karyotypic evolution in a neotropical frog species complex. Sci Rep 2024; 14:1119. [PMID: 38212602 PMCID: PMC10784582 DOI: 10.1038/s41598-024-51421-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 01/04/2024] [Indexed: 01/13/2024] Open
Abstract
Previous studies of DNA sequence and karyotypic data have revealed high genetic diversity in the Physalaemus cuvieri - Physalaemus ephippifer species complex-a group of small leptodactylid frogs in South America. To date, seven major genetic lineages have been recognized in this group, with species delimitation tests supporting four to seven of them as valid species. Among these, only P. ephippifer shows heteromorphic sex chromosomes, but the implications of cytogenetic divergence for the evolution of this group are unknown. We analyzed karyotypic, mitochondrial DNA, and 3RAD genomic data to characterize a putative contact zone between P. ephippifer and P. cuvieri Lineage 1, finding evidence for admixture and karyotypic evolution. We also describe preliminary evidence for admixture between two other members of this species complex-Lineage 1 and Lineage 3 of P. cuvieri. Our study sheds new light on evolutionary relationships in the P. cuvieri - P. ephippifer species complex, suggesting an important role of karyotypic divergence in its evolutionary history and underscoring the importance of hybridization as a mechanism of sex chromosome evolution in amphibians.
Collapse
Affiliation(s)
- Lucas H B Souza
- Laboratório de Estudos Cromossômicos (LabEsC), Departamento de Biologia Estrutural e Funcional, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), Campinas, SP, 13083-863, Brazil.
| | - Todd W Pierson
- Department of Ecology, Evolution, and Organismal Biology, Kennesaw State University, Kennesaw, GA, USA
| | - Renata O Tenório
- Laboratório de Estudos Cromossômicos (LabEsC), Departamento de Biologia Estrutural e Funcional, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), Campinas, SP, 13083-863, Brazil
| | - Juan M Ferro
- Laboratorio de Genética Evolutiva "Dr. Claudio J. Bidau", Instituto de Biología Subtropical (CONICET-UNaM), Facultad de Ciencias Exactas, Químicas y Naturales, Universidad Nacional de Misiones, Posadas, Misiones, Argentina
| | - Kaleb P Gatto
- Laboratório de Estudos Cromossômicos (LabEsC), Departamento de Biologia Estrutural e Funcional, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), Campinas, SP, 13083-863, Brazil
| | - Bruno C Silva
- Laboratório de Estudos Cromossômicos (LabEsC), Departamento de Biologia Estrutural e Funcional, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), Campinas, SP, 13083-863, Brazil
| | - Gilda V de Andrade
- Departamento de Biologia, Centro de Ciências Biológicas e da Saúde, Universidade Federal do Maranhão (UFMA), Campus do Bacanga, São Luís, MA, 65080-040, Brazil
| | - Pablo Suárez
- Instituto de Biología Subtropical (CONICET-UNaM), Puerto Iguazú, Argentina
| | - Célio F B Haddad
- Departamento de Biodiversidade and Centro de Aquicultura (CAUNESP), Instituto de Biociências, Universidade Estadual Paulista, Rio Claro, SP, Brazil
| | - Luciana B Lourenço
- Laboratório de Estudos Cromossômicos (LabEsC), Departamento de Biologia Estrutural e Funcional, Instituto de Biologia, Universidade Estadual de Campinas (UNICAMP), Campinas, SP, 13083-863, Brazil
| |
Collapse
|
11
|
Sidlauskas BL, Mathur S, Aydoğan H, Monzyk FR, Black AN. Genetic approaches reveal a healthy population and an unexpectedly recent origin for an isolated desert spring fish. BMC Ecol Evol 2024; 24:2. [PMID: 38177987 PMCID: PMC10765885 DOI: 10.1186/s12862-023-02191-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Accepted: 12/17/2023] [Indexed: 01/06/2024] Open
Abstract
Foskett Spring in Oregon's desert harbors a historically threatened population of Western Speckled Dace (Rhinichthys klamathensis). Though recently delisted, the dace's recruitment depends upon regular removal of encroaching vegetation. Previous studies assumed that Foskett Dace separated from others in the Warner Valley about 10,000 years ago, thereby framing an enigma about the population's surprising ability to persist for so long in a tiny habitat easily overrun by plants. To investigate that persistence and the effectiveness of interventions to augment population size, we assessed genetic diversity among daces inhabiting Foskett Spring, a refuge at Dace Spring, and three nearby streams. Analysis revealed a robust effective population size (Ne) of nearly 5000 within Foskett Spring, though Ne in the Dace Spring refuge is just 10% of that value. Heterozygosity is slightly lower than expected based on random mating at all five sites, indicating mild inbreeding, but not at a level of concern. These results confirm the genetic health of Foskett Dace. Unexpectedly, genetic differentiation reveals closer similarity between Foskett Dace and a newly discovered population from Nevada's Coleman Creek than between Foskett Dace and dace elsewhere in Oregon. Demographic modeling inferred Coleman Creek as the ancestral source of Foskett Dace fewer than 1000 years ago, much more recently than previously suspected and possibly coincident with the arrival of large herbivores whose grazing may have maintained open water suitable for reproduction. These results solve the enigma of persistence by greatly shortening the duration over which Foskett Dace have inhabited their isolated spring.
Collapse
Affiliation(s)
- Brian L Sidlauskas
- Department of Fisheries, Wildlife and Conservation Sciences, Oregon State University, 104 Nash Hall, Corvallis, OR, 97331, USA.
| | - Samarth Mathur
- Department of Evolution, Ecology and Organismal Biology, The Ohio State University, 318 W 12th Ave, Columbus, OH, 43210, USA
| | - Hakan Aydoğan
- Department of Fisheries, Wildlife and Conservation Sciences, Oregon State University, 104 Nash Hall, Corvallis, OR, 97331, USA
| | - Fred R Monzyk
- Oregon Department of Fish and Wildlife, Corvallis Research Lab, 28655 OR-34, Corvallis, OR, 97333, USA
| | - Andrew N Black
- Center for Quantitative Life Sciences, Oregon State University, 2750 SW Campus Way, Corvallis, OR, 97331, USA
| |
Collapse
|
12
|
Duckett DJ, Calder K, Sullivan J, Tank DC, Carstens BC. Reduced representation approaches produce similar results to whole genome sequencing for some common phylogeographic analyses. PLoS One 2023; 18:e0291941. [PMID: 38032899 PMCID: PMC10688678 DOI: 10.1371/journal.pone.0291941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 09/09/2023] [Indexed: 12/02/2023] Open
Abstract
When designing phylogeographic investigations researchers can choose to collect many different types of molecular markers, including mitochondrial genes or genomes, SNPs from reduced representation protocols, large sequence capture data sets, and even whole genomes. Given that the statistical power and accuracy of various analyses are expected to differ depending on both the type of marker and the amount of data collected, an exploration of the variance across methodological results as a function of marker type should provide valuable information to researchers. Here we collect mitochondrial Cytochrome b sequences, whole mitochondrial genomes, single nucleotide polymorphisms (SNP)s isolated using a genotype by sequencing (GBS) protocol, sequences from ultraconserved elements, and low-coverage nuclear genomes from the North American water vole (Microtus richardsoni). We estimate genetic distances, population genetic structure, and historical demography using data from each of these datasets and compare the results across markers. As anticipated, the results exhibit differences across marker types, particularly in terms of the resolution offered by different analyses. A cost-benefit analysis indicates that SNPs collected using a GBS protocol are the most cost-effective molecular marker, with inferences that mirror those collected from the whole genome data at a fraction of the cost per sample.
Collapse
Affiliation(s)
- Drew J. Duckett
- Department of Evolution, Ecology, and Organismal Biology, The Ohio State University, Columbus, OH, United States of America
| | - Kailee Calder
- College of Veterinary Medicine and Biomedical Sciences, Colorado State University, Fort Collins, CO, United States of America
| | - Jack Sullivan
- Department of Biological Sciences, University of Idaho, Moscow, ID, United States of America
| | - David C. Tank
- Department of Botany, University of Wyoming, Laramie, WY, United States of America
| | - Bryan C. Carstens
- Department of Evolution, Ecology, and Organismal Biology, The Ohio State University, Columbus, OH, United States of America
| |
Collapse
|
13
|
Martchenko D, Shafer ABA. Contrasting whole-genome and reduced representation sequencing for population demographic and adaptive inference: an alpine mammal case study. Heredity (Edinb) 2023; 131:273-281. [PMID: 37532838 PMCID: PMC10539292 DOI: 10.1038/s41437-023-00643-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Revised: 07/22/2023] [Accepted: 07/22/2023] [Indexed: 08/04/2023] Open
Abstract
Genomes capture the adaptive and demographic history of a species, but the choice of sequencing strategy and sample size can impact such inferences. We compared whole genome and reduced representation sequencing approaches to study the population demographic and adaptive signals of the North American mountain goat (Oreamnos americanus). We applied the restriction site-associated DNA sequencing (RADseq) approach to 254 individuals and whole genome resequencing (WGS) approach to 35 individuals across the species range at mid-level coverage (9X) and to 5 individuals at high coverage (30X). We used ANGSD to estimate the genotype likelihoods and estimated the effective population size (Ne), population structure, and explicitly modelled the demographic history with δaδi and MSMC2. The data sets were overall concordant in supporting a glacial induced vicariance and extremely low Ne in mountain goats. We evaluated a set of climatic variables and geographic location as predictors of genetic diversity using redundancy analysis. A moderate proportion of total variance (36% for WGS and 21% for RADseq data sets) was explained by geography and climate variables; both data sets support a large impact of drift and some degree of local adaptation. The empirical similarities of WGS and RADseq presented herein reassuringly suggest that both approaches will recover large demographic and adaptive signals in a population; however, WGS offers several advantages over RADseq, such as inferring adaptive processes and calculating runs-of-homozygosity estimates. Considering the predicted climate-induced changes in alpine environments and the genetically depauperate mountain goat, the long-term adaptive capabilities of this enigmatic species are questionable.
Collapse
Affiliation(s)
- Daria Martchenko
- Environmental and Life Sciences Graduate Program, Trent University, 2140 East Bank Drive, Peterborough, ON, K9J 7B8, Canada.
| | - Aaron B A Shafer
- Environmental and Life Sciences Graduate Program, Trent University, 2140 East Bank Drive, Peterborough, ON, K9J 7B8, Canada
- Department of Forensics & Environmental and Life Sciences Graduate Program, Trent University, 2140 East Bank Drive, Peterborough, ON, K9J 7B8, Canada
| |
Collapse
|
14
|
de Oliveira DA, da Silva PHM, Novaes E, Grattapaglia D. Genome-wide analysis highlights genetic admixture in exotic germplasm resources of Eucalyptus and unexpected ancestral genomic composition of interspecific hybrids. PLoS One 2023; 18:e0289536. [PMID: 37552668 PMCID: PMC10409294 DOI: 10.1371/journal.pone.0289536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 07/20/2023] [Indexed: 08/10/2023] Open
Abstract
Eucalyptus is an economically important genus comprising more than 890 species in different subgenera and sections. Approximately twenty species of subgenus Symphyomyrtus account for 95% of the world's planted eucalypts. Discrimination of closely related eucalypt taxa is challenging, consistent with their recent phylogenetic divergence and occasional hybridization in nature. Admixture, misclassification or mislabeling of Eucalyptus germplasm resources maintained as exotics have been suggested, although no reports are available. Moreover, hybrids with increased productivity and traits complementarity are planted worldwide, but little is known about their actual genomic ancestry. In this study we examined a set of 440 trees of 16 different Eucalyptus species and 44 interspecific hybrids of multi-species origin conserved in germplasm banks in Brazil. We used genome-wide SNP data to evaluate the agreement between the alleged phylogenetic classification of species and provenances as registered in their historical records, and their observed genetic clustering derived from SNP data. Genetic structure analyses correctly assigned each of the 16 species to a different cluster although the PCA positioning of E. longirostrata was inconsistent with its current taxonomy. Admixture was present for closely related species' materials derived from local germplasm banks, indicating unintended hybridization following germplasm introduction. Provenances could be discriminated for some species, indicating that SNP-based discrimination was directly proportional to geographical distance, consistent with an isolation-by-distance model. SNP-based genomic ancestry analysis showed that the majority of the hybrids displayed realized genomic composition deviating from the expected ones based on their pedigree records, consistent with admixture in their parents and pervasive genome-wide directional selection toward the fast-growing E. grandis genome. SNP data in support of tree breeding provide precise germplasm identity verification, and allow breeders to objectively recognize the actual ancestral origin of superior hybrids to more realistically guide the program toward the development of the desired genetic combinations.
Collapse
Affiliation(s)
| | | | - Evandro Novaes
- Departamento de Biologia, Universidade Federal de Lavras, Lavras, MG, Brazil
| | - Dario Grattapaglia
- Plant Genetics Laboratory, EMBRAPA Genetic Resources and Biotechnology, Brasilia, DF, Brazil
| |
Collapse
|
15
|
Zhou H, Zhang W, Sheng Y, Qiu K, Liao L, Shi P, Xie Q, Pan H, Zhang J, Han Y. A large-scale behavior of allelic dropout and imbalance caused by DNA methylation changes in an early-ripening bud sport of peach. THE NEW PHYTOLOGIST 2023; 239:13-18. [PMID: 36960535 DOI: 10.1111/nph.18903] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Accepted: 03/19/2023] [Indexed: 06/02/2023]
Affiliation(s)
- Hui Zhou
- Key Laboratory of Horticultural Crop Germplasm Innovation and Utilization (Co-construction by Ministry and Province), Key Laboratory of Horticultural Crop Genetic Improvement and Eco-Physiology of Anhui Province, Institute of Horticulture, Anhui Academy of Agricultural Sciences, Hefei, 230031, China
| | - Weihan Zhang
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Hubei Hongshan Laboratory, Wuhan, 430074, China
| | - Yu Sheng
- Key Laboratory of Horticultural Crop Germplasm Innovation and Utilization (Co-construction by Ministry and Province), Key Laboratory of Horticultural Crop Genetic Improvement and Eco-Physiology of Anhui Province, Institute of Horticulture, Anhui Academy of Agricultural Sciences, Hefei, 230031, China
| | - Keli Qiu
- Key Laboratory of Horticultural Crop Germplasm Innovation and Utilization (Co-construction by Ministry and Province), Key Laboratory of Horticultural Crop Genetic Improvement and Eco-Physiology of Anhui Province, Institute of Horticulture, Anhui Academy of Agricultural Sciences, Hefei, 230031, China
| | - Liao Liao
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Hubei Hongshan Laboratory, Wuhan, 430074, China
| | - Pei Shi
- Key Laboratory of Horticultural Crop Germplasm Innovation and Utilization (Co-construction by Ministry and Province), Key Laboratory of Horticultural Crop Genetic Improvement and Eco-Physiology of Anhui Province, Institute of Horticulture, Anhui Academy of Agricultural Sciences, Hefei, 230031, China
| | - Qingmei Xie
- Key Laboratory of Horticultural Crop Germplasm Innovation and Utilization (Co-construction by Ministry and Province), Key Laboratory of Horticultural Crop Genetic Improvement and Eco-Physiology of Anhui Province, Institute of Horticulture, Anhui Academy of Agricultural Sciences, Hefei, 230031, China
| | - Haifa Pan
- Key Laboratory of Horticultural Crop Germplasm Innovation and Utilization (Co-construction by Ministry and Province), Key Laboratory of Horticultural Crop Genetic Improvement and Eco-Physiology of Anhui Province, Institute of Horticulture, Anhui Academy of Agricultural Sciences, Hefei, 230031, China
| | - Jinyun Zhang
- Key Laboratory of Horticultural Crop Germplasm Innovation and Utilization (Co-construction by Ministry and Province), Key Laboratory of Horticultural Crop Genetic Improvement and Eco-Physiology of Anhui Province, Institute of Horticulture, Anhui Academy of Agricultural Sciences, Hefei, 230031, China
| | - Yuepeng Han
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Hubei Hongshan Laboratory, Wuhan, 430074, China
| |
Collapse
|
16
|
Lopes F, Oliveira LR, Beux Y, Kessler A, Cárdenas-Alayza S, Majluf P, Páez-Rosas D, Chaves J, Crespo E, Brownell RL, Baylis AMM, Sepúlveda M, Franco-Trecu V, Loch C, Robertson BC, Peart CR, Wolf JBW, Bonatto SL. Genomic evidence for homoploid hybrid speciation in a marine mammal apex predator. SCIENCE ADVANCES 2023; 9:eadf6601. [PMID: 37134171 PMCID: PMC10156116 DOI: 10.1126/sciadv.adf6601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Hybridization is widespread and constitutes an important source of genetic variability and evolution. In animals, its role in generating novel and independent lineages (hybrid speciation) has been strongly debated, with only a few cases supported by genomic data. The South American fur seal (SAfs) Arctocephalus australis is a marine apex predator of Pacific and Atlantic waters, with a disjunct set of populations in Peru and Northern Chile [Peruvian fur seal (Pfs)] with controversial taxonomic status. We demonstrate, using complete genome and reduced representation sequencing, that the Pfs is a genetically distinct species with an admixed genome that originated from hybridization between the SAfs and the Galapagos fur seal (Arctocephalus galapagoensis) ~400,000 years ago. Our results strongly support the origin of Pfs by homoploid hybrid speciation over alternative introgression scenarios. This study highlights the role of hybridization in promoting species-level biodiversity in large vertebrates.
Collapse
Affiliation(s)
- Fernando Lopes
- Escola de Ciências da Saúde e da Vida, Pontifícia Universidade Católica do Rio Grande do Sul, PUCRS, Porto Alegre, Brazil
- Laboratório de Ecologia de Mamíferos, Universidade do Vale do Rio dos Sinos, São Leopoldo, Brazil
- Finnish Museum of Natural History, University of Helsinki, Helsinki, Finland
| | - Larissa R Oliveira
- Finnish Museum of Natural History, University of Helsinki, Helsinki, Finland
- Grupo de Estudos de Mamíferos Aquáticos do Rio Grande do Sul (GEMARS), Torres, Brazil
| | - Yago Beux
- Escola de Ciências da Saúde e da Vida, Pontifícia Universidade Católica do Rio Grande do Sul, PUCRS, Porto Alegre, Brazil
| | - Amanda Kessler
- Escola de Ciências da Saúde e da Vida, Pontifícia Universidade Católica do Rio Grande do Sul, PUCRS, Porto Alegre, Brazil
| | - Susana Cárdenas-Alayza
- Centro para la Sostenibilidad Ambiental, Universidad Peruana Cayetano Heredia, Lima, Peru
- Departamento de Ciencias Biológicas y Fisiológicas, Facultad de Ciencias y Filosofía, Universidad Peruana Cayetano Heredia, Lima, Peru
| | - Patricia Majluf
- Centro para la Sostenibilidad Ambiental, Universidad Peruana Cayetano Heredia, Lima, Peru
| | - Diego Páez-Rosas
- Colegio de Ciencias Biológicas y Ambientales, COCIBA, Universidad San Francisco de Quito, Quito, Ecuador
- Dirección del Parque Nacional Galápagos, Oficina Técnica San Cristobal, Islas Galápagos, Ecuador
| | - Jaime Chaves
- Colegio de Ciencias Biológicas y Ambientales, COCIBA, Universidad San Francisco de Quito, Quito, Ecuador
- Galapagos Science Center, Puerto Baquerizo Moreno, Ecuador
- Department of Biology, San Francisco State University, 1800 Holloway Ave, San Francisco, CA, USA
| | - Enrique Crespo
- Laboratório de Mamíferos Marinos, CESIMAR - CCT CENPAT, CONICET, Puerto Madryn, Argentina
| | - Robert L Brownell
- Southwest Fisheries Science Center, NOAA Fisheries, La Jolla, CA, USA
| | | | - Maritza Sepúlveda
- Centro de Investigación y Gestión de Recursos Naturales (CIGREN), Facultad de Ciencias, Universidad de Valparaíso, Valparaíso, Chile
| | - Valentina Franco-Trecu
- Departamento de Ecología y Evolución, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| | - Carolina Loch
- Sir John Walsh Research Institute, Faculty of Dentistry, University of Otago, Dunedin, New Zealand
| | | | - Claire R Peart
- Division of Evolutionary Biology, LMU Munich, München, Germany
| | - Jochen B W Wolf
- Division of Evolutionary Biology, LMU Munich, München, Germany
| | - Sandro L Bonatto
- Escola de Ciências da Saúde e da Vida, Pontifícia Universidade Católica do Rio Grande do Sul, PUCRS, Porto Alegre, Brazil
| |
Collapse
|
17
|
Bartoš O, Bohlen J, Šlechtová VB, Kočí J, Röslein J, Janko K. Sequence capture: Obsolete or irreplaceable? A thorough validation across phylogenetic distances and its applicability to hybrids and allopolyploids. Mol Ecol Resour 2023. [PMID: 37122140 DOI: 10.1111/1755-0998.13806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 04/05/2023] [Accepted: 04/12/2023] [Indexed: 05/02/2023]
Abstract
As whole-genome sequencing has become pervasive, some have suggested that reduced genomic representation approaches, for example, sequence capture, are becoming obsolete. In the present study, we argue that these techniques still provide excellent tools in terms of price and quality of data as well as in their ability to provide markers with specific features, as required, for example, in phylogenomics. A potential drawback of the wide-scale application of reduced representation approaches could be their drop in efficiency with increasing phylogenetic distance from the reference species. While some studies have focused on the degree and performance of reduced representation techniques in such situations, to our knowledge, none of them evaluated their applicability to inter-specific hybrids and polyploids. This highlights a significant gap in current knowledge since there is increasing evidence for the frequent occurrence of natural hybrids and polyploids, as well as for the major importance of both phenomena in evolution. The main aim of the present study was to carry out a thorough validation of SEQcap applicability to (1) a set of non-model taxa with a wide range of phylogenetic relatedness and (2) inter-specific hybrids of various ploidies and genomic compositions. Considering the latter point, we especially focused on mechanisms causing allelic bias and consequent allelic dropout, as these could have confounding effects with respect to the evolutionary genomic dynamics of hybrids, especially in asexuals, which virtually reproduce as a frozen F1 generation.
Collapse
Affiliation(s)
- Oldřich Bartoš
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Jörg Bohlen
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
| | - Vendula Bohlen Šlechtová
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
| | - Jan Kočí
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Jan Röslein
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Karel Janko
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| |
Collapse
|
18
|
Chambers EA, Tarvin RD, Santos JC, Ron SR, Betancourth‐Cundar M, Hillis DM, Matz MV, Cannatella DC. 2b or not 2b? 2bRAD is an effective alternative to ddRAD for phylogenomics. Ecol Evol 2023; 13:e9842. [PMID: 36911313 PMCID: PMC9994478 DOI: 10.1002/ece3.9842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 02/02/2023] [Accepted: 02/03/2023] [Indexed: 03/10/2023] Open
Abstract
Restriction-site-associated DNA sequencing (RADseq) has become an accessible way to obtain genome-wide data in the form of single-nucleotide polymorphisms (SNPs) for phylogenetic inference. Nonetheless, how differences in RADseq methods influence phylogenetic estimation is poorly understood because most comparisons have largely relied on conceptual predictions rather than empirical tests. We examine how differences in ddRAD and 2bRAD data influence phylogenetic estimation in two non-model frog groups. We compare the impact of method choice on phylogenetic information, missing data, and allelic dropout, considering different sequencing depths. Given that researchers must balance input (funding, time) with output (amount and quality of data), we also provide comparisons of laboratory effort, computational time, monetary costs, and the repeatability of library preparation and sequencing. Both 2bRAD and ddRAD methods estimated well-supported trees, even at low sequencing depths, and had comparable amounts of missing data, patterns of allelic dropout, and phylogenetic signal. Compared to ddRAD, 2bRAD produced more repeatable datasets, had simpler laboratory protocols, and had an overall faster bioinformatics assembly. However, many fewer parsimony-informative sites per SNP were obtained from 2bRAD data when using native pipelines, highlighting a need for further investigation into the effects of each pipeline on resulting datasets. Our study underscores the importance of comparing RADseq methods, such as expected results and theoretical performance using empirical datasets, before undertaking costly experiments.
Collapse
Affiliation(s)
- E. Anne Chambers
- Department of Integrative Biology and Biodiversity CenterUniversity of Texas at AustinAustinTexasUSA
- Department of Environmental Science, Policy, and Management and Museum of Vertebrate ZoologyUniversity of California BerkeleyBerkeleyCaliforniaUSA
| | - Rebecca D. Tarvin
- Department of Integrative Biology and Biodiversity CenterUniversity of Texas at AustinAustinTexasUSA
- Department of Integrative Biology and Museum of Vertebrate ZoologyUniversity of California BerkeleyBerkeleyCaliforniaUSA
| | - Juan C. Santos
- Department of Biological SciencesSt John's UniversityNew YorkNew YorkUSA
| | - Santiago R. Ron
- Museo de Zoología, Escuela de Ciencias BiológicasPontificia Universidad Católica del EcuadorQuitoEcuador
| | | | - David M. Hillis
- Department of Integrative Biology and Biodiversity CenterUniversity of Texas at AustinAustinTexasUSA
| | - Mikhail V. Matz
- Department of Integrative Biology and Biodiversity CenterUniversity of Texas at AustinAustinTexasUSA
| | - David C. Cannatella
- Department of Integrative Biology and Biodiversity CenterUniversity of Texas at AustinAustinTexasUSA
| |
Collapse
|
19
|
Reed EMX, Reiskind MH, Burford Reiskind MO. Life-history stage and the population genetics of the tiger mosquito Aedes albopictus at a fine spatial scale. MEDICAL AND VETERINARY ENTOMOLOGY 2023; 37:132-142. [PMID: 36300547 DOI: 10.1111/mve.12618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 09/26/2022] [Indexed: 06/16/2023]
Abstract
As a widespread vector of disease with an expanding range, the mosquito Aedes albopictus Skuse (Diptera: Culicidae) is a high priority for research and management. A. albopictus has a complex life history with aquatic egg, larval and pupal stages, and a terrestrial adult stage. This requires targeted management strategies for each life stage, coordinated across time and space. Population genetics can aid in A. albopictus control by evaluating patterns of genetic diversity and dispersal. However, how life stage impacts population genetic characteristics is unknown. We examined whether patterns of A. albopictus genetic diversity and differentiation changed with life stage at a spatial scale relevant to management efforts. We first conducted a literature review of field-caught A. albopictus population genetic papers and identified 101 peer-reviewed publications, none of which compared results between life stages. Our study uniquely examines population genomic patterns of egg and adult A. albopictus at five sites in Wake County, North Carolina, USA, using 8425 single nucleotide polymorphisms. We found that the level of genetic diversity and connectivity between sites varied between adults and eggs. This warrants further study and is critical for research aimed at informing local management.
Collapse
Affiliation(s)
- Emily M X Reed
- Department of Biological Sciences, North Carolina State University, Raleigh, North Carolina, USA
| | - Michael H Reiskind
- Department of Entomology and Plant Pathology, North Carolina State University, Raleigh, North Carolina, USA
| | | |
Collapse
|
20
|
Sekino M, Hashimoto K, Nakamichi R, Yamamoto M, Fujinami Y, Sasaki T. Introgressive hybridization in the west Pacific pen shells (genus Atrina): Restricted interspecies gene flow within the genome. Mol Ecol 2023; 32:2945-2963. [PMID: 36855846 DOI: 10.1111/mec.16908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Revised: 02/03/2023] [Accepted: 02/14/2023] [Indexed: 03/02/2023]
Abstract
A compelling interest in marine biology is to elucidate how species boundaries between sympatric free-spawning marine invertebrates such as bivalve molluscs are maintained in the face of potential hybridization. Hybrid zones provide the natural resources for us to study the underlying genetic mechanisms of reproductive isolation between hybridizing species. Against this backdrop, we examined the occurrence of introgressive hybridization (introgression) between two bivalves distributed in the western Pacific margin, Atrina japonica and Atrina lischkeana, based on single-nucleotide polymorphisms (SNPs) derived from restriction site-associated DNA sequencing. Using 1066 ancestry-informative SNP sites, we also investigated the extent of introgression within the genome to search for SNP sites with reduced interspecies gene flow. A series of our individual-level clustering analyses including the principal component analysis, Bayesian model-based clustering, and triangle plotting based on ancestry-heterozygosity relationships for an admixed population sample from the Seto Inland Sea (Japan) consistently suggested the presence of specimens with varying degrees of genomic admixture, thereby implying that the two species are not completely isolated. The Bayesian genomic cline analysis identified 10 SNP sites with reduced introgression, each of which was located within a genic region or an intergenic region physically close to a functional gene. No, or very few, heterozygotes were observed at these sites in the hybrid zone, suggesting that selection acts against heterozygotes. Accordingly, we raised the possibility that the SNP sites are within genomic regions that are incompatible between the two species. Our finding of restricted interspecies gene flow at certain genomic regions gives new insight into the maintenance of species boundaries in hybridizing broadcast-spawning molluscs.
Collapse
Affiliation(s)
- Masashi Sekino
- Fisheries Resources Institute, Japan Fisheries Research and Education Agency, Yokohama, Kanagawa, Japan
| | - Kazumasa Hashimoto
- Fisheries Technology Institute, Japan Fisheries Research and Education Agency, Nagasaki, Japan
| | - Reiichiro Nakamichi
- Fisheries Resources Institute, Japan Fisheries Research and Education Agency, Yokohama, Kanagawa, Japan
| | - Masayuki Yamamoto
- Fisheries Division, Kagawa Prefectural Government, Takamatsu, Kagawa, Japan
| | - Yuichiro Fujinami
- Goto Field Station, Fisheries Technology Institute, Japan Fisheries Research and Education Agency, Nagasaki, Japan
| | - Takenori Sasaki
- The University Museum, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
21
|
Lavanchy E, Goudet J. Effect of reduced genomic representation on using runs of homozygosity for inbreeding characterization. Mol Ecol Resour 2023; 23:787-802. [PMID: 36626297 DOI: 10.1111/1755-0998.13755] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Revised: 12/22/2022] [Accepted: 01/05/2023] [Indexed: 01/11/2023]
Abstract
Genomic measures of inbreeding based on identical-by-descent (IBD) segments are increasingly used to measure inbreeding and mostly estimated on SNP arrays and whole-genome sequencing (WGS) data. However, some softwares recurrently used for their estimation assume that genomic positions which have not been genotyped are nonvariant. This might be true for WGS data, but not for reduced genomic representations and can lead to spurious IBD segments estimation. In this project, we simulated the outputs of WGS, two SNP arrays of different sizes and RAD-sequencing for three populations with different sizes and histories. We compare the results of IBD segments estimation with two softwares: runs of homozygosity (ROHs) estimated with PLINK and homozygous-by-descent (HBD) segments estimated with RZooRoH. We demonstrate that to obtain meaningful estimates of inbreeding, RZooRoH requires a SNPs density 11 times smaller compared to PLINK: ranks of inbreeding coefficients were conserved among individuals above 22 SNPs/Mb for PLINK and 2 SNPs/Mb for RZooRoH. We also show that in populations with simple demographic histories, distribution of ROHs and HBD segments are correctly estimated with both SNP arrays and WGS. PLINK correctly estimated distribution of ROHs with SNP densities above 22 SNPs/Mb, while RZooRoH correctly estimated distribution of HBD segments with SNPs densities above 11 SNPs/Mb. However, in a population with a more complex demographic history, RZooRoH resulted in better distribution of IBD segments estimation compared to PLINK even with WGS data. Consequently, we advise researchers to use either methods relying on excess homozygosity averaged across SNPs or model-based HBD segments calling methods for inbreeding estimations.
Collapse
Affiliation(s)
- Eléonore Lavanchy
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, University of Lausanne, Lausanne, Switzerland
| | - Jérôme Goudet
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland.,Swiss Institute of Bioinformatics, University of Lausanne, Lausanne, Switzerland
| |
Collapse
|
22
|
Lotterhos KE, Fitzpatrick MC, Blackmon H. Simulation Tests of Methods in Evolution, Ecology, and Systematics: Pitfalls, Progress, and Principles. ANNUAL REVIEW OF ECOLOGY, EVOLUTION, AND SYSTEMATICS 2022; 53:113-136. [PMID: 38107485 PMCID: PMC10723108 DOI: 10.1146/annurev-ecolsys-102320-093722] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Complex statistical methods are continuously developed across the fields of ecology, evolution, and systematics (EES). These fields, however, lack standardized principles for evaluating methods, which has led to high variability in the rigor with which methods are tested, a lack of clarity regarding their limitations, and the potential for misapplication. In this review, we illustrate the common pitfalls of method evaluations in EES, the advantages of testing methods with simulated data, and best practices for method evaluations. We highlight the difference between method evaluation and validation and review how simulations, when appropriately designed, can refine the domain in which a method can be reliably applied. We also discuss the strengths and limitations of different evaluation metrics. The potential for misapplication of methods would be greatly reduced if funding agencies, reviewers, and journals required principled method evaluation.
Collapse
Affiliation(s)
- Katie E Lotterhos
- Department of Marine and Environmental Sciences, Northeastern University, Nahant, Massachusetts, USA
| | - Matthew C Fitzpatrick
- Appalachian Lab, University of Maryland Center for Environmental Science, Frostburg, Maryland, USA
| | - Heath Blackmon
- Department of Biology, Texas A&M University, College Station, Texas, USA
| |
Collapse
|
23
|
Hoey JA, Able KW, Pinsky ML. Genetic decline and recovery of a demographically rebuilt fishery species. Mol Ecol 2022; 31:5684-5698. [PMID: 36114805 PMCID: PMC9828022 DOI: 10.1111/mec.16697] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Revised: 09/09/2022] [Accepted: 09/15/2022] [Indexed: 01/13/2023]
Abstract
The demographic history of a population is important for conservation and evolution, but this history is unknown for many populations. Methods that use genomic data have been developed to infer demography, but they can be challenging to implement and interpret, particularly for large populations. Thus, understanding if and when genetic estimates of demography correspond to true population history is important for assessing the performance of these genetic methods. Here, we used double-digest restriction-site associated DNA (ddRAD) sequencing data from archived collections of larval summer flounder (Paralichthys dentatus, n = 279) from three cohorts (1994-1995, 1997-1998 and 2008-2009) along the U.S. East coast to examine how contemporary effective population size and genetic diversity responded to changes in abundance in a natural population. Despite little to no detectable change in genetic diversity, coalescent-based demographic modelling from site frequency spectra revealed that summer flounder effective population size declined dramatically in the early 1980s. The timing and direction of change corresponded well with the observed decline in spawning stock census abundance in the late 1980s from independent fish surveys. Census abundance subsequently recovered and achieved the prebottleneck size. Effective population size also grew following the bottleneck. Our results for summer flounder demonstrate that genetic sampling and site frequency spectra can be useful for detecting population dynamics, even in species with large effective sizes.
Collapse
Affiliation(s)
- Jennifer A. Hoey
- Ecology, Evolution, & Natural ResourcesRutgers UniversityNew BrunswickNew JerseyUSA,Institute for Biodiversity Science and SustainabilityCalifornia Academy of SciencesSan FranciscoCaliforniaUSA
| | - Kenneth W. Able
- Marine Field Station, Department of Marine and Coastal Sciences, Rutgers UniversityTuckertonNew JerseyUSA
| | - Malin L. Pinsky
- Ecology, Evolution, & Natural ResourcesRutgers UniversityNew BrunswickNew JerseyUSA
| |
Collapse
|
24
|
Baccichet I, Chiozzotto R, Scaglione D, Bassi D, Rossini L, Cirilli M. Genetic dissection of fruit maturity date in apricot (P. armeniaca L.) through a Single Primer Enrichment Technology (SPET) approach. BMC Genomics 2022; 23:712. [PMID: 36258163 PMCID: PMC9580121 DOI: 10.1186/s12864-022-08901-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Accepted: 09/08/2022] [Indexed: 11/10/2022] Open
Abstract
Background Single primer enrichment technology (SPET) is an emerging and increasingly popular solution for high-throughput targeted genotyping in plants. Although SPET requires a priori identification of polymorphisms for probe design, this technology has potentially higher reproducibility and transferability compared to other reduced representation sequencing (RRS) approaches, also enabling the discovery of closely linked polymorphisms surrounding the target one. Results The potential for SPET application in fruit trees was evaluated by developing a 25K target SNPs assay to genotype a panel of apricot accessions and progenies. A total of 32,492 polymorphic sites were genotyped in 128 accessions (including 8,188 accessory non-target SNPs) with extremely low levels of missing data and a significant correlation of allelic frequencies compared to whole-genome sequencing data used for array design. Assay performance was further validated by estimating genotyping errors in two biparental progenies, resulting in an overall 1.8% rate. SPET genotyping data were used to infer population structure and to dissect the architecture of fruit maturity date (MD), a quantitative reproductive phenological trait of great agronomical interest in apricot species. Depending on the year, GWAS revealed loci associated to MD on several chromosomes. The QTLs on chromosomes 1 and 4 (the latter explaining most of the phenotypic variability in the panel) were the most consistent over years and were further confirmed by linkage mapping in two segregating progenies. Conclusions Besides the utility for marker assisted selection and for paving the way to in-depth studies to clarify the molecular bases of MD trait variation in apricot, the results provide an overview of the performance and reliability of SPET for fruit tree genetics. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08901-1.
Collapse
Affiliation(s)
| | | | | | - Daniele Bassi
- Università degli Studi di Milan - DiSAA, Milano, Italy
| | - Laura Rossini
- Università degli Studi di Milan - DiSAA, Milano, Italy.
| | - Marco Cirilli
- Università degli Studi di Milan - DiSAA, Milano, Italy.
| |
Collapse
|
25
|
Halsey MK, Stuhler JD, Bayona-Vásquez NJ, Platt RN, Goetze JR, Martin RE, Matocha KG, Bradley RD, Stevens RD, Ray DA. Comparison of genetic variation between rare and common congeners of Dipodomys with estimates of contemporary and historical effective population size. PLoS One 2022; 17:e0274554. [PMID: 36099283 PMCID: PMC9469943 DOI: 10.1371/journal.pone.0274554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Accepted: 08/31/2022] [Indexed: 11/18/2022] Open
Abstract
Species with low effective population sizes are at greater risk of extinction because of reduced genetic diversity. Such species are more vulnerable to chance events that decrease population sizes (e.g. demographic stochasticity). Dipodomys elator, (Texas kangaroo rat) is a kangaroo rat that is classified as threatened in Texas and field surveys from the past 50 years indicate that the distribution of this species has decreased. This suggests geographic range reductions that could have caused population fluctuations, potentially impacting effective population size. Conversely, the more common and widespread D. ordii (Ord’s kangaroo rat) is thought to exhibit relative geographic and demographic stability. We assessed the genetic variation of D. elator and D. ordii samples using 3RAD, a modified restriction site associated sequencing approach. We hypothesized that D. elator would show lower levels of nucleotide diversity, observed heterozygosity, and effective population size when compared to D. ordii. We were also interested in identifying population structure within contemporary samples of D. elator and detecting genetic variation between temporal samples to understand demographic dynamics. We analyzed up to 61,000 single nucleotide polymorphisms. We found that genetic variability and effective population size in contemporary D. elator populations is lower than that of D. ordii. There is slight, if any, population structure within contemporary D. elator samples, and we found low genetic differentiation between spatial or temporal historical samples. This indicates little change in nuclear genetic diversity over 30 years. Results suggest that genetic diversity of D. elator has remained stable despite reduced population size and/or abundance, which may indicate a metapopulation-like system, whose fluctuations might counteract species extinction.
Collapse
Affiliation(s)
- Michaela K. Halsey
- Department of Biological Sciences, Texas Tech University, Lubbock, Texas, United States of America
- Department of Natural Resources Management, Texas Tech University, Lubbock, Texas, United States of America
| | - John D. Stuhler
- Department of Natural Resources Management, Texas Tech University, Lubbock, Texas, United States of America
| | - Natalia J. Bayona-Vásquez
- Department of Environmental Health Science, University of Georgia, Athens, Georgia, United States of America
- Institute of Bioinformatics, University of Georgia, Athens, Georgia, United States of America
| | - Roy N. Platt
- Texas Biomedical Research Institute, San Antonio, Texas, United States of America
| | - Jim R. Goetze
- Natural Sciences Department, Laredo College, Laredo, Texas, United States of America
| | - Robert E. Martin
- Department of Biology, McMurry University, Abilene, Texas, United States of America
| | - Kenneth G. Matocha
- Department of Biology, South Arkansas Community College, El Dorado, Arkansas, United States of America
| | - Robert D. Bradley
- Department of Biological Sciences, Texas Tech University, Lubbock, Texas, United States of America
- Natural Science Research Laboratory, Museum of Texas Tech, Lubbock, Texas, United States of America
| | - Richard D. Stevens
- Department of Natural Resources Management, Texas Tech University, Lubbock, Texas, United States of America
- Natural Science Research Laboratory, Museum of Texas Tech, Lubbock, Texas, United States of America
| | - David A. Ray
- Department of Biological Sciences, Texas Tech University, Lubbock, Texas, United States of America
- * E-mail:
| |
Collapse
|
26
|
Species and population genomic differentiation in Pocillopora corals (Cnidaria, Hexacorallia). Genetica 2022; 150:247-262. [PMID: 36083388 DOI: 10.1007/s10709-022-00165-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 09/01/2022] [Indexed: 11/04/2022]
Abstract
Correctly delimiting species and populations is a prerequisite for studies of connectivity, adaptation and conservation. Genomic data are particularly useful to test species differentiation for organisms with few informative morphological characters or low discrimination of cytoplasmic markers, as in Scleractinians. Here we applied Restriction site Associated DNA sequencing (RAD-sequencing) to the study of species differentiation and genetic structure in populations of Pocillopora spp. from Oman and French Polynesia, with the objectives to test species hypotheses, and to study the genetic structure among sampling sites within species. We focused here on coral colonies morphologically similar to P. acuta (damicornis type β). We tested the impact of different filtering strategies on the stability of the results. The main genetic differentiation was observed between samples from Oman and French Polynesia. These samples corresponded to different previously defined primary species hypotheses (PSH), i.e., PSHs 12 and 13 in Oman, and PSH 5 in French Polynesia. In Oman, we did not observe any clear differentiation between the two putative species PSH 12 and 13, nor between sampling sites. In French Polynesia, where a single species hypothesis was studied, there was no differentiation between sites. Our analyses allowed the identification of clonal lineages in Oman and French Polynesia. The impact of clonality on genetic diversity is discussed in light of individual-based simulations.
Collapse
|
27
|
Pacheco C, Lobo D, Silva P, Álvares F, García EJ, Castro D, Layna JF, López-Bao JV, Godinho R. Assessing the performance of historical skins and bones for museomics using wolf specimens as a case study. Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.970249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Advances in the field of museomics have promoted a high sampling demand for natural history collections (NHCs), eventually resulting in damage to invaluable resources to understand historical biodiversity. It is thus essential to achieve a consensus about which historical tissues present the best sources of DNA. In this study, we evaluated the performance of different historical tissues from Iberian wolf NHCs in genome-wide assessments. We targeted three tissues—bone (jaw and femur), maxilloturbinal bone, and skin—that have been favored by traditional taxidermy practices for mammalian carnivores. Specifically, we performed shotgun sequencing and target capture enrichment for 100,000 single nucleotide polymorphisms (SNPs) selected from the commercial Canine HD BeadChip across 103 specimens from 1912 to 2005. The performance of the different tissues was assessed using metrics based on endogenous DNA content, uniquely high-quality mapped reads after capture, and enrichment proportions. All samples succeeded as DNA sources, regardless of their collection year or sample type. Skin samples yielded significantly higher amounts of endogenous DNA compared to both bone types, which yielded equivalent amounts. There was no evidence for a direct effect of tissue type on capture efficiency; however, the number of genotyped SNPs was strictly associated with the starting amount of endogenous DNA. Evaluation of genotyping accuracy for distinct minimum read depths across tissue types showed a consistent overall low genotyping error rate (<7%), even at low (3x) coverage. We recommend the use of skins as reliable and minimally destructive sources of endogenous DNA for whole-genome and target enrichment approaches in mammalian carnivores. In addition, we provide a new 100,000 SNP capture array validated for historical DNA (hDNA) compatible to the Canine HD BeadChip for high-quality DNA. The increasing demand for NHCs as DNA sources should encourage the generation of genomic datasets comparable among studies.
Collapse
|
28
|
Kusuma YWC, Matsuo A, Suyama Y, Wanke S, Isagi Y. Conservation genetics of three Rafflesia species in Java Island, Indonesia using SNP markers obtained from MIG-seq. CONSERV GENET 2022. [DOI: 10.1007/s10592-022-01470-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
29
|
Vitek NS, McDaniel SF, Bloch JI. Microevolutionary variation in molar morphology of Onychomys leucogaster decoupled from genetic structure. Evolution 2022; 76:2032-2048. [PMID: 35872621 DOI: 10.1111/evo.14576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Revised: 04/22/2022] [Accepted: 04/29/2022] [Indexed: 01/22/2023]
Abstract
In neutral models of quantitative trait evolution, both genetic and phenotypic divergence scale as random walks, producing a correlation between the two measures. However, complexity in the genotype-phenotype map may alter the correlation between genotypic and phenotypic divergence, even when both are evolving neutrally or nearly so. Understanding this correlation between phenotypic and genetic variation is critical for accurately interpreting the fossil record. This study compares the geographic structure and scaling of morphological variation of the shape of the first lower molar of 77 individuals of the northern grasshopper mouse Onychomys leucogaster to genome-wide SNP variation in the same sample. We found strong genetic structure but weak or absent morphological structure indicating that the scaling of each type of variation is decoupled from one another. Low PST values relative to FST values are consistent with a lack of morphological divergence in contrast to genetic divergence between groups. This lack of phenotypic structure and the presence of notable within-sample phenotypic variance are consistent with uniform selection or constraints on molar shape across a wide geographic and environmental range. Over time, this kind of decoupling may result in patterns of phenotypic stasis masking underlying genetic patterns.
Collapse
Affiliation(s)
- Natasha S Vitek
- Department of Biology, University of Florida, Gainesville, Florida, 32611.,Florida Museum of Natural History, University of Florida, Gainesville, Florida, 32611.,Department of Ecology and Evolution, Stony Brook University, Stony Brook, New York, 11794
| | - Stuart F McDaniel
- Department of Biology, University of Florida, Gainesville, Florida, 32611
| | - Jonathan I Bloch
- Florida Museum of Natural History, University of Florida, Gainesville, Florida, 32611
| |
Collapse
|
30
|
Zhou W, Jenny Xiang QY. Phylogenomics and Biogeography of Castanea (Chestnut) and Hamamelis (Witch-hazel) - Choosing between RAD-seq and Hyb-Seq Approaches. Mol Phylogenet Evol 2022; 176:107592. [DOI: 10.1016/j.ympev.2022.107592] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2021] [Revised: 06/18/2022] [Accepted: 07/20/2022] [Indexed: 10/31/2022]
|
31
|
Lujan NK, Colm JE, Weir JT, Montgomery FA, Noonan BP, Lovejoy NR, Mandrak NE. Genomic population structure of Grass Pickerel (Esox americanus vermiculatus) in Canada: management guidance for an at-risk fish at its northern range limit. CONSERV GENET 2022. [DOI: 10.1007/s10592-022-01450-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
32
|
Hurt C, Hildreth P, Williams C. A genomic perspective on the conservation status of the endangered Nashville crayfish (Faxonius shoupi). CONSERV GENET 2022. [DOI: 10.1007/s10592-022-01438-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
|
33
|
Dukić M, Bomblies K. Male and female recombination landscapes of diploid Arabidopsis arenosa. Genetics 2022; 220:iyab236. [PMID: 35100396 PMCID: PMC8893250 DOI: 10.1093/genetics/iyab236] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 12/17/2021] [Indexed: 12/13/2022] Open
Abstract
The number and placement of meiotic crossover events during meiosis have important implications for the fidelity of chromosome segregation as well as patterns of inheritance. Despite the functional importance of recombination, recombination landscapes vary widely among and within species, and this can have a strong impact on evolutionary processes. A good knowledge of recombination landscapes is important for model systems in evolutionary and ecological genetics, since it can improve interpretation of genomic patterns of differentiation and genome evolution, and provides an important starting point for understanding the causes and consequences of recombination rate variation. Arabidopsis arenosa is a powerful evolutionary genetic model for studying the molecular basis of adaptation and recombination rate evolution. Here, we generate genetic maps for 2 diploid A. arenosa individuals from distinct genetic lineages where we have prior knowledge that meiotic genes show evidence of selection. We complement the genetic maps with cytological approaches to map and quantify recombination rates, and test the idea that these populations might have distinct patterns of recombination. We explore how recombination differs at the level of populations, individuals, sexes and genomic regions. We show that the positioning of crossovers along a chromosome correlates with their number, presumably a consequence of crossover interference, and discuss how this effect can cause differences in recombination landscape among sexes or species. We identify several instances of female segregation distortion. We found that averaged genome-wide recombination rate is lower and sex differences subtler in A. arenosa than in Arabidopsis thaliana.
Collapse
Affiliation(s)
- Marinela Dukić
- Department of Biology, Plant Evolutionary Genetics, Institute of Plant Molecular Biology, ETH Zürich, Zürich 8092, Switzerland
| | - Kirsten Bomblies
- Department of Biology, Plant Evolutionary Genetics, Institute of Plant Molecular Biology, ETH Zürich, Zürich 8092, Switzerland
| |
Collapse
|
34
|
Lange JD, Bastide H, Lack JB, Pool JE. A Population Genomic Assessment of Three Decades of Evolution in a Natural Drosophila Population. Mol Biol Evol 2021; 39:6491261. [PMID: 34971382 PMCID: PMC8826484 DOI: 10.1093/molbev/msab368] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Population genetics seeks to illuminate the forces shaping genetic variation, often based on a single snapshot of genomic variation. However, utilizing multiple sampling times to study changes in allele frequencies can help clarify the relative roles of neutral and non-neutral forces on short time scales. This study compares whole-genome sequence variation of recently collected natural population samples of Drosophila melanogaster against a collection made approximately 35 years prior from the same locality—encompassing roughly 500 generations of evolution. The allele frequency changes between these time points would suggest a relatively small local effective population size on the order of 10,000, significantly smaller than the global effective population size of the species. Some loci display stronger allele frequency changes than would be expected anywhere in the genome under neutrality—most notably the tandem paralogs Cyp6a17 and Cyp6a23, which are impacted by structural variation associated with resistance to pyrethroid insecticides. We find a genome-wide excess of outliers for high genetic differentiation between old and new samples, but a larger number of adaptation targets may have affected SNP-level differentiation versus window differentiation. We also find evidence for strengthening latitudinal allele frequency clines: northern-associated alleles have increased in frequency by an average of nearly 2.5% at SNPs previously identified as clinal outliers, but no such pattern is observed at random SNPs. This project underscores the scientific potential of using multiple sampling time points to investigate how evolution operates in natural populations, by quantifying how genetic variation has changed over ecologically relevant timescales.
Collapse
Affiliation(s)
- Jeremy D Lange
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, Wisconsin, 53706
| | - Héloïse Bastide
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, Wisconsin, 53706
| | - Justin B Lack
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, Wisconsin, 53706
| | - John E Pool
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, Wisconsin, 53706
| |
Collapse
|
35
|
Wang L, Yang J, Zhang H, Tao Q, Zhang Y, Dang Z, Zhang F, Luo Z. Sequence coverage required for accurate genotyping by sequencing in polyploid species. Mol Ecol Resour 2021; 22:1417-1426. [PMID: 34826191 DOI: 10.1111/1755-0998.13558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2021] [Revised: 11/12/2021] [Accepted: 11/15/2021] [Indexed: 11/29/2022]
Abstract
Polyploidy plays an important role in the evolution of eukaryotes, especially for flowering plants. Many of ecologically or agronomically important plant or crop species are polyploids, including sycamore maple (tetraploid), the world second and third largest food crops wheat (hexaploid) and potato (tetraploid) as well as economically important aquaculture animals such as Atlantic salmon and trout. The next generation sequencing data enables to allocate genotype at a sequence variant site, known as genotyping by sequencing (GBS). GBS has stimulated enormous interests in population based genomics studies in almost all diploid and many polyploid organisms. DNA sequence polymorphisms are codominant and thus fully informative about the underlying genotype at the polymorphic site, making GBS a straightforward task in diploids. However, sequence data may usually be uninformative in polyploid species, making GBS a far more challenging task in polyploids. This paper presents novel and rigorous statistical methods for predicting the number of sequence reads needed to ensure accurate GBS at a polymorphic site bared by the reads in polyploids and shows that a dozen of reads can ensure a probability of 95% to recover all constituent alleles of any tetraploid genotype but several hundreds of reads are needed to accurately uncover the genotype with probability confidence of 90%, subverting the proposition of GBS using low coverage sequence data in the literature. The theoretical prediction was tested by use of RAD-seq data from tetraploid potato cultivars. The paper provides polyploid experimentalists with theoretical guides and methods for designing and conducting their sequence-based studies.
Collapse
Affiliation(s)
- Lin Wang
- Laboratory of Population and Quantitative Genetics, Institute of Biostatistics, School of Life Sciences, Fudan University, Shanghai, China
| | - Jixuan Yang
- Laboratory of Population and Quantitative Genetics, Institute of Biostatistics, School of Life Sciences, Fudan University, Shanghai, China
| | - Hong Zhang
- Department of Statistics and Finance, University of Science and Technology of China, Hefei, China
| | - Qin Tao
- Laboratory of Population and Quantitative Genetics, Institute of Biostatistics, School of Life Sciences, Fudan University, Shanghai, China
| | - Yuxin Zhang
- Laboratory of Population and Quantitative Genetics, Institute of Biostatistics, School of Life Sciences, Fudan University, Shanghai, China
| | - Zhenyu Dang
- Laboratory of Population and Quantitative Genetics, Institute of Biostatistics, School of Life Sciences, Fudan University, Shanghai, China
| | - Fengjun Zhang
- Laboratory of Population and Quantitative Genetics, Institute of Biostatistics, School of Life Sciences, Fudan University, Shanghai, China
| | - Zewei Luo
- Laboratory of Population and Quantitative Genetics, Institute of Biostatistics, School of Life Sciences, Fudan University, Shanghai, China.,School of Biosciences, University of Birmingham, Birmingham, UK
| |
Collapse
|
36
|
Nazareno AG, Knowles LL. There Is No 'Rule of Thumb': Genomic Filter Settings for a Small Plant Population to Obtain Unbiased Gene Flow Estimates. FRONTIERS IN PLANT SCIENCE 2021; 12:677009. [PMID: 34721447 PMCID: PMC8551369 DOI: 10.3389/fpls.2021.677009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Accepted: 06/16/2021] [Indexed: 06/13/2023]
Abstract
The application of high-density polymorphic single-nucleotide polymorphisms (SNP) markers derived from high-throughput sequencing methods has heralded plenty of biological questions about the linkages of processes operating at micro- and macroevolutionary scales. However, the effects of SNP filtering practices on population genetic inference have received much less attention. By performing sensitivity analyses, we empirically investigated how decisions about the percentage of missing data (MD) and the minor allele frequency (MAF) set in bioinformatic processing of genomic data affect direct (i.e., parentage analysis) and indirect (i.e., fine-scale spatial genetic structure - SGS) gene flow estimates. We focus specifically on these manifestations in small plant populations, and particularly, in the rare tropical plant species Dinizia jueirana-facao, where assumptions implicit to analytical procedures for accurate estimates of gene flow may not hold. Avoiding biases in dispersal estimates are essential given this species is facing extinction risks due to habitat loss, and so we also investigate the effects of forest fragmentation on the accuracy of dispersal estimates under different filtering criteria by testing for recent decrease in the scale of gene flow. Our sensitivity analyses demonstrate that gene flow estimates are robust to different setting of MAF (0.05-0.35) and MD (0-20%). Comparing the direct and indirect estimates of dispersal, we find that contemporary estimates of gene dispersal distance (σ r t = 41.8 m) was ∼ fourfold smaller than the historical estimates, supporting the hypothesis of a temporal shift in the scale of gene flow in D. jueirana-facao, which is consistent with predictions based on recent, dramatic forest fragmentation process. While we identified settings for filtering genomic data to avoid biases in gene flow estimates, we stress that there is no 'rule of thumb' for bioinformatic filtering and that relying on default program settings is not advisable. Instead, we suggest that the approach implemented here be applied independently in each separate empirical study to confirm appropriate settings to obtain unbiased population genetics estimates.
Collapse
Affiliation(s)
- Alison G. Nazareno
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, United States
- Department of Genetics, Ecology and Evolution, Federal University of Minas Gerais, Belo Horizonte, Brazil
| | - L. Lacey Knowles
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, United States
| |
Collapse
|
37
|
Rasplus JY, Rodriguez LJ, Sauné L, Peng YQ, Bain A, Kjellberg F, Harrison RD, Pereira RAS, Ubaidillah R, Tollon-Cordet C, Gautier M, Rossi JP, Cruaud A. Exploring systematic biases, rooting methods and morphological evidence to unravel the evolutionary history of the genus Ficus (Moraceae). Cladistics 2021; 37:402-422. [PMID: 34478193 DOI: 10.1111/cla.12443] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/11/2020] [Indexed: 11/28/2022] Open
Abstract
Despite many attempts in the Sanger sequencing era, the phylogeny of fig trees remains unresolved, which limits our ability to analyze the evolution of key traits that may have contributed to their evolutionary and ecological success. We used restriction-site-associated DNA sequencing (c. 420 kb) and 102 morphological characters to elucidate the relationships between 70 species of Ficus. To increase phylogenetic information for higher-level relationships, we targeted conserved regions and assembled paired reads into long loci to enable the retrieval of homologous loci in outgroup genomes. We compared morphological and molecular results to highlight discrepancies and reveal possible inference bias. For the first time, we recovered a monophyletic subgenus Urostigma (stranglers) and a clade with all gynodioecious Ficus. However, we show, with a new approach based on iterative principal component analysis, that it is not (and will probably never be) possible to homogenize evolutionary rates and GC content for all taxa before phylogenetic inference. Four competing positions for the root of the molecular tree are possible. The placement of section Pharmacosycea as sister to other fig trees is not supported by morphological data and considered a result of a long-branch attraction artefact to the outgroups. Regarding morphological features and indirect evidence from the pollinator tree of life, the topology that divides Ficus into monoecious versus gynodioecious species appears most plausible. It seems most likely that the ancestor of fig trees was a freestanding tree and active pollination is inferred as the ancestral state, contrary to previous hypotheses. However, ambiguity remains on the ancestral breeding system. Despite morphological plasticity, we advocate restoring a central role to morphology in our understanding of the evolution of Ficus, as it can help detect systematic errors that appear more pronounced with larger molecular datasets.
Collapse
Affiliation(s)
- Jean-Yves Rasplus
- CBGP, INRAE, CIRAD, IRD, Montpellier SupAgro, Université de Montpellier, Montpellier, 34988, France
| | - Lillian Jennifer Rodriguez
- Institute of Biology, University of the Philippines Diliman, Quezon City, 1101, Philippines.,Natural Sciences Research Institute, University of the Philippines Diliman, Quezon City, 1101, Philippines
| | - Laure Sauné
- CBGP, INRAE, CIRAD, IRD, Montpellier SupAgro, Université de Montpellier, Montpellier, 34988, France
| | - Yang-Qiong Peng
- CAS Key Laboratory of Tropical Forest Ecology, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Kunming, 650223, China
| | - Anthony Bain
- Department of Biological Sciences, National Sun Yat-sen University, Kaohsiung, 80424, Taiwan
| | - Finn Kjellberg
- CEFE, CNRS, Université Paul-Valéry Montpellier, EPHE, Université de Montpellier, Montpellier, 34090, France
| | - Rhett D Harrison
- World Agroforestry, Eastern and Southern Africa, Region, 13 Elm Road, Woodlands, Lusaka, 10101, Zambia
| | - Rodrigo A S Pereira
- Departamento de Biologia, FFCLRP, Universidade de São Paulo, Ribeirão Preto, SP, 14040-901, Brazil
| | - Rosichon Ubaidillah
- Museum Zoologicum Bogoriense, LIPI, Gedung Widyasatwaloka, Jln Raya km 46, Cibinong, Bogor, 16911, Indonesia
| | - Christine Tollon-Cordet
- AGAP, INRA, CIRAD, Montpellier SupAgro, Université de Montpellier, Montpellier, 34398, France
| | - Mathieu Gautier
- CBGP, INRAE, CIRAD, IRD, Montpellier SupAgro, Université de Montpellier, Montpellier, 34988, France
| | - Jean-Pierre Rossi
- CBGP, INRAE, CIRAD, IRD, Montpellier SupAgro, Université de Montpellier, Montpellier, 34988, France
| | - Astrid Cruaud
- CBGP, INRAE, CIRAD, IRD, Montpellier SupAgro, Université de Montpellier, Montpellier, 34988, France
| |
Collapse
|
38
|
Waples RS, Waples RK, Ward EJ. Pseudoreplication in genomics-scale datasets. Mol Ecol Resour 2021; 22:503-518. [PMID: 34351073 DOI: 10.1111/1755-0998.13482] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 06/14/2021] [Accepted: 07/23/2021] [Indexed: 11/30/2022]
Abstract
In genomics-scale datasets, loci are closely packed within chromosomes and hence provide correlated information. Averaging across loci as if they were independent creates pseudoreplication, which reduces the effective degrees of freedom (df') compared to the nominal degrees of freedom, df. This issue has been known for some time, but consequences have not been systematically quantified across the entire genome. Here we measured pseudoreplication (quantified by the ratio df'/df) for a common metric of genetic differentiation (FST ) and a common measure of linkage disequilibrium between pairs of loci (r2 ). Based on data simulated using models (SLiM and msprime) that allow efficient forward-in-time and coalescent simulations while precisely controlling population pedigrees, we estimated df' and df'/df by measuring the rate of decline in the variance of mean FST and mean r2 as more loci were used. For both indices, df' increases with Ne and genome size, as expected. However, even for large Ne and large genomes, df' for mean r2 plateaus after a few thousand loci, and a variance components analysis indicates that the limiting factor is uncertainty associated with sampling individuals rather than genes. Pseudoreplication is less extreme for FST , but df'/df ≤0.01 can occur in datasets using tens of thousands of loci. Commonly-used block-jackknife methods consistently overestimated var(FST ), producing very conservative confidence intervals. Predicting df' based on our modeling results as a function of Ne , L, S, and genome size provides a robust way to quantify precision associated with genomics-scale datasets.
Collapse
Affiliation(s)
- Robin S Waples
- NOAA Fisheries, Northwest Fisheries Science Center, 2725 Montlake Blvd. East, Seattle, WA, 98112, USA
| | - Ryan K Waples
- Department of Biology, Section for Computational and RNA Biology, University of Copenhagen, Copenhagen, Denmark.,Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - Eric J Ward
- NOAA Fisheries, Northwest Fisheries Science Center, 2725 Montlake Blvd. East, Seattle, WA, 98112, USA
| |
Collapse
|
39
|
Leal BSS, Chaves CJN, Graciano VA, Boury C, Huacre LAP, Heuertz M, Palma-Silva C. Evidence of local adaptation despite strong drift in a Neotropical patchily distributed bromeliad. Heredity (Edinb) 2021; 127:203-218. [PMID: 33953353 PMCID: PMC8322333 DOI: 10.1038/s41437-021-00442-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Revised: 04/16/2021] [Accepted: 04/17/2021] [Indexed: 02/03/2023] Open
Abstract
Both genetic drift and divergent selection are predicted to be drivers of population differentiation across patchy habitats, but the extent to which these forces act on natural populations to shape traits is strongly affected by species' ecological features. In this study, we infer the genomic structure of Pitcairnia lanuginosa, a widespread herbaceous perennial plant with a patchy distribution. We sampled populations in the Brazilian Cerrado and the Central Andean Yungas and discovered and genotyped SNP markers using double-digest restriction-site associated DNA sequencing. In addition, we analyzed ecophysiological traits obtained from a common garden experiment and compared patterns of phenotypic and genetic divergence (PST-FST comparisons) in a subset of populations from the Cerrado. Our results from molecular analyses pointed to extremely low genetic diversity and a remarkable population differentiation, supporting a major role of genetic drift. Approximately 0.3% of genotyped SNPs were flagged as differentiation outliers by at least two distinct methods, and Bayesian generalized linear mixed models revealed a signature of isolation by environment in addition to isolation by distance for high-differentiation outlier SNPs among the Cerrado populations. PST-FST comparisons suggested divergent selection on two ecophysiological traits linked to drought tolerance. We showed that these traits vary among populations, although without any particular macro-spatial pattern, suggesting local adaptation to differences in micro-habitats. Our study shows that selection might be a relevant force, particularly for traits involved in drought stress, even for populations experiencing strong drift, which improves our knowledge on eco-evolutionary processes acting on non-continuously distributed species.
Collapse
Affiliation(s)
- Bárbara Simões Santos Leal
- grid.410543.70000 0001 2188 478XDepartamento de Ecologia, Instituto de Biociências, Universidade Estadual Paulista, Rio Claro, São Paulo Brazil
| | - Cleber Juliano Neves Chaves
- grid.410543.70000 0001 2188 478XDepartamento de Ecologia, Instituto de Biociências, Universidade Estadual Paulista, Rio Claro, São Paulo Brazil
| | - Vanessa Araujo Graciano
- grid.410543.70000 0001 2188 478XDepartamento de Ecologia, Instituto de Biociências, Universidade Estadual Paulista, Rio Claro, São Paulo Brazil
| | - Christophe Boury
- grid.412041.20000 0001 2106 639XINRAE, Univ. Bordeaux, Biogeco, Cestas France
| | - Luis Alberto Pillaca Huacre
- grid.10800.390000 0001 2107 4576Departamento de Ecología, Museo de Historia Natural de la Universidad Nacional Mayor de San Marcos, Lima, Peru
| | - Myriam Heuertz
- grid.412041.20000 0001 2106 639XINRAE, Univ. Bordeaux, Biogeco, Cestas France
| | - Clarisse Palma-Silva
- grid.410543.70000 0001 2188 478XDepartamento de Ecologia, Instituto de Biociências, Universidade Estadual Paulista, Rio Claro, São Paulo Brazil ,grid.411087.b0000 0001 0723 2494Departamento de Biologia Vegetal, Instituto de Biologia, Universidade Estadual de Campinas, Campinas, São Paulo, Brazil
| |
Collapse
|
40
|
O'Connell KA, Mulder KP, Wynn A, de Queiroz K, Bell RC. Genomic library preparation and hybridization capture of formalin-fixed tissues and allozyme supernatant for population genomics and considerations for combining capture- and RADseq-based single nucleotide polymorphism data sets. Mol Ecol Resour 2021; 22:487-502. [PMID: 34329532 DOI: 10.1111/1755-0998.13481] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Revised: 06/10/2021] [Accepted: 07/14/2021] [Indexed: 12/17/2022]
Abstract
Until recently many historical museum specimens were largely inaccessible to genomic inquiry, but high-throughput sequencing (HTS) approaches have allowed researchers to successfully sequence genomic DNA from dried and fluid-preserved museum specimens. In addition to preserved specimens, many museums contain large series of allozyme supernatant samples, but the amenability of these samples to HTS has not yet been assessed. Here, we compared the performance of a target-capture approach using alternative sources of genomic DNA from 10 specimens of spring salamanders (Plethodontidae: Gyrinophilus porphyriticus) collected between 1985 and 1990: allozyme supernatants, allozyme homogenate pellets and formalin-fixed tissues. We designed capture probes based on double-digest restriction-site associated sequencing (RADseq) derived loci from frozen blood samples available for seven of the specimens and assessed the success and consistency of capture and RADseq approaches. This study design enabled direct comparisons of data quality and potential biases among the different data sets for phylogenomic and population genomic analyses. We found that in phylogenetic analyses, all enrichment types for a given specimen clustered together. In principal component space all capture-based samples clustered together, but RADseq samples did not cluster with corresponding capture-based samples. Single nucleotide polymorphism calls were on average 18.3% different between enrichment types for a given individual, but these discrepancies were primarily due to differences in heterozygous/homozygous single nucleotide polymorphism calls. We demonstrate that both allozyme supernatant and formalin-fixed samples can be successfully used for population genomic analyses and we discuss ways to identify and reduce biases associated with combining capture and RADseq data.
Collapse
Affiliation(s)
- Kyle A O'Connell
- Global Genome Initiative, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, USA.,Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, USA.,Department of Biological Sciences, The George Washington University, Washington, District of Columbia, USA.,Biomedical Data Science Lab, Deloitte Consulting LLP, Arlington, Virginia, USA
| | - Kevin P Mulder
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, USA.,CIBIO/InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto, Vairão, Portugal.,Center for Conservation Genomics, Smithsonian Conservation Biology Institute, National Zoological Park, Washington, District of Columbia, USA
| | - Addison Wynn
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, USA
| | - Kevin de Queiroz
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, USA
| | - Rayna C Bell
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, USA.,Department of Herpetology, California Academy of Sciences, San Francisco, California, USA
| |
Collapse
|
41
|
Comparison of sequence-capture and ddRAD approaches in resolving species and populations in hexacorallian anthozoans. Mol Phylogenet Evol 2021; 163:107233. [PMID: 34139346 DOI: 10.1016/j.ympev.2021.107233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Revised: 05/25/2021] [Accepted: 06/11/2021] [Indexed: 10/21/2022]
Abstract
Genome-level sequencing is the next step in understanding species-level relationships within Anthozoa (soft corals, anemones, stony corals, and their kin) as morphological and PCR-directed (single-locus) sequencing methods often fall short of differentiating species. The sea anemone genus Metridium is a common northern temperate sea anemone whose species are difficult to differentiate using morphology alone. Here we use Metridium as a case study to confirm the low level of information available in six loci for species differentiation commonly sequenced for Actiniaria and explore and compare the efficacy of ddRAD and sequence-capture methods in species-level systematics and biogeographic studies. We produce phylogenetic trees from concatenated datasets and perform DAPC and STRUCTURE analyses using SNP data. The six conventional loci are not able to consistently differentiate species within Metridium. The sequence-capture dataset resulted in high support and resolution for both current species and relationships between geographic areas. The ddRAD datasets displayed ambiguity among species, and support between major geographic groupings was not as high as the sequence-capture datasets. The level of resolution and support resulting from the sequence-capture data, combined with the ability to add additional individuals and expand beyond the genus Metridium over time, emphasizes the utility of sequence-capture methods for both systematics and future biogeographic studies within anthozoans. We discuss the strengths and weaknesses of the genomic approaches in light of our findings and suggest potential implications for the biogeography of Metridium based on our sampling.
Collapse
|
42
|
Galaska MP, Wethey DS, Arias A, Dubois SF, Halanych KM, Woodin SA. The impact of aquaculture on the genetics and distribution of the onuphid annelid Diopatra biscayensis. Ecol Evol 2021; 11:6184-6194. [PMID: 34141211 PMCID: PMC8207402 DOI: 10.1002/ece3.7447] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Revised: 02/15/2021] [Accepted: 02/22/2021] [Indexed: 01/30/2023] Open
Abstract
AIM Evolutionary history of natural populations can be confounded by human intervention such as the case of decorator worm species Diopatra (Onuphidae), which have a history of being transported through anthropogenic activities. Because they build tubes and act as ecosystem engineers, they can have a large impact on the overall ecosystem in which they occur. One conspicuous member, Diopatra biscayensis, which was only described in 2012, has a fragmented distribution that includes the Bay of Biscay and the Normanno-Breton Gulf in the English Channel. This study explores the origin of these worms in the Normanno-Breton region, which has been debated to either be the result of a historic range contraction from a relic continuous population or a more recent introduction. LOCATION Northeastern Atlantic, the Bay of Biscay, and the Normanno-Breton Gulf. METHODS We utilized a RAD-tag-based SNP approach to create a reduced genomic data set to recover fine-scale population structure and infer which hypothesis best describes the D. biscayensis biogeographic distribution. The reduced genomic data set was used to calculate standard genetic diversities and genetic differentiation statistics, and utilized various clustering analyses, including PCAs, DAPC, and admixture. RESULTS Clustering analyses were consistent with D. biscayensis as a single population spanning the Bay of Biscay to the Normanno-Breton Gulf in the English Channel, although unexpected genetic substructure was recovered from Arcachon Bay, in the middle of its geographic range. Consistent with a hypothesized introduction, the isolated Sainte-Anne locality in the Normanno-Breton Gulf was recovered to be a subset of the diversity found in the rest of the Bay of Biscay. MAIN CONCLUSIONS These results are congruent with previous simulations that did not support connectivity from the Bay of Biscay to the Normanno-Breton Gulf by natural dispersal. These genomic findings, with support from previous climatic studies, further support the hypothesis that D. biscayensis phylogeographic connectivity is the result of introductions, likely through the regions' rich shellfish aquaculture, and not of a historically held range contraction.
Collapse
Affiliation(s)
- Matthew P. Galaska
- Cooperative Institute for Climate, Ocean, & Ecosystem StudiesNOAA Pacific Marine Environmental LabUniversity of WashingtonSeattleWashingtonUSA
- Department of Biological SciencesAuburn UniversityAuburnAlabamaUSA
| | - David S. Wethey
- Department of Biological SciencesUniversity of South CarolinaColumbiaSouth CarolinaUSA
| | - Andrés Arias
- Departamento de Biología de Organismos y Sistemas (Zoología)Universidad de OviedoOviedoSpain
| | | | | | - Sarah A. Woodin
- Department of Biological SciencesUniversity of South CarolinaColumbiaSouth CarolinaUSA
| |
Collapse
|
43
|
Silliman K, Indorf JL, Knowlton N, Browne WE, Hurt C. Base-substitution mutation rate across the nuclear genome of Alpheus snapping shrimp and the timing of isolation by the Isthmus of Panama. BMC Ecol Evol 2021; 21:104. [PMID: 34049492 PMCID: PMC8164322 DOI: 10.1186/s12862-021-01836-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Accepted: 04/06/2021] [Indexed: 11/17/2022] Open
Abstract
Background The formation of the Isthmus of Panama and final closure of the Central American Seaway (CAS) provides an independent calibration point for examining the rate of DNA substitutions. This vicariant event has been widely used to estimate the substitution rate across mitochondrial genomes and to date evolutionary events in other taxonomic groups. Nuclear sequence data is increasingly being used to complement mitochondrial datasets for phylogenetic and evolutionary investigations; these studies would benefit from information regarding the rate and pattern of DNA substitutions derived from the nuclear genome. Results To estimate the genome-wide neutral mutation rate (µ), genotype-by-sequencing (GBS) datasets were generated for three transisthmian species pairs in Alpheus snapping shrimp. A range of bioinformatic filtering parameters were evaluated in order to minimize potential bias in mutation rate estimates that may result from SNP filtering. Using a Bayesian coalescent approach (G-PhoCS) applied to 44,960 GBS loci, we estimated µ to be 2.64E−9 substitutions/site/year, when calibrated with the closure of the CAS at 3 Ma. Post-divergence gene flow was detected in one species pair. Failure to account for this post-split migration inflates our substitution rate estimates, emphasizing the importance of demographic methods that can accommodate gene flow. Conclusions Results from our study, both parameter estimates and bioinformatic explorations, have broad-ranging implications for phylogeographic studies in other non-model taxa using reduced representation datasets. Our best estimate of µ that accounts for coalescent and demographic processes is remarkably similar to experimentally derived mutation rates in model arthropod systems. These results contradicted recent suggestions that the closure of the Isthmus was completed much earlier (around 10 Ma), as mutation rates based on an early calibration resulted in uncharacteristically low genomic mutation rates. Also, stricter filtering parameters resulted in biased datasets that generated lower mutation rate estimates and influenced demographic parameters, serving as a cautionary tale for the adherence to conservative bioinformatic strategies when generating reduced-representation datasets at the species level. To our knowledge this is the first use of transisthmian species pairs to calibrate the rate of molecular evolution from GBS data. Supplementary Information The online version contains supplementary material available at 10.1186/s12862-021-01836-3.
Collapse
Affiliation(s)
- Katherine Silliman
- School of Fisheries, Aquaculture, and Aquatic Sciences, Auburn University, Auburn, AL, 36849, USA. .,Committee on Evolutionary Biology, University of Chicago, Chicago, IL, 60637, USA.
| | - Jane L Indorf
- Department of Biology, University of Miami, Coral Gables, FL, 33146, USA
| | - Nancy Knowlton
- National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
| | - William E Browne
- Department of Biology, University of Miami, Coral Gables, FL, 33146, USA
| | - Carla Hurt
- Department of Biology, University of Miami, Coral Gables, FL, 33146, USA.,Department of Biology, Tennessee Tech University, Cookeville, TN, 38505, USA
| |
Collapse
|
44
|
Cockburn A, Peñalba JV, Jaccoud D, Kilian A, Brouwer L, Double MC, Margraf N, Osmond HL, Kruuk LEB, van de Pol M. hiphop: Improved paternity assignment among close relatives using a simple exclusion method for biallelic markers. Mol Ecol Resour 2021; 21:1850-1865. [PMID: 33750003 DOI: 10.1111/1755-0998.13389] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 03/08/2021] [Accepted: 03/15/2021] [Indexed: 11/30/2022]
Abstract
Assignment of parentage with molecular markers is most difficult when the true parents have close relatives in the adult population. Here, we present an efficient solution to that problem by extending simple exclusion approaches to parentage analysis with single nucleotide polymorphic markers (SNPs). We augmented the previously published homozygote opposite test (hot), which counts mismatches due to the offspring and candidate parent having different homozygous genotypes, with an additional test. In this case, parents homozygous for the same SNP are incompatible with heterozygous offspring (i.e., "Homozygous Identical Parents, Heterozygous Offspring are Precluded": hiphop). We tested this approach in a cooperatively breeding bird, the superb fairy-wren, Malurus cyaneus, where rates of extra-pair paternity are exceptionally high, and where paternity assignment is challenging because breeding males typically have first-order adult relatives in their neighbourhood. Combining the tests and conditioning on the maternal genotype with a set of 1376 autosomal SNPs always allowed us to distinguish a single most likely sire from his relatives, and also to identify cases where the true sire must have been unsampled. In contrast, if just the hot test was used, we failed to identify a single most-likely sire in 2.5% of cases. Resampling enabled us to create guidelines for the number of SNPs required when first-order relatives coexist in the mating pool. Our method, implemented in the R package hiphop, therefore provides unambiguous parentage assignments even in systems with complex social organisation. We also identified a suite of Z- and W-linked SNPs that always identified sex correctly.
Collapse
Affiliation(s)
- Andrew Cockburn
- Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Joshua V Peñalba
- Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia.,Division of Evolutionary Biology, Ludwig Maximilians Universitat Munchen, Munchen, Germany
| | - Damian Jaccoud
- Diversity Arrays Technology Pty Ltd, Bruce, ACT, Australia
| | - Andrzej Kilian
- Diversity Arrays Technology Pty Ltd, Bruce, ACT, Australia
| | - Lyanne Brouwer
- Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia.,Department of Animal Ecology and Physiology, Radboud University, Nijmegen, The Netherlands
| | - Michael C Double
- Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia.,Australian Antarctic Division, Kingston, TAS, Australia
| | - Nicolas Margraf
- Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia.,Musée d'histoire naturelle de La Chaux-de-Fonds, Neuchatel, Switzerland
| | - Helen L Osmond
- Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Loeske E B Kruuk
- Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia
| | - Martijn van de Pol
- Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, ACT, Australia.,Netherlands Institute of Ecology, Wageningen, The Netherlands
| |
Collapse
|
45
|
Lin KP, Chaw SM, Lo YH, Kinjo T, Tung CY, Cheng HC, Liu Q, Satta Y, Izawa M, Chen SF, Ko WY. Genetic Differentiation and Demographic Trajectory of the Insular Formosan and Orii's Flying Foxes. J Hered 2021; 112:192-203. [PMID: 33675222 PMCID: PMC8006818 DOI: 10.1093/jhered/esab007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Accepted: 02/24/2021] [Indexed: 12/04/2022] Open
Abstract
Insular flying foxes are keystone species in island ecosystems due to their critical roles in plant pollination and seed dispersal. These species are vulnerable to population decline because of their small populations and low reproductive rates. The Formosan flying fox (Pteropus dasymallus formosus) is one of the 5 subspecies of the Ryukyu flying fox. Pteropus dasymallus formosus has suffered from a severe decline and is currently recognized as a critically endangered population in Taiwan. On the contrary, the Orii's flying fox (Pteropus dasymallus inopinatus) is a relatively stable population inhabiting Okinawa Island. Here, we applied a genomic approach called double digest restriction-site associated DNA sequencing to study these 2 subspecies for a total of 7 individuals. We detected significant genetic structure between the 2 populations. Despite their contrasting contemporary population sizes, both populations harbor very low degrees of genetic diversity. We further inferred their demographic history based on the joint folded site frequency spectrum and revealed that both P. d. formosus and P. d. inopinatus had maintained small population sizes for a long period of time after their divergence. Recently, these populations experienced distinct trajectories of demographic changes. While P. d. formosus suffered from a drastic ~10-fold population decline not long ago, P. d. inopinatus underwent a ~4.5-fold population expansion. Our results suggest separate conservation management for the 2 populations-population recovery is urgently needed for P. d. formosus while long-term monitoring for adverse genetic effects should be considered for P. d. inopinatus.
Collapse
Affiliation(s)
- Kung-Ping Lin
- Department of Life Sciences and Institute of Genome Sciences, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Shu-Miaw Chaw
- Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Yun-Hwa Lo
- Department of Life Sciences and Institute of Genome Sciences, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | | | - Chien-Yi Tung
- Cancer Progression Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | | | - Quintin Liu
- Department of Evolutionary Studies of Biosystems, SOKENDAI (The Graduate University for Advanced Studies), Hayama, Japan
| | - Yoko Satta
- Department of Evolutionary Studies of Biosystems, SOKENDAI (The Graduate University for Advanced Studies), Hayama, Japan
| | - Masako Izawa
- Kitakyushu Museum of Natural History and Human History, Fukuoka, Japan
| | - Shiang-Fan Chen
- Center for General Education, National Taipei University, New Taipei City, Taiwan
| | - Wen-Ya Ko
- Department of Life Sciences and Institute of Genome Sciences, National Yang Ming Chiao Tung University, Taipei, Taiwan
| |
Collapse
|
46
|
Ahrens CW, Jordan R, Bragg J, Harrison PA, Hopley T, Bothwell H, Murray K, Steane DA, Whale JW, Byrne M, Andrew R, Rymer PD. Regarding the F-word: The effects of data filtering on inferred genotype-environment associations. Mol Ecol Resour 2021; 21:1460-1474. [PMID: 33565725 DOI: 10.1111/1755-0998.13351] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Revised: 02/01/2021] [Accepted: 02/05/2021] [Indexed: 01/05/2023]
Abstract
Genotype-environment association (GEA) methods have become part of the standard landscape genomics toolkit, yet, we know little about how to best filter genotype-by-sequencing data to provide robust inferences for environmental adaptation. In many cases, default filtering thresholds for minor allele frequency and missing data are applied regardless of sample size, having unknown impacts on the results, negatively affecting management strategies. Here, we investigate the effects of filtering on GEA results and the potential implications for assessment of adaptation to environment. We use empirical and simulated data sets derived from two widespread tree species to assess the effects of filtering on GEA outputs. Critically, we find that the level of filtering of missing data and minor allele frequency affect the identification of true positives. Even slight adjustments to these thresholds can change the rate of true positive detection. Using conservative thresholds for missing data and minor allele frequency substantially reduces the size of the data set, lessening the power to detect adaptive variants (i.e., simulated true positives) with strong and weak strengths of selection. Regardless, strength of selection was a good predictor for GEA detection, but even some SNPs under strong selection went undetected. False positive rates varied depending on the species and GEA method, and filtering significantly impacted the predictions of adaptive capacity in downstream analyses. We make several recommendations regarding filtering for GEA methods. Ultimately, there is no filtering panacea, but some choices are better than others, depending on the study system, availability of genomic resources, and desired objectives.
Collapse
Affiliation(s)
- Collin W Ahrens
- Hawkesbury Institute for the Environment, Western Sydney University, Richmond, NSW, Australia
| | | | - Jason Bragg
- Research Centre for Ecosystem Resilience, Australian Institute of Botanical Science, The Royal Botanic Garden, Sydney, NSW, Australia
| | - Peter A Harrison
- School of Natural Sciences and Australian Research Council Training Centre for Forest Value, University of Tasmania, Hobart, Tas., Australia
| | - Tara Hopley
- Department of Biodiversity, Conservation and Attractions, Biodiversity and Conservation Science, Perth, WA, Australia
| | | | - Kevin Murray
- Australian National University, Acton, ACT, Australia
| | - Dorothy A Steane
- CSIRO Land & Water, Hobart, Tas., Australia.,School of Natural Sciences and Australian Research Council Training Centre for Forest Value, University of Tasmania, Hobart, Tas., Australia
| | - John W Whale
- Hawkesbury Institute for the Environment, Western Sydney University, Richmond, NSW, Australia
| | - Margaret Byrne
- Department of Biodiversity, Conservation and Attractions, Biodiversity and Conservation Science, Perth, WA, Australia
| | - Rose Andrew
- School of Environmental and Rural Science, University of New England, Armidale, NSW, Australia
| | - Paul D Rymer
- Hawkesbury Institute for the Environment, Western Sydney University, Richmond, NSW, Australia
| |
Collapse
|
47
|
Cerca J, Maurstad MF, Rochette NC, Rivera‐Colón AG, Rayamajhi N, Catchen JM, Struck TH. Removing the bad apples: A simple bioinformatic method to improve loci‐recovery in de novo RADseq data for non‐model organisms. Methods Ecol Evol 2021. [DOI: 10.1111/2041-210x.13562] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Affiliation(s)
- José Cerca
- Frontiers in Evolutionary Zoology Natural History MuseumUniversity of Oslo Oslo Norway
- Department of Environmental Science, Policy, and Management University of California Berkeley CA USA
- Department of Natural History NTNU University MuseumNorwegian University of Science and Technology Trondheim Norway
| | - Marius F. Maurstad
- Frontiers in Evolutionary Zoology Natural History MuseumUniversity of Oslo Oslo Norway
- Centre for Ecological and Evolutionary Synthesis University of Oslo Oslo Norway
| | - Nicolas C. Rochette
- Department of Evolution, Ecology, and Behavior University of Illinois at Urbana‐ChampaignUrbana‐Champaign IL USA
- Department of Ecology and Evolutionary Biology University of California Los Angeles CA USA
| | - Angel G. Rivera‐Colón
- Department of Evolution, Ecology, and Behavior University of Illinois at Urbana‐ChampaignUrbana‐Champaign IL USA
| | - Niraj Rayamajhi
- Department of Evolution, Ecology, and Behavior University of Illinois at Urbana‐ChampaignUrbana‐Champaign IL USA
| | - Julian M. Catchen
- Department of Evolution, Ecology, and Behavior University of Illinois at Urbana‐ChampaignUrbana‐Champaign IL USA
| | - Torsten H. Struck
- Frontiers in Evolutionary Zoology Natural History MuseumUniversity of Oslo Oslo Norway
| |
Collapse
|
48
|
Martin BT, Chafin TK, Douglas MR, Placyk JS, Birkhead RD, Phillips CA, Douglas ME. The choices we make and the impacts they have: Machine learning and species delimitation in North American box turtles (Terrapene spp.). Mol Ecol Resour 2021; 21:2801-2817. [PMID: 33566450 DOI: 10.1111/1755-0998.13350] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Revised: 01/20/2021] [Accepted: 02/05/2021] [Indexed: 12/26/2022]
Abstract
Model-based approaches that attempt to delimit species are hampered by computational limitations as well as the unfortunate tendency by users to disregard algorithmic assumptions. Alternatives are clearly needed, and machine-learning (M-L) is attractive in this regard as it functions without the need to explicitly define a species concept. Unfortunately, its performance will vary according to which (of several) bioinformatic parameters are invoked. Herein, we gauge the effectiveness of M-L-based species-delimitation algorithms by parsing 64 variably-filtered versions of a ddRAD-derived SNP data set collected from North American box turtles (Terrapene spp.). Our filtering strategies included: (i) minor allele frequencies (MAF) of 5%, 3%, 1%, and 0% (= none), and (ii) maximum missing data per-individual/per-population at 25%, 50%, 75%, and 100% (= no filtering). We found that species-delimitation via unsupervised M-L impacted the signal-to-noise ratio in our data, as well as the discordance among resolved clades. The latter may also reflect biogeographic history, gene flow, incomplete lineage sorting, or combinations thereof (as corroborated from previously observed patterns of differential introgression). Our results substantiate M-L as a viable species-delimitation method, but also demonstrate how commonly observed patterns of phylogenetic discordance can seriously impact M-L-classification.
Collapse
Affiliation(s)
- Bradley T Martin
- Department of Biological Sciences, University of Arkansas, Fayetteville, AR, USA
| | - Tyler K Chafin
- Department of Biological Sciences, University of Arkansas, Fayetteville, AR, USA
| | - Marlis R Douglas
- Department of Biological Sciences, University of Arkansas, Fayetteville, AR, USA
| | - John S Placyk
- Department of Biology, University of Texas, Tyler, TX, USA.,Science Division, Trinity Valley Community College, Athens, Texas, USA
| | | | - Christopher A Phillips
- Illinois Natural History Survey, Prairie Research Institute, University of Illinois, Champaign, IL, USA
| | - Michael E Douglas
- Department of Biological Sciences, University of Arkansas, Fayetteville, AR, USA
| |
Collapse
|
49
|
Cerca J, Rivera-Colón AG, Ferreira MS, Ravinet M, Nowak MD, Catchen JM, Struck TH. Incomplete lineage sorting and ancient admixture, and speciation without morphological change in ghost-worm cryptic species. PeerJ 2021; 9:e10896. [PMID: 33614296 PMCID: PMC7879940 DOI: 10.7717/peerj.10896] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Accepted: 01/13/2021] [Indexed: 12/14/2022] Open
Abstract
Morphologically similar species, that is cryptic species, may be similar or quasi-similar owing to the deceleration of morphological evolution and stasis. While the factors underlying the deceleration of morphological evolution or stasis in cryptic species remain unknown, decades of research in the field of paleontology on punctuated equilibrium have originated clear hypotheses. Species are expected to remain morphologically identical in scenarios of shared genetic variation, such as hybridization and incomplete lineage sorting, or in scenarios where bottlenecks reduce genetic variation and constrain the evolution of morphology. Here, focusing on three morphologically similar Stygocapitella species, we employ a whole-genome amplification method (WGA) coupled with double-digestion restriction-site associated DNA sequencing (ddRAD) to reconstruct the evolutionary history of the species complex. We explore population structure, use population-level statistics to determine the degree of connectivity between populations and species, and determine the most likely demographic scenarios which generally reject for recent hybridization. We find that the combination of WGA and ddRAD allowed us to obtain genomic-level data from microscopic eukaryotes (∼1 millimetre) opening up opportunities for those working with population genomics and phylogenomics in such taxa. The three species share genetic variance, likely from incomplete lineage sorting and ancient admixture. We speculate that the degree of shared variation might underlie morphological similarity in the Atlantic species complex.
Collapse
Affiliation(s)
- José Cerca
- Department of Environmental Science, Policy, and Management, University of California, University of California, Berkeley, Berkeley, CA, United States of America
- Department of Natural History, NTNU University Museum, Norwegian University of Science and Technology, Trondheim, Norway
- Natural History Museum, University of Oslo, Oslo, Norway
| | - Angel G. Rivera-Colón
- Department of Evolution, Ecology, and Behavior, University of Illinois at Urbana-Champaign, Urbana Champaign, IL, United States of America
| | - Mafalda S. Ferreira
- Division of Biological Sciences, University of Montana, Missoula, MT, United States of America
- Departamento de Biologia, Universidade do Porto, Porto, Porto, Portugal
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Universidade do Porto, Porto, Porto, Portugal
| | - Mark Ravinet
- School of Life Sciences, University of Nottingham, Nottingham, United Kingdom
- Centre for Ecological and Evolutionary Synthesis, University of Oslo, Oslo, Norway
| | | | - Julian M. Catchen
- Department of Evolution, Ecology, and Behavior, University of Illinois at Urbana-Champaign, Urbana Champaign, IL, United States of America
| | | |
Collapse
|
50
|
Heller R, Nursyifa C, Garcia-Erill G, Salmona J, Chikhi L, Meisner J, Korneliussen TS, Albrechtsen A. A reference-free approach to analyse RADseq data using standard next generation sequencing toolkits. Mol Ecol Resour 2021; 21:1085-1097. [PMID: 33434329 DOI: 10.1111/1755-0998.13324] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Revised: 12/18/2020] [Accepted: 01/05/2021] [Indexed: 12/29/2022]
Abstract
Genotyping-by-sequencing methods such as RADseq are popular for generating genomic and population-scale data sets from a diverse range of organisms. These often lack a usable reference genome, restricting users to RADseq specific software for processing. However, these come with limitations compared to generic next generation sequencing (NGS) toolkits. Here, we describe and test a simple pipeline for reference-free RADseq data processing that blends de novo elements from STACKS with the full suite of state-of-the art NGS tools. Specifically, we use the de novo RADseq assembly employed by STACKS to create a catalogue of RAD loci that serves as a reference for read mapping, variant calling and site filters. Using RADseq data from 28 zebra sequenced to ~8x depth-of-coverage we evaluate our approach by comparing the site frequency spectra (SFS) to those from alternative pipelines. Most pipelines yielded similar SFS at 8x depth, but only a genotype likelihood based pipeline performed similarly at low sequencing depth (2-4x). We compared the RADseq SFS with medium-depth (~13x) shotgun sequencing of eight overlapping samples, revealing that the RADseq SFS was persistently slightly skewed towards rare and invariant alleles. Using simulations and human data we confirm that this is expected when there is allelic dropout (AD) in the RADseq data. AD in the RADseq data caused a heterozygosity deficit of ~16%, which dropped to ~5% after filtering AD. Hence, AD was the most important source of bias in our RADseq data.
Collapse
Affiliation(s)
- Rasmus Heller
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| | - Casia Nursyifa
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| | - Genís Garcia-Erill
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| | - Jordi Salmona
- CNRS, Université Paul Sabatier, ENFA, UMR 5174 EDB (Laboratoire Évolution & Diversité Biologique), Toulouse, France
| | - Lounes Chikhi
- CNRS, Université Paul Sabatier, ENFA, UMR 5174 EDB (Laboratoire Évolution & Diversité Biologique), Toulouse, France.,Instituto Gulbenkian de Ciência, Oeiras, Portugal
| | - Jonas Meisner
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| | | | - Anders Albrechtsen
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| |
Collapse
|