1
|
Kazilas C, Dufresnes C, France J, Kalaentzis K, Martínez-Solano I, de Visser MC, Arntzen JW, Wielstra B. Spatial genetic structure in European marbled newts revealed with target enrichment by sequence capture. Mol Phylogenet Evol 2024; 194:108043. [PMID: 38382821 DOI: 10.1016/j.ympev.2024.108043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 12/21/2023] [Accepted: 02/18/2024] [Indexed: 02/23/2024]
Abstract
European marbled newts come in two species that have abutting ranges. The northern species, Triturus marmoratus, is found in France and the northern part of the Iberian Peninsula, whereas the southern species, T. pygmaeus, is found in the southwestern corner of the Iberian Peninsula. We study the intraspecific genetic differentiation of the group because morphological data show geographical variation and because the Iberian Peninsula is a recognized center of speciation and intraspecific genetic diversity for all kinds of organisms, amphibians included. We use target enrichment by sequence capture to generate c. 7 k nuclear DNA markers. We observe limited genetic exchange between the species, which confirms their distinctiveness. Both species show substantial genetic structuring that is only in part mirrored by morphological variation. Genetically differentiated groups are found in the south (T. marmoratus) and west (T. pygmaeus) of the species ranges. Our observations highlight the position of the Iberian Peninsula as a hotspot for genetic differentiation.
Collapse
Affiliation(s)
- Christos Kazilas
- Naturalis Biodiversity Center, P.O. Box 9517, 2300 RA Leiden, The Netherlands; Institute of Biology Leiden, Leiden University, P.O. Box 9505, 2300 RA Leiden, The Netherlands.
| | - Christophe Dufresnes
- LASER, College of Biology and the Environment, Nanjing Forestry University, Nanjing, People's Republic of China; Institut de Systématique, Evolution, Biodiversité (ISYEB), Muséum national d'Histoire naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, Paris, France
| | - James France
- Naturalis Biodiversity Center, P.O. Box 9517, 2300 RA Leiden, The Netherlands; Institute of Biology Leiden, Leiden University, P.O. Box 9505, 2300 RA Leiden, The Netherlands
| | - Konstantinos Kalaentzis
- Naturalis Biodiversity Center, P.O. Box 9517, 2300 RA Leiden, The Netherlands; Institute of Biology Leiden, Leiden University, P.O. Box 9505, 2300 RA Leiden, The Netherlands
| | - Iñigo Martínez-Solano
- Museo Nacional de Ciencias Naturales, MNCN-CSIC, c/ José Gutiérrez Abascal 2, 28006 Madrid, Spain
| | - Manon C de Visser
- Naturalis Biodiversity Center, P.O. Box 9517, 2300 RA Leiden, The Netherlands; Institute of Biology Leiden, Leiden University, P.O. Box 9505, 2300 RA Leiden, The Netherlands
| | - Jan W Arntzen
- Naturalis Biodiversity Center, P.O. Box 9517, 2300 RA Leiden, The Netherlands; Institute of Biology Leiden, Leiden University, P.O. Box 9505, 2300 RA Leiden, The Netherlands
| | - Ben Wielstra
- Naturalis Biodiversity Center, P.O. Box 9517, 2300 RA Leiden, The Netherlands; Institute of Biology Leiden, Leiden University, P.O. Box 9505, 2300 RA Leiden, The Netherlands
| |
Collapse
|
2
|
Naranjo AA, Edwards CE, Gitzendanner MA, Soltis DE, Soltis PS. Abundant incongruence in a clade endemic to a biodiversity hotspot: Phylogenetics of the scrub mint clade (Lamiaceae). Mol Phylogenet Evol 2024; 192:108014. [PMID: 38199595 DOI: 10.1016/j.ympev.2024.108014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Revised: 12/26/2023] [Accepted: 01/06/2024] [Indexed: 01/12/2024]
Abstract
The Scrub Mint clade(Lamiaceae) provides a unique system for investigating the evolutionary processes driving diversification in the North American Coastal Plain from both a systematic and biogeographic context. The clade comprisesDicerandra, Conradina, Piloblephis, Stachydeoma, and four species of the broadly defined genus Clinopodium(Mentheae; Lamiaceae), almost all of which are endemic to the North American Eastern Coastal Plain. Most species of this clade are threatened or endangered and restricted to sandhill or a mosaic of scrub habitats. We analyzed relationships in this clade to understand the evolution of the group and identify evolutionary mechanisms acting on the clade, with important implications for conservation. We used a target-capture method to sequence and analyze 238 nuclear loci across all species of scrub mints, reconstructed the phylogeny, and calculated gene tree concordance, gene tree estimation error, and reticulation indices for every node in the tree using ML methods. Phylogenetic networks were used to determine reticulation events. Our nuclear phylogenetic estimates were consistent with previous results, while greatly increasing the robustness of taxon sampling. The phylogeny resolved the full relationship between Dicerandra and Conradina and the less-studied members of the clade (Piloblephis, Stachydeoma, Clinopodium spp.). We found hotspots of gene tree discordance and reticulation throughout the tree, especially in perennial Dicerandra. Several instances of reticulation events were uncovered between annual and perennial Dicerandra, and within the Conradina + allies clade. Incomplete lineage sorting also likely contributed to phylogenetic discordance. These results clarify phylogenetic relationships in the clade and provide insight on important evolutionary drivers in the clade, such as hybridization. General relationships in the group were confirmed, while the large amount of gene tree discordance is likely due to reticulation across the phylogeny.
Collapse
Affiliation(s)
- Andre A Naranjo
- Institute of Environment, Department of Biological Sciences, Florida International University, 11200 SW 8th ST, Miami, FL 33199, USA; Florida Museum of Natural History, University of Florida, 1659 Museum Road, PO Box 117800, Gainesville, FL 32611-7800, USA.
| | | | - Matthew A Gitzendanner
- Department of Biology, University of Florida, PO Box 118526, Gainesville, FL 32611-8526, USA
| | - Douglas E Soltis
- Florida Museum of Natural History, University of Florida, 1659 Museum Road, PO Box 117800, Gainesville, FL 32611-7800, USA; Department of Biology, University of Florida, PO Box 118526, Gainesville, FL 32611-8526, USA
| | - Pamela S Soltis
- Florida Museum of Natural History, University of Florida, 1659 Museum Road, PO Box 117800, Gainesville, FL 32611-7800, USA
| |
Collapse
|
3
|
Featherstone LA, McGaughran A. The effect of missing data on evolutionary analysis of sequence capture bycatch, with application to an agricultural pest. Mol Genet Genomics 2024; 299:11. [PMID: 38381254 PMCID: PMC10881687 DOI: 10.1007/s00438-024-02097-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 12/29/2023] [Indexed: 02/22/2024]
Abstract
Sequence capture is a genomic technique that selectively enriches target sequences before high throughput next-generation sequencing, to generate specific sequences of interest. Off-target or 'bycatch' data are often discarded from capture experiments, but can be leveraged to address evolutionary questions under some circumstances. Here, we investigated the effects of missing data on a variety of evolutionary analyses using bycatch from an exon capture experiment on the global pest moth, Helicoverpa armigera. We added > 200 new samples from across Australia in the form of mitogenomes obtained as bycatch from targeted sequence capture, and combined these into an additional larger dataset to total > 1000 mitochondrial cytochrome c oxidase subunit I (COI) sequences across the species' global distribution. Using discriminant analysis of principal components and Bayesian coalescent analyses, we showed that mitogenomes assembled from bycatch with up to 75% missing data were able to return evolutionary inferences consistent with higher coverage datasets and the broader literature surrounding H. armigera. For example, low-coverage sequences broadly supported the delineation of two H. armigera subspecies and also provided new insights into the potential for geographic turnover among these subspecies. However, we also identified key effects of dataset coverage and composition on our results. Thus, low-coverage bycatch data can offer valuable information for population genetic and phylodynamic analyses, but caution is required to ensure the reduced information does not introduce confounding factors, such as sampling biases, that drive inference. We encourage more researchers to consider maximizing the potential of the targeted sequence approach by examining evolutionary questions with their off-target bycatch where possible-especially in cases where no previous mitochondrial data exists-but recommend stratifying data at different genome coverage thresholds to separate sampling effects from genuine genomic signals, and to understand their implications for evolutionary research.
Collapse
Affiliation(s)
- Leo A Featherstone
- Research School of Biology, Division of Ecology and Evolution, Australian National University, Canberra, ACT, 2601, Australia
- Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, VIC, 3000, Australia
| | - Angela McGaughran
- Research School of Biology, Division of Ecology and Evolution, Australian National University, Canberra, ACT, 2601, Australia.
- Te Aka Mātuatua, School of Science, University of Waikato, Private Bag 3105, Hamilton, 3240, New Zealand.
| |
Collapse
|
4
|
Arpin KE, Schmidt DA, Sjodin BMF, Einfeldt AL, Galbreath K, Russello MA. Evaluating genotyping-in-thousands by sequencing as a genetic monitoring tool for a climate sentinel mammal using non-invasive and archival samples. Ecol Evol 2024; 14:e10934. [PMID: 38333095 PMCID: PMC10850814 DOI: 10.1002/ece3.10934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 12/01/2023] [Accepted: 01/19/2024] [Indexed: 02/10/2024] Open
Abstract
Genetic tools for wildlife monitoring can provide valuable information on spatiotemporal population trends and connectivity, particularly in systems experiencing rapid environmental change. Multiplexed targeted amplicon sequencing techniques, such as genotyping-in-thousands by sequencing (GT-seq), can provide cost-effective approaches for collecting genetic data from low-quality and quantity DNA samples, making them potentially useful for long-term wildlife monitoring using non-invasive and archival samples. Here, we developed a GT-seq panel as a potential monitoring tool for the American pika (Ochotona princeps) and evaluated its performance when applied to traditional, non-invasive, and archival samples, respectively. Specifically, we optimized a GT-seq panel (307 single nucleotide polymorphisms (SNPs)) that included neutral, sex-associated, and putatively adaptive SNPs using contemporary tissue samples (n = 77) from the Northern Rocky Mountains lineage of American pikas. The panel demonstrated high genotyping success (94.7%), low genotyping error (0.001%), and excellent performance identifying individuals, sex, relatedness, and population structure. We subsequently applied the GT-seq panel to archival tissue (n = 17) and contemporary fecal pellet samples (n = 129) collected within the Canadian Rocky Mountains to evaluate its effectiveness. Although the panel demonstrated high efficacy with archival tissue samples (90.5% genotyping success, 0.0% genotyping error), this was not the case for the fecal pellet samples (79.7% genotyping success, 28.4% genotyping error) likely due to the exceptionally low quality/quantity of recovered DNA using the approaches implemented. Overall, our study reinforced GT-seq as an effective tool using contemporary and archival tissue samples, providing future opportunities for temporal applications using historical specimens. Our results further highlight the need for additional optimization of sample and genetic data collection techniques prior to broader-scale implementation of a non-invasive genetic monitoring tool for American pikas.
Collapse
Affiliation(s)
- Kate E. Arpin
- Department of BiologyThe University of British ColumbiaKelownaBritish ColumbiaCanada
| | - Danielle A. Schmidt
- Department of BiologyThe University of British ColumbiaKelownaBritish ColumbiaCanada
| | - Bryson M. F. Sjodin
- Department of BiologyThe University of British ColumbiaKelownaBritish ColumbiaCanada
| | | | - Kurt Galbreath
- Department of BiologyNorthern Michigan UniversityMarquetteMichiganUSA
| | - Michael A. Russello
- Department of BiologyThe University of British ColumbiaKelownaBritish ColumbiaCanada
| |
Collapse
|
5
|
Mulder KP, Savage AE, Gratwicke B, Longcore JE, Bronikowski E, Evans M, Longo AV, Kurata NP, Walsh T, Pasmans F, McInerney N, Murray S, Martel A, Fleischer RC. Sequence capture identifies fastidious chytrid fungi directly from host tissue. Fungal Genet Biol 2024; 170:103858. [PMID: 38101696 DOI: 10.1016/j.fgb.2023.103858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 12/04/2023] [Accepted: 12/12/2023] [Indexed: 12/17/2023]
Abstract
The chytrid fungus Batrachochytrium dendrobatidis (Bd) was discovered in 1998 as the cause of chytridiomycosis, an emerging infectious disease causing mass declines in amphibian populations worldwide. The rapid population declines of the 1970s-1990s were likely caused by the spread of a highly virulent lineage belonging to the Bd-GPL clade that was introduced to naïve susceptible populations. Multiple genetically distinct and regional lineages of Bd have since been isolated and sequenced, greatly expanding the known biological diversity within this fungal pathogen. To date, most Bd research has been restricted to the limited number of samples that could be isolated using culturing techniques, potentially causing a selection bias for strains that can grow on media and missing other unculturable or fastidious strains that are also present on amphibians. We thus attempted to characterize potentially non-culturable genetic lineages of Bd from distinct amphibian taxa using sequence capture technology on DNA extracted from host tissue and swabs. We focused our efforts on host taxa from two different regions that likely harbored distinct Bd clades: (1) wild-caught leopard frogs (Rana) from North America, and (2) a Japanese Giant Salamander (Andrias japonicus) at the Smithsonian Institution's National Zoological Park that exhibited signs of disease and tested positive for Bd using qPCR, but multiple attempts failed to isolate and culture the strain for physiological and genetic characterization. We successfully enriched for and sequenced thousands of fungal genes from both host clades, and Bd load was positively associated with number of recovered Bd sequences. Phylogenetic reconstruction placed all the Rana-derived strains in the Bd-GPL clade. In contrast, the A. japonicus strain fell within the Bd-Asia3 clade, expanding the range of this clade and generating additional genomic data to confirm its placement. The retrieved ITS locus matched public barcoding data from wild A. japonicus and Bd infections found on other amphibians in India and China, suggesting that this uncultured clade is widespread across Asia. Our study underscores the importance of recognizing and characterizing the hidden diversity of fastidious strains in order to reconstruct the spatiotemporal and evolutionary history of Bd. The success of the sequence capture approach highlights the utility of directly sequencing pathogen DNA from host tissue to characterize cryptic diversity that is missed by culture-reliant approaches.
Collapse
Affiliation(s)
- Kevin P Mulder
- Wildlife Health Ghent, Faculty of Veterinary Medicine, Ghent University, Merelbeke, Belgium; Center for Conservation Genomics, Smithsonian National Zoo and Conservation Biology Institute, Washington, DC, USA.
| | - Anna E Savage
- Department of Biology, University of Central Florida, Orlando, FL, USA
| | - Brian Gratwicke
- Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, USA
| | - Joyce E Longcore
- School of Biology and Ecology, University of Maine, Orono, ME, USA
| | - Ed Bronikowski
- Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, USA
| | - Matthew Evans
- Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, USA
| | - Ana V Longo
- Department of Biology, University of Florida, Gainesville, FL, USA
| | - Naoko P Kurata
- Center for Conservation Genomics, Smithsonian National Zoo and Conservation Biology Institute, Washington, DC, USA; Department of Natural Resources and the Environment, Cornell University, Ithaca, NY, USA; Department of Ichthyology, American Museum of Natural History, New York, NY, USA
| | - Tim Walsh
- Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, USA
| | - Frank Pasmans
- Wildlife Health Ghent, Faculty of Veterinary Medicine, Ghent University, Merelbeke, Belgium
| | - Nancy McInerney
- Center for Conservation Genomics, Smithsonian National Zoo and Conservation Biology Institute, Washington, DC, USA
| | - Suzan Murray
- Smithsonian's National Zoo and Conservation Biology Institute, Washington, DC, USA
| | - An Martel
- Wildlife Health Ghent, Faculty of Veterinary Medicine, Ghent University, Merelbeke, Belgium
| | - Robert C Fleischer
- Center for Conservation Genomics, Smithsonian National Zoo and Conservation Biology Institute, Washington, DC, USA
| |
Collapse
|
6
|
Benham PM, Walsh J, Bowie RCK. Spatial variation in population genomic responses to over a century of anthropogenic change within a tidal marsh songbird. GLOBAL CHANGE BIOLOGY 2024; 30:e17126. [PMID: 38273486 DOI: 10.1111/gcb.17126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Revised: 11/22/2023] [Accepted: 12/13/2023] [Indexed: 01/27/2024]
Abstract
Combating the current biodiversity crisis requires the accurate documentation of population responses to human-induced ecological change. However, our ability to pinpoint population responses to human activities is often limited to the analysis of populations studied well after the fact. Museum collections preserve a record of population responses to anthropogenic change that can provide critical baseline data on patterns of genetic diversity, connectivity, and population structure prior to the onset of human perturbation. Here, we leverage a spatially replicated time series of specimens to document population genomic responses to the destruction of nearly 90% of coastal habitats occupied by the Savannah sparrow (Passerculus sandwichensis) in California. We sequenced 219 sparrows collected from 1889 to 2017 across the state of California using an exome capture approach. Spatial-temporal analyses of genetic diversity found that the amount of habitat lost was not predictive of genetic diversity loss. Sparrow populations from southern California historically exhibited lower levels of genetic diversity and experienced the most significant temporal declines in genetic diversity. Despite experiencing the greatest levels of habitat loss, we found that genetic diversity in the San Francisco Bay area remained relatively high. This was potentially related to an observed increase in gene flow into the Bay Area from other populations. While gene flow may have minimized genetic diversity declines, we also found that immigration from inland freshwater-adapted populations into tidal marsh populations led to the erosion of divergence at loci associated with tidal marsh adaptation. Shifting patterns of gene flow through time in response to habitat loss may thus contribute to negative fitness consequences and outbreeding depression. Together, our results underscore the importance of tracing the genomic trajectories of multiple populations over time to address issues of fundamental conservation concern.
Collapse
Affiliation(s)
- Phred M Benham
- Museum of Vertebrate Zoology, University of California, Berkeley, Berkeley, California, USA
- Department of Integrative Biology, University of California, Berkeley, Berkeley, California, USA
| | - Jennifer Walsh
- Fuller Evolutionary Biology Program, Cornell Lab of Ornithology, Cornell University, Ithaca, New York, USA
| | - Rauri C K Bowie
- Museum of Vertebrate Zoology, University of California, Berkeley, Berkeley, California, USA
- Department of Integrative Biology, University of California, Berkeley, Berkeley, California, USA
| |
Collapse
|
7
|
Li J, Han G, Tian X, Liang D, Zhang P. UPrimer: A Clade-Specific Primer Design Program Based on Nested-PCR Strategy and Its Applications in Amplicon Capture Phylogenomics. Mol Biol Evol 2023; 40:msad230. [PMID: 37832226 PMCID: PMC10630340 DOI: 10.1093/molbev/msad230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 09/12/2023] [Accepted: 10/09/2023] [Indexed: 10/15/2023] Open
Abstract
Amplicon capture is a promising target sequence capture approach for phylogenomic analyses, and the design of clade-specific nuclear protein-coding locus (NPCL) amplification primers is crucial for its successful application. In this study, we developed a primer design program called UPrimer that can quickly design clade-specific NPCL amplification primers based on genome data, without requiring manual intervention. Unlike other available primer design programs, UPrimer uses a nested-PCR strategy that greatly improves the amplification success rate of the designed primers. We examined all available metazoan genome data deposited in NCBI and developed NPCL primer sets for 21 metazoan groups with UPrimer, covering a wide range of taxa, including arthropods, mollusks, cnidarians, echinoderms, and vertebrates. On average, each clade-specific NPCL primer set comprises ∼1,000 NPCLs. PCR amplification tests were performed in 6 metazoan groups, and the developed primers showed a PCR success rate exceeding 95%. Furthermore, we demonstrated a phylogenetic case study in Lepidoptera, showing how NPCL primers can be used for phylogenomic analyses with amplicon capture. Our results indicated that using 100 NPCL probes recovered robust high-level phylogenetic relationships among butterflies, highlighting the utility of the newly designed NPCL primer sets for phylogenetic studies. We anticipate that the automated tool UPrimer and the developed NPCL primer sets for 21 metazoan groups will enable researchers to obtain phylogenomic data more efficiently and cost-effectively and accelerate the resolution of various parts of the Tree of Life.
Collapse
Affiliation(s)
- JiaXuan Li
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, China
| | - GuangCheng Han
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, China
| | - Xiao Tian
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, China
| | - Dan Liang
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, China
| | - Peng Zhang
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou 510275, China
| |
Collapse
|
8
|
Karunarathne P, Zhou Q, Schliep K, Milesi P. A comprehensive framework for detecting copy number variants from single nucleotide polymorphism data: 'rCNV', a versatile r package for paralogue and CNV detection. Mol Ecol Resour 2023; 23:1772-1789. [PMID: 37515483 DOI: 10.1111/1755-0998.13843] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 07/04/2023] [Accepted: 07/07/2023] [Indexed: 07/31/2023]
Abstract
Recent studies have highlighted the significant role of copy number variants (CNVs) in phenotypic diversity, environmental adaptation and species divergence across eukaryotes. The presence of CNVs also has the potential to introduce genotyping biases, which can pose challenges to accurate population and quantitative genetic analyses. However, detecting CNVs in genomes, particularly in non-model organisms, presents a formidable challenge. To address this issue, we have developed a statistical framework and an accompanying r software package that leverage allelic-read depth from single nucleotide polymorphism (SNP) data for accurate CNV detection. Our framework capitalises on two key principles. First, it exploits the distribution of allelic-read depth ratios in heterozygotes for individual SNPs by comparing it against an expected distribution based on binomial sampling. Second, it identifies SNPs exhibiting an apparent excess of heterozygotes under Hardy-Weinberg equilibrium. By employing multiple statistical tests, our method not only enhances sensitivity to sampling effects but also effectively addresses reference biases, resulting in optimised SNP classification. Our framework is compatible with various NGS technologies (e.g. RADseq, Exome-capture). This versatility enables CNV calling from genomes of diverse complexities. To streamline the analysis process, we have implemented our framework in the user-friendly r package 'rCNV', which automates the entire workflow seamlessly. We trained our models using simulated data and validated their performance on four datasets derived from different sequencing technologies, including RADseq (Chinook salmon-Oncorhynchus tshawytscha), Rapture (American lobster-Homarus americanus), Exome-capture (Norway spruce-Picea abies) and WGS (Malaria mosquito-Anopheles gambiae).
Collapse
Affiliation(s)
- Piyal Karunarathne
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
- Science for Life Laboratory (SciLifeLab), Uppsala, Sweden
- Institute of Population Genetics, Heinrich Heine University, Düsseldorf, Germany
| | - Qiujie Zhou
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
- Science for Life Laboratory (SciLifeLab), Uppsala, Sweden
| | - Klaus Schliep
- Institute of Computational Biotechnology, Graz University of Technology, Graz, Austria
| | - Pascal Milesi
- Plant Ecology and Evolution, Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
- Science for Life Laboratory (SciLifeLab), Uppsala, Sweden
| |
Collapse
|
9
|
Terbot JW, Cooper BS, Good JM, Jensen JD. A Simulation Framework for Modeling the Within-Patient Evolutionary Dynamics of SARS-CoV-2. Genome Biol Evol 2023; 15:evad204. [PMID: 37950882 PMCID: PMC10664409 DOI: 10.1093/gbe/evad204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Revised: 10/31/2023] [Accepted: 11/07/2023] [Indexed: 11/13/2023] Open
Abstract
The global impact of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has led to considerable interest in detecting novel beneficial mutations and other genomic changes that may signal the development of variants of concern (VOCs). The ability to accurately detect these changes within individual patient samples is important in enabling early detection of VOCs. Such genomic scans for rarely acting positive selection are best performed via comparison of empirical data with simulated data wherein commonly acting evolutionary factors, including mutation and recombination, reproductive and infection dynamics, and purifying and background selection, can be carefully accounted for and parameterized. Although there has been work to quantify these factors in SARS-CoV-2, they have yet to be integrated into a baseline model describing intrahost evolutionary dynamics. To construct such a baseline model, we develop a simulation framework that enables one to establish expectations for underlying levels and patterns of patient-level variation. By varying eight key parameters, we evaluated 12,096 different model-parameter combinations and compared them with existing empirical data. Of these, 592 models (∼5%) were plausible based on the resulting mean expected number of segregating variants. These plausible models shared several commonalities shedding light on intrahost SARS-CoV-2 evolutionary dynamics: severe infection bottlenecks, low levels of reproductive skew, and a distribution of fitness effects skewed toward strongly deleterious mutations. We also describe important areas of model uncertainty and highlight additional sequence data that may help to further refine a baseline model. This study lays the groundwork for the improved analysis of existing and future SARS-CoV-2 within-patient data.
Collapse
Affiliation(s)
- John W Terbot
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, Arizona, USA
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Brandon S Cooper
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Jeffrey M Good
- Division of Biological Sciences, University of Montana, Missoula, Montana, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, Arizona, USA
| |
Collapse
|
10
|
Yu PL, Fulton JC, Hudson OH, Huguet-Tapia JC, Brawner JT. Next-generation fungal identification using target enrichment and Nanopore sequencing. BMC Genomics 2023; 24:581. [PMID: 37784013 PMCID: PMC10544392 DOI: 10.1186/s12864-023-09691-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 09/21/2023] [Indexed: 10/04/2023] Open
Abstract
BACKGROUND Rapid and accurate pathogen identification is required for disease management. Compared to sequencing entire genomes, targeted sequencing may be used to direct sequencing resources to genes of interest for microbe identification and mitigate the low resolution that single-locus molecular identification provides. This work describes a broad-spectrum fungal identification tool developed to focus high-throughput Nanopore sequencing on genes commonly employed for disease diagnostics and phylogenetic inference. RESULTS Orthologs of targeted genes were extracted from 386 reference genomes of fungal species spanning six phyla to identify homologous regions that were used to design the baits used for enrichment. To reduce the cost of producing probes without diminishing the phylogenetic power, DNA sequences were first clustered, and then consensus sequences within each cluster were identified to produce 26,000 probes that targeted 114 genes. To test the efficacy of our probes, we applied the technique to three species representing Ascomycota and Basidiomycota fungi. The efficiency of enrichment, quantified as mean target coverage over the mean genome-wide coverage, ranged from 200 to 300. Furthermore, enrichment of long reads increased the depth of coverage across the targeted genes and into non-coding flanking sequence. The assemblies generated from enriched samples provided well-resolved phylogenetic trees for taxonomic assignment and molecular identification. CONCLUSIONS Our work provides data to support the utility of targeted Nanopore sequencing for fungal identification and provides a platform that may be extended for use with other phytopathogens.
Collapse
Affiliation(s)
- Pei-Ling Yu
- Department of Plant Pathology, University of Florida, Gainesville, FL, 32611, USA
| | - James C Fulton
- Department of Plant Pathology, University of Florida, Gainesville, FL, 32611, USA
- Florida Department of Agriculture and Consumer Services, Division of Plant Industry, Gainesville, FL, 32608, USA
| | - Owen H Hudson
- Department of Plant Pathology, University of Florida, Gainesville, FL, 32611, USA
| | - Jose C Huguet-Tapia
- Department of Plant Pathology, University of Florida, Gainesville, FL, 32611, USA
| | - Jeremy T Brawner
- Department of Plant Pathology, University of Florida, Gainesville, FL, 32611, USA.
| |
Collapse
|
11
|
Thorburn DMJ, Sagonas K, Binzer-Panchal M, Chain FJJ, Feulner PGD, Bornberg-Bauer E, Reusch TBH, Samonte-Padilla IE, Milinski M, Lenz TL, Eizaguirre C. Origin matters: Using a local reference genome improves measures in population genomics. Mol Ecol Resour 2023; 23:1706-1723. [PMID: 37489282 DOI: 10.1111/1755-0998.13838] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 05/10/2023] [Accepted: 06/02/2023] [Indexed: 07/26/2023]
Abstract
Genome sequencing enables answering fundamental questions about the genetic basis of adaptation, population structure and epigenetic mechanisms. Yet, we usually need a suitable reference genome for mapping population-level resequencing data. In some model systems, multiple reference genomes are available, giving the challenging task of determining which reference genome best suits the data. Here, we compared the use of two different reference genomes for the three-spined stickleback (Gasterosteus aculeatus), one novel genome derived from a European gynogenetic individual and the published reference genome of a North American individual. Specifically, we investigated the impact of using a local reference versus one generated from a distinct lineage on several common population genomics analyses. Through mapping genome resequencing data of 60 sticklebacks from across Europe and North America, we demonstrate that genetic distance among samples and the reference genomes impacts downstream analyses. Using a local reference genome increased mapping efficiency and genotyping accuracy, effectively retaining more and better data. Despite comparable distributions of the metrics generated across the genome using SNP data (i.e. π, Tajima's D and FST ), window-based statistics using different references resulted in different outlier genes and enriched gene functions. A marker-based analysis of DNA methylation distributions had a comparably high overlap in outlier genes and functions, yet with distinct differences depending on the reference genome. Overall, our results highlight how using a local reference genome decreases reference bias to increase confidence in downstream analyses of the data. Such results have significant implications in all reference-genome-based population genomic analyses.
Collapse
Affiliation(s)
- Doko-Miles J Thorburn
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK
- Department of Life Sciences, Imperial College London, London, UK
| | - Kostas Sagonas
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK
- Department of Zoology, School of Biology, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Mahesh Binzer-Panchal
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, National Bioinformatics Infrastructure Sweden (NBIS), Uppsala University, Uppsala, Sweden
| | - Frederic J J Chain
- Department of Biological Sciences, University of Massachusetts Lowell, Lowell, Massachusetts, USA
| | - Philine G D Feulner
- Department of Fish Ecology and Evolution, Centre of Ecology, Evolution and Biogeochemistry, EAWAG Swiss Federal Institute of Aquatic Science and Technology, Kastanienbaum, Switzerland
- Division of Aquatic Ecology and Evolution, Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
| | - Erich Bornberg-Bauer
- Evolutionary Bioinformatics, Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Thorsten B H Reusch
- Marine Evolutionary Ecology, GEOMAR Helmholtz Centre for Ocean Research, Kiel, Germany
| | - Irene E Samonte-Padilla
- Department of Evolutionary Ecology, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Manfred Milinski
- Department of Evolutionary Ecology, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Tobias L Lenz
- Research Group for Evolutionary Immunogenomics, Max Planck Institute for Evolutionary Biology, Plön, Germany
- Research Unit for Evolutionary Immunogenomics, Department of Biology, University of Hamburg, Hamburg, Germany
| | - Christophe Eizaguirre
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK
| |
Collapse
|
12
|
Folk RA, Gaynor ML, Engle-Wrye NJ, O’Meara BC, Soltis PS, Soltis DE, Guralnick RP, Smith SA, Grady CJ, Okuyama Y. Identifying Climatic Drivers of Hybridization with a New Ancestral Niche Reconstruction Method. Syst Biol 2023; 72:856-873. [PMID: 37073863 PMCID: PMC10405357 DOI: 10.1093/sysbio/syad018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Revised: 03/23/2023] [Accepted: 04/17/2023] [Indexed: 04/20/2023] Open
Abstract
Applications of molecular phylogenetic approaches have uncovered evidence of hybridization across numerous clades of life, yet the environmental factors responsible for driving opportunities for hybridization remain obscure. Verbal models implicating geographic range shifts that brought species together during the Pleistocene have often been invoked, but quantitative tests using paleoclimatic data are needed to validate these models. Here, we produce a phylogeny for Heuchereae, a clade of 15 genera and 83 species in Saxifragaceae, with complete sampling of recognized species, using 277 nuclear loci and nearly complete chloroplast genomes. We then employ an improved framework with a coalescent simulation approach to test and confirm previous hybridization hypotheses and identify one new intergeneric hybridization event. Focusing on the North American distribution of Heuchereae, we introduce and implement a newly developed approach to reconstruct potential past distributions for ancestral lineages across all species in the clade and across a paleoclimatic record extending from the late Pliocene. Time calibration based on both nuclear and chloroplast trees recovers a mid- to late-Pleistocene date for most inferred hybridization events, a timeframe concomitant with repeated geographic range restriction into overlapping refugia. Our results indicate an important role for past episodes of climate change, and the contrasting responses of species with differing ecological strategies, in generating novel patterns of range contact among plant communities and therefore new opportunities for hybridization. The new ancestral niche method flexibly models the shape of niche while incorporating diverse sources of uncertainty and will be an important addition to the current comparative methods toolkit. [Ancestral niche reconstruction; hybridization; paleoclimate; pleistocene.].
Collapse
Affiliation(s)
- Ryan A Folk
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, USA
| | - Michelle L Gaynor
- Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
- Department of Biology, University of Florida, Gainesville, FL, USA
| | - Nicholas J Engle-Wrye
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS, USA
| | - Brian C O’Meara
- Department of Ecology and Evolutionary Biology, University of Tennessee, Knoxville, TN, USA
| | - Pamela S Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Biodiversity Institute, University of Florida, Gainesville, FL, USA
| | - Douglas E Soltis
- Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
- Department of Biology, University of Florida, Gainesville, FL, USA
- Genetics Institute, University of Florida, Gainesville, FL, USA
- Biodiversity Institute, University of Florida, Gainesville, FL, USA
| | - Robert P Guralnick
- Florida Museum of Natural History, University of Florida, Gainesville, FL, USA
- Biodiversity Institute, University of Florida, Gainesville, FL, USA
| | - Stephen A Smith
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, USA
| | - Charles J Grady
- Biodiversity Institute, University of Kansas, Lawrence, KS, 66045, USA
| | - Yudai Okuyama
- Tsukuba Botanical Garden, National Museum of Nature and Science, Tsukuba, Japan
| |
Collapse
|
13
|
Mcguire JA, Huang X, Reilly SB, Iskandar DT, Wang-Claypool CY, Werning S, Chong RA, Lawalata SZS, Stubbs AL, Frederick JH, Brown RM, Evans BJ, Arifin U, Riyanto A, Hamidy A, Arida E, Koo MS, Supriatna J, Andayani N, Hall R. Species Delimitation, Phylogenomics, and Biogeography of Sulawesi Flying Lizards: A Diversification History Complicated by Ancient Hybridization, Cryptic Species, and Arrested Speciation. Syst Biol 2023; 72:885-911. [PMID: 37074804 PMCID: PMC10405571 DOI: 10.1093/sysbio/syad020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Revised: 03/14/2023] [Accepted: 04/13/2023] [Indexed: 04/20/2023] Open
Abstract
The biota of Sulawesi is noted for its high degree of endemism and for its substantial levels of in situ biological diversification. While the island's long period of isolation and dynamic tectonic history have been implicated as drivers of the regional diversification, this has rarely been tested in the context of an explicit geological framework. Here, we provide a tectonically informed biogeographical framework that we use to explore the diversification history of Sulawesi flying lizards (the Draco lineatus Group), a radiation that is endemic to Sulawesi and its surrounding islands. We employ a framework for inferring cryptic speciation that involves phylogeographic and genetic clustering analyses as a means of identifying potential species followed by population demographic assessment of divergence-timing and rates of bi-directional migration as means of confirming lineage independence (and thus species status). Using this approach, phylogenetic and population genetic analyses of mitochondrial sequence data obtained for 613 samples, a 50-SNP data set for 370 samples, and a 1249-locus exon-capture data set for 106 samples indicate that the current taxonomy substantially understates the true number of Sulawesi Draco species, that both cryptic and arrested speciations have taken place, and that ancient hybridization confounds phylogenetic analyses that do not explicitly account for reticulation. The Draco lineatus Group appears to comprise 15 species-9 on Sulawesi proper and 6 on peripheral islands. The common ancestor of this group colonized Sulawesi ~11 Ma when proto-Sulawesi was likely composed of two ancestral islands, and began to radiate ~6 Ma as new islands formed and were colonized via overwater dispersal. The enlargement and amalgamation of many of these proto-islands into modern Sulawesi, especially during the past 3 Ma, set in motion dynamic species interactions as once-isolated lineages came into secondary contact, some of which resulted in lineage merger, and others surviving to the present. [Genomics; Indonesia; introgression; mitochondria; phylogenetics; phylogeography; population genetics; reptiles.].
Collapse
Affiliation(s)
- Jimmy A Mcguire
- Museum of Vertebrate Zoology, University of California, Berkeley, CA 94720, USA
- Department of Integrative Biology, University of California, Berkeley, CA 94720, USA
| | - Xiaoting Huang
- College of Marine Life Sciences, Ocean University of China, No. 5 Yushan Road, Qindao, Shandong, 266003, PR China
| | - Sean B Reilly
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, CA 95060, USA
| | - Djoko T Iskandar
- School of Life Sciences and Technology, Institut Teknologi Bandung, Bandung, Indonesia
| | - Cynthia Y Wang-Claypool
- Museum of Vertebrate Zoology, University of California, Berkeley, CA 94720, USA
- Department of Integrative Biology, University of California, Berkeley, CA 94720, USA
| | - Sarah Werning
- Department of Anatomy, Des Moines University, 3200 Grand Avenue, Des Moines, IA 50312-4198, USA
| | - Rebecca A Chong
- Department of Biology, University of Hawaii at Manoa, Honolulu, HI 96822, USA
| | - Shobi Z S Lawalata
- Museum of Vertebrate Zoology, University of California, Berkeley, CA 94720, USA
- Department of Integrative Biology, University of California, Berkeley, CA 94720, USA
- United in Diversity Foundation, Jalan Hayam Wuruk, Jakarta, Indonesia
| | - Alexander L Stubbs
- Museum of Vertebrate Zoology, University of California, Berkeley, CA 94720, USA
- Department of Integrative Biology, University of California, Berkeley, CA 94720, USA
| | - Jeffrey H Frederick
- Museum of Vertebrate Zoology, University of California, Berkeley, CA 94720, USA
- Department of Integrative Biology, University of California, Berkeley, CA 94720, USA
| | - Rafe M Brown
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, 1345 Jayhawk Blvd., University of Kansas, Lawrence, KS 66045, USA
| | - Ben J Evans
- Biology Department, McMaster University, Hamilton, Ontario, Canada
| | - Umilaela Arifin
- Museum of Vertebrate Zoology, University of California, Berkeley, CA 94720, USA
- School of Life Sciences and Technology, Institut Teknologi Bandung, Bandung, Indonesia
- Center for Taxonomy and Morphology, Zoologisches Museum Hamburg, Leibniz Institute for the Analysis of Biodiversity Change, Martin-Luther-King-Platz 3, R230 20146 Hamburg, Germany
| | - Awal Riyanto
- Laboratory of Herpetology, Museum Zoologicum Bogoriense, Research Center for Biosystematics and Evolution, National Research and Innovation Agency of Indonesia (BRIN), Cibinong 16911, Indonesia
| | - Amir Hamidy
- Laboratory of Herpetology, Museum Zoologicum Bogoriense, Research Center for Biosystematics and Evolution, National Research and Innovation Agency of Indonesia (BRIN), Cibinong 16911, Indonesia
| | - Evy Arida
- Research Center for Applied Zoology, National Research and Innovation Agency of Indonesia (BRIN), Cibinong 16911, Indonesia
| | - Michelle S Koo
- Museum of Vertebrate Zoology, University of California, Berkeley, CA 94720, USA
| | - Jatna Supriatna
- Department of Biology, Institute for Sustainable Earth and Resources (I-SER), Gedung Laboratorium Multidisiplin, and Research Center for Climate Change (RCCC-UI), Gedung Laboratorium Multidisiplin, Faculty of Mathematics and Natural Sciences, Universitas Indonesia, Depok 16424, Indonesia
| | - Noviar Andayani
- Department of Biology, Institute for Sustainable Earth and Resources (I-SER), Gedung Laboratorium Multidisiplin, and Research Center for Climate Change (RCCC-UI), Gedung Laboratorium Multidisiplin, Faculty of Mathematics and Natural Sciences, Universitas Indonesia, Depok 16424, Indonesia
| | - Robert Hall
- SE Asia Research Group (SEARG), Department of Earth Sciences, Royal Holloway University of London, Egham, Surrey TW20 0EX, UK
| |
Collapse
|
14
|
Blanchet S, Fargeot L, Raffard A. Phylogenetically-conserved candidate genes unify biodiversity-ecosystem function relationships and eco-evolutionary dynamics across biological scales. Mol Ecol 2023; 32:4467-4481. [PMID: 37296539 DOI: 10.1111/mec.17043] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 05/24/2023] [Accepted: 05/30/2023] [Indexed: 06/12/2023]
Abstract
The intra- and interspecific facets of biodiversity have traditionally been analysed separately, limiting our understanding of how evolution has shaped biodiversity, how biodiversity (as a whole) alters ecological dynamics and hence eco-evolutionary feedbacks at the community scale. Here, we propose using candidate genes phylogenetically-conserved across species and sustaining functional traits as an inclusive biodiversity unit transcending the intra- and interspecific boundaries. This framework merges knowledge from functional genomics and functional ecology, and we first provide guidelines and a concrete example for identifying phylogenetically-conserved candidate genes (PCCGs) within communities and for measuring biodiversity from PCCGs. We then explain how biodiversity measured at PCCGs can be linked to ecosystem functions, which unifies recent observations that both intra- and interspecific biodiversity are important for ecosystem functions. We then highlight the eco-evolutionary processes shaping PCCG diversity patterns and argue that their respective role can be inferred from concepts derived from population genetics. Finally, we explain how PCCGs may shift the field of eco-evolutionary dynamics from a focal-species approach to a more realistic focal-community approach. This framework provides a novel perspective to investigate the global ecosystem consequences of diversity loss across biological scales, and how these ecological changes further alter biodiversity evolution.
Collapse
Affiliation(s)
- Simon Blanchet
- Centre National de la Recherche Scientifique (CNRS), Station d'Écologie Théorique et Expérimentale du CNRS à Moulis, UAR2029, Moulis, France
| | - Laura Fargeot
- Centre National de la Recherche Scientifique (CNRS), Station d'Écologie Théorique et Expérimentale du CNRS à Moulis, UAR2029, Moulis, France
| | - Allan Raffard
- Univ. Savoie Mont Blanc, INRAE, CARRTEL, Thonon-les-Bains, France
| |
Collapse
|
15
|
Terbot JW, Cooper BS, Good JM, Jensen JD. A simulation framework for modeling the within-patient evolutionary dynamics of SARS-CoV-2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.13.548462. [PMID: 37503016 PMCID: PMC10370031 DOI: 10.1101/2023.07.13.548462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
The global impact of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has led to considerable interest in detecting novel beneficial mutations and other genomic changes that may signal the development of variants of concern (VOCs). The ability to accurately detect these changes within individual patient samples is important in enabling early detection of VOCs. Such genomic scans for positive selection are best performed via comparison of empirical data to simulated data wherein evolutionary factors, including mutation and recombination rates, reproductive and infection dynamics, and purifying and background selection, can be carefully accounted for and parameterized. While there has been work to quantify these factors in SARS-CoV-2, they have yet to be integrated into a baseline model describing intra-host evolutionary dynamics. To construct such a baseline model, we develop a simulation framework that enables one to establish expectations for underlying levels and patterns of patient-level variation. By varying eight key parameters, we evaluated 12,096 different model-parameter combinations and compared them to existing empirical data. Of these, 592 models (~5%) were plausible based on the resulting mean expected number of segregating variants. These plausible models shared several commonalities shedding light on intra-host SARS-CoV-2 evolutionary dynamics: severe infection bottlenecks, low levels of reproductive skew, and a distribution of fitness effects skewed towards strongly deleterious mutations. We also describe important areas of model uncertainty and highlight additional sequence data that may help to further refine a baseline model. This study lays the groundwork for the improved analysis of existing and future SARS-CoV-2 within-patient data.
Collapse
Affiliation(s)
- John W Terbot
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine, Tempe, Arizona, United States of America
- University of Montana, Division of Biological Sciences, Missoula, Montana, United States of America
| | - Brandon S. Cooper
- University of Montana, Division of Biological Sciences, Missoula, Montana, United States of America
| | - Jeffrey M. Good
- University of Montana, Division of Biological Sciences, Missoula, Montana, United States of America
| | - Jeffrey D. Jensen
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine, Tempe, Arizona, United States of America
| |
Collapse
|
16
|
Bartoš O, Bohlen J, Šlechtová VB, Kočí J, Röslein J, Janko K. Sequence capture: Obsolete or irreplaceable? A thorough validation across phylogenetic distances and its applicability to hybrids and allopolyploids. Mol Ecol Resour 2023. [PMID: 37122140 DOI: 10.1111/1755-0998.13806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 04/05/2023] [Accepted: 04/12/2023] [Indexed: 05/02/2023]
Abstract
As whole-genome sequencing has become pervasive, some have suggested that reduced genomic representation approaches, for example, sequence capture, are becoming obsolete. In the present study, we argue that these techniques still provide excellent tools in terms of price and quality of data as well as in their ability to provide markers with specific features, as required, for example, in phylogenomics. A potential drawback of the wide-scale application of reduced representation approaches could be their drop in efficiency with increasing phylogenetic distance from the reference species. While some studies have focused on the degree and performance of reduced representation techniques in such situations, to our knowledge, none of them evaluated their applicability to inter-specific hybrids and polyploids. This highlights a significant gap in current knowledge since there is increasing evidence for the frequent occurrence of natural hybrids and polyploids, as well as for the major importance of both phenomena in evolution. The main aim of the present study was to carry out a thorough validation of SEQcap applicability to (1) a set of non-model taxa with a wide range of phylogenetic relatedness and (2) inter-specific hybrids of various ploidies and genomic compositions. Considering the latter point, we especially focused on mechanisms causing allelic bias and consequent allelic dropout, as these could have confounding effects with respect to the evolutionary genomic dynamics of hybrids, especially in asexuals, which virtually reproduce as a frozen F1 generation.
Collapse
Affiliation(s)
- Oldřich Bartoš
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
- Department of Zoology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Jörg Bohlen
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
| | - Vendula Bohlen Šlechtová
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
| | - Jan Kočí
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Jan Röslein
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| | - Karel Janko
- Laboratory of Fish Genetics, Institute of Animal Physiology and Genetics, The Czech Academy of Sciences, Libechov, Czech Republic
- Department of Biology and Ecology, Faculty of Science, University of Ostrava, Ostrava, Czech Republic
| |
Collapse
|
17
|
Kunde MN, Barlow A, Klittich AM, Yakupova A, Patel RP, Fickel J, Förster DW. First mitogenome phylogeny of the sun bear Helarctos malayanus reveals a deep split between Indochinese and Sundaic lineages. Ecol Evol 2023; 13:e9969. [PMID: 37082317 PMCID: PMC10111171 DOI: 10.1002/ece3.9969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2022] [Revised: 03/04/2023] [Accepted: 03/14/2023] [Indexed: 04/22/2023] Open
Abstract
The sun bear Helarctos malayanus is one of the most endangered ursids, and to date classification of sun bear populations has been based almost exclusively on geographic distribution and morphology. The very few molecular studies focussing on this species were limited in geographic scope. Using archival and non-invasively collected sample material, we have added a substantial number of complete or near-complete mitochondrial genome sequences from sun bears of several range countries of the species' distribution. We here report 32 new mitogenome sequences representing sun bears from Cambodia, Thailand, Peninsular Malaysia, Sumatra, and Borneo. Reconstruction of phylogenetic relationships revealed two matrilines that diverged ~295 thousand years ago: one restricted to portions of mainland Indochina (China, Cambodia, Thailand; "Mainland clade"), and one comprising bears from Borneo, Sumatra, Peninsular Malaysia but also Thailand ("Sunda clade"). Generally recent coalescence times in the mitochondrial phylogeny suggest that recent or historical demographic processes have resulted in a loss of mtDNA variation. Additionally, analysis of our data in conjunction with shorter mtDNA sequences revealed that the Bornean sun bear, classified as a distinct subspecies (H. m. euryspilus), does not harbor a distinctive matriline. Further molecular studies of H. malayanus are needed, which should ideally include data from nuclear loci.
Collapse
Affiliation(s)
- Miriam N. Kunde
- Leibniz Institute for Zoo and Wildlife ResearchAlfred‐Kowalke‐Str. 1710315BerlinGermany
- School of EnvironmentGriffith UniversityNathan Campus, 170 Kessels Road, NathanBrisbaneQueensland4111Australia
| | - Axel Barlow
- School of Natural SciencesBangor UniversityBangorGwyneddLL57 2DGUK
| | - Achim M. Klittich
- Institute for Biochemistry and BiologyUniversity of PotsdamKarl‐Liebknecht‐Str. 24–2514476PotsdamGermany
| | - Aliya Yakupova
- Computer Technologies LaboratoryITMO University197101Saint PetersburgRussia
| | - Riddhi P. Patel
- Leibniz Institute for Zoo and Wildlife ResearchAlfred‐Kowalke‐Str. 1710315BerlinGermany
| | - Jörns Fickel
- Leibniz Institute for Zoo and Wildlife ResearchAlfred‐Kowalke‐Str. 1710315BerlinGermany
- Institute for Biochemistry and BiologyUniversity of PotsdamKarl‐Liebknecht‐Str. 24–2514476PotsdamGermany
| | - Daniel W. Förster
- Leibniz Institute for Zoo and Wildlife ResearchAlfred‐Kowalke‐Str. 1710315BerlinGermany
| |
Collapse
|
18
|
Sasser KT, Weber JN. A Call For More Ecologically And Evolutionarily Relevant Studies of Immune Costs. Evol Ecol 2023; 37:203-214. [PMID: 37608798 PMCID: PMC10443930 DOI: 10.1007/s10682-022-10213-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Accepted: 10/02/2022] [Indexed: 11/04/2022]
Abstract
What are the relative costs and benefits of mounting immune responses? Practitioners of ecoimmunology have grappled with this central question since the field's inception with the main tension being how to make tractable methodological choices that maintain the ecological relevance of induced and measured immune costs. Here, we point out two methodological approaches that we feel are underrepresented in the field, describe risks associated with neglecting these methods, and suggest modern techniques that maximize both the diversity and ecological relevance of collected data. First, it is commonly assumed that frequently used and experimentally convenient immune stimulants will induce ecologically relevant immune responses in study organisms. This can be a dangerous assumption. Even if a stimulant's general immune response properties are well characterized, it is critical to also measure the type and scale of immune responses induced by live pathogens. Second, patterns of immune defenses evolve like other traits, thus a comparative approach is essential to understand what forces shape immune variation. Finally, we describe modern genetic and immunological approaches that will soon become essential tools for ecoimmunologists, and present case studies that exemplify the utility of our recommendations.
Collapse
Affiliation(s)
- Kristofer Trey Sasser
- University of Alaska Anchorage, Anchorage, AK, USA. Current address: University of Wisconsin, Madison, WI, USA
- 430 Lincoln Dr, Birge Hall, Madison WI, 53706
| | - Jesse N Weber
- University of Wisconsin, Madison, WI, USA
- 430 Lincoln Dr, Birge Hall, Madison WI, 53706
| |
Collapse
|
19
|
Li Y, Shi X, Zuo Y, Li T, Liu L, Shen Z, Shen J, Zhang R, Wang S. Multiplexed Target Enrichment Enables Efficient and In-Depth Analysis of Antimicrobial Resistome in Metagenomes. Microbiol Spectr 2022; 10:e0229722. [PMID: 36287061 PMCID: PMC9769626 DOI: 10.1128/spectrum.02297-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2022] [Accepted: 10/04/2022] [Indexed: 01/06/2023] Open
Abstract
Antibiotic resistance genes (ARGs) pose a serious threat to public health and ecological security in the 21st century. However, the resistome only accounts for a tiny fraction of metagenomic content, which makes it difficult to investigate low-abundance ARGs in various environmental settings. Thus, a highly sensitive, accurate, and comprehensive method is needed to describe ARG profiles in complex metagenomic samples. In this study, we established a high-throughput sequencing method based on targeted amplification, which could simultaneously detect ARGs (n = 251), mobile genetic element genes (n = 8), and metal resistance genes (n = 19) in metagenomes. The performance of amplicon sequencing was compared with traditional metagenomic shotgun sequencing (MetaSeq). A total of 1421 primer pairs were designed, achieving extremely high coverage of target genes. The amplicon sequencing significantly improved the recovery of target ARGs (~9 × 104-fold), with higher sensitivity and diversity, less cost, and computation burden. Furthermore, targeted enrichment allows deep scanning of single nucleotide polymorphisms (SNPs), and elevated SNPs detection was shown in this study. We further performed this approach for 48 environmental samples (37 feces, 20 soils, and 7 sewage) and 16 clinical samples. All samples tested in this study showed high diversity and recovery of targeted genes. Our results demonstrated that the approach could be applied to various metagenomic samples and served as an efficient tool in the surveillance and evolution assessment of ARGs. Access to the resistome using the enrichment method validated in this study enabled the capture of low-abundance resistomes while being less costly and time-consuming, which can greatly advance our understanding of local and global resistome dynamics. IMPORTANCE ARGs, an increasing global threat to human health, can be transferred into health-related microorganisms in the environment by horizontal gene transfer, posing a serious threat to public health. Advancing profiling methods are needed for monitoring and predicting the potential risks of ARGs in metagenomes. Our study described a customized amplicon sequencing assay that could enable a high-throughput, targeted, in-depth analysis of ARGs and detect a low-abundance portion of resistomes. This method could serve as an efficient tool to assess the variation and evolution of specific ARGs in the clinical and natural environment.
Collapse
Affiliation(s)
- Yiming Li
- Beijing Key Laboratory of Detection Technology for Animal-Derived Food Safety, College of Veterinary Medicine, China Agricultural University, Beijing, China
- Guangdong Laboratory for Lingnan Modern Agriculture, Guangzhou, China
| | - Xiaomin Shi
- Beijing Key Laboratory of Detection Technology for Animal-Derived Food Safety, College of Veterinary Medicine, China Agricultural University, Beijing, China
- Guangdong Laboratory for Lingnan Modern Agriculture, Guangzhou, China
| | - Yang Zuo
- Beijing Key Laboratory of Detection Technology for Animal-Derived Food Safety, College of Veterinary Medicine, China Agricultural University, Beijing, China
- Guangdong Laboratory for Lingnan Modern Agriculture, Guangzhou, China
| | - Tian Li
- Beijing Key Laboratory of Detection Technology for Animal-Derived Food Safety, College of Veterinary Medicine, China Agricultural University, Beijing, China
- Guangdong Laboratory for Lingnan Modern Agriculture, Guangzhou, China
| | - Lu Liu
- Beijing Key Laboratory of Detection Technology for Animal-Derived Food Safety, College of Veterinary Medicine, China Agricultural University, Beijing, China
- Guangdong Laboratory for Lingnan Modern Agriculture, Guangzhou, China
| | - Zhangqi Shen
- Beijing Key Laboratory of Detection Technology for Animal-Derived Food Safety, College of Veterinary Medicine, China Agricultural University, Beijing, China
- Guangdong Laboratory for Lingnan Modern Agriculture, Guangzhou, China
| | - Jianzhong Shen
- Beijing Key Laboratory of Detection Technology for Animal-Derived Food Safety, College of Veterinary Medicine, China Agricultural University, Beijing, China
- Guangdong Laboratory for Lingnan Modern Agriculture, Guangzhou, China
| | - Rong Zhang
- The Second Affiliated Hospital of Zhejiang University, Zhejiang University, Hangzhou, China
| | - Shaolin Wang
- Beijing Key Laboratory of Detection Technology for Animal-Derived Food Safety, College of Veterinary Medicine, China Agricultural University, Beijing, China
- Guangdong Laboratory for Lingnan Modern Agriculture, Guangzhou, China
| |
Collapse
|
20
|
Nunes R, Storer C, Doleck T, Kawahara AY, Pierce NE, Lohman DJ. Predictors of sequence capture in a large-scale anchored phylogenomics project. Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.943361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2023] Open
Abstract
Next-generation sequencing (NGS) technologies have revolutionized phylogenomics by decreasing the cost and time required to generate sequence data from multiple markers or whole genomes. Further, the fragmented DNA of biological specimens collected decades ago can be sequenced with NGS, reducing the need for collecting fresh specimens. Sequence capture, also known as anchored hybrid enrichment, is a method to produce reduced representation libraries for NGS sequencing. The technique uses single-stranded oligonucleotide probes that hybridize with pre-selected regions of the genome that are sequenced via NGS, culminating in a dataset of numerous orthologous loci from multiple taxa. Phylogenetic analyses using these sequences have the potential to resolve deep and shallow phylogenetic relationships. Identifying the factors that affect sequence capture success could save time, money, and valuable specimens that might be destructively sampled despite low likelihood of sequencing success. We investigated the impacts of specimen age, preservation method, and DNA concentration on sequence capture (number of captured sequences and sequence quality) while accounting for taxonomy and extracted tissue type in a large-scale butterfly phylogenomics project. This project used two probe sets to extract 391 loci or a subset of 13 loci from over 6,000 butterfly specimens. We found that sequence capture is a resilient method capable of amplifying loci in samples of varying age (0–111 years), preservation method (alcohol, papered, pinned), and DNA concentration (0.020 ng/μl - 316 ng/ul). Regression analyses demonstrate that sequence capture is positively correlated with DNA concentration. However, sequence capture and DNA concentration are negatively correlated with sample age and preservation method. Our findings suggest that sequence capture projects should prioritize the use of alcohol-preserved samples younger than 20 years old when available. In the absence of such specimens, dried samples of any age can yield sequence data, albeit with returns that diminish with increasing age.
Collapse
|
21
|
Crameri S, Fior S, Zoller S, Widmer A. A target capture approach for phylogenomic analyses at multiple evolutionary timescales in rosewoods (Dalbergia spp.) and the legume family (Fabaceae). Mol Ecol Resour 2022; 22:3087-3105. [PMID: 35689779 PMCID: PMC9796917 DOI: 10.1111/1755-0998.13666] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Revised: 03/29/2022] [Accepted: 06/01/2022] [Indexed: 01/07/2023]
Abstract
Understanding the genetic changes associated with the evolution of biological diversity is of fundamental interest to molecular ecologists. The assessment of genetic variation at hundreds or thousands of unlinked genetic loci forms a sound basis to address questions ranging from micro- to macroevolutionary timescales, and is now possible thanks to advances in sequencing technology. Major difficulties are associated with (i) the lack of genomic resources for many taxa, especially from tropical biodiversity hotspots; (ii) scaling the numbers of individuals analysed and loci sequenced; and (iii) building tools for reproducible bioinformatic analyses of such data sets. To address these challenges, we developed target capture probes for genomic studies of the highly diverse, pantropically distributed and economically significant rosewoods (Dalbergia spp.), explored the performance of an overlapping probe set for target capture across the legume family (Fabaceae), and built the general purpose bioinformatic pipeline CaptureAl. Phylogenomic analyses of Malagasy Dalbergia species yielded highly resolved and well supported hypotheses of evolutionary relationships. Population genomic analyses identified differences between closely related species and revealed the existence of a potentially new species, suggesting that the diversity of Malagasy Dalbergia species has been underestimated. Analyses at the family level corroborated previous findings by the recovery of monophyletic subfamilies and many well-known clades, as well as high levels of gene tree discordance, especially near the root of the family. The new genomic and bioinformatic resources, including the Fabaceae1005 and Dalbergia2396 probe sets, will hopefully advance systematics and ecological genetics research in legumes, and promote conservation of the highly diverse and endangered Dalbergia rosewoods.
Collapse
Affiliation(s)
- Simon Crameri
- Institute of Integrative BiologyETH ZurichZürichSwitzerland
| | - Simone Fior
- Institute of Integrative BiologyETH ZurichZürichSwitzerland
| | - Stefan Zoller
- Institute of Integrative BiologyETH ZurichZürichSwitzerland,Genetic Diversity Centre (GDC)ETH ZurichZürichSwitzerland
| | - Alex Widmer
- Institute of Integrative BiologyETH ZurichZürichSwitzerland
| |
Collapse
|
22
|
Pacheco C, Lobo D, Silva P, Álvares F, García EJ, Castro D, Layna JF, López-Bao JV, Godinho R. Assessing the performance of historical skins and bones for museomics using wolf specimens as a case study. Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.970249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Advances in the field of museomics have promoted a high sampling demand for natural history collections (NHCs), eventually resulting in damage to invaluable resources to understand historical biodiversity. It is thus essential to achieve a consensus about which historical tissues present the best sources of DNA. In this study, we evaluated the performance of different historical tissues from Iberian wolf NHCs in genome-wide assessments. We targeted three tissues—bone (jaw and femur), maxilloturbinal bone, and skin—that have been favored by traditional taxidermy practices for mammalian carnivores. Specifically, we performed shotgun sequencing and target capture enrichment for 100,000 single nucleotide polymorphisms (SNPs) selected from the commercial Canine HD BeadChip across 103 specimens from 1912 to 2005. The performance of the different tissues was assessed using metrics based on endogenous DNA content, uniquely high-quality mapped reads after capture, and enrichment proportions. All samples succeeded as DNA sources, regardless of their collection year or sample type. Skin samples yielded significantly higher amounts of endogenous DNA compared to both bone types, which yielded equivalent amounts. There was no evidence for a direct effect of tissue type on capture efficiency; however, the number of genotyped SNPs was strictly associated with the starting amount of endogenous DNA. Evaluation of genotyping accuracy for distinct minimum read depths across tissue types showed a consistent overall low genotyping error rate (<7%), even at low (3x) coverage. We recommend the use of skins as reliable and minimally destructive sources of endogenous DNA for whole-genome and target enrichment approaches in mammalian carnivores. In addition, we provide a new 100,000 SNP capture array validated for historical DNA (hDNA) compatible to the Canine HD BeadChip for high-quality DNA. The increasing demand for NHCs as DNA sources should encourage the generation of genomic datasets comparable among studies.
Collapse
|
23
|
Li J, Liang D, Zhang P. Simultaneously collecting coding and non-coding phylogenomic data using homemade full-length cDNA probes, tested by resolving the high-level relationships of Colubridae. Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.969581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Resolving intractable phylogenetic relationships often requires simultaneously analyzing a large number of coding and non-coding orthologous loci. To gather both coding and non-coding data, traditional sequence capture methods require custom-designed commercial probes. Here, we present a cost-effective sequence capture method based on homemade probes, to capture thousands of coding and non-coding orthologous loci simultaneously, suitable for all organisms. This approach, called “FLc-Capture,” synthesizes biotinylated full-length cDNAs from mRNA as capture probes, eliminates the need for costly commercial probe design and synthesis. To demonstrate the utility of FLc-Capture, we prepared full-length cDNA probes from mRNA extracted from a common colubrid snake. We performed capture experiments with these homemade cDNA probes and successfully obtained thousands of coding and non-coding genomic loci from 24 Colubridae species and 12 distantly related snake species of other families. The average capture specificity of FLc-Capture across all tested snake species is 35%, similar to the previously published EecSeq method. We constructed two phylogenomic data sets, one including 1,075 coding loci (∼817,000 bp) and the other including 1,948 non-coding loci (∼1,114,000 bp), to study the phylogeny of Colubridae. Both data sets yielded highly similar and well-resolved trees, with 85% of nodes having >95% bootstrap support. Our experimental tests show that FLc-Capture is a flexible, fast, and cost-effective sequence capture approach for simultaneously gathering coding and non-coding phylogenomic data sets to study intractable phylogenetic questions. We hope that this method will serve as a new data collection tool for evolutionary biologists working in the era of phylogenomics.
Collapse
|
24
|
Lv J, Wang Y, Ni P, Lin P, Hou H, Ding J, Chang Y, Hu J, Wang S, Bao Z. Development of a high-throughput SNP array for sea cucumber (Apostichopus japonicus) and its application in genomic selection with MCP regularized deep neural networks. Genomics 2022; 114:110426. [PMID: 35820495 DOI: 10.1016/j.ygeno.2022.110426] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Revised: 06/23/2022] [Accepted: 06/30/2022] [Indexed: 12/22/2022]
Abstract
High-throughput single nucleotide polymorphism (SNP) genotyping assays are powerful tools for genetic studies and genomic breeding applications for many species. Though large numbers of SNPs have been identified in sea cucumber (Apostichopus japonicus), but, as yet, no high-throughput genotyping platform is available for this species. In this study, we designed and developed a high-throughput 24 K SNP genotyping array named HaishenSNP24K for A. japonicus, based on the multi-objective-local optimization (MOLO) algorithm and HD-Marker genotyping method. The SNP array exhibited a relatively high genotyping call rate (> 96%), genotyping accuracy (>95%) and exhibited highly polymorphic in sea cucumber populations. In addition, we also assessed its application in genomic selection (GS). Deep neural networks (DNN) that can capture the complicated interactions of genes have been proposed as a promising tool in GS for SNP-based genomic prediction of complex traits in animal breeding. To overcome the problem of over-fitting when using the HaishenSNP24K array as high-dimensional DNN input, we developed minmax concave penalty (MCP) regularization for sparse deep neural networks (DNN-MCP) that finds an optimal sparse structure of a DNN by minimizing the square error subject to the non-convex penalty MCP on the parameters (weights and biases). Compared to two linear models, namely RR-GBLUP and Bayes B, and the nonlinear model DNN, DNN-MCP has greatly improved the genomic prediction ability for three quantitative traits (e.g., wet weight, dry weight and survival time) in the sea cucumber population. To the best of our knowledge, this is the first work to develop a high-throughput SNP array for A. japonicus and a new model DNN-MCP for genomic prediction of complex traits in GS. The present results provide evidence that supports the HaishenSNP24K array with DNN-MCP will be valuable for genetic studies and molecular breeding in A. japonicus.
Collapse
Affiliation(s)
- Jia Lv
- MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao 266003, China
| | - Yangfan Wang
- MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao 266003, China.
| | - Ping Ni
- MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao 266003, China
| | - Ping Lin
- Division of Mathematics, University of Dundee, Dundee DD1 4HN, UK
| | - Hu Hou
- College of Food Science and Engineering, Ocean University of China, Qingdao 266003, China
| | - Jun Ding
- College of Fisheries and Life Science, Dalian Ocean University, Dalian 116023, China.
| | - Yaqing Chang
- College of Fisheries and Life Science, Dalian Ocean University, Dalian 116023, China.
| | - Jingjie Hu
- Ocean University China, Sanya Oceanog Inst, Lab Trop Marine Germplasm Res & Breeding Engn, Sanya 572000, China.
| | - Shi Wang
- MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao 266003, China
| | - Zhenmin Bao
- MOE Key Laboratory of Marine Genetics and Breeding, College of Marine Life Sciences, Ocean University of China, Qingdao 266003, China
| |
Collapse
|
25
|
Jiménez‐Mena B, Flávio H, Henriques R, Manuzzi A, Ramos M, Meldrup D, Edson J, Pálsson S, Ásta Ólafsdóttir G, Ovenden JR, Nielsen EE. Fishing for DNA? Designing baits for population genetics in target enrichment experiments: Guidelines, considerations and the new tool supeRbaits. Mol Ecol Resour 2022; 22:2105-2119. [PMID: 35178874 PMCID: PMC9313901 DOI: 10.1111/1755-0998.13598] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Revised: 01/24/2022] [Accepted: 02/07/2022] [Indexed: 11/27/2022]
Abstract
Targeted sequencing is an increasingly popular next-generation sequencing (NGS) approach for studying populations that involves focusing sequencing efforts on specific parts of the genome of a species of interest. Methodologies and tools for designing targeted baits are scarce but in high demand. Here, we present specific guidelines and considerations for designing capture sequencing experiments for population genetics for both neutral genomic regions and regions subject to selection. We describe the bait design process for three diverse fish species: Atlantic salmon, Atlantic cod and tiger shark, which was carried out in our research group, and provide an evaluation of the performance of our approach across both historical and modern samples. The workflow used for designing these three bait sets has been implemented in the R-package supeRbaits, which encompasses our considerations and guidelines for bait design for the benefit of researchers and practitioners. The supeRbaits R-package is user-friendly and versatile. It is written in C++ and implemented in R. supeRbaits and its manual are available from Github: https://github.com/BelenJM/supeRbaits.
Collapse
Affiliation(s)
- Belén Jiménez‐Mena
- Section for Marine Living Resources, National Institute of Aquatic ResourcesTechnical University of DenmarkSilkeborgDenmark
| | - Hugo Flávio
- Section for Marine Living Resources, National Institute of Aquatic ResourcesTechnical University of DenmarkSilkeborgDenmark
| | - Romina Henriques
- Section for Marine Living Resources, National Institute of Aquatic ResourcesTechnical University of DenmarkSilkeborgDenmark
| | - Alice Manuzzi
- Section for Marine Living Resources, National Institute of Aquatic ResourcesTechnical University of DenmarkSilkeborgDenmark
| | | | - Dorte Meldrup
- Section for Marine Living Resources, National Institute of Aquatic ResourcesTechnical University of DenmarkSilkeborgDenmark
| | - Janette Edson
- Queensland Brain InstituteThe University of QueenslandBrisbaneQueenslandAustralia
| | - Snæbjörn Pálsson
- Faculty of Life and Environmental SciencesUniversity of IcelandReykjavíkIceland
| | | | - Jennifer R. Ovenden
- Molecular Fisheries Laboratory, School of Biomedical SciencesThe University of QueenslandBrisbaneQueenslandAustralia
| | - Einar Eg Nielsen
- Section for Marine Living Resources, National Institute of Aquatic ResourcesTechnical University of DenmarkSilkeborgDenmark
| |
Collapse
|
26
|
Target Enrichment Approaches for Next-Generation Sequencing Applications in Oncology. Diagnostics (Basel) 2022; 12:diagnostics12071539. [PMID: 35885445 PMCID: PMC9318977 DOI: 10.3390/diagnostics12071539] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 06/20/2022] [Accepted: 06/21/2022] [Indexed: 11/29/2022] Open
Abstract
Screening for genomic sequence variants in genes of predictive and prognostic significance is an integral part of precision medicine. Next-generation sequencing (NGS) technologies are progressively becoming platforms of choice to facilitate this, owing to their massively parallel sequencing capability, which can be used to simultaneously screen multiple markers in multiple samples for a variety of variants (single nucleotide and multi nucleotide variants, insertions and deletions, gene copy number variations, and fusions). A crucial step in the workflow of targeted NGS is the enrichment of the genomic regions of interest to be sequenced, against the whole genomic background. This ensures that the NGS effort is focused to predominantly screen target regions of interest with minimal off-target sequencing, making it more accurate and economical. Polymerase chain reaction-based (PCR, or amplicon-based) and hybridization capture-based methodologies are the two prominent approaches employed for target enrichment. This review summarizes the basic principles of target enrichment utilized by these methods, their multiple variations that have evolved over time, automation approaches, overall comparison of their advantages and drawbacks, and commercially available choices for these methodologies.
Collapse
|
27
|
Rao WQ, Kalogeropoulos K, Allentoft ME, Gopalakrishnan S, Zhao WN, Workman CT, Knudsen C, Jiménez-Mena B, Seneci L, Mousavi-Derazmahalleh M, Jenkins TP, Rivera-de-Torre E, Liu SQ, Laustsen AH. The rise of genomics in snake venom research: recent advances and future perspectives. Gigascience 2022; 11:6562531. [PMID: 35365832 PMCID: PMC8975721 DOI: 10.1093/gigascience/giac024] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Revised: 02/12/2022] [Accepted: 02/13/2022] [Indexed: 12/12/2022] Open
Abstract
Snake venoms represent a danger to human health, but also a gold mine of bioactive proteins that can be harnessed for drug discovery purposes. The evolution of snakes and their venom has been studied for decades, particularly via traditional morphological and basic genetic methods alongside venom proteomics. However, while the field of genomics has matured rapidly over the past 2 decades, owing to the development of next-generation sequencing technologies, snake genomics remains in its infancy. Here, we provide an overview of the state of the art in snake genomics and discuss its potential implications for studying venom evolution and toxinology. On the basis of current knowledge, gene duplication and positive selection are key mechanisms in the neofunctionalization of snake venom proteins. This makes snake venoms important evolutionary drivers that explain the remarkable venom diversification and adaptive variation observed in these reptiles. Gene duplication and neofunctionalization have also generated a large number of repeat sequences in snake genomes that pose a significant challenge to DNA sequencing, resulting in the need for substantial computational resources and longer sequencing read length for high-quality genome assembly. Fortunately, owing to constantly improving sequencing technologies and computational tools, we are now able to explore the molecular mechanisms of snake venom evolution in unprecedented detail. Such novel insights have the potential to affect the design and development of antivenoms and possibly other drugs, as well as provide new fundamental knowledge on snake biology and evolution.
Collapse
Affiliation(s)
- Wei-Qiao Rao
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Søltofts Plads 224, 2800 Kongens Lyngby, Denmark.,Department of Mass Spectrometry, Beijing Genomics Institute-Research, 518083, Shenzhen, China
| | - Konstantinos Kalogeropoulos
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Søltofts Plads 224, 2800 Kongens Lyngby, Denmark
| | - Morten E Allentoft
- Trace and Environmental DNA (TrEnD) Laboratory, School of Molecular and Life Sciences, Curtin University, Kent Street, 6102, Bentley Perth, Australia.,Globe Institute, University of Copenhagen, Øster Voldgade 5, 1350, Copenhagen, Denmark
| | - Shyam Gopalakrishnan
- Globe Institute, University of Copenhagen, Øster Voldgade 5, 1350, Copenhagen, Denmark
| | - Wei-Ning Zhao
- Department of Mass Spectrometry, Beijing Genomics Institute-Research, 518083, Shenzhen, China
| | - Christopher T Workman
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Søltofts Plads 224, 2800 Kongens Lyngby, Denmark
| | - Cecilie Knudsen
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Søltofts Plads 224, 2800 Kongens Lyngby, Denmark
| | - Belén Jiménez-Mena
- DTU Aqua, Technical University of Denmark, Vejlsøvej 39, 8600, Silkeborg, Denmark
| | - Lorenzo Seneci
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Søltofts Plads 224, 2800 Kongens Lyngby, Denmark
| | - Mahsa Mousavi-Derazmahalleh
- Trace and Environmental DNA (TrEnD) Laboratory, School of Molecular and Life Sciences, Curtin University, Kent Street, 6102, Bentley Perth, Australia
| | - Timothy P Jenkins
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Søltofts Plads 224, 2800 Kongens Lyngby, Denmark
| | - Esperanza Rivera-de-Torre
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Søltofts Plads 224, 2800 Kongens Lyngby, Denmark
| | - Si-Qi Liu
- Department of Mass Spectrometry, Beijing Genomics Institute-Research, 518083, Shenzhen, China
| | - Andreas H Laustsen
- Department of Biotechnology and Biomedicine, Technical University of Denmark, Søltofts Plads 224, 2800 Kongens Lyngby, Denmark
| |
Collapse
|
28
|
Lott MJ, Wright BR, Neaves LE, Frankham GJ, Dennison S, Eldridge MDB, Potter S, Alquezar-Planas DE, Hogg CJ, Belov K, Johnson RN. Future-proofing the koala: synergising genomic and environmental data for effective species management. Mol Ecol 2022; 31:3035-3055. [PMID: 35344635 DOI: 10.1111/mec.16446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Revised: 02/19/2022] [Accepted: 03/04/2022] [Indexed: 11/30/2022]
Abstract
Climatic and evolutionary processes are inextricably linked to conservation. Avoiding extinction in rapidly changing environments often depends upon a species' capacity to adapt in the face of extreme selective pressures. Here, we employed exon capture and high-throughput next-generation sequencing to investigate the mechanisms underlying population structure and adaptive genetic variation in the koala (Phascolarctos cinereus), an iconic Australian marsupial that represents a unique conservation challenge because it is not uniformly threatened across its range. An examination of 250 specimens representing 91 wild source locations revealed that five major genetic clusters currently exist on a continental scale. The initial divergence of these clusters appears to have been concordant with the Mid-Brunhes Transition (∼ 430-300 kya), a major climatic reorganization that increased the amplitude of Pleistocene glacial-interglacial cycles. While signatures of polygenic selection and environmental adaptation were detected, strong evidence for repeated, climate-associated range contractions and demographic bottleneck events suggests that geographically isolated refugia may have played a more significant role in the survival of the koala through the Pleistocene glaciation than in situ adaptation. Consequently, the conservation of genome-wide genetic variation must be aligned with the protection of core koala habitat to increase the resilience of threatened populations to accelerating anthropogenic threats. Finally, we propose that the five major genetic clusters identified in this study should be accounted for in future koala conservation efforts (e.g. guiding translocations), as existing management divisions in the states of Queensland and New South Wales do not reflect historic or contemporary population structure.
Collapse
Affiliation(s)
- Matthew J Lott
- Australian Museum Research Institute, Australian Museum, 1 William Street, 2010, New South Wales, Australia
| | - Belinda R Wright
- Australian Museum Research Institute, Australian Museum, 1 William Street, 2010, New South Wales, Australia.,School of Life and Environmental Sciences, the University of Sydney, 2006, New South Wales, Australia.,Sydney School of Veterinary Sciences, Faculty of Science, the University of Sydney, 2006, New South Wales, Australia
| | - Linda E Neaves
- Australian Museum Research Institute, Australian Museum, 1 William Street, 2010, New South Wales, Australia.,Fenner School of Environment and Society, the Australian National University, Canberra, Australian Capital Territory, 2600, Australia
| | - Greta J Frankham
- Australian Museum Research Institute, Australian Museum, 1 William Street, 2010, New South Wales, Australia
| | - Siobhan Dennison
- Australian Museum Research Institute, Australian Museum, 1 William Street, 2010, New South Wales, Australia
| | - Mark D B Eldridge
- Australian Museum Research Institute, Australian Museum, 1 William Street, 2010, New South Wales, Australia
| | - Sally Potter
- Australian Museum Research Institute, Australian Museum, 1 William Street, 2010, New South Wales, Australia.,Division of Ecology & Evolution, Research School of Biology, the Australian National University, Australian Capital Territory, Canberra, 2600, Australia
| | - David E Alquezar-Planas
- Australian Museum Research Institute, Australian Museum, 1 William Street, 2010, New South Wales, Australia
| | - Carolyn J Hogg
- School of Life and Environmental Sciences, the University of Sydney, 2006, New South Wales, Australia
| | - Katherine Belov
- School of Life and Environmental Sciences, the University of Sydney, 2006, New South Wales, Australia
| | - Rebecca N Johnson
- Australian Museum Research Institute, Australian Museum, 1 William Street, 2010, New South Wales, Australia.,National Museum of Natural History, District of Columbia, Washington, 20560, United States
| |
Collapse
|
29
|
Comparative population genomics in Tabebuia alliance shows evidence of adaptation in Neotropical tree species. Heredity (Edinb) 2022; 128:141-153. [PMID: 35132209 PMCID: PMC8897506 DOI: 10.1038/s41437-021-00491-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Revised: 12/07/2021] [Accepted: 12/07/2021] [Indexed: 11/08/2022] Open
Abstract
The role of natural selection in shaping spatial patterns of genetic diversity in the Neotropics is still poorly understood. Here, we perform a genome scan with 24,751 probes targeting 11,026 loci in two Neotropical Bignoniaceae tree species: Handroanthus serratifolius from the seasonally dry tropical forest (SDTF) and Tabebuia aurea from savannas, and compared with the population genomics of H. impetiginosus from SDTF. OutFLANK detected 29 loci in 20 genes with selection signal in H. serratifolius and no loci in T. aurea. Using BayPass, we found evidence of selection in 335 loci in 312 genes in H. serratifolius, 101 loci in 92 genes in T. aurea, and 448 loci in 416 genes in H. impetiginosus. All approaches evidenced several genes affecting plant response to environmental stress and primary metabolic processes. The three species shared no SNPs with selection signal, but we found SNPs affecting the same gene in pair of species. Handroanthus serratifolius showed differences in allele frequencies at SNPs with selection signal among ecosystems, mainly between Caatinga/Cerrado and Atlantic Forest, while H. impetiginosus had one allele fixed across all populations, and T. aurea had similar allele frequency distribution among ecosystems and polymorphism across populations. Taken together, our results indicate that natural selection related to environmental stress shaped the spatial pattern of genetic diversity in the three species. However, the three species have different geographical distribution and niches, which may affect tolerances and adaption, and natural selection may lead to different signatures due to the differences in adaptive landscapes in different niches.
Collapse
|
30
|
de Aquino SO, Kiwuka C, Tournebize R, Gain C, Marraccini P, Mariac C, Bethune K, Couderc M, Cubry P, Andrade AC, Lepelley M, Darracq O, Crouzillat D, Anten N, Musoli P, Vigouroux Y, de Kochko A, Manel S, François O, Poncet V. Adaptive potential of
Coffea canephora
from Uganda in response to climate change. Mol Ecol 2022; 31:1800-1819. [DOI: 10.1111/mec.16360] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Revised: 11/12/2021] [Accepted: 01/06/2022] [Indexed: 11/28/2022]
Affiliation(s)
| | - Catherine Kiwuka
- NARO Kampala Uganda
- Centre for Crop Systems Analysis Wageningen Univ. Wageningen Netherlands
| | | | - Clément Gain
- U. Grenoble‐Alpes, TIMC‐IMAG, CNRS UMR 5525, Grenoble, France and LJK, Inria, CNRS UMR 5224 Grenoble France
| | | | - Cédric Mariac
- DIADE, Univ. Montpellier, CIRAD, IRD Montpellier France
| | - Kévin Bethune
- DIADE, Univ. Montpellier, CIRAD, IRD Montpellier France
| | - Marie Couderc
- DIADE, Univ. Montpellier, CIRAD, IRD Montpellier France
| | | | | | | | | | | | - Niels Anten
- Centre for Crop Systems Analysis Wageningen Univ. Wageningen Netherlands
| | | | | | | | - Stéphanie Manel
- CEFE, Univ Montpellier, CNRS, EPHE‐PSL University, IRD Montpellier France
| | - Olivier François
- U. Grenoble‐Alpes, TIMC‐IMAG, CNRS UMR 5525, Grenoble, France and LJK, Inria, CNRS UMR 5224 Grenoble France
| | | |
Collapse
|
31
|
AKHTAR MUHAMMADSHOAIB, ASHINO RYUICHI, OOTA HIROKI, ISHIDA HAJIME, NIIMURA YOSHIHITO, TOUHARA KAZUSHIGE, MELIN AMANDAD, KAWAMURA SHOJI. Genetic variation of olfactory receptor gene family in a Japanese population. ANTHROPOL SCI 2022. [DOI: 10.1537/ase.211024] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
Affiliation(s)
- MUHAMMAD SHOAIB AKHTAR
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa
| | - RYUICHI ASHINO
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa
| | - HIROKI OOTA
- Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo
| | - HAJIME ISHIDA
- Department of Human Biology and Anatomy, Faculty of Medicine, University of the Ryukyus, Nishihara
| | - YOSHIHITO NIIMURA
- Department of Veterinary Sciences, Faculty of Agriculture, University of Miyazaki, Miyazaki
| | - KAZUSHIGE TOUHARA
- Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo
| | - AMANDA D. MELIN
- Department of Anthropology and Archaeology & Department of Medical Genetics, University of Calgary, Calgary
| | - SHOJI KAWAMURA
- Department of Integrated Biosciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa
| |
Collapse
|
32
|
Nürnberger B, Baird SJE, Čížková D, Bryjová A, Mudd AB, Blaxter ML, Szymura JM. A dense linkage map for a large repetitive genome: discovery of the sex-determining region in hybridizing fire-bellied toads (Bombina bombina and Bombina variegata). G3 (BETHESDA, MD.) 2021; 11:6353606. [PMID: 34849761 PMCID: PMC8664441 DOI: 10.1093/g3journal/jkab286] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Accepted: 08/16/2021] [Indexed: 12/20/2022]
Abstract
Genomic analysis of hybrid zones offers unique insights into emerging reproductive isolation and the dynamics of introgression. Because hybrid genomes consist of blocks inherited from one or the other parental taxon, linkage information is essential. In most cases, the spectrum of local ancestry tracts can be efficiently uncovered from dense linkage maps. Here, we report the development of such a map for the hybridizing toads, Bombina bombina and Bombina variegata (Anura: Bombinatoridae). Faced with the challenge of a large (7–10 Gb), repetitive genome, we set out to identify a large number of Mendelian markers in the nonrepetitive portion of the genome that report B. bombina vs B. variegata ancestry with appropriately quantified statistical support. Bait sequences for targeted enrichment were selected from a draft genome assembly, after filtering highly repetitive sequences. We developed a novel approach to infer the most likely diplotype per sample and locus from the raw read mapping data, which is robust to over-merging and obviates arbitrary filtering thresholds. Validation of the resulting map with 4755 markers underscored the large-scale synteny between Bombina and Xenopus tropicalis. By assessing the sex of late-stage F2 tadpoles from histological sections, we identified the sex-determining region in the Bombina genome to 7 cM on LG5, which is homologous to X. tropicalis chromosome 5, and inferred male heterogamety. Interestingly, chromosome 5 has been repeatedly recruited as a sex chromosome in anurans with XY sex determination.
Collapse
Affiliation(s)
- Beate Nürnberger
- Research Facility Studenec, Institute of Vertebrate Biology, Czech Academy of Sciences, 603 65 Brno, Czech Republic
| | - Stuart J E Baird
- Research Facility Studenec, Institute of Vertebrate Biology, Czech Academy of Sciences, 603 65 Brno, Czech Republic
| | - Dagmar Čížková
- Research Facility Studenec, Institute of Vertebrate Biology, Czech Academy of Sciences, 603 65 Brno, Czech Republic
| | - Anna Bryjová
- Research Facility Studenec, Institute of Vertebrate Biology, Czech Academy of Sciences, 603 65 Brno, Czech Republic
| | - Austin B Mudd
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, 94720 CA, USA
| | - Mark L Blaxter
- Tree of Life Programme, Wellcome Sanger Institute, Hinxton, Cambridge CB10 1SA, UK
| | - Jacek M Szymura
- Department of Comparative Anatomy, Jagiellonian University, 30-387 Kraków, Poland
| |
Collapse
|
33
|
Houston DD, Satler JD, Stack TK, Carroll HM, Bevan AM, Moya AL, Alexander KD. A phylogenomic perspective on the evolutionary history of the stonefly genus Suwallia (Plecoptera: Chloroperlidae) revealed by ultraconserved genomic elements. Mol Phylogenet Evol 2021; 166:107320. [PMID: 34626810 DOI: 10.1016/j.ympev.2021.107320] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2020] [Revised: 09/20/2021] [Accepted: 10/04/2021] [Indexed: 11/18/2022]
Abstract
Evolutionary biologists have long sought to disentangle phylogenetic relationships among taxa spanning the tree of life, an increasingly important task as anthropogenic influences accelerate population declines and species extinctions, particularly in insects. Phylogenetic analyses are commonly used to identify unique evolutionary lineages, to clarify taxonomic designations of the focal taxa, and to inform conservation decisions. Advances in DNA sequencing techniques have increasingly facilitated the ability of researchers to apply genomic methods to phylogenetic analyses, even for non-model organisms. Stoneflies are non-model insects that are important bioindicators of the quality of freshwater habitats and landscape disturbance as they spend the immature stages of their life cycles in fresh water, and the adult stages in terrestrial environments. Phylogenetic relationships within the stonefly genus Suwallia (Insecta: Plecoptera: Chloroperlidae) are poorly understood, and have never been assessed using molecular data. We used DNA sequence data from genome-wide ultraconserved element loci to generate the first molecular phylogeny for the group and assess its monophyly. We found that Palearctic and Nearctic Suwallia do not form reciprocally monophyletic clades, and that a biogeographic history including dispersal, vicariance, and founder event speciation via jump dispersal best explains the geographic distribution of this group. Our results also strongly suggest that Neaviperla forcipata (Neave, 1929) is nested within Suwallia, and the concept of the genus Suwallia should be revised to include it. Thus, we formally propose a new taxonomic combination wherein Neaviperla forcipata (Neave, 1929) is reclassified as Suwallia forcipata (Neave, 1929). Moreover, some Suwallia species (e.g., S. amoenacolens, S. kerzhneri, S. marginata, S. pallidula, and S. starki) exhibit pronounced cryptic diversity that is worthy of further investigation. These findings provide a first glimpse into the evolutionary history of Suwallia, improve our understanding of stonefly diversity in the tribe Suwallini, and highlight areas where additional research is needed.
Collapse
Affiliation(s)
- Derek D Houston
- Department of Natural and Environmental Sciences, Western Colorado University, Gunnison, CO, USA.
| | - Jordan D Satler
- Department of Ecology Evolution and Organismal Biology, Iowa State University, Ames, IA, USA.
| | - Taylor K Stack
- Department of Natural and Environmental Sciences, Western Colorado University, Gunnison, CO, USA.
| | - Hannah M Carroll
- Department of Ecology Evolution and Organismal Biology, Iowa State University, Ames, IA, USA; Department of Earth Planetary and Space Sciences, University of California-Los Angeles, CA, USA.
| | - Alissa M Bevan
- Department of Natural and Environmental Sciences, Western Colorado University, Gunnison, CO, USA.
| | - Autumn L Moya
- Department of Natural and Environmental Sciences, Western Colorado University, Gunnison, CO, USA.
| | - Kevin D Alexander
- Department of Natural and Environmental Sciences, Western Colorado University, Gunnison, CO, USA.
| |
Collapse
|
34
|
Yardeni G, Viruel J, Paris M, Hess J, Groot Crego C, de La Harpe M, Rivera N, Barfuss MHJ, Till W, Guzmán-Jacob V, Krömer T, Lexer C, Paun O, Leroy T. Taxon-specific or universal? Using target capture to study the evolutionary history of rapid radiations. Mol Ecol Resour 2021; 22:927-945. [PMID: 34606683 PMCID: PMC9292372 DOI: 10.1111/1755-0998.13523] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 09/09/2021] [Accepted: 09/22/2021] [Indexed: 12/20/2022]
Abstract
Target capture has emerged as an important tool for phylogenetics and population genetics in nonmodel taxa. Whereas developing taxon‐specific capture probes requires sustained efforts, available universal kits may have a lower power to reconstruct relationships at shallow phylogenetic scales and within rapidly radiating clades. We present here a newly developed target capture set for Bromeliaceae, a large and ecologically diverse plant family with highly variable diversification rates. The set targets 1776 coding regions, including genes putatively involved in key innovations, with the aim to empower testing of a wide range of evolutionary hypotheses. We compare the relative power of this taxon‐specific set, Bromeliad1776, to the universal Angiosperms353 kit. The taxon‐specific set results in higher enrichment success across the entire family; however, the overall performance of both kits to reconstruct phylogenetic trees is relatively comparable, highlighting the vast potential of universal kits for resolving evolutionary relationships. For more detailed phylogenetic or population genetic analyses, for example the exploration of gene tree concordance, nucleotide diversity or population structure, the taxon‐specific capture set presents clear benefits. We discuss the potential lessons that this comparative study provides for future phylogenetic and population genetic investigations, in particular for the study of evolutionary radiations.
Collapse
Affiliation(s)
- Gil Yardeni
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | | | - Margot Paris
- Unit of Ecology & Evolution, Department of Biology, University of Fribourg, Fribourg, Switzerland
| | - Jaqueline Hess
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria.,Department of Soil Ecology, Helmholtz Centre for Environmental Research, UFZ, Halle (Saale), Germany
| | - Clara Groot Crego
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria.,Vienna Graduate School of Population Genetics, Vienna, Austria
| | - Marylaure de La Harpe
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Norma Rivera
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Michael H J Barfuss
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Walter Till
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Valeria Guzmán-Jacob
- Biodiversity, Macroecology and Biogeography, University of Goettingen, Göttingen, Germany
| | - Thorsten Krömer
- Centro de Investigaciones Tropicales, Universidad Veracruzana, Xalapa, Mexico
| | - Christian Lexer
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Ovidiu Paun
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Thibault Leroy
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| |
Collapse
|
35
|
Hutter CR, Cobb KA, Portik DM, Travers SL, Wood PL, Brown RM. FrogCap: A modular sequence capture probe-set for phylogenomics and population genetics for all frogs, assessed across multiple phylogenetic scales. Mol Ecol Resour 2021; 22:1100-1119. [PMID: 34569723 DOI: 10.1111/1755-0998.13517] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 09/08/2021] [Accepted: 09/14/2021] [Indexed: 12/01/2022]
Abstract
Despite the prevalence of high-throughput sequencing in phylogenetics, many relationships remain difficult to resolve because of conflicting signal among genomic regions. Selection of different types of molecular markers from different genomic regions is required to overcome these challenges. For evolutionary studies in frogs, we introduce the publicly available FrogCap suite of genomic resources, which is a large collection of ~15,000 markers that unifies previous genetic sequencing efforts. FrogCap is designed to be modular, such that subsets of markers and SNPs can be selected based on the desired phylogenetic scale. FrogCap uses a variety of marker types that include exons and introns, ultraconserved elements, and previously sequenced Sanger markers, which span up to 10,000 bp in alignment lengths; in addition, we demonstrate potential for SNP-based analyses. We tested FrogCap using 121 samples distributed across five phylogenetic scales, comparing probes designed using a consensus- or exemplar genome-based approach. Using the consensus design is more resilient to issues with sensitivity, specificity, and missing data than picking an exemplar genome sequence. We also tested the impact of different bait kit sizes (20,020 vs. 40,040) on depth of coverage and found triple the depth for the 20,020 bait kit. We observed sequence capture success (i.e., missing data, sequenced markers/bases, marker length, and informative sites) across phylogenetic scales. The incorporation of different marker types is effective for deep phylogenetic relationships and shallow population genetics studies. Having demonstrated FrogCap's utility and modularity, we conclude that these new resources are efficacious for high-throughput sequencing projects across variable timescales.
Collapse
Affiliation(s)
- Carl R Hutter
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas, USA
| | - Kerry A Cobb
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas, USA
| | - Daniel M Portik
- California Academy of Sciences, San Francisco, California, USA
| | - Scott L Travers
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas, USA.,Department of Biological Sciences, Rutgers University-Newark, Newark, New Jersey, USA
| | - Perry L Wood
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas, USA
| | - Rafe M Brown
- Biodiversity Institute and Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas, USA
| |
Collapse
|
36
|
Yuan Y, Zhang H, Yi G, You Z, Zhao C, Yuan H, Wang K, Li J, Yang N, Lian L. Genetic Diversity of MHC B-F/B-L Region in 21 Chicken Populations. Front Genet 2021; 12:710770. [PMID: 34484301 PMCID: PMC8414643 DOI: 10.3389/fgene.2021.710770] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 07/26/2021] [Indexed: 12/15/2022] Open
Abstract
The chicken major histocompatibility complex (MHC) on chromosome 16 is the most polymorphic region across the whole genome, and also an ideal model for genetic diversity investigation. The MHC B-F/B-L region is 92 kb in length with high GC content consisting of 18 genes and one pseudogene (Blec4), which plays important roles in immune response. To evaluate polymorphism of the Chinese indigenous chickens as well as to analyze the effect of selection to genetic diversity, we used WaferGen platform to identify sequence variants of the B-F/B-L region in 21 chicken populations, including the Red Jungle Fowl (RJF), Cornish (CS), White Leghorns (WLs), 16 Chinese domestic breeds, and two well-known inbred lines 63 and 72. A total of 3,319 single nucleotide polymorphism (SNPs) and 181 INDELs in the B-F/B-L region were identified among 21 populations, of which 2,057 SNPs (62%) and 159 INDELs (88%) were novel. Most of the variants were within the intron and the flanking regions. The average variation density was 36 SNPs and 2 INDELs per kb, indicating dramatical high diversity of this region. Furthermore, BF2 was identified as the hypervariable genes with 67 SNPs per kb. Chinese domestic populations showed higher diversity than the WLs and CS. The indigenous breeds, Nandan Yao (NY), Xishuangbanna Game (XG), Gushi (GS), and Xiayan (XY) chickens, were the top four with the highest density of SNPs and INDELs. The highly inbred lines 63 and 72 have the lowest diversity, which might be resulted from a long-term intense selection for decades. Collectively, we refined the genetic map of chicken MHC B-F/B-L region, and illustrated genetic diversity of 21 chicken populations. Abundant genetic variants were identified, which not only strikingly expanded the current Ensembl SNP database, but also provided comprehensive data for researchers to further investigate association between variants in MHC and immune traits.
Collapse
Affiliation(s)
- Yiming Yuan
- National Engineering Laboratory for Animal Breeding and MOA Key Laboratory of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Huanmin Zhang
- United States Department of Agriculture, Agricultural Research Service, Avian Disease and Oncology Laboratory, East Lansing, MI, United States
| | - Guoqiang Yi
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China
| | - Zhen You
- National Engineering Laboratory for Animal Breeding and MOA Key Laboratory of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Chunfang Zhao
- National Engineering Laboratory for Animal Breeding and MOA Key Laboratory of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Haixu Yuan
- National Engineering Laboratory for Animal Breeding and MOA Key Laboratory of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Kejun Wang
- College of Animal Science and Technology, Henan Agricultural University, Zhengzhou, China
| | - Junying Li
- National Engineering Laboratory for Animal Breeding and MOA Key Laboratory of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Ning Yang
- National Engineering Laboratory for Animal Breeding and MOA Key Laboratory of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| | - Ling Lian
- National Engineering Laboratory for Animal Breeding and MOA Key Laboratory of Animal Genetics and Breeding, College of Animal Science and Technology, China Agricultural University, Beijing, China
| |
Collapse
|
37
|
Tsai H, Kippes N, Firl A, Lieberman M, Comai L, Henry IM. Efficient construction of a linkage map and haplotypes for Mentha suaveolens using sequence capture. G3-GENES GENOMES GENETICS 2021; 11:6321234. [PMID: 34544134 PMCID: PMC8496254 DOI: 10.1093/g3journal/jkab232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 06/25/2021] [Indexed: 11/12/2022]
Abstract
The sustainability of many crops is hindered by the lack of genomic resources and a poor understanding of natural genetic diversity. Particularly, application of modern breeding requires high-density linkage maps that are integrated into a highly contiguous reference genome. Here, we present a rapid method for deriving haplotypes and developing linkage maps, and its application to Mentha suaveolens, one of the diploid progenitors of cultivated mints. Using sequence-capture via DNA hybridization to target single nucleotide polymorphisms (SNPs), we successfully genotyped ∼5000 SNPs within the genome of >400 individuals derived from a self cross. After stringent quality control, and identification of nonredundant SNPs, 1919 informative SNPs were retained for linkage map construction. The resulting linkage map defined a total genetic space of 942.17 cM divided among 12 linkage groups, ranging from 56.32 to 122.61 cM in length. The linkage map is in good agreement with pseudomolecules from our preliminary genome assembly, proving this resource effective for the correction and validation of the reference genome. We discuss the advantages of this method for the rapid creation of linkage maps.
Collapse
Affiliation(s)
- Helen Tsai
- Department of Plant Biology and Genome Center, University of California, Davis, Davis, CA 95616, USA
| | - Nestor Kippes
- Department of Plant Biology and Genome Center, University of California, Davis, Davis, CA 95616, USA
| | - Alana Firl
- Department of Plant Biology and Genome Center, University of California, Davis, Davis, CA 95616, USA
| | - Meric Lieberman
- Department of Plant Biology and Genome Center, University of California, Davis, Davis, CA 95616, USA
| | - Luca Comai
- Department of Plant Biology and Genome Center, University of California, Davis, Davis, CA 95616, USA
| | - Isabelle M Henry
- Department of Plant Biology and Genome Center, University of California, Davis, Davis, CA 95616, USA
| |
Collapse
|
38
|
Wu Y, Yan Y, Zhao Y, Gu L, Wang S, Johnson DH. Genomic bases underlying the adaptive radiation of core landbirds. BMC Ecol Evol 2021; 21:162. [PMID: 34454438 PMCID: PMC8403425 DOI: 10.1186/s12862-021-01888-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Accepted: 08/10/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Core landbirds undergo adaptive radiation with different ecological niches, but the genomic bases that underlie their ecological diversification remain unclear. RESULTS Here we used the genome-wide target enrichment sequencing of the genes related to vision, hearing, language, temperature sensation, beak shape, taste transduction, and carbohydrate, protein and fat digestion and absorption to examine the genomic bases underlying their ecological diversification. Our comparative molecular phyloecological analyses show that different core landbirds present adaptive enhancement in different aspects, and two general patterns emerge. First, all three raptorial birds (Accipitriformes, Strigiformes, and Falconiformes) show a convergent adaptive enhancement for fat digestion and absorption, while non-raptorial birds tend to exhibit a promoted capability for protein and carbohydrate digestion and absorption. Using this as a molecular marker, our results show relatively strong support for the raptorial lifestyle of the common ancestor of core landbirds, consequently suggesting a single origin of raptors, followed by two secondary losses of raptorial lifestyle within core landbirds. In addition to the dietary niche, we find at temporal niche that diurnal birds tend to exhibit an adaptive enhancement in bright-light vision, while nocturnal birds show an increased adaption in dim-light vision, in line with previous findings. CONCLUSIONS Our molecular phyloecological study reveals the genome-wide adaptive differentiations underlying the ecological diversification of core landbirds.
Collapse
Affiliation(s)
- Yonghua Wu
- School of Life Sciences, Northeast Normal University, Changchun, 130024, China.
| | - Yi Yan
- School of Life Sciences, Northeast Normal University, Changchun, 130024, China
| | - Yuanqin Zhao
- School of Life Sciences, Northeast Normal University, Changchun, 130024, China
| | - Li Gu
- School of Life Sciences, Northeast Normal University, Changchun, 130024, China
| | - Songbo Wang
- Bio-Intelligence Co. Ltd, Shenzhen, 518000, China
| | - David H Johnson
- Global Owl Project, 6504 Carriage Drive, Alexandria, VA, 22310, USA.
| |
Collapse
|
39
|
Castro RAD, Borrell S, Gagneux S. The within-host evolution of antimicrobial resistance in Mycobacterium tuberculosis. FEMS Microbiol Rev 2021; 45:fuaa071. [PMID: 33320947 PMCID: PMC8371278 DOI: 10.1093/femsre/fuaa071] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Accepted: 12/11/2020] [Indexed: 12/12/2022] Open
Abstract
Tuberculosis (TB) has been responsible for the greatest number of human deaths due to an infectious disease in general, and due to antimicrobial resistance (AMR) in particular. The etiological agents of human TB are a closely-related group of human-adapted bacteria that belong to the Mycobacterium tuberculosis complex (MTBC). Understanding how MTBC populations evolve within-host may allow for improved TB treatment and control strategies. In this review, we highlight recent works that have shed light on how AMR evolves in MTBC populations within individual patients. We discuss the role of heteroresistance in AMR evolution, and review the bacterial, patient and environmental factors that likely modulate the magnitude of heteroresistance within-host. We further highlight recent works on the dynamics of MTBC genetic diversity within-host, and discuss how spatial substructures in patients' lungs, spatiotemporal heterogeneity in antimicrobial concentrations and phenotypic drug tolerance likely modulates the dynamics of MTBC genetic diversity in patients during treatment. We note the general characteristics that are shared between how the MTBC and other bacterial pathogens evolve in humans, and highlight the characteristics unique to the MTBC.
Collapse
Affiliation(s)
- Rhastin A D Castro
- Swiss Tropical and Public Health Institute, Socinstrasse 57, 4051 Basel, Basel, Switzerland
- University of Basel, Petersplatz 1, 4001 Basel, Basel, Switzerland
| | - Sonia Borrell
- Swiss Tropical and Public Health Institute, Socinstrasse 57, 4051 Basel, Basel, Switzerland
- University of Basel, Petersplatz 1, 4001 Basel, Basel, Switzerland
| | - Sebastien Gagneux
- Swiss Tropical and Public Health Institute, Socinstrasse 57, 4051 Basel, Basel, Switzerland
- University of Basel, Petersplatz 1, 4001 Basel, Basel, Switzerland
| |
Collapse
|
40
|
Alda F, Ludt WB, Elías DJ, McMahan CD, Chakrabarty P. Comparing Ultraconserved Elements and Exons for Phylogenomic Analyses of Middle American Cichlids: When Data Agree to Disagree. Genome Biol Evol 2021; 13:evab161. [PMID: 34272856 PMCID: PMC8369075 DOI: 10.1093/gbe/evab161] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/05/2021] [Indexed: 12/20/2022] Open
Abstract
Choosing among types of genomic markers to be used in a phylogenomic study can have a major influence on the cost, design, and results of a study. Yet few attempts have been made to compare categories of next-generation sequence markers limiting our ability to compare the suitability of these different genomic fragment types. Here, we explore properties of different genomic markers to find if they vary in the accuracy of component phylogenetic trees and to clarify the causes of conflict obtained from different data sets or inference methods. As a test case, we explore the causes of discordance between phylogenetic hypotheses obtained using a novel data set of ultraconserved elements (UCEs) and a recently published exon data set of the cichlid tribe Heroini. Resolving relationships among heroine cichlids has historically been difficult, and the processes of colonization and diversification in Middle America and the Greater Antilles are not yet well understood. Despite differences in informativeness and levels of gene tree discordance between UCEs and exons, the resulting phylogenomic hypotheses generally agree on most relationships. The independent data sets disagreed in areas with low phylogenetic signal that were overwhelmed by incomplete lineage sorting and nonphylogenetic signals. For UCEs, high levels of incomplete lineage sorting were found to be the major cause of gene tree discordance, whereas, for exons, nonphylogenetic signal is most likely caused by a reduced number of highly informative loci. This paucity of informative loci in exons might be due to heterogeneous substitution rates that are problematic to model (i.e., computationally restrictive) resulting in systematic errors that UCEs (being less informative individually but more uniform) are less prone to. These results generally demonstrate the robustness of phylogenomic methods to accommodate genomic markers with different biological and phylogenetic properties. However, we identify common and unique pitfalls of different categories of genomic fragments when inferring enigmatic phylogenetic relationships.
Collapse
Affiliation(s)
- Fernando Alda
- Department of Biology, Geology and Environmental Science, University of Tennessee at Chattanooga, Tennessee, USA
| | - William B Ludt
- Department of Ichthyology, Natural History Museum of Los Angeles County, Los Angeles, California, USA
| | - Diego J Elías
- Museum of Natural Science, Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA
| | | | - Prosanta Chakrabarty
- Museum of Natural Science, Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA
| |
Collapse
|
41
|
Xue CX, Lin H, Zhu XY, Liu J, Zhang Y, Rowley G, Todd JD, Li M, Zhang XH. DiTing: A Pipeline to Infer and Compare Biogeochemical Pathways From Metagenomic and Metatranscriptomic Data. Front Microbiol 2021; 12:698286. [PMID: 34408730 PMCID: PMC8367434 DOI: 10.3389/fmicb.2021.698286] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Accepted: 07/05/2021] [Indexed: 12/15/2022] Open
Abstract
Metagenomics and metatranscriptomics are powerful methods to uncover key micro-organisms and processes driving biogeochemical cycling in natural ecosystems. Databases dedicated to depicting biogeochemical pathways (for example, metabolism of dimethylsulfoniopropionate (DMSP), which is an abundant organosulfur compound) from metagenomic/metatranscriptomic data are rarely seen. Additionally, a recognized normalization model to estimate the relative abundance and environmental importance of pathways from metagenomic and metatranscriptomic data has not been organized to date. These limitations impact the ability to accurately relate key microbial-driven biogeochemical processes to differences in environmental conditions. Thus, an easy-to-use, specialized tool that infers and visually compares the potential for biogeochemical processes, including DMSP cycling, is urgently required. To solve these issues, we developed DiTing, a tool wrapper to infer and compare biogeochemical pathways among a set of given metagenomic or metatranscriptomic reads in one step, based on the Kyoto Encyclopedia of Genes and Genomes (KEGG) and a manually created DMSP cycling gene database. Accurate and specific formulae for over 100 pathways were developed to calculate their relative abundance. Output reports detail the relative abundance of biogeochemical pathways in both text and graphical format. DiTing was applied to simulated metagenomic data and resulted in consistent genetic features of simulated benchmark genomic data. Subsequently, when applied to natural metagenomic and metatranscriptomic data from hydrothermal vents and the Tara Ocean project, the functional profiles predicted by DiTing were correlated with environmental condition changes. DiTing can now be confidently applied to wider metagenomic and metatranscriptomic datasets, and it is available at https://github.com/xuechunxu/DiTing.
Collapse
Affiliation(s)
- Chun-Xu Xue
- College of Marine Life Sciences, and Frontiers Science Center for Deep Ocean Multispheres and Earth System, Ocean University of China, Qingdao, China
- Laboratory for Marine Ecology and Environmental Science, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
| | - Heyu Lin
- School of Earth Sciences, University of Melbourne, Parkville, VIC, Australia
| | - Xiao-Yu Zhu
- College of Marine Life Sciences, and Frontiers Science Center for Deep Ocean Multispheres and Earth System, Ocean University of China, Qingdao, China
| | - Jiwen Liu
- College of Marine Life Sciences, and Frontiers Science Center for Deep Ocean Multispheres and Earth System, Ocean University of China, Qingdao, China
- Laboratory for Marine Ecology and Environmental Science, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- Institute of Evolution & Marine Biodiversity, Ocean University of China, Qingdao, China
| | - Yunhui Zhang
- College of Marine Life Sciences, and Frontiers Science Center for Deep Ocean Multispheres and Earth System, Ocean University of China, Qingdao, China
| | - Gary Rowley
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
| | - Jonathan D. Todd
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, United Kingdom
| | - Meng Li
- Shenzhen Key Laboratory of Marine Microbiome Engineering, Institute for Advanced Study, Shenzhen University, Shenzhen, China
| | - Xiao-Hua Zhang
- College of Marine Life Sciences, and Frontiers Science Center for Deep Ocean Multispheres and Earth System, Ocean University of China, Qingdao, China
- Laboratory for Marine Ecology and Environmental Science, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China
- Institute of Evolution & Marine Biodiversity, Ocean University of China, Qingdao, China
| |
Collapse
|
42
|
Towards a more healthy conservation paradigm: integrating disease and molecular ecology to aid biological conservation †. J Genet 2021. [PMID: 33622992 PMCID: PMC7371965 DOI: 10.1007/s12041-020-01225-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Parasites, and the diseases they cause, are important from an ecological and evolutionary perspective because they can negatively affect host fitness and can regulate host populations. Consequently, conservation biology has long recognized the vital role that parasites can play in the process of species endangerment and recovery. However, we are only beginning to understand how deeply parasites are embedded in ecological systems, and there is a growing recognition of the important ways in which parasites affect ecosystem structure and function. Thus, there is an urgent need to revisit how parasites are viewed from a conservation perspective and broaden the role that disease ecology plays in conservation-related research and outcomes. This review broadly focusses on the role that disease ecology can play in biological conservation. Our review specifically emphasizes on how the integration of tools and analytical approaches associated with both disease and molecular ecology can be leveraged to aid conservation biology. Our review first concentrates on disease-mediated extinctions and wildlife epidemics. We then focus on elucidating how host–parasite interactions has improved our understanding of the eco-evolutionary dynamics affecting hosts at the individual, population, community and ecosystem scales. We believe that the role of parasites as drivers and indicators of ecosystem health is especially an exciting area of research that has the potential to fundamentally alter our view of parasites and their role in biological conservation. The review concludes with a broad overview of the current and potential applications of modern genomic tools in disease ecology to aid biological conservation.
Collapse
|
43
|
Wenzell KE, McDonnell AJ, Wickett NJ, Fant JB, Skogen KA. Incomplete reproductive isolation and low genetic differentiation despite floral divergence across varying geographic scales in Castilleja. AMERICAN JOURNAL OF BOTANY 2021; 108:1270-1288. [PMID: 34289081 DOI: 10.1002/ajb2.1700] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Accepted: 05/18/2021] [Indexed: 06/13/2023]
Abstract
PREMISE Divergence depends on the strength of selection and frequency of gene flow between taxa, while reproductive isolation relies on mating barriers and geographic distance. Less is known about how these processes interact at early stages of speciation. Here, we compared population-level differentiation in floral phenotype and genetic sequence variation among recently diverged Castilleja to explore patterns of diversification under different scenarios of reproductive isolation. METHODS Using target enrichment enabled by the Angiosperms353 probe set, we assessed genetic distance among 50 populations of four Castilleja species. We investigated whether patterns of genetic divergence are explained by floral trait variation or geographic distance in two focal groups: the widespread C. sessiliflora and the more restricted C. purpurea species complex. RESULTS We document that C. sessiliflora and the C. purpurea complex are characterized by high diversity in floral color across varying geographic scales. Despite phenotypic divergence, groups were not well supported in phylogenetic analyses, and little genetic differentiation was found across targeted Angiosperms353 loci. Nonetheless, a principal coordinate analysis of single nucleotide polymorphisms revealed differentiation within C. sessiliflora across floral morphs and geography and less differentiation among species of the C. purpurea complex. CONCLUSIONS Patterns of genetic distance in C. sessiliflora suggest species cohesion maintained over long distances despite variation in floral traits. In the C. purpurea complex, divergence in floral color across narrow geographic clines may be driven by recent selection on floral color. These contrasting patterns of floral and genetic differentiation reveal that divergence can arise via multiple eco-evolutionary paths.
Collapse
Affiliation(s)
- Katherine E Wenzell
- Program in Plant Biology and Conservation, Northwestern University, O.T. Hogan Hall, Room 6-140B, 2205 Tech Drive, Evanston, IL, 60208, USA
- Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, Glencoe, IL, 60022, USA
- John Innes Centre, Norwich Research Park, Colney Lane, Norwich, NR4 7UH, UK
| | - Angela J McDonnell
- Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, Glencoe, IL, 60022, USA
| | - Norman J Wickett
- Program in Plant Biology and Conservation, Northwestern University, O.T. Hogan Hall, Room 6-140B, 2205 Tech Drive, Evanston, IL, 60208, USA
- Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, Glencoe, IL, 60022, USA
| | - Jeremie B Fant
- Program in Plant Biology and Conservation, Northwestern University, O.T. Hogan Hall, Room 6-140B, 2205 Tech Drive, Evanston, IL, 60208, USA
- Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, Glencoe, IL, 60022, USA
| | - Krissa A Skogen
- Program in Plant Biology and Conservation, Northwestern University, O.T. Hogan Hall, Room 6-140B, 2205 Tech Drive, Evanston, IL, 60208, USA
- Negaunee Institute for Plant Conservation Science and Action, Chicago Botanic Garden, Glencoe, IL, 60022, USA
| |
Collapse
|
44
|
Sánchez-Reyes LL, Kandziora M, McTavish EJ. Physcraper: a Python package for continually updated phylogenetic trees using the Open Tree of Life. BMC Bioinformatics 2021; 22:355. [PMID: 34187366 PMCID: PMC8244228 DOI: 10.1186/s12859-021-04274-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Accepted: 06/16/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Phylogenies are a key part of research in many areas of biology. Tools that automate some parts of the process of phylogenetic reconstruction, mainly molecular character matrix assembly, have been developed for the advantage of both specialists in the field of phylogenetics and non-specialists. However, interpretation of results, comparison with previously available phylogenetic hypotheses, and selection of one phylogeny for downstream analyses and discussion still impose difficulties to one that is not a specialist either on phylogenetic methods or on a particular group of study. RESULTS Physcraper is a command-line Python program that automates the update of published phylogenies by adding public DNA sequences to underlying alignments of previously published phylogenies. It also provides a framework for straightforward comparison of published phylogenies with their updated versions, by leveraging upon tools from the Open Tree of Life project to link taxonomic information across databases. The program can be used by the nonspecialist, as a tool to generate phylogenetic hypotheses based on publicly available expert phylogenetic knowledge. Phylogeneticists and taxonomic group specialists will find it useful as a tool to facilitate molecular dataset gathering and comparison of alternative phylogenetic hypotheses (topologies). CONCLUSION The Physcraper workflow showcases the benefits of doing open science for phylogenetics, encouraging researchers to strive for better scientific sharing practices. Physcraper can be used with any OS and is released under an open-source license. Detailed instructions for installation and usage are available at https://physcraper.readthedocs.io.
Collapse
Affiliation(s)
| | - Martha Kandziora
- School of Natural Sciences, University of California, Merced, USA.,Department of Botany, Faculty of Science, Charles University, Prague, Czech Republic
| | | |
Collapse
|
45
|
Phylogenomic reconstruction addressing the Peltigeralean backbone (Lecanoromycetes, Ascomycota). FUNGAL DIVERS 2021. [DOI: 10.1007/s13225-021-00476-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
46
|
Ren C, Wang L, Nie ZL, Johnson G, Yang QE, Wen J. Development and phylogenetic utilities of a new set of single-/low-copy nuclear genes in Senecioneae (Asteraceae), with new insights into the tribal position and the relationships within subtribe Tussilagininae. Mol Phylogenet Evol 2021; 162:107202. [PMID: 33992786 DOI: 10.1016/j.ympev.2021.107202] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Revised: 04/13/2021] [Accepted: 05/06/2021] [Indexed: 11/26/2022]
Abstract
The tribe Senecioneae is one of the largest tribes in Asteraceae, with a nearly cosmopolitan distribution. Despite great efforts devoted to elucidate the evolution of Senecioneae, many questions still remain concerning the systematics of this group, from the tribal circumscription and position to species relationships in many genera. The hybridization-based target enrichment method of next-generation sequencing has been accepted as a promising approach to resolve phylogenetic problems. We herein develop a set of single-/low-copy genes for Senecioneae, and test their phylogenetic utilities. Our results demonstrate that these genes work highly efficiently for Senecioneae, with a high average gene recovery of 98.8% across the tribe and recovering robust phylogenetic hypotheses at different levels. In particular, the delimitation of the Senecioneae has been confirmed to include Abrotanella and exclude Doronicum, with the former sister to core Senecioneae and the latter shown to be more closely related to Calenduleae. Moreover, Doronicum and Calenduleae are inferred to be the closest relatives of Senecioneae, which is a new hypothesis well supported by statistical topology tests, morphological evidence, and the profile of pyrrolizidine alkaloids, a special kind of chemical characters generally used to define Senecioneae. Furthermore, this study suggests a complex reticulation history in the diversification of Senecioneae, accounting for the prevalence of polyploid groups in the tribe. With subtribe Tussilagininae s.str. as a case study showing a more evident pattern of gene duplication, we further explored reconstructing the phylogeny in the groups with high ploidy levels. Our results also demonstrate that tree topologies based on sorted paralogous copies are stable across different methods of phylogenetic inference, and more congruent with the morphological evidence and the results of previous phylogenetic studies.
Collapse
Affiliation(s)
- Chen Ren
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, Guangdong 510650, China; Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou, Guangdong 510650, China
| | - Long Wang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, Guangdong 510650, China
| | - Ze-Long Nie
- Key Laboratory of Plant Resources Conservation and Utilization, College of Biology and Environmental Sciences, Jishou University, Jishou, Hunan 416000, China
| | - Gabriel Johnson
- Department of Botany, National Museum of Natural History, MRC 166, Smithsonian Institution, Washington, DC 20013-7012, USA
| | - Qin-Er Yang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, Guangdong 510650, China; Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou, Guangdong 510650, China; Key Laboratory of Digital Botanical Garden of Guangdong Province, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, Guangdong 510650, China.
| | - Jun Wen
- Department of Botany, National Museum of Natural History, MRC 166, Smithsonian Institution, Washington, DC 20013-7012, USA.
| |
Collapse
|
47
|
Toussaint EFA, Gauthier J, Bilat J, Gillett CPDT, Gough HM, Lundkvist H, Blanc M, Muñoz-Ramírez CP, Alvarez N. HyRAD-X Exome Capture Museomics Unravels Giant Ground Beetle Evolution. Genome Biol Evol 2021; 13:6275686. [PMID: 33988685 PMCID: PMC8480185 DOI: 10.1093/gbe/evab112] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/10/2021] [Indexed: 12/27/2022] Open
Abstract
Advances in phylogenomics contribute toward resolving long-standing evolutionary questions. Notwithstanding, genetic diversity contained within more than a billion biological specimens deposited in natural history museums remains recalcitrant to analysis owing to challenges posed by its intrinsically degraded nature. Yet that tantalizing resource could be critical in overcoming taxon sampling constraints hindering our ability to address major evolutionary questions. We addressed this impediment by developing phyloHyRAD, a new bioinformatic pipeline enabling locus recovery at a broad evolutionary scale from HyRAD-X exome capture of museum specimens of low DNA integrity using a benchtop RAD-derived exome-complexity-reduction probe set developed from high DNA integrity specimens. Our new pipeline can also successfully align raw RNAseq transcriptomic and ultraconserved element reads with the RAD-derived probe catalog. Using this method, we generated a robust timetree for Carabinae beetles, the lack of which had precluded study of macroevolutionary trends pertaining to their biogeography and wing-morphology evolution. We successfully recovered up to 2,945 loci with a mean of 1,788 loci across the exome of specimens of varying age. Coverage was not significantly linked to specimen age, demonstrating the wide exploitability of museum specimens. We also recovered fragmentary mitogenomes compatible with Sanger-sequenced mtDNA. Our phylogenomic timetree revealed a Lower Cretaceous origin for crown group Carabinae, with the extinct Aplothorax Waterhouse, 1841 nested within the genus Calosoma Weber, 1801 demonstrating the junior synonymy of Aplothorax syn. nov., resulting in the new combination Calosoma burchellii (Waterhouse, 1841) comb. nov. This study compellingly illustrates that HyRAD-X and phyloHyRAD efficiently provide genomic-level data sets informative at deep evolutionary scales.
Collapse
Affiliation(s)
| | | | - Julia Bilat
- Natural History Museum of Geneva, Switzerland
| | - Conrad P D T Gillett
- University of Hawai'i Insect Museum, University of Hawai'i at Mānoa, Honolulu, Hawaii, USA
| | - Harlan M Gough
- Florida Natural History Museum, University of Florida, Gainesville, Florida, USA
| | | | | | - Carlos P Muñoz-Ramírez
- Instituto de Entomología, Universidad Metropolitana de Ciencias de la Educación, Santiago, Chile.,Centro de Investigación en Biodiversidad y Ambientes Sustentables (CIBAS), Universidad Católica de la Santísima Concepción, Chile
| | - Nadir Alvarez
- Natural History Museum of Geneva, Switzerland.,Department of Genetics and Evolution, University of Geneva, Switzerland
| |
Collapse
|
48
|
Jones MR, Winkler DE, Massatti R. The demographic and ecological factors shaping diversification among rare
Astragalus
species. DIVERS DISTRIB 2021. [DOI: 10.1111/ddi.13288] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Affiliation(s)
- Matthew R. Jones
- Southwest Biological Science Center U.S. Geological Survey Flagstaff AZ USA
| | - Daniel E. Winkler
- Southwest Biological Science Center U.S. Geological Survey Moab UT USA
| | - Rob Massatti
- Southwest Biological Science Center U.S. Geological Survey Flagstaff AZ USA
| |
Collapse
|
49
|
Cvetković T, Areces-Berazain F, Hinsinger DD, Thomas DC, Wieringa JJ, Ganesan SK, Strijk JS. Phylogenomics resolves deep subfamilial relationships in Malvaceae s.l. G3-GENES GENOMES GENETICS 2021; 11:6248091. [PMID: 33892500 PMCID: PMC8496235 DOI: 10.1093/g3journal/jkab136] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Accepted: 04/11/2021] [Indexed: 11/22/2022]
Abstract
Malvaceae s.l., the most diverse family within Malvales, includes well-known species of great economic importance like cotton, cacao, and durian. Despite numerous phylogenetic analyses employing multiple markers, relationships between several of its nine subfamilies, particularly within the largest lineage /Malvadendrina, remain unclear. In this study, we attempted to resolve the relationships within the major clades of Malvaceae s.l. using plastid genomes of 48 accessions representing all subfamilies. Maximum likelihood and Bayesian analyses recovered a fully resolved and well-supported topology confirming the split of the family into /Byttneriina (/Grewioideae +/Byttnerioideae) and /Malvadendrina. Within /Malvadendrina, /Helicteroideae occupied the earliest branching position, followed by /Sterculioideae, /Brownlowioideae, /Tiliodeae, and /Dombeyoideae formed a clade sister to /Malvatheca (/Malvoideae +/Bombacoideae), a grouping morphologically supported by the lack of androgynophore. Results from dating analyses suggest that all subfamilies originated during hot or warm phases in the Late Cretaceous to Paleocene. This study presents a well-supported phylogenetic framework for Malvaceae s.l. that will aid downstream revisions and evolutionary studies of this economically important plant family.
Collapse
Affiliation(s)
- Tijana Cvetković
- Institute of Parasitology, Biology Centre CAS, Branišovská 1160/31, 37005 České Budějovice, Czech Republic.,Biodiversity Genomics Team, Guangxi University, Daxuedonglu 100, Nanning, Guangxi, 530005, China
| | - Fabiola Areces-Berazain
- Alliance for Conservation Tree Genomics, Pha Tad Ke Botanical Garden, PO Box 959, 06000 Luang Prabang, Laos.,Herbarium UPRRP, Department of Biology, University of Puerto Rico, San Juan PR 00925-2537, United States of America
| | - Damien D Hinsinger
- Alliance for Conservation Tree Genomics, Pha Tad Ke Botanical Garden, PO Box 959, 06000 Luang Prabang, Laos.,Université Paris-Saclay, INRAE, Etude du Polymorphisme des Génomes Végétaux (EPGV), 91000 Evry, France
| | - Daniel C Thomas
- National Parks Board, Singapore Botanic Gardens, 1 Cluny Road, Singapore 259569, Singapore
| | - Jan J Wieringa
- Naturalis Biodiversity Center, Darwinweg 2, 2333 CR, Leiden, the Netherlands
| | - Santhana K Ganesan
- National Parks Board, Singapore Botanic Gardens, 1 Cluny Road, Singapore 259569, Singapore
| | - Joeri S Strijk
- Institute of Parasitology, Biology Centre CAS, Branišovská 1160/31, 37005 České Budějovice, Czech Republic.,Alliance for Conservation Tree Genomics, Pha Tad Ke Botanical Garden, PO Box 959, 06000 Luang Prabang, Laos.,Institute for Biodiversity and Environmental Research, Universiti Brunei Darussalam, Jalan Tungku Link, BE1410, Brunei Darussalam
| |
Collapse
|
50
|
Schweizer RM, Saarman N, Ramstad KM, Forester BR, Kelley JL, Hand BK, Malison RL, Ackiss AS, Watsa M, Nelson TC, Beja-Pereira A, Waples RS, Funk WC, Luikart G. Big Data in Conservation Genomics: Boosting Skills, Hedging Bets, and Staying Current in the Field. J Hered 2021; 112:313-327. [PMID: 33860294 DOI: 10.1093/jhered/esab019] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 04/13/2021] [Indexed: 02/07/2023] Open
Abstract
A current challenge in the fields of evolutionary, ecological, and conservation genomics is balancing production of large-scale datasets with additional training often required to handle such datasets. Thus, there is an increasing need for conservation geneticists to continually learn and train to stay up-to-date through avenues such as symposia, meetings, and workshops. The ConGen meeting is a near-annual workshop that strives to guide participants in understanding population genetics principles, study design, data processing, analysis, interpretation, and applications to real-world conservation issues. Each year of ConGen gathers a diverse set of instructors, students, and resulting lectures, hands-on sessions, and discussions. Here, we summarize key lessons learned from the 2019 meeting and more recent updates to the field with a focus on big data in conservation genomics. First, we highlight classical and contemporary issues in study design that are especially relevant to working with big datasets, including the intricacies of data filtering. We next emphasize the importance of building analytical skills and simulating data, and how these skills have applications within and outside of conservation genetics careers. We also highlight recent technological advances and novel applications to conservation of wild populations. Finally, we provide data and recommendations to support ongoing efforts by ConGen organizers and instructors-and beyond-to increase participation of underrepresented minorities in conservation and eco-evolutionary sciences. The future success of conservation genetics requires both continual training in handling big data and a diverse group of people and approaches to tackle key issues, including the global biodiversity-loss crisis.
Collapse
Affiliation(s)
- Rena M Schweizer
- Division of Biological Sciences, University of Montana, Missoula, MT
| | - Norah Saarman
- Department of Biology, Utah State University, Logan, UT
| | - Kristina M Ramstad
- Department of Biology and Geology, University of South Carolina Aiken, Aiken, SC
| | | | - Joanna L Kelley
- School of Biological Sciences, Washington State University, Pullman, WA
| | - Brian K Hand
- Division of Biological Sciences, University of Montana, Missoula, MT.,Flathead Lake Biological Station, University of Montana, Polson, MT
| | - Rachel L Malison
- Flathead Lake Biological Station, University of Montana, Polson, MT
| | - Amanda S Ackiss
- Wisconsin Cooperative Fishery Research Unit, University of Wisconsin Stevens Point, Stevens Point, WI
| | | | | | - Albano Beja-Pereira
- Centro de Investigação em Biodiversidade e Recursos Genéticos (CIBIO-UP), InBIO, Universidade do Porto, Vairão, Portugal.,DGAOT, Faculty of Sciences, University of Porto, Porto, Portugal.,Sustainable Agrifood Production Research Centre (GreenUPorto), Faculty of Sciences, University of Porto, Porto, Portugal
| | - Robin S Waples
- Northwest Fisheries Science Center, NOAA Fisheries, Seattle, WA
| | - W Chris Funk
- Department of Biology, Graduate Degree Program in Ecology, Colorado State University, Fort Collins, CO
| | - Gordon Luikart
- Division of Biological Sciences, University of Montana, Missoula, MT.,Flathead Lake Biological Station, University of Montana, Polson, MT
| |
Collapse
|