1
|
Thompson NF, Sutherland BJG, Green TJ, Delomas TA. A free lunch: microhaplotype discovery in an existing amplicon panel improves parentage assignment for the highly polymorphic Pacific oyster. G3 (BETHESDA, MD.) 2024:jkae280. [PMID: 39700397 DOI: 10.1093/g3journal/jkae280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2024] [Accepted: 11/12/2024] [Indexed: 12/21/2024]
Abstract
Amplicon panels using genotyping by sequencing methods are now common, but have focused on characterizing SNP markers. We investigate how microhaplotype (MH) discovery within a recently developed Pacific oyster (Magallana gigas) amplicon panel could increase the statistical power for relationship assignment. Trios (offspring and two parents) from three populations in a newly established breeding program were genotyped on a 592 locus panel. After processing, 92% of retained amplicons contained polymorphic MH variants and 85% of monomorphic SNP markers contained MH variation. The increased allelic richness resulted in substantially improved power for relationship assignment with much lower estimated false positive rates. No substantive differences in assignment accuracy occurred between SNP and MH datasets, but using MHs increased the separation in log-likelihood values between true parents and highly related potential parents (aunts and uncles). A high number of Mendelian incompatibilities among trios were observed, likely due to null alleles. Further development of a MH panel, including removing loci with high rates of null alleles, would enable high-throughput genotyping by reducing panel size and therefore cost for Pacific oyster research and breeding programs.
Collapse
Affiliation(s)
- Neil F Thompson
- Pacific Shellfish Research Unit, USDA Agricultural Research Service, 2030 SE Marine Science Drive, Newport, OR 97365, USA
| | - Ben J G Sutherland
- Sutherland Bioinformatics, Lantzville, BC V0R 2H0, Canada
- Faculty of Science and Technology, Vancouver Island University, 900 Fifth Street, Nanaimo, BC V9R 5S5, Canada
| | - Timothy J Green
- Faculty of Science and Technology, Vancouver Island University, 900 Fifth Street, Nanaimo, BC V9R 5S5, Canada
| | - Thomas A Delomas
- National Cold Water Marine Aquaculture Center, USDA Agricultural Research Service, 483 CBLS, 120 Flagg Road, Kingston, RI 02881, USA
| |
Collapse
|
2
|
Montana L, Bringloe TT, Bourret A, Sauvé C, Mosnier A, Ferguson SH, Postma L, Lesage V, Watt CA, Hammill MO, Parent GJ. Reduced Representation and Whole-Genome Sequencing Approaches Highlight Beluga Whale Populations Associated to Eastern Canada Summer Aggregations. Evol Appl 2024; 17:e70058. [PMID: 39703673 PMCID: PMC11655672 DOI: 10.1111/eva.70058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Revised: 11/01/2024] [Accepted: 11/24/2024] [Indexed: 12/21/2024] Open
Abstract
Effective conservation strategies inherently depend on preserving populations, which in turn requires accurate tools for their detection. Beluga whales (Delphinapterus leucas) inhabit the circumpolar Arctic and form discrete summer aggregations. Previous genetic studies using mitochondrial and microsatellite loci have delineated distinct populations associated to summer aggregations but the extent of dispersal and interbreeding among these populations remains largely unknown. Such information is essential for the conservation of populations in Canada as some are endangered and harvested for subsistence by Inuit communities. Here, we used reduced representation and whole-genome sequencing approaches to characterize population structure of beluga whales in eastern Canada and examine admixture between populations. A total of 905 beluga whales sampled between 1989 and 2021 were genotyped. Six main genomic clusters, with potential subclusters, were identified using multiple proxies for population structure. Most of the six main genomic clusters were consistent with previously identified populations, except in southeast Hudson Bay where two clusters were identified. Beluga summer aggregations may consequently be comprised of more than one distinct population. A low number of dispersers were identified between summer aggregations and limited interbreeding was detected between the six genomic clusters. Our work highlights the value of genomic approaches to improve our understanding of population structure and reproductive behavior in beluga whales, offering insights applicable to other cetacean species of conservation concern. An expansion of the geographical scope and increase in number of genotyped individuals will, however, be needed to improve the characterization of the finer scale structure and of the extent of admixture between populations.
Collapse
Affiliation(s)
- Luca Montana
- Maurice Lamontagne InstituteFisheries and Oceans CanadaMont‐JoliQuebecCanada
| | - Trevor T. Bringloe
- Maurice Lamontagne InstituteFisheries and Oceans CanadaMont‐JoliQuebecCanada
| | - Audrey Bourret
- Maurice Lamontagne InstituteFisheries and Oceans CanadaMont‐JoliQuebecCanada
| | - Caroline Sauvé
- Maurice Lamontagne InstituteFisheries and Oceans CanadaMont‐JoliQuebecCanada
| | - Arnaud Mosnier
- Maurice Lamontagne InstituteFisheries and Oceans CanadaMont‐JoliQuebecCanada
| | - Steven H. Ferguson
- Freshwater InstituteFisheries and Oceans CanadaWinnipegManitobaCanada
- Department of Biological SciencesUniversity of ManitobaWinnipegManitobaCanada
| | - Lianne Postma
- Freshwater InstituteFisheries and Oceans CanadaWinnipegManitobaCanada
| | - Véronique Lesage
- Maurice Lamontagne InstituteFisheries and Oceans CanadaMont‐JoliQuebecCanada
| | - Cortney A. Watt
- Freshwater InstituteFisheries and Oceans CanadaWinnipegManitobaCanada
- Department of Biological SciencesUniversity of ManitobaWinnipegManitobaCanada
| | - Mike O. Hammill
- Maurice Lamontagne InstituteFisheries and Oceans CanadaMont‐JoliQuebecCanada
| | - Geneviève J. Parent
- Maurice Lamontagne InstituteFisheries and Oceans CanadaMont‐JoliQuebecCanada
| |
Collapse
|
3
|
Parsons KM, May SA, Gold Z, Dahlheim M, Gabriele C, Straley JM, Moran JR, Goetz K, Zerbini AN, Park L, Morin PA. Using eDNA to Supplement Population Genetic Analyses for Cryptic Marine Species: Identifying Population Boundaries for Alaska Harbour Porpoises. Mol Ecol 2024:e17563. [PMID: 39450613 DOI: 10.1111/mec.17563] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 09/10/2024] [Accepted: 10/07/2024] [Indexed: 10/26/2024]
Abstract
Isolation by distance and biogeographical boundaries define patterns of population genetic structure for harbour porpoise along the Pacific coast from California to British Columbia. Until recently, inadequate sample sizes in many regions constrained efforts to characterise population genetic structure throughout the coastal waters of Alaska. Here, tissue samples from beachcast strandings and fisheries bycatch were supplemented with targeted environmental DNA (eDNA) samples in key regions of Alaska coastal and inland waters. Using a geographically explicit, hierarchical approach, we examined the genetic structure of Alaska harbour porpoises, using both mitochondrial DNA (mtDNA) sequence data and multilocus SNP genotypes. Despite a lack of evidence of genetic differentiation from nuclear SNP loci, patterns of relatedness and genetic differentiation from mtDNA suggest natal philopatry at multiple geographic scales, with limited gene flow among sites possibly mediated by male dispersal. A priori clustering of sampled areas at an intermediate scale (eastern and western Bering Sea, Gulf of Alaska and Southeast Alaska) best explained the genetic variance (12.37%) among regions. In addition, mtDNA differentiation between the Gulf of Alaska and eastern Bering Sea, and among regions within the Gulf of Alaska, indicated significant genetic structuring of harbour porpoise populations in Southeast Alaska. The targeted collection of eDNA samples from strata within Southeast Alaska was key for elevating the statistical power of our mtDNA dataset, and findings indicate limited dispersal between neighbouring strata within coastal and inland waters. These results provide evidence supporting a population boundary within the currently recognised Southeast Alaska Stock. Together, these findings will prove useful for ongoing management efforts to reduce fisheries conflict and conserve genetic diversity in this iconic coastal species.
Collapse
Affiliation(s)
- Kim M Parsons
- Conservation Biology Division, Northwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Seattle, Washington, USA
| | - Samuel A May
- College of Fisheries and Ocean Sciences, University of Alaska Fairbanks, Fairbanks, Alaska, USA
| | - Zachary Gold
- Ocean Molecular Ecology Group, Pacific Marine Environmental Laboratory, Oceanic and Atmospheric Research, National Oceanic and Atmospheric Administration, Seattle, Washington, USA
| | - Marilyn Dahlheim
- Marine Mammal Laboratory, Alaska Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Seattle, Washington, USA
- Retired
| | | | - Janice M Straley
- Retired
- Applied Fisheries, University of Alaska Southeast, Sitka, Alaska, USA
| | - John R Moran
- Auke Bay Laboratories, Alaska Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Juneau, Alaska, USA
| | - Kimberly Goetz
- Marine Mammal Laboratory, Alaska Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Seattle, Washington, USA
| | - Alexandre N Zerbini
- Cooperative Institute for Climate, Ocean and Ecosystem Studies (CICOES), University of Washington, Seattle, Washington, USA
- Marine Mammal Laboratory, Alaska Fisheries Science Center, NOAA, Seattle, Washington, USA
- Marine Ecology and Telemetry Research, Seabeck, Washington, USA
| | - Linda Park
- Conservation Biology Division, Northwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, Seattle, Washington, USA
- Retired
| | - Phillip A Morin
- Marine Mammal Genetics Program, Southwest Fisheries Science Center, National Marine Fisheries Service, National Oceanic and Atmospheric Administration, La Jolla, California, USA
| |
Collapse
|
4
|
Hargrove JS, Delomas TA, Powell JH, Hess JE, Narum SR, Campbell MR. Efficient population representation with more genetic markers increases performance of a steelhead (Oncorhynchus mykiss) genetic stock identification baseline. Evol Appl 2024; 17:e13610. [PMID: 38343774 PMCID: PMC10853585 DOI: 10.1111/eva.13610] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 10/04/2023] [Accepted: 10/16/2023] [Indexed: 10/28/2024] Open
Abstract
Genetic stock identification (GSI) is an important fisheries management tool to identify the origin of fish harvested in mixed stock fisheries. Periodic updates of genetic baselines can improve performance via the addition of unsampled or under-sampled populations and the inclusion of more informative markers. We used a combination of baselines to evaluate how population representation, marker number, and marker type affected the performance and accuracy of genetic stock assignments (self-assignment, bias, and holdout group tests) for steelhead (Oncorhynchus mykiss) in the Snake River basin. First, we compared the performance of an existing genetic baseline with a newly developed one which had a reduced number of individuals from more populations using the same set of markers. Self-assignment rates were significantly higher (p < 0.001; +5.4%) for the older, larger baseline, bias did not differ significantly between the two, but there was a significant improvement in performance for the new baseline in holdout results (p < 0.001; mean increase of 25.0%). Second, we compared the performance of the new baseline with increased numbers of genetic markers (~2x increase of single-nucleotide polymorphisms; SNPs) for the same set of baseline individuals. In this comparison, results produced significantly higher rates of self-assignment (p < 0.001; +9.7%) but neither bias nor leave-one-out were significantly affected. Third, we compared 334 SNPs versus opportunistically discovered microhaplotypes from the same amplicons for the new baseline, and showed the latter produced significantly higher rates of self-assignment (p < 0.01; +2.6%), similar bias, but slightly lower holdout performance (-0.1%). Combined, we show the performance of genetic baselines can be improved via representative and efficient sampling, that increased marker number consistently improved performance over the original baseline, and that opportunistic discovery of microhaplotypes can lead to small improvements in GSI performance.
Collapse
Affiliation(s)
| | - Thomas A. Delomas
- U.S. Department of AgricultureAgricultural Research Service National Cold Water Marine Aquaculture CenterKingstonRhode IslandUSA
| | | | - Jon E. Hess
- Columbia River Inter‐Tribal Fish CommissionPortlandOregonUSA
| | - Shawn R. Narum
- Columbia River Inter‐Tribal Fish CommissionHagermanIdahoUSA
| | | |
Collapse
|
5
|
Hopken MW, Piaggio AJ, Abdo Z, Chipman RB, Mankowski CP, Nelson KM, Hilton MS, Thurber C, Tsuchiya MTN, Maldonado JE, Gilbert AT. Are rabid raccoons ( Procyon lotor) ready for the rapture? Determining the geographic origin of rabies virus-infected raccoons using RADcapture and microhaplotypes. Evol Appl 2023; 16:1937-1955. [PMID: 38143904 PMCID: PMC10739080 DOI: 10.1111/eva.13613] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 09/06/2023] [Accepted: 10/18/2023] [Indexed: 12/26/2023] Open
Abstract
North America is recognized for the exceptional richness of rabies virus (RV) wildlife reservoir species. Management of RV is accomplished through vaccination targeting mesocarnivore reservoir populations, such as the raccoon (Procyon lotor) in Eastern North America. Raccoons are a common generalist species, and populations may reach high densities in developed areas, which can result in contact with humans and pets with potential exposures to the raccoon variant of RV throughout the eastern United States. Understanding the spatial movement of RV by raccoon populations is important for monitoring and refining strategies supporting the landscape-level control and local elimination of this lethal zoonosis. We developed a high-throughput genotyping panel for raccoons based on hundreds of microhaplotypes to identify population structure and genetic diversity relevant to rabies management programs. Throughout the eastern United States, we identified hierarchical population genetic structure with clusters that were connected through isolation-by-distance. We also illustrate that this genotyping approach can be used to support real-time management priorities by identifying the geographic origin of a rabid raccoon that was collected in an area of the United States that had been raccoon RV-free for 8 years. The results from this study and the utility of the microhaplotype panel and genotyping method will provide managers with information on raccoon ecology that can be incorporated into future management decisions.
Collapse
Affiliation(s)
- Matthew W. Hopken
- United States Department of Agriculture, Animal and Plant Health Inspection Service, Wildlife ServicesNational Wildlife Research CenterFort CollinsColoradoUSA
- Department of Microbiology, Immunology, and PathologyColorado State UniversityFort CollinsColoradoUSA
| | - Antoinette J. Piaggio
- United States Department of Agriculture, Animal and Plant Health Inspection Service, Wildlife ServicesNational Wildlife Research CenterFort CollinsColoradoUSA
| | - Zaid Abdo
- Department of Microbiology, Immunology, and PathologyColorado State UniversityFort CollinsColoradoUSA
| | - Richard B. Chipman
- United States Department of Agriculture, Animal and Plant Health Inspection Service, Wildlife ServicesNational Rabies Management ProgramConcordNew HampshireUSA
| | - Clara P. Mankowski
- United States Department of Agriculture, Animal and Plant Health Inspection Service, Wildlife ServicesNational Wildlife Research CenterFort CollinsColoradoUSA
- Department of Microbiology, Immunology, and PathologyColorado State UniversityFort CollinsColoradoUSA
| | - Kathleen M. Nelson
- United States Department of Agriculture, Animal and Plant Health Inspection Service, Wildlife ServicesNational Rabies Management ProgramConcordNew HampshireUSA
| | - Mikaela Samsel Hilton
- United States Department of Agriculture, Animal and Plant Health Inspection Service, Wildlife ServicesNational Wildlife Research CenterFort CollinsColoradoUSA
| | - Christine Thurber
- United States Department of Agriculture, Animal and Plant Health Inspection Service, Wildlife ServicesNational Rabies Management ProgramConcordNew HampshireUSA
| | - Mirian T. N. Tsuchiya
- Data Science Lab, Office of the Chief Information OfficerSmithsonian InstitutionWashingtonDCUSA
- Center for Conservation GenomicsSmithsonian National Zoo and Conservation Biology InstituteWashingtonDCUSA
| | - Jesús E. Maldonado
- Center for Conservation GenomicsSmithsonian National Zoo and Conservation Biology InstituteWashingtonDCUSA
| | - Amy T. Gilbert
- United States Department of Agriculture, Animal and Plant Health Inspection Service, Wildlife ServicesNational Wildlife Research CenterFort CollinsColoradoUSA
| |
Collapse
|
6
|
Delomas TA, Willis SC. Estimating microhaplotype allele frequencies from low-coverage or pooled sequencing data. BMC Bioinformatics 2023; 24:415. [PMID: 37923981 PMCID: PMC10623847 DOI: 10.1186/s12859-023-05554-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 10/30/2023] [Indexed: 11/06/2023] Open
Abstract
BACKGROUND Microhaplotypes have the potential to be more cost-effective than SNPs for applications that require genetic panels of highly variable loci. However, development of microhaplotype panels is hindered by a lack of methods for estimating microhaplotype allele frequency from low-coverage whole genome sequencing or pooled sequencing (pool-seq) data. RESULTS We developed new methods for estimating microhaplotype allele frequency from low-coverage whole genome sequence and pool-seq data. We validated these methods using datasets from three non-model organisms. These methods allowed estimation of allele frequency and expected heterozygosity at depths routinely achieved from pooled sequencing. CONCLUSIONS These new methods will allow microhaplotype panels to be designed using low-coverage WGS and pool-seq data to discover and evaluate candidate loci. The python script implementing the two methods and documentation are available at https://www.github.com/delomast/mhFromLowDepSeq .
Collapse
Affiliation(s)
- Thomas A Delomas
- Agricultural Research Service, United States Department of Agriculture, National Cold Water Marine Aquaculture Center, 483 CBLS, 120 Flagg Road, Kingston, RI, 02881, USA.
| | - Stuart C Willis
- Hagerman Genetics Laboratory, Columbia River Inter-Tribal Fish Commission, Hagerman, ID, USA
| |
Collapse
|
7
|
Penaud B, Laurent B, Milhes M, Noüs C, Ehrenmann F, Dutech C. SNP4OrphanSpecies: A bioinformatics pipeline to isolate molecular markers for studying genetic diversity of orphan species. Biodivers Data J 2022; 10:e85587. [PMID: 36761595 PMCID: PMC9848450 DOI: 10.3897/bdj.10.e85587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 06/23/2022] [Indexed: 11/12/2022] Open
Abstract
Background For several decades, an increase in disease or pest emergences due to anthropogenic introduction or environmental changes has been recorded. This increase leads to serious threats to the genetic and species diversity of numerous ecosystems. Many of these events involve species with poor or no genomic resources (called here "orphan species"). This lack of resources is a serious limitation to our understanding of the origin of emergent populations, their ability to adapt to new environments and to predict future consequences to biodiversity. Analyses of genetic diversity are an efficient method to obtain this information rapidly, but require available polymorphic genetic markers. New information We developed a generic bioinformatics pipeline to rapidly isolate such markers with the goal for the pipeline to be applied in studies of invasive taxa from different taxonomic groups, with a special focus on forest fungal pathogens and insect pests. This pipeline is based on: 1) an automated de novo genome assembly obtained from shotgun whole genome sequencing using paired-end Illumina technology; 2) the isolation of single-copy genes conserved in species related to the studied emergent organisms; 3) primer development for multiplexed short sequences obtained from these conserved genes. Previous studies have shown that intronic regions of these conserved genes generally contain several single nucleotide polymorphisms within species. The pipeline's functionality was evaluated with sequenced genomes of five invasive or expanding pathogen and pest species in Europe (Armillariaostoyae (Romagn.) Herink 1973, Bursaphelenchusxylophilus Steiner & Buhrer 1934, Sphaeropsissapinea (fr.) Dicko & B. Sutton 1980, Erysiphealphitoides (Griffon & Maubl.) U. Braun & S. Takam. 2000, Thaumetopoeapityocampa Denis & Schiffermüller, 1775). We successfully isolated several pools of one hundred short gene regions for each assembled genome, which can be amplified in multiplex. The bioinformatics pipeline is user-friendly and requires little computational resources. This easy-to-set-up and run method for genetic marker identification will be useful for numerous laboratories studying biological invasions, but with limited resources and expertise in bioinformatics.
Collapse
Affiliation(s)
- Benjamin Penaud
- BIOGECO, INRAE, Univ. Bordeaux, 33610 Cestas, FranceBIOGECO, INRAE, Univ. Bordeaux33610 CestasFrance
| | - Benoit Laurent
- BIOGECO, INRAE, Univ. Bordeaux, 33610 Cestas, FranceBIOGECO, INRAE, Univ. Bordeaux33610 CestasFrance
| | - Marine Milhes
- INRAE, US 1426, GeT-PlaGe, Genotoul, Castanet-Tolosan, FranceINRAE, US 1426, GeT-PlaGe, GenotoulCastanet-TolosanFrance
| | - Camille Noüs
- Laboratoire Cogitamus, Bordeaux, FranceLaboratoire CogitamusBordeauxFrance
| | - François Ehrenmann
- BIOGECO, INRAE, Univ. Bordeaux, 33610 Cestas, FranceBIOGECO, INRAE, Univ. Bordeaux33610 CestasFrance
| | - Cyril Dutech
- BIOGECO, INRAE, Univ. Bordeaux, 33610 Cestas, FranceBIOGECO, INRAE, Univ. Bordeaux33610 CestasFrance
| |
Collapse
|
8
|
Capblancq T, Forester BR. Redundancy analysis: A Swiss Army Knife for landscape genomics. Methods Ecol Evol 2021. [DOI: 10.1111/2041-210x.13722] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
|
9
|
Pakstis AJ, Gandotra N, Speed WC, Murtha M, Scharfe C, Kidd KK. The population genetics characteristics of a 90 locus panel of microhaplotypes. Hum Genet 2021; 140:1753-1773. [PMID: 34643790 PMCID: PMC8553733 DOI: 10.1007/s00439-021-02382-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Accepted: 09/30/2021] [Indexed: 12/26/2022]
Abstract
Single-nucleotide polymorphisms (SNPs) and small genomic regions with multiple SNPs (microhaplotypes, MHs) are rapidly emerging as novel forensic investigative tools to assist in individual identification, kinship analyses, ancestry inference, and deconvolution of DNA mixtures. Here, we analyzed information for 90 microhaplotype loci in 4009 individuals from 79 world populations in 6 major biogeographic regions. The study included multiplex microhaplotype sequencing (mMHseq) data analyzed for 524 individuals from 16 populations and genotype data for 3485 individuals from 63 populations curated from public repositories. Analyses of the 79 populations revealed excellent characteristics for this 90-plex MH panel for various forensic applications achieving an overall average effective number of allele values (Ae) of 4.55 (range 1.04–19.27) for individualization and mixture deconvolution. Population-specific random match probabilities ranged from a low of 10–115 to a maximum of 10–66. Mean informativeness (In) for ancestry inference was 0.355 (range 0.117–0.883). 65 novel SNPs were detected in 39 of the MHs using mMHseq. Of the 3018 different microhaplotype alleles identified, 1337 occurred at frequencies > 5% in at least one of the populations studied. The 90-plex MH panel enables effective differentiation of population groupings for major biogeographic regions as well as delineation of distinct subgroupings within regions. Open-source, web-based software is available to support validation of this technology for forensic case work analysis and to tailor MH analysis for specific geographical regions.
Collapse
Affiliation(s)
- Andrew J Pakstis
- Department of Genetics, Yale University School of Medicine, New Haven, CT, 06520, USA
| | - Neeru Gandotra
- Department of Genetics, Yale University School of Medicine, New Haven, CT, 06520, USA
| | - William C Speed
- Department of Genetics, Yale University School of Medicine, New Haven, CT, 06520, USA
| | - Michael Murtha
- Department of Genetics, Yale University School of Medicine, New Haven, CT, 06520, USA
| | - Curt Scharfe
- Department of Genetics, Yale University School of Medicine, New Haven, CT, 06520, USA
| | - Kenneth K Kidd
- Department of Genetics, Yale University School of Medicine, New Haven, CT, 06520, USA.
| |
Collapse
|
10
|
Schweizer RM, Saarman N, Ramstad KM, Forester BR, Kelley JL, Hand BK, Malison RL, Ackiss AS, Watsa M, Nelson TC, Beja-Pereira A, Waples RS, Funk WC, Luikart G. Big Data in Conservation Genomics: Boosting Skills, Hedging Bets, and Staying Current in the Field. J Hered 2021; 112:313-327. [PMID: 33860294 DOI: 10.1093/jhered/esab019] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 04/13/2021] [Indexed: 02/07/2023] Open
Abstract
A current challenge in the fields of evolutionary, ecological, and conservation genomics is balancing production of large-scale datasets with additional training often required to handle such datasets. Thus, there is an increasing need for conservation geneticists to continually learn and train to stay up-to-date through avenues such as symposia, meetings, and workshops. The ConGen meeting is a near-annual workshop that strives to guide participants in understanding population genetics principles, study design, data processing, analysis, interpretation, and applications to real-world conservation issues. Each year of ConGen gathers a diverse set of instructors, students, and resulting lectures, hands-on sessions, and discussions. Here, we summarize key lessons learned from the 2019 meeting and more recent updates to the field with a focus on big data in conservation genomics. First, we highlight classical and contemporary issues in study design that are especially relevant to working with big datasets, including the intricacies of data filtering. We next emphasize the importance of building analytical skills and simulating data, and how these skills have applications within and outside of conservation genetics careers. We also highlight recent technological advances and novel applications to conservation of wild populations. Finally, we provide data and recommendations to support ongoing efforts by ConGen organizers and instructors-and beyond-to increase participation of underrepresented minorities in conservation and eco-evolutionary sciences. The future success of conservation genetics requires both continual training in handling big data and a diverse group of people and approaches to tackle key issues, including the global biodiversity-loss crisis.
Collapse
Affiliation(s)
- Rena M Schweizer
- Division of Biological Sciences, University of Montana, Missoula, MT
| | - Norah Saarman
- Department of Biology, Utah State University, Logan, UT
| | - Kristina M Ramstad
- Department of Biology and Geology, University of South Carolina Aiken, Aiken, SC
| | | | - Joanna L Kelley
- School of Biological Sciences, Washington State University, Pullman, WA
| | - Brian K Hand
- Division of Biological Sciences, University of Montana, Missoula, MT.,Flathead Lake Biological Station, University of Montana, Polson, MT
| | - Rachel L Malison
- Flathead Lake Biological Station, University of Montana, Polson, MT
| | - Amanda S Ackiss
- Wisconsin Cooperative Fishery Research Unit, University of Wisconsin Stevens Point, Stevens Point, WI
| | | | | | - Albano Beja-Pereira
- Centro de Investigação em Biodiversidade e Recursos Genéticos (CIBIO-UP), InBIO, Universidade do Porto, Vairão, Portugal.,DGAOT, Faculty of Sciences, University of Porto, Porto, Portugal.,Sustainable Agrifood Production Research Centre (GreenUPorto), Faculty of Sciences, University of Porto, Porto, Portugal
| | - Robin S Waples
- Northwest Fisheries Science Center, NOAA Fisheries, Seattle, WA
| | - W Chris Funk
- Department of Biology, Graduate Degree Program in Ecology, Colorado State University, Fort Collins, CO
| | - Gordon Luikart
- Division of Biological Sciences, University of Montana, Missoula, MT.,Flathead Lake Biological Station, University of Montana, Polson, MT
| |
Collapse
|