1
|
O'Reilly GD, Manlik O, Vardeh S, Sinclair J, Cannell B, Lawler ZP, Sherwin WB. A new method for ecologists to estimate heterozygote excess and deficit for multi-locus gene families. Ecol Evol 2024; 14:e11561. [PMID: 39045501 PMCID: PMC11264353 DOI: 10.1002/ece3.11561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 05/25/2024] [Accepted: 05/31/2024] [Indexed: 07/25/2024] Open
Abstract
The fixation index, F IS, has been a staple measure to detect selection, or departures from random mating in populations. However, current Next Generation Sequencing (NGS) cannot easily estimate F IS, in multi-locus gene families that contain multiple loci having similar or identical arrays of variant sequences of ≥1 kilobase (kb), which differ at multiple positions. In these families, high-quality short-read NGS data typically identify variants, but not the genomic location, which is required to calculate F IS (based on locus-specific observed and expected heterozygosity). Thus, to assess assortative mating, or selection on heterozygotes, from NGS of multi-locus gene families, we need a method that does not require knowledge of which variants are alleles at which locus in the genome. We developed such a method. Like F IS, our novel measure, 1 H IS, is based on the principle that positive assortative mating, or selection against heterozygotes, and some other processes reduce within-individual variability relative to the population. We demonstrate high accuracy of 1 H IS on a wide range of simulated scenarios and two datasets from natural populations of penguins and dolphins. 1 H IS is important because multi-locus gene families are often involved in assortative mating or selection on heterozygotes. 1 H IS is particularly useful for multi-locus gene families, such as toll-like receptors, the major histocompatibility complex in animals, homeobox genes in fungi and self-incompatibility genes in plants.
Collapse
Affiliation(s)
- Gabe D. O'Reilly
- Evolution and Ecology Research Centre, School of Biological Earth and Environmental ScienceUniversity of New South WalesSydneyNew South WalesAustralia
- Department of BioinformaticsUniversity of North Carolina at CharlotteCharlotteNorth CarolinaUSA
| | - Oliver Manlik
- Evolution and Ecology Research Centre, School of Biological Earth and Environmental ScienceUniversity of New South WalesSydneyNew South WalesAustralia
- Biology DepartmentUnited Arab Emirates UniversityAl Ain, Abu DhabiUAE
| | - Sandra Vardeh
- Evolution and Ecology Research Centre, School of Biological Earth and Environmental ScienceUniversity of New South WalesSydneyNew South WalesAustralia
- Bundesamt für NaturschutzBonnNordrhein‐WestfalenGermany
| | - Jennifer Sinclair
- Evolution and Ecology Research Centre, School of Biological Earth and Environmental ScienceUniversity of New South WalesSydneyNew South WalesAustralia
- Cape Bernier VineyardBream CreekTasmaniaAustralia
| | - Belinda Cannell
- Oceans Institute/School of Biological SciencesUniversity of Western AustraliaCrawleyWestern AustraliaAustralia
- School of Environmental and Conservation SciencesMurdoch UniversityMurdochWestern AustraliaAustralia
| | - Zachary P. Lawler
- Evolution and Ecology Research Centre, School of Biological Earth and Environmental ScienceUniversity of New South WalesSydneyNew South WalesAustralia
- The University of NewcastleNewcastleNew South WalesAustralia
| | - William B. Sherwin
- Evolution and Ecology Research Centre, School of Biological Earth and Environmental ScienceUniversity of New South WalesSydneyNew South WalesAustralia
| |
Collapse
|
2
|
Nielsen ES, Walkes S, Sones JL, Fenberg PB, Paz-García DA, Cameron BB, Grosberg RK, Sanford E, Bay RA. Pushed waves, trailing edges, and extreme events: Eco-evolutionary dynamics of a geographic range shift in the owl limpet, Lottia gigantea. GLOBAL CHANGE BIOLOGY 2024; 30:e17414. [PMID: 39044553 DOI: 10.1111/gcb.17414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2024] [Revised: 06/21/2024] [Accepted: 06/21/2024] [Indexed: 07/25/2024]
Abstract
As climatic variation re-shapes global biodiversity, understanding eco-evolutionary feedbacks during species range shifts is of increasing importance. Theory on range expansions distinguishes between two different forms: "pulled" and "pushed" waves. Pulled waves occur when the source of the expansion comes from low-density peripheral populations, while pushed waves occur when recruitment to the expanding edge is supplied by high-density populations closer to the species' core. How extreme events shape pushed/pulled wave expansion events, as well as trailing-edge declines/contractions, remains largely unexplored. We examined eco-evolutionary responses of a marine invertebrate (the owl limpet, Lottia gigantea) that increased in abundance during the 2014-2016 marine heatwaves near the poleward edge of its geographic range in the northeastern Pacific. We used whole-genome sequencing from 19 populations across >11 degrees of latitude to characterize genomic variation, gene flow, and demographic histories across the species' range. We estimated present-day dispersal potential and past climatic stability to identify how contemporary and historical seascape features shape genomic characteristics. Consistent with expectations of a pushed wave, we found little genomic differentiation between core and leading-edge populations, and higher genomic diversity at range edges. A large and well-mixed population in the northern edge of the species' range is likely a result of ocean current anomalies increasing larval settlement and high-dispersal potential across biogeographic boundaries. Trailing-edge populations have higher differentiation from core populations, possibly driven by local selection and limited gene flow, as well as high genomic diversity likely as a result of climatic stability during the Last Glacial Maximum. Our findings suggest that extreme events can drive poleward range expansions that carry the adaptive potential of core populations, while also cautioning that trailing-edge extirpations may threaten unique evolutionary variation. This work highlights the importance of understanding how both trailing and leading edges respond to global change and extreme events.
Collapse
Affiliation(s)
- Erica S Nielsen
- Department of Evolution and Ecology, University of California Davis, Davis, California, USA
| | - Samuel Walkes
- Department of Evolution and Ecology, University of California Davis, Davis, California, USA
- Bodega Marine Laboratory, University of California Davis, Bodega Bay, California, USA
| | - Jacqueline L Sones
- Bodega Marine Reserve, University of California Davis, Bodega Bay, California, USA
| | - Phillip B Fenberg
- School of Ocean and Earth Sciences, National Oceanography Centre Southampton, University of Southampton, Southampton, UK
| | - David A Paz-García
- Laboratorio de Genética para la Conservación, Centro de Investigaciones Biológicas del Noroeste (CIBNOR), La Paz, Baja California Sur, Mexico
| | - Brenda B Cameron
- Department of Evolution and Ecology, University of California Davis, Davis, California, USA
| | - Richard K Grosberg
- Department of Evolution and Ecology, University of California Davis, Davis, California, USA
| | - Eric Sanford
- Department of Evolution and Ecology, University of California Davis, Davis, California, USA
- Bodega Marine Laboratory, University of California Davis, Bodega Bay, California, USA
| | - Rachael A Bay
- Department of Evolution and Ecology, University of California Davis, Davis, California, USA
| |
Collapse
|
3
|
Lawson DJ, Howard-McCombe J, Beaumont M, Senn H. How admixed captive breeding populations could be rescued using local ancestry information. Mol Ecol 2024:e17349. [PMID: 38634332 DOI: 10.1111/mec.17349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 12/21/2023] [Accepted: 02/26/2024] [Indexed: 04/19/2024]
Abstract
This paper asks the question: can genomic information be used to recover a species that is already on the pathway to extinction due to genetic swamping from a related and more numerous population? We show that a breeding strategy in a captive breeding program can use whole genome sequencing to identify and remove segments of DNA introgressed through hybridisation. The proposed policy uses a generalized measure of kinship or heterozygosity accounting for local ancestry, that is, whether a specific genetic location was inherited from the target of conservation. We then show that optimizing these measures would minimize undesired ancestry while also controlling kinship and/or heterozygosity, in a simulated breeding population. The process is applied to real data representing the hybridized Scottish wildcat breeding population, with the result that it should be possible to breed out domestic cat ancestry. The ability to reverse introgression is a powerful tool brought about through the combination of sequencing with computational advances in ancestry estimation. Since it works best when applied early in the process, important decisions need to be made about which genetically distinct populations should benefit from it and which should be left to reform into a single population.
Collapse
Affiliation(s)
- Daniel J Lawson
- Institute of Statistical Sciences, School of Mathematics, University of Bristol, Bristol, UK
| | - Jo Howard-McCombe
- RZSS WildGenes Laboratory, Conservation Department, Royal Zoological Society of Scotland, Edinburgh, UK
| | - Mark Beaumont
- School of Biological Sciences, University of Bristol, Bristol, UK
| | - Helen Senn
- RZSS WildGenes Laboratory, Conservation Department, Royal Zoological Society of Scotland, Edinburgh, UK
| |
Collapse
|
4
|
Naji MM, Gualdrón Duarte JL, Forneris NS, Druet T. Inbreeding depression is associated with recent homozygous-by-descent segments in Belgian Blue beef cattle. Genet Sel Evol 2024; 56:10. [PMID: 38297209 PMCID: PMC10832232 DOI: 10.1186/s12711-024-00878-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 01/19/2024] [Indexed: 02/02/2024] Open
Abstract
BACKGROUND Cattle populations harbor generally high inbreeding levels that can lead to inbreeding depression (ID). Here, we study ID with different estimators of the inbreeding coefficient F, evaluate their sensitivity to used allele frequencies (founder versus sample allele frequencies), and compare effects from recent and ancient inbreeding. METHODS We used data from 14,205 Belgian Blue beef cattle genotyped cows that were phenotyped for 11 linear classification traits. We computed estimators of F based on the pedigree information (FPED), on the correlation between uniting gametes (FUNI), on the genomic relationship matrix (FGRM), on excess homozygosity (FHET), or on homozygous-by-descent (HBD) segments (FHBD). RESULTS FUNI and FGRM were sensitive to used allele frequencies, whereas FHET and FHBD were more robust. We detected significant ID for four traits related to height and length; FHBD and FUNI presenting the strongest associations. Then, we took advantage of the classification of HBD segments in different age-related classes (the length of an HBD segment being inversely related to the number of generations to the common ancestors) to determine that recent HBD classes (common ancestors present approximately up to 15 generations in the past) presented stronger ID than more ancient HBD classes. We performed additional analyses to check whether these observations could result from a lower level of variation in ancient HBD classes, or from a reduced precision to identify these shorter segments. CONCLUSIONS Overall, our results suggest that mutational load decreases with haplotype age, and that mating plans should consider mainly the levels of recent inbreeding.
Collapse
Affiliation(s)
- Maulana Mughitz Naji
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Quartier Hôpital, Avenue de l'Hôpital, 11, 4000, Liege, Belgium.
| | - José Luis Gualdrón Duarte
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Quartier Hôpital, Avenue de l'Hôpital, 11, 4000, Liege, Belgium
- Walloon Breeders Association (awe groupe), 5590, Ciney, Belgium
| | - Natalia Soledad Forneris
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Quartier Hôpital, Avenue de l'Hôpital, 11, 4000, Liege, Belgium
| | - Tom Druet
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Quartier Hôpital, Avenue de l'Hôpital, 11, 4000, Liege, Belgium
| |
Collapse
|
5
|
Talavera A, Palmada-Flores M, Burriel-Carranza B, Valbuena-Ureña E, Mochales-Riaño G, Adams DC, Tejero-Cicuéndez H, Soler-Membrives A, Amat F, Guinart D, Carbonell F, Obon E, Marquès-Bonet T, Carranza S. Genomic insights into the Montseny brook newt ( Calotriton arnoldi), a Critically Endangered glacial relict. iScience 2024; 27:108665. [PMID: 38226169 PMCID: PMC10788218 DOI: 10.1016/j.isci.2023.108665] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 10/09/2023] [Accepted: 12/05/2023] [Indexed: 01/17/2024] Open
Abstract
The Montseny brook newt (Calotriton arnoldi), considered the most endangered amphibian in Europe, is a relict salamandrid species endemic to a small massif located in northeastern Spain. Although conservation efforts should always be guided by genomic studies, those are yet scarce among urodeles, hampered by the extreme sizes of their genomes. Here, we present the third available genome assembly for the order Caudata, and the first genomic study of the species and its sister taxon, the Pyrenean brook newt (Calotriton asper), combining whole-genome and ddRADseq data. Our results reveal significant demographic oscillations which accurately mirrored Europe's climatic history. Although severe bottlenecks have led to depauperate genomic diversity and long runs of homozygosity along a gigantic genome, inbreeding might have been avoided by assortative mating strategies. Other life history traits, however, seem to have been less advantageous, and the lack of land dispersal has driven to exceptional levels of population fragmentation.
Collapse
Affiliation(s)
- Adrián Talavera
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Marc Palmada-Flores
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| | - Bernat Burriel-Carranza
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
- Museu de Ciències Naturals de Barcelona, Pº Picasso s/n, Parc Ciutadella, 08003 Barcelona, Spain
| | | | | | - Dean C. Adams
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, IA 50010, USA
| | - Héctor Tejero-Cicuéndez
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
- Department of Biodiversity, Ecology and Evolution, Faculty of Biology, Universidad Complutense de Madrid, 28040 Madrid, Spain
| | - Anna Soler-Membrives
- Departament de Biologia Animal, de Biologia Vegetal i d'Ecologia, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Fèlix Amat
- Àrea d’Herpetologia, BiBIO, Museu de Granollers – Ciències Naturals. Palaudàries 102, Granollers, Barcelona, Spain
| | - Daniel Guinart
- Servei de Gestió de Parcs Naturals, Diputació de Barcelona, Spain
| | - Francesc Carbonell
- Centre de fauna salvatge de Torreferrussa (Forestal Catalana, SA), Santa Perpètua de Mogoda, Spain
| | - Elena Obon
- Centre de fauna salvatge de Torreferrussa (Forestal Catalana, SA), Santa Perpètua de Mogoda, Spain
| | - Tomàs Marquès-Bonet
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Barcelona, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), Barcelona, Spain
| | - Salvador Carranza
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona, Spain
| |
Collapse
|
6
|
Ringbauer H, Huang Y, Akbari A, Mallick S, Olalde I, Patterson N, Reich D. Accurate detection of identity-by-descent segments in human ancient DNA. Nat Genet 2024; 56:143-151. [PMID: 38123640 PMCID: PMC10786714 DOI: 10.1038/s41588-023-01582-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 10/20/2023] [Indexed: 12/23/2023]
Abstract
Long DNA segments shared between two individuals, known as identity-by-descent (IBD), reveal recent genealogical connections. Here we introduce ancIBD, a method for identifying IBD segments in ancient human DNA (aDNA) using a hidden Markov model and imputed genotype probabilities. We demonstrate that ancIBD accurately identifies IBD segments >8 cM for aDNA data with an average depth of >0.25× for whole-genome sequencing or >1× for 1240k single nucleotide polymorphism capture data. Applying ancIBD to 4,248 ancient Eurasian individuals, we identify relatives up to the sixth degree and genealogical connections between archaeological groups. Notably, we reveal long IBD sharing between Corded Ware and Yamnaya groups, indicating that the Yamnaya herders of the Pontic-Caspian Steppe and the Steppe-related ancestry in various European Corded Ware groups share substantial co-ancestry within only a few hundred years. These results show that detecting IBD segments can generate powerful insights into the growing aDNA record, both on a small scale relevant to life stories and on a large scale relevant to major cultural-historical events.
Collapse
Affiliation(s)
- Harald Ringbauer
- Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany.
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA.
| | - Yilei Huang
- Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Bioinformatics Group, Institute of Computer Science, Universität Leipzig, Leipzig, Germany
| | - Ali Akbari
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Swapan Mallick
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | - Iñigo Olalde
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- BIOMICs Research Group, University of the Basque Country, Vitoria-Gasteiz, Spain
- Ikerbasque-Basque Foundation of Science, Bilbao, Spain
| | - Nick Patterson
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - David Reich
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA.
- Department of Genetics, Harvard Medical School, Boston, MA, USA.
- Broad Institute of Harvard and MIT, Cambridge, MA, USA.
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
7
|
Luna LW, Williams LM, Duren K, Tyl R, Toews DPL, Avery JD. Whole genome assessment of a declining game bird reveals cryptic genetic structure and insights for population management. Mol Ecol 2023; 32:5498-5513. [PMID: 37688483 DOI: 10.1111/mec.17129] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 08/23/2023] [Accepted: 08/30/2023] [Indexed: 09/11/2023]
Abstract
Population genomics applied to game species conservation can help delineate management units, ensure appropriate harvest levels and identify populations needing genetic rescue to safeguard their adaptive potential. The ruffed grouse (Bonasa umbellus) is rapidly declining in much of the eastern USA due to a combination of forest maturation and habitat fragmentation. More recently, mortality from West Nile Virus may have affected connectivity of local populations; however, genetic approaches have never explicitly investigated this issue. In this study, we sequenced 54 individual low-coverage (~5X) grouse genomes to characterize population structure, assess migration rates across the landscape to detect potential barriers to gene flow and identify genomic regions with high differentiation. We identified two genomic clusters with no clear geographic correlation, with large blocks of genomic differentiation associated with chromosomes 4 and 20, likely due to chromosomal inversions. After excluding these putative inversions from the data set, we found weak but nonsignificant signals of population subdivision. Estimated gene flow revealed reduced rates of migration in areas with extensive habitat fragmentation and increased genetic connectivity in areas with less habitat fragmentation. Our findings provide a benchmark for wildlife managers to compare and scale the genetic diversity and structure of ruffed grouse populations in Pennsylvania and across the eastern USA, and we also reveal structural variation in the grouse genome that requires further study to understand its possible effects on individual fitness and population distribution.
Collapse
Affiliation(s)
- Leilton W Luna
- Department of Ecosystem Science and Management, Penn State University, University Park, Pennsylvania, USA
| | - Lisa M Williams
- Bureau of Wildlife Management, Pennsylvania Game Commission, Harrisburg, Pennsylvania, USA
| | - Kenneth Duren
- Bureau of Wildlife Management, Pennsylvania Game Commission, Harrisburg, Pennsylvania, USA
| | - Reina Tyl
- Bureau of Wildlife Management, Pennsylvania Game Commission, Harrisburg, Pennsylvania, USA
| | - David P L Toews
- Department of Biology, Penn State University, University Park, Pennsylvania, USA
| | - Julian D Avery
- Department of Ecosystem Science and Management, Penn State University, University Park, Pennsylvania, USA
| |
Collapse
|
8
|
Burnett HA, Bieker VC, Le Moullec M, Peeters B, Rosvold J, Pedersen ÅØ, Dalén L, Loe LE, Jensen H, Hansen BB, Martin MD. Contrasting genomic consequences of anthropogenic reintroduction and natural recolonization in high-arctic wild reindeer. Evol Appl 2023; 16:1531-1548. [PMID: 37752961 PMCID: PMC10519417 DOI: 10.1111/eva.13585] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 07/27/2023] [Accepted: 08/01/2023] [Indexed: 09/28/2023] Open
Abstract
Anthropogenic reintroduction can supplement natural recolonization in reestablishing a species' distribution and abundance. However, both reintroductions and recolonizations can give rise to founder effects that reduce genetic diversity and increase inbreeding, potentially causing the accumulation of genetic load and reduced fitness. Most current populations of the endemic high-arctic Svalbard reindeer (Rangifer tarandus platyrhynchus) originate from recent reintroductions or recolonizations following regional extirpations due to past overharvesting. We investigated and compared the genomic consequences of these two paths to reestablishment using whole-genome shotgun sequencing of 100 Svalbard reindeer across their range. We found little admixture between reintroduced and natural populations. Two reintroduced populations, each founded by 12 individuals around four decades (i.e. 8 reindeer generations) ago, formed two distinct genetic clusters. Compared to the source population, these populations showed only small decreases in genome-wide heterozygosity and increases in inbreeding and lengths of runs of homozygosity. In contrast, the two naturally recolonized populations without admixture possessed much lower heterozygosity, higher inbreeding and longer runs of homozygosity, possibly caused by serial population founder effects and/or fewer or more genetically related founders than in the reintroduction events. Naturally recolonized populations can thus be more vulnerable to the accumulation of genetic load than reintroduced populations. This suggests that in some organisms even small-scale reintroduction programs based on genetically diverse source populations can be more effective than natural recolonization in establishing genetically diverse populations. These findings warrant particular attention in the conservation and management of populations and species threatened by habitat fragmentation and loss.
Collapse
Affiliation(s)
- Hamish A. Burnett
- Centre for Biodiversity Dynamics, Department of BiologyNorwegian University of Science and Technology (NTNU)TrondheimNorway
- Department of Natural History, NTNU University MuseumNorwegian University of Science and Technology (NTNU)TrondheimNorway
| | - Vanessa C. Bieker
- Centre for Biodiversity Dynamics, Department of BiologyNorwegian University of Science and Technology (NTNU)TrondheimNorway
- Department of Natural History, NTNU University MuseumNorwegian University of Science and Technology (NTNU)TrondheimNorway
| | - Mathilde Le Moullec
- Centre for Biodiversity Dynamics, Department of BiologyNorwegian University of Science and Technology (NTNU)TrondheimNorway
| | - Bart Peeters
- Centre for Biodiversity Dynamics, Department of BiologyNorwegian University of Science and Technology (NTNU)TrondheimNorway
| | - Jørgen Rosvold
- Department of Terrestrial BiodiversityNorwegian Institute for Nature Research (NINA)TrondheimNorway
| | | | - Love Dalén
- Centre for PalaeogeneticsStockholmSweden
- Department of Bioinformatics and GeneticsSwedish Museum of Natural HistoryStockholmSweden
- Department of ZoologyStockholm UniversityStockholmSweden
| | - Leif Egil Loe
- Faculty of Environmental Sciences and Natural Resource ManagementNorwegian University of Life SciencesAasNorway
| | - Henrik Jensen
- Centre for Biodiversity Dynamics, Department of BiologyNorwegian University of Science and Technology (NTNU)TrondheimNorway
| | - Brage B. Hansen
- Centre for Biodiversity Dynamics, Department of BiologyNorwegian University of Science and Technology (NTNU)TrondheimNorway
- Department of Terrestrial EcologyNorwegian Institute for Nature Research (NINA)TrondheimNorway
| | - Michael D. Martin
- Centre for Biodiversity Dynamics, Department of BiologyNorwegian University of Science and Technology (NTNU)TrondheimNorway
- Department of Natural History, NTNU University MuseumNorwegian University of Science and Technology (NTNU)TrondheimNorway
| |
Collapse
|
9
|
Ringbauer H, Huang Y, Akbari A, Mallick S, Patterson N, Reich D. ancIBD - Screening for identity by descent segments in human ancient DNA. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.08.531671. [PMID: 36945531 PMCID: PMC10028887 DOI: 10.1101/2023.03.08.531671] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Long DNA sequences shared between two individuals, known as Identical by descent (IBD) segments, are a powerful signal for identifying close and distant biological relatives because they only arise when the pair shares a recent common ancestor. Existing methods to call IBD segments between present-day genomes cannot be straightforwardly applied to ancient DNA data (aDNA) due to typically low coverage and high genotyping error rates. We present ancIBD, a method to identify IBD segments for human aDNA data implemented as a Python package. Our approach is based on a Hidden Markov Model, using as input genotype probabilities imputed based on a modern reference panel of genomic variation. Through simulation and downsampling experiments, we demonstrate that ancIBD robustly identifies IBD segments longer than 8 centimorgan for aDNA data with at least either 0.25x average whole-genome sequencing (WGS) coverage depth or at least 1x average depth for in-solution enrichment experiments targeting a widely used aDNA SNP set ('1240k'). This application range allows us to screen a substantial fraction of the aDNA record for IBD segments and we showcase two downstream applications. First, leveraging the fact that biological relatives up to the sixth degree are expected to share multiple long IBD segments, we identify relatives between 10,156 ancient Eurasian individuals and document evidence of long-distance migration, for example by identifying a pair of two approximately fifth-degree relatives who were buried 1410km apart in Central Asia 5000 years ago. Second, by applying ancIBD, we reveal new details regarding the spread of ancestry related to Steppe pastoralists into Europe starting 5000 years ago. We find that the first individuals in Central and Northern Europe carrying high amounts of Steppe-ancestry, associated with the Corded Ware culture, share high rates of long IBD (12-25 cM) with Yamnaya herders of the Pontic-Caspian steppe, signaling a strong bottleneck and a recent biological connection on the order of only few hundred years, providing evidence that the Yamnaya themselves are a main source of Steppe ancestry in Corded Ware people. We also detect elevated sharing of long IBD segments between Corded Ware individuals and people associated with the Globular Amphora culture (GAC) from Poland and Ukraine, who were Copper Age farmers not yet carrying Steppe-like ancestry. These IBD links appear for all Corded Ware groups in our analysis, indicating that individuals related to GAC contexts must have had a major demographic impact early on in the genetic admixtures giving rise to various Corded Ware groups across Europe. These results show that detecting IBD segments in aDNA can generate new insights both on a small scale, relevant to understanding the life stories of people, and on the macroscale, relevant to large-scale cultural-historical events.
Collapse
Affiliation(s)
- Harald Ringbauer
- Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Yilei Huang
- Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
- Bioinformatics Group, Institute of Computer Science, Universität Leipzig, Leipzig, Germanÿ
| | - Ali Akbari
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Swapan Mallick
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | - Nick Patterson
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - David Reich
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
10
|
Söylev A, Çokoglu SS, Koptekin D, Alkan C, Somel M. CONGA: Copy number variation genotyping in ancient genomes and low-coverage sequencing data. PLoS Comput Biol 2022; 18:e1010788. [PMID: 36516232 PMCID: PMC9873172 DOI: 10.1371/journal.pcbi.1010788] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Revised: 01/24/2023] [Accepted: 12/03/2022] [Indexed: 12/15/2022] Open
Abstract
To date, ancient genome analyses have been largely confined to the study of single nucleotide polymorphisms (SNPs). Copy number variants (CNVs) are a major contributor of disease and of evolutionary adaptation, but identifying CNVs in ancient shotgun-sequenced genomes is hampered by typical low genome coverage (<1×) and short fragments (<80 bps), precluding standard CNV detection software to be effectively applied to ancient genomes. Here we present CONGA, tailored for genotyping CNVs at low coverage. Simulations and down-sampling experiments suggest that CONGA can genotype deletions >1 kbps with F-scores >0.75 at ≥1×, and distinguish between heterozygous and homozygous states. We used CONGA to genotype 10,002 outgroup-ascertained deletions across a heterogenous set of 71 ancient human genomes spanning the last 50,000 years, produced using variable experimental protocols. A fraction of these (21/71) display divergent deletion profiles unrelated to their population origin, but attributable to technical factors such as coverage and read length. The majority of the sample (50/71), despite originating from nine different laboratories and having coverages ranging from 0.44×-26× (median 4×) and average read lengths 52-121 bps (median 69), exhibit coherent deletion frequencies. Across these 50 genomes, inter-individual genetic diversity measured using SNPs and CONGA-genotyped deletions are highly correlated. CONGA-genotyped deletions also display purifying selection signatures, as expected. CONGA thus paves the way for systematic CNV analyses in ancient genomes, despite the technical challenges posed by low and variable genome coverage.
Collapse
Affiliation(s)
- Arda Söylev
- Department of Computer Engineering, Konya Food and Agriculture University, Konya, Turkey
- Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University, Düsseldorf, Germany
- * E-mail: (AS); (MS)
| | | | - Dilek Koptekin
- Department of Health Informatics, Graduate School of Informatics, Middle East Technical University, Ankara, Turkey
| | - Can Alkan
- Department of Computer Engineering, Bilkent University, Ankara, Turkey
| | - Mehmet Somel
- Department of Biology, Middle East Technical University, Ankara, Turkey
- * E-mail: (AS); (MS)
| |
Collapse
|
11
|
Todd ET, Tonasso-Calvière L, Chauvey L, Schiavinato S, Fages A, Seguin-Orlando A, Clavel P, Khan N, Pérez Pardal L, Patterson Rosa L, Librado P, Ringbauer H, Verdugo M, Southon J, Aury JM, Perdereau A, Vila E, Marzullo M, Prato O, Tecchiati U, Bagnasco Gianni G, Tagliacozzo A, Tinè V, Alhaique F, Cardoso JL, Valente MJ, Telles Antunes M, Frantz L, Shapiro B, Bradley DG, Boulbes N, Gardeisen A, Horwitz LK, Öztan A, Arbuckle BS, Onar V, Clavel B, Lepetz S, Vahdati AA, Davoudi H, Mohaseb A, Mashkour M, Bouchez O, Donnadieu C, Wincker P, Brooks SA, Beja-Pereira A, Wu DD, Orlando L. The genomic history and global expansion of domestic donkeys. Science 2022; 377:1172-1180. [PMID: 36074859 DOI: 10.1126/science.abo3503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Donkeys transformed human history as essential beasts of burden for long-distance movement, especially across semi-arid and upland environments. They remain insufficiently studied despite globally expanding and providing key support to low- to middle-income communities. To elucidate their domestication history, we constructed a comprehensive genome panel of 207 modern and 31 ancient donkeys, as well as 15 wild equids. We found a strong phylogeographic structure in modern donkeys that supports a single domestication in Africa ~5000 BCE, followed by further expansions in this continent and Eurasia and ultimately returning to Africa. We uncover a previously unknown genetic lineage in the Levant ~200 BCE, which contributed increasing ancestry toward Asia. Donkey management involved inbreeding and the production of giant bloodlines at a time when mules were essential to the Roman economy and military.
Collapse
Affiliation(s)
- Evelyn T Todd
- Centre d'Anthropobiologie et de Génomique de Toulouse (CAGT), CNRS UMR 5288, Université Paul Sabatier, Toulouse 31000, France
| | - Laure Tonasso-Calvière
- Centre d'Anthropobiologie et de Génomique de Toulouse (CAGT), CNRS UMR 5288, Université Paul Sabatier, Toulouse 31000, France
| | - Loreleï Chauvey
- Centre d'Anthropobiologie et de Génomique de Toulouse (CAGT), CNRS UMR 5288, Université Paul Sabatier, Toulouse 31000, France
| | - Stéphanie Schiavinato
- Centre d'Anthropobiologie et de Génomique de Toulouse (CAGT), CNRS UMR 5288, Université Paul Sabatier, Toulouse 31000, France
| | - Antoine Fages
- Centre d'Anthropobiologie et de Génomique de Toulouse (CAGT), CNRS UMR 5288, Université Paul Sabatier, Toulouse 31000, France
| | - Andaine Seguin-Orlando
- Centre d'Anthropobiologie et de Génomique de Toulouse (CAGT), CNRS UMR 5288, Université Paul Sabatier, Toulouse 31000, France
| | - Pierre Clavel
- Centre d'Anthropobiologie et de Génomique de Toulouse (CAGT), CNRS UMR 5288, Université Paul Sabatier, Toulouse 31000, France
| | - Naveed Khan
- Centre d'Anthropobiologie et de Génomique de Toulouse (CAGT), CNRS UMR 5288, Université Paul Sabatier, Toulouse 31000, France.,Department of Biotechnology, Abdul Wali Khan University, Mardan 23200, Pakistan
| | - Lucía Pérez Pardal
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Campus de Vairão, Universidade do Porto, Vairão 4485-661, Portugal.,BIOPOLIS Program in Genomics, Biodiversity and Land Planning, Campus de Vairão, Universidade do Porto, Vairão 4485-661, Portugal
| | | | - Pablo Librado
- Centre d'Anthropobiologie et de Génomique de Toulouse (CAGT), CNRS UMR 5288, Université Paul Sabatier, Toulouse 31000, France
| | - Harald Ringbauer
- Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Leipzig 04103, Germany
| | - Marta Verdugo
- Smurfit Institute of Genetics, Trinity College Dublin, Dublin D02 PN40, Ireland
| | - John Southon
- Earth System Science Department, University of California, Irvine, CA 92697, USA
| | - Jean-Marc Aury
- Genoscope, Institut de biologie François Jacob, CEA, Université d'Evry, Université Paris-Saclay, Evry 91042, France
| | - Aude Perdereau
- Genoscope, Institut de biologie François Jacob, CEA, Université d'Evry, Université Paris-Saclay, Evry 91042, France
| | - Emmanuelle Vila
- Laboratoire Archéorient, Université Lyon 2, Lyon 69007, France
| | - Matilde Marzullo
- Dipartimento di Beni Culturali e Ambientali, Università degli Studi di Milano, Milan 20122, Italy
| | - Ornella Prato
- Dipartimento di Beni Culturali e Ambientali, Università degli Studi di Milano, Milan 20122, Italy
| | - Umberto Tecchiati
- Dipartimento di Beni Culturali e Ambientali, Università degli Studi di Milano, Milan 20122, Italy
| | - Giovanna Bagnasco Gianni
- Dipartimento di Beni Culturali e Ambientali, Università degli Studi di Milano, Milan 20122, Italy
| | | | - Vincenzo Tinè
- Soprintendenza archeologia belle arti e paesaggio per le province di Verona, Rovigo e Vicenza, Verona 37121, Italy
| | | | - João Luís Cardoso
- ICArEHB, Campus de Gambelas, University of Algarve, Faro 8005-139, Portugal.,Universidade Aberta, Lisbon 1269-001, Portugal
| | - Maria João Valente
- Faculdade de Ciências Humanas e Sociais, Centro de Estudos de Arqueologia, Artes e Ciências do Património, Universidade do Algarve, Faro 8000-117, Portugal
| | - Miguel Telles Antunes
- Centre for Research on Science and Geological Engineering, Universidade NOVA de Lisboa, Lisbon 1099-085, Portugal
| | - Laurent Frantz
- Palaeogenomics Group, Department of Veterinary Sciences, Ludwig Maximilian University, Munich 80539, Germany.,School of Biological and Behavioural Sciences, Queen Mary University of London, London E1 4DQ, United Kingdom
| | - Beth Shapiro
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, CA 95064, USA.,Howard Hughes Medical Institute, University of California, Santa Cruz, CA 95064, USA
| | - Daniel G Bradley
- Smurfit Institute of Genetics, Trinity College Dublin, Dublin D02 PN40, Ireland
| | - Nicolas Boulbes
- Institut de Paléontologie Humaine, Fondation Albert Ier, Paris / UMR 7194 HNHP, MNHN-CNRS-UPVD / EPCC Centre Européen de Recherche Préhistorique, Tautavel 66720, France
| | - Armelle Gardeisen
- Archéologie des Sociétés Méditéranéennes, Université Paul Valéry - Site Saint-Charles 2, Montpellier 34090, France
| | - Liora Kolska Horwitz
- National Natural History Collections, Edmond J. Safra Campus, Givat Ram, The Hebrew University, Jerusalem 9190401, Israel
| | - Aliye Öztan
- Archaeology Department, Ankara University, Ankara 06100, Turkey
| | - Benjamin S Arbuckle
- Department of Anthropology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Vedat Onar
- Osteoarchaeology Practice and Research Center and Department of Anatomy, Faculty of Veterinary Medicine, Istanbul University-Cerrahpaşa, Istanbul 34320, Turkey
| | - Benoît Clavel
- Archéozoologie, Archéobotanique, Sociétés, Pratiques et Environnements, Muséum National d'Histoire Naturelle, Paris 75005, France
| | - Sébastien Lepetz
- Archéozoologie, Archéobotanique, Sociétés, Pratiques et Environnements, Muséum National d'Histoire Naturelle, Paris 75005, France
| | - Ali Akbar Vahdati
- Provincial Office of the Iranian Center for Cultural Heritage, Handicrafts and Tourism Organisation, North Khorassan, Bojnord 9416745775, Iran
| | - Hossein Davoudi
- Archaezoology section, Bioarchaeology Laboratory of the Central Laboratory, University of Tehran, Tehran CP1417634934, Iran
| | - Azadeh Mohaseb
- Archéozoologie, Archéobotanique, Sociétés, Pratiques et Environnements, Muséum National d'Histoire Naturelle, Paris 75005, France.,Archaezoology section, Bioarchaeology Laboratory of the Central Laboratory, University of Tehran, Tehran CP1417634934, Iran
| | - Marjan Mashkour
- Archéozoologie, Archéobotanique, Sociétés, Pratiques et Environnements, Muséum National d'Histoire Naturelle, Paris 75005, France.,Archaezoology section, Bioarchaeology Laboratory of the Central Laboratory, University of Tehran, Tehran CP1417634934, Iran.,Department of Osteology, National Museum of Iran, Tehran 1136918111, Iran
| | - Olivier Bouchez
- GeT-PlaGe - Génome et Transcriptome - Plateforme Génomique, GET - Plateforme Génome & Transcriptome, Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement, Castaneet-Tolosan Cedex 31326, France
| | - Cécile Donnadieu
- GeT-PlaGe - Génome et Transcriptome - Plateforme Génomique, GET - Plateforme Génome & Transcriptome, Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement, Castaneet-Tolosan Cedex 31326, France
| | - Patrick Wincker
- Genoscope, Institut de biologie François Jacob, CEA, Université d'Evry, Université Paris-Saclay, Evry 91042, France
| | - Samantha A Brooks
- Department of Animal Science, UF Genetics Institute, University of Florida, Gainesville, FL 32610, USA
| | - Albano Beja-Pereira
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO Laboratório Associado, Campus de Vairão, Universidade do Porto, Vairão 4485-661, Portugal.,BIOPOLIS Program in Genomics, Biodiversity and Land Planning, Campus de Vairão, Universidade do Porto, Vairão 4485-661, Portugal.,DGAOT, Faculty of Sciences, Universidade do Porto, Porto 4169-007, Portugal.,Sustainable Agrifood Production Research Centre (GreenUPorto), Universidade do Porto, Vairão 4485-646, Portugal
| | - Dong-Dong Wu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650201, China.,Kunming Natural History Museum of Zoology, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan 650223, China
| | - Ludovic Orlando
- Centre d'Anthropobiologie et de Génomique de Toulouse (CAGT), CNRS UMR 5288, Université Paul Sabatier, Toulouse 31000, France
| |
Collapse
|
12
|
Mas-Sandoval A, Pope NS, Nielsen KN, Altinkaya I, Fumagalli M, Korneliussen TS. Fast and accurate estimation of multidimensional site frequency spectra from low-coverage high-throughput sequencing data. Gigascience 2022; 11:giac032. [PMID: 35579549 PMCID: PMC9112775 DOI: 10.1093/gigascience/giac032] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 12/16/2021] [Indexed: 11/26/2022] Open
Abstract
BACKGROUND The site frequency spectrum summarizes the distribution of allele frequencies throughout the genome, and it is widely used as a summary statistic to infer demographic parameters and to detect signals of natural selection. The use of high-throughput low-coverage DNA sequencing data can lead to biased estimates of the site frequency spectrum due to high levels of uncertainty in genotyping. RESULTS Here we design and implement a method to efficiently and accurately estimate the multidimensional joint site frequency spectrum for large numbers of haploid or diploid individuals across an arbitrary number of populations, using low-coverage sequencing data. The method maximizes a likelihood function that represents the probability of the sequencing data observed given a multidimensional site frequency spectrum using genotype likelihoods. Notably, it uses an advanced binning heuristic paired with an accelerated expectation-maximization algorithm for a fast and memory-efficient computation, and can generate both unfolded and folded spectra and bootstrapped replicates for haploid and diploid genomes. On the basis of extensive simulations, we show that the new method requires remarkably less storage and is faster than previous implementations whilst retaining the same accuracy. When applied to low-coverage sequencing data from the fungal pathogen Neonectria neomacrospora, results recapitulate the patterns of population differentiation generated using the original high-coverage data. CONCLUSION The new implementation allows for accurate estimation of population genetic parameters from arbitrarily large, low-coverage datasets, thus facilitating cost-effective sequencing experiments in model and non-model organisms.
Collapse
Affiliation(s)
- Alex Mas-Sandoval
- Department of Life Sciences, Silwood Park campus, Imperial College London, SL5 7PY, Ascot, UK
| | - Nathaniel S Pope
- Department of Entomology, The Pennsylvania State University, 201 Old Main, University Park, PA 16802, USA
| | - Knud Nor Nielsen
- Department of Plant and Environmental Sciences, University of Copenhagen, Thorvaldsensvej 40, 1871 Frederiksberg C, Denmark
| | - Isin Altinkaya
- GLOBE, Section for Geogenetics, Øster Voldgade 5-7, 1350, Copenhagen, Denmark
| | - Matteo Fumagalli
- Department of Life Sciences, Silwood Park campus, Imperial College London, SL5 7PY, Ascot, UK
- School of Biological and Behavioural Sciences, Queen Mary University of London, London, UK
| | | |
Collapse
|
13
|
Druet T, Gautier M. An hidden Markov model to estimate homozygous-by-descent probabilities associated with nested layers of ancestors. Theor Popul Biol 2022; 145:38-51. [DOI: 10.1016/j.tpb.2022.03.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2021] [Revised: 03/02/2022] [Accepted: 03/03/2022] [Indexed: 10/18/2022]
|
14
|
Bazzicalupo E, Lucena‐Perez M, Kleinman‐Ruiz D, Pavlov A, Trajçe A, Hoxha B, Sanaja B, Gurielidze Z, Kerdikoshvili N, Mamuchadze J, Yarovenko YA, Akkiev MI, Ratkiewicz M, Saveljev AP, Melovski D, Gavashelishvili A, Schmidt K, Godoy JA. History, demography and genetic status of Balkan and Caucasian
Lynx lynx
(Linnaeus, 1758) populations revealed by genome‐wide variation. DIVERS DISTRIB 2021. [DOI: 10.1111/ddi.13439] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Affiliation(s)
- Enrico Bazzicalupo
- Department of Integrative Ecology Estación Biológica de Doñana (CSIC) Seville Spain
| | - Maria Lucena‐Perez
- Department of Integrative Ecology Estación Biológica de Doñana (CSIC) Seville Spain
| | - Daniel Kleinman‐Ruiz
- Department of Integrative Ecology Estación Biológica de Doñana (CSIC) Seville Spain
| | | | - Aleksandër Trajçe
- Protection and Preservation of Natural Environment in Albania (PPNEA) Tirana Albania
| | - Bledi Hoxha
- Protection and Preservation of Natural Environment in Albania (PPNEA) Tirana Albania
| | - Bardh Sanaja
- Environmentally Responsible Action (ERA) Peje Montenegro
| | - Zurab Gurielidze
- Program for the Ecology and Conservation of Large Mammals Institute of Ecology Ilia State University Tbilisi Georgia
| | - Niko Kerdikoshvili
- Program for the Ecology and Conservation of Large Mammals Institute of Ecology Ilia State University Tbilisi Georgia
| | | | - Yuriy A. Yarovenko
- Pre‐Caspian Institute of Biological Resources Dagestan Federal Scientific Centre of RAS Makhachkala Russian Federation
| | - Muzigit I. Akkiev
- Kabardino‐Balkarian State High‐Mountain Nature Reserve Kashkhatau Russian Federation
| | | | - Alexander P. Saveljev
- Russian Research Institute of Game Management and Fur Farming Kirov Russian Federation
| | - Dime Melovski
- Macedonian Ecological Society (MES) Skopje Macedonia
- Wildlife Sciences Georg‐August University Göttingen Germany
| | | | - Krzysztof Schmidt
- Mammal Research Institute Polish Academy of Sciences Białowieża Poland
| | - José A. Godoy
- Department of Integrative Ecology Estación Biológica de Doñana (CSIC) Seville Spain
| |
Collapse
|
15
|
Parental relatedness through time revealed by runs of homozygosity in ancient DNA. Nat Commun 2021; 12:5425. [PMID: 34521843 PMCID: PMC8440622 DOI: 10.1038/s41467-021-25289-w] [Citation(s) in RCA: 81] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 07/21/2021] [Indexed: 02/08/2023] Open
Abstract
Parental relatedness of present-day humans varies substantially across the globe, but little is known about the past. Here we analyze ancient DNA, leveraging that parental relatedness leaves genomic traces in the form of runs of homozygosity. We present an approach to identify such runs in low-coverage ancient DNA data aided by haplotype information from a modern phased reference panel. Simulation and experiments show that this method robustly detects runs of homozygosity longer than 4 centimorgan for ancient individuals with at least 0.3 × coverage. Analyzing genomic data from 1,785 ancient humans who lived in the last 45,000 years, we detect low rates of first cousin or closer unions across most ancient populations. Moreover, we find a marked decay in background parental relatedness co-occurring with or shortly after the advent of sedentary agriculture. We observe this signal, likely linked to increasing local population sizes, across several geographic transects worldwide.
Collapse
|
16
|
Lou RN, Jacobs A, Wilder A, Therkildsen NO. A beginner's guide to low-coverage whole genome sequencing for population genomics. Mol Ecol 2021; 30:5966-5993. [PMID: 34250668 DOI: 10.1111/mec.16077] [Citation(s) in RCA: 93] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 06/30/2021] [Accepted: 07/01/2021] [Indexed: 11/26/2022]
Abstract
Low-coverage whole genome sequencing (lcWGS) has emerged as a powerful and cost-effective approach for population genomic studies in both model and non-model species. However, with read depths too low to confidently call individual genotypes, lcWGS requires specialized analysis tools that explicitly account for genotype uncertainty. A growing number of such tools have become available, but it can be difficult to get an overview of what types of analyses can be performed reliably with lcWGS data, and how the distribution of sequencing effort between the number of samples analyzed and per-sample sequencing depths affects inference accuracy. In this introductory guide to lcWGS, we first illustrate how the per-sample cost for lcWGS is now comparable to RAD-seq and Pool-seq in many systems. We then provide an overview of software packages that explicitly account for genotype uncertainty in different types of population genomic inference. Next, we use both simulated and empirical data to assess the accuracy of allele frequency and genetic diversity estimation, detection of population structure, and selection scans under different sequencing strategies. Our results show that spreading a given amount of sequencing effort across more samples with lower depth per sample consistently improves the accuracy of most types of inference, with a few notable exceptions. Finally, we assess the potential for using imputation to bolster inference from lcWGS data in non-model species, and discuss current limitations and future perspectives for lcWGS-based population genomics research. With this overview, we hope to make lcWGS more approachable and stimulate its broader adoption.
Collapse
Affiliation(s)
- Runyang Nicolas Lou
- Department of Natural Resources and the Environment, Cornell University, Ithaca, NY, 14853, USA
| | - Arne Jacobs
- Department of Natural Resources and the Environment, Cornell University, Ithaca, NY, 14853, USA.,Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Glasgow, G12 8QQ, UK
| | - Aryn Wilder
- San Diego Zoo Wildlife Alliance, Escondido, CA, 92027, USA
| | - Nina O Therkildsen
- Department of Natural Resources and the Environment, Cornell University, Ithaca, NY, 14853, USA
| |
Collapse
|
17
|
de Jager D, Glanzmann B, Möller M, Hoal E, van Helden P, Harper C, Bloomer P. High diversity, inbreeding and a dynamic Pleistocene demographic history revealed by African buffalo genomes. Sci Rep 2021; 11:4540. [PMID: 33633171 PMCID: PMC7907399 DOI: 10.1038/s41598-021-83823-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2020] [Accepted: 02/04/2021] [Indexed: 12/30/2022] Open
Abstract
Genomes retain records of demographic changes and evolutionary forces that shape species and populations. Remnant populations of African buffalo (Syncerus caffer) in South Africa, with varied histories, provide an opportunity to investigate signatures left in their genomes by past events, both recent and ancient. Here, we produce 40 low coverage (7.14×) genome sequences of Cape buffalo (S. c. caffer) from four protected areas in South Africa. Genome-wide heterozygosity was the highest for any mammal for which these data are available, while differences in individual inbreeding coefficients reflected the severity of historical bottlenecks and current census sizes in each population. PSMC analysis revealed multiple changes in Ne between approximately one million and 20 thousand years ago, corresponding to paleoclimatic changes and Cape buffalo colonisation of southern Africa. The results of this study have implications for buffalo management and conservation, particularly in the context of the predicted increase in aridity and temperature in southern Africa over the next century as a result of climate change.
Collapse
Affiliation(s)
- Deon de Jager
- Molecular Ecology and Evolution Programme, Department of Biochemistry, Genetics and Microbiology, Faculty of Natural and Agricultural Sciences, University of Pretoria, Pretoria, South Africa.
| | - Brigitte Glanzmann
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, Stellenbosch University, Cape Town, South Africa.,South African Medical Research Council Centre for Tuberculosis Research, Stellenbosch University, Cape Town, South Africa.,Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Marlo Möller
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, Stellenbosch University, Cape Town, South Africa.,South African Medical Research Council Centre for Tuberculosis Research, Stellenbosch University, Cape Town, South Africa.,Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Eileen Hoal
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, Stellenbosch University, Cape Town, South Africa.,South African Medical Research Council Centre for Tuberculosis Research, Stellenbosch University, Cape Town, South Africa.,Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Paul van Helden
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, Stellenbosch University, Cape Town, South Africa.,South African Medical Research Council Centre for Tuberculosis Research, Stellenbosch University, Cape Town, South Africa.,Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Cindy Harper
- Veterinary Genetics Laboratory, Faculty of Veterinary Science, University of Pretoria, Pretoria, South Africa
| | - Paulette Bloomer
- Molecular Ecology and Evolution Programme, Department of Biochemistry, Genetics and Microbiology, Faculty of Natural and Agricultural Sciences, University of Pretoria, Pretoria, South Africa
| |
Collapse
|
18
|
Heller R, Nursyifa C, Garcia-Erill G, Salmona J, Chikhi L, Meisner J, Korneliussen TS, Albrechtsen A. A reference-free approach to analyse RADseq data using standard next generation sequencing toolkits. Mol Ecol Resour 2021; 21:1085-1097. [PMID: 33434329 DOI: 10.1111/1755-0998.13324] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Revised: 12/18/2020] [Accepted: 01/05/2021] [Indexed: 12/29/2022]
Abstract
Genotyping-by-sequencing methods such as RADseq are popular for generating genomic and population-scale data sets from a diverse range of organisms. These often lack a usable reference genome, restricting users to RADseq specific software for processing. However, these come with limitations compared to generic next generation sequencing (NGS) toolkits. Here, we describe and test a simple pipeline for reference-free RADseq data processing that blends de novo elements from STACKS with the full suite of state-of-the art NGS tools. Specifically, we use the de novo RADseq assembly employed by STACKS to create a catalogue of RAD loci that serves as a reference for read mapping, variant calling and site filters. Using RADseq data from 28 zebra sequenced to ~8x depth-of-coverage we evaluate our approach by comparing the site frequency spectra (SFS) to those from alternative pipelines. Most pipelines yielded similar SFS at 8x depth, but only a genotype likelihood based pipeline performed similarly at low sequencing depth (2-4x). We compared the RADseq SFS with medium-depth (~13x) shotgun sequencing of eight overlapping samples, revealing that the RADseq SFS was persistently slightly skewed towards rare and invariant alleles. Using simulations and human data we confirm that this is expected when there is allelic dropout (AD) in the RADseq data. AD in the RADseq data caused a heterozygosity deficit of ~16%, which dropped to ~5% after filtering AD. Hence, AD was the most important source of bias in our RADseq data.
Collapse
Affiliation(s)
- Rasmus Heller
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| | - Casia Nursyifa
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| | - Genís Garcia-Erill
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| | - Jordi Salmona
- CNRS, Université Paul Sabatier, ENFA, UMR 5174 EDB (Laboratoire Évolution & Diversité Biologique), Toulouse, France
| | - Lounes Chikhi
- CNRS, Université Paul Sabatier, ENFA, UMR 5174 EDB (Laboratoire Évolution & Diversité Biologique), Toulouse, France.,Instituto Gulbenkian de Ciência, Oeiras, Portugal
| | - Jonas Meisner
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| | | | - Anders Albrechtsen
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| |
Collapse
|
19
|
Beyond broad strokes: sociocultural insights from the study of ancient genomes. Nat Rev Genet 2020; 21:355-366. [DOI: 10.1038/s41576-020-0218-z] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/04/2020] [Indexed: 01/01/2023]
|
20
|
Origin and Evolution of Deleterious Mutations in Horses. Genes (Basel) 2019; 10:genes10090649. [PMID: 31466279 PMCID: PMC6769756 DOI: 10.3390/genes10090649] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Revised: 08/24/2019] [Accepted: 08/26/2019] [Indexed: 01/04/2023] Open
Abstract
Domestication has changed the natural evolutionary trajectory of horses by favoring the reproduction of a limited number of animals showing traits of interest. Reduced breeding stocks hampered the elimination of deleterious variants by means of negative selection, ultimately inflating mutational loads. However, ancient genomics revealed that mutational loads remained steady during most of the domestication history until a sudden burst took place some 250 years ago. To identify the factors underlying this trajectory, we gather an extensive dataset consisting of 175 modern and 153 ancient genomes previously published, and carry out the most comprehensive characterization of deleterious mutations in horses. We confirm that deleterious variants segregated at low frequencies during the last 3500 years, and only spread and incremented their occurrence in the homozygous state during modern times, owing to inbreeding. This independently happened in multiple breeds, following both the development of closed studs and purebred lines, and the deprecation of horsepower in the 20th century, which brought many draft breeds close to extinction. Our work illustrates the paradoxical effect of some conservation and improvement programs, which reduced the overall genomic fitness and viability.
Collapse
|
21
|
Renaud G, Hanghøj K, Korneliussen TS, Willerslev E, Orlando L. Joint Estimates of Heterozygosity and Runs of Homozygosity for Modern and Ancient Samples. Genetics 2019; 212:587-614. [PMID: 31088861 PMCID: PMC6614887 DOI: 10.1534/genetics.119.302057] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Accepted: 05/01/2019] [Indexed: 11/18/2022] Open
Abstract
Both the total amount and the distribution of heterozygous sites within individual genomes are informative about the genetic diversity of the population they belong to. Detecting true heterozygous sites in ancient genomes is complicated by the generally limited coverage achieved and the presence of post-mortem damage inflating sequencing errors. Additionally, large runs of homozygosity found in the genomes of particularly inbred individuals and of domestic animals can skew estimates of genome-wide heterozygosity rates. Current computational tools aimed at estimating runs of homozygosity and genome-wide heterozygosity levels are generally sensitive to such limitations. Here, we introduce ROHan, a probabilistic method which substantially improves the estimate of heterozygosity rates both genome-wide and for genomic local windows. It combines a local Bayesian model and a Hidden Markov Model at the genome-wide level and can work both on modern and ancient samples. We show that our algorithm outperforms currently available methods for predicting heterozygosity rates for ancient samples. Specifically, ROHan can delineate large runs of homozygosity (at megabase scales) and produce a reliable confidence interval for the genome-wide rate of heterozygosity outside of such regions from modern genomes with a depth of coverage as low as 5-6× and down to 7-8× for ancient samples showing moderate DNA damage. We apply ROHan to a series of modern and ancient genomes previously published and revise available estimates of heterozygosity for humans, chimpanzees and horses.
Collapse
Affiliation(s)
- Gabriel Renaud
- Lundbeck Foundation GeoGenetics Center, Globe Institute, University of Copenhagen, 1350K, Denmark
| | - Kristian Hanghøj
- Lundbeck Foundation GeoGenetics Center, Globe Institute, University of Copenhagen, 1350K, Denmark
- Laboratoire d'Anthropobiologie Moléculaire et d'Imagerie de Synthèse, CNRS UMR 5288, Université de Toulouse, Université Paul Sabatier, 31000, France
| | | | - Eske Willerslev
- Lundbeck Foundation GeoGenetics Center, Globe Institute, University of Copenhagen, 1350K, Denmark
- Department of Zoology, University of Cambridge, CB2 3EJ, UK
- The Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
- The Danish Institute for Advanced Study at The University of Southern Denmark, DK-5230 Odense M, Denmark
| | - Ludovic Orlando
- Lundbeck Foundation GeoGenetics Center, Globe Institute, University of Copenhagen, 1350K, Denmark
- Laboratoire d'Anthropobiologie Moléculaire et d'Imagerie de Synthèse, CNRS UMR 5288, Université de Toulouse, Université Paul Sabatier, 31000, France
| |
Collapse
|
22
|
Wang B, Mojica JP, Perera N, Lee CR, Lovell JT, Sharma A, Adam C, Lipzen A, Barry K, Rokhsar DS, Schmutz J, Mitchell-Olds T. Ancient polymorphisms contribute to genome-wide variation by long-term balancing selection and divergent sorting in Boechera stricta. Genome Biol 2019; 20:126. [PMID: 31227026 PMCID: PMC6587263 DOI: 10.1186/s13059-019-1729-9] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2018] [Accepted: 06/04/2019] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Genomic variation is widespread, and both neutral and selective processes can generate similar patterns in the genome. These processes are not mutually exclusive, so it is difficult to infer the evolutionary mechanisms that govern population and species divergence. Boechera stricta is a perennial relative of Arabidopsis thaliana native to largely undisturbed habitats with two geographic and ecologically divergent subspecies. Here, we delineate the evolutionary processes driving the genetic diversity and population differentiation in this species. RESULTS Using whole-genome re-sequencing data from 517 B. stricta accessions, we identify four genetic groups that diverged around 30-180 thousand years ago, with long-term small effective population sizes and recent population expansion after the Last Glacial Maximum. We find three genomic regions with elevated nucleotide diversity, totaling about 10% of the genome. These three regions of elevated nucleotide diversity show excess of intermediate-frequency alleles, higher absolute divergence (dXY), and lower relative divergence (FST) than genomic background, and significant enrichment in immune-related genes, reflecting long-term balancing selection. Scattered across the genome, we also find regions with both high FST and dXY among the groups, termed FST-islands. Population genetic signatures indicate that FST-islands with elevated divergence, which have experienced directional selection, are derived from divergent sorting of ancient polymorphisms. CONCLUSIONS Our results suggest that long-term balancing selection on disease resistance genes may have maintained ancestral haplotypes across different geographical lineages, and unequal sorting of balanced polymorphisms may have generated genomic regions with elevated divergence. This study highlights the importance of ancestral balanced polymorphisms as crucial components of genome-wide variation.
Collapse
Affiliation(s)
- Baosheng Wang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou, 510650, China.
- Department of Biology, Duke University, Box 90338, Durham, NC, 27708, USA.
| | - Julius P Mojica
- Department of Biology, Duke University, Box 90338, Durham, NC, 27708, USA
| | - Nadeesha Perera
- Department of Biology, Duke University, Box 90338, Durham, NC, 27708, USA
| | - Cheng-Ruei Lee
- Institute of Ecology and Evolutionary Biology and Institute of Plant Biology, National Taiwan University, Taipei, 10617, Taiwan, ROC
| | - John T Lovell
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
| | - Aditi Sharma
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Catherine Adam
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Anna Lipzen
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Kerrie Barry
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Daniel S Rokhsar
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | - Jeremy Schmutz
- HudsonAlpha Institute for Biotechnology, Huntsville, AL, 35806, USA
- Department of Energy Joint Genome Institute, Walnut Creek, CA, 94598, USA
| | | |
Collapse
|
23
|
White LC, Fontsere C, Lizano E, Hughes DA, Angedakin S, Arandjelovic M, Granjon AC, Hans JB, Lester JD, Rabanus-Wallace MT, Rowney C, Städele V, Marques-Bonet T, Langergraber KE, Vigilant L. A roadmap for high-throughput sequencing studies of wild animal populations using noninvasive samples and hybridization capture. Mol Ecol Resour 2019; 19:609-622. [PMID: 30637963 DOI: 10.1111/1755-0998.12993] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Revised: 12/14/2018] [Accepted: 01/08/2019] [Indexed: 11/30/2022]
Abstract
Large-scale genomic studies of wild animal populations are often limited by access to high-quality DNA. Although noninvasive samples, such as faeces, can be readily collected, DNA from the sample producers is usually present in low quantities, fragmented, and contaminated by microorganism and dietary DNAs. Hybridization capture can help to overcome these impediments by increasing the proportion of subject DNA prior to high-throughput sequencing. Here we evaluate a key design variable for hybridization capture, the number of rounds of capture, by testing whether one or two rounds are most appropriate, given varying sample quality (as measured by the ratios of subject to total DNA). We used a set of 1,780 quality-assessed wild chimpanzee (Pan troglodytes schweinfurthii) faecal samples and chose 110 samples of varying quality for exome capture and sequencing. We used multiple regression to assess the effects of the ratio of subject to total DNA (sample quality), rounds of capture and sequencing effort on the number of unique exome reads sequenced. We not only show that one round of capture is preferable when the proportion of subject DNA in a sample is above ~2%-3%, but also explore various types of bias introduced by capture, and develop a model that predicts the sequencing effort necessary for a desired data yield from samples of a given quality. Thus, our results provide a useful guide and pave a methodological way forward for researchers wishing to plan similar hybridization capture studies.
Collapse
Affiliation(s)
- Lauren C White
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Claudia Fontsere
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Barcelona, Spain
| | - Esther Lizano
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Barcelona, Spain
| | - David A Hughes
- MRC Integrative Epidemiology Unit at University of Bristol, Bristol, UK.,Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Samuel Angedakin
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Mimi Arandjelovic
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Anne-Céline Granjon
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Jörg B Hans
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Jack D Lester
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | | | - Carolyn Rowney
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Veronika Städele
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | - Tomas Marques-Bonet
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Barcelona, Spain.,Catalan Institution of Research and Advanced Studies (ICREA), Barcelona, Spain.,CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain.,Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Kevin E Langergraber
- School of Human Evolution and Social Change, Arizona State University, Tempe, Arizona.,Institute of Human Origins, Arizona State University, Tempe, Arizona
| | - Linda Vigilant
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| |
Collapse
|
24
|
Nandolo W, Mészáros G, Banda LJ, Gondwe TN, Lamuno D, Mulindwa HA, Nakimbugwe HN, Wurzinger M, Utsunomiya YT, Woodward-Greene MJ, Liu M, Liu G, Van Tassell CP, Curik I, Rosen BD, Sölkner J. Timing and Extent of Inbreeding in African Goats. Front Genet 2019; 10:537. [PMID: 31214253 PMCID: PMC6558083 DOI: 10.3389/fgene.2019.00537] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Accepted: 05/17/2019] [Indexed: 11/13/2022] Open
Abstract
Genetic characterization of African goats is one of the current priorities in the improvement of goats in the continent. This study contributes to the characterization effort by determining the levels and number of generations to common ancestors ("age") associated with inbreeding in African goat breeds and identifies regions that contain copy number variation mistyped as being homozygous. Illumina 50k single nucleotide polymorphism genotype data for 608 goats from 31 breeds were used to compute the level and age of inbreeding at both local (marker) and global levels (FG) using a model-based approach based on a hidden Markov model. Runs of homozygosity (ROH) segments detected using the Viterbi algorithm led to ROH-based inbreeding coefficients for all ROH (FROH) and for ROH longer than 2 Mb (FROH > 2Mb). Some of the genomic regions identified as having ROH are likely to be hemizygous regions (copy number deletions) mistyped as homozygous regions. Although the proportion of these miscalled ROH is small and does not substantially affect estimates of levels of inbreeding for individual animals, the inbreeding metrics were adjusted by removing these regions from the ROH. All the inbreeding metrics varied widely across breeds, with overall means of 0.0408, 0.0370, and 0.0691 and medians of 0.0125, 0.0098, and 0.0366 for FROH, FROH > 2Mb, and FG, respectively. Several breeds (including Menabe and Sofia from Madagascar) had high proportions of recent inbreeding, while Small East African, Ethiopian, and most of the West African breeds (including West African Dwarf) had more ancient inbreeding.
Collapse
Affiliation(s)
- Wilson Nandolo
- Division of Livestock Sciences, Department of Sustainable Agricultural Systems, University of Natural Resources and Life Sciences, Vienna, Vienna, Austria.,Department of Animal Science, Faculty of Agriculture, Lilongwe University of Agriculture and Natural Resources, Lilongwe, Malawi.,Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agriculture Research Service, Beltsville, MD, United States
| | - Gábor Mészáros
- Division of Livestock Sciences, Department of Sustainable Agricultural Systems, University of Natural Resources and Life Sciences, Vienna, Vienna, Austria
| | - Liveness Jessica Banda
- Department of Animal Science, Faculty of Agriculture, Lilongwe University of Agriculture and Natural Resources, Lilongwe, Malawi
| | - Timothy N Gondwe
- Department of Animal Science, Faculty of Agriculture, Lilongwe University of Agriculture and Natural Resources, Lilongwe, Malawi
| | - Doreen Lamuno
- Division of Livestock Sciences, Department of Sustainable Agricultural Systems, University of Natural Resources and Life Sciences, Vienna, Vienna, Austria
| | | | | | - Maria Wurzinger
- Division of Livestock Sciences, Department of Sustainable Agricultural Systems, University of Natural Resources and Life Sciences, Vienna, Vienna, Austria
| | - Yuri T Utsunomiya
- School of Agricultural and Veterinarian Sciences, Jaboticabal, São Paulo State University (UNESP), São Paulo, Brazil
| | - M Jennifer Woodward-Greene
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agriculture Research Service, Beltsville, MD, United States
| | - Mei Liu
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agriculture Research Service, Beltsville, MD, United States
| | - George Liu
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agriculture Research Service, Beltsville, MD, United States
| | - Curtis P Van Tassell
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agriculture Research Service, Beltsville, MD, United States
| | - Ino Curik
- Department of Animal Science, Faculty of Agriculture, University of Zagreb, Zagreb, Croatia
| | - Benjamin D Rosen
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agriculture Research Service, Beltsville, MD, United States
| | - Johann Sölkner
- Division of Livestock Sciences, Department of Sustainable Agricultural Systems, University of Natural Resources and Life Sciences, Vienna, Vienna, Austria
| |
Collapse
|
25
|
Bertrand AR, Kadri NK, Flori L, Gautier M, Druet T. RZooRoH: An R package to characterize individual genomic autozygosity and identify homozygous‐by‐descent segments. Methods Ecol Evol 2019. [DOI: 10.1111/2041-210x.13167] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Affiliation(s)
- Amandine R. Bertrand
- Unit of Animal Genomics, GIGA‐R & Faculty of Veterinary MedicineUniversity of Liège Liège Belgium
| | - Naveen K. Kadri
- Unit of Animal Genomics, GIGA‐R & Faculty of Veterinary MedicineUniversity of Liège Liège Belgium
| | - Laurence Flori
- SELMET, INRA, CIRAD, Montpellier SupagroUniversity of Montpellier Montpellier France
| | - Mathieu Gautier
- INRA, UMR CBGP (INRA – IRD – Cirad – Montpellier SupAgro) Montferrier‐sur‐Lez France
| | - Tom Druet
- Unit of Animal Genomics, GIGA‐R & Faculty of Veterinary MedicineUniversity of Liège Liège Belgium
| |
Collapse
|
26
|
Abstract
"Conservation genomics" encompasses the idea that genome-scale data will improve the capacity of resource managers to protect species. Although genetic approaches have long been used in conservation research, it has only recently become tractable to generate genome-wide data at a scale that is useful for conservation. In this Review, we discuss how genome-scale data can inform species delineation in the face of admixture, facilitate evolution through the identification of adaptive alleles, and enhance evolutionary rescue based on genomic patterns of inbreeding. As genomic approaches become more widely adopted in conservation, we expect that they will have a positive impact on management and policy decisions.
Collapse
Affiliation(s)
- Megan A Supple
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA, 95060, USA.
| | - Beth Shapiro
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, CA, 95060, USA.
- UCSC Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, 95060, USA.
| |
Collapse
|
27
|
Carroll EL, Bruford MW, DeWoody JA, Leroy G, Strand A, Waits L, Wang J. Genetic and genomic monitoring with minimally invasive sampling methods. Evol Appl 2018; 11:1094-1119. [PMID: 30026800 PMCID: PMC6050181 DOI: 10.1111/eva.12600] [Citation(s) in RCA: 93] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 01/02/2018] [Indexed: 12/12/2022] Open
Abstract
The decreasing cost and increasing scope and power of emerging genomic technologies are reshaping the field of molecular ecology. However, many modern genomic approaches (e.g., RAD-seq) require large amounts of high-quality template DNA. This poses a problem for an active branch of conservation biology: genetic monitoring using minimally invasive sampling (MIS) methods. Without handling or even observing an animal, MIS methods (e.g., collection of hair, skin, faeces) can provide genetic information on individuals or populations. Such samples typically yield low-quality and/or quantities of DNA, restricting the type of molecular methods that can be used. Despite this limitation, genetic monitoring using MIS is an effective tool for estimating population demographic parameters and monitoring genetic diversity in natural populations. Genetic monitoring is likely to become more important in the future as many natural populations are undergoing anthropogenically driven declines, which are unlikely to abate without intensive adaptive management efforts that often include MIS approaches. Here, we profile the expanding suite of genomic methods and platforms compatible with producing genotypes from MIS, considering factors such as development costs and error rates. We evaluate how powerful new approaches will enhance our ability to investigate questions typically answered using genetic monitoring, such as estimating abundance, genetic structure and relatedness. As the field is in a period of unusually rapid transition, we also highlight the importance of legacy data sets and recommend how to address the challenges of moving between traditional and next-generation genetic monitoring platforms. Finally, we consider how genetic monitoring could move beyond genotypes in the future. For example, assessing microbiomes or epigenetic markers could provide a greater understanding of the relationship between individuals and their environment.
Collapse
Affiliation(s)
- Emma L. Carroll
- Scottish Oceans Institute and Sea Mammal Research UnitUniversity of St AndrewsSt AndrewsUK
| | - Mike W. Bruford
- Cardiff School of Biosciences and Sustainable Places Research InstituteCardiff UniversityCardiff, WalesUK
| | - J. Andrew DeWoody
- Department of Forestry and Natural Resources and Department of Biological SciencesPurdue UniversityWest LafayetteINUSA
| | - Gregoire Leroy
- Animal Production and Health DivisionFood and Agriculture Organization of the United NationsRomeItaly
| | - Alan Strand
- Grice Marine LaboratoryDepartment of BiologyCollege of CharlestonCharlestonSCUSA
| | - Lisette Waits
- Department of Fish and Wildlife SciencesUniversity of IdahoMoscowIDUSA
| | - Jinliang Wang
- Institute of ZoologyZoological Society of LondonLondonUK
| |
Collapse
|
28
|
Monroy Kuhn JM, Jakobsson M, Günther T. Estimating genetic kin relationships in prehistoric populations. PLoS One 2018; 13:e0195491. [PMID: 29684051 PMCID: PMC5912749 DOI: 10.1371/journal.pone.0195491] [Citation(s) in RCA: 151] [Impact Index Per Article: 25.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2017] [Accepted: 03/23/2018] [Indexed: 12/21/2022] Open
Abstract
Archaeogenomic research has proven to be a valuable tool to trace migrations of historic and prehistoric individuals and groups, whereas relationships within a group or burial site have not been investigated to a large extent. Knowing the genetic kinship of historic and prehistoric individuals would give important insights into social structures of ancient and historic cultures. Most archaeogenetic research concerning kinship has been restricted to uniparental markers, while studies using genome-wide information were mainly focused on comparisons between populations. Applications which infer the degree of relationship based on modern-day DNA information typically require diploid genotype data. Low concentration of endogenous DNA, fragmentation and other post-mortem damage to ancient DNA (aDNA) makes the application of such tools unfeasible for most archaeological samples. To infer family relationships for degraded samples, we developed the software READ (Relationship Estimation from Ancient DNA). We show that our heuristic approach can successfully infer up to second degree relationships with as little as 0.1x shotgun coverage per genome for pairs of individuals. We uncover previously unknown relationships among prehistoric individuals by applying READ to published aDNA data from several human remains excavated from different cultural contexts. In particular, we find a group of five closely related males from the same Corded Ware culture site in modern-day Germany, suggesting patrilocality, which highlights the possibility to uncover social structures of ancient populations by applying READ to genome-wide aDNA data. READ is publicly available from https://bitbucket.org/tguenther/read.
Collapse
Affiliation(s)
- Jose Manuel Monroy Kuhn
- Uppsala University, Evolutionary Biology Centre, Department of Organismal Biology, Norbyvägen 18C, SE-752 36 Uppsala, Sweden
| | - Mattias Jakobsson
- Uppsala University, Evolutionary Biology Centre, Department of Organismal Biology, Norbyvägen 18C, SE-752 36 Uppsala, Sweden
- Uppsala University, SciLifeLab, Norbyvägen 18C, SE-752 36 Uppsala, Sweden
- * E-mail: (MJ); (TG)
| | - Torsten Günther
- Uppsala University, Evolutionary Biology Centre, Department of Organismal Biology, Norbyvägen 18C, SE-752 36 Uppsala, Sweden
- * E-mail: (MJ); (TG)
| |
Collapse
|
29
|
Herzig AF, Nutile T, Babron MC, Ciullo M, Bellenguez C, Leutenegger AL. Strategies for phasing and imputation in a population isolate. Genet Epidemiol 2018; 42:201-213. [PMID: 29319195 DOI: 10.1002/gepi.22109] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2017] [Revised: 11/16/2017] [Accepted: 11/16/2017] [Indexed: 11/05/2022]
Abstract
In the search for genetic associations with complex traits, population isolates offer the advantage of reduced genetic and environmental heterogeneity. In addition, cost-efficient next-generation association approaches have been proposed in these populations where only a subsample of representative individuals is sequenced and then genotypes are imputed into the rest of the population. Gene mapping in such populations thus requires high-quality genetic imputation and preliminary phasing. To identify an effective study design, we compare by simulation a range of phasing and imputation software and strategies. We simulated 1,115,604 variants on chromosome 10 for 477 members of the large complex pedigree of Campora, a village within the established isolate of Cilento in southern Italy. We assessed the phasing performance of identical by descent based software ALPHAPHASE and SLRP, LD-based software SHAPEIT2, SHAPEIT3, and BEAGLE, and new software EAGLE that combines both methodologies. For imputation we compared IMPUTE2, IMPUTE4, MINIMAC3, BEAGLE, and new software PBWT. Genotyping errors and missing genotypes were simulated to observe their effects on the performance of each software. Highly accurate phased data were achieved by all software with SHAPEIT2, SHAPEIT3, and EAGLE2 providing the most accurate results. MINIMAC3, IMPUTE4, and IMPUTE2 all performed strongly as imputation software and our study highlights the considerable gain in imputation accuracy provided by a genome sequenced reference panel specific to the population isolate.
Collapse
Affiliation(s)
- Anthony Francis Herzig
- Université Paris-Diderot, Sorbonne Paris Cité, U946, Paris, France.,Inserm, U946, Genetic Variation and Human Diseases, Paris, France
| | - Teresa Nutile
- Institute of Genetics and Biophysics A. Buzzati-Traverso-CNR, Naples, Italy
| | - Marie-Claude Babron
- Université Paris-Diderot, Sorbonne Paris Cité, U946, Paris, France.,Inserm, U946, Genetic Variation and Human Diseases, Paris, France
| | - Marina Ciullo
- Institute of Genetics and Biophysics A. Buzzati-Traverso-CNR, Naples, Italy.,IRCCS Neuromed, Pozzilli, Isernia, Italy
| | - Céline Bellenguez
- Inserm, U1167, RID-AGE-Risk Factors and Molecular Determinants of Aging-Related Diseases, Lille, France.,Institut Pasteur de Lille, Lille, France.,Université de Lille, U1167-Excellence Laboratory LabEx DISTALZ, Lille, France
| | - Anne-Louise Leutenegger
- Université Paris-Diderot, Sorbonne Paris Cité, U946, Paris, France.,Inserm, U946, Genetic Variation and Human Diseases, Paris, France
| |
Collapse
|
30
|
Solé M, Gori AS, Faux P, Bertrand A, Farnir F, Gautier M, Druet T. Age-based partitioning of individual genomic inbreeding levels in Belgian Blue cattle. Genet Sel Evol 2017; 49:92. [PMID: 29273000 PMCID: PMC5741860 DOI: 10.1186/s12711-017-0370-x] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2017] [Accepted: 12/13/2017] [Indexed: 11/21/2022] Open
Abstract
Background
Inbreeding coefficients can be estimated either from pedigree data or from genomic data, and with genomic data, they are either global or local (when the linkage map is used). Recently, we developed a new hidden Markov model (HMM) that estimates probabilities of homozygosity-by-descent (HBD) at each marker position and automatically partitions autozygosity in multiple age-related classes (based on the length of HBD segments). Our objectives were to: (1) characterize inbreeding with our model in an intensively selected population such as the Belgian Blue Beef (BBB) cattle breed; (2) compare the properties of the model at different marker densities; and (3) compare our model with other methods.
Results When using 600 K single nucleotide polymorphisms (SNPs), the inbreeding coefficient (probability of sampling an HBD locus in an individual) was on average 0.303 (ranging from 0.258 to 0.375). HBD-classes associated to historical ancestors (with small segments ≤ 200 kb) accounted for 21.6% of the genome length (71.4% of the total length of the genome in HBD segments), whereas classes associated to more recent ancestors accounted for only 22.6% of the total length of the genome in HBD segments. However, these recent classes presented more individual variation than more ancient classes. Although inbreeding coefficients obtained with low SNP densities (7 and 32 K) were much lower (0.060 and 0.093), they were highly correlated with those obtained at higher density (r = 0.934 and 0.975, respectively), indicating that they captured most of the individual variation. At higher SNP density, smaller HBD segments are identified and, thus, more past generations can be explored. We observed very high correlations between our estimates and those based on homozygosity (r = 0.95) or on runs-of-homozygosity (r = 0.95). As expected, pedigree-based estimates were mainly correlated with recent HBD-classes (r = 0.56). Conclusions Although we observed high levels of autozygosity associated with small HBD segments in BBB cattle, recent inbreeding accounted for most of the individual variation. Recent autozygosity can be captured efficiently with low-density SNP arrays and relatively simple models (e.g., two HBD classes). The HMM framework provides local HBD probabilities that are still useful at lower SNP densities. Electronic supplementary material The online version of this article (10.1186/s12711-017-0370-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Marina Solé
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, B34 (+1) Avenue de l'Hôpital 1, 4000, Liège, Belgium.
| | - Ann-Stephan Gori
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, B34 (+1) Avenue de l'Hôpital 1, 4000, Liège, Belgium.,Awé Coopérative (Association Wallonne de l'Élevage) - Recherche et Développement, Rue des Champs Elysées 4, 5590, Ciney, Belgium
| | - Pierre Faux
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, B34 (+1) Avenue de l'Hôpital 1, 4000, Liège, Belgium
| | - Amandine Bertrand
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, B34 (+1) Avenue de l'Hôpital 1, 4000, Liège, Belgium
| | - Frédéric Farnir
- BBASV, FARAH-PAD & Faculty of Veterinary Medicine, University of Liège, Quartier Vallée 2, Avenue de Cureghem, (B43 +3), 4000, Liège, Belgium
| | - Mathieu Gautier
- INRA, UMR CBGP (Centre de Biologie pour la Gestion des Populations), Campus International de Baillarguet, 34988, Montferrier sur Lez, France.,IBD (Institut de Biologie Computationnelle), 34095, Montpellier, France
| | - Tom Druet
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, B34 (+1) Avenue de l'Hôpital 1, 4000, Liège, Belgium
| |
Collapse
|
31
|
Druet T, Gautier M. A model-based approach to characterize individual inbreeding at both global and local genomic scales. Mol Ecol 2017; 26:5820-5841. [PMID: 28815918 DOI: 10.1111/mec.14324] [Citation(s) in RCA: 58] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2017] [Revised: 07/31/2017] [Accepted: 08/07/2017] [Indexed: 01/05/2023]
Abstract
Inbreeding results from the mating of related individuals and may be associated with reduced fitness because it brings together deleterious variants in one individual. In general, inbreeding is estimated with respect to an arbitrary base population consisting of ancestors that are assumed unrelated. We herein propose a model-based approach to estimate and characterize individual inbreeding at both global and local genomic scales by assuming the individual genome is a mosaic of homozygous-by-descent (HBD) and non-HBD segments. The HBD segments may originate from ancestors tracing back to different periods in the past defining distinct age-related classes. The lengths of the HBD segments are exponentially distributed with class-specific parameters reflecting that inbreeding of older origin generates on average shorter stretches of observed homozygous markers. The model is implemented in a hidden Markov model framework that uses marker allele frequencies, genetic distances, genotyping error rates and the sequences of observed genotypes. Note that genotyping errors, low-fold sequencing or genotype-by-sequencing data are easily accommodated under this framework. Based on simulations under the inference model, we show that the genomewide inbreeding coefficients and the parameters of the model are accurately estimated. In addition, when several inbreeding classes are simulated, the model captures them if their ages are sufficiently different. Complementary analyses, either on data sets simulated under more realistic models or on human, dog and sheep real data, illustrate the range of applications of the approach and how it can reveal recent demographic histories among populations (e.g., very recent bottlenecks or founder effects). The method also allows to clearly identify individuals resulting from extreme consanguineous matings.
Collapse
Affiliation(s)
- T Druet
- Unit of Animal Genomics, GIGA-R & Faculty of Veterinary Medicine, University of Liège, Liège, Belgium
| | - M Gautier
- INRA, UMR CBGP (INRA - IRD - Cirad - Montpellier SupAgro), Montferrier-sur-Lez, France.,Institut de Biologie Computationnelle, Montpellier, France
| |
Collapse
|
32
|
Fuentes-Pardo AP, Ruzzante DE. Whole-genome sequencing approaches for conservation biology: Advantages, limitations and practical recommendations. Mol Ecol 2017; 26:5369-5406. [PMID: 28746784 DOI: 10.1111/mec.14264] [Citation(s) in RCA: 158] [Impact Index Per Article: 22.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2017] [Revised: 06/23/2017] [Accepted: 06/28/2017] [Indexed: 12/14/2022]
Abstract
Whole-genome resequencing (WGR) is a powerful method for addressing fundamental evolutionary biology questions that have not been fully resolved using traditional methods. WGR includes four approaches: the sequencing of individuals to a high depth of coverage with either unresolved or resolved haplotypes, the sequencing of population genomes to a high depth by mixing equimolar amounts of unlabelled-individual DNA (Pool-seq) and the sequencing of multiple individuals from a population to a low depth (lcWGR). These techniques require the availability of a reference genome. This, along with the still high cost of shotgun sequencing and the large demand for computing resources and storage, has limited their implementation in nonmodel species with scarce genomic resources and in fields such as conservation biology. Our goal here is to describe the various WGR methods, their pros and cons and potential applications in conservation biology. WGR offers an unprecedented marker density and surveys a wide diversity of genetic variations not limited to single nucleotide polymorphisms (e.g., structural variants and mutations in regulatory elements), increasing their power for the detection of signatures of selection and local adaptation as well as for the identification of the genetic basis of phenotypic traits and diseases. Currently, though, no single WGR approach fulfils all requirements of conservation genetics, and each method has its own limitations and sources of potential bias. We discuss proposed ways to minimize such biases. We envision a not distant future where the analysis of whole genomes becomes a routine task in many nonmodel species and fields including conservation biology.
Collapse
|