1
|
Milesi P, Kastally C, Dauphin B, Cervantes S, Bagnoli F, Budde KB, Cavers S, Fady B, Faivre-Rampant P, González-Martínez SC, Grivet D, Gugerli F, Jorge V, Lesur Kupin I, Ojeda DI, Olsson S, Opgenoorth L, Pinosio S, Plomion C, Rellstab C, Rogier O, Scalabrin S, Scotti I, Vendramin GG, Westergren M, Lascoux M, Pyhäjärvi T. Resilience of genetic diversity in forest trees over the Quaternary. Nat Commun 2024; 15:8538. [PMID: 39402024 PMCID: PMC11473659 DOI: 10.1038/s41467-024-52612-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Accepted: 09/11/2024] [Indexed: 10/17/2024] Open
Abstract
The effect of past environmental changes on the demography and genetic diversity of natural populations remains a contentious issue and has rarely been investigated across multiple, phylogenetically distant species. Here, we perform comparative population genomic analyses and demographic inferences for seven widely distributed and ecologically contrasting European forest tree species based on concerted sampling of 164 populations across their natural ranges. For all seven species, the effective population size, Ne, increased or remained stable over many glacial cycles and up to 15 million years in the most extreme cases. Surprisingly, the drastic environmental changes associated with the Pleistocene glacial cycles have had little impact on the level of genetic diversity of dominant forest tree species, despite major shifts in their geographic ranges. Based on their trajectories of Ne over time, the seven tree species can be divided into three major groups, highlighting the importance of life history and range size in determining synchronous variation in genetic diversity over time. Altogether, our results indicate that forest trees have been able to retain their evolutionary potential over very long periods of time despite strong environmental changes.
Collapse
Affiliation(s)
- Pascal Milesi
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden.
- SciLifeLab, Uppsala University, Uppsala, Sweden.
| | - Chedly Kastally
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
- Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Benjamin Dauphin
- Biodiversity and Conservation Biology, Swiss Federal Research Institute WSL, Birmensdorf, Switzerland
| | - Sandra Cervantes
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
- Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Francesca Bagnoli
- Institute of Biosciences and Bioresources, National Research Council of Italy (IBBR-CNR), Sesto Fiorentino, Italy
| | - Katharina B Budde
- Department of Forest Genetics and Forest Tree Breeding, Georg-August-University Goettingen, Göttingen, Germany
- Department of Forest Genetic Resources, Northwest German Forest Research Institute, Hann. Münden, Germany
| | - Stephen Cavers
- UK Centre for Ecology & Hydrology (UKCEH), Bush Estate, UK
| | - Bruno Fady
- INRAE, URFM, Ecology of Mediterranean Forests, Avignon, France
| | | | | | - Delphine Grivet
- Institute of Forest Sciences (ICIFOR), INIA-CSIC, Madrid, Spain
| | - Felix Gugerli
- Biodiversity and Conservation Biology, Swiss Federal Research Institute WSL, Birmensdorf, Switzerland
| | | | - Isabelle Lesur Kupin
- University of Bordeaux, INRAE, BIOGECO, Cestas, France
- Helix Venture, Mérignac, France
| | - Dario I Ojeda
- Department of Forest Biodiversity, Norwegian Institute of Bioeconomy Research (NIBIO), Aas, Norway
| | - Sanna Olsson
- Institute of Forest Sciences (ICIFOR), INIA-CSIC, Madrid, Spain
| | - Lars Opgenoorth
- Biodiversity and Conservation Biology, Swiss Federal Research Institute WSL, Birmensdorf, Switzerland
- Plant Ecology and Geobotany, Philipps-Universität Marburg, Marburg, Germany
| | - Sara Pinosio
- Institute of Biosciences and Bioresources, National Research Council of Italy (IBBR-CNR), Sesto Fiorentino, Italy
- Institute of Applied Genomics (IGA), Udine, Italy
| | | | - Christian Rellstab
- Biodiversity and Conservation Biology, Swiss Federal Research Institute WSL, Birmensdorf, Switzerland
| | | | | | - Ivan Scotti
- INRAE, URFM, Ecology of Mediterranean Forests, Avignon, France
| | - Giovanni G Vendramin
- Institute of Biosciences and Bioresources, National Research Council of Italy (IBBR-CNR), Sesto Fiorentino, Italy
| | | | - Martin Lascoux
- Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden.
- SciLifeLab, Uppsala University, Uppsala, Sweden.
| | - Tanja Pyhäjärvi
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland.
- Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland.
| |
Collapse
|
2
|
Mora-Carrera E, Stubbs RL, Potente G, Yousefi N, Aeschbacher S, Keller B, Choudhury RR, Celep F, Kochjarová J, de Vos JM, Szövényi P, Conti E. Unveiling the Genome-Wide Consequences of Range Expansion and Mating System Transitions in Primula vulgaris. Genome Biol Evol 2024; 16:evae208. [PMID: 39340447 PMCID: PMC11469071 DOI: 10.1093/gbe/evae208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 09/16/2024] [Accepted: 09/23/2024] [Indexed: 09/30/2024] Open
Abstract
Genetic diversity is heterogeneously distributed among populations of the same species, due to the joint effects of multiple demographic processes, including range contractions and expansions, and mating systems shifts. Here, we ask how both processes shape genomic diversity in space and time in the classical Primula vulgaris model. This perennial herb originated in the Caucasus region and was hypothesized to have expanded westward following glacial retreat in the Quaternary. Moreover, this species is a long-standing model for mating system transitions, exemplified by shifts from heterostyly to homostyly. Leveraging a high-quality reference genome of the closely related Primula veris and whole-genome resequencing data from both heterostylous and homostylous individuals from populations encompassing a wide distribution of P. vulgaris, we reconstructed the demographic history of P. vulgaris. Results are compatible with the previously proposed hypothesis of range expansion from the Caucasus region approximately 79,000 years ago and suggest later shifts to homostyly following rather than preceding postglacial colonization of England. Furthermore, in accordance with population genetic theoretical predictions, both processes are associated with reduced genetic diversity, increased linkage disequilibrium, and reduced efficacy of purifying selection. A novel result concerns the contrasting effects of range expansion versus shift to homostyly on transposable elements, for the former, process is associated with changes in transposable element genomic content, while the latter is not. Jointly, our results elucidate how the interactions among range expansion, transitions to selfing, and Quaternary climatic oscillations shape plant evolution.
Collapse
Affiliation(s)
- Emiliano Mora-Carrera
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - Rebecca L Stubbs
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - Giacomo Potente
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - Narjes Yousefi
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - Simon Aeschbacher
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
| | - Barbara Keller
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - Rimjhim Roy Choudhury
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - Ferhat Celep
- Department of Biology, Faculty of Engineering and Natural Sciences, Kırıkkale University, Kırıkkale, Turkey
| | - Judita Kochjarová
- Department of Phytology, Faculty of Forestry, Technical University in Zvolen, Zvolen, Slovak Republic
| | - Jurriaan M de Vos
- Department of Environmental Sciences—Botany, University of Basel, Basel, Switzerland
| | - Peter Szövényi
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - Elena Conti
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| |
Collapse
|
3
|
Liang X, Yang S, Wang D, Knief U. Characterization and distribution of de novo mutations in the zebra finch. Commun Biol 2024; 7:1243. [PMID: 39358581 PMCID: PMC11447093 DOI: 10.1038/s42003-024-06945-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Accepted: 09/24/2024] [Indexed: 10/04/2024] Open
Abstract
Germline de novo mutations (DNMs) provide the raw material for evolution. The DNM rate varies considerably between species, sexes and chromosomes. Here, we identify DNMs in the zebra finch (Taeniopygia guttata) across 16 parent-offspring trios using two genome assemblies of different quality. Using an independent genotyping assay, we validate 82% of the 150 candidate DNMs. DNM rates are consistent between both assemblies, with estimates of 6.14 × 10-9 and 6.36 × 10-9 per site per generation. We observe a strong paternal bias in DNM rates (male-to-female ratio ɑ ≈ 4), but this bias is in transition mutations only, leading to a transition-to-transversion ratio of 3.18 and 3.57. Finally, we find that DNMs tend to be randomly distributed across chromosomes, not associated with recombination hotspots or genic regions. However, the sex chromosome chrZ shows a roughly fourfold increased DNM rate compared to autosomes, which is more than the expected increase due to chrZ spending two-thirds of its time in males. Overall, our results further enhance our understanding of DNMs in passerine songbirds.
Collapse
Affiliation(s)
- Xixi Liang
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Shuai Yang
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Daiping Wang
- Key Laboratory of Animal Ecology and Conservation Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| | - Ulrich Knief
- Evolutionary Biology & Ecology, Faculty of Biology, University of Freiburg, Freiburg, Germany
| |
Collapse
|
4
|
Modica A, Lalagüe H, Muratorio S, Scotti I. Rolling down that mountain: microgeographical adaptive divergence during a fast population expansion along a steep environmental gradient in European beech. Heredity (Edinb) 2024; 133:99-112. [PMID: 38890557 PMCID: PMC11286953 DOI: 10.1038/s41437-024-00696-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 05/23/2024] [Accepted: 05/23/2024] [Indexed: 06/20/2024] Open
Abstract
Forest tree populations harbour high genetic diversity thanks to large effective population sizes and strong gene flow, allowing them to diversify through adaptation to local environmental pressures within dispersal distance. Many tree populations also experienced historical demographic fluctuations, including spatial population contraction or expansions at various temporal scales, which may constrain their ability to adapt to environmental variations. Our aim is to investigate how recent contraction and expansion events interfere with local adaptation, by studying patterns of adaptive divergence between closely related stands undergoing environmentally contrasted conditions, and having or not recently expanded. To investigate genome-wide signatures of local adaptation while accounting for demography, we analysed divergence in a European beech population by testing pairwise differentiation among four tree stands at ~35k Single Nucleotide Polymorphisms from ~9k genomic regions. We applied three divergence outlier search methods resting on different assumptions and targeting either single SNPs or contiguous genomic regions, while accounting for the effect of population size variations on genetic divergence. We found 27 signals of selective signatures in 19 target regions. Putatively adaptive divergence involved all stand pairs. We retrieved signals both when comparing old-growth stands and recently colonised areas and when comparing stands within the old-growth area. Therefore, adaptive divergence processes have taken place both over short time spans, under strong environmental contrasts, and over short ecological gradients, in populations that have been stable in the long term. This suggests that standing genetic variation supports local, microgeographic divergence processes, which can maintain genetic diversity at the landscape level.
Collapse
Affiliation(s)
- Andrea Modica
- INRAE, URFM, 228, Route de l'Aérodrome, 84914, Avignon, France
| | - Hadrien Lalagüe
- INRAE, EcoFoG, Campus agronomique, 97310, Kourou, French Guiana
| | - Sylvie Muratorio
- INRAE, EcoBioP, 173, Route de Saint-Jean-de-Luz RD 918, 64310, Saint-Pée-sur-Nivelle, France
| | - Ivan Scotti
- INRAE, URFM, 228, Route de l'Aérodrome, 84914, Avignon, France.
| |
Collapse
|
5
|
Serradell JM, Lorenzo-Salazar JM, Flores C, Lao O, Comas D. Modelling the demographic history of human North African genomes points to a recent soft split divergence between populations. Genome Biol 2024; 25:201. [PMID: 39080715 PMCID: PMC11290046 DOI: 10.1186/s13059-024-03341-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 07/22/2024] [Indexed: 08/02/2024] Open
Abstract
BACKGROUND North African human populations present a complex demographic scenario due to the presence of an autochthonous genetic component and population substructure, plus extensive gene flow from the Middle East, Europe, and sub-Saharan Africa. RESULTS We conducted a comprehensive analysis of 364 genomes to construct detailed demographic models for the North African region, encompassing its two primary ethnic groups, the Arab and Amazigh populations. This was achieved through an Approximate Bayesian Computation with Deep Learning (ABC-DL) framework and a novel algorithm called Genetic Programming for Population Genetics (GP4PG). This innovative approach enabled us to effectively model intricate demographic scenarios, utilizing a subset of 16 whole genomes at > 30X coverage. The demographic model suggested by GP4PG exhibited a closer alignment with the observed data compared to the ABC-DL model. Both point to a back-to-Africa origin of North African individuals and a close relationship with Eurasian populations. Results support different origins for Amazigh and Arab populations, with Amazigh populations originating back in Epipaleolithic times, while GP4PG supports Arabization as the main source of Middle Eastern ancestry. The GP4PG model includes population substructure in surrounding populations (sub-Saharan Africa and Middle East) with continuous decaying gene flow after population split. Contrary to ABC-DL, the best GP4PG model does not require pulses of admixture from surrounding populations into North Africa pointing to soft splits as drivers of divergence in North Africa. CONCLUSIONS We have built a demographic model on North Africa that points to a back-to-Africa expansion and a differential origin between Arab and Amazigh populations.
Collapse
Affiliation(s)
- Jose M Serradell
- Departament de Medicina i Ciències de la Vida, Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Carrer del Doctor Aiguader 88, Barcelona, 08003, Spain
| | - Jose M Lorenzo-Salazar
- Genomics Division, Instituto Tecnológico y de Energías Renovables (ITER), Granadilla de Abona s/n, Santa Cruz de Tenerife, 38600, Spain
| | - Carlos Flores
- Genomics Division, Instituto Tecnológico y de Energías Renovables (ITER), Granadilla de Abona s/n, Santa Cruz de Tenerife, 38600, Spain
- Plataforma Genómica de Alto Rendimiento para el Estudio de la Biodiversidad, Instituto de Productos Naturales y Agrobiología (IPNA), Consejo Superior de Investigaciones Científicas, San Cristóbal de La Laguna, Santa Cruz de Tenerife, 38206, Spain
- Research Unit, Hospital Universitario Nuestra Señora de Candelaria, Carretera del Rosario 145, Santa Cruz de Tenerife, 38010, Spain
- CIBER de Enfermedades Respiratorias (CIBERES), Instituto de Salud Carlos III, Av. de Monforte de Lemos, 3-5, Madrid, 28029, Spain
- Facultad de Ciencias de la Salud, Universidad Fernando de Pessoa Canarias, Calle de La Juventud S/N, Santa María de Guía, Las Palmas de Gran Canaria, 35450, Spain
| | - Oscar Lao
- Departament de Medicina i Ciències de la Vida, Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Carrer del Doctor Aiguader 88, Barcelona, 08003, Spain.
| | - David Comas
- Departament de Medicina i Ciències de la Vida, Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Carrer del Doctor Aiguader 88, Barcelona, 08003, Spain.
| |
Collapse
|
6
|
Tengstedt ANB, Liu S, Jacobsen MW, Gundlund C, Møller PR, Berg S, Bekkevold D, Hansen MM. Genomic insights on conservation priorities for North Sea houting and European lake whitefish (Coregonus spp.). Mol Ecol 2024:e17367. [PMID: 38686435 DOI: 10.1111/mec.17367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Revised: 04/08/2024] [Accepted: 04/12/2024] [Indexed: 05/02/2024]
Abstract
Population genomics analysis holds great potential for informing conservation of endangered populations. We focused on a controversial case of European whitefish (Coregonus spp.) populations. The endangered North Sea houting is the only coregonid fish that tolerates oceanic salinities and was previously considered a species (C. oxyrhinchus) distinct from European lake whitefish (C. lavaretus). However, no firm evidence for genetic-based salinity adaptation has been available. Also, studies based on microsatellite and mitogenome data suggested surprisingly recent divergence (c. 2500 years bp) between houting and lake whitefish. These data types furthermore have provided no evidence for possible inbreeding. Finally, a controversial taxonomic revision recently classified all whitefish in the region as C. maraena, calling conservation priorities of houting into question. We used whole-genome and ddRAD sequencing to analyse six lake whitefish populations and the only extant indigenous houting population. Demographic inference indicated post-glacial expansion and divergence between lake whitefish and houting occurring not long after the Last Glaciation, implying deeper population histories than previous analyses. Runs of homozygosity analysis suggested not only high inbreeding (FROH up to 30.6%) in some freshwater populations but also FROH up to 10.6% in the houting prompting conservation concerns. Finally, outlier scans provided evidence for adaptation to high salinities in the houting. Applying a framework for defining conservation units based on current and historical reproductive isolation and adaptive divergence led us to recommend that the houting be treated as a separate conservation unit regardless of species status. In total, the results underscore the potential of genomics to inform conservation practices, in this case clarifying conservation units and highlighting populations of concern.
Collapse
Affiliation(s)
| | - Shenglin Liu
- Department of Biology, Aarhus University, Aarhus C, Denmark
| | - Magnus W Jacobsen
- National Institute of Aquatic Resources, Technical University of Denmark, Silkeborg, Denmark
| | | | - Peter Rask Møller
- Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Søren Berg
- National Institute of Aquatic Resources, Technical University of Denmark, Silkeborg, Denmark
| | - Dorte Bekkevold
- National Institute of Aquatic Resources, Technical University of Denmark, Silkeborg, Denmark
| | | |
Collapse
|
7
|
Kessler C, Shafer ABA. Genomic Analyses Capture the Human-Induced Demographic Collapse and Recovery in a Wide-Ranging Cervid. Mol Biol Evol 2024; 41:msae038. [PMID: 38378172 PMCID: PMC10917209 DOI: 10.1093/molbev/msae038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 02/08/2024] [Accepted: 02/14/2024] [Indexed: 02/22/2024] Open
Abstract
The glacial cycles of the Quaternary heavily impacted species through successions of population contractions and expansions. Similarly, populations have been intensely shaped by human pressures such as unregulated hunting and land use changes. White-tailed and mule deer survived in different refugia through the Last Glacial Maximum, and their populations were severely reduced after the European colonization. Here, we analyzed 73 resequenced deer genomes from across their North American range to understand the consequences of climatic and anthropogenic pressures on deer demographic and adaptive history. We found strong signals of climate-induced vicariance and demographic decline; notably, multiple sequentially Markovian coalescent recovers a severe decline in mainland white-tailed deer effective population size (Ne) at the end of the Last Glacial Maximum. We found robust evidence for colonial overharvest in the form of a recent and dramatic drop in Ne in all analyzed populations. Historical census size and restocking data show a clear parallel to historical Ne estimates, and temporal Ne/Nc ratio shows patterns of conservation concern for mule deer. Signatures of selection highlight genes related to temperature, including a cold receptor previously highlighted in woolly mammoth. We also detected immune genes that we surmise reflect the changing land use patterns in North America. Our study provides a detailed picture of anthropogenic and climatic-induced decline in deer diversity and clues to understanding the conservation concerns of mule deer and the successful demographic recovery of white-tailed deer.
Collapse
Affiliation(s)
- Camille Kessler
- Environmental and Life Sciences Graduate Program, Trent University, Peterborough, Ontario, Canada
| | - Aaron B A Shafer
- Environmental and Life Sciences Graduate Program, Trent University, Peterborough, Ontario, Canada
- Department of Forensic Science, Trent University, Peterborough, Ontario, Canada
| |
Collapse
|
8
|
Teixeira H, Le Corre M, Michon L, Nicoll MAC, Jaeger A, Nikolic N, Pinet P, Couzi FX, Humeau L. Past volcanic activity predisposes an endemic threatened seabird to negative anthropogenic impacts. Sci Rep 2024; 14:1960. [PMID: 38263429 PMCID: PMC10805739 DOI: 10.1038/s41598-024-52556-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Accepted: 01/19/2024] [Indexed: 01/25/2024] Open
Abstract
Humans are regularly cited as the main driver of current biodiversity extinction, but the impact of historic volcanic activity is often overlooked. Pre-human evidence of wildlife abundance and diversity are essential for disentangling anthropogenic impacts from natural events. Réunion Island, with its intense and well-documented volcanic activity, endemic biodiversity, long history of isolation and recent human colonization, provides an opportunity to disentangle these processes. We track past demographic changes of a critically endangered seabird, the Mascarene petrel Pseudobulweria aterrima, using genome-wide SNPs. Coalescent modeling suggested that a large ancestral population underwent a substantial population decline in two distinct phases, ca. 125,000 and 37,000 years ago, coinciding with periods of major eruptions of Piton des Neiges. Subsequently, the ancestral population was fragmented into the two known colonies, ca. 1500 years ago, following eruptions of Piton de la Fournaise. In the last century, both colonies declined significantly due to anthropogenic activities, and although the species was initially considered extinct, it was rediscovered in the 1970s. Our findings suggest that the current conservation status of wildlife on volcanic islands should be firstly assessed as a legacy of historic volcanic activity, and thereafter by the increasing anthropogenic impacts, which may ultimately drive species towards extinction.
Collapse
Affiliation(s)
- Helena Teixeira
- UMR ENTROPIE (Université de La Réunion, IRD, CNRS, IFREMER, Université de Nouvelle-Calédonie), 15 Avenue René Cassin, CS 92003, 97744, Saint Denis Cedex 9, Ile de La Réunion, France.
| | - Matthieu Le Corre
- UMR ENTROPIE (Université de La Réunion, IRD, CNRS, IFREMER, Université de Nouvelle-Calédonie), 15 Avenue René Cassin, CS 92003, 97744, Saint Denis Cedex 9, Ile de La Réunion, France
| | - Laurent Michon
- Université de La Réunion, Laboratoire Géosciences Réunion, 97744, Saint Denis, France
- Université Paris Cité, Institut de physique du globe de Paris, CNRS, 75005, Paris, France
| | - Malcolm A C Nicoll
- Institute of Zoology, Zoological Society of London, Regent's Park, London, NW1 4RY, UK
| | - Audrey Jaeger
- UMR ENTROPIE (Université de La Réunion, IRD, CNRS, IFREMER, Université de Nouvelle-Calédonie), 15 Avenue René Cassin, CS 92003, 97744, Saint Denis Cedex 9, Ile de La Réunion, France
| | | | - Patrick Pinet
- Parc National de La Réunion, Life+ Pétrels, 258 Rue de la République, 97431, Plaine des Palmistes, Réunion Island, France
| | - François-Xavier Couzi
- Société d'Etudes Ornithologiques de La Réunion (SEOR), 13 ruelle des Orchidées, 97440, Saint André, Réunion Island, France
| | - Laurence Humeau
- UMR PVBMT (Université de La Réunion, CIRAD), 15 Avenue René Cassin, CS 92003, 97744, Saint Denis Cedex 9, Ile de La Réunion, France
| |
Collapse
|
9
|
Marchi N, Kapopoulou A, Excoffier L. Demogenomic inference from spatially and temporally heterogeneous samples. Mol Ecol Resour 2024; 24:e13877. [PMID: 37819677 DOI: 10.1111/1755-0998.13877] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 09/15/2023] [Accepted: 09/27/2023] [Indexed: 10/13/2023]
Abstract
Modern and ancient genomes are not necessarily drawn from homogeneous populations, as they may have been collected from different places and at different times. This heterogeneous sampling can be an issue for demographic inferences and results in biased demographic parameters and incorrect model choice if not properly considered. When explicitly accounted for, it can result in very complex models and high data dimensionality that are difficult to analyse. In this paper, we formally study the impact of such spatial and temporal sampling heterogeneity on demographic inference, and we introduce a way to circumvent this problem. To deal with structured samples without increasing the dimensionality of the site frequency spectrum (SFS), we introduce a new structured approach to the existing program fastsimcoal2. We assess the efficiency and relevance of this methodological update with simulated and modern human genomic data. We particularly focus on spatial and temporal heterogeneities to evidence the interest of this new SFS-based approach, which can be especially useful when handling scattered and ancient DNA samples, as in conservation genetics or archaeogenetics.
Collapse
Affiliation(s)
- Nina Marchi
- CMPG, Institute for Ecology and Evolution, University of Berne, Berne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Adamandia Kapopoulou
- CMPG, Institute for Ecology and Evolution, University of Berne, Berne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Laurent Excoffier
- CMPG, Institute for Ecology and Evolution, University of Berne, Berne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
10
|
Mah JC, Lohmueller KE, Garud N. Inference of the demographic histories and selective effects of human gut commensal microbiota over the course of human history. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.09.566454. [PMID: 38014007 PMCID: PMC10680615 DOI: 10.1101/2023.11.09.566454] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Despite the importance of gut commensal microbiota to human health, there is little knowledge about their evolutionary histories, including their population demographic histories and their distributions of fitness effects (DFE) of new mutations. Here, we infer the demographic histories and DFEs of 27 of the most highly prevalent and abundant commensal gut microbial species in North Americans over timescales exceeding human generations using a collection of lineages inferred from a panel of healthy hosts. We find overall reductions in genetic variation among commensal gut microbes sampled from a Western population relative to an African rural population. Additionally, some species in North American microbiomes display contractions in population size and others expansions, potentially occurring at several key historical moments in human history. DFEs across species vary from highly to mildly deleterious, with accessory genes experiencing more drift compared to core genes. Within genera, DFEs tend to be more congruent, reflective of underlying phylogenetic relationships. Taken together, these findings suggest that human commensal gut microbes have distinct evolutionary histories, possibly reflecting the unique roles of individual members of the microbiome.
Collapse
Affiliation(s)
- Jonathan C. Mah
- Bioinformatics Interdepartmental Program, University of California, Los Angeles
| | - Kirk E. Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles
- Department of Human Genetics, University of California, Los Angeles
| | - Nandita Garud
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles
- Department of Human Genetics, University of California, Los Angeles
| |
Collapse
|
11
|
Fan C, Cahoon JL, Dinh BL, Ortega-Del Vecchyo D, Huber C, Edge MD, Mancuso N, Chiang CWK. A likelihood-based framework for demographic inference from genealogical trees. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.10.561787. [PMID: 37873208 PMCID: PMC10592779 DOI: 10.1101/2023.10.10.561787] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
The demographic history of a population drives the pattern of genetic variation and is encoded in the gene-genealogical trees of the sampled alleles. However, existing methods to infer demographic history from genetic data tend to use relatively low-dimensional summaries of the genealogy, such as allele frequency spectra. As a step toward capturing more of the information encoded in the genome-wide sequence of genealogical trees, here we propose a novel framework called the genealogical likelihood (gLike), which derives the full likelihood of a genealogical tree under any hypothesized demographic history. Employing a graph-based structure, gLike summarizes across independent trees the relationships among all lineages in a tree with all possible trajectories of population memberships through time and efficiently computes the exact marginal probability under a parameterized demographic model. Through extensive simulations and empirical applications on populations that have experienced multiple admixtures, we showed that gLike can accurately estimate dozens of demographic parameters when the true genealogy is known, including ancestral population sizes, admixture timing, and admixture proportions. Moreover, when using genealogical trees inferred from genetic data, we showed that gLike outperformed conventional demographic inference methods that leverage only the allele-frequency spectrum and yielded parameter estimates that align with established historical knowledge of the past demographic histories for populations like Latino Americans and Native Hawaiians. Furthermore, our framework can trace ancestral histories by analyzing a sample from the admixed population without proxies for its source populations, removing the need to sample ancestral populations that may no longer exist. Taken together, our proposed gLike framework harnesses underutilized genealogical information to offer exceptional sensitivity and accuracy in inferring complex demographies for humans and other species, particularly as estimation of genome-wide genealogies improves.
Collapse
Affiliation(s)
- Caoqi Fan
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Jordan L Cahoon
- Department of Quantitative and Computational Biology, University of Southern California
- Department of Computer Science, University of Southern California
| | - Bryan L Dinh
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Diego Ortega-Del Vecchyo
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Juriquilla, Querétaro, México
| | - Christian Huber
- Department of Biology, Penn State University, University Park, PA, USA
| | - Michael D Edge
- Department of Quantitative and Computational Biology, University of Southern California
| | - Nicholas Mancuso
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Charleston W K Chiang
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| |
Collapse
|
12
|
Wakeley J, Fan WT(L, Koch E, Sunyaev S. Recurrent mutation in the ancestry of a rare variant. Genetics 2023; 224:iyad049. [PMID: 36967220 PMCID: PMC10324944 DOI: 10.1093/genetics/iyad049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 01/30/2023] [Accepted: 03/08/2023] [Indexed: 03/28/2023] Open
Abstract
Recurrent mutation produces multiple copies of the same allele which may be co-segregating in a population. Yet, most analyses of allele-frequency or site-frequency spectra assume that all observed copies of an allele trace back to a single mutation. We develop a sampling theory for the number of latent mutations in the ancestry of a rare variant, specifically a variant observed in relatively small count in a large sample. Our results follow from the statistical independence of low-count mutations, which we show to hold for the standard neutral coalescent or diffusion model of population genetics as well as for more general coalescent trees. For populations of constant size, these counts are distributed like the number of alleles in the Ewens sampling formula. We develop a Poisson sampling model for populations of varying size and illustrate it using new results for site-frequency spectra in an exponentially growing population. We apply our model to a large data set of human SNPs and use it to explain dramatic differences in site-frequency spectra across the range of mutation rates in the human genome.
Collapse
Affiliation(s)
- John Wakeley
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Wai-Tong (Louis) Fan
- Department of Mathematics, Indiana University, Bloomington, IN 47405, USA
- Center of Mathematical Sciences and Applications, Harvard University, Cambridge, MA 02138, USA
| | - Evan Koch
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
- Division of Genetics, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Shamil Sunyaev
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA
- Division of Genetics, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
13
|
Germain RR, Feng S, Chen G, Graves GR, Tobias JA, Rahbek C, Lei F, Fjeldså J, Hosner PA, Gilbert MTP, Zhang G, Nogués-Bravo D. Species-specific traits mediate avian demographic responses under past climate change. Nat Ecol Evol 2023:10.1038/s41559-023-02055-3. [PMID: 37106156 DOI: 10.1038/s41559-023-02055-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Accepted: 03/30/2023] [Indexed: 04/29/2023]
Abstract
Anticipating species' responses to environmental change is a pressing mission in biodiversity conservation. Despite decades of research investigating how climate change may affect population sizes, historical context is lacking, and the traits that mediate demographic sensitivity to changing climate remain elusive. We use whole-genome sequence data to reconstruct the demographic histories of 263 bird species over the past million years and identify networks of interacting morphological and life history traits associated with changes in effective population size (Ne) in response to climate warming and cooling. Our results identify direct and indirect effects of key traits representing dispersal, reproduction and survival on long-term demographic responses to climate change, thereby highlighting traits most likely to influence population responses to ongoing climate warming.
Collapse
Affiliation(s)
- Ryan R Germain
- Center for Macroecology, Evolution, and Climate, The Globe Institute, University of Copenhagen, Copenhagen, Denmark.
- Villum Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Shaohong Feng
- Center for Evolutionary and Organismal Biology, Zhejiang University School of Medicine, Hangzhou, China
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou, China
- Department of General Surgery of Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou, China
- Innovation Center of Yangtze River Delta, Zhejiang University, Hangzhou, China
| | - Guangji Chen
- BGI Shenzhen, Shenzhen, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Gary R Graves
- Center for Macroecology, Evolution, and Climate, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Department of Vertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC, USA
| | - Joseph A Tobias
- Department of Life Sciences, Imperial College London, Ascot, UK
| | - Carsten Rahbek
- Center for Macroecology, Evolution, and Climate, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Department of Life Sciences, Imperial College London, Ascot, UK
- Center for Global Mountain Biodiversity, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Danish Institute for Advanced Study, University of Southern Denmark, Odense, Denmark
| | - Fumin Lei
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
| | - Jon Fjeldså
- Center for Macroecology, Evolution, and Climate, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Peter A Hosner
- Center for Macroecology, Evolution, and Climate, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Center for Global Mountain Biodiversity, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - M Thomas P Gilbert
- Center for Evolutionary Hologenomics, The Globe Institute, University of Copenhagen, Copenhagen, Denmark
- Department of Natural History, University Museum, Norwegian University of Science and Technology, Trondheim, Norway
| | - Guojie Zhang
- Villum Centre for Biodiversity Genomics, Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
- Center for Evolutionary and Organismal Biology, Zhejiang University School of Medicine, Hangzhou, China.
- Liangzhu Laboratory, Zhejiang University Medical Center, Hangzhou, China.
- Innovation Center of Yangtze River Delta, Zhejiang University, Hangzhou, China.
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China.
| | - David Nogués-Bravo
- Center for Macroecology, Evolution, and Climate, The Globe Institute, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
14
|
Freund F, Kerdoncuff E, Matuszewski S, Lapierre M, Hildebrandt M, Jensen JD, Ferretti L, Lambert A, Sackton TB, Achaz G. Interpreting the pervasive observation of U-shaped Site Frequency Spectra. PLoS Genet 2023; 19:e1010677. [PMID: 36952570 PMCID: PMC10072462 DOI: 10.1371/journal.pgen.1010677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 04/04/2023] [Accepted: 02/22/2023] [Indexed: 03/25/2023] Open
Abstract
The standard neutral model of molecular evolution has traditionally been used as the null model for population genomics. We gathered a collection of 45 genome-wide site frequency spectra from a diverse set of species, most of which display an excess of low and high frequency variants compared to the expectation of the standard neutral model, resulting in U-shaped spectra. We show that multiple merger coalescent models often provide a better fit to these observations than the standard Kingman coalescent. Hence, in many circumstances these under-utilized models may serve as the more appropriate reference for genomic analyses. We further discuss the underlying evolutionary processes that may result in the widespread U-shape of frequency spectra.
Collapse
Affiliation(s)
- Fabian Freund
- Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, Stuttgart, Germany
- Department of Genetics and Genome Biology, University of Leicester, Leicester, United Kingdom
| | - Elise Kerdoncuff
- Department of Genetics, University of California, Berkeley, California, United States of America
- Informatics Group, Harvard University, Cambridge, Massachusetts, United States of America
| | | | - Marguerite Lapierre
- Informatics Group, Harvard University, Cambridge, Massachusetts, United States of America
| | | | - Jeffrey D Jensen
- Center for Evolution & Medicine, School of Life Sciences, Arizona State University, Tempe, Arizona, United States of America
| | - Luca Ferretti
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
| | - Amaury Lambert
- Institut de Biologie de l'ENS (IBENS), École Normale Supérieure, Paris, France
- Informatics Group, Harvard University, Cambridge, Massachusetts, United States of America
| | - Timothy B Sackton
- Éco-anthropologie, Muséum National d'Histoire Naturelle, Université Paris-Cité, Paris, France
| | - Guillaume Achaz
- Informatics Group, Harvard University, Cambridge, Massachusetts, United States of America
- SMILE group, Center for Interdisciplinary Research in Biology (CIRB), Collège de France, Paris, France
| |
Collapse
|
15
|
Tiley GP, van Elst T, Teixeira H, Schüßler D, Salmona J, Blanco MB, Ralison JM, Randrianambinina B, Rasoloarison RM, Stahlke AR, Hohenlohe PA, Chikhi L, Louis EE, Radespiel U, Yoder AD. Population genomic structure in Goodman's mouse lemur reveals long-standing separation of Madagascar's Central Highlands and eastern rainforests. Mol Ecol 2022; 31:4901-4918. [PMID: 35880414 DOI: 10.1111/mec.16632] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2020] [Revised: 06/25/2022] [Accepted: 07/08/2022] [Indexed: 11/28/2022]
Abstract
Madagascar's Central Highlands are largely composed of grasslands, interspersed with patches of forest. The historical perspective was that Madagascar's grasslands had anthropogenic origins, but emerging evidence suggests that grasslands were a component of the pre-human Central Highlands vegetation. Consequently, there is now vigorous debate regarding the extent to which these grasslands have expanded due to anthropogenic pressures. Here, we shed light on the temporal dynamics of Madagascar's vegetative composition by conducting a population genomic investigation of Goodman's mouse lemur (Microcebus lehilahytsara; Cheirogaleidae). These small-bodied primates occur both in Madagascar's eastern rainforests and in the Central Highlands, making them a valuable indicator species. Population divergences among forest-dwelling mammals will reflect changes to their habitat, including fragmentation, whereas patterns of post-divergence gene flow can reveal formerly wooded migration corridors. To explore these patterns, we used RADseq data to infer population genetic structure, demographic models of post-divergence gene flow, and population size change through time. The results offer evidence that open habitats are an ancient component of the Central Highlands, and that wide-spread forest fragmentation occurred naturally during a period of decreased precipitation near the Last Glacial Maximum. Models of gene flow suggest that migration across the Central Highlands has been possible from the Pleistocene through the recent Holocene via riparian corridors. Though our findings support the hypothesis that Central Highland grasslands predate human arrival, we also find evidence for human-mediated population declines. This highlights the extent to which species imminently threatened by human-mediated deforestation may already be vulnerable from paleoclimatic conditions.
Collapse
Affiliation(s)
- George P Tiley
- Department of Biology, Duke University, Durham, NC, USA.,Royal Botanic Gardens Kew, Richmond, UK
| | - Tobias van Elst
- Institute of Zoology, University of Veterinary Medicine Hannover, Hannover, Germany
| | - Helena Teixeira
- Institute of Zoology, University of Veterinary Medicine Hannover, Hannover, Germany
| | - Dominik Schüßler
- Research Group Vegetation Ecology and Nature Conservation, Institute of Biology and Chemistry, University of Hildesheim, Hildesheim, Germany
| | - Jordi Salmona
- CNRS, Université Paul Sabatier, IRD; UMR5174 EDB (Laboratoire Évolution & Diversité Biologique), Toulouse, France
| | | | - José M Ralison
- Département de Biologie Animale, Université d'Antananarivo, Antananarivo, Madagascar
| | - Blanchard Randrianambinina
- Group d'Etude et de Recherche sur les Primates de Madagascar (GERP), Antananarivo, Madagascar.,Faculté des Sciences, University of Mahajanga, Mahajanga, Madagascar
| | - Rodin M Rasoloarison
- Département de Biologie Animale, Université d'Antananarivo, Antananarivo, Madagascar.,Group d'Etude et de Recherche sur les Primates de Madagascar (GERP), Antananarivo, Madagascar
| | - Amanda R Stahlke
- Institute for Bioinformatics and Evolutionary Studies, Department of Biological Sciences, University of Idaho, Moscow, ID, USA
| | - Paul A Hohenlohe
- Institute for Bioinformatics and Evolutionary Studies, Department of Biological Sciences, University of Idaho, Moscow, ID, USA
| | - Lounès Chikhi
- CNRS, Université Paul Sabatier, IRD; UMR5174 EDB (Laboratoire Évolution & Diversité Biologique), Toulouse, France.,Instituto Gulbenkian de Ciência, Rua da Quinta Grande, Oeiras, Portugal
| | - Edward E Louis
- Grewcock Center for Conservation and Research, Omaha's Henry Doorly Zoo and Aquarium, Omaha, NE, USA
| | - Ute Radespiel
- Institute of Zoology, University of Veterinary Medicine Hannover, Hannover, Germany
| | - Anne D Yoder
- Department of Biology, Duke University, Durham, NC, USA
| |
Collapse
|
16
|
Carson J, Ledda A, Ferretti L, Keeling M, Didelot X. The bounded coalescent model: Conditioning a genealogy on a minimum root date. J Theor Biol 2022; 548:111186. [PMID: 35697144 DOI: 10.1016/j.jtbi.2022.111186] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Revised: 05/05/2022] [Accepted: 06/02/2022] [Indexed: 01/27/2023]
Abstract
The coalescent model represents how individuals sampled from a population may have originated from a last common ancestor. The bounded coalescent model is obtained by conditioning the coalescent model such that the last common ancestor must have existed after a certain date. This conditioned model arises in a variety of applications, such as speciation, horizontal gene transfer or transmission analysis, and yet the bounded coalescent model has not been previously analysed in detail. Here we describe a new algorithm to simulate from this model directly, without resorting to rejection sampling. We show that this direct simulation algorithm is more computationally efficient than the rejection sampling approach. We also show how to calculate the probability of the last common ancestor occurring after a given date, which is required to compute the probability density of realisations under the bounded coalescent model. Our results are applicable in both the isochronous (when all samples have the same date) and heterochronous (where samples can have different dates) settings. We explore the effect of setting a bound on the date of the last common ancestor, and show that it affects a number of properties of the resulting phylogenies. All our methods are implemented in a new R package called BoundedCoalescent which is freely available online.
Collapse
Affiliation(s)
- Jake Carson
- Mathematics Institute, University of Warwick, United Kingdom
| | - Alice Ledda
- HCAI, Fungal, AMR, AMU & Sepsis Division, UK Health Security Agency, United Kingdom
| | - Luca Ferretti
- Big Data Institute, University of Oxford, United Kingdom
| | - Matt Keeling
- Mathematics Institute, University of Warwick, United Kingdom
| | - Xavier Didelot
- Department of Statistics and School of Life Sciences, University of Warwick, United Kingdom
| |
Collapse
|
17
|
Jasper RJ, McDonald TK, Singh P, Lu M, Rougeux C, Lind BM, Yeaman S. Evaluating the accuracy of variant calling methods using the frequency of parent-offspring genotype mismatch. Mol Ecol Resour 2022; 22:2524-2533. [PMID: 35510784 PMCID: PMC9544674 DOI: 10.1111/1755-0998.13628] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Revised: 03/11/2022] [Accepted: 03/29/2022] [Indexed: 11/28/2022]
Abstract
The use of next‐generation sequencing (NGS) data sets has increased dramatically over the last decade, but there have been few systematic analyses quantifying the accuracy of the commonly used variant caller programs. Here we used a familial design consisting of diploid tissue from a single lodgepole pine (Pinus contorta) parent and the maternally derived haploid tissue from 106 full‐sibling offspring, where mismatches could only arise due to mutation or bioinformatic error. Given the rarity of mutation, we used the rate of mismatches between parent and offspring genotype calls to infer the single nucleotide polymorphism (SNP) genotyping error rates of FreeBayes, HaplotypeCaller, SAMtools, UnifiedGenotyper, and VarScan. With baseline filtering HaplotypeCaller and UnifiedGenotyper yielded more SNPs and higher error rates by one to two orders of magnitude, whereas FreeBayes, SAMtools and VarScan yielded lower numbers of SNPs and more modest error rates. To facilitate comparison between variant callers we standardized each SNP set to the same number of SNPs using additional filtering, where UnifiedGenotyper consistently produced the smallest proportion of genotype errors, followed by HaplotypeCaller, VarScan, SAMtools, and FreeBayes. Additionally, we found that error rates were minimized for SNPs called by more than one variant caller. Finally, we evaluated the performance of various commonly used filtering metrics on SNP calling. Our analysis provides a quantitative assessment of the accuracy of five widely used variant calling programs and offers valuable insights into both the choice of variant caller program and the choice of filtering metrics, especially for researchers using non‐model study systems.
Collapse
Affiliation(s)
- Russ J Jasper
- Department of Biological Sciences, University of Calgary, Calgary, AB, Canada
| | | | - Pooja Singh
- Department of Biological Sciences, University of Calgary, Calgary, AB, Canada.,Aquatic Ecology and Evolution, Institute of Ecology and Evolution, University of Bern, Bern, Switzerland.,3EAWAG, Swiss Federal Institute of Aquatic Science and Technology, Kastanienbaum, Switzerland
| | - Mengmeng Lu
- Department of Biological Sciences, University of Calgary, Calgary, AB, Canada
| | - Clément Rougeux
- Department of Biological Sciences, University of Calgary, Calgary, AB, Canada
| | - Brandon M Lind
- Centre for Forest Conservation Genetics and Department of Forest and Conservation Sciences, University of British Columbia, Vancouver, BC, Canada
| | - Sam Yeaman
- Department of Biological Sciences, University of Calgary, Calgary, AB, Canada
| |
Collapse
|
18
|
Johri P, Aquadro CF, Beaumont M, Charlesworth B, Excoffier L, Eyre-Walker A, Keightley PD, Lynch M, McVean G, Payseur BA, Pfeifer SP, Stephan W, Jensen JD. Recommendations for improving statistical inference in population genomics. PLoS Biol 2022; 20:e3001669. [PMID: 35639797 PMCID: PMC9154105 DOI: 10.1371/journal.pbio.3001669] [Citation(s) in RCA: 48] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
The field of population genomics has grown rapidly in response to the recent advent of affordable, large-scale sequencing technologies. As opposed to the situation during the majority of the 20th century, in which the development of theoretical and statistical population genetic insights outpaced the generation of data to which they could be applied, genomic data are now being produced at a far greater rate than they can be meaningfully analyzed and interpreted. With this wealth of data has come a tendency to focus on fitting specific (and often rather idiosyncratic) models to data, at the expense of a careful exploration of the range of possible underlying evolutionary processes. For example, the approach of directly investigating models of adaptive evolution in each newly sequenced population or species often neglects the fact that a thorough characterization of ubiquitous nonadaptive processes is a prerequisite for accurate inference. We here describe the perils of these tendencies, present our consensus views on current best practices in population genomic data analysis, and highlight areas of statistical inference and theory that are in need of further attention. Thereby, we argue for the importance of defining a biologically relevant baseline model tuned to the details of each new analysis, of skepticism and scrutiny in interpreting model fitting results, and of carefully defining addressable hypotheses and underlying uncertainties.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, Arizona, United States of America
| | - Charles F. Aquadro
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Mark Beaumont
- School of Biological Sciences, University of Bristol, Bristol, United Kingdom
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Laurent Excoffier
- Institute of Ecology and Evolution, University of Berne, Berne, Switzerland
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Peter D. Keightley
- Institute of Ecology and Evolution, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Michael Lynch
- School of Life Sciences, Arizona State University, Tempe, Arizona, United States of America
| | - Gil McVean
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom
| | - Bret A. Payseur
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Susanne P. Pfeifer
- School of Life Sciences, Arizona State University, Tempe, Arizona, United States of America
| | | | - Jeffrey D. Jensen
- School of Life Sciences, Arizona State University, Tempe, Arizona, United States of America
| |
Collapse
|
19
|
Montinaro F, Pankratov V, Yelmen B, Pagani L, Mondal M. Revisiting the out of Africa event with a deep-learning approach. Am J Hum Genet 2021; 108:2037-2051. [PMID: 34626535 PMCID: PMC8595897 DOI: 10.1016/j.ajhg.2021.09.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 09/09/2021] [Indexed: 10/20/2022] Open
Abstract
Anatomically modern humans evolved around 300 thousand years ago in Africa. They started to appear in the fossil record outside of Africa as early as 100 thousand years ago, although other hominins existed throughout Eurasia much earlier. Recently, several studies argued in favor of a single out of Africa event for modern humans on the basis of whole-genome sequence analyses. However, the single out of Africa model is in contrast with some of the findings from fossil records, which support two out of Africa events, and uniparental data, which propose a back to Africa movement. Here, we used a deep-learning approach coupled with approximate Bayesian computation and sequential Monte Carlo to revisit these hypotheses from the whole-genome sequence perspective. Our results support the back to Africa model over other alternatives. We estimated that there are two sequential separations between Africa and out of African populations happening around 60-90 thousand years ago and separated by 13-15 thousand years. One of the populations resulting from the more recent split has replaced the older West African population to a large extent, while the other one has founded the out of Africa populations.
Collapse
Affiliation(s)
- Francesco Montinaro
- Institute of Genomics, University of Tartu, Tartu 51010, Estonia; Department of Biology-Genetics, University of Bari, Bari 70124, Italy
| | - Vasili Pankratov
- Institute of Genomics, University of Tartu, Tartu 51010, Estonia
| | - Burak Yelmen
- Institute of Genomics, University of Tartu, Tartu 51010, Estonia; Institute of Molecular and Cell Biology, University of Tartu, Tartu 51010, Estonia; Université Paris-Saclay, CNRS UMR 9015, INRIA, Laboratoire Interdisciplinaire des Sciences du Numérique, 91400 Orsay, France
| | - Luca Pagani
- Institute of Genomics, University of Tartu, Tartu 51010, Estonia; Department of Biology, University of Padova, Padova 35121, Italy
| | - Mayukh Mondal
- Institute of Genomics, University of Tartu, Tartu 51010, Estonia.
| |
Collapse
|
20
|
Teixeira H, Salmona J, Arredondo A, Mourato B, Manzi S, Rakotondravony R, Mazet O, Chikhi L, Metzger J, Radespiel U. Impact of model assumptions on demographic inferences: the case study of two sympatric mouse lemurs in northwestern Madagascar. BMC Ecol Evol 2021; 21:197. [PMID: 34727890 PMCID: PMC8561976 DOI: 10.1186/s12862-021-01929-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Accepted: 10/18/2021] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Quaternary climate fluctuations have been acknowledged as major drivers of the geographical distribution of the extraordinary biodiversity observed in tropical biomes, including Madagascar. The main existing framework for Pleistocene Malagasy diversification assumes that forest cover was strongly shaped by warmer Interglacials (leading to forest expansion) and by cooler and arid glacials (leading to forest contraction), but predictions derived from this scenario for forest-dwelling animals have rarely been tested with genomic datasets. RESULTS We generated genomic data and applied three complementary demographic approaches (Stairway Plot, PSMC and IICR-simulations) to infer population size and connectivity changes for two forest-dependent primate species (Microcebus murinus and M. ravelobensis) in northwestern Madagascar. The analyses suggested major demographic changes in both species that could be interpreted in two ways, depending on underlying model assumptions (i.e., panmixia or population structure). Under panmixia, the two species exhibited larger population sizes across the Last Glacial Maximum (LGM) and towards the African Humid Period (AHP). This peak was followed by a population decline in M. ravelobensis until the present, while M. murinus may have experienced a second population expansion that was followed by a sharp decline starting 3000 years ago. In contrast, simulations under population structure suggested decreasing population connectivity between the Last Interglacial and the LGM for both species, but increased connectivity during the AHP exclusively for M. murinus. CONCLUSION Our study shows that closely related species may differ in their responses to climatic events. Assuming that Pleistocene climatic conditions in the lowlands were similar to those in the Malagasy highlands, some demographic dynamics would be better explained by changes in population connectivity than in population size. However, changes in connectivity alone cannot be easily reconciled with a founder effect that was shown for M. murinus during its colonization of the northwestern Madagascar in the late Pleistocene. To decide between the two alternative models, more knowledge about historic forest dynamics in lowland habitats is necessary. Altogether, our study stresses that demographic inferences strongly depend on the underlying model assumptions. Final conclusions should therefore be based on a comparative evaluation of multiple approaches.
Collapse
Affiliation(s)
- Helena Teixeira
- Institute of Zoology, University of Veterinary Medicine Hannover, Foundation, Bünteweg 17, 30559, Hannover, Germany.
| | - Jordi Salmona
- Laboratoire Évolution and Diversité Biologique (EDB UMR 5174), Université de Toulouse Midi-Pyrénées, CNRS, IRD, UPS, 118 Route de Narbonne, Bât. 4R1, 31062, Toulouse cedex 9, France
| | - Armando Arredondo
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande, 6, 2780-156, Oeiras, Portugal
- Université de Toulouse, Institut National des Sciences Appliquées, Institut de Mathématiques de Toulouse, Toulouse, France
| | - Beatriz Mourato
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande, 6, 2780-156, Oeiras, Portugal
| | - Sophie Manzi
- Laboratoire Évolution and Diversité Biologique (EDB UMR 5174), Université de Toulouse Midi-Pyrénées, CNRS, IRD, UPS, 118 Route de Narbonne, Bât. 4R1, 31062, Toulouse cedex 9, France
| | - Romule Rakotondravony
- Ecole Doctorale Ecosystèmes Naturels (EDEN), University of Mahajanga, 5 Rue Georges V - Immeuble KAKAL, Mahajanga Be, B.P. 652, 401, Mahajanga, Madagascar
- Faculté des Sciences, de Technologies et de l'Environnement, University of Mahajanga, 5 Rue Georges V - Immeuble KAKAL, Mahajanga Be, B.P. 652, 401, Mahajanga, Madagascar
| | - Olivier Mazet
- Université de Toulouse, Institut National des Sciences Appliquées, Institut de Mathématiques de Toulouse, Toulouse, France
| | - Lounès Chikhi
- Laboratoire Évolution and Diversité Biologique (EDB UMR 5174), Université de Toulouse Midi-Pyrénées, CNRS, IRD, UPS, 118 Route de Narbonne, Bât. 4R1, 31062, Toulouse cedex 9, France
- Instituto Gulbenkian de Ciência, Rua da Quinta Grande, 6, 2780-156, Oeiras, Portugal
| | - Julia Metzger
- Institute of Animal Breeding and Genetics, University of Veterinary Medicine Hannover, Foundation, Bünteweg 17p, 30559, Hannover, Germany
- Veterinary Functional Genomics, Max Planck Institute for Molecular Genetics, Ihnestrasse 73, 14195, Berlin, Germany
| | - Ute Radespiel
- Institute of Zoology, University of Veterinary Medicine Hannover, Foundation, Bünteweg 17, 30559, Hannover, Germany.
| |
Collapse
|
21
|
Barley AJ, Cordes JE, Walker JM, Thomson RC. Genetic diversity and the origins of parthenogenesis in the teiid lizard Aspidoscelis laredoensis. Mol Ecol 2021; 31:266-278. [PMID: 34614250 DOI: 10.1111/mec.16213] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Revised: 09/14/2021] [Accepted: 09/30/2021] [Indexed: 01/25/2023]
Abstract
Unisexual vertebrates typically form through hybridization events between sexual species in which reproductive mode transitions occur in the hybrid offspring. This evolutionary history is thought to have important consequences for the ecology of unisexual lineages and their interactions with congeners in natural communities. However, these consequences have proven challenging to study owing to uncertainty about patterns of population genetic diversity in unisexual lineages. Of particular interest is resolving the contribution of historical hybridization events versus post formational mutation to patterns of genetic diversity in nature. Here we use restriction site associated DNA genotyping to evaluate genetic diversity and demographic history in Aspidoscelis laredoensis, a diploid unisexual lizard species from the vicinity of the Rio Grande River in southern Texas and northern Mexico. The sexual progenitor species from which one or more lineages are derived also occur in the Rio Grande Valley region, although patterns of distribution across individual sites are quite variable. Results from population genetic and phylogenetic analyses resolved the major axes of genetic variation in this species and highlight how these match predictions based on historical patterns of hybridization. We also found discordance between results of demographic modelling using different statistical approaches with the genomic data. We discuss these insights within the context of the ecological and evolutionary mechanisms that generate and maintain lineage diversity in unisexual species. As one of the most dynamic, intriguing, and geographically well investigated groups of whiptail lizards, these species hold substantial promise for future studies on the constraints of diversification in unisexual vertebrates.
Collapse
Affiliation(s)
- Anthony J Barley
- School of Life Sciences, University of Hawai'i, Honolulu, Hawai'I, USA
| | - James E Cordes
- Division of Sciences and Mathematics, Louisiana State University Eunice, Eunice, Louisiana, USA
| | - James M Walker
- Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas, USA
| | - Robert C Thomson
- School of Life Sciences, University of Hawai'i, Honolulu, Hawai'I, USA
| |
Collapse
|
22
|
Cleary AC, Hoffman JI, Forcada J, Lydersen C, Lowther AD, Kovacs KM. 50,000 years of ice and seals: Impacts of the Last Glacial Maximum on Antarctic fur seals. Ecol Evol 2021; 11:14003-14011. [PMID: 34707834 PMCID: PMC8525082 DOI: 10.1002/ece3.8104] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Revised: 08/22/2021] [Accepted: 08/26/2021] [Indexed: 11/11/2022] Open
Abstract
Ice is one of the most important drivers of population dynamics in polar organisms, influencing the locations, sizes, and connectivity of populations. Antarctic fur seals, Arctocephalus gazella, are particularly interesting in this regard, as they are concomitantly reliant on both ice-associated prey and ice-free coastal breeding areas. We reconstructed the history of this species through the Last Glacial Maximum (LGM) using genomic sequence data from seals across their range. Population size trends and divergence events were investigated using continuous-time size estimation analysis and divergence time estimation models. The combined results indicated that a panmictic population present prior to the LGM split into two small refugial populations during peak ice extent. Following ice decline, the western refugial population founded colonies at the South Shetlands, South Georgia, and Bouvetøya, while the eastern refugial population founded the colony on Iles Kerguelen. Postglacial population divergence times closely match geological estimates of when these coastal breeding areas became ice free. Given the predictions regarding continued future warming in polar oceans, these responses of Antarctic fur seals to past climate variation suggest it may be worthwhile giving conservation consideration to potential future breeding locations, such as areas further south along the Antarctic Peninsula, in addition to present colony areas.
Collapse
Affiliation(s)
- Alison C. Cleary
- Department of Natural SciencesUniversity of AgderKristiansandNorway
- Norwegian Polar InstituteFram CentreTromsøNorway
| | - Joseph I. Hoffman
- Department of Animal BehaviourUniversity of BielefeldBielefeldGermany
| | | | | | | | | |
Collapse
|
23
|
Teixeira H, Montade V, Salmona J, Metzger J, Bremond L, Kasper T, Daut G, Rouland S, Ranarilalatiana S, Rakotondravony R, Chikhi L, Behling H, Radespiel U. Past environmental changes affected lemur population dynamics prior to human impact in Madagascar. Commun Biol 2021; 4:1084. [PMID: 34526636 PMCID: PMC8443640 DOI: 10.1038/s42003-021-02620-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2021] [Accepted: 08/31/2021] [Indexed: 02/08/2023] Open
Abstract
Quaternary climatic changes have been invoked as important drivers of species diversification worldwide. However, the impact of such changes on vegetation and animal population dynamics in tropical regions remains debated. To overcome this uncertainty, we integrated high-resolution paleoenvironmental reconstructions from a sedimentary record covering the past 25,000 years with demographic inferences of a forest-dwelling primate species (Microcebus arnholdi), in northern Madagascar. Result comparisons suggest that climate changes through the African Humid Period (15.2 - 5.5 kyr) strongly affected the demographic dynamics of M. arnholdi. We further inferred a population decline in the last millennium which was likely shaped by the combination of climatic and anthropogenic impacts. Our findings demonstrate that population fluctuations in Malagasy wildlife were substantial prior to a significant human impact. This provides a critical knowledge of climatically driven, environmental and ecological changes in the past, which is essential to better understand the dynamics and resilience of current biodiversity.
Collapse
Affiliation(s)
- Helena Teixeira
- grid.412970.90000 0001 0126 6191Institute of Zoology, University of Veterinary Medicine Hannover, Bünteweg 17, 30559 Hannover, Germany
| | - Vincent Montade
- grid.7450.60000 0001 2364 4210University of Goettingen, Department of Palynology and Climate Dynamics, Untere Karspüle 2, 37073 Goettingen, Germany ,grid.462058.d0000 0001 2188 7059ISEM, Université Montpellier, CNRS, IRD, EPHE, Place Eugène Bataillon, Montpellier, France
| | - Jordi Salmona
- grid.15781.3a0000 0001 0723 035XCNRS-UPS-IRD, UMR5174, Laboratoire Évolution & Diversité Biologique, Université Paul Sabatier, 118 route de Narbonne, 31062 Toulouse, France
| | - Julia Metzger
- grid.412970.90000 0001 0126 6191Institute of Animal Breeding and Genetics, University of Veterinary Medicine Hannover, Bünteweg 17p, 30559 Hannover, Germany ,grid.419538.20000 0000 9071 0620Veterinary Functional Genomics, Max Planck Institute for Molecular Genetics, Ihnestrasse 73, 14195 Berlin, Germany
| | - Laurent Bremond
- grid.462058.d0000 0001 2188 7059ISEM, Université Montpellier, CNRS, IRD, EPHE, Place Eugène Bataillon, Montpellier, France
| | - Thomas Kasper
- grid.9613.d0000 0001 1939 2794Friedrich-Schiller-University Jena, Department of Physical Geography, Loebdergraben 32, 07743 Jena, Germany
| | - Gerhard Daut
- grid.9613.d0000 0001 1939 2794Friedrich-Schiller-University Jena, Department of Physical Geography, Loebdergraben 32, 07743 Jena, Germany
| | - Sylvie Rouland
- grid.462058.d0000 0001 2188 7059ISEM, Université Montpellier, CNRS, IRD, EPHE, Place Eugène Bataillon, Montpellier, France
| | - Sandratrinirainy Ranarilalatiana
- grid.440419.c0000 0001 2165 5629Université d’Antananarivo, Faculté des Sciences, Mention Biologie et Ecologie Végétale, Laboratoire de Palynologie Appliquée, B.P 905 - 101, Antananarivo, Madagascar
| | - Romule Rakotondravony
- Ecole Doctorale Ecosystèmes Naturels (EDEN), University of Mahajanga, 5 Rue Georges V - Immeuble KAKAL, Mahajanga Be, B.P. 652, Mahajanga, 401 Madagascar ,Faculté des Sciences, de Technologies et de l’Environnement, University of Mahajanga, 5 Rue Georges V - Immeuble KAKAL, Mahajanga Be, B.P. 652, Mahajanga, 401 Madagascar
| | - Lounès Chikhi
- grid.418346.c0000 0001 2191 3202Instituto Gulbenkian de Ciência, Rua da Quinta Grande, 6, P-2780-156 Oeiras, Portugal ,grid.4399.70000000122879528Laboratoire Évolution & Diversité Biologique (EDB UMR 5174), Université de Toulouse Midi-Pyrénées, CNRS, IRD, UPS, 118 route de Narbonne, Bât. 4R1, 31062 Toulouse cedex 9, France
| | - Hermann Behling
- grid.7450.60000 0001 2364 4210University of Goettingen, Department of Palynology and Climate Dynamics, Untere Karspüle 2, 37073 Goettingen, Germany
| | - Ute Radespiel
- grid.412970.90000 0001 0126 6191Institute of Zoology, University of Veterinary Medicine Hannover, Bünteweg 17, 30559 Hannover, Germany
| |
Collapse
|
24
|
Daron J, Bravo IG. Variability in Codon Usage in Coronaviruses Is Mainly Driven by Mutational Bias and Selective Constraints on CpG Dinucleotide. Viruses 2021; 13:v13091800. [PMID: 34578381 PMCID: PMC8473333 DOI: 10.3390/v13091800] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 08/30/2021] [Accepted: 08/31/2021] [Indexed: 12/18/2022] Open
Abstract
The Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the third human-emerged virus of the 21st century from the Coronaviridae family, causing the ongoing coronavirus disease 2019 (COVID-19) pandemic. Due to the high zoonotic potential of coronaviruses, it is critical to unravel their evolutionary history of host species breadth, host-switch potential, adaptation and emergence, to identify viruses posing a pandemic risk in humans. We present here a comprehensive analysis of the composition and codon usage bias of the 82 Orthocoronavirinae members, infecting 47 different avian and mammalian hosts. Our results clearly establish that synonymous codon usage varies widely among viruses, is only weakly dependent on their primary host, and is dominated by mutational bias towards AU-enrichment and by CpG avoidance. Indeed, variation in GC3 explains around 34%, while variation in CpG frequency explains around 14% of total variation in codon usage bias. Further insight on the mutational equilibrium within Orthocoronavirinae revealed that most coronavirus genomes are close to their neutral equilibrium, the exception being the three recently infecting human coronaviruses, which lie further away from the mutational equilibrium than their endemic human coronavirus counterparts. Finally, our results suggest that, while replicating in humans, SARS-CoV-2 is slowly becoming AU-richer, likely until attaining a new mutational equilibrium.
Collapse
Affiliation(s)
- Josquin Daron
- Laboratoire MIVEGEC (CNRS, IRD, Université de Montpellier), 34394 Montpellier, France;
- Correspondence:
| | - Ignacio G. Bravo
- Laboratoire MIVEGEC (CNRS, IRD, Université de Montpellier), 34394 Montpellier, France;
- Center for Research on the Ecology and Evolution of Diseases (CREES), 34394 Montpellier, France
| |
Collapse
|
25
|
Lesturgie P, Planes S, Mona S. Coalescence times, life history traits and conservation concerns: An example from four coastal shark species from the Indo-Pacific. Mol Ecol Resour 2021; 22:554-566. [PMID: 34407294 DOI: 10.1111/1755-0998.13487] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Revised: 07/27/2021] [Accepted: 08/12/2021] [Indexed: 11/30/2022]
Abstract
Dispersal abilities play a crucial role in shaping the extent of population genetic structure, with more mobile species being panmictic over large geographical ranges and less mobile ones organized in metapopulations exchanging migrants to different degrees. In turn, population structure directly influences the coalescence pattern of the sampled lineages, but the consequences on the estimated variation of the effective population size (Ne ) over time obtained by means of unstructured demographic models remain poorly understood. However, this knowledge is crucial for biologically interpreting the observed Ne trajectory and further devising conservation strategies in endangered species. Here we investigated the demographic history of four shark species (Carharhinus melanopterus, Carharhinus limbatus, Carharhinus amblyrhynchos, Galeocerdo cuvier) with different degrees of endangered status and life history traits related to dispersal distributed in the Indo-Pacific and sampled off New Caledonia. We compared several evolutionary scenarios representing both structured (metapopulation) and unstructured models and then inferred the Ne variation through time. By performing extensive coalescent simulations, we provided a general framework relating the underlying population structure and the observed Ne dynamics. On this basis, we concluded that the recent decline observed in three out of the four considered species when assuming unstructured demographic models can be explained by the presence of population structure. Furthermore, we also demonstrated the limits of the inferences based on the sole site frequency spectrum and warn that statistics based on linkage disequilibrium will be needed to exclude recent demographic events affecting meta-populations.
Collapse
Affiliation(s)
- Pierre Lesturgie
- Institut de Systématique, Evolution, Biodiversité, ISYEB (UMR 7205), Muséum National d'Histoire Naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, Paris, France
| | - Serge Planes
- PSL Research University: EPHE-UPVD-CNRS, USR 3278 CRIOBE, Université de Perpignan, Perpignan, France.,Laboratoire d'Excellence CORAIL, Papetoai, French Polynesia
| | - Stefano Mona
- Institut de Systématique, Evolution, Biodiversité, ISYEB (UMR 7205), Muséum National d'Histoire Naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, Paris, France.,Laboratoire d'Excellence CORAIL, Papetoai, French Polynesia.,EPHE, PSL Research University, Paris, France
| |
Collapse
|
26
|
Clark RD, Aardema ML, Andolfatto P, Barber PH, Hattori A, Hoey JA, Montes HR, Pinsky ML. Genomic signatures of spatially divergent selection at clownfish range margins. Proc Biol Sci 2021; 288:20210407. [PMID: 34102891 PMCID: PMC8187997 DOI: 10.1098/rspb.2021.0407] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Accepted: 05/11/2021] [Indexed: 01/25/2023] Open
Abstract
Understanding how evolutionary forces interact to drive patterns of selection and distribute genetic variation across a species' range is of great interest in ecology and evolution, especially in an era of global change. While theory predicts how and when populations at range margins are likely to undergo local adaptation, empirical evidence testing these models remains sparse. Here, we address this knowledge gap by investigating the relationship between selection, gene flow and genetic drift in the yellowtail clownfish, Amphiprion clarkii, from the core to the northern periphery of the species range. Analyses reveal low genetic diversity at the range edge, gene flow from the core to the edge and genomic signatures of local adaptation at 56 single nucleotide polymorphisms in 25 candidate genes, most of which are significantly correlated with minimum annual sea surface temperature. Several of these candidate genes play a role in functions that are upregulated during cold stress, including protein turnover, metabolism and translation. Our results illustrate how spatially divergent selection spanning the range core to the periphery can occur despite the potential for strong genetic drift at the range edge and moderate gene flow from the core populations.
Collapse
Affiliation(s)
- René D. Clark
- Department of Ecology, Evolution and Natural Resources, Rutgers University, 14 College Farm Road, New Brunswick, NJ 08901, USA
| | - Matthew L. Aardema
- Department of Biology, Montclair State University, 1 Normal Avenue, Montclair, NJ 07043, USA
- Sackler Institute for Comparative Genomics, American Museum of Natural History, 200 Central Park West, New York, NY 10024-5102, USA
| | - Peter Andolfatto
- Department of Biological Sciences, Columbia University, New York, NY 10026, USA
| | - Paul H. Barber
- Department of Ecology and Evolutionary Biology, University of California-Los Angeles, Los Angeles, CA 90095, USA
| | - Akihisa Hattori
- Faculty of Liberal Arts and Education, Shiga University, 2-5-1 Hiratsu, Otsu, Shiga 520-0862, Japan
| | - Jennifer A. Hoey
- Department of Ecology, Evolution and Natural Resources, Rutgers University, 14 College Farm Road, New Brunswick, NJ 08901, USA
- Department of Ecology and Evolutionary Biology, University of California-Santa Cruz, 130 McAllister Way, Santa Cruz, CA 95060, USA
| | | | - Malin L. Pinsky
- Department of Ecology, Evolution and Natural Resources, Rutgers University, 14 College Farm Road, New Brunswick, NJ 08901, USA
| |
Collapse
|
27
|
Bergeron LA, Besenbacher S, Bakker J, Zheng J, Li P, Pacheco G, Sinding MHS, Kamilari M, Gilbert MTP, Schierup MH, Zhang G. The germline mutational process in rhesus macaque and its implications for phylogenetic dating. Gigascience 2021; 10:giab029. [PMID: 33954793 PMCID: PMC8099771 DOI: 10.1093/gigascience/giab029] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 01/05/2021] [Accepted: 03/29/2021] [Indexed: 01/13/2023] Open
Abstract
BACKGROUND Understanding the rate and pattern of germline mutations is of fundamental importance for understanding evolutionary processes. RESULTS Here we analyzed 19 parent-offspring trios of rhesus macaques (Macaca mulatta) at high sequencing coverage of ∼76× per individual and estimated a mean rate of 0.77 × 10-8de novo mutations per site per generation (95% CI: 0.69 × 10-8 to 0.85 × 10-8). By phasing 50% of the mutations to parental origins, we found that the mutation rate is positively correlated with the paternal age. The paternal lineage contributed a mean of 81% of the de novo mutations, with a trend of an increasing male contribution for older fathers. Approximately 3.5% of de novo mutations were shared between siblings, with no parental bias, suggesting that they arose from early development (postzygotic) stages. Finally, the divergence times between closely related primates calculated on the basis of the yearly mutation rate of rhesus macaque generally reconcile with divergence estimated with molecular clock methods, except for the Cercopithecoidea/Hominoidea molecular divergence dated at 58 Mya using our new estimate of the yearly mutation rate. CONCLUSIONS When compared to the traditional molecular clock methods, new estimated rates from pedigree samples can provide insights into the evolution of well-studied groups such as primates.
Collapse
Affiliation(s)
- Lucie A Bergeron
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen Ø, Denmark
| | - Søren Besenbacher
- Department of Molecular Medicine, Aarhus University, Brendstrupgårdsvej 21A, 8200 Aarhus N, Denmark
| | - Jaco Bakker
- Animal Science Department, Biomedical Primate Research Centre, Lange Kleiweg 161, 2288 GJ Rijswijk, Netherlands
| | - Jiao Zheng
- BGI-Shenzhen, Shenzhen 518083, Guangdong, China
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, Guangdong, China
| | - Panyi Li
- BGI-Shenzhen, Shenzhen 518083, Guangdong, China
| | - George Pacheco
- Section for Evolutionary Genomics, The GLOBE Institute, University of Copenhagen, Oester Voldgade 5-7, 1350 Copenhagen K, Denmark
| | - Mikkel-Holger S Sinding
- Department of genetics, Trinity College Dublin, 2 college green, D02 VF25, Dublin, Ireland
- Greenland Institute of Natural Resources, Kivioq 2, 3900 Nuuk, Greenland
| | - Maria Kamilari
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen Ø, Denmark
| | - M Thomas P Gilbert
- Section for Evolutionary Genomics, The GLOBE Institute, University of Copenhagen, Oester Voldgade 5-7, 1350 Copenhagen K, Denmark
- Department of Natural History, NTNU University Museum, Norwegian University of Science and Technology (NTNU), NO-7491 Trondheim, Norway
| | - Mikkel H Schierup
- Bioinformatics Research Centre, Aarhus University, C.F.Møllers Allé 8, 8000, Aarhus C, Denmark
| | - Guojie Zhang
- Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen Ø, Denmark
- BGI-Shenzhen, Shenzhen 518083, Guangdong, China
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China
| |
Collapse
|
28
|
Arredondo A, Mourato B, Nguyen K, Boitard S, Rodríguez W, Noûs C, Mazet O, Chikhi L. Inferring number of populations and changes in connectivity under the n-island model. Heredity (Edinb) 2021; 126:896-912. [PMID: 33846579 PMCID: PMC8178352 DOI: 10.1038/s41437-021-00426-9] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2020] [Revised: 03/11/2021] [Accepted: 03/12/2021] [Indexed: 12/11/2022] Open
Abstract
Inferring the demographic history of species is one of the greatest challenges in populations genetics. This history is often represented as a history of size changes, ignoring population structure. Alternatively, when structure is assumed, it is defined a priori as a population tree and not inferred. Here we propose a framework based on the IICR (Inverse Instantaneous Coalescence Rate). The IICR can be estimated for a single diploid individual using the PSMC method of Li and Durbin (2011). For an isolated panmictic population, the IICR matches the population size history, and this is how the PSMC outputs are generally interpreted. However, it is increasingly acknowledged that the IICR is a function of the demographic model and sampling scheme with limited connection to population size changes. Our method fits observed IICR curves of diploid individuals with IICR curves obtained under piecewise stationary symmetrical island models. In our models we assume a fixed number of time periods during which gene flow is constant, but gene flow is allowed to change between time periods. We infer the number of islands, their sizes, the periods at which connectivity changes and the corresponding rates of connectivity. Validation with simulated data showed that the method can accurately recover most of the scenario parameters. Our application to a set of five human PSMCs yielded demographic histories that are in agreement with previous studies using similar methods and with recent research suggesting ancient human structure. They are in contrast with the view of human evolution consisting of one ancestral population branching into three large continental and panmictic populations with varying degrees of connectivity and no population structure within each continent.
Collapse
Affiliation(s)
- Armando Arredondo
- Université de Toulouse, Institut National des Sciences Appliquées, Institut de Mathématiques de Toulouse, Toulouse, France. .,Institut de Mathématiques de Toulouse; UMR5219. Université de Toulouse, Toulouse, France.
| | - Beatriz Mourato
- Institut de Mathématiques de Toulouse; UMR5219. Université de Toulouse, Toulouse, France.,Instituto Gulbenkian de Ciência, Oeiras, Portugal
| | - Khoa Nguyen
- Université de Toulouse, Institut National des Sciences Appliquées, Institut de Mathématiques de Toulouse, Toulouse, France
| | - Simon Boitard
- CBGP, Université de Montpellier, CIRAD, INRAE, Institut Agro, IRD, Montpellier, France
| | - Willy Rodríguez
- Institut de Mathématiques de Toulouse; UMR5219. Université de Toulouse, Toulouse, France.,ENAC - Ecole Nationale de l'Aviation Civile, Université de Toulouse, Toulouse, France
| | | | - Olivier Mazet
- Université de Toulouse, Institut National des Sciences Appliquées, Institut de Mathématiques de Toulouse, Toulouse, France.,Institut de Mathématiques de Toulouse; UMR5219. Université de Toulouse, Toulouse, France
| | - Lounès Chikhi
- Instituto Gulbenkian de Ciência, Oeiras, Portugal. .,Laboratoire Évolution & Diversité Biologique (EDB UMR 5174), CNRS, IRD, UPS, Université de Toulouse Midi-Pyrénées, Toulouse, France.
| |
Collapse
|
29
|
Johri P, Riall K, Becher H, Excoffier L, Charlesworth B, Jensen JD. The Impact of Purifying and Background Selection on the Inference of Population History: Problems and Prospects. Mol Biol Evol 2021; 38:2986-3003. [PMID: 33591322 PMCID: PMC8233493 DOI: 10.1093/molbev/msab050] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Current procedures for inferring population history generally assume complete neutrality—that is, they neglect both direct selection and the effects of selection on linked sites. We here examine how the presence of direct purifying selection and background selection may bias demographic inference by evaluating two commonly-used methods (MSMC and fastsimcoal2), specifically studying how the underlying shape of the distribution of fitness effects and the fraction of directly selected sites interact with demographic parameter estimation. The results show that, even after masking functional genomic regions, background selection may cause the mis-inference of population growth under models of both constant population size and decline. This effect is amplified as the strength of purifying selection and the density of directly selected sites increases, as indicated by the distortion of the site frequency spectrum and levels of nucleotide diversity at linked neutral sites. We also show how simulated changes in background selection effects caused by population size changes can be predicted analytically. We propose a potential method for correcting for the mis-inference of population growth caused by selection. By treating the distribution of fitness effect as a nuisance parameter and averaging across all potential realizations, we demonstrate that even directly selected sites can be used to infer demographic histories with reasonable accuracy.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Kellen Riall
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Hannes Becher
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Laurent Excoffier
- Institute of Ecology and Evolution, University of Berne, Berne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
30
|
Imai R, Tsuda Y, Ebihara A, Matsumoto S, Tezuka A, Nagano AJ, Ootsuki R, Watano Y. Mating system evolution and genetic structure of diploid sexual populations of Cyrtomium falcatum in Japan. Sci Rep 2021; 11:3124. [PMID: 33542454 PMCID: PMC7862634 DOI: 10.1038/s41598-021-82731-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Accepted: 01/12/2021] [Indexed: 11/09/2022] Open
Abstract
Evolution of mating systems has become one of the most important research areas in evolutionary biology. Cyrtomium falcatum is a homosporous fern species native to eastern Asia. Two subspecies belonging to a sexual diploid race of C. falcatum are recognized: subsp. littorale and subsp. australe. Subspecies littorale shows intermediate selfing rates, while subsp. australe is an obligate outcrosser. We aimed to evaluate the process of mating system evolution and divergence for the two subspecies using restriction site associated DNA sequencing (RAD-seq). The results showed that subsp. littorale had lower genetic diversity and stronger genetic drift than subsp. australe. Fluctuations in the effective population size over time were evaluated by extended Bayesian skyline plot and Stairway plot analyses, both of which revealed a severe population bottleneck about 20,000 years ago in subsp. littorale. This bottleneck and the subsequent range expansion after the LGM appear to have played an important role in the divergence of the two subspecies and the evolution of selfing in subsp. littorale. These results shed new light on the relationship between mating system evolution and past demographic change in fern species.
Collapse
Affiliation(s)
- Ryosuke Imai
- Sugadaira Research Station, Mountain Science Center, University of Tsukuba, Sugadaira, Ueda , Nagano, 386-2204, Japan.
| | - Yoshiaki Tsuda
- Sugadaira Research Station, Mountain Science Center, University of Tsukuba, Sugadaira, Ueda , Nagano, 386-2204, Japan
| | - Atsushi Ebihara
- Department of Botany, National Museum of Nature and Science, Tsukuba, Ibaraki, 305-0005, Japan
| | - Sadamu Matsumoto
- Department of Botany, National Museum of Nature and Science, Tsukuba, Ibaraki, 305-0005, Japan
| | - Ayumi Tezuka
- Faculty of Agriculture, Ryukoku University, Otsu, Shiga, 520-2194, Japan
| | - Atsushi J Nagano
- Faculty of Agriculture, Ryukoku University, Otsu, Shiga, 520-2194, Japan
| | - Ryo Ootsuki
- Department of Natural Sciences, Faculty of Arts and Sciences, Komazawa University, 1-23-1 Komazawa, Setagaya-ku, Tokyo, 154-8525, Japan
| | - Yasuyuki Watano
- Department of Biology, Graduate School of Science, Chiba University, Inage, Chiba, Chiba, 263-8522, Japan
| |
Collapse
|
31
|
Johri P, Riall K, Becher H, Excoffier L, Charlesworth B, Jensen JD. The impact of purifying and background selection on the inference of population history: problems and prospects. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021. [PMID: 33501439 PMCID: PMC7836109 DOI: 10.1101/2020.04.28.066365] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Current procedures for inferring population history generally assume complete neutrality - that is, they neglect both direct selection and the effects of selection on linked sites. We here examine how the presence of direct purifying selection and background selection may bias demographic inference by evaluating two commonly-used methods (MSMC and fastsimcoal2), specifically studying how the underlying shape of the distribution of fitness effects (DFE) and the fraction of directly selected sites interact with demographic parameter estimation. The results show that, even after masking functional genomic regions, background selection may cause the mis-inference of population growth under models of both constant population size and decline. This effect is amplified as the strength of purifying selection and the density of directly selected sites increases, as indicated by the distortion of the site frequency spectrum and levels of nucleotide diversity at linked neutral sites. We also show how simulated changes in background selection effects caused by population size changes can be predicted analytically. We propose a potential method for correcting for the mis-inference of population growth caused by selection. By treating the DFE as a nuisance parameter and averaging across all potential realizations, we demonstrate that even directly selected sites can be used to infer demographic histories with reasonable accuracy.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Kellen Riall
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Hannes Becher
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, EH9 3FL, United Kingdom
| | - Laurent Excoffier
- Institute of Ecology and Evolution, University of Berne, Berne 3012, Switzerland.,Swiss Institute of Bioinformatics, Lausanne 1015, Switzerland
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, EH9 3FL, United Kingdom
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| |
Collapse
|
32
|
Jackson FLC. So many Nigerians: why is Nigeria overrepresented as the ancestral genetic homeland of Legacy African North Americans? Am J Hum Genet 2021; 108:202-208. [PMID: 33321100 PMCID: PMC7820629 DOI: 10.1016/j.ajhg.2020.10.010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
The genetics of African North Americans are complex amalgamations of various West and Central African peoples with modest gene flow from specific European and Amerindian peoples. A comprehensive understanding of African North American biohistory is a prerequisite for accurate interpretations of the ancestral genetics of this population. Too often, genetic interpretations falter with ahistorical reconstructions. The recently reported overrepresentation of Nigerian lineages in African North Americans reflects pronounced limitations in the African genomic database, the artificiality of the colonial maps of Africa, the contributions of multiple African empires and kingdoms into the transatlantic trade in enslaved Africans, and the overrepresentation of Yoruba peoples in the existing limited representation of West Africans in public genomic databases. This Matters Arising paper is in response to Micheletti et al. (2020), published in The American Journal of Human Genetics. See also the response by Micheletti et al. (2020), published in this issue.
Collapse
|
33
|
Liu X, Fu YX. Stairway Plot 2: demographic history inference with folded SNP frequency spectra. Genome Biol 2020; 21:280. [PMID: 33203475 PMCID: PMC7670622 DOI: 10.1186/s13059-020-02196-9] [Citation(s) in RCA: 100] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Accepted: 11/05/2020] [Indexed: 01/27/2023] Open
Abstract
Inferring the demographic histories of populations has wide applications in population, ecological, and conservation genomics. We present Stairway Plot 2, a cross-platform program package for this task using SNP frequency spectra. It is based on a nonparametric method with the capability of handling folded SNP frequency spectra (that is, when the ancestral alleles of the SNPs are unknown) of thousands of samples produced with genotyping-by-sequencing technologies; therefore, it is particularly suitable for nonmodel organisms.
Collapse
Affiliation(s)
- Xiaoming Liu
- USF Genomics & College of Public Health, University of South Florida, Tampa, FL, USA.
| | - Yun-Xin Fu
- Department of Biostatistics and Data Science, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
| |
Collapse
|
34
|
Ghirotto S, Vizzari MT, Tassi F, Barbujani G, Benazzo A. Distinguishing among complex evolutionary models using unphased whole-genome data through random forest approximate Bayesian computation. Mol Ecol Resour 2020; 21:2614-2628. [PMID: 33000507 DOI: 10.1111/1755-0998.13263] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Revised: 08/28/2020] [Accepted: 09/07/2020] [Indexed: 01/25/2023]
Abstract
Inferring past demographic histories is crucial in population genetics, and the amount of complete genomes now available should in principle facilitate this inference. In practice, however, the available inferential methods suffer from severe limitations. Although hundreds complete genomes can be simultaneously analysed, complex demographic processes can easily exceed computational constraints, and the procedures to evaluate the reliability of the estimates contribute to increase the computational effort. Here we present an approximate Bayesian computation framework based on the random forest algorithm (ABC-RF), to infer complex past population processes using complete genomes. To this aim, we propose to summarize the data by the full genomic distribution of the four mutually exclusive categories of segregating sites (FDSS), a statistic fast to compute from unphased genome data and that does not require the ancestral state of alleles to be known. We constructed an efficient ABC pipeline and tested how accurately it allows one to recognize the true model among models of increasing complexity, using simulated data and taking into account different sampling strategies in terms of number of individuals analysed, number and size of the genetic loci considered. We also compared the FDSS with the unfolded and folded site frequency spectrum (SFS), and for these statistics we highlighted the experimental conditions maximizing the inferential power of the ABC-RF procedure. We finally analysed real data sets, testing models on the dispersal of anatomically modern humans out of Africa and exploring the evolutionary relationships of the three species of Orangutan inhabiting Borneo and Sumatra.
Collapse
Affiliation(s)
- Silvia Ghirotto
- Department of Mathematics and Computer Science, University of Ferrara, Ferrara, Italy
| | - Maria Teresa Vizzari
- Department of Mathematics and Computer Science, University of Ferrara, Ferrara, Italy
| | - Francesca Tassi
- Department of Life Sciences and Biotechnology, University of Ferrara, Ferrara, Italy
| | - Guido Barbujani
- Department of Life Sciences and Biotechnology, University of Ferrara, Ferrara, Italy
| | - Andrea Benazzo
- Department of Life Sciences and Biotechnology, University of Ferrara, Ferrara, Italy
| |
Collapse
|
35
|
Marchi N, Excoffier L. Gene flow as a simple cause for an excess of high-frequency-derived alleles. Evol Appl 2020; 13:2254-2263. [PMID: 33005222 PMCID: PMC7513730 DOI: 10.1111/eva.12998] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2020] [Revised: 04/30/2020] [Accepted: 05/04/2020] [Indexed: 01/19/2023] Open
Abstract
Most human populations exhibit an excess of high-frequency variants, leading to a U-shaped site-frequency spectrum (uSFS). This pattern has been generally interpreted as a signature of ongoing episodes of positive selection, or as evidence for a mis-assignment of ancestral/derived allelic states, but uSFS has also been observed in populations receiving gene flow from a ghost population, in structured populations, or after range expansions. In order to better explain the prevalence of high-frequency variants in humans and other populations, we describe here which patterns of gene flow and population demography can lead to uSFS by using extensive coalescent simulations. We find that uSFS can often be observed in a population if gene flow brings a few ancestral alleles from a well-differentiated population. Gene flow can either consist in single pulses of admixture or continuous immigration, but different demographic conditions are necessary to observe uSFS in these two scenarios. Indeed, an extremely low and recent gene flow is required in the case of single admixture events, while with continuous immigration, uSFS occurs only if gene flow started recently at a high rate or if it lasted for a long time at a low rate. Overall, we find that a neutral uSFS occurs under more restrictive conditions in populations having received single pulses of gene flow than in populations exposed to continuous gene flow. We also show that the uSFS observed in human populations from the 1000 Genomes Project can easily be explained by gene flow from surrounding populations without requiring past episodes of positive selection. These results imply that uSFS should be common in non-isolated populations, such as most wild or domesticated plants and animals.
Collapse
Affiliation(s)
- Nina Marchi
- CMPGInstitute of Ecology and EvolutionUniversity of BerneBerneSwitzerland
- Swiss Institute of BioinformaticsLausanneSwitzerland
| | - Laurent Excoffier
- CMPGInstitute of Ecology and EvolutionUniversity of BerneBerneSwitzerland
- Swiss Institute of BioinformaticsLausanneSwitzerland
| |
Collapse
|
36
|
Liu D, Zhang L, Wang J, Ma Y. Conservation Genomics of a Threatened Rhododendron: Contrasting Patterns of Population Structure Revealed From Neutral and Selected SNPs. Front Genet 2020; 11:757. [PMID: 33101354 PMCID: PMC7500208 DOI: 10.3389/fgene.2020.00757] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Accepted: 06/25/2020] [Indexed: 12/13/2022] Open
Abstract
Though it is well-acknowledged that next generation sequencing (NGS) technologies can provide further insights into plant conservation management than traditional molecular markers, studies employing NGS to address conservation genomics and subsequent conservation strategies for threatened plants are still rare. Rhododendron is the largest genus of woody plants in China, and many species are threatened, however, to date there has been no conservation genetic research using NGS in this genus. In the present study, we investigated the conservation genetics of R. cyanocarpum, a threatened species endemic to the Cangshan Mountains in Yunnan, China, using a double digest restriction-site-associated DNA-sequencing (ddRAD-seq) approach. Due to the availability of sufficient SNPs, we were able to distinguish between neutral and putatively selected SNPs and were able to further investigate the genetic diversity, population structure, and differentiation in R. cyanocarpum, as well as make an estimation of its demographic history. A total of 6,584 SNPs were obtained, of which 5,729 were neutral (detected using Tajima’s D). In terms of the 5,729 neutral SNPs, R. cyanocarpum had a higher genetic diversity (π = 0.0702 ± 0.0017, He = 0.0675 ± 0.0016) than other plant species assessed using Rad-seq methods, while population differentiation (Fst from 0.0314 to 0.0452) was weak. Interestingly, contrasting patterns of population structure were revealed from all neutral and selected SNPs, with distinct genetic clusters forming for all SNPs and neutral SNPs, but no distinct subgroups for selected ones. Moreover, we were able to detect changes in effective population size (Ne) of R. cyanocarpum from 150,000 years ago, including a bottleneck event ca. 60,000 years ago, followed by recovery of Ne over a short period, and a subsequent gradual decline in Ne to date. Implications for conserving R. cyanocarpum based on these main results are then discussed.
Collapse
Affiliation(s)
- Detuan Liu
- Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Populations, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| | - Lu Zhang
- Flower Research Institute, Yunnan Academy of Agriculture Sciences, Kunming, China
| | - Jihua Wang
- Flower Research Institute, Yunnan Academy of Agriculture Sciences, Kunming, China
| | - Yongpeng Ma
- Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Populations, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| |
Collapse
|
37
|
Pinzari CA, Kang L, Michalak P, Jermiin LS, Price DK, Bonaccorso FJ. Analysis of Genomic Sequence Data Reveals the Origin and Evolutionary Separation of Hawaiian Hoary Bat Populations. Genome Biol Evol 2020; 12:1504-1514. [PMID: 32853363 PMCID: PMC7543519 DOI: 10.1093/gbe/evaa137] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
We examine the genetic history and population status of Hawaiian hoary bats (Lasiurus semotus), the most isolated bats on Earth, and their relationship to northern hoary bats (Lasiurus cinereus), through whole-genome analysis of single-nucleotide polymorphisms mapped to a de novo-assembled reference genome. Profiles of genomic diversity and divergence indicate that Hawaiian hoary bats are distinct from northern hoary bats, and form a monophyletic group, indicating a single ancestral colonization event 1.34 Ma, followed by substantial divergence between islands beginning 0.51 Ma. Phylogenetic analysis indicates Maui is central to the radiation across the archipelago, with the southward expansion to Hawai'i and westward to O'ahu and Kaua'i. Because this endangered species is of conservation concern, a clearer understanding of the population genetic structure of this bat in the Hawaiian Islands is of timely importance.
Collapse
Affiliation(s)
| | - Lin Kang
- Edward Via College of Osteopathic Medicine, Blacksburg, Virginia
- Department of Biomedical Sciences and Pathobiology, Virginia-Maryland College of Veterinary Medicine, Blacksburg, Virginia
| | - Pawel Michalak
- Edward Via College of Osteopathic Medicine, Blacksburg, Virginia
- Center for One Health Research, Virginia-Maryland College of Veterinary Medicine, Blacksburg, Virginia
- Institute of Evolution, University of Haifa, Israel
| | - Lars S Jermiin
- Research School of Biology, Australian National University, Acton, Australian Capital Territory, Australia
- School of Biology & Environmental Science, University College Dublin, Ireland
- Earth Institute, University College Dublin, Ireland
| | - Donald K Price
- School of Life Sciences, University of Nevada, Las Vegas
| | - Frank J Bonaccorso
- U.S. Geological Survey, Pacific Island Ecosystems Research Center, Hawai‘i National Park, HI
| |
Collapse
|
38
|
Genetic Structure and Population Demography of White-Spotted Charr in the Upstream Watershed of a Large Dam. WATER 2020. [DOI: 10.3390/w12092406] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
White-spotted charr (Salvelinus leucomaenis leucomaenis) is an anadromous fish that has been severely harmed by human land-use development, particularly through habitat fragmentation. However, the anthropogenic impacts on populations of this species have not been evaluated, except those on small dammed-off populations. Using multiplexed ISSR genotyping by sequencing, we investigated the genetic structure of white-spotted charr in four tributaries in the upper section of the Kanayama Dam in the Sorachi River, Hokkaido Island, Japan. There were no distinct genetic structures (FST = 0.014), probably because some active individuals migrate frequently among tributaries. By model-flexible demographic simulation, historical changes in the effective population size were inferred. The result indicates that the population size has decreased since the end of the last glacial period, with three major population decline events, including recent declines that were probably associated with recent human activities. Nevertheless, populations in the watershed upstream of the Kanayama Dam are still expected to be at low risk of immediate extinction, owing to the large watershed size and the limited number of small check dams. An effective conservation measure for sustaining the white-spotted charr population is to maintain high connectivity between tributaries, such as by providing fishways in check dams during construction.
Collapse
|
39
|
Rougemont Q, Moore JS, Leroy T, Normandeau E, Rondeau EB, Withler RE, Van Doornik DM, Crane PA, Naish KA, Garza JC, Beacham TD, Koop BF, Bernatchez L. Demographic history shaped geographical patterns of deleterious mutation load in a broadly distributed Pacific Salmon. PLoS Genet 2020; 16:e1008348. [PMID: 32845885 PMCID: PMC7478589 DOI: 10.1371/journal.pgen.1008348] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Revised: 09/08/2020] [Accepted: 06/24/2020] [Indexed: 12/24/2022] Open
Abstract
A thorough reconstruction of historical processes is essential for a comprehensive understanding of the mechanisms shaping patterns of genetic diversity. Indeed, past and current conditions influencing effective population size have important evolutionary implications for the efficacy of selection, increased accumulation of deleterious mutations, and loss of adaptive potential. Here, we gather extensive genome-wide data that represent the extant diversity of the Coho salmon (Oncorhynchus kisutch) to address two objectives. We demonstrate that a single glacial refugium is the source of most of the present-day genetic diversity, with detectable inputs from a putative secondary micro-refugium. We found statistical support for a scenario whereby ancestral populations located south of the ice sheets expanded recently, swamping out most of the diversity from other putative micro-refugia. Demographic inferences revealed that genetic diversity was also affected by linked selection in large parts of the genome. Moreover, we demonstrate that the recent demographic history of this species generated regional differences in the load of deleterious mutations among populations, a finding that mirrors recent results from human populations and provides increased support for models of expansion load. We propose that insights from these historical inferences should be better integrated in conservation planning of wild organisms, which currently focuses largely on neutral genetic diversity and local adaptation, with the role of potentially maladaptive variation being generally ignored. Reconstruction of a species’ past demographic history from genetic data can highlight historical factors that have shaped the distribution of genetic diversity along its genome and its geographic range. Here, we combine genotyping-by-sequencing with demographic modelling to address these issues in the Coho salmon, a Pacific salmon of conservation concern in some parts of its range, notably in the south. Our demographic reconstructions reveal a linear decrease in genetic diversity toward the north of the species range, supporting the hypothesis of a northern route of postglacial recolonization from a single major southern refugium. As predicted by theory, we also observed a higher proportion of deleterious mutations in the most distant populations from this refugium. Beyond this general pattern, among-site variation in the proportion of deleterious mutations is consistent with different local trends in effective population sizes. Our results highlight the potential importance of understanding historical factors that have shaped geographic patterns of the distribution of deleterious mutations in order to implement effective management programs for the conservation of wild populations. Such fundamental knowledge of human historical demography is now having major impacts on health sciences, and we argue it is time to integrate such approaches in conservation science as well.
Collapse
Affiliation(s)
- Quentin Rougemont
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Québec, Canada
- * E-mail:
| | - Jean-Sébastien Moore
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Québec, Canada
| | - Thibault Leroy
- ISEM, Univ. Montpellier, CNRS, EPHE, IRD, Montpellier, France
- Department of Botany & Biodiversity Research, University of Vienna, Vienna, Austria
| | - Eric Normandeau
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Québec, Canada
| | - Eric B. Rondeau
- Centre for Biomedical Research, University of Victoria, Victoria, BC, Canada
- Department of Biology, University of Victoria, Victoria, BC, Canada
| | - Ruth E. Withler
- Department of Fisheries and Ocean, Pacific Biological Station, Nanaimo, British Columbia, Canada
| | - Donald M. Van Doornik
- National Oceanic and Atmospheric Administration, National Marine Fisheries Service, Northwest Fisheries Science Center, Manchester Research Station, Port Orchard, Washington, United States of America
| | - Penelope A. Crane
- Conservation Genetics Laboratory, U.S. Fish and Wildlife Service, Anchorage, Alaska, United States of America
| | - Kerry A. Naish
- School of Aquatic and Fishery Sciences, University of Washington, Seattle, WA, United States of America
| | - John Carlos Garza
- Fisheries Ecology Division, Southwest Fisheries Science Center, National Marine Fisheries Service and Institute of Marine Sciences, University of California–Santa Cruz, Santa Cruz, California, United States of America
| | - Terry D. Beacham
- Department of Fisheries and Ocean, Pacific Biological Station, Nanaimo, British Columbia, Canada
| | - Ben F. Koop
- Centre for Biomedical Research, University of Victoria, Victoria, BC, Canada
- Department of Biology, University of Victoria, Victoria, BC, Canada
| | - Louis Bernatchez
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Québec, Canada
| |
Collapse
|
40
|
Kerdoncuff E, Lambert A, Achaz G. Testing for population decline using maximal linkage disequilibrium blocks. Theor Popul Biol 2020; 134:171-181. [DOI: 10.1016/j.tpb.2020.03.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2019] [Revised: 03/26/2020] [Accepted: 03/29/2020] [Indexed: 02/02/2023]
|
41
|
Stoltz M, Baeumer B, Bouckaert R, Fox C, Hiscott G, Bryant D. Bayesian Inference of Species Trees using Diffusion Models. Syst Biol 2020; 70:145-161. [PMID: 33005955 DOI: 10.1093/sysbio/syaa051] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2019] [Revised: 06/19/2020] [Accepted: 06/23/2020] [Indexed: 11/13/2022] Open
Abstract
We describe a new and computationally efficient Bayesian methodology for inferring species trees and demographics from unlinked binary markers. Likelihood calculations are carried out using diffusion models of allele frequency dynamics combined with novel numerical algorithms. The diffusion approach allows for analysis of data sets containing hundreds or thousands of individuals. The method, which we call Snapper, has been implemented as part of the BEAST2 package. We conducted simulation experiments to assess numerical error, computational requirements, and accuracy recovering known model parameters. A reanalysis of soybean SNP data demonstrates that the models implemented in Snapp and Snapper can be difficult to distinguish in practice, a characteristic which we tested with further simulations. We demonstrate the scale of analysis possible using a SNP data set sampled from 399 fresh water turtles in 41 populations. [Bayesian inference; diffusion models; multi-species coalescent; SNP data; species trees; spectral methods.].
Collapse
Affiliation(s)
- Marnus Stoltz
- Department of Mathematics and Statistics, University of Otago, Dunedin 9054, New Zealand
| | - Boris Baeumer
- Department of Mathematics and Statistics, University of Otago, Dunedin 9054, New Zealand
| | - Remco Bouckaert
- Centre for Computational Evolution, University of Auckland, Auckland 1142, New Zealand
| | - Colin Fox
- Department of Physics, University of Otago, Dunedin 9054, New Zealand
| | - Gordon Hiscott
- Department of Mathematics and Statistics, University of Otago, Dunedin 9054, New Zealand
| | - David Bryant
- Department of Mathematics and Statistics, University of Otago, Dunedin 9054, New Zealand
| |
Collapse
|
42
|
Delrieu-Trottin E, Hubert N, Giles EC, Chifflet-Belle P, Suwalski A, Neglia V, Rapu-Edmunds C, Mona S, Saenz-Agudelo P. Coping with Pleistocene climatic fluctuations: Demographic responses in remote endemic reef fishes. Mol Ecol 2020; 29:2218-2233. [PMID: 32428327 DOI: 10.1111/mec.15478] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2020] [Revised: 05/11/2020] [Accepted: 05/14/2020] [Indexed: 12/28/2022]
Abstract
Elucidating demographic history during the settlement of ecological communities is crucial for properly inferring the mechanisms that shape patterns of species diversity and their persistence through time. Here, we used genomic data and coalescent-based approaches to elucidate for the first time the demographic dynamics associated with the settlement by endemic reef fish fauna of one of the most remote peripheral islands of the Pacific Ocean, Rapa Nui (Easter Island). We compared the demographic history of nine endemic species in order to explore their demographic responses to Pleistocene climatic fluctuations. We found that species endemic to Rapa Nui share a common demographic history, as signatures of population expansions were retrieved for almost all of the species studied here, and synchronous demographic expansions initiated during the last glacial period were recovered for more than half of the studied species. These results suggest that eustatic fluctuations associated with Milankovitch cycles have played a central role in species demographic histories and in the final stage of the community assembly of many Rapa Nui reef fishes. Specifically, sea level lowstands resulted in the maximum reef habitat extension for Rapa Nui endemic species; we discuss the potential role of seamounts in allowing endemic species to cope with Pleistocene climatic fluctuations, and we highlight the importance of local historical processes over regional ones. Overall, our results shed light on the mechanisms by which endemism arises and is maintained in peripheral reef fish fauna.
Collapse
Affiliation(s)
- Erwan Delrieu-Trottin
- Instituto de Ciencias Ambientales y Evolutivas, Universidad Austral de Chile, Valdivia, Chile.,Institut de Recherche pour le Développement, UMR 5554 (UM-CNRS-IRD-EPHE), ISEM, Montpellier, France.,Museum für Naturkunde, Leibniz-Institut für Evolutions-und Biodiversitätsforschung an der, Humboldt-Universität zu Berlin, Berlin, Germany
| | - Nicolas Hubert
- Institut de Recherche pour le Développement, UMR 5554 (UM-CNRS-IRD-EPHE), ISEM, Montpellier, France
| | - Emily C Giles
- Instituto de Ciencias Ambientales y Evolutivas, Universidad Austral de Chile, Valdivia, Chile.,Programa de Doctorado en Ciencias mención Ecología y Evolución, Escuela de Graduados, Facultad de Ciencias, Universidad Austral de Chile, Valdivia, Chile
| | - Pascaline Chifflet-Belle
- Institut de Systématique, Evolution, Biodiversité, ISYEB, Muséum National d'Histoire naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, Paris, France.,EPHE, PSL Research University, Paris, France
| | - Arnaud Suwalski
- Institut de Systématique, Evolution, Biodiversité, ISYEB, Muséum National d'Histoire naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, Paris, France.,EPHE, PSL Research University, Paris, France
| | - Valentina Neglia
- Instituto de Ciencias Ambientales y Evolutivas, Universidad Austral de Chile, Valdivia, Chile
| | | | - Stefano Mona
- Institut de Systématique, Evolution, Biodiversité, ISYEB, Muséum National d'Histoire naturelle, CNRS, Sorbonne Université, EPHE, Université des Antilles, Paris, France.,EPHE, PSL Research University, Paris, France.,Laboratoire d'Excellence CORAIL, Papetoai, French Polynesia
| | - Pablo Saenz-Agudelo
- Instituto de Ciencias Ambientales y Evolutivas, Universidad Austral de Chile, Valdivia, Chile
| |
Collapse
|
43
|
Gauthier J, Pajkovic M, Neuenschwander S, Kaila L, Schmid S, Orlando L, Alvarez N. Museomics identifies genetic erosion in two butterfly species across the 20th century in Finland. Mol Ecol Resour 2020; 20:1191-1205. [PMID: 32304133 PMCID: PMC7540272 DOI: 10.1111/1755-0998.13167] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Revised: 03/06/2020] [Accepted: 03/30/2020] [Indexed: 12/12/2022]
Abstract
Erosion of biodiversity generated by anthropogenic activities has been studied for decades and in many areas at the species level, using taxa monitoring. In contrast, genetic erosion within species has rarely been tracked, and is often studied by inferring past population dynamics from contemporaneous estimators. An alternative to such inferences is the direct examination of past genes, by analysing museum collection specimens. While providing direct access to genetic variation over time, historical DNA is usually not optimally preserved, and it is necessary to apply genotyping methods based on hybridization-capture to unravel past genetic variation. In this study, we apply such a method (i.e., HyRAD), to large time series of two butterfly species in Finland, and present a new bioinformatic pipeline, namely PopHyRAD, that standardizes and optimizes the analysis of HyRAD data at the within-species level. In the localities for which the data retrieved have sufficient power to accurately examine genetic dynamics through time, we show that genetic erosion has increased across the last 100 years, as revealed by signatures of allele extinctions and heterozygosity decreases, despite local variations. In one of the two butterflies (Erebia embla), isolation by distance also increased through time, revealing the effect of greater habitat fragmentation over time.
Collapse
Affiliation(s)
| | - Mila Pajkovic
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Samuel Neuenschwander
- Vital-IT, Swiss Institute of Bioinformatics, University of Lausanne, Lausanne, Switzerland
| | - Lauri Kaila
- Zoology Unit, Finnish Museum of Natural History, University of Helsinki, Finland
| | - Sarah Schmid
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland.,Department of Computational Biology, University of Lausanne, Lausanne, Switzerland
| | - Ludovic Orlando
- Laboratoire AMIS CNRS UMR 5288, Faculté de Médecine de Purpan, Toulouse, France.,Globe Institut, Lundbeck Foundation GeoGenetics Centre, University of Copenhagen, Copenhagen, Denmark
| | - Nadir Alvarez
- Geneva Natural History Museum, Geneva, Switzerland.,Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| |
Collapse
|
44
|
Patton AH, Margres MJ, Stahlke AR, Hendricks S, Lewallen K, Hamede RK, Ruiz-Aravena M, Ryder O, McCallum HI, Jones ME, Hohenlohe PA, Storfer A. Contemporary Demographic Reconstruction Methods Are Robust to Genome Assembly Quality: A Case Study in Tasmanian Devils. Mol Biol Evol 2020; 36:2906-2921. [PMID: 31424552 PMCID: PMC6878949 DOI: 10.1093/molbev/msz191] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Reconstructing species’ demographic histories is a central focus of molecular ecology and evolution. Recently, an expanding suite of methods leveraging either the sequentially Markovian coalescent (SMC) or the site-frequency spectrum has been developed to reconstruct population size histories from genomic sequence data. However, few studies have investigated the robustness of these methods to genome assemblies of varying quality. In this study, we first present an improved genome assembly for the Tasmanian devil using the Chicago library method. Compared with the original reference genome, our new assembly reduces the number of scaffolds (from 35,975 to 10,010) and increases the scaffold N90 (from 0.101 to 2.164 Mb). Second, we assess the performance of four contemporary genomic methods for inferring population size history (PSMC, MSMC, SMC++, Stairway Plot), using the two devil genome assemblies as well as simulated, artificially fragmented genomes that approximate the hypothesized demographic history of Tasmanian devils. We demonstrate that each method is robust to assembly quality, producing similar estimates of Ne when simulated genomes were fragmented into up to 5,000 scaffolds. Overall, methods reliant on the SMC are most reliable between ∼300 generations before present (gbp) and 100 kgbp, whereas methods exclusively reliant on the site-frequency spectrum are most reliable between the present and 30 gbp. Our results suggest that when used in concert, genomic methods for reconstructing species’ effective population size histories 1) can be applied to nonmodel organisms without highly contiguous reference genomes, and 2) are capable of detecting independently documented effects of historical geological events.
Collapse
Affiliation(s)
- Austin H Patton
- School of Biological Sciences, Washington State University, Pullman, WA
| | - Mark J Margres
- School of Biological Sciences, Washington State University, Pullman, WA.,Department of Organismic and Evolutionary Biology, Harvard University, MA
| | - Amanda R Stahlke
- Institute for Bioinformatics and Evolutionary Studies, University of Idaho, Moscow, ID
| | - Sarah Hendricks
- Institute for Bioinformatics and Evolutionary Studies, University of Idaho, Moscow, ID
| | - Kevin Lewallen
- Institute for Bioinformatics and Evolutionary Studies, University of Idaho, Moscow, ID
| | - Rodrigo K Hamede
- School of Natural Sciences, University of Tasmania, Hobart, Australia
| | | | - Oliver Ryder
- Institute for Conservation Research, San Diego, CA
| | | | - Menna E Jones
- School of Natural Sciences, University of Tasmania, Hobart, Australia
| | - Paul A Hohenlohe
- Institute for Bioinformatics and Evolutionary Studies, University of Idaho, Moscow, ID
| | - Andrew Storfer
- School of Biological Sciences, Washington State University, Pullman, WA
| |
Collapse
|
45
|
Jay F, Boitard S, Austerlitz F. An ABC Method for Whole-Genome Sequence Data: Inferring Paleolithic and Neolithic Human Expansions. Mol Biol Evol 2020; 36:1565-1579. [PMID: 30785202 DOI: 10.1093/molbev/msz038] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Species generally undergo a complex demographic history consisting, in particular, of multiple changes in population size. Genome-wide sequencing data are potentially highly informative for reconstructing this demographic history. A crucial point is to extract the relevant information from these very large data sets. Here, we design an approach for inferring past demographic events from a moderate number of fully sequenced genomes. Our new approach uses Approximate Bayesian Computation, a simulation-based statistical framework that allows 1) identifying the best demographic scenario among several competing scenarios and 2) estimating the best-fitting parameters under the chosen scenario. Approximate Bayesian Computation relies on the computation of summary statistics. Using a cross-validation approach, we show that statistics such as the lengths of haplotypes shared between individuals, or the decay of linkage disequilibrium with distance, can be combined with classical statistics (e.g., heterozygosity and Tajima's D) to accurately infer complex demographic scenarios including bottlenecks and expansion periods. We also demonstrate the importance of simultaneously estimating the genotyping error rate. Applying our method on genome-wide human-sequence databases, we finally show that a model consisting in a bottleneck followed by a Paleolithic and a Neolithic expansion is the most relevant for Eurasian populations.
Collapse
Affiliation(s)
- Flora Jay
- Laboratoire EcoAnthropologie et Ethnobiologie, CNRS/MNHN/Université Paris Diderot, Paris, France.,Laboratoire de Recherche en Informatique, CNRS/Université Paris-Sud/Université Paris-Saclay, Orsay, France
| | - Simon Boitard
- GenPhySE, Université de Toulouse, INRA, INPT, INP-ENVT, Castanet Tolosan, France
| | - Frédéric Austerlitz
- Laboratoire EcoAnthropologie et Ethnobiologie, CNRS/MNHN/Université Paris Diderot, Paris, France
| |
Collapse
|
46
|
Khimoun A, Doums C, Molet M, Kaufmann B, Peronnet R, Eyer PA, Mona S. Urbanization without isolation: the absence of genetic structure among cities and forests in the tiny acorn ant Temnothorax nylanderi. Biol Lett 2020; 16:20190741. [PMID: 31992150 DOI: 10.1098/rsbl.2019.0741] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Urban alteration of neutral and adaptive evolutionary processes is still underexplored. Using a genome-wide SNP dataset, we investigated (i) urban-induced modifications of population demography, genetic diversity and population structure and (ii) signature of divergent selection between urban and forest populations in the ant species, Temnothorax nylanderi. Our results did not reveal an impact of urbanization on neutral processes since we observed: (i) analogous genetic diversity among paired urban/forest sites and two control populations; (ii) weak population genetic structure explained neither by habitat (urban versus forest) nor by geography; (iii) a remarkably similar demographic history across populations with an ancestral growth followed by a recent decline, regardless of their current habitat or geographical location. The micro-geographical home range of ants may explain their resilience to urbanization. Finally, we detected 19 candidate loci discriminating urban/forest populations and associated with core cellular components, molecular function or biological process. Two of these loci were associated with a gene ontology term that was previously found to belong to a module of co-expressed genes related to caste phenotype. These results call for transcriptomics analyses to identify genes associated with ant social traits and to infer their potential role in urban adaptation.
Collapse
Affiliation(s)
- A Khimoun
- Biogéosciences, UMR 6282 CNRS, Université Bourgogne Franche-Comté, 6 Boulevard Gabriel, 21000 Dijon, France
| | - C Doums
- Institut de Systématique, Évolution, Biodiversité (ISYEB), UMR 7205, MNHN, CNRS, EPHE, Sorbonne Université, 75005 Paris, France.,EPHE, PSL University, 75005 Paris, France
| | - M Molet
- Institut d'Ecologie et des Sciences de l'Environnement de Paris (iEES-Paris), UMR 7618, Sorbonne Université, Université Paris Est Créteil, Université Paris Diderot, CNRS, INRA, IRD, 75005 Paris, France
| | - B Kaufmann
- Université de Lyon, UMR5023 Ecologie des Hydrosystèmes Naturels et Anthropisés, Université Lyon 1, ENTPE, CNRS, Villeurbanne 69622, France
| | - R Peronnet
- Institut d'Ecologie et des Sciences de l'Environnement de Paris (iEES-Paris), UMR 7618, Sorbonne Université, Université Paris Est Créteil, Université Paris Diderot, CNRS, INRA, IRD, 75005 Paris, France
| | - P A Eyer
- Department of Entomology, Texas A&M University, 2143 TAMU, College Station, TX 77843-2143, USA
| | - S Mona
- Institut de Systématique, Évolution, Biodiversité (ISYEB), UMR 7205, MNHN, CNRS, EPHE, Sorbonne Université, 75005 Paris, France.,EPHE, PSL University, 75005 Paris, France
| |
Collapse
|
47
|
Inference of Historical Population-Size Changes with Allele-Frequency Data. G3-GENES GENOMES GENETICS 2020; 10:211-223. [PMID: 31699776 PMCID: PMC6945023 DOI: 10.1534/g3.119.400854] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
With up to millions of nearly neutral polymorphisms now being routinely sampled in population-genomic surveys, it is possible to estimate the site-frequency spectrum of such sites with high precision. Each frequency class reflects a mixture of potentially unique demographic histories, which can be revealed using theory for the probability distributions of the starting and ending points of branch segments over all possible coalescence trees. Such distributions are completely independent of past population history, which only influences the segment lengths, providing the basis for estimating average population sizes separating tree-wide coalescence events. The history of population-size change experienced by a sample of polymorphisms can then be dissected in a model-flexible fashion, and extension of this theory allows estimation of the mean and full distribution of long-term effective population sizes and ages of alleles of specific frequencies. Here, we outline the basic theory underlying the conceptual approach, develop and test an efficient statistical procedure for parameter estimation, and apply this to multiple population-genomic datasets for the microcrustacean Daphnia pulex.
Collapse
|
48
|
Gilbert KJ, Pouyet F, Excoffier L, Peischl S. Transition from Background Selection to Associative Overdominance Promotes Diversity in Regions of Low Recombination. Curr Biol 2019; 30:101-107.e3. [PMID: 31866368 DOI: 10.1016/j.cub.2019.11.063] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 10/16/2019] [Accepted: 11/21/2019] [Indexed: 12/16/2022]
Abstract
Linked selection is a major driver of genetic diversity. Selection against deleterious mutations removes linked neutral diversity (background selection [BGS]) [1], creating a positive correlation between recombination rates and genetic diversity. Purifying selection against recessive variants, however, can also lead to associative overdominance (AOD) [2, 3], due to an apparent heterozygote advantage at linked neutral loci that opposes the loss of neutral diversity by BGS. Zhao and Charlesworth [3] identified the conditions under which AOD should dominate over BGS in a single-locus model and suggested that the effect of AOD could become stronger if multiple linked deleterious variants co-segregate. We present a model describing how and under which conditions multi-locus dynamics can amplify the effects of AOD. We derive the conditions for a transition from BGS to AOD due to pseudo-overdominance [4], i.e., a form of balancing selection that maintains complementary deleterious haplotypes that mask the effect of recessive deleterious mutations. Simulations confirm these findings and show that multi-locus AOD can increase diversity in low-recombination regions much more strongly than previously appreciated. While BGS is known to drive genome-wide diversity in humans [5], the observation of a resurgence of genetic diversity in regions of very low recombination is indicative of AOD. We identify 22 such regions in the human genome consistent with multi-locus AOD. Our results demonstrate that AOD may play an important role in the evolution of low-recombination regions of many species.
Collapse
Affiliation(s)
- Kimberly J Gilbert
- Institute of Ecology and Evolution, Baltzerstrasse 6, University of Bern, 3012 Bern, Switzerland; Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Amphipole, 1015 Lausanne, Switzerland.
| | - Fanny Pouyet
- Institute of Ecology and Evolution, Baltzerstrasse 6, University of Bern, 3012 Bern, Switzerland; Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Amphipole, 1015 Lausanne, Switzerland
| | - Laurent Excoffier
- Institute of Ecology and Evolution, Baltzerstrasse 6, University of Bern, 3012 Bern, Switzerland; Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Amphipole, 1015 Lausanne, Switzerland
| | - Stephan Peischl
- Swiss Institute of Bioinformatics, Quartier Sorge-Batiment Amphipole, 1015 Lausanne, Switzerland; Interfaculty Bioinformatics Unit, Baltzerstrasse 6, University of Bern, 3012 Bern, Switzerland
| |
Collapse
|
49
|
Satta Y, Zheng W, Nishiyama KV, Iwasaki RL, Hayakawa T, Fujito NT, Takahata N. Two-dimensional site frequency spectrum for detecting, classifying and dating incomplete selective sweeps. Genes Genet Syst 2019; 94:283-300. [DOI: 10.1266/ggs.19-00012] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Affiliation(s)
- Yoko Satta
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies)
| | - Wanjing Zheng
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies)
| | - Kumiko V. Nishiyama
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies)
| | - Risa L. Iwasaki
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies)
| | - Toshiyuki Hayakawa
- Graduate School of Systems Life Sciences and Faculty of Arts and Science, Kyushu University
| | - Naoko T. Fujito
- Institute for Human Genetics and Department of Epidemiology and Biostatistics, University of California
| | - Naoyuki Takahata
- School of Advanced Sciences, SOKENDAI (The Graduate University for Advanced Studies)
| |
Collapse
|
50
|
Lohse K. Come on feel the noise - from metaphors to null models. J Evol Biol 2019; 30:1506-1508. [PMID: 28786185 DOI: 10.1111/jeb.13109] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Revised: 04/17/2017] [Accepted: 04/19/2017] [Indexed: 01/03/2023]
Affiliation(s)
- K Lohse
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, UK
| |
Collapse
|