51
|
Corbett-Detig R, Nielsen R. A Hidden Markov Model Approach for Simultaneously Estimating Local Ancestry and Admixture Time Using Next Generation Sequence Data in Samples of Arbitrary Ploidy. PLoS Genet 2017; 13:e1006529. [PMID: 28045893 PMCID: PMC5242547 DOI: 10.1371/journal.pgen.1006529] [Citation(s) in RCA: 75] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Revised: 01/18/2017] [Accepted: 12/08/2016] [Indexed: 12/19/2022] Open
Abstract
Admixture-the mixing of genomes from divergent populations-is increasingly appreciated as a central process in evolution. To characterize and quantify patterns of admixture across the genome, a number of methods have been developed for local ancestry inference. However, existing approaches have a number of shortcomings. First, all local ancestry inference methods require some prior assumption about the expected ancestry tract lengths. Second, existing methods generally require genotypes, which is not feasible to obtain for many next-generation sequencing projects. Third, many methods assume samples are diploid, however a wide variety of sequencing applications will fail to meet this assumption. To address these issues, we introduce a novel hidden Markov model for estimating local ancestry that models the read pileup data, rather than genotypes, is generalized to arbitrary ploidy, and can estimate the time since admixture during local ancestry inference. We demonstrate that our method can simultaneously estimate the time since admixture and local ancestry with good accuracy, and that it performs well on samples of high ploidy-i.e. 100 or more chromosomes. As this method is very general, we expect it will be useful for local ancestry inference in a wider variety of populations than what previously has been possible. We then applied our method to pooled sequencing data derived from populations of Drosophila melanogaster on an ancestry cline on the east coast of North America. We find that regions of local recombination rates are negatively correlated with the proportion of African ancestry, suggesting that selection against foreign ancestry is the least efficient in low recombination regions. Finally we show that clinal outlier loci are enriched for genes associated with gene regulatory functions, consistent with a role of regulatory evolution in ecological adaptation of admixed D. melanogaster populations. Our results illustrate the potential of local ancestry inference for elucidating fundamental evolutionary processes.
Collapse
Affiliation(s)
- Russell Corbett-Detig
- Genomics Institute and Department of Biomolecular Engineering, UC Santa Cruz, Santa Cruz, CA, United States of America
- Department of Integrative Biology, UC Berkeley, Berkeley, CA, United States of America
| | - Rasmus Nielsen
- Department of Integrative Biology, UC Berkeley, Berkeley, CA, United States of America
- The Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
52
|
Schrider DR, Shanku AG, Kern AD. Effects of Linked Selective Sweeps on Demographic Inference and Model Selection. Genetics 2016; 204:1207-1223. [PMID: 27605051 PMCID: PMC5105852 DOI: 10.1534/genetics.116.190223] [Citation(s) in RCA: 90] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2016] [Accepted: 09/02/2016] [Indexed: 01/06/2023] Open
Abstract
The availability of large-scale population genomic sequence data has resulted in an explosion in efforts to infer the demographic histories of natural populations across a broad range of organisms. As demographic events alter coalescent genealogies, they leave detectable signatures in patterns of genetic variation within and between populations. Accordingly, a variety of approaches have been designed to leverage population genetic data to uncover the footprints of demographic change in the genome. The vast majority of these methods make the simplifying assumption that the measures of genetic variation used as their input are unaffected by natural selection. However, natural selection can dramatically skew patterns of variation not only at selected sites, but at linked, neutral loci as well. Here we assess the impact of recent positive selection on demographic inference by characterizing the performance of three popular methods through extensive simulation of data sets with varying numbers of linked selective sweeps. In particular, we examined three different demographic models relevant to a number of species, finding that positive selection can bias parameter estimates of each of these models-often severely. We find that selection can lead to incorrect inferences of population size changes when none have occurred. Moreover, we show that linked selection can lead to incorrect demographic model selection, when multiple demographic scenarios are compared. We argue that natural populations may experience the amount of recent positive selection required to skew inferences. These results suggest that demographic studies conducted in many species to date may have exaggerated the extent and frequency of population size changes.
Collapse
Affiliation(s)
- Daniel R Schrider
- Department of Genetics, Rutgers University, Piscataway, New Jersey 08854
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, New Jersey 08554
| | - Alexander G Shanku
- Department of Genetics, Rutgers University, Piscataway, New Jersey 08854
- Institute for Quantitative Biomedicine, Rutgers University, Piscataway, New Jersey 08554
| | - Andrew D Kern
- Department of Genetics, Rutgers University, Piscataway, New Jersey 08854
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, New Jersey 08554
| |
Collapse
|
53
|
Librado P, Rozas J. Weak Polygenic Selection Drives the Rapid Adaptation of the Chemosensory System: Lessons from the Upstream Regions of the Major Gene Families. Genome Biol Evol 2016; 8:2493-504. [PMID: 27503297 PMCID: PMC5010915 DOI: 10.1093/gbe/evw191] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/18/2016] [Indexed: 12/12/2022] Open
Abstract
The animal chemosensory system is involved in essential biological processes, most of them mediated by proteins encoded in multigene families. These multigene families have been fundamental for the adaptation to new environments, significantly contributing to phenotypic variation. This adaptive potential contrasts, however, with the lack of studies at their upstream regions, especially taking into account the evidence linking their transcriptional changes to certain phenotypic effects. Here, we explicitly characterize the contribution of the upstream sequences of the major chemosensory gene families to rapid adaptive processes. For that, we analyze the genome sequences of 158 lines from a population of Drosophila melanogaster that recently colonized North America, and integrate functional and transcriptional data available for this species. We find that both, strong negative and strong positive selection, shape transcriptional evolution at the genome-wide level. The chemosensory upstream regions, however, exhibit a distinctive adaptive landscape, including multiple mutations of small beneficial effect and a reduced number of cis-regulatory elements. Together, our results suggest that the promiscuous and partially redundant transcription and function of the chemosensory genes provide evolutionarily opportunities for rapid adaptive episodes through weak polygenic selection.
Collapse
Affiliation(s)
- Pablo Librado
- Departament de Genètica, Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | - Julio Rozas
- Departament de Genètica, Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| |
Collapse
|
54
|
|
55
|
Flatt T. Genomics of clinal variation in Drosophila: disentangling the interactions of selection and demography. Mol Ecol 2016; 25:1023-6. [PMID: 26919307 DOI: 10.1111/mec.13534] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2015] [Accepted: 01/09/2016] [Indexed: 12/23/2022]
Abstract
Clines in phenotypes and genotype frequencies across environmental gradients are commonly taken as evidence for spatially varying selection. Classical examples include the latitudinal clines in various species of Drosophila, which often occur in parallel fashion on multiple continents. Today, genomewide analysis of such clinal systems provides a fantastic opportunity for unravelling the genetics of adaptation, yet major challenges remain. A well-known but often neglected problem is that demographic processes can also generate clinality, independent of or coincident with selection. A closely related issue is how to identify true genic targets of clinal selection. In this issue of Molecular Ecology, three studies illustrate these challenges and how they might be met. Bergland et al. report evidence suggesting that the well-known parallel latitudinal clines in North American and Australian D. melanogaster are confounded by admixture from Africa and Europe, highlighting the importance of distinguishing demographic from adaptive clines. In a companion study, Machado et al. provide the first genomic comparison of latitudinal differentiation in D. melanogaster and its sister species D. simulans. While D. simulans is less clinal than D. melanogaster, a significant fraction of clinal genes is shared between both species, suggesting the existence of convergent adaptation to clinaly varying selection pressures. Finally, by drawing on several independent sources of evidence, Božičević et al. identify a functional network of eight clinal genes that are likely involved in cold adaptation. Together, these studies remind us that clinality does not necessarily imply selection and that separating adaptive signal from demographic noise requires great effort and care.
Collapse
Affiliation(s)
- Thomas Flatt
- Department of Ecology and Evolution, University of Lausanne, Lausanne, CH-1015, Switzerland
| |
Collapse
|
56
|
Elyashiv E, Sattath S, Hu TT, Strutsovsky A, McVicker G, Andolfatto P, Coop G, Sella G. A Genomic Map of the Effects of Linked Selection in Drosophila. PLoS Genet 2016; 12:e1006130. [PMID: 27536991 PMCID: PMC4990265 DOI: 10.1371/journal.pgen.1006130] [Citation(s) in RCA: 88] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2015] [Accepted: 05/26/2016] [Indexed: 01/23/2023] Open
Abstract
Natural selection at one site shapes patterns of genetic variation at linked sites. Quantifying the effects of "linked selection" on levels of genetic diversity is key to making reliable inference about demography, building a null model in scans for targets of adaptation, and learning about the dynamics of natural selection. Here, we introduce the first method that jointly infers parameters of distinct modes of linked selection, notably background selection and selective sweeps, from genome-wide diversity data, functional annotations and genetic maps. The central idea is to calculate the probability that a neutral site is polymorphic given local annotations, substitution patterns, and recombination rates. Information is then combined across sites and samples using composite likelihood in order to estimate genome-wide parameters of distinct modes of selection. In addition to parameter estimation, this approach yields a map of the expected neutral diversity levels along the genome. To illustrate the utility of our approach, we apply it to genome-wide resequencing data from 125 lines in Drosophila melanogaster and reliably predict diversity levels at the 1Mb scale. Our results corroborate estimates of a high fraction of beneficial substitutions in proteins and untranslated regions (UTR). They allow us to distinguish between the contribution of sweeps and other modes of selection around amino acid substitutions and to uncover evidence for pervasive sweeps in untranslated regions (UTRs). Our inference further suggests a substantial effect of other modes of linked selection and of adaptation in particular. More generally, we demonstrate that linked selection has had a larger effect in reducing diversity levels and increasing their variance in D. melanogaster than previously appreciated.
Collapse
Affiliation(s)
- Eyal Elyashiv
- Department of Ecology, Evolution, and Behavior, Hebrew University of Jerusalem, Jerusalem, Israel
- Department of Biological Sciences, Columbia University, New York, New York, United States of America
| | - Shmuel Sattath
- Department of Ecology, Evolution, and Behavior, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Tina T. Hu
- Department of Ecology and Evolutionary Biology and the Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Alon Strutsovsky
- Department of Ecology, Evolution, and Behavior, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Graham McVicker
- The Laboratory of Genetics and The Integrative Biology Laboratory, Salk Institute for Biological Studies, La Jolla, California, United States of America
| | - Peter Andolfatto
- Department of Ecology and Evolutionary Biology and the Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Graham Coop
- Department of Evolution and Ecology, University of California, Davis, Davis, California, United States of America
| | - Guy Sella
- Department of Biological Sciences, Columbia University, New York, New York, United States of America
| |
Collapse
|
57
|
Papadantonakis S, Poirazi P, Pavlidis P. CoMuS: simulating coalescent histories and polymorphic data from multiple species. Mol Ecol Resour 2016; 16:1435-1448. [PMID: 27238297 DOI: 10.1111/1755-0998.12544] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Revised: 04/30/2016] [Accepted: 05/06/2016] [Indexed: 01/25/2023]
Abstract
The simultaneous analysis of intra- and interspecies variation is challenging mainly because our knowledge about patterns of polymorphisms where both intra- and interspecies samples coexist is limited. In this study, we present CoMuS (Coalescent of Multiple Species), a multispecies coalescent software that can simulate intra- and interspecies polymorphisms. CoMuS supports a variety of speciation models and demographic scenarios related to the history of each species. In CoMuS, speciation can be accompanied by either instant or gradual isolation between sister species. Sampling may also occur in the past, and thus, we can study simultaneously extinct and extant species. Our software supports both the infinite- and the finite-site model, with substitution rate heterogeneity among sites and a user-defined proportion of invariable sites. We demonstrate the usage of CoMuS in various applications: species delimitation, software testing, model selection and parameter inference involving present-day and ancestral samples, comparison between gradual and instantaneous isolation models, estimation of speciation time between human and chimpanzee using both intra- and interspecies variation. We expect that CoMuS will be particularly useful for studies where species have been separated recently from their common ancestor and phenomena such as incomplete lineage sorting or introgression still occur.
Collapse
Affiliation(s)
- S Papadantonakis
- Department of Biology, University of Crete, PO Box 2208, 71409, Heraklio, Greece
| | - P Poirazi
- Institute of Molecular Biology and Biotechnology (IMBB), Foundation for Research and Technology-Hellas (FORTH), 70013, Heraklio, Greece
| | - P Pavlidis
- Institute of Molecular Biology and Biotechnology (IMBB), Foundation for Research and Technology-Hellas (FORTH), 70013, Heraklio, Greece.
| |
Collapse
|
58
|
Beissinger TM, Wang L, Crosby K, Durvasula A, Hufford MB, Ross-Ibarra J. Recent demography drives changes in linked selection across the maize genome. NATURE PLANTS 2016; 2:16084. [PMID: 27294617 DOI: 10.1038/nplants.2016.84] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2016] [Accepted: 05/12/2016] [Indexed: 05/14/2023]
Abstract
Genetic diversity is shaped by the interaction of drift and selection, but the details of this interaction are not well understood. The impact of genetic drift in a population is largely determined by its demographic history, typically summarized by its long-term effective population size (Ne). Rapidly changing population demographics complicate this relationship, however. To better understand how changing demography impacts selection, we used whole-genome sequencing data to investigate patterns of linked selection in domesticated and wild maize (teosinte). We produce the first whole-genome estimate of the demography of maize domestication, showing that maize was reduced to approximately 5% the population size of teosinte before it experienced rapid expansion post-domestication to population sizes much larger than its ancestor. Evaluation of patterns of nucleotide diversity in and near genes shows little evidence of selection on beneficial amino acid substitutions, and that the domestication bottleneck led to a decline in the efficiency of purifying selection in maize. Young alleles, however, show evidence of much stronger purifying selection in maize, reflecting the much larger effective size of present day populations. Our results demonstrate that recent demographic change-a hall-mark of many species including both humans and crops-can have immediate and wide-ranging impacts on diversity that conflict with expectations based on long-term Ne alone.
Collapse
Affiliation(s)
- Timothy M Beissinger
- Department of Plant Sciences, University of California, Davis, California 95616, USA
- US Department of Agriculture, Agricultural Research Service, Columbia, Missouri 65211, USA
- Division of Plant Sciences, University of Missouri, Columbia, Missouri 65211, USA
| | - Li Wang
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa 50011, USA
| | - Kate Crosby
- Department of Plant Sciences, University of California, Davis, California 95616, USA
| | - Arun Durvasula
- Department of Plant Sciences, University of California, Davis, California 95616, USA
| | - Matthew B Hufford
- Department of Ecology, Evolution, and Organismal Biology, Iowa State University, Ames, Iowa 50011, USA
| | - Jeffrey Ross-Ibarra
- Department of Plant Sciences, University of California, Davis, California 95616, USA
- Genome Center and Center for Population Biology, University of California, Davis, California 95616, USA
| |
Collapse
|
59
|
Hemmer LW, Blumenstiel JP. Holding it together: rapid evolution and positive selection in the synaptonemal complex of Drosophila. BMC Evol Biol 2016; 16:91. [PMID: 27150275 PMCID: PMC4857336 DOI: 10.1186/s12862-016-0670-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2016] [Accepted: 04/27/2016] [Indexed: 11/21/2022] Open
Abstract
Background The synaptonemal complex (SC) is a highly conserved meiotic structure that functions to pair homologs and facilitate meiotic recombination in most eukaryotes. Five Drosophila SC proteins have been identified and localized within the complex: C(3)G, C(2)M, CONA, ORD, and the newly identified Corolla. The SC is required for meiotic recombination in Drosophila and absence of these proteins leads to reduced crossing over and chromosomal nondisjunction. Despite the conserved nature of the SC and the key role that these five proteins have in meiosis in D. melanogaster, they display little apparent sequence conservation outside the genus. To identify factors that explain this lack of apparent conservation, we performed a molecular evolutionary analysis of these genes across the Drosophila genus. Results For the five SC components, gene sequence similarity declines rapidly with increasing phylogenetic distance and only ORD and C(2)M are identifiable outside of the Drosophila genus. SC gene sequences have a higher dN/dS (ω) rate ratio than the genome wide average and this can in part be explained by the action of positive selection in almost every SC component. Across the genus, there is significant variation in ω for each protein. It further appears that ω estimates for the five SC components are in accordance with their physical position within the SC. Components interacting with chromatin evolve slowest and components comprising the central elements evolve the most rapidly. Finally, using population genetic approaches, we demonstrate that positive selection on SC components is ongoing. Conclusions SC components within Drosophila show little apparent sequence homology to those identified in other model organisms due to their rapid evolution. We propose that the Drosophila SC is evolving rapidly due to two combined effects. First, we propose that a high rate of evolution can be partly explained by low purifying selection on protein components whose function is to simply hold chromosomes together. We also propose that positive selection in the SC is driven by its sex-specificity combined with its role in facilitating both recombination and centromere clustering in the face of recurrent bouts of drive in female meiosis. Electronic supplementary material The online version of this article (doi:10.1186/s12862-016-0670-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Lucas W Hemmer
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS, 66045, USA.
| | - Justin P Blumenstiel
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, KS, 66045, USA
| |
Collapse
|
60
|
Catalán A, Glaser-Schmitt A, Argyridou E, Duchen P, Parsch J. An Indel Polymorphism in the MtnA 3' Untranslated Region Is Associated with Gene Expression Variation and Local Adaptation in Drosophila melanogaster. PLoS Genet 2016; 12:e1005987. [PMID: 27120580 PMCID: PMC4847869 DOI: 10.1371/journal.pgen.1005987] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2015] [Accepted: 03/22/2016] [Indexed: 11/18/2022] Open
Abstract
Insertions and deletions (indels) are a major source of genetic variation within species and may result in functional changes to coding or regulatory sequences. In this study we report that an indel polymorphism in the 3’ untranslated region (UTR) of the metallothionein gene MtnA is associated with gene expression variation in natural populations of Drosophila melanogaster. A derived allele of MtnA with a 49-bp deletion in the 3' UTR segregates at high frequency in populations outside of sub-Saharan Africa. The frequency of the deletion increases with latitude across multiple continents and approaches 100% in northern Europe. Flies with the deletion have more than 4-fold higher MtnA expression than flies with the ancestral sequence. Using reporter gene constructs in transgenic flies, we show that the 3' UTR deletion significantly contributes to the observed expression difference. Population genetic analyses uncovered signatures of a selective sweep in the MtnA region within populations from northern Europe. We also find that the 3’ UTR deletion is associated with increased oxidative stress tolerance. These results suggest that the 3' UTR deletion has been a target of selection for its ability to confer increased levels of MtnA expression in northern European populations, likely due to a local adaptive advantage of increased oxidative stress tolerance. Although molecular variation is abundant in natural populations, understanding how this variation affects organismal phenotypes that are subject to natural selection remains a major challenge in the field of evolutionary genetics. Here we show that a deletion mutation in a noncoding region of the Drosophila melanogaster Metallothionein A gene leads to a significant increase in gene expression and increases survival under oxidative stress. The deletion is in high frequency in three distinct geographic regions: in northern European populations, in northern populations along the east coast of North America, and in southern populations along the east coast of Australia. In northern European populations the deletion shows population genetic signatures of recent positive selection. Thus, we provide evidence for a regulatory polymorphism that underlies local adaptation in natural populations.
Collapse
Affiliation(s)
- Ana Catalán
- Faculty of Biology, Ludwig-Maximilians-Universität München, Planegg, Germany
- Department of Ecology and Evolutionary Biology, University of California, Irvine, Irvine, California, United States of America
- * E-mail: (AC); (JP)
| | | | - Eliza Argyridou
- Faculty of Biology, Ludwig-Maximilians-Universität München, Planegg, Germany
| | - Pablo Duchen
- Department of Biology and Biochemistry, University of Fribourg, Fribourg, Switzerland
| | - John Parsch
- Faculty of Biology, Ludwig-Maximilians-Universität München, Planegg, Germany
- * E-mail: (AC); (JP)
| |
Collapse
|
61
|
Elevated Linkage Disequilibrium and Signatures of Soft Sweeps Are Common in Drosophila melanogaster. Genetics 2016; 203:863-80. [PMID: 27098909 DOI: 10.1534/genetics.115.184002] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2015] [Accepted: 03/25/2016] [Indexed: 12/20/2022] Open
Abstract
The extent to which selection and demography impact patterns of genetic diversity in natural populations of Drosophila melanogaster is yet to be fully understood. We previously observed that linkage disequilibrium (LD) at scales of ∼10 kb in the Drosophila Genetic Reference Panel (DGRP), consisting of 145 inbred strains from Raleigh, North Carolina, measured both between pairs of sites and as haplotype homozygosity, is elevated above neutral demographic expectations. We also demonstrated that signatures of strong and recent soft sweeps are abundant. However, the extent to which these patterns are specific to this derived and admixed population is unknown. It is also unclear whether these patterns are a consequence of the extensive inbreeding performed to generate the DGRP data. Here we analyze LD statistics in a sample of >100 fully-sequenced strains from Zambia; an ancestral population to the Raleigh population that has experienced little to no admixture and was generated by sequencing haploid embryos rather than inbred strains. We find an elevation in long-range LD and haplotype homozygosity compared to neutral expectations in the Zambian sample, thus showing the elevation in LD is not specific to the DGRP data set. This elevation in LD and haplotype structure remains even after controlling for possible confounders including genomic inversions, admixture, population substructure, close relatedness of individual strains, and recombination rate variation. Furthermore, signatures of partial soft sweeps similar to those found in the DGRP as well as partial hard sweeps are common in Zambia. These results suggest that while the selective forces and sources of adaptive mutations may differ in Zambia and Raleigh, elevated long-range LD and signatures of soft sweeps are generic in D. melanogaster.
Collapse
|
62
|
Croze M, Živković D, Stephan W, Hutter S. Balancing selection on immunity genes: review of the current literature and new analysis in Drosophila melanogaster. ZOOLOGY 2016; 119:322-9. [PMID: 27106015 DOI: 10.1016/j.zool.2016.03.004] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2015] [Revised: 02/11/2016] [Accepted: 03/16/2016] [Indexed: 12/18/2022]
Abstract
Balancing selection has been widely assumed to be an important evolutionary force, yet even today little is known about its abundance and its impact on the patterns of genetic diversity. Several studies have shown examples of balancing selection in humans, plants or parasites, and many genes under balancing selection are involved in immunity. It has been proposed that host-parasite coevolution is one of the main forces driving immune genes to evolve under balancing selection. In this paper, we review the literature on balancing selection on immunity genes in several organisms, including Drosophila. Furthermore, we performed a genome scan for balancing selection in an African population of Drosophila melanogaster using coalescent simulations of a demographic model with and without selection. We find very few genes under balancing selection and only one novel candidate gene related to immunity. Finally, we discuss the possible causes of the low number of genes under balancing selection.
Collapse
Affiliation(s)
- Myriam Croze
- Department of Biology II, Ludwig Maximilian University Munich, Großhaderner Str. 2, D-82152 Planegg-Martinsried, Germany.
| | - Daniel Živković
- Department of Biology II, Ludwig Maximilian University Munich, Großhaderner Str. 2, D-82152 Planegg-Martinsried, Germany
| | - Wolfgang Stephan
- Department of Biology II, Ludwig Maximilian University Munich, Großhaderner Str. 2, D-82152 Planegg-Martinsried, Germany
| | - Stephan Hutter
- Department of Biology II, Ludwig Maximilian University Munich, Großhaderner Str. 2, D-82152 Planegg-Martinsried, Germany
| |
Collapse
|
63
|
Sheehan S, Song YS. Deep Learning for Population Genetic Inference. PLoS Comput Biol 2016; 12:e1004845. [PMID: 27018908 PMCID: PMC4809617 DOI: 10.1371/journal.pcbi.1004845] [Citation(s) in RCA: 139] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2015] [Accepted: 03/02/2016] [Indexed: 02/05/2023] Open
Abstract
Given genomic variation data from multiple individuals, computing the likelihood of complex population genetic models is often infeasible. To circumvent this problem, we introduce a novel likelihood-free inference framework by applying deep learning, a powerful modern technique in machine learning. Deep learning makes use of multilayer neural networks to learn a feature-based function from the input (e.g., hundreds of correlated summary statistics of data) to the output (e.g., population genetic parameters of interest). We demonstrate that deep learning can be effectively employed for population genetic inference and learning informative features of data. As a concrete application, we focus on the challenging problem of jointly inferring natural selection and demography (in the form of a population size change history). Our method is able to separate the global nature of demography from the local nature of selection, without sequential steps for these two factors. Studying demography and selection jointly is motivated by Drosophila, where pervasive selection confounds demographic analysis. We apply our method to 197 African Drosophila melanogaster genomes from Zambia to infer both their overall demography, and regions of their genome under selection. We find many regions of the genome that have experienced hard sweeps, and fewer under selection on standing variation (soft sweep) or balancing selection. Interestingly, we find that soft sweeps and balancing selection occur more frequently closer to the centromere of each chromosome. In addition, our demographic inference suggests that previously estimated bottlenecks for African Drosophila melanogaster are too extreme.
Collapse
Affiliation(s)
- Sara Sheehan
- Department of Computer Science, Smith College, Northampton, Massachusetts, United States of America
- Computer Science Division, UC Berkeley, Berkeley, California, United States of America
| | - Yun S. Song
- Computer Science Division, UC Berkeley, Berkeley, California, United States of America
- Department of Statistics, UC Berkeley, Berkeley, California, United States of America
- Department of Integrative Biology, UC Berkeley, Berkeley, California, United States of America
- Department of Mathematics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| |
Collapse
|
64
|
Cogni R, Kuczynski K, Lavington E, Koury S, Behrman EL, O'Brien KR, Schmidt PS, Eanes WF. Variation in Drosophila melanogaster central metabolic genes appears driven by natural selection both within and between populations. Proc Biol Sci 2016; 282:20142688. [PMID: 25520361 DOI: 10.1098/rspb.2014.2688] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
In this report, we examine the hypothesis that the drivers of latitudinal selection observed in the eastern US Drosophila melanogaster populations are reiterated within seasons in a temperate orchard population in Pennsylvania, USA. Specifically, we ask whether alleles that are apparently favoured in northern populations are also favoured early in the spring, and decrease in frequency from the spring to autumn with the population expansion. We use SNP data collected for 46 metabolic genes and 128 SNPs representing the central metabolic pathway and examine for the aggregate SNP allele frequencies whether the association of allele change with latitude and that with increasing days of spring-autumn season are reversed. Testing by random permutation, we observe a highly significant negative correlation between these associations that is consistent with this expectation. This correlation is stronger when we confine our analysis to only those alleles that show significant latitudinal changes. This pattern is not caused by association with chromosomal inversions. When data are resampled using SNPs for amino acid change the relationship is not significant but is supported when SNPs associated with cis-expression are only considered. Our results suggest that climate factors driving latitudinal molecular variation in a metabolic pathway are related to those operating on a seasonal level within populations.
Collapse
Affiliation(s)
- Rodrigo Cogni
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794, USA
| | - Kate Kuczynski
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794, USA
| | - Erik Lavington
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794, USA
| | - Spencer Koury
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794, USA
| | - Emily L Behrman
- Department of Biology, University of Pennsylvania, Philadelphia, PA, USA
| | | | - Paul S Schmidt
- Department of Biology, University of Pennsylvania, Philadelphia, PA, USA
| | - Walter F Eanes
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794, USA
| |
Collapse
|
65
|
Sato MP, Makino T, Kawata M. Natural selection in a population of Drosophila melanogaster explained by changes in gene expression caused by sequence variation in core promoter regions. BMC Evol Biol 2016; 16:35. [PMID: 26860869 PMCID: PMC4748610 DOI: 10.1186/s12862-016-0606-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2015] [Accepted: 01/29/2016] [Indexed: 11/29/2022] Open
Abstract
Background Understanding the evolutionary forces that influence variation in gene regulatory regions in natural populations is an important challenge for evolutionary biology because natural selection for such variations could promote adaptive phenotypic evolution. Recently, whole-genome sequence analyses have identified regulatory regions subject to natural selection. However, these studies could not identify the relationship between sequence variation in the detected regions and change in gene expression levels. We analyzed sequence variations in core promoter regions, which are critical regions for gene regulation in higher eukaryotes, in a natural population of Drosophila melanogaster, and identified core promoter sequence variations associated with differences in gene expression levels subjected to natural selection. Results Among the core promoter regions whose sequence variation could change transcription factor binding sites and explain differences in expression levels, three core promoter regions were detected as candidates associated with purifying selection or selective sweep and seven as candidates associated with balancing selection, excluding the possibility of linkage between these regions and core promoter regions. CHKov1, which confers resistance to the sigma virus and related insecticides, was identified as core promoter regions that has been subject to selective sweep, although it could not be denied that selection for variation in core promoter regions was due to linked single nucleotide polymorphisms in the regulatory region outside core promoter regions. Nucleotide changes in core promoter regions of CHKov1 caused the loss of two basal transcription factor binding sites and acquisition of one transcription factor binding site, resulting in decreased gene expression levels. Of nine core promoter regions regions associated with balancing selection, brat, and CG9044 are associated with neuromuscular junction development, and Nmda1 are associated with learning, behavioral plasticity, and memory. Diversity of neural and behavioral traits may have been maintained by balancing selection. Conclusions Our results revealed the evolutionary process occurring by natural selection for differences in gene expression levels caused by sequence variation in core promoter regions in a natural population. The sequences of core promoter regions were diverse even within the population, possibly providing a source for natural selection. Electronic supplementary material The online version of this article (doi:10.1186/s12862-016-0606-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Mitsuhiko P Sato
- Department of Ecology and Evolutionary Biology, Graduate School of Life Sciences, Tohoku University, 6-3, Aramaki Aza Aoba, Aoba-ku, Sendai, 980-8578, Japan.
| | - Takashi Makino
- Department of Ecology and Evolutionary Biology, Graduate School of Life Sciences, Tohoku University, 6-3, Aramaki Aza Aoba, Aoba-ku, Sendai, 980-8578, Japan.
| | - Masakado Kawata
- Department of Ecology and Evolutionary Biology, Graduate School of Life Sciences, Tohoku University, 6-3, Aramaki Aza Aoba, Aoba-ku, Sendai, 980-8578, Japan.
| |
Collapse
|
66
|
Schrider DR, Hahn MW, Begun DJ. Parallel Evolution of Copy-Number Variation across Continents in Drosophila melanogaster. Mol Biol Evol 2016; 33:1308-16. [PMID: 26809315 DOI: 10.1093/molbev/msw014] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Genetic differentiation across populations that is maintained in the presence of gene flow is a hallmark of spatially varying selection. In Drosophila melanogaster, the latitudinal clines across the eastern coasts of Australia and North America appear to be examples of this type of selection, with recent studies showing that a substantial portion of the D. melanogaster genome exhibits allele frequency differentiation with respect to latitude on both continents. As of yet there has been no genome-wide examination of differentiated copy-number variants (CNVs) in these geographic regions, despite their potential importance for phenotypic variation in Drosophila and other taxa. Here, we present an analysis of geographic variation in CNVs in D. melanogaster. We also present the first genomic analysis of geographic variation for copy-number variation in the sister species, D. simulans, in order to investigate patterns of parallel evolution in these close relatives. In D. melanogaster we find hundreds of CNVs, many of which show parallel patterns of geographic variation on both continents, lending support to the idea that they are influenced by spatially varying selection. These findings support the idea that polymorphic CNVs contribute to local adaptation in D. melanogaster In contrast, we find very few CNVs in D. simulans that are geographically differentiated in parallel on both continents, consistent with earlier work suggesting that clinal patterns are weaker in this species.
Collapse
Affiliation(s)
| | - Matthew W Hahn
- Department of Biology and School of Informatics and Computing, Indiana University, Bloomington
| | - David J Begun
- Department of Evolution and Ecology, University of California, Davis
| |
Collapse
|
67
|
Kapun M, Fabian DK, Goudet J, Flatt T. Genomic Evidence for Adaptive Inversion Clines in Drosophila melanogaster. Mol Biol Evol 2016; 33:1317-36. [PMID: 26796550 DOI: 10.1093/molbev/msw016] [Citation(s) in RCA: 106] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Clines in chromosomal inversion polymorphisms-presumably driven by climatic gradients-are common but there is surprisingly little evidence for selection acting on them. Here we address this long-standing issue in Drosophila melanogaster by using diagnostic single nucleotide polymorphism (SNP) markers to estimate inversion frequencies from 28 whole-genome Pool-seq samples collected from 10 populations along the North American east coast. Inversions In(3L)P, In(3R)Mo, and In(3R)Payne showed clear latitudinal clines, and for In(2L)t, In(2R)NS, and In(3R)Payne the steepness of the clinal slopes changed between summer and fall. Consistent with an effect of seasonality on inversion frequencies, we detected small but stable seasonal fluctuations of In(2R)NS and In(3R)Payne in a temperate Pennsylvanian population over 4 years. In support of spatially varying selection, we observed that the cline in In(3R)Payne has remained stable for >40 years and that the frequencies of In(2L)t and In(3R)Payne are strongly correlated with climatic factors that vary latitudinally, independent of population structure. To test whether these patterns are adaptive, we compared the amount of genetic differentiation of inversions versus neutral SNPs and found that the clines in In(2L)t and In(3R)Payne are maintained nonneutrally and independent of admixture. We also identified numerous clinal inversion-associated SNPs, many of which exhibit parallel differentiation along the Australian cline and reside in genes known to affect fitness-related traits. Together, our results provide strong evidence that inversion clines are maintained by spatially-and perhaps also temporally-varying selection. We interpret our data in light of current hypotheses about how inversions are established and maintained.
Collapse
Affiliation(s)
- Martin Kapun
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Daniel K Fabian
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Jérôme Goudet
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Thomas Flatt
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| |
Collapse
|
68
|
Machado HE, Bergland AO, O'Brien KR, Behrman EL, Schmidt PS, Petrov DA. Comparative population genomics of latitudinal variation in Drosophila simulans and Drosophila melanogaster. Mol Ecol 2016; 25:723-40. [PMID: 26523848 DOI: 10.1111/mec.13446] [Citation(s) in RCA: 111] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2015] [Revised: 10/26/2015] [Accepted: 10/28/2015] [Indexed: 12/15/2022]
Abstract
Examples of clinal variation in phenotypes and genotypes across latitudinal transects have served as important models for understanding how spatially varying selection and demographic forces shape variation within species. Here, we examine the selective and demographic contributions to latitudinal variation through the largest comparative genomic study to date of Drosophila simulans and Drosophila melanogaster, with genomic sequence data from 382 individual fruit flies, collected across a spatial transect of 19 degrees latitude and at multiple time points over 2 years. Consistent with phenotypic studies, we find less clinal variation in D. simulans than D. melanogaster, particularly for the autosomes. Moreover, we find that clinally varying loci in D. simulans are less stable over multiple years than comparable clines in D. melanogaster. D. simulans shows a significantly weaker pattern of isolation by distance than D. melanogaster and we find evidence for a stronger contribution of migration to D. simulans population genetic structure. While population bottlenecks and migration can plausibly explain the differences in stability of clinal variation between the two species, we also observe a significant enrichment of shared clinal genes, suggesting that the selective forces associated with climate are acting on the same genes and phenotypes in D. simulans and D. melanogaster.
Collapse
Affiliation(s)
- Heather E Machado
- Department of Biology, Stanford University, 371 Serra Mall, Stanford, CA, 94305-5020, USA
| | - Alan O Bergland
- Department of Biology, Stanford University, 371 Serra Mall, Stanford, CA, 94305-5020, USA
| | - Katherine R O'Brien
- School of Biological Sciences, University of Nebraska-Lincoln, 348 Manter Hall, Lincoln, NE, 68588, USA.,Department of Biology, University of Pennsylvania, 102 Leidy Laboratories, Philadelphia, PA, 19104-6313, USA
| | - Emily L Behrman
- Department of Biology, University of Pennsylvania, 102 Leidy Laboratories, Philadelphia, PA, 19104-6313, USA
| | - Paul S Schmidt
- Department of Biology, University of Pennsylvania, 102 Leidy Laboratories, Philadelphia, PA, 19104-6313, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, 371 Serra Mall, Stanford, CA, 94305-5020, USA
| |
Collapse
|
69
|
Bergland AO, Tobler R, González J, Schmidt P, Petrov D. Secondary contact and local adaptation contribute to genome-wide patterns of clinal variation in Drosophila melanogaster. Mol Ecol 2016; 25:1157-74. [PMID: 26547394 DOI: 10.1111/mec.13455] [Citation(s) in RCA: 93] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2014] [Revised: 10/29/2015] [Accepted: 11/02/2015] [Indexed: 12/12/2022]
Abstract
Populations arrayed along broad latitudinal gradients often show patterns of clinal variation in phenotype and genotype. Such population differentiation can be generated and maintained by both historical demographic events and local adaptation. These evolutionary forces are not mutually exclusive and can in some cases produce nearly identical patterns of genetic differentiation among populations. Here, we investigate the evolutionary forces that generated and maintain clinal variation genome-wide among populations of Drosophila melanogaster sampled in North America and Australia. We contrast patterns of clinal variation in these continents with patterns of differentiation among ancestral European and African populations. Using established and novel methods we derive here, we show that recently derived North America and Australia populations were likely founded by both European and African lineages and that this hybridization event likely contributed to genome-wide patterns of parallel clinal variation between continents. The pervasive effects of admixture mean that differentiation at only several hundred loci can be attributed to the operation of spatially varying selection using an FST outlier approach. Our results provide novel insight into the well-studied system of clinal differentiation in D. melanogaster and provide a context for future studies seeking to identify loci contributing to local adaptation in a wide variety of organisms, including other invasive species as well as temperate endemics.
Collapse
Affiliation(s)
- Alan O Bergland
- Department of Biology, Stanford University, Stanford, CA, 94305-5020, USA
| | - Ray Tobler
- Department of Biology, Stanford University, Stanford, CA, 94305-5020, USA.,Institut für Populationsgenetik, Vetmeduni Vienna, Veterinärplatz 1, Vienna, A-1210, Austria
| | - Josefa González
- Department of Biology, Stanford University, Stanford, CA, 94305-5020, USA.,Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Passeig Maritim de la Barceloneta 37-49, 0800, 3 Barcelona, Spain
| | - Paul Schmidt
- Department of Biology, The University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Dmitri Petrov
- Department of Biology, Stanford University, Stanford, CA, 94305-5020, USA
| |
Collapse
|
70
|
Božičević V, Hutter S, Stephan W, Wollstein A. Population genetic evidence for cold adaptation in European Drosophila melanogaster populations. Mol Ecol 2016; 25:1175-91. [PMID: 26558479 DOI: 10.1111/mec.13464] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2015] [Revised: 11/03/2015] [Accepted: 11/05/2015] [Indexed: 01/05/2023]
Abstract
We studied Drosophila melanogaster populations from Europe (the Netherlands and France) and Africa (Rwanda and Zambia) to uncover genetic evidence of adaptation to cold. We present here four lines of evidence for genes involved in cold adaptation from four perspectives: (i) the frequency of SNPs at genes previously known to be associated with chill-coma recovery time (CCRT), startle reflex (SR) and resistance to starvation stress (RSS) vary along environmental gradients and therefore among populations; (ii) SNPs of genes that correlate significantly with latitude and altitude in African and European populations overlap with SNPs that correlate with a latitudinal cline from North America; (iii) at the genomewide level, the top candidate genes are enriched in gene ontology (GO) terms that are related to cold tolerance; (iv) GO enriched terms from North American clinal genes overlap significantly with those from Africa and Europe. Each SNP was tested in 10 independent runs of Bayenv2, using the median Bayes factors to ascertain candidate genes. None of the candidate genes were found close to the breakpoints of cosmopolitan inversions, and only four candidate genes were linked to QTLs related to CCRT. To overcome the limitation that we used only four populations to test correlations with environmental gradients, we performed simulations to estimate the power of our approach for detecting selection. Based on our results, we propose a novel network of genes that is involved in cold adaptation.
Collapse
Affiliation(s)
- Vedran Božičević
- Section of Evolutionary Biology, Department of Biology II, University of Munich, D-82152, Planegg-Martinsried, Germany
| | - Stephan Hutter
- Section of Evolutionary Biology, Department of Biology II, University of Munich, D-82152, Planegg-Martinsried, Germany
| | - Wolfgang Stephan
- Section of Evolutionary Biology, Department of Biology II, University of Munich, D-82152, Planegg-Martinsried, Germany
| | - Andreas Wollstein
- Section of Evolutionary Biology, Department of Biology II, University of Munich, D-82152, Planegg-Martinsried, Germany
| |
Collapse
|
71
|
Edwards SV, Shultz AJ, Campbell-Staton SC. Next-generation sequencing and the expanding domain of phylogeography. FOLIA ZOOLOGICA 2015. [DOI: 10.25225/fozo.v64.i3.a2.2015] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Affiliation(s)
- Scott V. Edwards
- Department of Organismic and Evolutionary Biology, and Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, U.S.A.
| | - Allison J. Shultz
- Department of Organismic and Evolutionary Biology, and Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, U.S.A.
| | - Shane C. Campbell-Staton
- Department of Organismic and Evolutionary Biology, and Museum of Comparative Zoology, Harvard University, Cambridge, MA 02138, U.S.A.
| |
Collapse
|
72
|
Choi JY, Aquadro CF. Molecular Evolution of Drosophila Germline Stem Cell and Neural Stem Cell Regulating Genes. Genome Biol Evol 2015; 7:3097-114. [PMID: 26507797 PMCID: PMC4994752 DOI: 10.1093/gbe/evv207] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Here, we study the molecular evolution of a near complete set of genes that had functional evidence in the regulation of the Drosophila germline and neural stem cell. Some of these genes have previously been shown to be rapidly evolving by positive selection raising the possibility that stem cell genes as a group have elevated signatures of positive selection. Using recent Drosophila comparative genome sequences and population genomic sequences of Drosophila melanogaster, we have investigated both long- and short-term evolution occurring across these two different stem cell systems, and compared them with a carefully chosen random set of genes to represent the background rate of evolution. Our results showed an excess of genes with evidence of a recent selective sweep in both germline and neural stem cells in D. melanogaster. However compared with their control genes, both stem cell systems had no significant excess of genes with long-term recurrent positive selection in D. melanogaster, or across orthologous sequences from the melanogaster group. The evidence of long-term positive selection was limited to a subset of genes with specific functions in both the germline and neural stem cell system.
Collapse
Affiliation(s)
- Jae Young Choi
- Department of Molecular Biology and Genetics, Cornell University
| | | |
Collapse
|
73
|
The Effects of Background and Interference Selection on Patterns of Genetic Variation in Subdivided Populations. Genetics 2015; 201:1539-54. [PMID: 26434720 DOI: 10.1534/genetics.115.178558] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2015] [Accepted: 09/24/2015] [Indexed: 11/18/2022] Open
Abstract
It is well known that most new mutations that affect fitness exert deleterious effects and that natural populations are often composed of subpopulations (demes) connected by gene flow. To gain a better understanding of the joint effects of purifying selection and population structure, we focus on a scenario where an ancestral population splits into multiple demes and study neutral diversity patterns in regions linked to selected sites. In the background selection regime of strong selection, we first derive analytic equations for pairwise coalescent times and FST as a function of time after the ancestral population splits into two demes and then construct a flexible coalescent simulator that can generate samples under complex models such as those involving multiple demes or nonconservative migration. We have carried out extensive forward simulations to show that the new methods can accurately predict diversity patterns both in the nonequilibrium phase following the split of the ancestral population and in the equilibrium between mutation, migration, drift, and selection. In the interference selection regime of many tightly linked selected sites, forward simulations provide evidence that neutral diversity patterns obtained from both the nonequilibrium and equilibrium phases may be virtually indistinguishable for models that have identical variance in fitness, but are nonetheless different with respect to the number of selected sites and the strength of purifying selection. This equivalence in neutral diversity patterns suggests that data collected from subdivided populations may have limited power for differentiating among the selective pressures to which closely linked selected sites are subject.
Collapse
|
74
|
Pool JE. The Mosaic Ancestry of the Drosophila Genetic Reference Panel and the D. melanogaster Reference Genome Reveals a Network of Epistatic Fitness Interactions. Mol Biol Evol 2015; 32:3236-51. [PMID: 26354524 PMCID: PMC4652625 DOI: 10.1093/molbev/msv194] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
North American populations of Drosophila melanogaster derive from both European and African source populations, but despite their importance for genetic research, patterns of ancestry along their genomes are largely undocumented. Here, I infer geographic ancestry along genomes of the Drosophila Genetic Reference Panel (DGRP) and the D. melanogaster reference genome, which may have implications for reference alignment, association mapping, and population genomic studies in Drosophila. Overall, the proportion of African ancestry was estimated to be 20% for the DGRP and 9% for the reference genome. Combining my estimate of admixture timing with historical records, I provide the first estimate of natural generation time for this species (approximately 15 generations per year). Ancestry levels were found to vary strikingly across the genome, with less African introgression on the X chromosome, in regions of high recombination, and at genes involved in specific processes (e.g., circadian rhythm). An important role for natural selection during the admixture process was further supported by evidence that many unlinked pairs of loci showed a deficiency of Africa–Europe allele combinations between them. Numerous epistatic fitness interactions may therefore exist between African and European genotypes, leading to ongoing selection against incompatible variants. By focusing on hubs in this network of fitness interactions, I identified a set of interacting loci that include genes with roles in sensation and neuropeptide/hormone reception. These findings suggest that admixed D. melanogaster samples could become an important study system for the genetics of early-stage isolation between populations.
Collapse
Affiliation(s)
- John E Pool
- Laboratory of Genetics, University of Wisconsin-Madison
| |
Collapse
|
75
|
Kao JY, Lymer S, Hwang SH, Sung A, Nuzhdin SV. Postmating reproductive barriers contribute to the incipient sexual isolation of the United States and Caribbean Drosophila melanogaster. Ecol Evol 2015; 5:3171-82. [PMID: 26357543 PMCID: PMC4559059 DOI: 10.1002/ece3.1596] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2015] [Revised: 05/22/2015] [Accepted: 05/26/2015] [Indexed: 02/01/2023] Open
Abstract
The nascent stages of speciation start with the emergence of sexual isolation. Understanding the influence of reproductive barriers in this evolutionary process is an ongoing effort. We present a study of Drosophila melanogaster admixed populations from the southeast United States and the Caribbean islands known to be a secondary contact zone of European- and African-derived populations undergoing incipient sexual isolation. The existence of premating reproductive barriers has been previously established, but these types of barriers are not the only source shaping sexual isolation. To assess the influence of postmating barriers, we investigated putative postmating barriers of female remating and egg-laying behavior, as well as hatchability of eggs laid and female longevity after mating. In the central region of our putative hybrid zone of American and Caribbean populations, we observed lower hatchability of eggs laid accompanied by increased resistance to harm after mating to less-related males. These results illustrate that postmating reproductive barriers act alongside premating barriers and genetic admixture such as hybrid incompatibilities and influence early phases of sexual isolation.
Collapse
Affiliation(s)
- Joyce Y Kao
- Section of Molecular and Computational Biology, Department of Biology, University of Southern California Los Angeles, California, 90089 ; Department of Biology, New York University 29 Washington Pl, New York city, New York, 10003
| | - Seana Lymer
- Department of Biology, New York University 29 Washington Pl, New York city, New York, 10003
| | - Sea H Hwang
- Section of Molecular and Computational Biology, Department of Biology, University of Southern California Los Angeles, California, 90089
| | - Albert Sung
- Section of Molecular and Computational Biology, Department of Biology, University of Southern California Los Angeles, California, 90089
| | - Sergey V Nuzhdin
- Section of Molecular and Computational Biology, Department of Biology, University of Southern California Los Angeles, California, 90089 ; St. Petersburg State Polytechnical University St. Petersburg, Russia
| |
Collapse
|
76
|
Survival Rate and Transcriptional Response upon Infection with the Generalist Parasite Beauveria bassiana in a World-Wide Sample of Drosophila melanogaster. PLoS One 2015; 10:e0132129. [PMID: 26154519 PMCID: PMC4495925 DOI: 10.1371/journal.pone.0132129] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2015] [Accepted: 06/10/2015] [Indexed: 01/22/2023] Open
Abstract
The ability to cope with infection by a parasite is one of the major challenges for any host species and is a major driver of evolution. Parasite pressure differs between habitats. It is thought to be higher in tropical regions compared to temporal ones. We infected Drosophila melanogaster from two tropical (Malaysia and Zimbabwe) and two temperate populations (the Netherlands and North Carolina) with the generalist entomopathogenic fungus Beauveria bassiana to examine if adaptation to local parasite pressures led to differences in resistance. Contrary to previous findings we observed increased survival in temperate populations. This, however, is not due to increased resistance to infection per se, but rather the consequence of a higher general vigor of the temperate populations. We also assessed transcriptional response to infection within these flies eight and 24 hours after infection. Only few genes were induced at the earlier time point, most of which are involved in detoxification. In contrast, we identified more than 4,000 genes that changed their expression state after 24 hours. This response was generally conserved over all populations with only few genes being uniquely regulated in the temperate populations. We furthermore found that the American population was transcriptionally highly diverged from all other populations concerning basal levels of gene expression. This was particularly true for stress and immune response genes, which might be the genetic basis for their elevated vigor.
Collapse
|
77
|
Linkage Disequilibrium and Inversion-Typing of the Drosophila melanogaster Genome Reference Panel. G3-GENES GENOMES GENETICS 2015; 5:1695-701. [PMID: 26068573 PMCID: PMC4528326 DOI: 10.1534/g3.115.019554] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
We calculated the linkage disequilibrium between all pairs of variants in the Drosophila Genome Reference Panel with minor allele count ≥5. We used r(2) ≥ 0.5 as the cutoff for a highly correlated SNP. We make available the list of all highly correlated SNPs for use in association studies. Seventy-six percent of variant SNPs are highly correlated with at least one other SNP, and the mean number of highly correlated SNPs per variant over the whole genome is 83.9. Disequilibrium between distant SNPs is also common when minor allele frequency (MAF) is low: 37% of SNPs with MAF < 0.1 are highly correlated with SNPs more than 100 kb distant. Although SNPs within regions with polymorphic inversions are highly correlated with somewhat larger numbers of SNPs, and these correlated SNPs are on average farther away, the probability that a SNP in such regions is highly correlated with at least one other SNP is very similar to SNPs outside inversions. Previous karyotyping of the DGRP lines has been inconsistent, and we used LD and genotype to investigate these discrepancies. When previous studies agreed on inversion karyotype, our analysis was almost perfectly concordant with those assignments. In discordant cases, and for inversion heterozygotes, our results suggest errors in two previous analyses or discordance between genotype and karyotype. Heterozygosities of chromosome arms are, in many cases, surprisingly highly correlated, suggesting strong epsistatic selection during the inbreeding and maintenance of the DGRP lines.
Collapse
|
78
|
Zhao L, Wit J, Svetec N, Begun DJ. Parallel Gene Expression Differences between Low and High Latitude Populations of Drosophila melanogaster and D. simulans. PLoS Genet 2015; 11:e1005184. [PMID: 25950438 PMCID: PMC4423912 DOI: 10.1371/journal.pgen.1005184] [Citation(s) in RCA: 68] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2014] [Accepted: 03/27/2015] [Indexed: 11/19/2022] Open
Abstract
Gene expression variation within species is relatively common, however, the role of natural selection in the maintenance of this variation is poorly understood. Here we investigate low and high latitude populations of Drosophila melanogaster and its sister species, D. simulans, to determine whether the two species show similar patterns of population differentiation, consistent with a role for spatially varying selection in maintaining gene expression variation. We compared at two temperatures the whole male transcriptome of D. melanogaster and D. simulans sampled from Panama City (Panama) and Maine (USA). We observed a significant excess of genes exhibiting differential expression in both species, consistent with parallel adaptation to heterogeneous environments. Moreover, the majority of genes showing parallel expression differentiation showed the same direction of differential expression in the two species and the magnitudes of expression differences between high and low latitude populations were correlated across species, further bolstering the conclusion that parallelism for expression phenotypes results from spatially varying selection. However, the species also exhibited important differences in expression phenotypes. For example, the genomic extent of genotype × environment interaction was much more common in D. melanogaster. Highly differentiated SNPs between low and high latitudes were enriched in the 3’ UTRs and CDS of the geographically differently expressed genes in both species, consistent with an important role for cis-acting variants in driving local adaptation for expression-related phenotypes. While gene expression variation in natural populations is common, the population genetic processes responsible for the maintenance of this variation remain obscure. Here we study geographic differences in gene expression in recently established low and high latitude populations of two closely related species of Drosophila. We observe substantial parallelism in expression differences and expression plasticity between populations, which supports the idea that spatially varying selection correlated with latitude contributes to the maintenance of gene expression variation in these species. Comparison of inter-population sequence differentiation and expression differentiation suggests that cis-acting variants play a role in geographic expression differentiation.
Collapse
Affiliation(s)
- Li Zhao
- Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America
- * E-mail:
| | - Janneke Wit
- Department of Bioscience, Section of Integrative Ecology and Evolution, Aarhus University, Aarhus C, Denmark
| | - Nicolas Svetec
- Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America
| | - David J. Begun
- Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America
| |
Collapse
|
79
|
Transition Densities and Sample Frequency Spectra of Diffusion Processes with Selection and Variable Population Size. Genetics 2015; 200:601-17. [PMID: 25873633 DOI: 10.1534/genetics.115.175265] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2015] [Accepted: 04/09/2015] [Indexed: 11/18/2022] Open
Abstract
Advances in empirical population genetics have made apparent the need for models that simultaneously account for selection and demography. To address this need, we here study the Wright-Fisher diffusion under selection and variable effective population size. In the case of genic selection and piecewise-constant effective population sizes, we obtain the transition density by extending a recently developed method for computing an accurate spectral representation for a constant population size. Utilizing this extension, we show how to compute the sample frequency spectrum in the presence of genic selection and an arbitrary number of instantaneous changes in the effective population size. We also develop an alternate, efficient algorithm for computing the sample frequency spectrum using a moment-based approach. We apply these methods to answer the following questions: If neutrality is incorrectly assumed when there is selection, what effects does it have on demographic parameter estimation? Can the impact of negative selection be observed in populations that undergo strong exponential growth?
Collapse
|
80
|
Ullastres A, Petit N, González J. Exploring the Phenotypic Space and the Evolutionary History of a Natural Mutation in Drosophila melanogaster. Mol Biol Evol 2015; 32:1800-14. [PMID: 25862139 PMCID: PMC4476160 DOI: 10.1093/molbev/msv061] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
A major challenge of modern Biology is elucidating the functional consequences of natural mutations. Although we have a good understanding of the effects of laboratory-induced mutations on the molecular- and organismal-level phenotypes, the study of natural mutations has lagged behind. In this work, we explore the phenotypic space and the evolutionary history of a previously identified adaptive transposable element insertion. We first combined several tests that capture different signatures of selection to show that there is evidence of positive selection in the regions flanking FBti0019386 insertion. We then explored several phenotypes related to known phenotypic effects of nearby genes, and having plausible connections to fitness variation in nature. We found that flies with FBti0019386 insertion had a shorter developmental time and were more sensitive to stress, which are likely to be the adaptive effect and the cost of selection of this mutation, respectively. Interestingly, these phenotypic effects are not consistent with a role of FBti0019386 in temperate adaptation as has been previously suggested. Indeed, a global analysis of the population frequency of FBti0019386 showed that climatic variables explain well the FBti0019386 frequency patterns only in Australia. Finally, although FBti0019386 insertion could be inducing the formation of heterochromatin by recruiting HP1a (Heterochromatin Protein 1a) protein, the insertion is associated with upregulation of sra in adult females. Overall, our integrative approach allowed us to shed light on the evolutionary history, the relevant fitness effects, and the likely molecular mechanisms of an adaptive mutation and highlights the complexity of natural genetic variants.
Collapse
Affiliation(s)
- Anna Ullastres
- Institute of Evolutionary Biology, CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | - Natalia Petit
- Institute of Evolutionary Biology, CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| | - Josefa González
- Institute of Evolutionary Biology, CSIC-Universitat Pompeu Fabra, Barcelona, Spain
| |
Collapse
|
81
|
Pratdesaba R, Segarra C, Aguadé M. Inferring the demographic history of Drosophila subobscura from nucleotide variation at regions not affected by chromosomal inversions. Mol Ecol 2015; 24:1729-41. [PMID: 25776124 DOI: 10.1111/mec.13155] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2014] [Revised: 03/09/2015] [Accepted: 03/11/2015] [Indexed: 11/29/2022]
Abstract
Drosophila subobscura presents a rich and complex chromosomal inversion polymorphism. It can thus be considered a model system (i) to study the mechanisms originating inversions and how inversions affect the levels and patterns of variation in the inverted regions and (ii) to study adaptation at both the single-gene and chromosomal inversion levels. It is therefore important to infer its demographic history as previous information indicated that its nucleotide variation is not at mutation-drift equilibrium. For that purpose, we sequenced 16 noncoding regions distributed across those parts of the J chromosome not affected by inversions in the studied population and possibly either by other selective events. The pattern of variation detected in these 16 regions is similar to that previously reported within different chromosomal arrangements, suggesting that the latter results would, thus, mainly reflect recent demographic events rather than the partial selective sweep imposed by the origin and frequency increase of inversions. Among the simple demographic models considered in our Approximate Bayesian Computation analysis of variation at the 16 regions, the model best supported by the data implies a population size expansion soon after the penultimate glacial period. This model constitutes a better null model, and it is therefore an important resource for subsequent studies aiming among others to uncover selective events across the species genome. Our results also highlight the importance of introducing the possibility of multiple hits in the coalescent simulations with an outgroup.
Collapse
Affiliation(s)
- Roser Pratdesaba
- Departament de Genètica, Facultat de Biologia and Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Diagonal 643, 08028, Barcelona, Spain
| | | | | |
Collapse
|
82
|
Kao JY, Zubair A, Salomon MP, Nuzhdin SV, Campo D. Population genomic analysis uncovers African and European admixture inDrosophila melanogasterpopulations from the south-eastern United States and Caribbean Islands. Mol Ecol 2015; 24:1499-509. [DOI: 10.1111/mec.13137] [Citation(s) in RCA: 65] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2014] [Revised: 02/23/2015] [Accepted: 02/25/2015] [Indexed: 01/19/2023]
Affiliation(s)
- Joyce Y. Kao
- Section of Molecular and Computational Biology; Department of Biology; University of Southern California; 1050 Childs Way Los Angeles CA 90089 USA
| | - Asif Zubair
- Section of Molecular and Computational Biology; Department of Biology; University of Southern California; 1050 Childs Way Los Angeles CA 90089 USA
| | - Matthew P. Salomon
- Section of Molecular and Computational Biology; Department of Biology; University of Southern California; 1050 Childs Way Los Angeles CA 90089 USA
| | - Sergey V. Nuzhdin
- Section of Molecular and Computational Biology; Department of Biology; University of Southern California; 1050 Childs Way Los Angeles CA 90089 USA
| | - Daniel Campo
- Section of Molecular and Computational Biology; Department of Biology; University of Southern California; 1050 Childs Way Los Angeles CA 90089 USA
| |
Collapse
|
83
|
Garud NR, Messer PW, Buzbas EO, Petrov DA. Recent selective sweeps in North American Drosophila melanogaster show signatures of soft sweeps. PLoS Genet 2015; 11:e1005004. [PMID: 25706129 PMCID: PMC4338236 DOI: 10.1371/journal.pgen.1005004] [Citation(s) in RCA: 257] [Impact Index Per Article: 28.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Accepted: 01/14/2015] [Indexed: 11/18/2022] Open
Abstract
Adaptation from standing genetic variation or recurrent de novo mutation in large populations should commonly generate soft rather than hard selective sweeps. In contrast to a hard selective sweep, in which a single adaptive haplotype rises to high population frequency, in a soft selective sweep multiple adaptive haplotypes sweep through the population simultaneously, producing distinct patterns of genetic variation in the vicinity of the adaptive site. Current statistical methods were expressly designed to detect hard sweeps and most lack power to detect soft sweeps. This is particularly unfortunate for the study of adaptation in species such as Drosophila melanogaster, where all three confirmed cases of recent adaptation resulted in soft selective sweeps and where there is evidence that the effective population size relevant for recent and strong adaptation is large enough to generate soft sweeps even when adaptation requires mutation at a specific single site at a locus. Here, we develop a statistical test based on a measure of haplotype homozygosity (H12) that is capable of detecting both hard and soft sweeps with similar power. We use H12 to identify multiple genomic regions that have undergone recent and strong adaptation in a large population sample of fully sequenced Drosophila melanogaster strains from the Drosophila Genetic Reference Panel (DGRP). Visual inspection of the top 50 candidates reveals that in all cases multiple haplotypes are present at high frequencies, consistent with signatures of soft sweeps. We further develop a second haplotype homozygosity statistic (H2/H1) that, in combination with H12, is capable of differentiating hard from soft sweeps. Surprisingly, we find that the H12 and H2/H1 values for all top 50 peaks are much more easily generated by soft rather than hard sweeps. We discuss the implications of these results for the study of adaptation in Drosophila and in species with large census population sizes. Evolutionary adaptation is a process in which beneficial mutations increase in frequency in response to selective pressures. If these mutations were previously rare or absent from the population, adaptation should generate a characteristic signature in the genetic diversity around the adaptive locus, known as a selective sweep. Such selective sweeps can be distinguished into hard selective sweeps, where only a single adaptive mutation rises in frequency, or soft selective sweeps, where multiple adaptive mutations at the same locus sweep through the population simultaneously. Here we design a new statistical method that can identify both hard and soft sweeps in population genomic data and apply this method to a Drosophila melanogaster population genomic dataset consisting of 145 sequenced strains collected in North Carolina. We find that selective sweeps were abundant in the recent history of this population. Interestingly, we also find that practically all of the strongest and most recent sweeps show patterns that are more consistent with soft rather than hard sweeps. We discuss the implications of these findings for the discovery and quantification of adaptation from population genomic data in Drosophila and other species with large population sizes.
Collapse
Affiliation(s)
- Nandita R. Garud
- Department of Genetics, Stanford University, Stanford, California, United States of America
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail: (NRG); (DAP)
| | - Philipp W. Messer
- Department of Biology, Stanford University, Stanford, California, United States of America
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
| | - Erkan O. Buzbas
- Department of Biology, Stanford University, Stanford, California, United States of America
- Department of Statistical Science, University of Idaho, Moscow, Idaho, United States of America
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail: (NRG); (DAP)
| |
Collapse
|
84
|
Adaptive evolution of genes involved in the regulation of germline stem cells in Drosophila melanogaster and D. simulans. G3-GENES GENOMES GENETICS 2015; 5:583-92. [PMID: 25670770 PMCID: PMC4390574 DOI: 10.1534/g3.114.015875] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Population genetic and comparative analyses in diverse taxa have shown that numerous genes involved in reproduction are adaptively evolving. Two genes involved in germline stem cell regulation, bag of marbles (bam) and benign gonial cell neoplasm (bgcn), have been shown previously to experience recurrent, adaptive evolution in both Drosophila melanogaster and D. simulans. Here we report a population genetic survey on eight additional genes involved in germline stem cell regulation in D. melanogaster and D. simulans that reveals all eight of these genes reject a neutral model of evolution in at least one test and one species after correction for multiple testing using a false-discovery rate of 0.05. These genes play diverse roles in the regulation of germline stem cells, suggesting that positive selection in response to several evolutionary pressures may be acting to drive the adaptive evolution of these genes.
Collapse
|
85
|
The Drosophila genome nexus: a population genomic resource of 623 Drosophila melanogaster genomes, including 197 from a single ancestral range population. Genetics 2015; 199:1229-41. [PMID: 25631317 PMCID: PMC4391556 DOI: 10.1534/genetics.115.174664] [Citation(s) in RCA: 167] [Impact Index Per Article: 18.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2014] [Accepted: 01/23/2015] [Indexed: 12/30/2022] Open
Abstract
Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an assembly strategy that utilizes two alignment programs and incorporates both substitutions and short indels to construct an updated reference for a second round of mapping prior to final variant detection. Utilizing this approach, we reassembled published D. melanogaster population genomic data sets and added unpublished genomes from several sub-Saharan populations. Most notably, we present aligned data from phase 3 of the Drosophila Population Genomics Project (DPGP3), which provides 197 genomes from a single ancestral range population of D. melanogaster (from Zambia). The large sample size, high genetic diversity, and potentially simpler demographic history of the DPGP3 sample will make this a highly valuable resource for fundamental population genetic research. The complete set of assemblies described here, termed the Drosophila Genome Nexus, presently comprises 623 consistently aligned genomes and is publicly available in multiple formats with supporting documentation and bioinformatic tools. This resource will greatly facilitate population genomic analysis in this model species by reducing the methodological differences between data sets.
Collapse
|
86
|
Population- and sex-biased gene expression in the excretion organs of Drosophila melanogaster. G3-GENES GENOMES GENETICS 2014; 4:2307-15. [PMID: 25246242 PMCID: PMC4267927 DOI: 10.1534/g3.114.013417] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Within species, levels of gene expression typically vary greatly between tissues, sexes, individuals, and populations. To investigate gene expression variation between sexes and populations in a single somatic tissue, we performed a quantitative analysis of the Malpighian tubule transcriptome in adult males and females of Drosophila melanogaster derived from two distinct populations (one from sub-Saharan Africa and one from northern Europe). We identified 2308 genes that differed in expression between the sexes and 2474 genes that differed in expression between populations at a false discovery rate of 5%. We also identified more than 1000 genes that showed a sex-by-population interaction in their expression. The genes that differed in expression between sexes showed enrichment for a wide variety of functions, although only 55% of them overlapped with sex-biased genes identified in whole-fly studies. The genes expressed differentially between populations included several that were previously implicated in adaptive regulatory evolution, an excess of cytochrome P450 genes, and many genes that were not detected in previous studies of whole flies. Our results demonstrate that there is abundant intraspecific gene expression variation within in a single, somatic tissue and uncover new candidates for adaptive regulatory evolution between populations.
Collapse
|
87
|
Jackson BC, Campos JL, Zeng K. The effects of purifying selection on patterns of genetic differentiation between Drosophila melanogaster populations. Heredity (Edinb) 2014; 114:163-74. [PMID: 25227256 PMCID: PMC4270736 DOI: 10.1038/hdy.2014.80] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2014] [Revised: 06/16/2014] [Accepted: 07/22/2014] [Indexed: 01/21/2023] Open
Abstract
Using the data provided by the Drosophila Population Genomics Project, we investigate factors that affect the genetic differentiation between Rwandan and French populations of D. melanogaster. By examining within-population polymorphisms, we show that sites in long introns (especially those >2000 bp) have significantly lower π (nucleotide diversity) and more low-frequency variants (as measured by Tajima's D, minor allele frequencies, and prevalence of variants that are private to one of the two populations) than short introns, suggesting a positive relationship between intron length and selective constraint. A similar analysis of protein-coding polymorphisms shows that 0-fold (degenerate) sites in more conserved genes are under stronger purifying selection than those in less conserved genes. There is limited evidence that selection on codon bias has an effect on differentiation (as measured by FST) at 4-fold (degenerate) sites, and 4-fold sites and sites in 8–30 bp of short introns ⩽65 bp have comparable FST values. Consistent with the expected effect of purifying selection, sites in long introns and 0-fold sites in conserved genes are less differentiated than those in short introns and less conserved genes, respectively. Genes in non-crossover regions (for example, the fourth chromosome) have very high FST values at both 0-fold and 4-fold degenerate sites, which is probably because of the large reduction in within-population diversity caused by tight linkage between many selected sites. Our analyses also reveal subtle statistical properties of FST, which arise when information from multiple single nucleotide polymorphisms is combined and can lead to the masking of important signals of selection.
Collapse
Affiliation(s)
- B C Jackson
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, UK
| | - J L Campos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - K Zeng
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, UK
| |
Collapse
|
88
|
Fine-mapping and selective sweep analysis of QTL for cold tolerance in Drosophila melanogaster. G3-GENES GENOMES GENETICS 2014; 4:1635-45. [PMID: 24970882 PMCID: PMC4169155 DOI: 10.1534/g3.114.012757] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
There is a growing interest in investigating the relationship between genes with signatures of natural selection and genes identified in QTL mapping studies using combined population and quantitative genetics approaches. We dissected an X-linked interval of 6.2 Mb, which contains two QTL underlying variation in chill coma recovery time (CCRT) in Drosophila melanogaster from temperate (European) and tropical (African) regions. This resulted in two relatively small regions of 131 kb and 124 kb. The latter one co-localizes with a very strong selective sweep in the European population. We examined the genes within and near the sweep region individually using gene expression analysis and P-element insertion lines. Of the genes overlapping with the sweep, none appears to be related to CCRT. However, we have identified a new candidate gene of CCRT, brinker, which is located just outside the sweep region and is inducible by cold stress. We discuss these results in light of recent population genetics theories on quantitative traits.
Collapse
|
89
|
Rius M, Darling JA. How important is intraspecific genetic admixture to the success of colonising populations? Trends Ecol Evol 2014; 29:233-42. [DOI: 10.1016/j.tree.2014.02.003] [Citation(s) in RCA: 329] [Impact Index Per Article: 32.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2013] [Revised: 02/06/2014] [Accepted: 02/07/2014] [Indexed: 11/16/2022]
|
90
|
Abstract
Drosophila melanogaster, an ancestrally African species, has recently spread throughout the world, associated with human activity. The species has served as the focus of many studies investigating local adaptation relating to latitudinal variation in non-African populations, especially those from the United States and Australia. These studies have documented the existence of shared, genetically determined phenotypic clines for several life history and morphological traits. However, there are no studies designed to formally address the degree of shared latitudinal differentiation at the genomic level. Here we present our comparative analysis of such differentiation. Not surprisingly, we find evidence of substantial, shared selection responses on the two continents, probably resulting from selection on standing ancestral variation. The polymorphic inversion In(3R)P has an important effect on this pattern, but considerable parallelism is also observed across the genome in regions not associated with inversion polymorphism. Interestingly, parallel latitudinal differentiation is observed even for variants that are not particularly strongly differentiated, which suggests that very large numbers of polymorphisms are targets of spatially varying selection in this species.
Collapse
|
91
|
Blumenstiel JP, Chen X, He M, Bergman CM. An age-of-allele test of neutrality for transposable element insertions. Genetics 2014; 196:523-38. [PMID: 24336751 PMCID: PMC3914624 DOI: 10.1534/genetics.113.158147] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2013] [Accepted: 12/06/2013] [Indexed: 01/31/2023] Open
Abstract
How natural selection acts to limit the proliferation of transposable elements (TEs) in genomes has been of interest to evolutionary biologists for many years. To describe TE dynamics in populations, previous studies have used models of transposition-selection equilibrium that assume a constant rate of transposition. However, since TE invasions are known to happen in bursts through time, this assumption may not be reasonable. Here we propose a test of neutrality for TE insertions that does not rely on the assumption of a constant transposition rate. We consider the case of TE insertions that have been ascertained from a single haploid reference genome sequence. By conditioning on the age of an individual TE insertion allele (inferred by the number of unique substitutions that have occurred within the particular TE sequence since insertion), we determine the probability distribution of the insertion allele frequency in a population sample under neutrality. Taking models of varying population size into account, we then evaluate predictions of our model against allele frequency data from 190 retrotransposon insertions sampled from North American and African populations of Drosophila melanogaster. Using this nonequilibrium neutral model, we are able to explain ∼ 80% of the variance in TE insertion allele frequencies based on age alone. Controlling for both nonequilibrium dynamics of transposition and host demography, we provide evidence for negative selection acting against most TEs as well as for positive selection acting on a small subset of TEs. Our work establishes a new framework for the analysis of the evolutionary forces governing large insertion mutations like TEs, gene duplications, or other copy number variants.
Collapse
Affiliation(s)
- Justin P. Blumenstiel
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas 66049
| | - Xi Chen
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas 66049
| | - Miaomiao He
- Faculty of Life Sciences, University of Manchester, Manchester M21 0RG, United Kingdom
| | - Casey M. Bergman
- Faculty of Life Sciences, University of Manchester, Manchester M21 0RG, United Kingdom
| |
Collapse
|
92
|
Glaser-Schmitt A, Catalán A, Parsch J. Adaptive divergence of a transcriptional enhancer between populations of Drosophila melanogaster. Philos Trans R Soc Lond B Biol Sci 2013; 368:20130024. [PMID: 24218636 DOI: 10.1098/rstb.2013.0024] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
As species colonize new habitats they must adapt to the local environment. Much of this adaptation is thought to occur at the regulatory level; however, the relationships among genetic polymorphism, expression variation and adaptation are poorly understood. Drosophila melanogaster, which expanded from an ancestral range in sub-Saharan Africa around 15 000 years ago, represents an excellent model system for studying regulatory evolution. Here, we focus on the gene CG9509, which differs in expression between an African and a European population of D. melanogaster. The expression difference is caused by variation within a transcriptional enhancer adjacent to the CG9509 coding sequence. Patterns of sequence variation indicate that this enhancer was the target of recent positive selection, suggesting that the expression difference is adaptive. Analysis of the CG9509 enhancer in new population samples from Europe, Asia, northern Africa and sub-Saharan Africa revealed that sequence polymorphism is greatly reduced outside the ancestral range. A derived haplotype absent in sub-Saharan Africa is at high frequency in all other populations. These observations are consistent with a selective sweep accompanying the range expansion of the species. The new data help identify the sequence changes responsible for the difference in enhancer activity.
Collapse
Affiliation(s)
- Amanda Glaser-Schmitt
- Department of Biology II, University of Munich (LMU), , Grosshaderner Strasse 2, 82152 Planegg-Martinsried, Germany
| | | | | |
Collapse
|
93
|
Robinson MC, Stone EA, Singh ND. Population genomic analysis reveals no evidence for GC-biased gene conversion in Drosophila melanogaster. Mol Biol Evol 2013; 31:425-33. [PMID: 24214536 DOI: 10.1093/molbev/mst220] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Gene conversion is the nonreciprocal exchange of genetic material between homologous chromosomes. Multiple lines of evidence from a variety of taxa strongly suggest that gene conversion events are biased toward GC-bearing alleles. However, in Drosophila, the data have largely been indirect and unclear, with some studies supporting the predictions of a GC-biased gene conversion model and other data showing contradictory findings. Here, we test whether gene conversion events are GC-biased in Drosophila melanogaster using whole-genome polymorphism and divergence data. Our results provide no support for GC-biased gene conversion and thus suggest that this process is unlikely to significantly contribute to patterns of polymorphism and divergence in this system.
Collapse
Affiliation(s)
- Matthew C Robinson
- Department of Biological Sciences, Program in Genetics, North Carolina State University
| | | | | |
Collapse
|
94
|
Cogni R, Kuczynski C, Koury S, Lavington E, Behrman EL, O'Brien KR, Schmidt PS, Eanes WF. THE INTENSITY OF SELECTION ACTING ON THECOUCH POTATOGENE-SPATIAL-TEMPORAL VARIATION IN A DIAPAUSE CLINE. Evolution 2013; 68:538-48. [DOI: 10.1111/evo.12291] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2013] [Accepted: 09/26/2013] [Indexed: 11/27/2022]
Affiliation(s)
- Rodrigo Cogni
- Department of Ecology and Evolution; Stony Brook University; Stony Brook New York
| | - Caitlin Kuczynski
- Department of Ecology and Evolution; Stony Brook University; Stony Brook New York
| | - Spencer Koury
- Department of Ecology and Evolution; Stony Brook University; Stony Brook New York
| | - Erik Lavington
- Department of Ecology and Evolution; Stony Brook University; Stony Brook New York
| | - Emily L. Behrman
- Department of Biology; University of Pennsylvania; Philadelphia Pennsylvania
| | | | - Paul S. Schmidt
- Department of Biology; University of Pennsylvania; Philadelphia Pennsylvania
| | - Walter F. Eanes
- Department of Ecology and Evolution; Stony Brook University; Stony Brook New York
| |
Collapse
|
95
|
Nadachowska-Brzyska K, Burri R, Olason PI, Kawakami T, Smeds L, Ellegren H. Demographic divergence history of pied flycatcher and collared flycatcher inferred from whole-genome re-sequencing data. PLoS Genet 2013; 9:e1003942. [PMID: 24244198 PMCID: PMC3820794 DOI: 10.1371/journal.pgen.1003942] [Citation(s) in RCA: 94] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2013] [Accepted: 09/23/2013] [Indexed: 01/05/2023] Open
Abstract
Profound knowledge of demographic history is a prerequisite for the understanding and inference of processes involved in the evolution of population differentiation and speciation. Together with new coalescent-based methods, the recent availability of genome-wide data enables investigation of differentiation and divergence processes at unprecedented depth. We combined two powerful approaches, full Approximate Bayesian Computation analysis (ABC) and pairwise sequentially Markovian coalescent modeling (PSMC), to reconstruct the demographic history of the split between two avian speciation model species, the pied flycatcher and collared flycatcher. Using whole-genome re-sequencing data from 20 individuals, we investigated 15 demographic models including different levels and patterns of gene flow, and changes in effective population size over time. ABC provided high support for recent (mode 0.3 my, range <0.7 my) species divergence, declines in effective population size of both species since their initial divergence, and unidirectional recent gene flow from pied flycatcher into collared flycatcher. The estimated divergence time and population size changes, supported by PSMC results, suggest that the ancestral species persisted through one of the glacial periods of middle Pleistocene and then split into two large populations that first increased in size before going through severe bottlenecks and expanding into their current ranges. Secondary contact appears to have been established after the last glacial maximum. The severity of the bottlenecks at the last glacial maximum is indicated by the discrepancy between current effective population sizes (20,000-80,000) and census sizes (5-50 million birds) of the two species. The recent divergence time challenges the supposition that avian speciation is a relatively slow process with extended times for intrinsic postzygotic reproductive barriers to evolve. Our study emphasizes the importance of using genome-wide data to unravel tangled demographic histories. Moreover, it constitutes one of the first examples of the inference of divergence history from genome-wide data in non-model species.
Collapse
Affiliation(s)
| | - Reto Burri
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Pall I. Olason
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Takeshi Kawakami
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Linnéa Smeds
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Hans Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| |
Collapse
|
96
|
Campo D, Lehmann K, Fjeldsted C, Souaiaia T, Kao J, Nuzhdin SV. Whole-genome sequencing of two North American Drosophila melanogaster populations reveals genetic differentiation and positive selection. Mol Ecol 2013; 22:5084-97. [PMID: 24102956 DOI: 10.1111/mec.12468] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2012] [Revised: 07/15/2013] [Accepted: 07/16/2013] [Indexed: 11/29/2022]
Abstract
The prevailing demographic model for Drosophila melanogaster suggests that the colonization of North America occurred very recently from a subset of European flies that rapidly expanded across the continent. This model implies a sudden population growth and range expansion consistent with very low or no population subdivision. As flies adapt to new environments, local adaptation events may be expected. To describe demographic and selective events during North American colonization, we have generated a data set of 35 individual whole-genome sequences from inbred lines of D. melanogaster from a west coast US population (Winters, California, USA) and compared them with a public genome data set from Raleigh (Raleigh, North Carolina, USA). We analysed nuclear and mitochondrial genomes and described levels of variation and divergence within and between these two North American D. melanogaster populations. Both populations exhibit negative values of Tajima's D across the genome, a common signature of demographic expansion. We also detected a low but significant level of genome-wide differentiation between the two populations, as well as multiple allele surfing events, which can be the result of gene drift in local subpopulations on the edge of an expansion wave. In contrast to this genome-wide pattern, we uncovered a 50-kilobase segment in chromosome arm 3L that showed all the hallmarks of a soft selective sweep in both populations. A comparison of allele frequencies within this divergent region among six populations from three continents allowed us to cluster these populations in two differentiated groups, providing evidence for the action of natural selection on a global scale.
Collapse
Affiliation(s)
- D Campo
- Molecular and Computational Biology, University of Southern California, Los Angeles, CA, 90089, USA
| | | | | | | | | | | |
Collapse
|
97
|
Werzner A, Pavlidis P, Ometto L, Stephan W, Laurent S. Selective sweep in the Flotillin-2 region of European Drosophila melanogaster. PLoS One 2013; 8:e56629. [PMID: 23437190 PMCID: PMC3578937 DOI: 10.1371/journal.pone.0056629] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2012] [Accepted: 01/11/2013] [Indexed: 11/18/2022] Open
Abstract
Localizing genes that are subject to recent positive selection is a major goal of evolutionary biology. In the model organism Drosophila melanogaster many attempts have been made in recent years to identify such genes by conducting so-called genome scans of selection. These analyses consisted in typing a large number of genetic markers along the genomes of a sample of individuals and then identifying those loci that harbor patterns of genetic variation, which are compatible with the ones generated by a selective sweep. In this study we conduct an in-depth analysis of a genomic region located on the X chromosome of D. melanogaster that was identified as a potential target of recent positive selection by a previous genome scan of selection. To this end we re-sequenced 20 kilobases around the Flotillin-2 gene (Flo-2) and conducted a detailed analysis of the allele frequencies and linkage disequilibria observed in this new dataset. The results of this analysis reveal eight genetic novelties that are specific to temperate populations of D. melanogaster and that may have arisen during the expansion of the species outside its ancestral sub-Saharan habitat since about 16,000 years ago.
Collapse
Affiliation(s)
- Annegret Werzner
- Section of Evolutionary Biology, Department of Biology II, University of Munich, Planegg-Martinsried, Germany
| | - Pavlos Pavlidis
- Section of Evolutionary Biology, Department of Biology II, University of Munich, Planegg-Martinsried, Germany
| | - Lino Ometto
- Section of Evolutionary Biology, Department of Biology II, University of Munich, Planegg-Martinsried, Germany
| | - Wolfgang Stephan
- Section of Evolutionary Biology, Department of Biology II, University of Munich, Planegg-Martinsried, Germany
| | - Stefan Laurent
- Section of Evolutionary Biology, Department of Biology II, University of Munich, Planegg-Martinsried, Germany
- * E-mail:
| |
Collapse
|