1
|
Soni V, Johri P, Jensen JD. Evaluating power to detect recurrent selective sweeps under increasingly realistic evolutionary null models. Evolution 2023; 77:2113-2127. [PMID: 37395482 PMCID: PMC10547124 DOI: 10.1093/evolut/qpad120] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 06/15/2023] [Accepted: 06/30/2023] [Indexed: 07/04/2023]
Abstract
The detection of selective sweeps from population genomic data often relies on the premise that the beneficial mutations in question have fixed very near the sampling time. As it has been previously shown that the power to detect a selective sweep is strongly dependent on the time since fixation as well as the strength of selection, it is naturally the case that strong, recent sweeps leave the strongest signatures. However, the biological reality is that beneficial mutations enter populations at a rate, one that partially determines the mean wait time between sweep events and hence their age distribution. An important question thus remains about the power to detect recurrent selective sweeps when they are modeled by a realistic mutation rate and as part of a realistic distribution of fitness effects, as opposed to a single, recent, isolated event on a purely neutral background as is more commonly modeled. Here we use forward-in-time simulations to study the performance of commonly used sweep statistics, within the context of more realistic evolutionary baseline models incorporating purifying and background selection, population size change, and mutation and recombination rate heterogeneity. Results demonstrate the important interplay of these processes, necessitating caution when interpreting selection scans; specifically, false-positive rates are in excess of true-positive across much of the evaluated parameter space, and selective sweeps are often undetectable unless the strength of selection is exceptionally strong.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| | - Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| |
Collapse
|
2
|
Soni V, Johri P, Jensen JD. Evaluating power to detect recurrent selective sweeps under increasingly realistic evolutionary null models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.15.545166. [PMID: 37398347 PMCID: PMC10312679 DOI: 10.1101/2023.06.15.545166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
The detection of selective sweeps from population genomic data often relies on the premise that the beneficial mutations in question have fixed very near the sampling time. As it has been previously shown that the power to detect a selective sweep is strongly dependent on the time since fixation as well as the strength of selection, it is naturally the case that strong, recent sweeps leave the strongest signatures. However, the biological reality is that beneficial mutations enter populations at a rate, one that partially determines the mean wait time between sweep events and hence their age distribution. An important question thus remains about the power to detect recurrent selective sweeps when they are modelled by a realistic mutation rate and as part of a realistic distribution of fitness effects (DFE), as opposed to a single, recent, isolated event on a purely neutral background as is more commonly modelled. Here we use forward-in-time simulations to study the performance of commonly used sweep statistics, within the context of more realistic evolutionary baseline models incorporating purifying and background selection, population size change, and mutation and recombination rate heterogeneity. Results demonstrate the important interplay of these processes, necessitating caution when interpreting selection scans; specifically, false positive rates are in excess of true positive across much of the evaluated parameter space, and selective sweeps are often undetectable unless the strength of selection is exceptionally strong. Teaser Text Outlier-based genomic scans have proven a popular approach for identifying loci that have potentially experienced recent positive selection. However, it has previously been shown that an evolutionarily appropriate baseline model that incorporates non-equilibrium population histories, purifying and background selection, and variation in mutation and recombination rates is necessary to reduce often extreme false positive rates when performing genomic scans. Here we evaluate the power to detect recurrent selective sweeps using common SFS-based and haplotype-based methods under these increasingly realistic models. We find that while these appropriate evolutionary baselines are essential to reduce false positive rates, the power to accurately detect recurrent selective sweeps is generally low across much of the biologically relevant parameter space.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
- Present address: Department of Biology, Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
| | | |
Collapse
|
3
|
Harris M, Garud NR. Enrichment of Hard Sweeps on the X Chromosome in Drosophila melanogaster. Mol Biol Evol 2022; 40:6955808. [PMID: 36546413 PMCID: PMC9825254 DOI: 10.1093/molbev/msac268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 11/11/2022] [Accepted: 12/05/2022] [Indexed: 12/24/2022] Open
Abstract
The characteristic properties of the X chromosome, such as male hemizygosity and its unique inheritance pattern, expose it to natural selection in a way that can be different from the autosomes. Here, we investigate the differences in the tempo and mode of adaptation on the X chromosome and autosomes in a population of Drosophila melanogaster. Specifically, we test the hypothesis that due to hemizygosity and a lower effective population size on the X, the relative proportion of hard sweeps, which are expected when adaptation is gradual, compared with soft sweeps, which are expected when adaptation is rapid, is greater on the X than on the autosomes. We quantify the incidence of hard versus soft sweeps in North American D. melanogaster population genomic data with haplotype homozygosity statistics and find an enrichment of the proportion of hard versus soft sweeps on the X chromosome compared with the autosomes, confirming predictions we make from simulations. Understanding these differences may enable a deeper understanding of how important phenotypes arise as well as the impact of fundamental evolutionary parameters on adaptation, such as dominance, sex-specific selection, and sex-biased demography.
Collapse
Affiliation(s)
- Mariana Harris
- Department of Computational Medicine, University of California Los Angeles, Los Angeles, CA
| | | |
Collapse
|
4
|
Abstract
Drosophila melanogaster, a small dipteran of African origin, represents one of the best-studied model organisms. Early work in this system has uniquely shed light on the basic principles of genetics and resulted in a versatile collection of genetic tools that allow to uncover mechanistic links between genotype and phenotype. Moreover, given its worldwide distribution in diverse habitats and its moderate genome-size, Drosophila has proven very powerful for population genetics inference and was one of the first eukaryotes whose genome was fully sequenced. In this book chapter, we provide a brief historical overview of research in Drosophila and then focus on recent advances during the genomic era. After describing different types and sources of genomic data, we discuss mechanisms of neutral evolution including the demographic history of Drosophila and the effects of recombination and biased gene conversion. Then, we review recent advances in detecting genome-wide signals of selection, such as soft and hard selective sweeps. We further provide a brief introduction to background selection, selection of noncoding DNA and codon usage and focus on the role of structural variants, such as transposable elements and chromosomal inversions, during the adaptive process. Finally, we discuss how genomic data helps to dissect neutral and adaptive evolutionary mechanisms that shape genetic and phenotypic variation in natural populations along environmental gradients. In summary, this book chapter serves as a starting point to Drosophila population genomics and provides an introduction to the system and an overview to data sources, important population genetic concepts and recent advances in the field.
Collapse
|
5
|
Comeron JM. Background selection as null hypothesis in population genomics: insights and challenges from Drosophila studies. Philos Trans R Soc Lond B Biol Sci 2018; 372:rstb.2016.0471. [PMID: 29109230 PMCID: PMC5698629 DOI: 10.1098/rstb.2016.0471] [Citation(s) in RCA: 73] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/04/2017] [Indexed: 12/11/2022] Open
Abstract
The consequences of selection at linked sites are multiple and widespread across the genomes of most species. Here, I first review the main concepts behind models of selection and linkage in recombining genomes, present the difficulty in parametrizing these models simply as a reduction in effective population size (Ne) and discuss the predicted impact of recombination rates on levels of diversity across genomes. Arguments are then put forward in favour of using a model of selection and linkage with neutral and deleterious mutations (i.e. the background selection model, BGS) as a sensible null hypothesis for investigating the presence of other forms of selection, such as balancing or positive. I also describe and compare two studies that have generated high-resolution landscapes of the predicted consequences of selection at linked sites in Drosophila melanogaster. Both studies show that BGS can explain a very large fraction of the observed variation in diversity across the whole genome, thus supporting its use as null model. Finally, I identify and discuss a number of caveats and challenges in studies of genetic hitchhiking that have been often overlooked, with several of them sharing a potential bias towards overestimating the evidence supporting recent selective sweeps to the detriment of a BGS explanation. One potential source of bias is the analysis of non-equilibrium populations: it is precisely because models of selection and linkage predict variation in Ne across chromosomes that demographic dynamics are not expected to be equivalent chromosome- or genome-wide. Other challenges include the use of incomplete genome annotations, the assumption of temporally stable recombination landscapes, the presence of genes under balancing selection and the consequences of ignoring non-crossover (gene conversion) recombination events. This article is part of the themed issue ‘Evolutionary causes and consequences of recombination rate variation in sexual organisms’.
Collapse
Affiliation(s)
- Josep M Comeron
- Department of Biology, University of Iowa, Iowa City, IA 52242, USA .,Interdisciplinary Program in Genetics, University of Iowa, Iowa City, IA 52242, USA
| |
Collapse
|
6
|
|
7
|
Stephan W. Genetic hitchhiking versus background selection: the controversy and its implications. Philos Trans R Soc Lond B Biol Sci 2010; 365:1245-53. [PMID: 20308100 DOI: 10.1098/rstb.2009.0278] [Citation(s) in RCA: 107] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The controversy on the relative importance of background selection (BGS; against deleterious mutations) and genetic hitchhiking (associated with positive directional selection) in explaining patterns of nucleotide variation in natural populations stimulated research activities for almost a decade. Despite efforts from many theorists and empiricists, fundamental questions are still open, in particular, for the population genetics of regions of reduced recombination. On the other hand, the development of the BGS and hitchhiking models and the long struggle to distinguish them, all of which seem to be a purely academic exercise, led to quite practical advances that are useful for the identification of genes involved in adaptation and domestication.
Collapse
Affiliation(s)
- Wolfgang Stephan
- Section of Evolutionary Biology, Department of Biology II, Ludwig-Maximilians University Munich, , Grosshaderner Strasse 2, 82152 Planegg, Germany.
| |
Collapse
|
8
|
Orengo DJ, Aguadé M. Uncovering the footprint of positive selection on the X chromosome of Drosophila melanogaster. Mol Biol Evol 2010; 27:153-60. [PMID: 19759233 DOI: 10.1093/molbev/msp220] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
A usual approach to detect the spatial footprint left by recent adaptive events has been to follow up putative candidates emerging from multilocus scans of variation by sequencing additional fragments. We have used a similar experimental and analytical approach to study variation at 15 independently evolving and randomly chosen regions of the X chromosome of Drosophila melanogaster. These incompletely sequenced regions, each extending over approximately 40 kb, were subjected to two tests of positive selection that take into account the spatial distribution of nucleotide variation. Our analysis of variation at these genomic regions in a European population of D. melanogaster has allowed us to uncover a candidate region for positive selection and to empirically evaluate the comparative performance of the two tests of selection under a bottleneck scenario. Moreover, the boundaries here estimated for both the rate of adaptive substitution (delta) and the average selection coefficient (s) would support previous estimates obtained by maximum likelihood that suggest rather strong but uncommon positive selection.
Collapse
Affiliation(s)
- Dorcas J Orengo
- Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Barcelona, Spain.
| | | |
Collapse
|
9
|
Abstract
A major focus of modern population genetics involves using polymorphism data in order to identify regions impacted by recent positive selection (so-called genomic scans). Recently, methodology has been proposed not to identify individual loci, but rather to quantify genomic recurrent hitchhiking (RHH) parameters using this same type of polymorphism data. I here examine to what extent genomic scans for adaptively important loci may be informed by recently estimated RHH parameters (and vice versa). I find that published results are largely incompatible with one another, with approximately an order of magnitude more sweeps being empirically identified than would be predicted under RHH estimates. Results demonstrate that making this connection between SHH and RHH models is crucial for a more complete and accurate characterization of adaptive evolution.
Collapse
|
10
|
González J, Petrov DA. The adaptive role of transposable elements in the Drosophila genome. Gene 2009; 448:124-33. [PMID: 19555747 DOI: 10.1016/j.gene.2009.06.008] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2009] [Revised: 06/11/2009] [Accepted: 06/12/2009] [Indexed: 11/25/2022]
Abstract
Transposable elements (TEs) are short DNA sequences with the capacity to move between different sites in the genome. This ability provides them with the capacity to mutate the genome in many different ways, from subtle regulatory mutations to gross genomic rearrangements. The potential adaptive significance of TEs was recognized by those involved in their initial discovery although it was hotly debated afterwards. For more than two decades, TEs were considered to be intragenomic parasites leading to almost exclusively detrimental effects to the host genome. The sequencing of the Drosophila melanogaster genome provided an unprecedented opportunity to study TEs and led to the identification of the first TE-induced adaptations in this species. These studies were followed by a systematic genome-wide search for adaptive insertions that allowed for the first time to infer that TEs contribute substantially to adaptive evolution. This study also revealed that there are at least twice as many TE-induced adaptations that remain to be identified. To gain a better understanding of the adaptive role of TEs in the genome we clearly need to (i) identify as many adaptive TEs as possible in a range of Drosophila species as well as (ii) carry out in-depth investigations of the effects of adaptive TEs on as many phenotypes as possible.
Collapse
Affiliation(s)
- Josefa González
- Department of Biology, 371 Serra St. Stanford University, Stanford, CA 94305-3020, USA.
| | | |
Collapse
|
11
|
Tian F, Stevens NM, Buckler ES. Tracking footprints of maize domestication and evidence for a massive selective sweep on chromosome 10. Proc Natl Acad Sci U S A 2009; 106 Suppl 1:9979-86. [PMID: 19528660 PMCID: PMC2702805 DOI: 10.1073/pnas.0901122106] [Citation(s) in RCA: 93] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Maize domestication is one of the greatest feats of artificial selection and evolution, wherein a weedy plant in Central Mexico was converted through human-mediated selection into the most productive crop in the world. In fact, the changes were so astounding that it took much of the last century to identify modern maize's true ancestor. Through modern genetic studies, the molecular basis of this evolution is being unraveled. Maize's new morphology and adaptation to diverse environments required selection at thousands of loci, and we are beginning to understand the magnitude and rates of these genetic changes. Most of the known major genes have experienced strong selection, but only small regions surrounding the selected genes exhibit substantially reduced genetic diversity. Here, we report the discovery of a large region on chromosome 10 involved in adaptation or domestication that has been the target of strong selection during maize domestication. Unlike previously described regions in the maize genome, 1.1 Mb and >15 genes lost genetic diversity during selection at this region. Finally, the prospects of a detailed understanding of maize evolution are discussed with consideration of both top-down and bottom-up approaches.
Collapse
Affiliation(s)
| | | | - Edward S. Buckler
- Institute for Genomic Diversity and
- Department of Plant Breeding and Genetics, Cornell University, Ithaca, NY 14850; and
- United States Department of Agriculture, Agricultural Research Service, Ithaca, NY 14850
| |
Collapse
|
12
|
Species-wide genetic variation and demographic history of Drosophila sechellia, a species lacking population structure. Genetics 2009; 182:1197-206. [PMID: 19506309 DOI: 10.1534/genetics.108.092080] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Long-term persistence of species characterized by a reduced effective population size is still a matter of debate that would benefit from the description of new relevant biological models. The island endemic specialist Drosophila sechellia has received considerable attention in evolutionary genetic studies. On the basis of the analysis of a limited number of strains, a handful of studies have reported a strikingly depleted level of genetic variation but little is known about its demographic history. We extended analyses of nucleotide polymorphism in D. sechellia to a species-wide level using 10 nuclear genes sequenced in 10 populations. We confirmed that D. sechellia exhibits little nucleotide-sequence variation. It is characterized by a low effective population size, >10-fold lower than that of D. simulans, which ranks D. sechellia as the least genetically diverse Drosophila species. No obvious population subdivision was detected despite its fragmented geographic distribution on different islands. We used approximate Bayesian computation (ABC) to test for demographic scenarios compatible with the geological history of the Seychelles and the ecology of D. sechellia. We found that while bottlenecks cannot account for the pattern of molecular evolution observed in this species, scenarios close to the null hypothesis of a constant population size are well supported. We discuss these findings with regard to adaptive features specific to D. sechellia and its life-history strategy.
Collapse
|
13
|
Singh ND, Aquadro CF, Clark AG. Estimation of fine-scale recombination intensity variation in the white-echinus interval of D. melanogaster. J Mol Evol 2009; 69:42-53. [PMID: 19504037 DOI: 10.1007/s00239-009-9250-5] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2008] [Revised: 04/27/2009] [Accepted: 05/15/2009] [Indexed: 01/19/2023]
Abstract
Accurate assessment of local recombination rate variation is crucial for understanding the recombination process and for determining the impact of natural selection on linked sites. In Drosophila, local recombination intensity has been estimated primarily by statistical approaches, by estimating the local slope of the relationship between the physical and genetic maps. However, these estimates are limited in resolution and, as a result, the physical scale at which recombination intensity varies in Drosophila is largely unknown. Although there is some evidence suggesting as much as a 40-fold variation in crossover rate at a local scale in D. pseudoobscura, little is known about the fine-scale structure of recombination rate variation in D. melanogaster. Here we experimentally examine the fine-scale distribution of crossover events in a 1.2-Mb region on the D. melanogaster X chromosome using a classic genetic mapping approach. Our results show that crossover frequency is significantly heterogeneous within this region, varying approximately 3.5-fold. Simulations suggest that this degree of heterogeneity is sufficient to affect levels of standing nucleotide diversity, although the magnitude of this effect is small. We recover no statistical association between empirical estimates of nucleotide diversity and recombination intensity, which is likely due to the limited number of loci sampled in our population genetic data set. However, codon bias is significantly negatively correlated with fine-scale recombination intensity estimates, as expected. Our results shed light on the relevant physical scale to consider in evolutionary analyses relating to recombination rate and highlight the motivations to increase the resolution of the recombination map in Drosophila.
Collapse
Affiliation(s)
- Nadia D Singh
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA.
| | | | | |
Collapse
|
14
|
Loywyck V, Bed'hom B, Pinard-van der LAAN MH, Pitel F, Verrier É, Bijma P. Evolution of the polymorphism at molecular markers in QTL and non-QTL regions in selected chicken lines ( Open Access publication). Genet Sel Evol 2008. [DOI: 10.1051/gse:2008025] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
15
|
Chapman MA, Pashley CH, Wenzler J, Hvala J, Tang S, Knapp SJ, Burke JM. A genomic scan for selection reveals candidates for genes involved in the evolution of cultivated sunflower (Helianthus annuus). THE PLANT CELL 2008; 20:2931-45. [PMID: 19017747 PMCID: PMC2613673 DOI: 10.1105/tpc.108.059808] [Citation(s) in RCA: 106] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2008] [Revised: 10/22/2008] [Accepted: 11/04/2008] [Indexed: 05/18/2023]
Abstract
Genomic scans for selection are a useful tool for identifying genes underlying phenotypic transitions. In this article, we describe the results of a genome scan designed to identify candidates for genes targeted by selection during the evolution of cultivated sunflower. This work involved screening 492 loci derived from ESTs on a large panel of wild, primitive (i.e., landrace), and improved sunflower (Helianthus annuus) lines. This sampling strategy allowed us to identify candidates for selectively important genes and investigate the likely timing of selection. Thirty-six genes showed evidence of selection during either domestication or improvement based on multiple criteria, and a sequence-based test of selection on a subset of these loci confirmed this result. In view of what is known about the structure of linkage disequilibrium across the sunflower genome, these genes are themselves likely to have been targeted by selection, rather than being merely linked to the actual targets. While the selection candidates showed a broad range of putative functions, they were enriched for genes involved in amino acid synthesis and protein catabolism. Given that a similar pattern has been detected in maize (Zea mays), this finding suggests that selection on amino acid composition may be a general feature of the evolution of crop plants. In terms of genomic locations, the selection candidates were significantly clustered near quantitative trait loci (QTL) that contribute to phenotypic differences between wild and cultivated sunflower, and specific instances of QTL colocalization provide some clues as to the roles that these genes may have played during sunflower evolution.
Collapse
Affiliation(s)
- Mark A Chapman
- Department of Plant Biology, University of Georgia, Athens, Georgia 30602, USA
| | | | | | | | | | | | | |
Collapse
|
16
|
Pavlidis P, Hutter S, Stephan W. A population genomic approach to map recent positive selection in model species. Mol Ecol 2008; 17:3585-98. [PMID: 18627454 DOI: 10.1111/j.1365-294x.2008.03852.x] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Based on nearly complete genome sequences from a variety of organisms data on naturally occurring genetic variation on the scale of hundreds of loci to entire genomes have been collected in recent years. In parallel, new statistical tests have been developed to infer evidence of recent positive selection from these data and to localize the target regions of selection in the genome. These methods have now been successfully applied to Drosophila melanogaster, humans, mice and a few plant species. In genomic regions of normal recombination rates, the targets of positive selection have been mapped down to the level of individual genes.
Collapse
Affiliation(s)
- P Pavlidis
- Department of Biology, Section of Evolutionary Biology, University of Munich, Grosshaderner Strasse 2, 82152 Planegg-Martinsried, Germany.
| | | | | |
Collapse
|
17
|
Hitchhiking mapping reveals a candidate genomic region for natural selection in three-spined stickleback chromosome VIII. Genetics 2008; 178:453-65. [PMID: 18202387 DOI: 10.1534/genetics.107.078782] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Identification of genes and genomic regions under directional natural selection has become one of the major goals in evolutionary genetics, but relatively little work to this end has been done by applying hitchhiking mapping to wild populations. Hitchhiking mapping starts from a genome scan using a randomly spaced set of molecular markers followed by a fine-scale analysis in the flanking regions of the candidate regions under selection. We used the hitchhiking mapping approach to narrow down a selective sweep in the genomic region flanking a candidate locus (Stn90) in chromosome VIII in the three-spined stickleback (Gasterosteus aculeatus). Twenty-four microsatellite markers were screened in an approximately 800-kb region around the candidate locus in three marine and four freshwater populations. The patterns of genetic diversity and differentiation in the candidate region were compared to those of a putatively neutral set of markers. The Bayesian FST-test indicated an elevated genetic differentiation, deviating significantly from neutral expectations, at a continuous region of approximately 20 kb upstream from the candidate locus. Furthermore, a method developed for an array of microsatellite markers rejected neutrality in a region of approximately 90 kb flanking the candidate locus supporting the selective sweep hypothesis. Likewise, the genomewide pattern of genetic diversity differed from the candidate region in a bottleneck analysis suggesting that selection, rather than demography, explains the reduced genetic diversity at the candidate interval. The neutrality tests suggest that the selective sweep had occurred mainly in the Lake Pulmanki population, but the results from bottleneck analyses indicate that selection might have operated in other populations as well. These results suggest that the narrow interval around locus Stn90 has likely been under directional selection, but the region contains several predicted genes, each of which can be the actual targets of selection. Understanding of the functional significance of this genomic region in an ecological context will require a more detailed sequence analysis.
Collapse
|
18
|
Ojeda A, Huang LS, Ren J, Angiolillo A, Cho IC, Soto H, Lemús-Flores C, Makuza SM, Folch JM, Pérez-Enciso M. Selection in the making: a worldwide survey of haplotypic diversity around a causative mutation in porcine IGF2. Genetics 2008; 178:1639-52. [PMID: 18245828 PMCID: PMC2278092 DOI: 10.1534/genetics.107.084269] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2007] [Accepted: 01/09/2008] [Indexed: 11/18/2022] Open
Abstract
Domestic species allow us to study dramatic evolutionary changes at an accelerated rate due to the effectiveness of modern breeding techniques and the availability of breeds that have undergone distinct selection pressures. We present a worldwide survey of haplotype variability around a known causative mutation in porcine gene IGF2, which increases lean content. We genotyped 34 SNPs spanning 27 kb in 237 domestic pigs and 162 wild boars. Although the selective process had wiped out variability for at least 27 kb in the haplotypes carrying the mutation, there was no indication of an overall reduction in genetic variability of international vs. European local breeds; there was also no evidence of a reduction in variability caused by domestication. The haplotype structure and a plot of Tajima's D against the frequency of the causative mutation across breeds suggested a temporal pattern, where each breed corresponded to a different selective stage. This was observed comparing the haplotype neighbor-joining (NJ) trees of breeds that have undergone increasing selection pressures for leanness, e.g., European local breeds vs. Pietrain. These results anticipate that comparing current domestic breeds will decisively help to recover the genetic history of domestication and contemporary selective processes.
Collapse
Affiliation(s)
- A Ojeda
- Institut Català de Recerca i Estudis Avançats, 08010 Barcelona, Spain
| | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Jensen JD, Thornton KR, Aquadro CF. Inferring selection in partially sequenced regions. Mol Biol Evol 2007; 25:438-46. [PMID: 18165259 DOI: 10.1093/molbev/msm273] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A common approach for identifying loci influenced by positive selection involves scanning large portions of the genome for regions that are inconsistent with the neutral equilibrium model or represent outliers relative to the empirical distribution of some aspect of the data. Once identified, partial sequence is generated spanning this more localized region in order to quantify the site-frequency spectrum and evaluate the data with tests of neutrality and selection. This method is widely used as partial sequencing is less expensive with regard to both time and money. Here, we demonstrate that this approach can lead to biased maximum likelihood estimates of selection parameters and reduced rejection rates, with some parameter combinations resulting in clearly misleading results. Most significantly, for a commonly used sample size in Drosophila population genetics (i.e., n = 12), the estimate of the target of selection has a large mean square error and the strength of selection is severely under estimated when the true selected site has not been sampled. We propose sequencing approaches that are much more likely to accurately localize the target and estimate the strength of selection. Additionally, we examine the performance of a commonly used test of selection under a variety of recurrent and single sweep models.
Collapse
Affiliation(s)
- Jeffrey D Jensen
- Department of Molecular Biology and Genetics, Cornell University, USA.
| | | | | |
Collapse
|
20
|
Derome N, Baudry E, Ogereau D, Veuille M, Montchamp-Moreau C. Selective sweeps in a 2-locus model for sex-ratio meiotic drive in Drosophila simulans. Mol Biol Evol 2007; 25:409-16. [PMID: 18071199 DOI: 10.1093/molbev/msm269] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A way to identify loci subject to positive selection is to detect the signature of selective sweeps in given chromosomal regions. It is revealed by the departure of DNA polymorphism patterns from the neutral equilibrium predicted by coalescent theory. We surveyed DNA sequence variation in a region formerly identified as causing "sex-ratio" meiotic drive in Drosophila simulans. We found evidence that this system evolved by positive selection at 2 neighboring loci, which thus appear to be required simultaneously for meiotic drive to occur. The 2 regions are approximately 150-kb distant, corresponding to a genetic distance of 0.1 cM. The presumably large transmission advantage of chromosomes carrying meiotic drive alleles at both loci has not erased the individual signature of selection at each locus. This chromosome fragment combines a high level of linkage disequilibrium between the 2 critical regions with a high recombination rate. As a result, 2 characteristic traits of selective sweeps--the reduction of variation and the departure from selective neutrality in haplotype tests--show a bimodal pattern. Linkage disequilibrium level indicates that, in the natural population from Madagascar used in this study, the selective sweep may be as recent as 100 years.
Collapse
|
21
|
Macpherson JM, Sella G, Davis JC, Petrov DA. Genomewide spatial correspondence between nonsynonymous divergence and neutral polymorphism reveals extensive adaptation in Drosophila. Genetics 2007; 177:2083-99. [PMID: 18073425 PMCID: PMC2219485 DOI: 10.1534/genetics.107.080226] [Citation(s) in RCA: 99] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2007] [Accepted: 09/18/2007] [Indexed: 11/18/2022] Open
Abstract
The effect of recurrent selective sweeps is a spatially heterogeneous reduction in neutral polymorphism throughout the genome. The pattern of reduction depends on the selective advantage and recurrence rate of the sweeps. Because many adaptive substitutions responsible for these sweeps also contribute to nonsynonymous divergence, the spatial distribution of nonsynonymous divergence also reflects the distribution of adaptive substitutions. Thus, the spatial correspondence between neutral polymorphism and nonsynonymous divergence may be especially informative about the process of adaptation. Here we study this correspondence using genomewide polymorphism data from Drosophila simulans and the divergence between D. simulans and D. melanogaster. Focusing on highly recombining portions of the autosomes, at a spatial scale appropriate to the study of selective sweeps, we find that neutral polymorphism is both lower and, as measured by a new statistic Q(S), less homogeneous where nonsynonymous divergence is higher and that the spatial structure of this correlation is best explained by the action of strong recurrent selective sweeps. We introduce a method to infer, from the spatial correspondence between polymorphism and divergence, the rate and selective strength of adaptation. Our results independently confirm a high rate of adaptive substitution (approximately 1/3000 generations) and newly suggest that many adaptations are of surprisingly great selective effect (approximately 1%), reducing the effective population size by approximately 15% even in highly recombining regions of the genome.
Collapse
|
22
|
Jensen JD, Wong A, Aquadro CF. Approaches for identifying targets of positive selection. Trends Genet 2007; 23:568-77. [PMID: 17959267 DOI: 10.1016/j.tig.2007.08.009] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2007] [Revised: 08/20/2007] [Accepted: 08/20/2007] [Indexed: 01/25/2023]
Abstract
Despite significant advancements in both empirical and theoretical population genetics throughout the past century, fundamental questions about the evolutionary forces that shape genomic diversity remain unresolved. Perhaps foremost among these are the strength and frequency of adaptive evolution. To quantify these parameters, statistical tools are needed that are capable of effectively identifying targets of positive selection throughout the genome in an unbiased manner, and functional approaches are needed that are capable of connecting these identified genotypes with the resulting adaptively significant phenotypes. Here we review recent advancements in both statistical and empirical methodology, and discuss important challenges and opportunities that remain as researchers continue to uncouple the relative importance of stochastic and deterministic factors in the evolution of natural populations.
Collapse
Affiliation(s)
- Jeffrey D Jensen
- Department of Molecular Biology and Genetics, Biotechnology Building, Cornell University, Ithaca, NY 14853, USA
| | | | | |
Collapse
|
23
|
Jensen JD, Bauer DuMont VL, Ashmore AB, Gutierrez A, Aquadro CF. Patterns of sequence variability and divergence at the diminutive gene region of Drosophila melanogaster: complex patterns suggest an ancestral selective sweep. Genetics 2007; 177:1071-85. [PMID: 17720938 PMCID: PMC2034614 DOI: 10.1534/genetics.106.069468] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2006] [Accepted: 08/19/2007] [Indexed: 11/18/2022] Open
Abstract
To identify putatively swept regions of the Drosophila melanogaster genome, we performed a microsatellite screen spanning a 260-kb region of the X chromosome in populations from Zimbabwe, Ecuador, the United States, and China. Among the regions identified by this screen as showing a complex pattern of reduced heterozygosity and a skewed frequency spectrum was the gene diminutive (dm). To investigate the microsatellite findings, nucleotide sequence polymorphism data were generated in populations from both China and Zimbabwe spanning a 25-kb region and encompassing dm. Analysis of the sequence data reveals strongly reduced nucleotide variation across the entire gene region in both the non-African and the African populations, an extended haplotype pattern, and structured linkage disequilibrium, as well as a rejection of neutrality in favor of selection using a composite likelihood-ratio test. Additionally, unusual patterns of synonymous site evolution were observed at the second exon of this locus. On the basis of simulation studies as well as recently proposed methods for distinguishing between selection and nonequilibrium demography, we find that this "footprint" is best explained by a selective sweep in the ancestral population, the signal of which has been somewhat blurred via founder effects in the non-African samples.
Collapse
Affiliation(s)
- Jeffrey D Jensen
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA.
| | | | | | | | | |
Collapse
|
24
|
Kane NC, Rieseberg LH. Genetics and evolution of weedy Helianthus annuus populations: adaptation of an agricultural weed. Mol Ecol 2007; 17:384-94. [PMID: 17725567 DOI: 10.1111/j.1365-294x.2007.03467.x] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Agricultural weeds are a major cost to economies throughout the world, and have evolved from numerous plant species in many different plant families. Despite their ubiquity, we do not yet know how easily or often weeds evolve from their wild ancestors or the kinds of genes underlying their evolution. Here we report on the evolution of weedy populations of the common sunflower Helianthus annuus. We analysed 106 microsatellites in 48 individuals from each of six wild and four weed populations of the species. The statistical tests lnRV and lnRH were used to test for significant reductions in genetic variability at each locus in weedy populations compared to nearby wild populations. Between 1% and 6% of genes were significant outliers with reduced variation in weedy populations, implying that a small but not insignificant fraction of the genome may be under selection and involved in adaptation of weedy sunflowers. However, there did not appear to be a substantial reduction in variation across the genome, suggesting that effective population sizes have remained very large during the recent evolution of these weedy populations. Additional analyses showed that weedy populations are more closely related to nearby wild populations than to each other, implying that weediness likely evolved multiple times within the species, although a single origin followed by gene flow with local populations cannot be ruled out. Together, our results point to the relative ease with which weedy forms of this species can evolve and persist despite the potentially high levels of geneflow with nearby wild populations.
Collapse
Affiliation(s)
- Nolan C Kane
- Department of Biology, Jordan Hall, 1001 E. Third Street, Indiana University, Bloomington, Indiana 47405, USA.
| | | |
Collapse
|
25
|
Thornton KR, Jensen JD, Becquet C, Andolfatto P. Progress and prospects in mapping recent selection in the genome. Heredity (Edinb) 2007; 98:340-8. [PMID: 17473869 DOI: 10.1038/sj.hdy.6800967] [Citation(s) in RCA: 112] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
One of the central goals of evolutionary biology is to understand the genetic basis of adaptive evolution. The availability of nearly complete genome sequences from a variety of organisms has facilitated the collection of data on naturally occurring genetic variation on the scale of hundreds of loci to whole genomes. Such data have changed the focus of molecular population genetics from making inferences about adaptive evolution at single loci to identifying which loci, out of hundreds to thousands, have been recent targets of natural selection. A major challenge in this effort is distinguishing the effects of selection from those of the demographic history of populations. Here we review some current progress and remaining challenges in the field.
Collapse
Affiliation(s)
- K R Thornton
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | | | | | | |
Collapse
|
26
|
Macdonald SJ, Long AD. Joint estimates of quantitative trait locus effect and frequency using synthetic recombinant populations of Drosophila melanogaster. Genetics 2007; 176:1261-81. [PMID: 17435224 PMCID: PMC1894589 DOI: 10.1534/genetics.106.069641] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We develop and implement a strategy to map QTL in two synthetic populations of Drosophila melanogaster each initiated with eight inbred founder strains. These recombinant populations allow simultaneous estimates of QTL location, effect, and frequency. Five X-linked QTL influencing bristle number were resolved to intervals of approximately 1.3 cM. We confirm previous observations of bristle number QTL distal to 4A at the tip of the chromosome and identify two novel QTL in 7F-8C, an interval that does not include any classic bristle number candidate genes. If QTL at the tip of the X are biallelic they appear to be intermediate in frequency, although there is evidence that these QTL may reside in multiallelic haplotypes. Conversely, the two QTL mapping to the middle of the X chromosome are likely rare: in each case the minor allele is observed in only 1 of the 16 founders. Assuming additivity and biallelism we estimate that identified QTL contribute 1.0 and 8.7%, respectively, to total phenotypic variation in male abdominal and sternopleural bristle number in nature. Models that seek to explain the maintenance of genetic variation make different predictions about the population frequency of QTL alleles. Thus, mapping QTL in eight-way recombinant populations can distinguish between these models.
Collapse
Affiliation(s)
- Stuart J Macdonald
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697, USA.
| | | |
Collapse
|
27
|
Thornton KR, Jensen JD. Controlling the false-positive rate in multilocus genome scans for selection. Genetics 2007; 175:737-50. [PMID: 17110489 PMCID: PMC1800626 DOI: 10.1534/genetics.106.064642] [Citation(s) in RCA: 138] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2006] [Accepted: 10/18/2006] [Indexed: 11/18/2022] Open
Abstract
Rapid typing of genetic variation at many regions of the genome is an efficient way to survey variability in natural populations in an effort to identify segments of the genome that have experienced recent natural selection. Following such a genome scan, individual regions may be chosen for further sequencing and a more detailed analysis of patterns of variability, often to perform a parametric test for selection and to estimate the strength of a recent selective sweep. We show here that not accounting for the ascertainment of loci in such analyses leads to false inference of natural selection when the true model is selective neutrality, because the procedure of choosing unusual loci (in comparison to the rest of the genome-scan data) selects regions of the genome with genealogies similar to those expected under models of recent directional selection. We describe a simple and efficient correction for this ascertainment bias, which restores the false-positive rate to near-nominal levels. For the parameters considered here, we find that obtaining a test with the expected distribution of P-values depends on accurately accounting both for ascertainment of regions and for demography. Finally, we use simulations to explore the utility of relying on outlier loci to detect recent selective sweeps. We find that measures of diversity and of population differentiation are more effective than summaries of the site-frequency spectrum and that sequencing larger regions (2.5 kbp) in genome-scan studies leads to more power to detect recent selective sweeps.
Collapse
Affiliation(s)
- Kevin R Thornton
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA.
| | | |
Collapse
|
28
|
Kane NC, Rieseberg LH. Selective sweeps reveal candidate genes for adaptation to drought and salt tolerance in common sunflower, Helianthus annuus. Genetics 2007; 175:1823-34. [PMID: 17237516 PMCID: PMC1855101 DOI: 10.1534/genetics.106.067728] [Citation(s) in RCA: 104] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Here we report the results of an analysis of variation at 128 EST-based microsatellites in wild Helianthus annuus, using populations from the species' typical plains habitat in Kansas and Colorado, as well as two arid desert and two distinct brackish marsh areas in Utah. The test statistics lnRV and lnRH were used to find regions of the genome that were significantly less variable in one population relative to the others and thus are likely to contain genes under selection. A small but detectable percentage (1.5-6%) of genes showed evidence for selection from both statistics in any particular environment, and a total of 17 loci showed evidence of selection in at least one environment. Distance-based measures provided additional evidence of selection for 15 of the 17 loci. Global F(ST)-values were significantly higher for candidate loci, as expected under divergent selection. However, pairwise F(ST)-values were lower for populations that shared a selective sweep. Moreover, while spatially separated populations undergoing similar selective pressures showed evidence of divergence at some loci, they evolved in concert at other loci. Thus, this study illustrates how selective sweeps might contribute both to the integration of conspecific populations and to the differentiation of races or species.
Collapse
Affiliation(s)
- Nolan C Kane
- Department of Biology, Indiana University, Bloomington, Indiana 47405, USA.
| | | |
Collapse
|
29
|
Harr B, Voolstra C, Heinen TJAJ, Baines JF, Rottscheidt R, Ihle S, Müller W, Bonhomme F, Tautz D. A change of expression in the conserved signaling gene MKK7 is associated with a selective sweep in the western house mouse Mus musculus domesticus. J Evol Biol 2006; 19:1486-96. [PMID: 16910979 DOI: 10.1111/j.1420-9101.2006.01130.x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Changes in gene expression are known to occur between closely related species, but it is not yet clear how many of these are due to random fixation of allelic variants or due to adaptive events. In a microarray survey between subspecies of the Mus musculus complex, we identified the mitogen-activated protein-kinase-kinase MKK7 as a candidate for change in gene expression. Quantitative PCR experiments with multiple individuals from each subspecies confirmed a specific and significant up-regulation in the testis of M. m. domesticus. Northern blot analysis shows that this is due to a new transcript that is not found in other tissues, nor in M. m. musculus. A cis-trans test via allele specific expression analysis of the MKK7 gene in F1 hybrids between domesticus and musculus shows that the expression change is mainly caused by a mutation located in cis. Nucleotide diversity was found to be significantly reduced in a window of at least 20 kb around the MKK7 locus in domesticus, indicative of a selective sweep. Because the MKK7 gene is involved in modulating a kinase signalling cascade in a stress response pathway, it seems a plausible target for adaptive differences between subspecies, although the functional role of the new testis-specific transcripts will need to be further studied.
Collapse
Affiliation(s)
- B Harr
- Institute for Genetics, Department of Evolutionary Genetics, Köln, Germany.
| | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Hoffmann AA, Daborn PJ. Towards genetic markers in animal populations as biomonitors for human-induced environmental change. Ecol Lett 2006; 10:63-76. [PMID: 17204118 DOI: 10.1111/j.1461-0248.2006.00985.x] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Genetic markers provide potentially sensitive indicators of changes in environmental conditions because the genetic constitution of populations is normally altered well before populations become extinct. Genetic indicators in populations include overall genetic diversity, genetic changes in traits measured at the phenotypic level, and evolution at specific loci under selection. While overall genetic diversity has rarely been successfully related to environmental conditions, genetically based changes in traits have now been linked to the presence of toxins and both local and global temperature shifts. Candidate loci for monitoring stressors are emerging from information on how specific genes influence traits, and from screens of random loci across environmental gradients. Drosophila research suggests that chromosomal regions under recent intense selection can be identified from patterns of molecular variation and a high frequency of transposable element insertions. Allele frequency changes at candidate loci have been linked to pesticides, pollutants and climate change. Nevertheless, there are challenges in interpreting allele frequencies in populations, particularly when a large number of loci control a trait and when interactions between alleles influence trait expression. To meet these challenges, population samples should be collected for longitudinal studies, and experimental programmes should be undertaken to link variation at candidate genes to ecological processes.
Collapse
Affiliation(s)
- Ary A Hoffmann
- Department of Genetics, Centre of Environmental Stress and Adaptation Research, The University of Melbourne, Melbourne, Vic. 3010, Australia.
| | | |
Collapse
|
31
|
Pool JE, Aquadro CF. History and structure of sub-Saharan populations of Drosophila melanogaster. Genetics 2006; 174:915-29. [PMID: 16951082 PMCID: PMC1602083 DOI: 10.1534/genetics.106.058693] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2006] [Accepted: 08/10/2006] [Indexed: 11/18/2022] Open
Abstract
Drosophila melanogaster is an important model organism in evolutionary genetics, yet little is known about the population structure and the demographic history of this species within sub-Saharan Africa, which is thought to contain its ancestral range. We surveyed nucleotide variation at four 1-kb fragments in 240 individual lines representing 21 sub-Saharan and 4 Palearctic population samples of D. melanogaster. In agreement with recent studies, we find a small but significant level of genetic differentiation within sub-Saharan Africa. A clear geographic pattern is observed, with eastern and western African populations composing two genetically distinct groups. This pattern may have resulted from a relatively recent establishment of D. melanogaster in western Africa. Eastern populations show greater evidence for long-term stability, consistent with the hypothesis that eastern Africa contains the ancestral range of the species. Three sub-Saharan populations show evidence for cosmopolitan introgression. Apart from those cases, the closest relationships between Palearctic and sub-Saharan populations involve a sample from the rift zone (Uganda), suggesting that the progenitors of Palearctic D. melanogaster might have come from this region. Finally, we find a large excess of singleton polymorphisms in the full data set, which is best explained by a combination of population growth and purifying selection.
Collapse
Affiliation(s)
- John E Pool
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA.
| | | |
Collapse
|
32
|
Glinka S, De Lorenzo D, Stephan W. Evidence of gene conversion associated with a selective sweep in Drosophila melanogaster. Mol Biol Evol 2006; 23:1869-78. [PMID: 16868022 DOI: 10.1093/molbev/msl069] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Since Drosophila melanogaster colonized Europe from tropical Africa 10 to 15 thousand years ago, it is expected that adaptation has played a major role in this species in recent times. A previously conducted multilocus scan of noncoding DNA sequences on the X chromosome in an ancestral and a derived population of D. melanogaster revealed that some loci have been affected by directional selection in the European population. We investigated if the pattern of DNA sequence polymorphism in a region surrounding one of these loci can be explained by a hitchhiking event. We found strong evidence that the studied region around the gene unc-119 was shaped by a recent selective sweep, including a valley of reduced heterozygosity of 83.4 kb, a skew in the frequency spectrum, and significant linkage disequilibrium on one side of the valley. This region, however, was interrupted by gene conversion events leading to a strong haplotype structure in the center of the valley of reduced variation.
Collapse
Affiliation(s)
- Sascha Glinka
- Section of Evolutionary Biology, Department of Biology II, Ludwig-Maximilians University, Planegg-Martinsried, Germany.
| | | | | |
Collapse
|
33
|
Quesada H, Ramos-Onsins SE, Rozas J, Aguadé M. Positive selection versus demography: evolutionary inferences based on an unusual haplotype structure in Drosophila simulans. Mol Biol Evol 2006; 23:1643-7. [PMID: 16757657 DOI: 10.1093/molbev/msl031] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Coalescent simulations were used to investigate the possible role of population subdivision and history in shaping nucleotide variation in a recombining 88-kb genomic fragment of Drosophila simulans displaying an unusual large-scale haplotype structure. The multilocus analysis, based on summary statistics using specific demographic null models under recombination, indicates that the observed levels of linkage disequilibrium differed significantly from the values expected under different bottleneck and population admixture scenarios. These results indicate that demography alone may not account for the observed pattern of variation and support the previous claim that the data are better described by a model in which an adaptive mutation has not yet gone to fixation.
Collapse
|
34
|
Hamblin MT, Casa AM, Sun H, Murray SC, Paterson AH, Aquadro CF, Kresovich S. Challenges of detecting directional selection after a bottleneck: lessons from Sorghum bicolor. Genetics 2006; 173:953-64. [PMID: 16547110 PMCID: PMC1526520 DOI: 10.1534/genetics.105.054312] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2005] [Accepted: 03/13/2006] [Indexed: 11/18/2022] Open
Abstract
Multilocus surveys of sequence variation can be used to identify targets of directional selection, which are expected to have reduced levels of variation. Following a population bottleneck, the signal of directional selection may be hard to detect because many loci may have low variation by chance and the frequency spectrum of variation may be perturbed in ways that resemble the effects of selection. Cultivated Sorghum bicolor contains a subset of the genetic diversity found in its wild ancestor(s) due to the combined effects of a domestication bottleneck and human selection on traits associated with agriculture. As a framework for distinguishing between the effects of demography and selection, we sequenced 204 loci in a diverse panel of 17 cultivated S. bicolor accessions. Genomewide patterns of diversity depart strongly from equilibrium expectations with regard to the variance of the number of segregating sites, the site frequency spectrum, and haplotype configuration. Furthermore, gene genealogies of most loci with an excess of low frequency variants and/or an excess of segregating sites do not show the characteristic signatures of directional and diversifying selection, respectively. A simple bottleneck model provides an improved but inadequate fit to the data, suggesting the action of other population-level factors, such as population structure and migration. Despite a known history of recent selection, we find little evidence for directional selection, likely due to low statistical power and lack of an appropriate null model.
Collapse
Affiliation(s)
- Martha T Hamblin
- Institute for Genomic Diversity, Cornell University, Ithaca, New York 14853, USA
| | | | | | | | | | | | | |
Collapse
|
35
|
Abstract
The beneficial substitution of an allele shapes patterns of genetic variation at linked sites. Thus, in principle, adaptations can be mapped by looking for the signature of directional selection in polymorphism data. In practice, such efforts are hampered by the need for an accurate characterization of the demographic history of the species and of the effects of positive selection. In an attempt to circumvent these difficulties, researchers are increasingly taking a purely empirical approach, in which a large number of genomic regions are ordered by summaries of the polymorphism data, and loci with extreme values are considered to be likely targets of positive selection. We evaluated the reliability of the "empirical" approach, focusing on applications to human data and to maize. To do so, we considered a coalescent model of directional selection in a sensible demographic setting, allowing for selection on standing variation as well as on a new mutation. Our simulations suggest that while empirical approaches will identify several interesting candidates, they will also miss many--in some cases, most--loci of interest. The extent of the trade-off depends on the mode of positive selection and the demographic history of the population. Specifically, the false-discovery rate is higher when directional selection involves a recessive rather than a co-dominant allele, when it acts on a previously neutral rather than a new allele, and when the population has experienced a population bottleneck rather than maintained a constant size. One implication of these results is that, insofar as attributes of the beneficial mutation (e.g., the dominance coefficient) affect the power to detect targets of selection, genomic scans will yield an unrepresentative subset of loci that contribute to adaptations.
Collapse
Affiliation(s)
- Kosuke M. Teshima
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
- Corresponding authors.E-mail ; fax (773) 834-0505.E-mail ; fax (773) 834-0505
| | - Graham Coop
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
| | - Molly Przeworski
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
- Corresponding authors.E-mail ; fax (773) 834-0505.E-mail ; fax (773) 834-0505
| |
Collapse
|
36
|
Pool JE, Bauer DuMont V, Mueller JL, Aquadro CF. A scan of molecular variation leads to the narrow localization of a selective sweep affecting both Afrotropical and cosmopolitan populations of Drosophila melanogaster. Genetics 2006; 172:1093-105. [PMID: 16322515 PMCID: PMC1456208 DOI: 10.1534/genetics.105.049973] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2005] [Accepted: 11/10/2005] [Indexed: 11/18/2022] Open
Abstract
Drosophila melanogaster originated in tropical Africa but has achieved a cosmopolitan distribution in association with human habitation. Cosmopolitan populations of D. melanogaster are known to have reduced genetic variation, particularly on the X chromosome. However, the relative importance of population bottlenecks and selective sweeps in explaining this reduction is uncertain. We surveyed variation at 31 microsatellites across a 330-kb section of the X chromosome located between the white and kirre genes. Two linked clusters of loci were observed with reduced variation and a skew toward rare alleles in both an Ecuador and a Zimbabwe population sample. Examining Zimbabwe DNA sequence polymorphism within one of these regions allowed us to localize a selective sweep to a 361-bp window within the 5' regulatory region of the roughest gene, with one nucleotide substitution representing the best candidate for the target of selection. Estimates of sweep age suggested that this fixation event occurred prior to the expansion of D. melanogaster from sub-Saharan Africa. For both putative sweep regions in our data set, cosmopolitan populations showed wider footprints of selection compared to those in Zimbabwe. This pattern appears consistent with the demographic amplification of preexisting sweep signals due to one or more population bottlenecks.
Collapse
Affiliation(s)
- John E Pool
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA.
| | | | | | | |
Collapse
|