151
|
Schmidt H, Greshake B, Feldmeyer B, Hankeln T, Pfenninger M. Genomic basis of ecological niche divergence among cryptic sister species of non-biting midges. BMC Genomics 2013; 14:384. [PMID: 23758757 PMCID: PMC3685581 DOI: 10.1186/1471-2164-14-384] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2013] [Accepted: 05/30/2013] [Indexed: 11/26/2022] Open
Abstract
BACKGROUND There is a lack of understanding the evolutionary forces driving niche segregation of closely related organisms. In addition, pinpointing the genes driving ecological divergence is a key goal in molecular ecology. Here, larval transcriptome sequences obtained by next-generation-sequencing are used to address these issues in a morphologically cryptic sister species pair of non-biting midges (Chironomus riparius and C. piger). RESULTS More than eight thousand orthologous open reading frames were screened for interspecific divergence and intraspecific polymorphisms. Despite a small mean sequence divergence of 1.53% between the sister species, 25.1% of 18,115 observed amino acid substitutions were inferred by α statistics to be driven by positive selection. Applying McDonald-Kreitman tests to 715 alignments of gene orthologues identified eleven (1.5%) genes driven by positive selection. CONCLUSIONS Three candidate genes were identified as potentially responsible for the observed niche segregation concerning nitrite concentration, habitat temperature and water conductivity. Additionally, signs of positive selection in the hydrogen sulfide detoxification pathway were detected, providing a new plausible hypothesis for the species' ecological differentiation. Finally, a divergently selected, nuclear encoded mitochondrial ribosomal protein may contribute to reproductive isolation due to cytonuclear coevolution.
Collapse
Affiliation(s)
- Hanno Schmidt
- Molecular Ecology Group, Biodiversity and Climate Research Centre (BiK-F) by Senckenberg Gesellschaft für Naturforschung and Goethe University, Biocampus Siesmayerstraße, Frankfurt am Main, 60054, Germany
- Institute of Molecular Genetics, Biosafety Research and Consulting, Johannes Gutenberg-University, Becherweg 30a, Mainz, 55128, Germany
| | - Bastian Greshake
- Molecular Ecology Group, Biodiversity and Climate Research Centre (BiK-F) by Senckenberg Gesellschaft für Naturforschung and Goethe University, Biocampus Siesmayerstraße, Frankfurt am Main, 60054, Germany
| | - Barbara Feldmeyer
- Molecular Ecology Group, Biodiversity and Climate Research Centre (BiK-F) by Senckenberg Gesellschaft für Naturforschung and Goethe University, Biocampus Siesmayerstraße, Frankfurt am Main, 60054, Germany
- Current address: Department of Evolutionary Biology, Johannes Gutenberg-University, Johannes-von-Müller-Weg 6, Mainz, 55128, Germany
| | - Thomas Hankeln
- Institute of Molecular Genetics, Biosafety Research and Consulting, Johannes Gutenberg-University, Becherweg 30a, Mainz, 55128, Germany
| | - Markus Pfenninger
- Molecular Ecology Group, Biodiversity and Climate Research Centre (BiK-F) by Senckenberg Gesellschaft für Naturforschung and Goethe University, Biocampus Siesmayerstraße, Frankfurt am Main, 60054, Germany
| |
Collapse
|
152
|
Strong purifying selection at synonymous sites in D. melanogaster. PLoS Genet 2013; 9:e1003527. [PMID: 23737754 PMCID: PMC3667748 DOI: 10.1371/journal.pgen.1003527] [Citation(s) in RCA: 143] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2013] [Accepted: 04/08/2013] [Indexed: 11/19/2022] Open
Abstract
Synonymous sites are generally assumed to be subject to weak selective constraint. For this reason, they are often neglected as a possible source of important functional variation. We use site frequency spectra from deep population sequencing data to show that, contrary to this expectation, 22% of four-fold synonymous (4D) sites in Drosophila melanogaster evolve under very strong selective constraint while few, if any, appear to be under weak constraint. Linking polymorphism with divergence data, we further find that the fraction of synonymous sites exposed to strong purifying selection is higher for those positions that show slower evolution on the Drosophila phylogeny. The function underlying the inferred strong constraint appears to be separate from splicing enhancers, nucleosome positioning, and the translational optimization generating canonical codon bias. The fraction of synonymous sites under strong constraint within a gene correlates well with gene expression, particularly in the mid-late embryo, pupae, and adult developmental stages. Genes enriched in strongly constrained synonymous sites tend to be particularly functionally important and are often involved in key developmental pathways. Given that the observed widespread constraint acting on synonymous sites is likely not limited to Drosophila, the role of synonymous sites in genetic disease and adaptation should be reevaluated.
Collapse
|
153
|
Abstract
SLiM is an efficient forward population genetic simulation designed for studying the effects of linkage and selection on a chromosome-wide scale. The program can incorporate complex scenarios of demography and population substructure, various models for selection and dominance of new mutations, arbitrary gene structure, and user-defined recombination maps.
Collapse
|
154
|
Watt WB. Specific-gene studies of evolutionary mechanisms in an age of genome-wide surveying. Ann N Y Acad Sci 2013; 1289:1-17. [PMID: 23679204 DOI: 10.1111/nyas.12139] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
The molecular tools of genomics have great power to reveal patterns of genetic difference within or among species, but must be complemented by the mechanistic study of the genetic variants found if these variants' evolutionary meaning is to be well understood. Central to this purpose is knowledge of the organisms' genotype-phenotype-environment interactions, which embody biological adaptation and constraint and thus drive natural selection. The history of this approach is briefly reviewed. Strategies embracing the complementarity of genomics and specific-gene studies in evolution are considered. Implementation of these strategies, and examples showing their feasibility and power, are discussed. Initial generalizations emphasize: (1) reproducibility of adaptive mechanisms; (2) evolutionary co-importance of variation in protein sequences and expression; (3) refinement of rudimentary molecular functions as an origin of evolutionary innovations; (4) identification of specific-gene mechanisms as underpinnings of genomic or quantitative genetic variation; and (5) multiple forms of adaptive or constraining epistasis among genes. Progress along these lines will advance understanding of evolution and support its use in addressing urgent medical and environmental applications.
Collapse
Affiliation(s)
- Ward B Watt
- Department of Biology, Stanford University, Stanford, California and Rocky Mountain Biological Laboratory, Crested Butte, Colorado, USA.
| |
Collapse
|
155
|
Leushkin EV, Bazykin GA. Short indels are subject to insertion-biased gene conversion. Evolution 2013; 67:2604-13. [PMID: 24033170 DOI: 10.1111/evo.12129] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2013] [Accepted: 04/05/2013] [Indexed: 11/29/2022]
Abstract
Recombination between homologous loci is accompanied by formation of heteroduplexes. Repairing mismatches in heteroduplexes often leads to single nucleotide substitutions in a process known as gene conversion. Gene conversion was shown to be GC-biased in different organisms; that is, a W(A or T)→S(G or C) substitution is more likely in this process than a S→W substitution. Here, we show that the insertion/deletion ratio for short noncoding indels that reach fixation between species is positively correlated with the recombination rate in Drosophila melanogaster, Homo sapiens, and Saccharomyces cerevisiae. This correlation is both due to an increase of the fixation rate of insertions and decrease of the fixation rate of deletions in regions of high recombination. Whole-genome data on indel polymorphism and divergence in D. melanogaster rule out mutation biases and selection as the cause of this trend, pointing to insertion-biased gene conversion as the most likely explanation. The bias toward insertions is the strongest for single-nucleotide indels, and decreases with indel length. In regions of high recombination rate this bias leads to an up to ∼5-fold excess of fixed short insertions over deletions, and substantially affects the evolution of DNA segments.
Collapse
Affiliation(s)
- Evgeny V Leushkin
- Department of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Leninskye Gory 1-73, Moscow, 119992, Russia; Institute for Information Transmission Problems of the Russian Academy of Sciences (Kharkevich Institute), Bolshoi Karetny pereulok, 19, Moscow, 127994, Russia.
| | | |
Collapse
|
156
|
Abstract
Population genomic studies have shown that genetic draft and background selection can profoundly affect the genome-wide patterns of molecular variation. We performed forward simulations under realistic gene-structure and selection scenarios to investigate whether such linkage effects impinge on the ability of the McDonald-Kreitman (MK) test to infer the rate of positive selection (α) from polymorphism and divergence data. We find that in the presence of slightly deleterious mutations, MK estimates of α severely underestimate the true rate of adaptation even if all polymorphisms with population frequencies under 50% are excluded. Furthermore, already under intermediate rates of adaptation, genetic draft substantially distorts the site frequency spectra at neutral and functional sites from the expectations under mutation-selection-drift balance. MK-type approaches that first infer demography from synonymous sites and then use the inferred demography to correct the estimation of α obtain almost the correct α in our simulations. However, these approaches typically infer a severe past population expansion although there was no such expansion in the simulations, casting doubt on the accuracy of methods that infer demography from synonymous polymorphism data. We propose a simple asymptotic extension of the MK test that yields accurate estimates of α in our simulations and should provide a fruitful direction for future studies.
Collapse
|
157
|
Simola DF, Wissler L, Donahue G, Waterhouse RM, Helmkampf M, Roux J, Nygaard S, Glastad KM, Hagen DE, Viljakainen L, Reese JT, Hunt BG, Graur D, Elhaik E, Kriventseva EV, Wen J, Parker BJ, Cash E, Privman E, Childers CP, Muñoz-Torres MC, Boomsma JJ, Bornberg-Bauer E, Currie CR, Elsik CG, Suen G, Goodisman MAD, Keller L, Liebig J, Rawls A, Reinberg D, Smith CD, Smith CR, Tsutsui N, Wurm Y, Zdobnov EM, Berger SL, Gadau J. Social insect genomes exhibit dramatic evolution in gene composition and regulation while preserving regulatory features linked to sociality. Genome Res 2013; 23:1235-47. [PMID: 23636946 PMCID: PMC3730098 DOI: 10.1101/gr.155408.113] [Citation(s) in RCA: 166] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Genomes of eusocial insects code for dramatic examples of phenotypic plasticity and social organization. We compared the genomes of seven ants, the honeybee, and various solitary insects to examine whether eusocial lineages share distinct features of genomic organization. Each ant lineage contains ∼4000 novel genes, but only 64 of these genes are conserved among all seven ants. Many gene families have been expanded in ants, notably those involved in chemical communication (e.g., desaturases and odorant receptors). Alignment of the ant genomes revealed reduced purifying selection compared with Drosophila without significantly reduced synteny. Correspondingly, ant genomes exhibit dramatic divergence of noncoding regulatory elements; however, extant conserved regions are enriched for novel noncoding RNAs and transcription factor–binding sites. Comparison of orthologous gene promoters between eusocial and solitary species revealed significant regulatory evolution in both cis (e.g., Creb) and trans (e.g., fork head) for nearly 2000 genes, many of which exhibit phenotypic plasticity. Our results emphasize that genomic changes can occur remarkably fast in ants, because two recently diverged leaf-cutter ant species exhibit faster accumulation of species-specific genes and greater divergence in regulatory elements compared with other ants or Drosophila. Thus, while the “socio-genomes” of ants and the honeybee are broadly characterized by a pervasive pattern of divergence in gene composition and regulation, they preserve lineage-specific regulatory features linked to eusociality. We propose that changes in gene regulation played a key role in the origins of insect eusociality, whereas changes in gene composition were more relevant for lineage-specific eusocial adaptations.
Collapse
Affiliation(s)
- Daniel F Simola
- Department of Cell and Developmental Biology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
158
|
Andrew RL, Bernatchez L, Bonin A, Buerkle CA, Carstens BC, Emerson BC, Garant D, Giraud T, Kane NC, Rogers SM, Slate J, Smith H, Sork VL, Stone GN, Vines TH, Waits L, Widmer A, Rieseberg LH. A road map for molecular ecology. Mol Ecol 2013; 22:2605-26. [DOI: 10.1111/mec.12319] [Citation(s) in RCA: 92] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2013] [Accepted: 03/16/2013] [Indexed: 12/18/2022]
Affiliation(s)
- Rose L. Andrew
- Department of Botany; University of British Columbia; 3529-6270 University Blvd Vancouver BC V6T 1Z4 Canada
| | - Louis Bernatchez
- DInstitut de Biologie Intégrative et des Systémes; Département de Biologie; 1030, Avenue de la Médecine Université Laval; Québec QC G1V 0A6 Canada
| | - Aurélie Bonin
- Laboratoire d'Ecologie Alpine; CNRS UMR 5553 Université Joseph Fourier; BP 53, 38041 Grenoble Cedex 9 France
| | - C. Alex. Buerkle
- Department of Botany; University of Wyoming; 1000 E. University Ave. Laramie WY 82071 USA
| | - Bryan C. Carstens
- Department of Evolution, Ecology and Organismal Biology; 318 W. 12th Ave. The Ohio State University; Columbus OH 43210 USA
| | - Brent C. Emerson
- Island Ecology and Evolution Research Group; Instituto de Productos Naturales y Agrobiología (IPNA-CSIC) C/Astrofísico Francisco Sánchez 3 La Laguna Tenerife; Canary Islands 38206 Spain
| | - Dany Garant
- Département de Biologie; Université de Sherbrooke; Sherbrooke QC J1K 2R1 Canada
| | - Tatiana Giraud
- Laboratoire Ecologie, Systématique et Evolution; UMR 8079 CNRS-UPS-AgroParisTech, Bâtiment 360 Univ. Paris Sud; 91405 Orsay cedex France
| | - Nolan C. Kane
- Department of Botany; University of British Columbia; 3529-6270 University Blvd Vancouver BC V6T 1Z4 Canada
| | - Sean M. Rogers
- Department of Biological Sciences; University of Calgary; 2500 University Drive N.W., Calgary AB T2N 1N4 Canada
| | - Jon Slate
- Department of Animal and Plant Sciences; University of Sheffield; Sheffield S10 2TN UK
| | - Harry Smith
- 79 Melton Road Burton-on-the-Wolds Loughborough LE12 5TQ UK
| | - Victoria L. Sork
- Department of Ecology and Evolutionary Biology; University of California Los Angeles; 4139 Terasaki Life Sciences Building, 610 Charles E. Young Drive East Los Angeles CA 90095 USA
| | - Graham N. Stone
- Institute of Evolutionary Biology; University of Edinburgh; The King's Buildings, West Mains Road, Edinburgh EH9 3JT UK
| | - Timothy H. Vines
- Molecular Ecology Editorial Office; 6270 University Blvd Vancouver BC V6T 1Z4 Canada
| | - Lisette Waits
- Department of Fish and Wildlife Sciences; University of Idaho; 875 Perimeter Drive MS 1136 Moscow ID 83844 USA
| | - Alex Widmer
- ETH Zurich; Institute of Integrative Biology; Universitätstrasse 16 Zurich 8092 Switzerland
| | - Loren H. Rieseberg
- Department of Botany; University of British Columbia; 3529-6270 University Blvd Vancouver BC V6T 1Z4 Canada
- Department of Biology; Indiana University; 1001 E. 3 St., Bloomington IN 47405 USA
| |
Collapse
|
159
|
ZRT1 Harbors an Excess of Nonsynonymous Polymorphism and Shows Evidence of Balancing Selection in Saccharomyces cerevisiae. G3-GENES GENOMES GENETICS 2013; 3:665-673. [PMID: 23550117 PMCID: PMC3618353 DOI: 10.1534/g3.112.005082] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
Abstract
Estimates of the fraction of nucleotide substitutions driven by positive selection vary widely across different species. Accounting for different estimates of positive selection has been difficult, in part because selection on polymorphism within a species is known to obscure a signal of positive selection among species. While methods have been developed to control for the confounding effects of negative selection against deleterious polymorphism, the impact of balancing selection on estimates of positive selection has not been assessed. In Saccharomyces cerevisiae, there is no signal of positive selection within protein coding sequences as the ratio of nonsynonymous to synonymous polymorphism is higher than that of divergence. To investigate the impact of balancing selection on estimates of positive selection, we examined five genes with high rates of nonsynonymous polymorphism in S. cerevisiae relative to divergence from S. paradoxus. One of the genes, the high-affinity zinc transporter ZRT1 showed an elevated rate of synonymous polymorphism indicative of balancing selection. The high rate of synonymous polymorphism coincided with nonsynonymous divergence among three haplotype groups, among which we found no detectable differences in ZRT1 function. Our results implicate balancing selection in one of five genes exhibiting a large excess of nonsynonymous polymorphism in yeast. We conclude that balancing selection is a potentially important factor in estimating the frequency of positive selection across the yeast genome.
Collapse
|
160
|
Cutter AD, Jovelin R, Dey A. Molecular hyperdiversity and evolution in very large populations. Mol Ecol 2013; 22:2074-95. [PMID: 23506466 PMCID: PMC4065115 DOI: 10.1111/mec.12281] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2012] [Revised: 01/24/2013] [Accepted: 01/29/2013] [Indexed: 02/06/2023]
Abstract
The genomic density of sequence polymorphisms critically affects the sensitivity of inferences about ongoing sequence evolution, function and demographic history. Most animal and plant genomes have relatively low densities of polymorphisms, but some species are hyperdiverse with neutral nucleotide heterozygosity exceeding 5%. Eukaryotes with extremely large populations, mimicking bacterial and viral populations, present novel opportunities for studying molecular evolution in sexually reproducing taxa with complex development. In particular, hyperdiverse species can help answer controversial questions about the evolution of genome complexity, the limits of natural selection, modes of adaptation and subtleties of the mutation process. However, such systems have some inherent complications and here we identify topics in need of theoretical developments. Close relatives of the model organisms Caenorhabditis elegans and Drosophila melanogaster provide known examples of hyperdiverse eukaryotes, encouraging functional dissection of resulting molecular evolutionary patterns. We recommend how best to exploit hyperdiverse populations for analysis, for example, in quantifying the impact of noncrossover recombination in genomes and for determining the identity and micro-evolutionary selective pressures on noncoding regulatory elements.
Collapse
Affiliation(s)
- Asher D Cutter
- Department of Ecology & Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada.
| | | | | |
Collapse
|
161
|
Genomic signatures of selection at linked sites: unifying the disparity among species. Nat Rev Genet 2013; 14:262-74. [PMID: 23478346 DOI: 10.1038/nrg3425] [Citation(s) in RCA: 311] [Impact Index Per Article: 28.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Population genetics theory supplies powerful predictions about how natural selection interacts with genetic linkage to sculpt the genomic landscape of nucleotide polymorphism. Both the spread of beneficial mutations and the removal of deleterious mutations act to depress polymorphism levels, especially in low-recombination regions. However, empiricists have documented extreme disparities among species. Here we characterize the dominant features that could drive differences in linked selection among species--including roles for selective sweeps being 'hard' or 'soft'--and the concealing effects of demography and confounding genomic variables. We advocate targeted studies of closely related species to unify our understanding of how selection and linkage interact to shape genome evolution.
Collapse
|
162
|
Abstract
Knowing the distribution of fitness effects (DFE) of new mutations is important for several topics in evolutionary genetics. Existing computational methods with which to infer the DFE based on DNA polymorphism data have frequently assumed that the DFE can be approximated by a unimodal distribution, such as a lognormal or a gamma distribution. However, if the true DFE departs substantially from the assumed distribution (e.g., if the DFE is multimodal), this could lead to misleading inferences about its properties. We conducted simulations to test the performance of parametric and nonparametric discretized distribution models to infer the properties of the DFE for cases in which the true DFE is unimodal, bimodal, or multimodal. We found that lognormal and gamma distribution models can perform poorly in recovering the properties of the distribution if the true DFE is bimodal or multimodal, whereas discretized distribution models perform better. If there is a sufficient amount of data, the discretized models can detect a multimodal DFE and can accurately infer the mean effect and the average fixation probability of a new deleterious mutation. We fitted several models for the DFE of amino acid-changing mutations using whole-genome polymorphism data from Drosophila melanogaster and the house mouse subspecies Mus musculus castaneus. A lognormal DFE best explains the data for D. melanogaster, whereas we find evidence for a bimodal DFE in M. m. castaneus.
Collapse
|
163
|
|
164
|
Watt WB, Hudson RR, Wang B, Wang E. A genetic polymorphism evolving in parallel in two cell compartments and in two clades. BMC Evol Biol 2013; 13:9. [PMID: 23311980 PMCID: PMC3556304 DOI: 10.1186/1471-2148-13-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2012] [Accepted: 12/12/2012] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND The enzyme phosphoenolpyruvate carboxykinase, PEPCK, occurs in its guanosine-nucleotide-using form in animals and a few prokaryotes. We study its natural genetic variation in Colias (Lepidoptera, Pieridae). PEPCK offers a route, alternative to pyruvate kinase, for carbon skeletons to move between cytosolic glycolysis and mitochondrial Krebs cycle reactions. RESULTS PEPCK is expressed in both cytosol and mitochondrion, but differently in diverse animal clades. In vertebrates and independently in Drosophila, compartment-specific paralogous genes occur. In a contrasting expression strategy, compartment-specific PEPCKs of Colias and of the silkmoth, Bombyx, differ only in their first, 5', exons; these are alternatively spliced onto a common series of following exons. In two Colias species from distinct clades, PEPCK sequence is highly variable at nonsynonymous and synonymous sites, mainly in its common exons. Three major amino acid polymorphisms, Gly 335 ↔ Ser, Asp 503 ↔ Glu, and Ile 629 ↔ Val occur in both species, and in the first two cases are similar in frequency between species. Homology-based structural modelling shows that the variants can alter hydrogen bonding, salt bridging, or van der Waals interactions of amino acid side chains, locally or at one another's sites which are distant in PEPCK's structure, and thus may affect its enzyme function. We ask, using coalescent simulations, if these polymorphisms' cross-species similarities are compatible with neutral evolution by genetic drift, but find the probability of this null hypothesis is 0.001 ≤ P ≤ 0.006 under differing scenarios. CONCLUSION Our results make the null hypothesis of neutrality of these PEPCK polymorphisms quite unlikely, but support an alternative hypothesis that they are maintained by natural selection in parallel in the two species. This alternative can now be justifiably tested further via studies of PEPCK genotypes' effects on function, organismal performance, and fitness. This case emphasizes the importance, for evolutionary insight, of studying gene-specific mechanisms affected by natural genetic variation as an essential complement to surveys of such variation.
Collapse
Affiliation(s)
- Ward B Watt
- Department of Biology, Stanford University, Stanford, CA 94305-5020, USA
- Rocky Mountain Biological Laboratory, Crested Butte, CO 81224, USA
| | - Richard R Hudson
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, 60637, USA
| | - Baiqing Wang
- Department of Biology, Stanford University, Stanford, CA 94305-5020, USA
| | - Eddie Wang
- Department of Biology, Stanford University, Stanford, CA 94305-5020, USA
- Present address: Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA, 0213, USA
| |
Collapse
|
165
|
|
166
|
Chan AH, Jenkins PA, Song YS. Genome-wide fine-scale recombination rate variation in Drosophila melanogaster. PLoS Genet 2012; 8:e1003090. [PMID: 23284288 PMCID: PMC3527307 DOI: 10.1371/journal.pgen.1003090] [Citation(s) in RCA: 172] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2012] [Accepted: 09/29/2012] [Indexed: 01/18/2023] Open
Abstract
Estimating fine-scale recombination maps of Drosophila from population genomic data is a challenging problem, in particular because of the high background recombination rate. In this paper, a new computational method is developed to address this challenge. Through an extensive simulation study, it is demonstrated that the method allows more accurate inference, and exhibits greater robustness to the effects of natural selection and noise, compared to a well-used previous method developed for studying fine-scale recombination rate variation in the human genome. As an application, a genome-wide analysis of genetic variation data is performed for two Drosophila melanogaster populations, one from North America (Raleigh, USA) and the other from Africa (Gikongoro, Rwanda). It is shown that fine-scale recombination rate variation is widespread throughout the D. melanogaster genome, across all chromosomes and in both populations. At the fine-scale, a conservative, systematic search for evidence of recombination hotspots suggests the existence of a handful of putative hotspots each with at least a tenfold increase in intensity over the background rate. A wavelet analysis is carried out to compare the estimated recombination maps in the two populations and to quantify the extent to which recombination rates are conserved. In general, similarity is observed at very broad scales, but substantial differences are seen at fine scales. The average recombination rate of the X chromosome appears to be higher than that of the autosomes in both populations, and this pattern is much more pronounced in the African population than the North American population. The correlation between various genomic features—including recombination rates, diversity, divergence, GC content, gene content, and sequence quality—is examined using the wavelet analysis, and it is shown that the most notable difference between D. melanogaster and humans is in the correlation between recombination and diversity. Recombination is a process by which chromosomes exchange genetic material during meiosis. It is important in evolution because it provides offspring with new combinations of genes, and so estimating the rate of recombination is of fundamental importance in various population genomic inference problems. In this paper, we develop a new statistical method to enable robust estimation of fine-scale recombination maps of Drosophila, a genus of common fruit flies, in which the background recombination rate is high and natural selection has been prevalent. We apply our method to produce fine-scale recombination maps for a North American population and an African population of D. melanogaster. For both populations, we find extensive fine-scale variation in recombination rate throughout the genome. We provide a quantitative characterization of the similarities and differences between the recombination maps of the two populations; our study reveals high correlation at broad scales and low correlation at fine scales, as has been documented among human populations. We also examine the correlation between various genomic features. Furthermore, using a conservative approach, we find a handful of putative recombination “hotspot” regions with solid statistical support for a local elevation of at least 10 times the background recombination rate.
Collapse
Affiliation(s)
- Andrew H. Chan
- Computer Science Division, University of California Berkeley, Berkeley, California, United States of America
| | - Paul A. Jenkins
- Computer Science Division, University of California Berkeley, Berkeley, California, United States of America
| | - Yun S. Song
- Computer Science Division, University of California Berkeley, Berkeley, California, United States of America
- Department of Statistics, University of California Berkeley, Berkeley, California, United States of America
- * E-mail:
| |
Collapse
|
167
|
Pool JE, Corbett-Detig RB, Sugino RP, Stevens KA, Cardeno CM, Crepeau MW, Duchen P, Emerson JJ, Saelao P, Begun DJ, Langley CH. Population Genomics of sub-saharan Drosophila melanogaster: African diversity and non-African admixture. PLoS Genet 2012; 8:e1003080. [PMID: 23284287 PMCID: PMC3527209 DOI: 10.1371/journal.pgen.1003080] [Citation(s) in RCA: 227] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2012] [Accepted: 09/27/2012] [Indexed: 11/25/2022] Open
Abstract
Drosophila melanogaster has played a pivotal role in the development of modern population genetics. However, many basic questions regarding the demographic and adaptive history of this species remain unresolved. We report the genome sequencing of 139 wild-derived strains of D. melanogaster, representing 22 population samples from the sub-Saharan ancestral range of this species, along with one European population. Most genomes were sequenced above 25X depth from haploid embryos. Results indicated a pervasive influence of non-African admixture in many African populations, motivating the development and application of a novel admixture detection method. Admixture proportions varied among populations, with greater admixture in urban locations. Admixture levels also varied across the genome, with localized peaks and valleys suggestive of a non-neutral introgression process. Genomes from the same location differed starkly in ancestry, suggesting that isolation mechanisms may exist within African populations. After removing putatively admixed genomic segments, the greatest genetic diversity was observed in southern Africa (e.g. Zambia), while diversity in other populations was largely consistent with a geographic expansion from this potentially ancestral region. The European population showed different levels of diversity reduction on each chromosome arm, and some African populations displayed chromosome arm-specific diversity reductions. Inversions in the European sample were associated with strong elevations in diversity across chromosome arms. Genomic scans were conducted to identify loci that may represent targets of positive selection within an African population, between African populations, and between European and African populations. A disproportionate number of candidate selective sweep regions were located near genes with varied roles in gene regulation. Outliers for Europe-Africa F(ST) were found to be enriched in genomic regions of locally elevated cosmopolitan admixture, possibly reflecting a role for some of these loci in driving the introgression of non-African alleles into African populations.
Collapse
Affiliation(s)
- John E Pool
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, Wisconsin, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
168
|
Abstract
Positive selection distorts the structure of genealogies and hence alters patterns of genetic variation within a population. Most analyses of these distortions focus on the signatures of hitchhiking due to hard or soft selective sweeps at a single genetic locus. However, in linked regions of rapidly adapting genomes, multiple beneficial mutations at different loci can segregate simultaneously within the population, an effect known as clonal interference. This leads to a subtle interplay between hitchhiking and interference effects, which leads to a unique signature of rapid adaptation on genetic variation both at the selected sites and at linked neutral loci. Here, we introduce an effective coalescent theory (a "fitness-class coalescent") that describes how positive selection at many perfectly linked sites alters the structure of genealogies. We use this theory to calculate several simple statistics describing genetic variation within a rapidly adapting population and to implement efficient backward-time coalescent simulations, which can be used to predict how clonal interference alters the expected patterns of molecular evolution.
Collapse
|
169
|
Olson-Manning CF, Wagner MR, Mitchell-Olds T. Adaptive evolution: evaluating empirical support for theoretical predictions. Nat Rev Genet 2012; 13:867-77. [PMID: 23154809 PMCID: PMC3748133 DOI: 10.1038/nrg3322] [Citation(s) in RCA: 148] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Adaptive evolution is shaped by the interaction of population genetics, natural selection and underlying network and biochemical constraints. Variation created by mutation, the raw material for evolutionary change, is translated into phenotypes by flux through metabolic pathways and by the topography and dynamics of molecular networks. Finally, the retention of genetic variation and the efficacy of selection depend on population genetics and demographic history. Emergent high-throughput experimental methods and sequencing technologies allow us to gather more evidence and to move beyond the theory in different systems and populations. Here we review the extent to which recent evidence supports long-established theoretical principles of adaptation.
Collapse
Affiliation(s)
- Carrie F. Olson-Manning
- Department of Biology, Box 90338, Program in Genetics and Genomics, Institute for Genome Sciences and Policy, Duke University, Durham, NC 27708
| | - Maggie R. Wagner
- Department of Biology, Box 90338, Program in Genetics and Genomics, Institute for Genome Sciences and Policy, Duke University, Durham, NC 27708
| | - Thomas Mitchell-Olds
- Department of Biology, Box 90338, Program in Genetics and Genomics, Institute for Genome Sciences and Policy, Duke University, Durham, NC 27708
| |
Collapse
|
170
|
Schoville SD, Bonin A, François O, Lobreaux S, Melodelima C, Manel S. Adaptive Genetic Variation on the Landscape: Methods and Cases. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2012. [DOI: 10.1146/annurev-ecolsys-110411-160248] [Citation(s) in RCA: 217] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Sean D. Schoville
- Laboratoire TIMC-IMAG, UMR-CNRS 5525, Université Joseph Fourier, 38041 Grenoble, France; ,
- Laboratoire d'Ecologie Alpine, UMR-CNRS 5553, Université Joseph Fourier, 38041 Grenoble, France; , , ,
| | - Aurélie Bonin
- Laboratoire d'Ecologie Alpine, UMR-CNRS 5553, Université Joseph Fourier, 38041 Grenoble, France; , , ,
| | - Olivier François
- Laboratoire TIMC-IMAG, UMR-CNRS 5525, Université Joseph Fourier, 38041 Grenoble, France; ,
| | - Stéphane Lobreaux
- Laboratoire d'Ecologie Alpine, UMR-CNRS 5553, Université Joseph Fourier, 38041 Grenoble, France; , , ,
| | - Christelle Melodelima
- Laboratoire d'Ecologie Alpine, UMR-CNRS 5553, Université Joseph Fourier, 38041 Grenoble, France; , , ,
| | - Stéphanie Manel
- Laboratoire d'Ecologie Alpine, UMR-CNRS 5553, Université Joseph Fourier, 38041 Grenoble, France; , , ,
- Laboratoire Population Environnement et Développement, UMR-IRD 151, Université Aix-Marseille, 13331 Marseille, France
| |
Collapse
|
171
|
Campos JL, Zeng K, Parker DJ, Charlesworth B, Haddrill PR. Codon usage bias and effective population sizes on the X chromosome versus the autosomes in Drosophila melanogaster. Mol Biol Evol 2012. [PMID: 23204387 PMCID: PMC3603305 DOI: 10.1093/molbev/mss222] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Codon usage bias (CUB) in Drosophila is higher for X-linked genes than for autosomal genes. One possible explanation is that the higher effective recombination rate for genes on the X chromosome compared with the autosomes reduces their susceptibility to Hill–Robertson effects, and thus enhances the efficacy of selection on codon usage. The genome sequence of D. melanogaster was used to test this hypothesis. Contrary to expectation, it was found that, after correcting for the effective recombination rate, CUB remained higher on the X than on the autosomes. In contrast, an analysis of polymorphism data from a Rwandan population showed that mean nucleotide site diversity at 4-fold degenerate sites for genes on the X is approximately three-quarters of the autosomal value after correcting for the effective recombination rate, compared with approximate equality before correction. In addition, these data show that selection for preferred versus unpreferred synonymous variants is stronger on the X than the autosomes, which accounts for the higher CUB of genes on the X chromosome. This difference in the strength of selection does not appear to reflect the effects of dominance of mutations affecting codon usage, differences in gene expression levels between X and autosomes, or differences in mutational bias. Its cause therefore remains unexplained. The stronger selection on CUB on the X chromosome leads to a lower rate of synonymous site divergence compared with the autosomes; this will cause a stronger upward bias for X than A in estimates of the proportion of nonsynonymous mutations fixed by positive selection, for methods based on the McDonald–Kreitman test.
Collapse
Affiliation(s)
- Jose L Campos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom.
| | | | | | | | | |
Collapse
|
172
|
A coalescent model of background selection with recombination, demography and variation in selection coefficients. Heredity (Edinb) 2012. [PMID: 23188176 DOI: 10.1038/hdy.2012.102] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
There is increasing evidence that background selection, the effects of the elimination of recurring deleterious mutations by natural selection on variability at linked sites, may be a major factor shaping genome-wide patterns of genetic diversity. To accurately quantify the importance of background selection, it is vital to have computationally efficient models that include essential biological features. To this end, a structured coalescent procedure is used to construct a model of background selection that takes into account the effects of recombination, recent changes in population size and variation in selection coefficients against deleterious mutations across sites. Furthermore, this model allows a flexible organization of selected and neutral sites in the region concerned, and has the ability to generate sequence variability at both selected and neutral sites, allowing the correlation between these two types of sites to be studied. The accuracy of the model is verified by checking against the results of forward simulations. These simulations also reveal several patterns of diversity that are in qualitative agreement with observations reported in recent studies of DNA sequence polymorphisms. These results suggest that the model should be useful for data analysis.
Collapse
|
173
|
Abstract
The most common models of sequence evolution used to make inferences about adaptation rely on the assumption that selective pressures at a site remain constant through time. Instead, one might plausibly imagine that a change in the environment renders an allele beneficial and that when it fixes, the site is now constrained-until another change in the environment occurs that affects the selective pressures at that site. With this view in mind, we introduce a simple dynamic model for the evolution of coding regions, in which non-synonymous sites alternate between being fixed for the favored allele and being neutral with respect to other alleles. We use the pruning algorithm to derive closed forms for observable patterns of polymorphism and divergence in terms of the model parameters. Using our model, estimates of the fraction of beneficial substitutions α would remain similar to those obtained from existing approaches. In this framework, however, it becomes natural to ask how often adaptive substitutions originate from previously constrained or previously neutral sites, i.e., about the source of adaptive substitutions. We show that counts of coding sites that are both polymorphic in a sample from one species and divergent between two others carry information about this parameter. We also extend the basic model to include the effects of weakly deleterious mutations and discuss the importance of assumptions about the distribution of deleterious mutations among constrained non-synonymous sites. Finally, we derive a likelihood function for the parameters and apply it to a toy example, variation data for coding regions from chromosome 2 of the Drosophila melanogaster subgroup. This modeling work underscores how restrictive assumptions about adaptation have been to date, and how further work in this area will help to reveal unexplored and yet basic characteristics of adaptation.
Collapse
|
174
|
McGaugh SE, Heil CSS, Manzano-Winkler B, Loewe L, Goldstein S, Himmel TL, Noor MAF. Recombination modulates how selection affects linked sites in Drosophila. PLoS Biol 2012; 10:e1001422. [PMID: 23152720 PMCID: PMC3496668 DOI: 10.1371/journal.pbio.1001422] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2012] [Accepted: 10/05/2012] [Indexed: 11/18/2022] Open
Abstract
Recombination rate in Drosophila species shapes the impact of selection in the genome and is positively correlated with nucleotide diversity. One of the most influential observations in molecular evolution has been a strong association between local recombination rate and nucleotide polymorphisms across the genome. This is interpreted as evidence for ubiquitous natural selection. The alternative explanation, that recombination is mutagenic, has been rejected by the absence of a similar association between local recombination rate and nucleotide divergence between species. However, many recent studies show that recombination rates are often very different even in closely related species, questioning whether an association between recombination rate and divergence between species has been tested satisfactorily. To circumvent this problem, we directly surveyed recombination across approximately 43% of the D. pseudoobscura physical genome in two separate recombination maps and 31% of the D. miranda physical genome, and we identified both global and local differences in recombination rate between these two closely related species. Using only regions with conserved recombination rates between and within species and accounting for multiple covariates, our data support the conclusion that recombination is positively related to diversity because recombination modulates Hill–Robertson effects in the genome and not because recombination is predominately mutagenic. Finally, we find evidence for dips in diversity around nonsynonymous substitutions. We infer that at least some of this reduction in diversity resulted from selective sweeps and examine these dips in the context of recombination rate. Individuals within a species differ in the DNA sequences of their genes. This sequence variation affects how well individuals survive or reproduce and is transmitted to their offspring. Genes near each other on individual chromosomes tend to be passed to offspring together—neighboring genes are unlikely to be separated by exchanges of genetic material derived from different parents during meiotic recombination. When genes are inherited together, however, the evolutionary forces acting on one gene can interfere with variation at its neighbors. Thus, variation at multiple genes can be lost if natural selection acts on one gene in close proximity. Recombination can prevent or reduce this loss of variation, but previous tests of this phenomenon failed to account for recombination rate differences between species. In this study, we show that some parts of the genome differ in recombination rate between two species of fruit fly, Drosophila pseudoobscura and D. miranda. Avoiding an assumption made in previous studies, we then examine sequence variation within and between fly species in those parts of the genome that have conserved recombination rates. Based on the results, we conclude that recombination indeed preserves variation within species that would otherwise have been eliminated by natural selection.
Collapse
Affiliation(s)
- Suzanne E McGaugh
- Biology Department, Duke University, Durham, North Carolina, United States of America.
| | | | | | | | | | | | | |
Collapse
|
175
|
Inferences of demography and selection in an African population of Drosophila melanogaster. Genetics 2012; 193:215-28. [PMID: 23105013 DOI: 10.1534/genetics.112.145318] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
It remains a central problem in population genetics to infer the past action of natural selection, and these inferences pose a challenge because demographic events will also substantially affect patterns of polymorphism and divergence. Thus it is imperative to explicitly model the underlying demographic history of the population whenever making inferences about natural selection. In light of the considerable interest in adaptation in African populations of Drosophila melanogaster, which are considered ancestral to the species, we generated a large polymorphism data set representing 2.1 Mb from each of 20 individuals from a Ugandan population of D. melanogaster. In contrast to previous inferences of a simple population expansion in eastern Africa, our demographic modeling of this ancestral population reveals a strong signature of a population bottleneck followed by population expansion, which has significant implications for future demographic modeling of derived populations of this species. Taking this more complex underlying demographic history into account, we also estimate a mean X-linked region-wide rate of adaptation of 6 × 10(-11)/site/generation and a mean selection coefficient of beneficial mutations of 0.0009. These inferences regarding the rate and strength of selection are largely consistent with most other estimates from D. melanogaster and indicate a relatively high rate of adaptation driven by weakly beneficial mutations.
Collapse
|
176
|
Classical genetics meets next-generation sequencing: uncovering a genome-wide recombination map in Drosophila melanogaster. PLoS Genet 2012; 8:e1003024. [PMID: 23071461 PMCID: PMC3469414 DOI: 10.1371/journal.pgen.1003024] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
|
177
|
Comeron JM, Ratnappan R, Bailin S. The many landscapes of recombination in Drosophila melanogaster. PLoS Genet 2012; 8:e1002905. [PMID: 23071443 PMCID: PMC3469467 DOI: 10.1371/journal.pgen.1002905] [Citation(s) in RCA: 326] [Impact Index Per Article: 27.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2012] [Accepted: 07/02/2012] [Indexed: 01/06/2023] Open
Abstract
Recombination is a fundamental biological process with profound evolutionary implications. Theory predicts that recombination increases the effectiveness of selection in natural populations. Yet, direct tests of this prediction have been restricted to qualitative trends due to the lack of detailed characterization of recombination rate variation across genomes and within species. The use of imprecise recombination rates can also skew population genetic analyses designed to assess the presence and mode of selection across genomes. Here we report the first integrated high-resolution description of genomic and population variation in recombination, which also distinguishes between the two outcomes of meiotic recombination: crossing over (CO) and gene conversion (GC). We characterized the products of 5,860 female meioses in Drosophila melanogaster by genotyping a total of 139 million informative SNPs and mapped 106,964 recombination events at a resolution down to 2 kilobases. This approach allowed us to generate whole-genome CO and GC maps as well as a detailed description of variation in recombination among individuals of this species. We describe many levels of variation in recombination rates. At a large-scale (100 kb), CO rates exhibit extreme and highly punctuated variation along chromosomes, with hot and coldspots. We also show extensive intra-specific variation in CO landscapes that is associated with hotspots at low frequency in our sample. GC rates are more uniformly distributed across the genome than CO rates and detectable in regions with reduced or absent CO. At a local scale, recombination events are associated with numerous sequence motifs and tend to occur within transcript regions, thus suggesting that chromatin accessibility favors double-strand breaks. All these non-independent layers of variation in recombination across genomes and among individuals need to be taken into account in order to obtain relevant estimates of recombination rates, and should be included in a new generation of population genetic models of the interaction between selection and linkage.
Collapse
Affiliation(s)
- Josep M Comeron
- Department of Biology, University of Iowa, Iowa City, Iowa, USA.
| | | | | |
Collapse
|
178
|
Hunt BG, Ometto L, Keller L, Goodisman MAD. Evolution at two levels in fire ants: the relationship between patterns of gene expression and protein sequence evolution. Mol Biol Evol 2012; 30:263-71. [PMID: 23051842 DOI: 10.1093/molbev/mss234] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Variation in protein sequence and gene expression each contribute to phenotypic diversity, and may be subject to similar selective pressures. Eusocial insects are particularly useful for investigating the evolutionary link between protein sequence and condition-dependent patterns of gene expression because gene expression plays a central role in determining differences between eusocial insect sexes and castes. We investigated the relationship between protein coding sequence evolution and gene expression patterns in the fire ants Solenopsis invicta, S. richteri, and their hybrids to gain greater insight into how selection jointly operates on gene expression and coding sequence. We found that genes with high expression variability within castes and sexes were frequently differentially expressed between castes and sexes, as well as between species and hybrids. These results indicate that genes showing high variation in expression in one context also tend to show high variation in expression in other contexts. Our analyses further revealed that variation in both intra- and interspecific gene expression was positively associated with rate of protein sequence evolution in Solenopsis. This suggests that selective constraints on a gene operate both at the level of protein sequence and at the level of gene expression regulation. Overall, our study provides one of the strongest demonstrations that selective constraints mediate both protein sequence evolution and gene expression variability across different biological contexts and timescales.
Collapse
Affiliation(s)
- Brendan G Hunt
- School of Biology, Georgia Institute of Technology, USA.
| | | | | | | |
Collapse
|
179
|
Coevolution within and between regulatory loci can preserve promoter function despite evolutionary rate acceleration. PLoS Genet 2012; 8:e1002961. [PMID: 23028368 PMCID: PMC3447958 DOI: 10.1371/journal.pgen.1002961] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2012] [Accepted: 08/06/2012] [Indexed: 11/19/2022] Open
Abstract
Phenotypes that appear to be conserved could be maintained not only by strong purifying selection on the underlying genetic systems, but also by stabilizing selection acting via compensatory mutations with balanced effects. Such coevolution has been invoked to explain experimental results, but has rarely been the focus of study. Conserved expression driven by the unc-47 promoters of Caenorhabditis elegans and C. briggsae persists despite divergence within a cis-regulatory element and between this element and the trans-regulatory environment. Compensatory changes in cis and trans are revealed when these promoters are used to drive expression in the other species. Functional changes in the C. briggsae promoter, which has experienced accelerated sequence evolution, did not lead to alteration of gene expression in its endogenous environment. Coevolution among promoter elements suggests that complex epistatic interactions within cis-regulatory elements may facilitate their divergence. Our results offer a detailed picture of regulatory evolution in which subtle, lineage-specific, and compensatory modifications of interacting cis and trans regulators together maintain conserved gene expression patterns. Some phenotypes, including gene expression patterns, are conserved between distantly related species. However, the molecular bases of those phenotypes are not necessarily conserved. Instead, regulatory DNA sequences and the proteins with which they interact can change over time with balanced effects, preserving expression patterns and concealing regulatory divergence. Coevolution between interacting molecules makes gene regulation highly species-specific, and it can be detected when the cis-regulatory DNA of one species is used to drive expression in another species. In this way, we identified regions of the C. elegans and C. briggsae unc-47 promoters that have coevolved with the lineage-specific trans-regulatory environments of these organisms. The C. briggsae promoter experienced accelerated sequence change relative to related species. All of this evolution occurred without changing the expression pattern driven by the promoter in its endogenous environment.
Collapse
|
180
|
Leffler EM, Bullaughey K, Matute DR, Meyer WK, Ségurel L, Venkat A, Andolfatto P, Przeworski M. Revisiting an old riddle: what determines genetic diversity levels within species? PLoS Biol 2012; 10:e1001388. [PMID: 22984349 PMCID: PMC3439417 DOI: 10.1371/journal.pbio.1001388] [Citation(s) in RCA: 328] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Understanding why some species have more genetic diversity than others is central to the study of ecology and evolution, and carries potentially important implications for conservation biology. Yet not only does this question remain unresolved, it has largely fallen into disregard. With the rapid decrease in sequencing costs, we argue that it is time to revive it.
Collapse
Affiliation(s)
- Ellen M. Leffler
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
- * E-mail: (EML); (MP)
| | - Kevin Bullaughey
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois, United States of America
| | - Daniel R. Matute
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
| | - Wynn K. Meyer
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
| | - Laure Ségurel
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
- Howard Hughes Medical Institute, University of Chicago, Chicago, Illinois, United States of America
| | - Aarti Venkat
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
| | - Peter Andolfatto
- Department of Ecology and Evolutionary Biology and the Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Molly Przeworski
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois, United States of America
- Howard Hughes Medical Institute, University of Chicago, Chicago, Illinois, United States of America
- * E-mail: (EML); (MP)
| |
Collapse
|
181
|
Coop G, Ralph P. Patterns of neutral diversity under general models of selective sweeps. Genetics 2012; 192:205-24. [PMID: 22714413 PMCID: PMC3430537 DOI: 10.1534/genetics.112.141861] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2012] [Accepted: 06/01/2012] [Indexed: 11/18/2022] Open
Abstract
Two major sources of stochasticity in the dynamics of neutral alleles result from resampling of finite populations (genetic drift) and the random genetic background of nearby selected alleles on which the neutral alleles are found (linked selection). There is now good evidence that linked selection plays an important role in shaping polymorphism levels in a number of species. One of the best-investigated models of linked selection is the recurrent full-sweep model, in which newly arisen selected alleles fix rapidly. However, the bulk of selected alleles that sweep into the population may not be destined for rapid fixation. Here we develop a general model of recurrent selective sweeps in a coalescent framework, one that generalizes the recurrent full-sweep model to the case where selected alleles do not sweep to fixation. We show that in a large population, only the initial rapid increase of a selected allele affects the genealogy at partially linked sites, which under fairly general assumptions are unaffected by the subsequent fate of the selected allele. We also apply the theory to a simple model to investigate the impact of recurrent partial sweeps on levels of neutral diversity and find that for a given reduction in diversity, the impact of recurrent partial sweeps on the frequency spectrum at neutral sites is determined primarily by the frequencies rapidly achieved by the selected alleles. Consequently, recurrent sweeps of selected alleles to low frequencies can have a profound effect on levels of diversity but can leave the frequency spectrum relatively unperturbed. In fact, the limiting coalescent model under a high rate of sweeps to low frequency is identical to the standard neutral model. The general model of selective sweeps we describe goes some way toward providing a more flexible framework to describe genomic patterns of diversity than is currently available.
Collapse
Affiliation(s)
- Graham Coop
- Department of Evolution and Ecology and Center for Population Biology, University of California, Davis, California 95616, USA.
| | | |
Collapse
|
182
|
Chang HH, Park DJ, Galinsky KJ, Schaffner SF, Ndiaye D, Ndir O, Mboup S, Wiegand RC, Volkman SK, Sabeti PC, Wirth DF, Neafsey DE, Hartl DL. Genomic sequencing of Plasmodium falciparum malaria parasites from Senegal reveals the demographic history of the population. Mol Biol Evol 2012; 29:3427-39. [PMID: 22734050 DOI: 10.1093/molbev/mss161] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Malaria is a deadly disease that causes nearly one million deaths each year. To develop methods to control and eradicate malaria, it is important to understand the genetic basis of Plasmodium falciparum adaptations to antimalarial treatments and the human immune system while taking into account its demographic history. To study the demographic history and identify genes under selection more efficiently, we sequenced the complete genomes of 25 culture-adapted P. falciparum isolates from three sites in Senegal. We show that there is no significant population structure among these Senegal sampling sites. By fitting demographic models to the synonymous allele-frequency spectrum, we also estimated a major 60-fold population expansion of this parasite population ∼20,000-40,000 years ago. Using inferred demographic history as a null model for coalescent simulation, we identified candidate genes under selection, including genes identified before, such as pfcrt and PfAMA1, as well as new candidate genes. Interestingly, we also found selection against G/C to A/T changes that offsets the large mutational bias toward A/T, and two unusual patterns: similar synonymous and nonsynonymous allele-frequency spectra, and 18% of genes having a nonsynonymous-to-synonymous polymorphism ratio >1.
Collapse
Affiliation(s)
- Hsiao-Han Chang
- Department of Organismic and Evolutionary Biology, Harvard University.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
183
|
Weissman DB, Barton NH. Limits to the rate of adaptive substitution in sexual populations. PLoS Genet 2012; 8:e1002740. [PMID: 22685419 PMCID: PMC3369949 DOI: 10.1371/journal.pgen.1002740] [Citation(s) in RCA: 88] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2011] [Accepted: 04/16/2012] [Indexed: 12/31/2022] Open
Abstract
In large populations, many beneficial mutations may be simultaneously available and may compete with one another, slowing adaptation. By finding the probability of fixation of a favorable allele in a simple model of a haploid sexual population, we find limits to the rate of adaptive substitution, Λ, that depend on simple parameter combinations. When variance in fitness is low and linkage is loose, the baseline rate of substitution is Λ₀ = 2NU , where N is the population size, U is the rate of beneficial mutations per genome, and is their mean selective advantage. Heritable variance v in log fitness due to unlinked loci reduces Λ by e⁻⁴(v) under polygamy and e⁻⁸ (v) under monogamy. With a linear genetic map of length R Morgans, interference is yet stronger. We use a scaling argument to show that the density of adaptive substitutions depends on s, N, U, and R only through the baseline density: Λ/R = F (Λ₀/R). Under the approximation that the interference due to different sweeps adds up, we show that Λ/R ~(Λ₀/R) / (1 +2Λ₉/R) , implying that interference prevents the rate of adaptive substitution from exceeding one per centimorgan per 200 generations. Simulations and numerical calculations confirm the scaling argument and confirm the additive approximation for Λ₀/R ~ 1; for higher Λ₀/R , the rate of adaptation grows above R/2, but only very slowly. We also consider the effect of sweeps on neutral diversity and show that, while even occasional sweeps can greatly reduce neutral diversity, this effect saturates as sweeps become more common-diversity can be maintained even in populations experiencing very strong interference. Our results indicate that for some organisms the rate of adaptive substitution may be primarily recombination-limited, depending only weakly on the mutation supply and the strength of selection.
Collapse
Affiliation(s)
- Daniel B Weissman
- Institute of Science and Technology Austria, Klosterneuburg, Austria.
| | | |
Collapse
|
184
|
Bazykin GA, Kondrashov AS. Major role of positive selection in the evolution of conservative segments of Drosophila proteins. Proc Biol Sci 2012; 279:3409-17. [PMID: 22673359 PMCID: PMC3396909 DOI: 10.1098/rspb.2012.0776] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Slow evolution of conservative segments of coding and non-coding DNA is caused by the action of negative selection, which removes new mutations. However, the mode of selection that affects the few substitutions that do occur within such segments remains unclear. Here, we show that the fraction of allele replacements that were driven by positive selection, and the strength of this selection, is the highest within the conservative segments of Drosophila protein-coding genes. The McDonald–Kreitman test, applied to the data on variation in Drosophila melanogaster and in Drosophila simulans, indicates that within the most conservative protein segments, approximately 72 per cent (approx. 80%) of allele replacements were driven by positive selection, as opposed to only approximately 44 per cent (approx. 53%) at rapidly evolving segments. Data on multiple non-synonymous substitutions at a codon lead to the same conclusion and additionally indicate that positive selection driving allele replacements at conservative sites is the strongest, as it accelerates evolution by a factor of approximately 40, as opposed to a factor of approximately 5 at rapidly evolving sites. Thus, random drift plays only a minor role in the evolution of conservative DNA segments, and those relatively rare allele replacements that occur within such segments are mostly driven by substantial positive selection.
Collapse
Affiliation(s)
- Georgii A Bazykin
- Department of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Vorbyevy Gory 1-73, Moscow 119992, Russia
| | | |
Collapse
|
185
|
Pavlidis P, Jensen JD, Stephan W, Stamatakis A. A critical assessment of storytelling: gene ontology categories and the importance of validating genomic scans. Mol Biol Evol 2012; 29:3237-48. [PMID: 22617950 DOI: 10.1093/molbev/mss136] [Citation(s) in RCA: 159] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
In the age of whole-genome population genetics, so-called genomic scan studies often conclude with a long list of putatively selected loci. These lists are then further scrutinized to annotate these regions by gene function, corresponding biological processes, expression levels, or gene networks. Such annotations are often used to assess and/or verify the validity of the genome scan and the statistical methods that have been used to perform the analyses. Furthermore, these results are frequently considered to validate "true-positives" if the identified regions make biological sense a posteriori. Here, we show that this approach can be potentially misleading. By simulating neutral evolutionary histories, we demonstrate that it is possible not only to obtain an extremely high false-positive rate but also to make biological sense out of the false-positives and construct a sensible biological narrative. Results are compared with a recent polymorphism data set from Drosophila melanogaster.
Collapse
Affiliation(s)
- Pavlos Pavlidis
- The Exelixis Lab, Scientific Computing Group, Heidelberg Institute for Theoretical Studies (HITS gGmbH), Heidelberg, Germany.
| | | | | | | |
Collapse
|
186
|
|
187
|
Phifer-Rixey M, Bonhomme F, Boursot P, Churchill GA, Piálek J, Tucker PK, Nachman MW. Adaptive evolution and effective population size in wild house mice. Mol Biol Evol 2012; 29:2949-55. [PMID: 22490822 DOI: 10.1093/molbev/mss105] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Estimates of the proportion of amino acid substitutions that have been fixed by selection (α) vary widely among taxa, ranging from zero in humans to over 50% in Drosophila. This wide range may reflect differences in the efficacy of selection due to differences in the effective population size (N(e)). However, most comparisons have been made among distantly related organisms that differ not only in N(e) but also in many other aspects of their biology. Here, we estimate α in three closely related lineages of house mice that have a similar ecology but differ widely in N(e): Mus musculus musculus (N(e) ∼ 25,000-120,000), M. m. domesticus (N(e) ∼ 58,000-200,000), and M. m. castaneus (N(e) ∼ 200,000-733,000). Mice were genotyped using a high-density single nucleotide polymorphism array, and the proportions of replacement and silent mutations within subspecies were compared with those fixed between each subspecies and an outgroup, Mus spretus. There was significant evidence of positive selection in M. m. castaneus, the lineage with the largest N(e), with α estimated to be approximately 40%. In contrast, estimates of α for M. m. domesticus (α = 13%) and for M. m. musculus (α = 12 %) were much smaller. Interestingly, the higher estimate of α for M. m. castaneus appears to reflect not only more adaptive fixations but also more effective purifying selection. These results support the hypothesis that differences in N(e) contribute to differences among species in the efficacy of selection.
Collapse
Affiliation(s)
- Megan Phifer-Rixey
- Department of Ecology and Evolutionary Biology, University of Arizona, USA.
| | | | | | | | | | | | | |
Collapse
|
188
|
Gleixner E, Herlyn H, Zimmerling S, Burmester T, Hankeln T. Testes-specific hemoglobins in Drosophila evolved by a combination of sub- and neofunctionalization after gene duplication. BMC Evol Biol 2012; 12:34. [PMID: 22429626 PMCID: PMC3361466 DOI: 10.1186/1471-2148-12-34] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2011] [Accepted: 03/19/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND For a long time the presence of respiratory proteins in most insects has been considered unnecessary. However, in recent years it has become evident that globins belong to the standard repertoire of the insect genome. Like most other insect globins, the glob1 gene of Drosophila melanogaster displays a conserved expression pattern in the tracheae, the fat body and the Malpighian tubules. RESULTS Here we show that the recently discovered D. melanogaster globin genes glob2 and glob3 both display an unusual male-specific expression in the reproductive tract during spermatogenesis. Both paralogs are transcribed at equivalent mRNA levels and largely overlap in their cellular expression patterns during spermatogenesis. Phylogenetic analyses showed that glob2 and glob3 reflect a gene duplication event that occurred in the ancestor of the Sophophora subgenus at least 40 million years ago. Therefore, flies of the Drosophila subgenus harbor only one glob2/3-like gene. CONCLUSIONS Phylogenetic and sequence analyses indicate an evolution of the glob2 and glob3 duplicates by a combination of sub- and neofunctionalization. Considering their restricted, testes-specific expression, an involvement of both globins in alleviating oxidative stress during spermatogenesis is conceivable.
Collapse
Affiliation(s)
- Eva Gleixner
- Institute of Molecular Genetics, University of Mainz, 55099 Mainz, Germany
- Center for Systems Biology, University of Freiburg, 79104 Freiburg, Germany
| | - Holger Herlyn
- Institute of Anthropology, University of Mainz, 55099 Mainz, Germany
| | - Stefan Zimmerling
- Institute of Molecular Genetics, University of Mainz, 55099 Mainz, Germany
| | - Thorsten Burmester
- Biocenter Grindel and Zoological Museum, University of Hamburg, 20146 Hamburg, Germany
| | - Thomas Hankeln
- Institute of Molecular Genetics, University of Mainz, 55099 Mainz, Germany
| |
Collapse
|
189
|
Young RS, Marques AC, Tibbit C, Haerty W, Bassett AR, Liu JL, Ponting CP. Identification and properties of 1,119 candidate lincRNA loci in the Drosophila melanogaster genome. Genome Biol Evol 2012; 4:427-42. [PMID: 22403033 PMCID: PMC3342871 DOI: 10.1093/gbe/evs020] [Citation(s) in RCA: 158] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
The functional repertoire of long intergenic noncoding RNA (lincRNA) molecules has begun to be elucidated in mammals. Determining the biological relevance and potential gene regulatory mechanisms of these enigmatic molecules would be expedited in a more tractable model organism, such as Drosophila melanogaster. To this end, we defined a set of 1,119 putative lincRNA genes in D. melanogaster using modENCODE whole transcriptome (RNA-seq) data. A large majority (1.1 of 1.3 Mb; 85%) of these bases were not previously reported by modENCODE as being transcribed. Significant selective constraint on the sequences of these loci predicts that virtually all have sustained functionality across the Drosophila clade. We observe biases in lincRNA genomic locations and expression profiles that are consistent with some of these lincRNAs being involved in the regulation of neighboring protein-coding genes with developmental functions. We identify lincRNAs that may be important in the developing nervous system and in male-specific organs, such as the testes. LincRNA loci were also identified whose positions, relative to nearby protein-coding loci, are equivalent between D. melanogaster and mouse. This study predicts that the genomes of not only vertebrates, such as mammals, but also an invertebrate (fruit fly) harbor large numbers of lincRNA loci. Our findings now permit exploitation of Drosophila genetics for the investigation of lincRNA mechanisms, including lincRNAs with potential functional analogues in mammals.
Collapse
|
190
|
da Silva J. The dynamics of HIV-1 adaptation in early infection. Genetics 2012; 190:1087-99. [PMID: 22209906 PMCID: PMC3296244 DOI: 10.1534/genetics.111.136366] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2011] [Accepted: 12/18/2011] [Indexed: 11/18/2022] Open
Abstract
Human immunodeficiency virus type 1 (HIV-1) undergoes a severe population bottleneck during sexual transmission and yet adapts extremely rapidly to the earliest immune responses. The bottleneck has been inferred to typically consist of a single genome, and typically eight amino acid mutations in viral proteins spread to fixation by the end of the early chronic phase of infection in response to selection by CD8(+) T cells. Stochastic simulation was used to examine the effects of the transmission bottleneck and of potential interference among spreading immune-escape mutations on the adaptive dynamics of the virus in early infection. If major viral population genetic parameters are assigned realistic values that permit rapid adaptive evolution, then a bottleneck of a single genome is not inconsistent with the observed pattern of adaptive fixations. One requirement is strong selection by CD8(+) T cells that decreases over time. Such selection may reduce effective population sizes at linked loci through genetic hitchhiking. However, this effect is predicted to be minor in early infection because the transmission bottleneck reduces the effective population size to such an extent that the resulting strong selection and weak mutation cause beneficial mutations to fix sequentially and thus avoid interference.
Collapse
Affiliation(s)
- Jack da Silva
- School of Molecular and Biomedical Science, University of Adelaide, Adelaide, SA 5005, Australia.
| |
Collapse
|
191
|
The role of background selection in shaping patterns of molecular evolution and variation: evidence from variability on the Drosophila X chromosome. Genetics 2012; 191:233-46. [PMID: 22377629 DOI: 10.1534/genetics.111.138073] [Citation(s) in RCA: 94] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In the putatively ancestral population of Drosophila melanogaster, the ratio of silent DNA sequence diversity for X-linked loci to that for autosomal loci is approximately one, instead of the expected "null" value of 3/4. One possible explanation is that background selection (the hitchhiking effect of deleterious mutations) is more effective on the autosomes than on the X chromosome, because of the lack of crossing over in male Drosophila. The expected effects of background selection on neutral variability at sites in the middle of an X chromosome or an autosomal arm were calculated for different models of chromosome organization and methods of approximation, using current estimates of the deleterious mutation rate and distributions of the fitness effects of deleterious mutations. The robustness of the results to different distributions of fitness effects, dominance coefficients, mutation rates, mapping functions, and chromosome size was investigated. The predicted ratio of X-linked to autosomal variability is relatively insensitive to these variables, except for the mutation rate and map length. Provided that the deleterious mutation rate per genome is sufficiently large, it seems likely that background selection can account for the observed X to autosome ratio of variability in the ancestral population of D. melanogaster. The fact that this ratio is much less than one in D. pseudoobscura is also consistent with the model's predictions, since this species has a high rate of crossing over. The results suggest that background selection may play a major role in shaping patterns of molecular evolution and variation.
Collapse
|
192
|
Abstract
A major challenge of biology is understanding the relationship between molecular genetic variation and variation in quantitative traits, including fitness. This relationship determines our ability to predict phenotypes from genotypes and to understand how evolutionary forces shape variation within and between species. Previous efforts to dissect the genotype-phenotype map were based on incomplete genotypic information. Here, we describe the Drosophila melanogaster Genetic Reference Panel (DGRP), a community resource for analysis of population genomics and quantitative traits. The DGRP consists of fully sequenced inbred lines derived from a natural population. Population genomic analyses reveal reduced polymorphism in centromeric autosomal regions and the X chromosome, evidence for positive and negative selection, and rapid evolution of the X chromosome. Many variants in novel genes, most at low frequency, are associated with quantitative traits and explain a large fraction of the phenotypic variance. The DGRP facilitates genotype-phenotype mapping using the power of Drosophila genetics.
Collapse
|
193
|
Grath S, Parsch J. Rate of amino acid substitution is influenced by the degree and conservation of male-biased transcription over 50 myr of Drosophila evolution. Genome Biol Evol 2012; 4:346-59. [PMID: 22321769 PMCID: PMC3318448 DOI: 10.1093/gbe/evs012] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/03/2012] [Indexed: 12/18/2022] Open
Abstract
Sex-biased gene expression (i.e., the differential expression of genes between males and females) is common among sexually reproducing species. However, genes often differ in their sex-bias classification or degree of sex bias between species. There is also an unequal distribution of sex-biased genes (especially male-biased genes) between the X chromosome and the autosomes. We used whole-genome expression data and evolutionary rate estimates for two different Drosophilid lineages, melanogaster and obscura, spanning an evolutionary time scale of around 50 Myr to investigate the influence of sex-biased gene expression and chromosomal location on the rate of molecular evolution. In both lineages, the rate of protein evolution correlated positively with the male/female expression ratio. Genes with highly male-biased expression, genes expressed specifically in male reproductive tissues, and genes with conserved male-biased expression over long evolutionary time scales showed the fastest rates of evolution. An analysis of sex-biased gene evolution in both lineages revealed evidence for a "fast-X" effect in which the rate of evolution was greater for X-linked than for autosomal genes. This pattern was particularly pronounced for male-biased genes. Genes located on the obscura "neo-X" chromosome, which originated from a recent X-autosome fusion, showed rates of evolution that were intermediate between genes located on the ancestral X-chromosome and the autosomes. This suggests that the shift to X-linkage led to an increase in the rate of molecular evolution.
Collapse
Affiliation(s)
- Sonja Grath
- Institute for Evolution and Biodiversity, University of Muenster (WWU), Germany.
| | | |
Collapse
|
194
|
Carneiro M, Albert FW, Melo-Ferreira J, Galtier N, Gayral P, Blanco-Aguiar JA, Villafuerte R, Nachman MW, Ferrand N. Evidence for widespread positive and purifying selection across the European rabbit (Oryctolagus cuniculus) genome. Mol Biol Evol 2012; 29:1837-49. [PMID: 22319161 DOI: 10.1093/molbev/mss025] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The nearly neutral theory of molecular evolution predicts that the efficacy of both positive and purifying selection is a function of the long-term effective population size (N(e)) of a species. Under this theory, the efficacy of natural selection should increase with N(e). Here, we tested this simple prediction by surveying ~1.5 to 1.8 Mb of protein coding sequence in the two subspecies of the European rabbit (Oryctolagus cuniculus algirus and O. c. cuniculus), a mammal species characterized by high levels of nucleotide diversity and N(e) estimates for each subspecies on the order of 1 × 10(6). When the segregation of slightly deleterious mutations and demographic effects were taken into account, we inferred that >60% of amino acid substitutions on the autosomes were driven to fixation by positive selection. Moreover, we inferred that a small fraction of new amino acid mutations (<4%) are effectively neutral (defined as 0 < N(e)s < 1) and that this fraction was negatively correlated with a gene's expression level. Consistent with models of recurrent adaptive evolution, we detected a negative correlation between levels of synonymous site polymorphism and the rate of protein evolution, although the correlation was weak and nonsignificant. No systematic X chromosome-autosome difference was found in the efficacy of selection. For example, the proportion of adaptive substitutions was significantly higher on the X chromosome compared with the autosomes in O. c. algirus but not in O. c. cuniculus. Our findings support widespread positive and purifying selection in rabbits and add to a growing list of examples suggesting that differences in N(e) among taxa play a substantial role in determining rates and patterns of protein evolution.
Collapse
Affiliation(s)
- Miguel Carneiro
- Centro de Investigação em Biodiversidade e Recursos Genéticos, Vairão, Portugal.
| | | | | | | | | | | | | | | | | |
Collapse
|
195
|
Wills C. Rapid recent human evolution and the accumulation of balanced genetic polymorphisms. High Alt Med Biol 2012; 12:149-55. [PMID: 21718163 DOI: 10.1089/ham.2010.1089] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
All evolutionary change can be traced to alterations in allele frequencies in populations over time. DNA sequencing on a massive scale now permits us to follow the genetic consequences as our species has diverged from our close relatives and as we have colonized different parts of the world and adapted to them. But it has been difficult to disentangle natural selection from many other factors that alter frequencies. These factors include mutation and intragenic reciprocal recombination, gene conversion, segregation distortion, random drift, and gene flow between populations (these last two are greatly influenced by splits and coalescences of populations over time). The first part of this review examines recent studies that have had some success in dissecting out the role of natural selection, especially in humans and Drosophila. Among many examples, these studies include those that have followed the rapid evolution of traits that may permit adaptation to high altitude in Tibetan and Andean populations. In some cases, directional selection has been so strong that it may have swept alleles close to fixation in the span of a few thousand years, a rapidity of change that is also sometimes encountered in other organisms. The second part of the review summarizes data showing that remarkably few alleles have been carried completely to fixation during our recent evolution. Some of the alleles that have not reached fixation may be approaching new internal equilibria, which would indicate polymorphisms that are maintained by balancing selection. Finally, the review briefly examines why genetic polymorphisms, particularly those that are maintained by negative frequency dependence, are likely to have played an important role in the evolution of our species. A method is suggested for measuring the contribution of these polymorphisms to our gene pool. Such polymorphisms may add to the ability of our species to adapt to our increasingly complex and challenging environment.
Collapse
Affiliation(s)
- Christopher Wills
- University of California, San Diego, Biological Sciences Department, 9500 Gilman Drive, La Jolla, CA 92093-0116, USA.
| |
Collapse
|
196
|
Inter- and intraspecific variation in Drosophila genes with sex-biased expression. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:963976. [PMID: 22315698 PMCID: PMC3270394 DOI: 10.1155/2012/963976] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/13/2011] [Revised: 10/06/2011] [Accepted: 10/06/2011] [Indexed: 02/04/2023]
Abstract
Genes with sexually dimorphic expression (sex-biased genes) often evolve rapidly and are thought to make an important contribution to reproductive isolation between species. We examined the molecular evolution of sex-biased genes in Drosophila melanogaster and D. ananassae, which represent two independent lineages within the melanogaster group. We find that strong purifying selection limits protein sequence variation within species, but that a considerable fraction of divergence between species can be attributed to positive selection. In D. melanogaster, the proportion of adaptive substitutions between species is greatest for male-biased genes and is especially high for those on the X chromosome. In contrast, male-biased genes do not show unusually high variation within or between populations. A similar pattern is seen at the level of gene expression, where sex-biased genes show high expression divergence between species, but low divergence between populations. In D. ananassae, there is no increased rate of adaptation of male-biased genes, suggesting that the type or strength of selection acting on sex-biased genes differs between lineages.
Collapse
|
197
|
Crisci JL, Poh YP, Bean A, Simkin A, Jensen JD. Recent progress in polymorphism-based population genetic inference. ACTA ACUST UNITED AC 2012; 103:287-96. [PMID: 22246406 DOI: 10.1093/jhered/esr128] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The recent availability of whole-genome sequencing data affords tremendous power for statistical inference. With this, there has been great interest in the development of polymorphism-based approaches for the estimation of population genetic parameters. These approaches seek to estimate, for example, recently fixed or sweeping beneficial mutations, the rate of recurrent positive selection, the distribution of selection coefficients, and the demographic history of the population. Yet despite estimating similar parameters using similar data sets, results between methodologies are far from consistent. We here summarize the current state of the field, compare existing approaches, and attempt to reconcile emerging discrepancies. We also discuss the biases in selection estimators introduced by ignoring the demographic history of the population, discuss the biases in demographic estimators introduced by assuming neutrality, and highlight the important challenge to the field of achieving a true joint estimation procedure to circumvent these confounding effects.
Collapse
Affiliation(s)
- Jessica L Crisci
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | | | | | | | | |
Collapse
|
198
|
Abstract
Vast tracts of noncoding DNA contain elements that regulate gene expression in higher eukaryotes. Describing these regulatory elements and understanding how they evolve represent major challenges for biologists. Advances in the ability to survey genome-scale DNA sequence data are providing unprecedented opportunities to use evolutionary models and computational tools to identify functionally important elements and the mode of selection acting on them in multiple species. This chapter reviews some of the current methods that have been developed and applied on noncoding DNA, what they have shown us, and how they are limited. Results of several recent studies reveal that a significantly larger fraction of noncoding DNA in eukaryotic organisms is likely to be functional than previously believed, implying that the functional annotation of most noncoding DNA in these organisms is largely incomplete. In Drosophila, recent studies have further suggested that a large fraction of noncoding DNA divergence observed between species may be the product of recurrent adaptive substitution. Similar studies in humans have revealed a more complex pattern, with signatures of recurrent positive selection being largely concentrated in conserved noncoding DNA elements. Understanding these patterns and the extent to which they generalize to other organisms awaits the analysis of forthcoming genome-scale polymorphism and divergence data from more species.
Collapse
Affiliation(s)
- Ying Zhen
- Department of Ecology and Evolutionary Biology, The Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | | |
Collapse
|
199
|
Charlesworth B. The effects of deleterious mutations on evolution at linked sites. Genetics 2012; 190:5-22. [PMID: 22219506 PMCID: PMC3249359 DOI: 10.1534/genetics.111.134288] [Citation(s) in RCA: 215] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2011] [Accepted: 11/04/2011] [Indexed: 01/14/2023] Open
Abstract
The process of evolution at a given site in the genome can be influenced by the action of selection at other sites, especially when these are closely linked to it. Such selection reduces the effective population size experienced by the site in question (the Hill-Robertson effect), reducing the level of variability and the efficacy of selection. In particular, deleterious variants are continually being produced by mutation and then eliminated by selection at sites throughout the genome. The resulting reduction in variability at linked neutral or nearly neutral sites can be predicted from the theory of background selection, which assumes that deleterious mutations have such large effects that their behavior in the population is effectively deterministic. More weakly selected mutations can accumulate by Muller's ratchet after a shutdown of recombination, as in an evolving Y chromosome. Many functionally significant sites are probably so weakly selected that Hill-Robertson interference undermines the effective strength of selection upon them, when recombination is rare or absent. This leads to large departures from deterministic equilibrium and smaller effects on linked neutral sites than under background selection or Muller's ratchet. Evidence is discussed that is consistent with the action of these processes in shaping genome-wide patterns of variation and evolution.
Collapse
Affiliation(s)
- Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom.
| |
Collapse
|
200
|
Callahan BJ. The length scale of selection in protein evolution. Fly (Austin) 2011; 6:16-20. [PMID: 22198524 DOI: 10.4161/fly.18305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Central to the study of molecular evolution, and an area of long-standing debate, is the appropriate model for the fitness landscape of proteins. Much of this debate has focused on the strength and frequency of positive and purifying selection, but the form and frequency of selective correlations is also a vital element. The constituent amino acids within a protein generically interact and share selective pressures in predictable ways, which conflicts with the selective independence assumed by common caricatures of the fitness landscape. Here, I discuss a recent study by myself and coauthors that used whole-genome comparisons of orthologous molecular sequences from closely related Drosophilids to explore the form of the selective correlations and selective interactions (epistasis) between the amino acids within a protein. I outline our results and highlight our finding of a selective length scale of ten amino acids within which individual amino acids are substantially and generically more likely to share selective pressures and interact epistatically. I then focus on the evidence presented in our study supporting a substantial role for epistasis in the process of molecular evolution, and discuss further the implications of this widespread epistasis on the overdispersion of the molecular clock and the efficacy of common tests for positive selection.
Collapse
|