101
|
Wallis LJ, Wallis GP. Extreme positive selection on a new highly-expressed larval glycoprotein (LGP) gene in Galaxias fishes (Osmeriformes: Galaxiidae). Mol Biol Evol 2010; 28:399-406. [PMID: 20696791 DOI: 10.1093/molbev/msq208] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We describe the intron-exon structure and DNA/protein sequences of a new larval glycoprotein (LGP) gene from nine species of galaxiid fish. The gene has a distant similarity to Danio THP (Tamm-Horsfall urinary glycoprotein; uromodulin) and cichlid SPP120 (seminal plasma glycoprotein) due to conserved features of its zona pellucida (ZP) domain, including eight highly conserved cysteines and a consensus furin cleavage site. Using a combination of 454 sequencing of cDNA and exon-primed intron-spanning sequencing of genomic DNA, we obtained full sequences of the coding region (996 bp) and its intervening sequences (1,459 bp). LGP shows an exceptionally strong signal of positive selection over the entire coding region, as evidenced by d(N)/d(S) values >1. Across nine species of Galaxias, 87/332 (26%) amino acid residues are variable, compared with 9/386 (2%) for mitochondrial cytochrome b (cytb) in the same group of species. Across 36 interspecific pairwise comparisons, genetic distances are in all cases larger for coding region than for introns, by a factor of 2.4-fold on average. Reading frame, gene structure, splice sites, and many ZP motifs are conserved across all species. Together with the fact that the gene is expressed in all species, these results argue clearly against the possibility of a pseudogene. We show by 454 sequencing and quantitative polymerase chain reaction that the transcript is abundant (ca. 0.5%) in newly hatched larvae and appears to be almost absent from a range of adult tissues. We postulate that the strong Darwinian evolution exhibited by this protein may reflect some type of immunoprotection at this vulnerable larval stage.
Collapse
Affiliation(s)
- Lise J Wallis
- Department of Zoology, University of Otago, Dunedin, New Zealand.
| | | |
Collapse
|
102
|
Altrock PM, Gokhale CS, Traulsen A. Stochastic slowdown in evolutionary processes. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2010; 82:011925. [PMID: 20866666 DOI: 10.1103/physreve.82.011925] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2010] [Revised: 07/12/2010] [Indexed: 05/29/2023]
Abstract
We examine birth-death processes with state dependent transition probabilities and at least one absorbing boundary. In evolution, this describes selection acting on two different types in a finite population where reproductive events occur successively. If the two types have equal fitness the system performs a random walk. If one type has a fitness advantage it is favored by selection, which introduces a bias (asymmetry) in the transition probabilities. How long does it take until advantageous mutants have invaded and taken over? Surprisingly, we find that the average time of such a process can increase, even if the mutant type always has a fitness advantage. We discuss this finding for the Moran process and develop a simplified model which allows a more intuitive understanding. We show that this effect can occur for weak but nonvanishing bias (selection) in the state dependent transition rates and infer the scaling with system size. We also address the Wright-Fisher model commonly used in population genetics, which shows that this stochastic slowdown is not restricted to birth-death processes.
Collapse
Affiliation(s)
- Philipp M Altrock
- Emmy-Noether Group for Evolutionary Dynamics, Department of Evolutionary Ecology, Max-Planck-Institute for Evolutionary Biology, August-Thienemann-Str 2, D-24306 Plön, Germany.
| | | | | |
Collapse
|
103
|
Protein coding gene nucleotide substitution pattern in the apicomplexan protozoa Cryptosporidium parvum and Cryptosporidium hominis. Comp Funct Genomics 2010:879023. [PMID: 18551181 PMCID: PMC2413048 DOI: 10.1155/2008/879023] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2007] [Accepted: 03/29/2008] [Indexed: 01/06/2023] Open
Abstract
Cryptosporidium parvum and C. hominis are related protozoan pathogens which infect the intestinal epithelium of humans and other vertebrates. To explore the evolution of these parasites, and identify genes under positive selection, we performed a pairwise whole-genome comparison between all orthologous protein coding genes in C. parvum and C. hominis. Genome-wide calculation of the ratio of nonsynonymous versus synonymous nucleotide substitutions (dN/dS) was performed to detect the impact of positive and purifying selection. Of 2465 pairs of orthologous genes, a total of 27 (1.1%) showed a high ratio of nonsynonymous substitutions, consistent with positive selection. A majority of these genes were annotated as hypothetical proteins. In addition, proteins with transmembrane and signal peptide domains are significantly more frequent in the high dN/dS group.
Collapse
|
104
|
Estimating the parameters of selection on nonsynonymous mutations in Drosophila pseudoobscura and D. miranda. Genetics 2010; 185:1381-96. [PMID: 20516497 DOI: 10.1534/genetics.110.117614] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We present the results of surveys of diversity in sets of >40 X-linked and autosomal loci in samples from natural populations of Drosophila miranda and D. pseudoobscura, together with their sequence divergence from D. affinis. Mean silent site diversity in D. miranda is approximately one-quarter of that in D. pseudoobscura; mean X-linked silent diversity is about three-quarters of that for the autosomes in both species. Estimates of the distribution of selection coefficients against heterozygous, deleterious nonsynonymous mutations from two different methods suggest a wide distribution, with coefficients of variation greater than one, and with the average segregating amino acid mutation being subject to only very weak selection. Only a small fraction of new amino acid mutations behave as effectively neutral, however. A large fraction of amino acid differences between D. pseudoobscura and D. affinis appear to have been fixed by positive natural selection, using three different methods of estimation; estimates between D. miranda and D. affinis are more equivocal. Sources of bias in the estimates, especially those arising from selection on synonymous mutations and from the choice of genes, are discussed and corrections for these applied. Overall, the results show that both purifying selection and positive selection on nonsynonymous mutations are pervasive.
Collapse
|
105
|
Gossmann TI, Song BH, Windsor AJ, Mitchell-Olds T, Dixon CJ, Kapralov MV, Filatov DA, Eyre-Walker A. Genome wide analyses reveal little evidence for adaptive evolution in many plant species. Mol Biol Evol 2010; 27:1822-32. [PMID: 20299543 DOI: 10.1093/molbev/msq079] [Citation(s) in RCA: 178] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
The relative contribution of advantageous and neutral mutations to the evolutionary process is a central problem in evolutionary biology. Current estimates suggest that whereas Drosophila, mice, and bacteria have undergone extensive adaptive evolution, hominids show little or no evidence of adaptive evolution in protein-coding sequences. This may be a consequence of differences in effective population size. To study the matter further, we have investigated whether plants show evidence of adaptive evolution using an extension of the McDonald-Kreitman test that explicitly models slightly deleterious mutations by estimating the distribution of fitness effects of new mutations. We apply this method to data from nine pairs of species. Altogether more than 2,400 loci with an average length of approximately 280 nucleotides were analyzed. We observe very similar results in all species; we find little evidence of adaptive amino acid substitution in any comparison except sunflowers. This may be because many plant species have modest effective population sizes.
Collapse
Affiliation(s)
- Toni I Gossmann
- Centre for the Study of Evolution, School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | | | | | | | | | | | | | | |
Collapse
|
106
|
Klaunig JE, Kamendulis LM, Hocevar BA. Oxidative stress and oxidative damage in carcinogenesis. Toxicol Pathol 2009; 38:96-109. [PMID: 20019356 DOI: 10.1177/0192623309356453] [Citation(s) in RCA: 618] [Impact Index Per Article: 41.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Carcinogenesis is a multistep process involving mutation and the subsequent selective clonal expansion of the mutated cell. Chemical and physical agents including those that induce reative oxygen species can induce and/or modulate this multistep process. Several modes of action by which carcinogens induce cancer have been identified, including through production of reactive oxygen species (ROS). Oxidative damage to cellular macromolecules can arise through overproduction of ROS and faulty antioxidant and/or DNA repair mechanisms. In addition, ROS can stimulate signal transduction pathways and lead to activation of key transcription factors such as Nrf2 and NF-kappaB. The resultant altered gene expression patterns evoked by ROS contribute to the carcinogenesis process. Recent evidence demonstrates an association between a number of single nucleotide polymorphisms (SNPs) in oxidative DNA repair genes and antioxidant genes with human cancer susceptibility. These aspects of ROS biology will be discussed in the context of their relationship to carcinogenesis.
Collapse
Affiliation(s)
- James E Klaunig
- Indiana University School of Medicine, Indianapolis, IN 46202, USA.
| | | | | |
Collapse
|
107
|
Flowers JM, Hanzawa Y, Hall MC, Moore RC, Purugganan MD. Population Genomics of the Arabidopsis thaliana Flowering Time Gene Network. Mol Biol Evol 2009; 26:2475-86. [DOI: 10.1093/molbev/msp161] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
|
108
|
Abstract
Each human carries a large number of deleterious mutations. Together, these mutations make a significant contribution to human disease. Identification of deleterious mutations within individual genome sequences could substantially impact an individual's health through personalized prevention and treatment of disease. Yet, distinguishing deleterious mutations from the massive number of nonfunctional variants that occur within a single genome is a considerable challenge. Using a comparative genomics data set of 32 vertebrate species we show that a likelihood ratio test (LRT) can accurately identify a subset of deleterious mutations that disrupt highly conserved amino acids within protein-coding sequences, which are likely to be unconditionally deleterious. The LRT is also able to identify known human disease alleles and performs as well as two commonly used heuristic methods, SIFT and PolyPhen. Application of the LRT to three human genomes reveals 796-837 deleterious mutations per individual, approximately 40% of which are estimated to be at <5% allele frequency. However, the overlap between predictions made by the LRT, SIFT, and PolyPhen, is low; 76% of predictions are unique to one of the three methods, and only 5% of predictions are shared across all three methods. Our results indicate that only a small subset of deleterious mutations can be reliably identified, but that this subset provides the raw material for personalized medicine.
Collapse
|
109
|
González J, Petrov DA. The adaptive role of transposable elements in the Drosophila genome. Gene 2009; 448:124-33. [PMID: 19555747 DOI: 10.1016/j.gene.2009.06.008] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2009] [Revised: 06/11/2009] [Accepted: 06/12/2009] [Indexed: 11/25/2022]
Abstract
Transposable elements (TEs) are short DNA sequences with the capacity to move between different sites in the genome. This ability provides them with the capacity to mutate the genome in many different ways, from subtle regulatory mutations to gross genomic rearrangements. The potential adaptive significance of TEs was recognized by those involved in their initial discovery although it was hotly debated afterwards. For more than two decades, TEs were considered to be intragenomic parasites leading to almost exclusively detrimental effects to the host genome. The sequencing of the Drosophila melanogaster genome provided an unprecedented opportunity to study TEs and led to the identification of the first TE-induced adaptations in this species. These studies were followed by a systematic genome-wide search for adaptive insertions that allowed for the first time to infer that TEs contribute substantially to adaptive evolution. This study also revealed that there are at least twice as many TE-induced adaptations that remain to be identified. To gain a better understanding of the adaptive role of TEs in the genome we clearly need to (i) identify as many adaptive TEs as possible in a range of Drosophila species as well as (ii) carry out in-depth investigations of the effects of adaptive TEs on as many phenotypes as possible.
Collapse
Affiliation(s)
- Josefa González
- Department of Biology, 371 Serra St. Stanford University, Stanford, CA 94305-3020, USA.
| | | |
Collapse
|
110
|
Measuring the rates of spontaneous mutation from deep and large-scale polymorphism data. Genetics 2009; 182:1219-32. [PMID: 19528323 DOI: 10.1534/genetics.109.105692] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The rates and patterns of spontaneous mutation are fundamental parameters of molecular evolution. Current methodology either tries to measure such rates and patterns directly in mutation-accumulation experiments or tries to infer them indirectly from levels of divergence or polymorphism. While experimental approaches are constrained by the low rate at which new mutations occur, indirect approaches suffer from their underlying assumption that mutations are effectively neutral. Here I present a maximum-likelihood approach to estimate mutation rates from large-scale polymorphism data. It is demonstrated that the method is not sensitive to demography and the distribution of selection coefficients among mutations when applied to mutations at sufficiently low population frequencies. With the many large-scale sequencing projects currently underway, for instance, the 1000 genomes project in humans, plenty of the required low-frequency polymorphism data will shortly become available. My method will allow for an accurate and unbiased inference of mutation rates and patterns from such data sets at high spatial resolution. I discuss how the assessment of several long-standing problems of evolutionary biology would benefit from the availability of accurate mutation rate estimates.
Collapse
|
111
|
Abstract
Over the past four decades, the predominant view of molecular evolution saw little connection between natural selection and genome evolution, assuming that the functionally constrained fraction of the genome is relatively small and that adaptation is sufficiently infrequent to play little role in shaping patterns of variation within and even between species. Recent evidence from Drosophila, reviewed here, suggests that this view may be invalid. Analyses of genetic variation within and between species reveal that much of the Drosophila genome is under purifying selection, and thus of functional importance, and that a large fraction of coding and noncoding differences between species are adaptive. The findings further indicate that, in Drosophila, adaptations may be both common and strong enough that the fate of neutral mutations depends on their chance linkage to adaptive mutations as much as on the vagaries of genetic drift. The emerging evidence has implications for a wide variety of fields, from conservation genetics to bioinformatics, and presents challenges to modelers and experimentalists alike.
Collapse
|
112
|
Strasburg JL, Scotti-Saintagne C, Scotti I, Lai Z, Rieseberg LH. Genomic patterns of adaptive divergence between chromosomally differentiated sunflower species. Mol Biol Evol 2009; 26:1341-55. [PMID: 19276154 PMCID: PMC2727376 DOI: 10.1093/molbev/msp043] [Citation(s) in RCA: 86] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/05/2009] [Indexed: 01/13/2023] Open
Abstract
Understanding the genetic mechanisms of speciation and basis of species differences is among the most important challenges in evolutionary biology. Two questions of particular interest are what roles divergent selection and chromosomal differentiation play in these processes. A number of recently proposed theories argue that chromosomal rearrangements can facilitate the development and maintenance of reproductive isolation and species differences by suppressing recombination within rearranged regions. Reduced recombination permits the accumulation of alleles contributing to isolation and adaptive differentiation and protects existing differences from the homogenizing effects of introgression between incipient species. Here, we examine patterns of genetic diversity and divergence in rearranged versus collinear regions in two widespread, extensively hybridizing sunflower species, Helianthus annuus and Helianthus petiolaris, using sequence data from 77 loci distributed throughout the genomes of the two species. We find weak evidence for increased genetic divergence near chromosomal break points but not within rearranged regions overall. We find no evidence for increased rates of adaptive divergence on rearranged chromosomes; in fact, collinear chromosomes show a far greater excess of fixed amino acid differences between the two species. A comparison with a third sunflower species indicates that much of the nonsynonymous divergence between H. annuus and H. petiolaris probably occurred during or soon after their formation. Our results suggest a limited role for chromosomal rearrangements in genetic divergence, but they do document substantial adaptive divergence and provide further evidence of how species integrity and genetic identity can be maintained at many loci in the face of extensive hybridization and gene flow.
Collapse
Affiliation(s)
- Jared L Strasburg
- Department of Biology, Indiana University, Bloomington, Indiana, USA.
| | | | | | | | | |
Collapse
|
113
|
González J, Macpherson JM, Petrov DA. A recent adaptive transposable element insertion near highly conserved developmental loci in Drosophila melanogaster. Mol Biol Evol 2009; 26:1949-61. [PMID: 19458110 DOI: 10.1093/molbev/msp107] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A recent genomewide screen identified 13 transposable elements that are likely to have been adaptive during or after the spread of Drosophila melanogaster out of Africa. One of these insertions, Bari-Juvenile hormone epoxy hydrolase (Bari-Jheh), was associated with the selective sweep of its flanking neutral variation and with reduction of expression of one of its neighboring genes: Jheh3. Here, we provide further evidence that Bari-Jheh insertion is adaptive. We delimit the extent of the selective sweep and show that Bari-Jheh is the only mutation linked to the sweep. Bari-Jheh also lowers the expression of its other flanking gene, Jheh2. Subtle consequences of Bari-Jheh insertion on life-history traits are consistent with the effects of reduced expression of the Jheh genes. Finally, we analyze molecular evolution of Jheh genes in both the long- and the short-term and conclude that Bari-Jheh appears to be a very rare adaptive event in the history of these genes. We discuss the implications of these findings for the detection and understanding of adaptation.
Collapse
|
114
|
Jakubowska A, Korona R. Lack of evolutionary conservation at positions important for thermal stability in the yeast ODCase protein. Mol Biol Evol 2009; 26:1431-4. [PMID: 19349645 DOI: 10.1093/molbev/msp066] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Mutations destabilizing the spatial structure of proteins can persist in populations if they are fixed by drift or compensated by other mutations. The prevalence and dynamics of these processes remain largely unrecognized. A suitable target to screen for both deleterious and compensatory mutations is the URA3 gene in yeast. We identified 13 positions in which a single missense substitution causes substantially strong thermal sensitivity. We then applied mild mutagenesis resulting in roughly one base substitution per gene and found that only reversions to an original amino acid can compensate for the thermal instability. However, the 13 positions are not visibly conserved across 53 species of Ascomycota, despite that the gene product is an enzyme of stable function and high efficiency. This shows how much fitness penalties for amino acid substitutions are background dependent, underscoring the role of complex intragenic interactions in the evolution of proteins.
Collapse
|
115
|
Ellegren H. A selection model of molecular evolution incorporating the effective population size. Evolution 2009; 63:301-5. [PMID: 19215289 DOI: 10.1111/j.1558-5646.2008.00560.x] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Affiliation(s)
- Hans Ellegren
- Department of Evolutionary Biology, Uppsala University, Norbyvägen 18D, SE-752 36 Uppsala, Sweden.
| |
Collapse
|
116
|
Hurst LD. Fundamental concepts in genetics: genetics and the understanding of selection. Nat Rev Genet 2009; 10:83-93. [PMID: 19119264 DOI: 10.1038/nrg2506] [Citation(s) in RCA: 87] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
At first sight selection is a simple notion, and some consider it the most important evolutionary force. But how important is selection, is it really so trivial to understand and what are the alternatives? Here I discuss how genetics is crucial for addressing all of these questions: genetics allowed the concept of natural selection to become viable, it contributed to our understanding of the complexities of selection and it spurred the development of competing models of evolution. Understanding how and why selection acts has important potential applications, from understanding the mechanisms of disease and microbial resistance, to improving the design of transgenes and drugs.
Collapse
Affiliation(s)
- Laurence D Hurst
- Department of Biology and Biochemistry, University of Bath, Somerset, BA2 7AY, UK.
| |
Collapse
|
117
|
Mustonen V, Lässig M. From fitness landscapes to seascapes: non-equilibrium dynamics of selection and adaptation. Trends Genet 2009; 25:111-9. [PMID: 19232770 DOI: 10.1016/j.tig.2009.01.002] [Citation(s) in RCA: 138] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2008] [Revised: 01/01/2009] [Accepted: 01/07/2009] [Indexed: 10/21/2022]
Abstract
Evolution is a quest for innovation. Organisms adapt to changing natural selection by evolving new phenotypes. Can we read this dynamics in their genomes? Not every mutation under positive selection responds to a change in selection: beneficial changes also occur at evolutionary equilibrium, repairing previous deleterious changes and restoring existing functions. Adaptation, by contrast, is viewed here as a non-equilibrium phenomenon: the genomic response to time-dependent selection. Our approach extends the static concept of fitness landscapes to dynamic fitness seascapes. It shows that adaptation requires a surplus of beneficial substitutions over deleterious ones. Here, we focus on the evolution of yeast and Drosophila genomes, providing examples where adaptive evolution can and cannot be inferred, despite the presence of positive selection.
Collapse
Affiliation(s)
- Ville Mustonen
- Institut für Theoretische Physik, Universität zu Köln, Zülpicherstrasse 77, 50937 Köln, Germany.
| | | |
Collapse
|
118
|
|
119
|
Pervasive hitchhiking at coding and regulatory sites in humans. PLoS Genet 2009; 5:e1000336. [PMID: 19148272 PMCID: PMC2613029 DOI: 10.1371/journal.pgen.1000336] [Citation(s) in RCA: 101] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2008] [Accepted: 12/11/2008] [Indexed: 01/28/2023] Open
Abstract
Much effort and interest have focused on assessing the importance of natural selection, particularly positive natural selection, in shaping the human genome. Although scans for positive selection have identified candidate loci that may be associated with positive selection in humans, such scans do not indicate whether adaptation is frequent in general in humans. Studies based on the reasoning of the MacDonald-Kreitman test, which, in principle, can be used to evaluate the extent of positive selection, suggested that adaptation is detectable in the human genome but that it is less common than in Drosophila or Escherichia coli. Both positive and purifying natural selection at functional sites should affect levels and patterns of polymorphism at linked nonfunctional sites. Here, we search for these effects by analyzing patterns of neutral polymorphism in humans in relation to the rates of recombination, functional density, and functional divergence with chimpanzees. We find that the levels of neutral polymorphism are lower in the regions of lower recombination and in the regions of higher functional density or divergence. These correlations persist after controlling for the variation in GC content, density of simple repeats, selective constraint, mutation rate, and depth of sequencing coverage. We argue that these results are most plausibly explained by the effects of natural selection at functional sites -- either recurrent selective sweeps or background selection -- on the levels of linked neutral polymorphism. Natural selection at both coding and regulatory sites appears to affect linked neutral polymorphism, reducing neutral polymorphism by 6% genome-wide and by 11% in the gene-rich half of the human genome. These findings suggest that the effects of natural selection at linked sites cannot be ignored in the study of neutral human polymorphism.
Collapse
|
120
|
Abstract
Human genes responsible for inherited diseases are important for the understanding of human disease. We investigated the degree of polymorphism and divergence in the human disease genes to elucidate the effect of natural selection on human disease genes. In particular, the effect of disease dominance was incorporated into the analysis. Both dominant disease genes (DDG) and recessive disease genes (RDG) had a higher mutation rate per site and encoded longer proteins than the nondisease genes, which exposed the disease genes to a faster flux of new mutations. Using an unbiased polymorphism dataset, we found that, proportionally, RDG harbor more nonsynonymous polymorphisms compared with DDG. We estimated the selection intensity on the disease genes using polymorphism and divergence data and determined whether the different patterns of polymorphism and divergence between DDG and RDG could be explained by the difference in only dominance. Even after the dominance effect was considered, the selection intensity on RDG was significantly different from DDG, suggesting that the deleterious effect of the dominant and recessive disease mutations are fundamentally different.
Collapse
|
121
|
Abstract
In recent years, the importance of horizontal gene transfer (HGT) in bacterial evolution has been elevated to such a degree that many bacteriologists now question the very existence of bacterial species. If gene transfer is as rampant as comparative genomic studies have suggested, how could bacterial species survive such genomic fluidity? And yet, most bacteriologists recognize, and name, as species, clusters of bacterial isolates that share complex phenotypic properties. The Core Genome Hypothesis (CGH) has been proposed to explain this apparent paradox of fluid bacterial genomes associated with stable phenotypic clusters. It posits that there is a core of genes responsible for maintaining the species-specific phenotypic clusters observed throughout bacterial diversity and argues that, even in the face of substantial genomic fluidity, bacterial species can be rationally identified and named.
Collapse
Affiliation(s)
- Margaret A Riley
- Department of Biology, University of Massachusetts, Amherst, MA, USA
| | | |
Collapse
|
122
|
Petit N, Barbadilla A. Selection efficiency and effective population size in Drosophila species. J Evol Biol 2008; 22:515-26. [PMID: 19170822 DOI: 10.1111/j.1420-9101.2008.01672.x] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
A corollary of the nearly neutral theory of molecular evolution is that the efficiency of natural selection depends on effective population size. In this study, we evaluated the differences in levels of synonymous polymorphism among Drosophila species and showed that these differences can be explained by differences in effective population size. The differences can have implications for the molecular evolution of the Drosophila species, as is suggested by our results showing that the levels of codon bias and the proportion of adaptive substitutions are both higher in species with higher levels of synonymous polymorphism. Moreover, species with lower synonymous polymorphism have higher levels of nonsynonymous polymorphism and larger content of repetitive sequences in their genomes, suggesting a diminished efficiency of selection in species with smaller effective population size.
Collapse
Affiliation(s)
- N Petit
- Group of Genomics, Bioinformatics and Evolution, Departament de Genètica i Microbiologia, Facultat de Biociències, Universitat Autònoma de Barcelona, Bellaterra, Spain.
| | | |
Collapse
|
123
|
Bachtrog D. Similar rates of protein adaptation in Drosophila miranda and D. melanogaster, two species with different current effective population sizes. BMC Evol Biol 2008; 8:334. [PMID: 19091130 PMCID: PMC2633301 DOI: 10.1186/1471-2148-8-334] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2008] [Accepted: 12/18/2008] [Indexed: 01/13/2023] Open
Abstract
BACKGROUND Adaptive protein evolution is common in several Drosophila species investigated. Some studies point to very weak selection operating on amino-acid mutations, with average selection intensities on the order of Nes approximately in D. melanogaster and D. simulans. Species with lower effective population sizes should undergo less adaptation since they generate fewer mutations and selection is ineffective on a greater proportion of beneficial mutations. RESULTS Here I study patterns of polymorphism and divergence at 91 X-linked loci in D. miranda, a species with a roughly 5-fold smaller effective population size than D. melanogaster. Surprisingly, I find a similar fraction of amino-acid mutations being driven to fixation by positive selection in D. miranda and D. melanogaster. Genes with higher rates of amino-acid evolution show lower levels of neutral diversity, a pattern predicted by recurrent adaptive protein evolution. I fit a hitchhiking model to patterns of polymorphism in D. miranda and D. melanogaster and estimate an order of magnitude higher selection coefficients for beneficial mutations in D. miranda. CONCLUSION This analysis suggests that effective population size may not be a major determinant in rates of protein adaptation. Instead, adaptation may not be mutation-limited, or the distribution of fitness effects for beneficial mutations might differ vastly between different species or populations. Alternative explanation such as biases in estimating the fraction of beneficial mutations or slightly deleterious mutation models are also discussed.
Collapse
Affiliation(s)
- Doris Bachtrog
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA.
| |
Collapse
|
124
|
Abstract
Neutralism and selectionism are extremes of an explanatory spectrum for understanding patterns of molecular evolution and the emergence of evolutionary innovation. Although recent genome-scale data from protein-coding genes argue against neutralism, molecular engineering and protein evolution data argue that neutral mutations and mutational robustness are important for evolutionary innovation. Here I propose a reconciliation in which neutral mutations prepare the ground for later evolutionary adaptation. Key to this perspective is an explicit understanding of molecular phenotypes that has only become accessible in recent years.
Collapse
|
125
|
Wright SI, Andolfatto P. The Impact of Natural Selection on the Genome: Emerging Patterns inDrosophilaandArabidopsis. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2008. [DOI: 10.1146/annurev.ecolsys.39.110707.173342] [Citation(s) in RCA: 87] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Stephen I. Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, 25 Willcocks St., Toronto, Ontario, M5S 3B2 Canada,
| | - Peter Andolfatto
- Department of Ecology and Evolutionary Biology and the Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544,
| |
Collapse
|
126
|
|
127
|
An exact steady state solution of Fisher's geometric model and other models. Theor Popul Biol 2008; 75:30-4. [PMID: 18996138 DOI: 10.1016/j.tpb.2008.10.001] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2008] [Revised: 09/23/2008] [Accepted: 10/09/2008] [Indexed: 11/23/2022]
Abstract
Because nearly neutral substitutions are thought to contribute substantially to molecular evolution, and much of our insight about the workings of nearly neutral evolution relies on theory, solvable models of this process are of particular interest. Here, I present an analytical method for solving models of nearly neutral evolution at steady state. The steady state solution applies to any constant fitness landscape under a dynamic of successive fixations, each of which occurs on the background of the population's most recent common ancestor. Because this dynamic neglects the effects of polymorphism in the population beyond the mutant allele under consideration, the steady state solution provides a decent approximation of evolutionary dynamics when the population mutation rate is low (Nu<<1). To demonstrate the method, I apply it to two examples: Fisher's geometric model (FGM), and a simple model of molecular evolution. Since recent papers have studied the steady state behavior of FGM under this dynamic, I analyze its behavior in detail and compare the results with previous work.
Collapse
|
128
|
Takahashi KH, Tanaka K, Itoh M, Takano-Shimizu T. Reduced X-linked rare polymorphism in males in comparison to females of Drosophila melanogaster. J Hered 2008; 100:97-105. [PMID: 18836147 DOI: 10.1093/jhered/esn078] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Natural selection is assumed to act more strongly on X-linked loci than on autosomal loci because the fitness effect of a recessive mutation on the X chromosome is fully expressed in hemizygous males. Therefore, selection is expected to fix or remove recessive mutations on the X chromosome more efficiently than those on autosomes. However, the assumption that hemizygosity of the X chromosome selectively accelerates changes in allele frequency has not been confirmed directly. To examine this assumption, we investigated current natural selection on X-linked chemoreceptor genes in a natural population of Drosophila melanogaster by comparing nucleotide diversity, linkage disequilibrium (LD), and departure from the neutrality in 4 chemoreceptor genes on 100 X chromosomes each from female and male flies. The general pattern of nucleotide diversity and LD for the genes investigated was similar in females and males. In contrast, males harbored significantly fewer rare polymorphisms defined as singletons and doubletons. When all the gene sequences were concatenated, Tajima's D showed a significant departure from the neutrality in both females and males, whereas Fu and Li's F* value revealed departure only in males. These results suggest that some rare polymorphisms on the X chromosome from females are recessively deleterious and are removed by stronger purifying selection when transferred to hemizygous males.
Collapse
Affiliation(s)
- Kazuo H Takahashi
- Department of Population Genetics, National Institute of Genetics, Yata 1111, Mishima, Shizuoka 411-8540, Japan
| | | | | | | |
Collapse
|
129
|
Oliveira DCSG, Raychoudhury R, Lavrov DV, Werren JH. Rapidly evolving mitochondrial genome and directional selection in mitochondrial genes in the parasitic wasp nasonia (hymenoptera: pteromalidae). Mol Biol Evol 2008; 25:2167-80. [PMID: 18653734 PMCID: PMC2727384 DOI: 10.1093/molbev/msn159] [Citation(s) in RCA: 179] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/19/2008] [Indexed: 11/13/2022] Open
Abstract
We sequenced the nearly complete mtDNA of 3 species of parasitic wasps, Nasonia vitripennis (2 strains), Nasonia giraulti, and Nasonia longicornis, including all 13 protein-coding genes and the 2 rRNAs, and found unusual patterns of mitochondrial evolution. The Nasonia mtDNA has a unique gene order compared with other insect mtDNAs due to multiple rearrangements. The mtDNAs of these wasps also show nucleotide substitution rates over 30 times faster than nuclear protein-coding genes, indicating among the highest substitution rates found in animal mitochondria (normally <10 times faster). A McDonald and Kreitman test shows that the between-species frequency of fixed replacement sites relative to silent sites is significantly higher compared with within-species polymorphisms in 2 mitochondrial genes of Nasonia, atp6 and atp8, indicating directional selection. Consistent with this interpretation, the Ka/Ks (nonsynonymous/synonymous substitution rates) ratios are higher between species than within species. In contrast, cox1 shows a signature of purifying selection for amino acid sequence conservation, although rates of amino acid substitutions are still higher than for comparable insects. The mitochondrial-encoded polypeptides atp6 and atp8 both occur in F0F1ATP synthase of the electron transport chain. Because malfunction in this fundamental protein severely affects fitness, we suggest that the accelerated accumulation of replacements is due to beneficial mutations necessary to compensate mild-deleterious mutations fixed by random genetic drift or Wolbachia sweeps in the fast evolving mitochondria of Nasonia. We further propose that relatively high rates of amino acid substitution in some mitochondrial genes can be driven by a "Compensation-Draft Feedback"; increased fixation of mildly deleterious mutations results in selection for compensatory mutations, which lead to fixation of additional deleterious mutations in nonrecombining mitochondrial genomes, thus accelerating the process of amino acid substitutions.
Collapse
|
130
|
Positive selection at the binding sites of the male-specific lethal complex involved in dosage compensation in Drosophila. Genetics 2008; 180:1123-9. [PMID: 18780755 DOI: 10.1534/genetics.107.084244] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In many taxa, males and females differ with respect to their sex chromosomes, and dosage compensation mechanisms have evolved to equalize X-linked gene transcription. In Drosophila, the male-specific lethal (MSL) complex binds to hundreds of sites along the male X chromosome and mediates twofold hypertranscription of the single male X. Two recent studies found evidence for lineage-specific adaptive evolution in all five core protein-coding genes of the MSL complex in Drosophila melanogaster. In particular, dramatic positive selection was detected in domains shown to be responsible for their specific targeting to the X chromosome. Here I use population genetics to show that three previously characterized MSL-binding DNA segments on the X themselves underwent adaptive evolution in D. melanogaster, but not in its close relatives D. simulans and D. yakuba. MSL components have been shown to not correctly target the D. melanogaster X chromosome in hybrids between D. melanogaster and D. simulans. My finding supports the idea of selection-driven coevolution among DNA-protein interactions of the dosage compensation machinery and suggests that misregulated dosage compensation could contribute to male hybrid inviability in Drosophila.
Collapse
|
131
|
A catalog of neutral and deleterious polymorphism in yeast. PLoS Genet 2008; 4:e1000183. [PMID: 18769710 PMCID: PMC2515631 DOI: 10.1371/journal.pgen.1000183] [Citation(s) in RCA: 173] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2008] [Accepted: 07/30/2008] [Indexed: 11/30/2022] Open
Abstract
The abundance and identity of functional variation segregating in natural populations is paramount to dissecting the molecular basis of quantitative traits as well as human genetic diseases. Genome sequencing of multiple organisms of the same species provides an efficient means of cataloging rearrangements, insertion, or deletion polymorphisms (InDels) and single-nucleotide polymorphisms (SNPs). While inbreeding depression and heterosis imply that a substantial amount of polymorphism is deleterious, distinguishing deleterious from neutral polymorphism remains a significant challenge. To identify deleterious and neutral DNA sequence variation within Saccharomyces cerevisiae, we sequenced the genome of a vineyard and oak tree strain and compared them to a reference genome. Among these three strains, 6% of the genome is variable, mostly attributable to variation in genome content that results from large InDels. Out of the 88,000 polymorphisms identified, 93% are SNPs and a small but significant fraction can be attributed to recent interspecific introgression and ectopic gene conversion. In comparison to the reference genome, there is substantial evidence for functional variation in gene content and structure that results from large InDels, frame-shifts, and polymorphic start and stop codons. Comparison of polymorphism to divergence reveals scant evidence for positive selection but an abundance of evidence for deleterious SNPs. We estimate that 12% of coding and 7% of noncoding SNPs are deleterious. Based on divergence among 11 yeast species, we identified 1,666 nonsynonymous SNPs that disrupt conserved amino acids and 1,863 noncoding SNPs that disrupt conserved noncoding motifs. The deleterious coding SNPs include those known to affect quantitative traits, and a subset of the deleterious noncoding SNPs occurs in the promoters of genes that show allele-specific expression, implying that some cis-regulatory SNPs are deleterious. Our results show that the genome sequences of both closely and distantly related species provide a means of identifying deleterious polymorphisms that disrupt functionally conserved coding and noncoding sequences. DNA sequence variation makes an important contribution to most traits that vary in natural populations. However, mapping mutations that underlie a trait of interest is a significant challenge. Genome sequencing of multiple organisms provides a complete list of DNA sequence differences responsible for any trait that differs among the organisms. Yet, distinguishing those DNA sequence variants that contribute to a trait from all other variants is not easy. Here, we sequence the genomes of two strains of yeast and, through comparisons with a reference genome, we catalog multiple types of DNA sequence variation among the three strains. Using a variety of comparative genomics methods, we show that a substantial fraction of DNA sequence variations has deleterious effects on fitness. Finally, we show that a subset of deleterious mutations is associated with changes in gene expression levels. Our results imply that comparative genomics methods will be a valuable approach to identifying DNA sequence changes underlying numerous traits of interest.
Collapse
|
132
|
Abstract
Rapid and inexpensive sequencing technologies are making it possible to collect whole genome sequence data on multiple individuals from a population. This type of data can be used to quickly identify genes that control important ecological and evolutionary phenotypes by finding the targets of adaptive natural selection, and we therefore refer to such approaches as "reverse ecology." To quantify the power gained in detecting positive selection using population genomic data, we compare three statistical methods for identifying targets of selection: the McDonald-Kreitman test, the mkprf method, and a likelihood implementation for detecting d(N)/d(S) > 1. Because the first two methods use polymorphism data we expect them to have more power to detect selection. However, when applied to population genomic datasets from human, fly, and yeast, the tests using polymorphism data were actually weaker in two of the three datasets. We explore reasons why the simpler comparative method has identified more genes under selection, and suggest that the different methods may really be detecting different signals from the same sequence data. Finally, we find several statistical anomalies associated with the mkprf method, including an almost linear dependence between the number of positively selected genes identified and the prior distributions used. We conclude that interpreting the results produced by this method should be done with some caution.
Collapse
Affiliation(s)
- Yong Fuga Li
- School of Informatics, Indiana University, Bloomington, IN, USA
| | | | | | | |
Collapse
|
133
|
Rokas A, Chatzimanolis S. From gene-scale to genome-scale phylogenetics: the data flood in, but the challenges remain. Methods Mol Biol 2008; 422:1-12. [PMID: 18629657 DOI: 10.1007/978-1-59745-581-7_1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
An important goal of phylogenetics is to be able to consistently and accurately reconstruct the historical patterns of cladogenesis among major organismic groups. Gene-scale phylogenetics is insufficient to attain this goal owing to the presence of poor resolution and incongruence in single--and few--gene phylogenies. The increasing availability of genome-scale amounts of data promises to overcome the insufficiency of gene-scale phylogenetics and uncover the genealogical tapestry uniting all living organisms with unprecedented accuracy. Here, we argue that a vast increase in data size alone--although necessary--may not be sufficient to achieve the desired accuracy for three reasons: (i) the existence of short stems in the tree of life, (ii) the saturation of phylogenetic signal in molecular sequences, and (iii) the effect of systematic error on phylogenetic inference. Devising strategies to ameliorate the effect of such challenges on sequence evolution will be critical to the success of current efforts to reconstruct the tree of life.
Collapse
Affiliation(s)
- Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, TN, USA
| | | |
Collapse
|
134
|
Adaptive and slightly deleterious evolution in a conifer, Cryptomeria japonica. J Mol Evol 2008; 67:201-10. [PMID: 18629428 DOI: 10.1007/s00239-008-9140-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2007] [Revised: 06/18/2008] [Accepted: 06/23/2008] [Indexed: 10/21/2022]
Abstract
In order to evaluate effects of the population structure and natural selection on organisms having long generation times, we surveyed DNA polymorphisms at five loci encoding 9-cis-epoxycarotenoid dioxygenase (NCED), ammonium transporter, calmodulin, aquaporin, and the second major allergen with polymethylgalacturonase enzyme activity in the pollen (Cryj2) in a conifer, Cryptomeria japonica. The average nucleotide diversity at silent sites across 12 loci including the previously analyzed seven loci was 0.0044. The population recombination rate (4Nr, where N and r are the effective population size and recombination rate per base per generation, respectively) was estimated as 0.00046 and a slow reduction in the population size was indicated, according to the maximum likelihood method implemented in LAMARC. At NCED, the McDonald-Kreitman (MK) test revealed an excess of replacement polymorphisms, suggesting contributions of slightly deleterious mutations. In contrast, the MK test revealed an excess of replacement divergence at Cryj2 and a maximum likelihood approach using the PAML package revealed that certain amino acid sites had a nonsynonymous/synonymous substitution rate ratio (omega) > 4.0, indicating adaptive evolution at this locus. The overall analysis of the 12 loci suggested that adaptive, neutral, and slightly deleterious evolution played important roles in the evolution of C. japonica.
Collapse
|
135
|
Ramensky VE, Nurtdinov RN, Neverov AD, Mironov AA, Gelfand MS. Positive selection in alternatively spliced exons of human genes. Am J Hum Genet 2008; 83:94-8. [PMID: 18571144 DOI: 10.1016/j.ajhg.2008.05.017] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2008] [Revised: 04/08/2008] [Accepted: 05/30/2008] [Indexed: 10/21/2022] Open
Abstract
Alternative splicing is a well-recognized mechanism of accelerated genome evolution. We have studied single-nucleotide polymorphisms and human-chimpanzee divergence in the exons of 6672 alternatively spliced human genes, with the aim of understanding the forces driving the evolution of alternatively spliced sequences. Here, we show that alternatively spliced exons and exon fragments (alternative exons) from minor isoforms experience lower selective pressure at the amino acid level, accompanied by selection against synonymous sequence variation. The results of the McDonald-Kreitman test suggest that alternatively spliced exons, unlike exons constitutively included in the mRNA, are also subject to positive selection, with up to 27% of amino acids fixed by positive selection.
Collapse
|
136
|
Hill MM, Broman KW, Stupka E, Smith WC, Jiang D, Sidow A. The C. savignyi genetic map and its integration with the reference sequence facilitates insights into chordate genome evolution. Genome Res 2008; 18:1369-79. [PMID: 18519652 DOI: 10.1101/gr.078576.108] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
The urochordate Ciona savignyi is an emerging model organism for the study of chordate evolution, development, and gene regulation. The extreme level of polymorphism in its population has inspired novel approaches in genome assembly, which we here continue to develop. Specifically, we present the reconstruction of all of C. savignyi's chromosomes via the development of a comprehensive genetic map, without a physical map intermediate. The resulting genetic map is complete, having one linkage group for each one of the 14 chromosomes. Eighty-three percent of the reference genome sequence is covered. The chromosomal reconstruction allowed us to investigate the evolution of genome structure in highly polymorphic species, by comparing the genome of C. savignyi to its divergent sister species, Ciona intestinalis. Both genomes have been extensively reshaped by intrachromosomal rearrangements. Interchromosomal changes have been extremely rare. This is in striking contrast to what has been observed in vertebrates, where interchromosomal events are commonplace. These results, when considered in light of the neutral theory, suggest fundamentally different modes of evolution of animal species with large versus small population sizes.
Collapse
Affiliation(s)
- Matthew M Hill
- Department of Pathology, SUMC, Stanford, CA 94305-5324, USA
| | | | | | | | | | | |
Collapse
|
137
|
Egea R, Casillas S, Barbadilla A. Standard and generalized McDonald-Kreitman test: a website to detect selection by comparing different classes of DNA sites. Nucleic Acids Res 2008; 36:W157-62. [PMID: 18515345 PMCID: PMC2447769 DOI: 10.1093/nar/gkn337] [Citation(s) in RCA: 107] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
The McDonald and Kreitman test (MKT) is one of the most powerful and extensively used tests to detect the signature of natural selection at the molecular level. Here, we present the standard and generalized MKT website, a novel website that allows performing MKTs not only for synonymous and nonsynonymous changes, as the test was initially described, but also for other classes of regions and/or several loci. The website has three different interfaces: (i) the standard MKT, where users can analyze several types of sites in a coding region, (ii) the advanced MKT, where users can compare two closely linked regions in the genome that can be either coding or noncoding, and (iii) the multi-locus MKT, where users can analyze many separate loci in a single multi-locus test. The website has already been used to show that selection efficiency is positively correlated with effective population size in the Drosophila genus and it has been applied to include estimates of selection in DPDB. This website is a timely resource, which will presumably be widely used by researchers in the field and will contribute to enlarge the catalogue of cases of adaptive evolution. It is available at http://mkt.uab.es.
Collapse
Affiliation(s)
- Raquel Egea
- Genomics, Bioinformatics and Evolution Group, Departament de Genètica i Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra (Barcelona), Spain.
| | | | | |
Collapse
|
138
|
Babbitt GA, Kim Y. Inferring natural selection on fine-scale chromatin organization in yeast. Mol Biol Evol 2008; 25:1714-27. [PMID: 18515262 DOI: 10.1093/molbev/msn127] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Despite its potential role in the evolution of complex phenotypes, the detection of negative (purifying) and positive selection on noncoding regulatory sequence has been elusive because of the inherent difficulty in predicting the functional consequences of mutations on noncoding sequence. Because the functioning of regulatory sequence depends upon both chromatin configuration and cis-regulatory factor binding, we investigate the idea that the functional conservation of regulatory regions should be associated with the conservation of sequence-dependent bending properties of DNA that determine its affinity for the nucleosome. Recent advances in the computational prediction of sequence-dependent affinity to nucleosomes provide an opportunity to distinguish between neutral and nonneutral evolution of fine-scale chromatin organization. Here, a statistical test is presented for detecting evolutionary conservation and/or adaptive evolution of nucleosome affinity from interspecies comparisons of DNA sequences. Local nucleosome affinities of homologous sequences were calculated using 2 recently published methods. A randomization test was applied to sites of mutation to evaluate the similarity of DNA-nucleosome affinity between several closely related species of Saccharomyces yeast. For most of the genes we analyzed, the conservation of local nucleosome affinity was detected at a few distinct locations in the upstream noncoding region. Our results also demonstrate that different patterns of chromatin evolution have shaped DNA-nucleosome interaction at the core promoters of TATA-containing and TATA-less genes and that elevated purifying selection has maintained low affinity for nucleosome in the core promoters of the latter group. Across the entire yeast genome, DNA-nucleosome interaction was also discovered to be significantly more conserved in TATA-less genes compared with TATA-containing genes.
Collapse
Affiliation(s)
- G A Babbitt
- Center for Evolutionary Functional Genomics, The Biodesign Institute, Arizona State University, USA.
| | | |
Collapse
|
139
|
Haddrill PR, Bachtrog D, Andolfatto P. Positive and negative selection on noncoding DNA in Drosophila simulans. Mol Biol Evol 2008; 25:1825-34. [PMID: 18515263 DOI: 10.1093/molbev/msn125] [Citation(s) in RCA: 86] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
There is now a wealth of evidence that some of the most important regions of the genome are found outside those that encode proteins, and noncoding regions of the genome have been shown to be subject to substantial levels of selective constraint, particularly in Drosophila. Recent work has suggested that these regions may also have been subject to the action of positive selection, with large fractions of noncoding divergence having been driven to fixation by adaptive evolution. However, this work has focused on Drosophila melanogaster, which is thought to have experienced a reduction in effective population size (N(e)), and thus a reduction in the efficacy of selection, compared with its closest relative Drosophila simulans. Here, we examine patterns of evolution at several classes of noncoding DNA in D. simulans and find that all noncoding DNA is subject to the action of negative selection, indicated by reduced levels of polymorphism and divergence and a skew in the frequency spectrum toward rare variants. We find that the signature of negative selection on noncoding DNA and nonsynonymous sites is obscured to some extent by purifying selection acting on preferred to unpreferred synonymous codon mutations. We investigate the extent to which divergence in noncoding DNA is inferred to be the product of positive selection and to what extent these inferences depend on selection on synonymous sites and demography. Based on patterns of polymorphism and divergence for different classes of synonymous substitution, we find the divergence excess inferred in noncoding DNA and nonsynonymous sites in the D. simulans lineage difficult to reconcile with demographic explanations.
Collapse
Affiliation(s)
- Penelope R Haddrill
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom.
| | | | | |
Collapse
|
140
|
|
141
|
Foxe JP, Dar VUN, Zheng H, Nordborg M, Gaut BS, Wright SI. Selection on amino acid substitutions in Arabidopsis. Mol Biol Evol 2008; 25:1375-83. [PMID: 18390851 DOI: 10.1093/molbev/msn079] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Studies of nucleotide diversity have found an excess of low-frequency amino acid polymorphisms segregating in Arabidopsis thaliana, suggesting a predominance of weak purifying selection acting on amino acid polymorphism in this inbreeding species. Here, we investigate levels of diversity and divergence at synonymous and nonsynonymous sites in 6 circumpolar populations of the outbreeding Arabidopsis lyrata and compare these results with A. thaliana, to test for differences in mutation and selection parameters across genes, populations, and species. We find that A. lyrata shows an excess of low-frequency nonsynonymous polymorphisms both within populations and species wide, consistent with weak purifying selection similar to the patterns observed in A. thaliana. Furthermore, nonsynonymous polymorphisms tend to be more restricted in their population distribution in A. lyrata, consistent with purifying selection preventing their geographic spread. Highly expressed genes show a reduced ratio of amino acid to synonymous change for both polymorphism and fixed differences, suggesting a general pattern of stronger purifying selection on high-expression proteins.
Collapse
Affiliation(s)
- John Paul Foxe
- Department of Biology, York University, Toronto, Ontario, Canada
| | | | | | | | | | | |
Collapse
|
142
|
Lu J, Fu Y, Kumar S, Shen Y, Zeng K, Xu A, Carthew R, Wu CI. Adaptive evolution of newly emerged micro-RNA genes in Drosophila. Mol Biol Evol 2008; 25:929-38. [PMID: 18296702 DOI: 10.1093/molbev/msn040] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
How often micro-RNA (miRNA) genes emerged and how fast they evolved soon after their emergence are some of the central questions in the evolution of miRNAs. Because most known miRNA genes are ancient and highly conserved, these questions can be best answered by identifying newly emerged miRNA genes. Among the 78 miRNA genes in Drosophila reported before 2007, only 5 are confirmed to be newly emerged in the genus (although many more can be found in the newly reported data set; e.g., Ruby et al. 2007; Stark et al. 2007; Lu et al. 2008). These new miRNA genes have undergone numerous changes, even in the normally invariant mature sequences. Four of them (the miR-310/311/312/313 cluster, denoted miR-310s) were duplicated from other conserved miRNA genes. The fifth one (miR-303) appears to be a very young gene, originating de novo from a non-miRNA sequence recently. We sequenced these 5 miRNA genes and their neighboring regions from a worldwide collection of Drosophila melanogaster lines. The levels of divergence and polymorphism in these miRNA genes, vis-à-vis those of the neighboring DNA sequences, suggest that these 5 genes are evolving adaptively. Furthermore, the polymorphism pattern of miR-310s in D. melanogaster is indicative of hitchhiking under positive selection. Thus, a large number of adaptive changes over a long period of time may be essential for the evolution of newly emerged miRNA genes.
Collapse
Affiliation(s)
- Jian Lu
- Department of Ecology and Evolution, University of Chicago, USA
| | | | | | | | | | | | | | | |
Collapse
|
143
|
The molecular basis of host adaptation in cactophilic Drosophila: molecular evolution of a glutathione S-transferase gene (GstD1) in Drosophila mojavensis. Genetics 2008; 178:1073-83. [PMID: 18245335 DOI: 10.1534/genetics.107.083287] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Drosophila mojavensis is a cactophilic fly endemic to the northwestern deserts of North America. This species includes four genetically isolated cactus host races each individually specializing on the necrotic tissues of a different cactus species. The necrosis of each cactus species provides the resident D. mojavensis populations with a distinct chemical environment. A previous investigation of the role of transcriptional variation in the adaptation of D. mojavensis to its hosts produced a set of candidate loci that are differentially expressed in response to host shifts, and among them was glutathione S-transferase D1 (GstD1). In both D. melanogaster and Anopheles gambiae, GstD1 has been implicated in the resistance of these species to the insecticide dichloro-diphenyl-trichloroethane (DDT). The pattern of sequence variation of the GstD1 locus from all four D. mojavensis populations, D. arizonae (sister species), and D. navojoa (outgroup) has been examined. The data suggest that in two populations of D. mojavensis GstD1 has gone through a period of adaptive amino acid evolution. Further analyses indicate that of the seven amino acid fixations that occurred in the D. mojavensis lineage, two of them occur in the active site pocket, potentially having a significant effect on substrate specificity and in the adaptation to alternative cactus hosts.
Collapse
|
144
|
|
145
|
Camps M, Herman A, Loh E, Loeb LA. Genetic constraints on protein evolution. Crit Rev Biochem Mol Biol 2008; 42:313-26. [PMID: 17917869 DOI: 10.1080/10409230701597642] [Citation(s) in RCA: 90] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Evolution requires the generation and optimization of new traits ("adaptation") and involves the selection of mutations that improve cellular function. These mutations were assumed to arise by selection of neutral mutations present at all times in the population. Here we review recent evidence that indicates that deleterious mutations are more frequent in the population than previously recognized and that these mutations play a significant role in protein evolution through continuous positive selection. Positively selected mutations include adaptive mutations, i.e. mutations that directly affect enzymatic function, and compensatory mutations, which suppress the pleiotropic effects of adaptive mutations. Compensatory mutations are by far the most frequent of the two and would allow potentially adaptive but deleterious mutations to persist long enough in the population to be positively selected during episodes of adaptation. Compensatory mutations are, by definition, context-dependent and thus constrain the paths available for evolution. This provides a mechanistic basis for the examples of highly constrained evolutionary landscapes and parallel evolution reported in natural and experimental populations. The present review article describes these recent advances in the field of protein evolution and discusses their implications for understanding the genetic basis of disease and for protein engineering in vitro.
Collapse
Affiliation(s)
- Manel Camps
- Department of Environmental Toxicology, University of California at Santa Cruz, Santa Cruz, California 95064, USA.
| | | | | | | |
Collapse
|
146
|
Charlesworth J, Eyre-Walker A. The McDonald-Kreitman test and slightly deleterious mutations. Mol Biol Evol 2008; 25:1007-15. [PMID: 18195052 DOI: 10.1093/molbev/msn005] [Citation(s) in RCA: 138] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
It is possible to estimate the proportion of substitutions that are due to adaptive evolution using the numbers of silent and nonsilent polymorphisms and substitutions in a McDonald and Kreitman-type analysis. Unfortunately, this estimate of adaptive evolution is biased downward by the segregation of slightly deleterious mutations. It has been suggested that 1 way to cope with the effects of these slightly deleterious mutations is to remove low-frequency polymorphisms from the analysis. We investigate the performance of this method theoretically. We show that although removing low-frequency polymorphisms does indeed reduce the bias in the estimate of adaptive evolution, the estimate is always downwardly biased, often to the extent that one would not be able to detect adaptive evolution, even if it existed. The method is reasonably satisfactory, only if the rate of adaptive evolution is high and the distribution of fitness effects for slightly deleterious mutations is very leptokurtic. Our analysis suggests that adaptive evolution could be quite prevalent in humans (>8%) and still not be detectable using current methodologies. Our analysis also suggests that the level of adaptive evolution has probably been underestimated, possibly substantially, in both bacteria and Drosophila.
Collapse
Affiliation(s)
- Jane Charlesworth
- Centre for the Study of Evolution, University of Sussex, Brighton, United Kingdom
| | | |
Collapse
|
147
|
Macpherson JM, Sella G, Davis JC, Petrov DA. Genomewide spatial correspondence between nonsynonymous divergence and neutral polymorphism reveals extensive adaptation in Drosophila. Genetics 2007; 177:2083-99. [PMID: 18073425 PMCID: PMC2219485 DOI: 10.1534/genetics.107.080226] [Citation(s) in RCA: 99] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2007] [Accepted: 09/18/2007] [Indexed: 11/18/2022] Open
Abstract
The effect of recurrent selective sweeps is a spatially heterogeneous reduction in neutral polymorphism throughout the genome. The pattern of reduction depends on the selective advantage and recurrence rate of the sweeps. Because many adaptive substitutions responsible for these sweeps also contribute to nonsynonymous divergence, the spatial distribution of nonsynonymous divergence also reflects the distribution of adaptive substitutions. Thus, the spatial correspondence between neutral polymorphism and nonsynonymous divergence may be especially informative about the process of adaptation. Here we study this correspondence using genomewide polymorphism data from Drosophila simulans and the divergence between D. simulans and D. melanogaster. Focusing on highly recombining portions of the autosomes, at a spatial scale appropriate to the study of selective sweeps, we find that neutral polymorphism is both lower and, as measured by a new statistic Q(S), less homogeneous where nonsynonymous divergence is higher and that the spatial structure of this correlation is best explained by the action of strong recurrent selective sweeps. We introduce a method to infer, from the spatial correspondence between polymorphism and divergence, the rate and selective strength of adaptation. Our results independently confirm a high rate of adaptive substitution (approximately 1/3000 generations) and newly suggest that many adaptations are of surprisingly great selective effect (approximately 1%), reducing the effective population size by approximately 15% even in highly recombining regions of the genome.
Collapse
|
148
|
Andolfatto P. Hitchhiking effects of recurrent beneficial amino acid substitutions in the Drosophila melanogaster genome. Genome Res 2007; 17:1755-62. [PMID: 17989248 DOI: 10.1101/gr.6691007] [Citation(s) in RCA: 150] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Several recent studies have estimated that a large fraction of amino acid divergence between species of Drosophila was fixed by positive selection, using statistical approaches based on the McDonald-Kreitman test. However, little is known about associated selection coefficients of beneficial amino acid mutations. Recurrent selective sweeps associated with adaptive substitutions should leave a characteristic signature in genome variability data that contains information about the frequency and strength of selection. Here, I document a significant negative correlation between the level and the frequency of synonymous site polymorphism and the rate of protein evolution in highly recombining regions of the X chromosome of D. melanogaster. This pattern is predicted by recurrent adaptive protein evolution and suggests that adaptation is an important determinant of patterns of neutral variation genome-wide. Using a maximum likelihood approach, I estimate the product of the rate and strength of selection under a recurrent genetic hitchhiking model, lambda2N(e)s approximately 3 x 10(-8). Using an approach based on the McDonald-Kreitman test, I estimate that approximately 50% of divergent amino acids were driven to fixation by positive selection, implying that beneficial amino acid substitutions are of weak effect on average, on the order of 10(-5) (i.e., 2N(e)s approximately 40). Two implications of these results are that most adaptive substitutions will be difficult to detect in genome scans of selection and that population size (and genetic drift) may be an important determinant of the evolutionary dynamics of protein adaptation.
Collapse
Affiliation(s)
- Peter Andolfatto
- Division of Biological Sciences, University of California San Diego, La Jolla, California 92093, USA.
| |
Collapse
|
149
|
Charlesworth J, Eyre-Walker A. The other side of the nearly neutral theory, evidence of slightly advantageous back-mutations. Proc Natl Acad Sci U S A 2007; 104:16992-7. [PMID: 17940029 PMCID: PMC2040392 DOI: 10.1073/pnas.0705456104] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2007] [Indexed: 11/18/2022] Open
Abstract
We argue that if there is a category of slightly deleterious mutations, then there should be a category of slightly advantageous back-mutations. We show that when there are both slightly deleterious and advantageous back-mutations, there is likely to be an increase in the rate of evolution after a population size expansion. This increase in the rate of evolution is short-lived. However, we show how its signature can be captured by comparing the rate of evolution in species that have undergone population size expansion versus contraction. We test our model by comparing the pattern of evolution in pairs of island and mainland species in which the colonization event was either island-to-mainland (population size expansion) or mainland-to-island (contraction). We show that the predicted pattern of evolution is observed.
Collapse
Affiliation(s)
- Jane Charlesworth
- Centre for the Study of Evolution, School of Life Sciences, University of Sussex, Brighton BN1 9QG, United Kingdom
| | - Adam Eyre-Walker
- Centre for the Study of Evolution, School of Life Sciences, University of Sussex, Brighton BN1 9QG, United Kingdom
| |
Collapse
|
150
|
Anisimova M, Liberles DA. The quest for natural selection in the age of comparative genomics. Heredity (Edinb) 2007; 99:567-79. [PMID: 17848974 DOI: 10.1038/sj.hdy.6801052] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
Continued genome sequencing has fueled progress in statistical methods for understanding the action of natural selection at the molecular level. This article reviews various statistical techniques (and their applicability) for detecting adaptation events and the functional divergence of proteins. As large-scale automated studies become more frequent, they provide a useful resource for generating biological null hypotheses for further experimental and statistical testing. Furthermore, they shed light on typical patterns of lineage-specific evolution of organisms, on the functional and structural evolution of protein families and on the interplay between the two. More complex models are being developed to better reflect the underlying biological and chemical processes and to complement simpler statistical models. Linking molecular processes to their statistical signatures in genomes can be demanding, and the proper application of statistical models is discussed.
Collapse
Affiliation(s)
- M Anisimova
- Department of Biology, University College London, London, UK
| | | |
Collapse
|