251
|
Xu S, Omilian AR, Cristescu ME. High Rate of Large-Scale Hemizygous Deletions in Asexually Propagating Daphnia: Implications for the Evolution of Sex. Mol Biol Evol 2010; 28:335-42. [DOI: 10.1093/molbev/msq199] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
|
252
|
Detection of heterozygous mutations in the genome of mismatch repair defective diploid yeast using a Bayesian approach. Genetics 2010; 186:493-503. [PMID: 20660644 DOI: 10.1534/genetics.110.120105] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
DNA replication errors that escape polymerase proofreading and mismatch repair (MMR) can lead to base substitution and frameshift mutations. Such mutations can disrupt gene function, reduce fitness, and promote diseases such as cancer and are also the raw material of molecular evolution. To analyze with limited bias genomic features associated with DNA polymerase errors, we performed a genome-wide analysis of mutations that accumulate in MMR-deficient diploid lines of Saccharomyces cerevisiae. These lines were derived from a common ancestor and were grown for 160 generations, with bottlenecks reducing the population to one cell every 20 generations. We sequenced to between 8- and 20-fold coverage one wild-type and three mutator lines using Illumina Solexa 36-bp reads. Using an experimentally aware Bayesian genotype caller developed to pool experimental data across sequencing runs for all strains, we detected 28 heterozygous single-nucleotide polymorphisms (SNPs) and 48 single-nt insertion/deletions (indels) from the data set. This method was evaluated on simulated data sets and found to have a very low false-positive rate (∼6 × 10(-5)) and a false-negative rate of 0.08 within the unique mapping regions of the genome that contained at least sevenfold coverage. The heterozygous mutations identified by the Bayesian genotype caller were confirmed by Sanger sequencing. All of the mutations were unique to a given line, except for a single-nt deletion mutation which occurred independently in two lines. All 48 indels, composed of 46 deletions and two insertions, occurred in homopolymer (HP) tracts [i.e., 47 poly(A) or (T) tracts, 1 poly(G) or (C) tract] between 5 and 13 bp long. Our findings are of interest because HP tracts are present at high levels in the yeast genome (>77,400 for 5- to 20-nt HP tracts), and frameshift mutations in these regions are likely to disrupt gene function. In addition, they demonstrate that the mutation pattern seen previously in mismatch repair defective strains using a limited number of reporters holds true for the entire genome.
Collapse
|
253
|
Mackay TFC. Mutations and quantitative genetic variation: lessons from Drosophila. Philos Trans R Soc Lond B Biol Sci 2010; 365:1229-39. [PMID: 20308098 DOI: 10.1098/rstb.2009.0315] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
A central issue in evolutionary quantitative genetics is to understand how genetic variation for quantitative traits is maintained in natural populations. Estimates of genetic variation and of genetic correlations and pleiotropy among multiple traits, inbreeding depression, mutation rates for fitness and quantitative traits and of the strength and nature of selection are all required to evaluate theoretical models of the maintenance of genetic variation. Studies in Drosophila melanogaster have shown that a substantial fraction of segregating variation for fitness-related traits in Drosophila is due to rare deleterious alleles maintained by mutation-selection balance, with a smaller but significant fraction attributable to intermediate frequency alleles maintained by alleles with antagonistic pleiotropic effects, and late-age-specific effects. However, the nature of segregating variation for traits under stabilizing selection is less clear and requires more detailed knowledge of the loci, mutation rates, allelic effects and frequencies of molecular polymorphisms affecting variation in suites of pleiotropically connected traits. Recent studies in D. melanogaster have revealed unexpectedly complex genetic architectures of many quantitative traits, with large numbers of pleiotropic genes and alleles with sex-, environment- and genetic background-specific effects. Future genome wide association analyses of many quantitative traits on a common panel of fully sequenced Drosophila strains will provide much needed empirical data on the molecular genetic basis of quantitative traits.
Collapse
Affiliation(s)
- Trudy F C Mackay
- Department of Genetics, W. M. Keck Center for Behavioral Biology, North Carolina State University, , Campus Box 7614, Raleigh, NC 27697, USA.
| |
Collapse
|
254
|
Evolution of the mutation rate. Trends Genet 2010; 26:345-52. [PMID: 20594608 DOI: 10.1016/j.tig.2010.05.003] [Citation(s) in RCA: 657] [Impact Index Per Article: 46.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2009] [Revised: 05/16/2010] [Accepted: 05/21/2010] [Indexed: 11/20/2022]
Abstract
Understanding the mechanisms of evolution requires information on the rate of appearance of new mutations and their effects at the molecular and phenotypic levels. Although procuring such data has been technically challenging, high-throughput genome sequencing is rapidly expanding knowledge in this area. With information on spontaneous mutations now available in a variety of organisms, general patterns have emerged for the scaling of mutation rate with genome size and for the likely mechanisms that drive this pattern. Support is presented for the hypothesis that natural selection pushes mutation rates down to a lower limit set by the power of random genetic drift rather than by intrinsic physiological limitations, and that this has resulted in reduced levels of replication, transcription, and translation fidelity in eukaryotes relative to prokaryotes.
Collapse
|
255
|
Karasov T, Messer PW, Petrov DA. Evidence that adaptation in Drosophila is not limited by mutation at single sites. PLoS Genet 2010; 6:e1000924. [PMID: 20585551 PMCID: PMC2887467 DOI: 10.1371/journal.pgen.1000924] [Citation(s) in RCA: 189] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2009] [Accepted: 03/24/2010] [Indexed: 11/18/2022] Open
Abstract
Adaptation in eukaryotes is generally assumed to be mutation-limited because of small effective population sizes. This view is difficult to reconcile, however, with the observation that adaptation to anthropogenic changes, such as the introduction of pesticides, can occur very rapidly. Here we investigate adaptation at a key insecticide resistance locus (Ace) in Drosophila melanogaster and show that multiple simple and complex resistance alleles evolved quickly and repeatedly within individual populations. Our results imply that the current effective population size of modern D. melanogaster populations is likely to be substantially larger (> or = 100-fold) than commonly believed. This discrepancy arises because estimates of the effective population size are generally derived from levels of standing variation and thus reveal long-term population dynamics dominated by sharp--even if infrequent--bottlenecks. The short-term effective population sizes relevant for strong adaptation, on the other hand, might be much closer to census population sizes. Adaptation in Drosophila may therefore not be limited by waiting for mutations at single sites, and complex adaptive alleles can be generated quickly without fixation of intermediate states. Adaptive events should also commonly involve the simultaneous rise in frequency of independently generated adaptive mutations. These so-called soft sweeps have very distinct effects on the linked neutral polymorphisms compared to the standard hard sweeps in mutation-limited scenarios. Methods for the mapping of adaptive mutations or association mapping of evolutionarily relevant mutations may thus need to be reconsidered.
Collapse
Affiliation(s)
| | | | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail:
| |
Collapse
|
256
|
Garcia AM, Calder RB, Dollé MET, Lundell M, Kapahi P, Vijg J. Age- and temperature-dependent somatic mutation accumulation in Drosophila melanogaster. PLoS Genet 2010; 6:e1000950. [PMID: 20485564 PMCID: PMC2869313 DOI: 10.1371/journal.pgen.1000950] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2009] [Accepted: 04/13/2010] [Indexed: 02/07/2023] Open
Abstract
Using a transgenic mouse model harboring a mutation reporter gene that can be efficiently recovered from genomic DNA, we previously demonstrated that mutations accumulate in aging mice in a tissue-specific manner. Applying a recently developed, similar reporter-based assay in Drosophila melanogaster, we now show that the mutation frequency at the lacZ locus in somatic tissue of flies is about three times as high as in mouse tissues, with a much higher fraction of large genome rearrangements. Similar to mice, somatic mutations in the fly also accumulate as a function of age, but they do so much more quickly at higher temperature, a condition which in invertebrates is associated with decreased life span. Most mutations were found to accumulate in the thorax and less in abdomen, suggesting the highly oxidative flight muscles as a possible source of genotoxic stress. These results show that somatic mutation loads in short-lived flies are much more severe than in the much longer-lived mice, with the mutation rate in flies proportional to biological rather than chronological aging.
Collapse
Affiliation(s)
- Ana Maria Garcia
- Department of Biology, University of Texas at San Antonio, San Antonio, Texas, United States of America
| | - R. Brent Calder
- Albert Einstein College of Medicine, Department of Genetics, New York, New York, United States of America
| | - Martijn E. T. Dollé
- National Institute of Public Health and the Environment, Bilthoven, The Netherlands
| | - Martha Lundell
- Department of Biology, University of Texas at San Antonio, San Antonio, Texas, United States of America
| | - Pankaj Kapahi
- Buck Institute for Age Research, Novato, California, United States of America
| | - Jan Vijg
- Albert Einstein College of Medicine, Department of Genetics, New York, New York, United States of America
| |
Collapse
|
257
|
Abstract
Under the classical view, selection depends more or less directly on mutation: standing genetic variance is maintained by a balance between selection and mutation, and adaptation is fuelled by new favourable mutations. Recombination is favoured if it breaks negative associations among selected alleles, which interfere with adaptation. Such associations may be generated by negative epistasis, or by random drift (leading to the Hill-Robertson effect). Both deterministic and stochastic explanations depend primarily on the genomic mutation rate, U. This may be large enough to explain high recombination rates in some organisms, but seems unlikely to be so in general. Random drift is a more general source of negative linkage disequilibria, and can cause selection for recombination even in large populations, through the chance loss of new favourable mutations. The rate of species-wide substitutions is much too low to drive this mechanism, but local fluctuations in selection, combined with gene flow, may suffice. These arguments are illustrated by comparing the interaction between good and bad mutations at unlinked loci under the infinitesimal model.
Collapse
Affiliation(s)
- N H Barton
- Institute of Science and Technology, , Am Campus 1, A-3400 Klosterneuburg, Austria.
| |
Collapse
|
258
|
Kondrashov FA, Kondrashov AS. Measurements of spontaneous rates of mutations in the recent past and the near future. Philos Trans R Soc Lond B Biol Sci 2010; 365:1169-76. [PMID: 20308091 PMCID: PMC2871817 DOI: 10.1098/rstb.2009.0286] [Citation(s) in RCA: 72] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
The rate of spontaneous mutation in natural populations is a fundamental parameter for many evolutionary phenomena. Because the rate of mutation is generally low, most of what is currently known about mutation has been obtained through indirect, complex and imprecise methodological approaches. However, in the past few years genome-wide sequencing of closely related individuals has made it possible to estimate the rates of mutation directly at the level of the DNA, avoiding most of the problems associated with using indirect methods. Here, we review the methods used in the past with an emphasis on next generation sequencing, which may soon make the accurate measurement of spontaneous mutation rates a matter of routine.
Collapse
Affiliation(s)
- Fyodor A Kondrashov
- Bioinformatics and Genomics Programme, Centre for Genomic Regulation, , C/Dr. Aiguader 88, Barcelona Biomedical Research Park Building 08003, Barcelona, Spain.
| | | |
Collapse
|
259
|
Hartfield M, Otto SP, Keightley PD. The role of advantageous mutations in enhancing the evolution of a recombination modifier. Genetics 2010; 184:1153-64. [PMID: 20139345 PMCID: PMC2865915 DOI: 10.1534/genetics.109.112920] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2009] [Accepted: 02/03/2010] [Indexed: 11/18/2022] Open
Abstract
Although the evolution of recombination is still a major problem in evolutionary genetics, recent theoretical studies have shown that recombination can evolve by breaking down interference ("Hill-Robertson effects") among multiple loci. This leads to selection on a recombination modifier in a population subject to recurrent deleterious mutation. Here, we use computer simulations to investigate the evolution of a recombination modifier under three different scenarios of recurrent mutation in a finite population: (1) mutations are deleterious only, (2) mutations are advantageous only, and (3) there is a mixture of deleterious and advantageous mutations. We also investigate how linkage disequilibrium, the strength of selection acting on a modifier, and effective population size change under the different scenarios. We observe that adding even a small number of advantageous mutations increases the fixation rate of modifiers that increase recombination, especially if the effects of deleterious mutations are weak. However, the strength of selection on a modifier is less than the summed strengths had there been deleterious mutations only and advantageous mutations only.
Collapse
Affiliation(s)
- Matthew Hartfield
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom.
| | | | | |
Collapse
|
260
|
Crawford JE, Lazzaro BP. The demographic histories of the M and S molecular forms of Anopheles gambiae s.s. Mol Biol Evol 2010; 27:1739-44. [PMID: 20223855 DOI: 10.1093/molbev/msq070] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Anopheles gambiae is a primary vector of Plasmodium falciparum, a human malaria parasite that causes over a million deaths each year in sub-Saharan Africa. Population genetic tests have been employed to detect natural selection at suspected A. gambiae antimalaria genes, but these tests have generally been compromised by the lack of demographically correct null models. Here, we used a coalescent simulation approach within a maximum likelihood framework to fit population growth, bottleneck, and migration models to polymorphism data from Cameroonian A. gambiae. The best-fit models for both the "M" and the "S" molecular forms of A. gambiae included ancient population growth and a high rate of migration from an unsampled subpopulation. After correcting for differences in effective population size, our models suggest that the molecular forms expanded at different times and both expansions significantly predate the advent of agriculture. We show that correcting null models for demography increases the power to detect natural selection in A. gambiae.
Collapse
|
261
|
Zeng K. A Simple Multiallele Model and Its Application to Identifying Preferred-Unpreferred Codons Using Polymorphism Data. Mol Biol Evol 2010; 27:1327-37. [DOI: 10.1093/molbev/msq023] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
262
|
Abstract
Although mutation provides the fuel for phenotypic evolution, it also imposes a substantial burden on fitness through the production of predominantly deleterious alleles, a matter of concern from a human-health perspective. Here, recently established databases on de novo mutations for monogenic disorders are used to estimate the rate and molecular spectrum of spontaneously arising mutations and to derive a number of inferences with respect to eukaryotic genome evolution. Although the human per-generation mutation rate is exceptionally high, on a per-cell division basis, the human germline mutation rate is lower than that recorded for any other species. Comparison with data from other species demonstrates a universal mutational bias toward A/T composition, and leads to the hypothesis that genome-wide nucleotide composition generally evolves to the point at which the power of selection in favor of G/C is approximately balanced by the power of random genetic drift, such that variation in equilibrium genome-wide nucleotide composition is largely defined by variation in mutation biases. Quantification of the hazards associated with introns reveals that mutations at key splice-site residues are a major source of human mortality. Finally, a consideration of the long-term consequences of current human behavior for deleterious-mutation accumulation leads to the conclusion that a substantial reduction in human fitness can be expected over the next few centuries in industrialized societies unless novel means of genetic intervention are developed.
Collapse
Affiliation(s)
- Michael Lynch
- Department of Biology, Indiana University, Bloomington, IN 47405, USA.
| |
Collapse
|
263
|
Pool JE, Hellmann I, Jensen JD, Nielsen R. Population genetic inference from genomic sequence variation. Genome Res 2010; 20:291-300. [PMID: 20067940 DOI: 10.1101/gr.079509.108] [Citation(s) in RCA: 147] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
Population genetics has evolved from a theory-driven field with little empirical data into a data-driven discipline in which genome-scale data sets test the limits of available models and computational analysis methods. In humans and a few model organisms, analyses of whole-genome sequence polymorphism data are currently under way. And in light of the falling costs of next-generation sequencing technologies, such studies will soon become common in many other organisms as well. Here, we assess the challenges to analyzing whole-genome sequence polymorphism data, and we discuss the potential of these data to yield new insights concerning population history and the genomic prevalence of natural selection.
Collapse
Affiliation(s)
- John E Pool
- Department of Integrative Biology, University of California, Berkeley, Berkeley, California 94720, USA
| | | | | | | |
Collapse
|
264
|
Ossowski S, Schneeberger K, Lucas-Lledó JI, Warthmann N, Clark RM, Shaw RG, Weigel D, Lynch M. The rate and molecular spectrum of spontaneous mutations in Arabidopsis thaliana. Science 2010; 327:92-4. [PMID: 20044577 PMCID: PMC3878865 DOI: 10.1126/science.1180677] [Citation(s) in RCA: 761] [Impact Index Per Article: 54.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
To take complete advantage of information on within-species polymorphism and divergence from close relatives, one needs to know the rate and the molecular spectrum of spontaneous mutations. To this end, we have searched for de novo spontaneous mutations in the complete nuclear genomes of five Arabidopsis thaliana mutation accumulation lines that had been maintained by single-seed descent for 30 generations. We identified and validated 99 base substitutions and 17 small and large insertions and deletions. Our results imply a spontaneous mutation rate of 7 x 10(-9) base substitutions per site per generation, the majority of which are G:C-->A:T transitions. We explain this very biased spectrum of base substitution mutations as a result of two main processes: deamination of methylated cytosines and ultraviolet light-induced mutagenesis.
Collapse
Affiliation(s)
- Stephan Ossowski
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Korbinian Schneeberger
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | | | - Norman Warthmann
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Richard M. Clark
- Department of Biology, University of Utah, Salt Lake City, UT 84112, USA
| | - Ruth G. Shaw
- Department of Ecology, Evolution, and Behavior, University of Minnesota, St. Paul, MN 55108, USA
| | - Detlef Weigel
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Michael Lynch
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
| |
Collapse
|
265
|
Halligan DL, Keightley PD. Spontaneous Mutation Accumulation Studies in Evolutionary Genetics. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2009. [DOI: 10.1146/annurev.ecolsys.39.110707.173437] [Citation(s) in RCA: 320] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Daniel L. Halligan
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom; ,
| | - Peter D. Keightley
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom; ,
| |
Collapse
|
266
|
Gilad Y, Pritchard JK, Thornton K. Characterizing natural variation using next-generation sequencing technologies. Trends Genet 2009; 25:463-71. [PMID: 19801172 DOI: 10.1016/j.tig.2009.09.003] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2009] [Revised: 09/08/2009] [Accepted: 09/09/2009] [Indexed: 01/22/2023]
Abstract
Progress in evolutionary genomics is tightly coupled with the development of new technologies to collect high-throughput data. The availability of next-generation sequencing technologies has the potential to revolutionize genomic research and enable us to focus on a large number of outstanding questions that previously could not be addressed effectively. Indeed, we are now able to study genetic variation on a genome-wide scale, characterize gene regulatory processes at unprecedented resolution, and soon, we expect that individual laboratories might be able to rapidly sequence new genomes. However, at present, the analysis of next-generation sequencing data is challenging, in particular because most sequencing platforms provide short reads, which are difficult to align and assemble. In addition, only little is known about sources of variation that are associated with next-generation sequencing study designs. A better understanding of the sources of error and bias in sequencing data is essential, especially in the context of studies of variation at dynamic quantitative traits.
Collapse
Affiliation(s)
- Yoav Gilad
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA.
| | | | | |
Collapse
|
267
|
A genome-wide view of Caenorhabditis elegans base-substitution mutation processes. Proc Natl Acad Sci U S A 2009; 106:16310-4. [PMID: 19805298 DOI: 10.1073/pnas.0904895106] [Citation(s) in RCA: 191] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Knowledge of mutation processes is central to understanding virtually all evolutionary phenomena and the underlying nature of genetic disorders and cancers. However, the limitations of standard molecular mutation detection methods have historically precluded a genome-wide understanding of mutation rates and spectra in the nuclear genomes of multicellular organisms. We applied two high-throughput DNA sequencing technologies to identify and characterize hundreds of spontaneously arising base-substitution mutations in 10 Caenorhabditis elegans mutation-accumulation (MA)-line nuclear genomes. C. elegans mutation rate estimates were similar to previous calculations based on smaller numbers of mutations. Mutations were distributed uniformly within and among chromosomes and were not associated with recombination rate variation in the MA lines, suggesting that intragenomic variation in genetic hitchhiking and/or background selection are primarily responsible for the chromosomal distribution patterns of polymorphic nucleotides in C. elegans natural populations. A strong mutational bias from G/C to A/T nucleotides was detected in the MA lines, implicating oxidative DNA damage as a major endogenous mutagenic force in C. elegans. The observed mutational bias also suggests that the C. elegans nuclear genome cannot be at equilibrium because of mutation alone. Transversions dominate the spectrum of spontaneous mutations observed here, whereas transitions dominate patterns of allegedly neutral polymorphism in natural populations of C. elegans and many other animal species; this observation challenges the assumption that natural patterns of molecular variation in noncoding regions of the nuclear genome accurately reflect underlying mutation processes.
Collapse
|
268
|
Estimating selection intensity on synonymous codon usage in a nonequilibrium population. Genetics 2009; 183:651-62, 1SI-23SI. [PMID: 19620398 DOI: 10.1534/genetics.109.101782] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Codon usage bias is the nonrandom use of synonymous codons for the same amino acid. Most population genetic models of codon usage evolution assume that the population is at mutation-selection-drift equilibrium. Natural populations, however, frequently deviate from equilibrium, often because of recent demographic changes. Here, we construct a matrix model that includes the effects of a recent change in population size on estimates of selection on preferred vs. unpreferred codons. Our results suggest that patterns of synonymous polymorphisms affecting codon usage can be quite erratic after such a change; statistical methods that fail to take demographic effects into account can then give incorrect estimates of important parameters. We propose a new method that can accurately estimate both demographic and codon usage parameters. The method also provides a simple way of testing for the effects of covariates such as gene length and level of gene expression on the intensity of selection, which we apply to a large Drosophila melanogaster polymorphism data set. Our analyses of twofold degenerate codons reveal that (i) selection acts in favor of preferred codons, (ii) there is mutational bias in favor of unpreferred codons, (iii) shorter genes and genes with higher expression levels are under stronger selection, and (iv) there is little evidence for a recent change in population size in the Zimbabwe population of D. melanogaster.
Collapse
|