51
|
Abstract
Y chromosomes originate from ordinary autosomes and degenerate by accumulating deleterious mutations. This accumulation results from a lack of recombination on the Y and is driven by interference among deleterious mutations (Muller's ratchet and background selection) and the fixation of beneficial alleles (genetic hitchhiking). Here I show that the relative importance of these processes is expected to vary over the course of Y chromosome evolution due to changes in the number of active genes. The dominant mode of degeneration on a newly formed gene-rich Y chromosome is expected to be Muller's ratchet and/or background selection due to the large numbers of deleterious mutations arising in active genes. However, the relative importance of these modes of degeneration declines rapidly as active genes are lost. In contrast, the rate of degeneration due to hitchhiking is predicted to be highest on Y chromosomes containing an intermediate number of active genes. The temporal dynamics of these processes imply that a gradual restriction of recombination, as inferred in mammals, will increase the importance of genetic hitchhiking relative to Muller's ratchet and background selection.
Collapse
|
52
|
Boyko AR, Williamson SH, Indap AR, Degenhardt JD, Hernandez RD, Lohmueller KE, Adams MD, Schmidt S, Sninsky JJ, Sunyaev SR, White TJ, Nielsen R, Clark AG, Bustamante CD. Assessing the evolutionary impact of amino acid mutations in the human genome. PLoS Genet 2008; 4:e1000083. [PMID: 18516229 PMCID: PMC2377339 DOI: 10.1371/journal.pgen.1000083] [Citation(s) in RCA: 460] [Impact Index Per Article: 28.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2007] [Accepted: 04/29/2008] [Indexed: 11/19/2022] Open
Abstract
Quantifying the distribution of fitness effects among newly arising mutations in the human genome is key to resolving important debates in medical and evolutionary genetics. Here, we present a method for inferring this distribution using Single Nucleotide Polymorphism (SNP) data from a population with non-stationary demographic history (such as that of modern humans). Application of our method to 47,576 coding SNPs found by direct resequencing of 11,404 protein coding-genes in 35 individuals (20 European Americans and 15 African Americans) allows us to assess the relative contribution of demographic and selective effects to patterning amino acid variation in the human genome. We find evidence of an ancient population expansion in the sample with African ancestry and a relatively recent bottleneck in the sample with European ancestry. After accounting for these demographic effects, we find strong evidence for great variability in the selective effects of new amino acid replacing mutations. In both populations, the patterns of variation are consistent with a leptokurtic distribution of selection coefficients (e.g., gamma or log-normal) peaked near neutrality. Specifically, we predict 27-29% of amino acid changing (nonsynonymous) mutations are neutral or nearly neutral (|s|<0.01%), 30-42% are moderately deleterious (0.01%<|s|<1%), and nearly all the remainder are highly deleterious or lethal (|s|>1%). Our results are consistent with 10-20% of amino acid differences between humans and chimpanzees having been fixed by positive selection with the remainder of differences being neutral or nearly neutral. Our analysis also predicts that many of the alleles identified via whole-genome association mapping may be selectively neutral or (formerly) positively selected, implying that deleterious genetic variation affecting disease phenotype may be missed by this widely used approach for mapping genes underlying complex traits.
Collapse
Affiliation(s)
- Adam R. Boyko
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Scott H. Williamson
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
| | - Amit R. Indap
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
| | - Jeremiah D. Degenhardt
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
| | - Ryan D. Hernandez
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
| | - Kirk E. Lohmueller
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Mark D. Adams
- Department of Genetics, BRB-624, Case Western Reserve University, Cleveland, Ohio, United States of America
| | - Steffen Schmidt
- Division of Genetics, Department of Medicine, Brigham & Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
| | - John J. Sninsky
- Celera Diagnostics, Alameda, California, United States of America
| | - Shamil R. Sunyaev
- Division of Genetics, Department of Medicine, Brigham & Women's Hospital and Harvard Medical School, Boston, Massachusetts, United States of America
| | - Thomas J. White
- Celera Diagnostics, Alameda, California, United States of America
| | - Rasmus Nielsen
- Center for Comparative Genomics, University of Copenhagen, Copenhagen, Denmark
| | - Andrew G. Clark
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Carlos D. Bustamante
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
| |
Collapse
|
53
|
Joint inference of the distribution of fitness effects of deleterious mutations and population demography based on nucleotide polymorphism frequencies. Genetics 2008; 177:2251-61. [PMID: 18073430 DOI: 10.1534/genetics.107.080663] [Citation(s) in RCA: 261] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The distribution of fitness effects of new mutations (DFE) is important for addressing several questions in genetics, including the nature of quantitative variation and the evolutionary fate of small populations. Properties of the DFE can be inferred by comparing the distributions of the frequencies of segregating nucleotide polymorphisms at selected and neutral sites in a population sample, but demographic changes alter the spectrum of allele frequencies at both neutral and selected sites, so can bias estimates of the DFE if not accounted for. We have developed a maximum-likelihood approach, based on the expected allele-frequency distribution generated by transition matrix methods, to estimate parameters of the DFE while simultaneously estimating parameters of a demographic model that allows a population size change at some time in the past. We tested the method using simulations and found that it accurately recovers simulated parameter values, even if the simulated demography differs substantially from that assumed in our analysis. We use our method to estimate parameters of the DFE for amino acid-changing mutations in humans and Drosophila melanogaster. For a model of unconditionally deleterious mutations, with effects sampled from a gamma distribution, the mean estimate for the distribution shape parameter is approximately 0.2 for human populations, which implies that the DFE is strongly leptokurtic. For Drosophila populations, we estimate that the shape parameter is approximately 0.35. Differences in the shape of the distribution and the mean selection coefficient between humans and Drosophila result in significantly more strongly deleterious mutations in Drosophila than in humans, and, conversely, nearly neutral mutations are significantly less frequent.
Collapse
|
54
|
Charlesworth J, Eyre-Walker A. The McDonald-Kreitman test and slightly deleterious mutations. Mol Biol Evol 2008; 25:1007-15. [PMID: 18195052 DOI: 10.1093/molbev/msn005] [Citation(s) in RCA: 138] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
It is possible to estimate the proportion of substitutions that are due to adaptive evolution using the numbers of silent and nonsilent polymorphisms and substitutions in a McDonald and Kreitman-type analysis. Unfortunately, this estimate of adaptive evolution is biased downward by the segregation of slightly deleterious mutations. It has been suggested that 1 way to cope with the effects of these slightly deleterious mutations is to remove low-frequency polymorphisms from the analysis. We investigate the performance of this method theoretically. We show that although removing low-frequency polymorphisms does indeed reduce the bias in the estimate of adaptive evolution, the estimate is always downwardly biased, often to the extent that one would not be able to detect adaptive evolution, even if it existed. The method is reasonably satisfactory, only if the rate of adaptive evolution is high and the distribution of fitness effects for slightly deleterious mutations is very leptokurtic. Our analysis suggests that adaptive evolution could be quite prevalent in humans (>8%) and still not be detectable using current methodologies. Our analysis also suggests that the level of adaptive evolution has probably been underestimated, possibly substantially, in both bacteria and Drosophila.
Collapse
Affiliation(s)
- Jane Charlesworth
- Centre for the Study of Evolution, University of Sussex, Brighton, United Kingdom
| | | |
Collapse
|
55
|
Mitchell-Olds T, Willis JH, Goldstein DB. Which evolutionary processes influence natural genetic variation for phenotypic traits? Nat Rev Genet 2007; 8:845-56. [DOI: 10.1038/nrg2207] [Citation(s) in RCA: 384] [Impact Index Per Article: 22.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
|
56
|
Bazykin GA, Kondrashov FA, Brudno M, Poliakov A, Dubchak I, Kondrashov AS. Extensive parallelism in protein evolution. Biol Direct 2007; 2:20. [PMID: 17705846 PMCID: PMC2020468 DOI: 10.1186/1745-6150-2-20] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2007] [Accepted: 08/16/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Independently evolving lineages mostly accumulate different changes, which leads to their gradual divergence. However, parallel accumulation of identical changes is also common, especially in traits with only a small number of possible states. RESULTS We characterize parallelism in evolution of coding sequences in three four-species sets of genomes of mammals, Drosophila, and yeasts. Each such set contains two independent evolutionary paths, which we call paths I and II. An amino acid replacement which occurred along path I also occurs along path II with the probability 50-80% of that expected under selective neutrality. Thus, the per site rate of parallel evolution of proteins is several times higher than their average rate of evolution, but still lower than the rate of evolution of neutral sequences. This deficit may be caused by changes in the fitness landscape, leading to a replacement being possible along path I but not along path II. However, constant, weak selection assumed by the nearly neutral model of evolution appears to be a more likely explanation. Then, the average coefficient of selection associated with an amino acid replacement, in the units of the effective population size, must exceed approximately 0.4, and the fraction of effectively neutral replacements must be below approximately 30%. At a majority of evolvable amino acid sites, only a relatively small number of different amino acids is permitted. CONCLUSION High, but below-neutral, rates of parallel amino acid replacements suggest that a majority of amino acid replacements that occur in evolution are subject to weak, but non-trivial, selection, as predicted by Ohta's nearly-neutral theory.
Collapse
Affiliation(s)
- Georgii A Bazykin
- Institute for Information Transmission Problems of the Russian Academy of Sciences (Kharkevich Institute), Bolshoi Karetny pereulok 19, Moscow, 127994, Russia
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ 08544, USA
| | - Fyodor A Kondrashov
- Section on Ecology, Behavior and Evolution, University of California at San Diego, La Jolla, CA 92093, USA
| | - Michael Brudno
- Department of Computer Science and Banting & Best Department of Medical Research, University of Toronto, Toronto, ON M5S 3J4, Canada
| | - Alexander Poliakov
- Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
| | - Inna Dubchak
- Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Berkeley, CA 94720, USA
- Department of Energy Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Alexey S Kondrashov
- Life Sciences Institute and Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109-2216, USA
| |
Collapse
|
57
|
Popadin K, Polishchuk LV, Mamirova L, Knorre D, Gunbin K. Accumulation of slightly deleterious mutations in mitochondrial protein-coding genes of large versus small mammals. Proc Natl Acad Sci U S A 2007; 104:13390-5. [PMID: 17679693 PMCID: PMC1948928 DOI: 10.1073/pnas.0701256104] [Citation(s) in RCA: 141] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2007] [Indexed: 11/18/2022] Open
Abstract
After the effective size of a population, N(e), declines, some slightly deleterious amino acid replacements which were initially suppressed by purifying selection become effectively neutral and can reach fixation. Here we investigate this phenomenon for a set of all 13 mitochondrial protein-coding genes from 110 mammalian species. By using body mass as a proxy for N(e), we show that large mammals (i.e., those with low N(e)) as compared with small ones (in our sample these are, on average, 369.5 kg and 275 g, respectively) have a 43% higher rate of accumulation of nonsynonymous nucleotide substitutions relative to synonymous substitutions, and an 8-40% higher rate of accumulation of radical amino acid substitutions relative to conservative substitutions, depending on the type of amino acid classification. These higher rates result in a 6% greater amino acid dissimilarity between modern species and their most recent reconstructed ancestors in large versus small mammals. Because nonsynonymous substitutions are likely to be more harmful than synonymous substitutions, and radical amino acid substitutions are likely to be more harmful than conservative ones, our results suggest that large mammals experience less efficient purifying selection than small mammals. Furthermore, because in the course of mammalian evolution body size tends to increase and, consequently, N(e) tends to decline, evolution of mammals toward large body size may involve accumulation of slightly deleterious mutations in mitochondrial protein-coding genes, which may contribute to decline or extinction of large mammals.
Collapse
Affiliation(s)
| | - Leonard V. Polishchuk
- General Ecology, Biological Faculty of M.V. Lomonosov Moscow State University, Vorobyevy Gory 1-12, Moscow 119992, Russia
| | - Leila Mamirova
- Institute for Information Transmission Problems RAS, Bolshoi Karetny pereulok 19, Moscow 127994, Russia
| | - Dmitry Knorre
- A.N. Belozersky Institute of Physico-Chemical Biology, M.V. Lomonosov Moscow State University, Building A, Moscow 119899, Russia; and
| | - Konstantin Gunbin
- **Institute of Cytology and Genetics SB RAS, Lavrentiev aven. 10, Novosibirsk 630090, Russia
| |
Collapse
|
58
|
Hughes AL. Looking for Darwin in all the wrong places: the misguided quest for positive selection at the nucleotide sequence level. Heredity (Edinb) 2007; 99:364-73. [PMID: 17622265 DOI: 10.1038/sj.hdy.6801031] [Citation(s) in RCA: 189] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Recent years have seen an explosion of interest in evidence for positive Darwinian selection at the molecular level. This quest has been hampered by the use of statistical methods that fail adequately to rule out alternative hypotheses, particularly the relaxation of purifying selection and the effects of population bottlenecks, during which the effectiveness of purifying selection is reduced. A further problem has been the assumption that positive selection will generally involve repeated amino-acid changes to a single protein. This model was derived from the case of the vertebrate major histocompatibility complex (MHC), but the MHC proteins are unusual in being involved in protein-protein recognition and in a co-evolutionary process of pathogens. There is no reason to suppose that repeated amino-acid changes to a single protein are involved in selectively advantageous phenotypes in general. Rather adaptive phenotypes are much more likely to result from other causes, including single amino-acid changes; deletion or silencing of genes or changes in the pattern of gene expression.
Collapse
Affiliation(s)
- A L Hughes
- Department of Biological Sciences, University of South Carolina, Columbia, SC 29208, USA.
| |
Collapse
|
59
|
Kryukov GV, Pennacchio LA, Sunyaev SR. Most rare missense alleles are deleterious in humans: implications for complex disease and association studies. Am J Hum Genet 2007; 80:727-39. [PMID: 17357078 PMCID: PMC1852724 DOI: 10.1086/513473] [Citation(s) in RCA: 444] [Impact Index Per Article: 26.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2006] [Accepted: 01/30/2007] [Indexed: 12/31/2022] Open
Abstract
The accumulation of mildly deleterious missense mutations in individual human genomes has been proposed to be a genetic basis for complex diseases. The plausibility of this hypothesis depends on quantitative estimates of the prevalence of mildly deleterious de novo mutations and polymorphic variants in humans and on the intensity of selective pressure against them. We combined analysis of mutations causing human Mendelian diseases, of human-chimpanzee divergence, and of systematic data on human genetic variation and found that ~20% of new missense mutations in humans result in a loss of function, whereas ~27% are effectively neutral. Thus, the remaining 53% of new missense mutations have mildly deleterious effects. These mutations give rise to many low-frequency deleterious allelic variants in the human population, as is evident from a new data set of 37 genes sequenced in >1,500 individual human chromosomes. Surprisingly, up to 70% of low-frequency missense alleles are mildly deleterious and are associated with a heterozygous fitness loss in the range 0.001-0.003. Thus, the low allele frequency of an amino acid variant can, by itself, serve as a predictor of its functional significance. Several recent studies have reported a significant excess of rare missense variants in candidate genes or pathways in individuals with extreme values of quantitative phenotypes. These studies would be unlikely to yield results if most rare variants were neutral or if rare variants were not a significant contributor to the genetic component of phenotypic inheritance. Our results provide a justification for these types of candidate-gene (pathway) association studies and imply that mutation-selection balance may be a feasible evolutionary mechanism underlying some common diseases.
Collapse
Affiliation(s)
- Gregory V Kryukov
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Boston, MA 02125, USA
| | | | | |
Collapse
|
60
|
Loewe L, Charlesworth B. Inferring the distribution of mutational effects on fitness in Drosophila. Biol Lett 2007; 2:426-30. [PMID: 17148422 PMCID: PMC1686194 DOI: 10.1098/rsbl.2006.0481] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The properties of the distribution of deleterious mutational effects on fitness (DDME) are of fundamental importance for evolutionary genetics. Since it is extremely difficult to determine the nature of this distribution, several methods using various assumptions about the DDME have been developed, for the purpose of parameter estimation. We apply a newly developed method to DNA sequence polymorphism data from two Drosophila species and compare estimates of the parameters of the distribution of the heterozygous fitness effects of amino acid mutations for several different distribution functions. The results exclude normal and gamma distributions, since these predict too few effectively lethal mutations and power-law distributions as a result of predicting too many lethals. Only the lognormal distribution appears to fit both the diversity data and the frequency of lethals. This DDME arises naturally in complex systems when independent factors contribute multiplicatively to an increase in fitness-reducing damage. Several important parameters, such as the fraction of effectively neutral non-synonymous mutations and the harmonic mean of non-neutral selection coefficients, are robust to the form of the DDME. Our results suggest that the majority of non-synonymous mutations in Drosophila are under effective purifying selection.
Collapse
Affiliation(s)
- Laurence Loewe
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JT, UK.
| | | |
Collapse
|
61
|
Nackley AG, Shabalina SA, Tchivileva IE, Satterfield K, Korchynskyi O, Makarov SS, Maixner W, Diatchenko L. Human catechol-O-methyltransferase haplotypes modulate protein expression by altering mRNA secondary structure. Science 2007; 314:1930-3. [PMID: 17185601 DOI: 10.1126/science.1131262] [Citation(s) in RCA: 660] [Impact Index Per Article: 38.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Catechol-O-methyltransferase (COMT) is a key regulator of pain perception, cognitive function, and affective mood. Three common haplotypes of the human COMT gene, divergent in two synonymous and one nonsynonymous position, code for differences in COMT enzymatic activity and are associated with pain sensitivity. Haplotypes divergent in synonymous changes exhibited the largest difference in COMT enzymatic activity, due to a reduced amount of translated protein. The major COMT haplotypes varied with respect to messenger RNA local stem-loop structures, such that the most stable structure was associated with the lowest protein levels and enzymatic activity. Site-directed mutagenesis that eliminated the stable structure restored the amount of translated protein. These data highlight the functional significance of synonymous variations and suggest the importance of haplotypes over single-nucleotide polymorphisms for analysis of genetic variations.
Collapse
Affiliation(s)
- A G Nackley
- Center for Neurosensory Disorders, University of North Carolina, Chapel Hill, NC 27599, USA
| | | | | | | | | | | | | | | |
Collapse
|
62
|
Gu X. Stabilizing selection of protein function and distribution of selection coefficient among sites. Genetica 2006; 130:93-7. [PMID: 17077960 DOI: 10.1007/s10709-006-0022-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2006] [Accepted: 05/24/2006] [Indexed: 10/24/2022]
Abstract
In this study, I take a new approach to modeling the evolutionary constraint of protein sequence, introducing the stabilizing selection of protein function into the nearly-neutral theory. In other words, protein function under stabilizing selection generates the evolutionary conservation at the sequence level. With the help of random mutational effects of nucleotides on protein function, I have derived the distribution of selection coefficient among sites, called the S-distribution whose parameters have clear biological interpretations. Moreover, I have studied the inverse relationship between the evolutionary rate and the effective population size, showing that the number of molecular phenotypes of protein function, i.e., independent components in the fitness of the organism, may play a key role for the molecular clock under the nearly-neutral theory. These results are helpful for having a better understanding of the underlying evolutionary mechanism of protein sequences, as well as human disease-related mutations.
Collapse
Affiliation(s)
- Xun Gu
- Department of Genetics, Development and Cell Biology Center for Bioinformatics and Biological Statistics, Iowa State University, 536 Science II Hall, Ames, IA 50011, USA.
| |
Collapse
|
63
|
Subramanian S, Kumar S. Higher intensity of purifying selection on >90% of the human genes revealed by the intrinsic replacement mutation rates. Mol Biol Evol 2006; 23:2283-7. [PMID: 16982819 PMCID: PMC3072915 DOI: 10.1093/molbev/msl123] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
For over 3 decades, the rate of replacement mutations has been assumed to be equal to, and estimated from, the rate of "strictly" neutral sequence divergence in noncoding regions and in silent-codon positions where mutations do not alter the amino acid encoded. This assumption is fundamental to estimating the fraction of harmful protein mutations and to identifying adaptive evolution at individual codons and proteins. We show that the assumption is not justifiable because a much larger fraction of codon positions is involved in hypermutable CpG dinucleotides as compared with the introns, leading to a higher expected replacement mutation rate per site in a vast majority of the genes. Consideration of this difference reveals a higher intensity of purifying natural selection than previously inferred in human genes. We also show that a much smaller number of genes are expected to be evolving with positive selection than that predicted using sequence divergence at intron and silent positions in the human genome. These patterns indicate the need for using new approaches for estimating rates of amino acid-altering mutations in order to find positively selected genes and codons in genomes that contain hypermutable CpG's.
Collapse
Affiliation(s)
- Sankar Subramanian
- Center for Evolutionary Functional Genomics, The Biodesign Institute, Arizona State University
- School of Life Sciences, Arizona State University
| | - Sudhir Kumar
- Center for Evolutionary Functional Genomics, The Biodesign Institute, Arizona State University
- School of Life Sciences, Arizona State University
| |
Collapse
|
64
|
Abstract
The beneficial substitution of an allele shapes patterns of genetic variation at linked sites. Thus, in principle, adaptations can be mapped by looking for the signature of directional selection in polymorphism data. In practice, such efforts are hampered by the need for an accurate characterization of the demographic history of the species and of the effects of positive selection. In an attempt to circumvent these difficulties, researchers are increasingly taking a purely empirical approach, in which a large number of genomic regions are ordered by summaries of the polymorphism data, and loci with extreme values are considered to be likely targets of positive selection. We evaluated the reliability of the "empirical" approach, focusing on applications to human data and to maize. To do so, we considered a coalescent model of directional selection in a sensible demographic setting, allowing for selection on standing variation as well as on a new mutation. Our simulations suggest that while empirical approaches will identify several interesting candidates, they will also miss many--in some cases, most--loci of interest. The extent of the trade-off depends on the mode of positive selection and the demographic history of the population. Specifically, the false-discovery rate is higher when directional selection involves a recessive rather than a co-dominant allele, when it acts on a previously neutral rather than a new allele, and when the population has experienced a population bottleneck rather than maintained a constant size. One implication of these results is that, insofar as attributes of the beneficial mutation (e.g., the dominance coefficient) affect the power to detect targets of selection, genomic scans will yield an unrepresentative subset of loci that contribute to adaptations.
Collapse
Affiliation(s)
- Kosuke M. Teshima
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
- Corresponding authors.E-mail ; fax (773) 834-0505.E-mail ; fax (773) 834-0505
| | - Graham Coop
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
| | - Molly Przeworski
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
- Corresponding authors.E-mail ; fax (773) 834-0505.E-mail ; fax (773) 834-0505
| |
Collapse
|
65
|
Eyre-Walker A, Woolfit M, Phelps T. The distribution of fitness effects of new deleterious amino acid mutations in humans. Genetics 2006; 173:891-900. [PMID: 16547091 PMCID: PMC1526495 DOI: 10.1534/genetics.106.057570] [Citation(s) in RCA: 274] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The distribution of fitness effects of new mutations is a fundamental parameter in genetics. Here we present a new method by which the distribution can be estimated. The method is fairly robust to changes in population size and admixture, and it can be corrected for any residual effects if a model of the demography is available. We apply the method to extensively sampled single-nucleotide polymorphism data from humans and estimate the distribution of fitness effects for amino acid changing mutations. We show that a gamma distribution with a shape parameter of 0.23 provides a good fit to the data and we estimate that >50% of mutations are likely to have mild effects, such that they reduce fitness by between one one-thousandth and one-tenth. We also infer that <15% of new mutations are likely to have strongly deleterious effects. We estimate that on average a nonsynonymous mutation reduces fitness by a few percent and that the average strength of selection acting against a nonsynonymous polymorphism is approximately 9 x 10(-5). We argue that the relaxation of natural selection due to modern medicine and reduced variance in family size is not likely to lead to a rapid decline in genetic quality, but that it will be very difficult to locate most of the genes involved in complex genetic diseases.
Collapse
|