101
|
Inferring the Frequency Spectrum of Derived Variants to Quantify Adaptive Molecular Evolution in Protein-Coding Genes of Drosophila melanogaster. Genetics 2016; 203:975-84. [PMID: 27098912 DOI: 10.1534/genetics.116.188102] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2016] [Accepted: 04/18/2014] [Indexed: 11/18/2022] Open
Abstract
Many approaches for inferring adaptive molecular evolution analyze the unfolded site frequency spectrum (SFS), a vector of counts of sites with different numbers of copies of derived alleles in a sample of alleles from a population. Accurate inference of the high-copy-number elements of the SFS is difficult, however, because of misassignment of alleles as derived vs. ancestral. This is a known problem with parsimony using outgroup species. Here we show that the problem is particularly serious if there is variation in the substitution rate among sites brought about by variation in selective constraint levels. We present a new method for inferring the SFS using one or two outgroups that attempts to overcome the problem of misassignment. We show that two outgroups are required for accurate estimation of the SFS if there is substantial variation in selective constraints, which is expected to be the case for nonsynonymous sites in protein-coding genes. We apply the method to estimate unfolded SFSs for synonymous and nonsynonymous sites in a population of Drosophila melanogaster from phase 2 of the Drosophila Population Genomics Project. We use the unfolded spectra to estimate the frequency and strength of advantageous and deleterious mutations and estimate that ∼50% of amino acid substitutions are positively selected but that <0.5% of new amino acid mutations are beneficial, with a scaled selection strength of Nes ≈ 12.
Collapse
|
102
|
Elevated Linkage Disequilibrium and Signatures of Soft Sweeps Are Common in Drosophila melanogaster. Genetics 2016; 203:863-80. [PMID: 27098909 DOI: 10.1534/genetics.115.184002] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2015] [Accepted: 03/25/2016] [Indexed: 12/20/2022] Open
Abstract
The extent to which selection and demography impact patterns of genetic diversity in natural populations of Drosophila melanogaster is yet to be fully understood. We previously observed that linkage disequilibrium (LD) at scales of ∼10 kb in the Drosophila Genetic Reference Panel (DGRP), consisting of 145 inbred strains from Raleigh, North Carolina, measured both between pairs of sites and as haplotype homozygosity, is elevated above neutral demographic expectations. We also demonstrated that signatures of strong and recent soft sweeps are abundant. However, the extent to which these patterns are specific to this derived and admixed population is unknown. It is also unclear whether these patterns are a consequence of the extensive inbreeding performed to generate the DGRP data. Here we analyze LD statistics in a sample of >100 fully-sequenced strains from Zambia; an ancestral population to the Raleigh population that has experienced little to no admixture and was generated by sequencing haploid embryos rather than inbred strains. We find an elevation in long-range LD and haplotype homozygosity compared to neutral expectations in the Zambian sample, thus showing the elevation in LD is not specific to the DGRP data set. This elevation in LD and haplotype structure remains even after controlling for possible confounders including genomic inversions, admixture, population substructure, close relatedness of individual strains, and recombination rate variation. Furthermore, signatures of partial soft sweeps similar to those found in the DGRP as well as partial hard sweeps are common in Zambia. These results suggest that while the selective forces and sources of adaptive mutations may differ in Zambia and Raleigh, elevated long-range LD and signatures of soft sweeps are generic in D. melanogaster.
Collapse
|
103
|
Nakashima Y, Higashiyama A, Ushimaru A, Nagoda N, Matsuo Y. Evolution of GC content in the histone gene repeating units from Drosophila lutescens, D. takahashii and D. pseudoobscura. Genes Genet Syst 2016; 91:27-36. [PMID: 27021916 DOI: 10.1266/ggs.15-00018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
A subset of histone genes (H1, H2A, H2B and H4), which are encoded along with H3 within repeating units, were analyzed in Drosophila lutescens, D. takahashii and D. pseudoobscura to investigate the evolutionary mechanisms influencing this multigene family and its GC content. Nucleotide divergence among species was more marked in the less functional regions. A strong inverse relationship was observed between the extent of evolutionary divergence and GC content within the repeating units; this finding indicated that the functional constraint on a region must be associated with both divergence and GC content. The GC content at 3(rd) codon positions in the histone genes from D. lutescens and D. takahashii was higher than that from D. melanogaster, while that from D. pseudoobscura was similar. These evolutionary patterns were similar to those of H3 gene regions. Based on these findings, we propose that the evolutionary mechanisms governing nucleotide content at 3(rd) codon positions tend to eliminate A and T nucleotides more frequently than G and C nucleotides. These changes might be the consequence of negative selection and would result in GC-rich 3(rd) codon positions. In addition, interspecific differences in GC content, which exhibited the same pattern for all histone genes, could be explained by different selection efficiencies that result from changes in population size.
Collapse
Affiliation(s)
- Yuko Nakashima
- Laboratory of Adaptive Evolution, Institute of Socio-Arts and Sciences, Tokushima University
| | | | | | | | | |
Collapse
|
104
|
Fischer I, Diévart A, Droc G, Dufayard JF, Chantret N. Evolutionary Dynamics of the Leucine-Rich Repeat Receptor-Like Kinase (LRR-RLK) Subfamily in Angiosperms. PLANT PHYSIOLOGY 2016; 170:1595-610. [PMID: 26773008 PMCID: PMC4775120 DOI: 10.1104/pp.15.01470] [Citation(s) in RCA: 74] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/16/2015] [Accepted: 01/14/2016] [Indexed: 05/20/2023]
Abstract
Gene duplications are an important factor in plant evolution, and lineage-specific expanded (LSE) genes are of particular interest. Receptor-like kinases expanded massively in land plants, and leucine-rich repeat receptor-like kinases (LRR-RLK) constitute the largest receptor-like kinases family. Based on the phylogeny of 7,554 LRR-RLK genes from 31 fully sequenced flowering plant genomes, the complex evolutionary dynamics of this family was characterized in depth. We studied the involvement of selection during the expansion of this family among angiosperms. LRR-RLK subgroups harbor extremely contrasting rates of duplication, retention, or loss, and LSE copies are predominantly found in subgroups involved in environmental interactions. Expansion rates also differ significantly depending on the time when rounds of expansion or loss occurred on the angiosperm phylogenetic tree. Finally, using a dN/dS-based test in a phylogenetic framework, we searched for selection footprints on LSE and single-copy LRR-RLK genes. Selective constraint appeared to be globally relaxed at LSE genes, and codons under positive selection were detected in 50% of them. Moreover, the leucine-rich repeat domains, and specifically four amino acids in them, were found to be the main targets of positive selection. Here, we provide an extensive overview of the expansion and evolution of this very large gene family.
Collapse
Affiliation(s)
- Iris Fischer
- Institut National de la Recherche Agronomique, Unité Mixte de Recherche Amélioration Génétique et Adaptation des Plantes Méditerranéennes et Tropicales, F-34060 Montpellier, France (I.F., N.C.); andCentre de Coopération Internationale en Recherche Agronomique Pour le Développement, Unité Mixte de Recherche AGAP, F-34398 Montpellier, France (A.D., G.D., J.-F.D.)
| | - Anne Diévart
- Institut National de la Recherche Agronomique, Unité Mixte de Recherche Amélioration Génétique et Adaptation des Plantes Méditerranéennes et Tropicales, F-34060 Montpellier, France (I.F., N.C.); andCentre de Coopération Internationale en Recherche Agronomique Pour le Développement, Unité Mixte de Recherche AGAP, F-34398 Montpellier, France (A.D., G.D., J.-F.D.)
| | - Gaetan Droc
- Institut National de la Recherche Agronomique, Unité Mixte de Recherche Amélioration Génétique et Adaptation des Plantes Méditerranéennes et Tropicales, F-34060 Montpellier, France (I.F., N.C.); andCentre de Coopération Internationale en Recherche Agronomique Pour le Développement, Unité Mixte de Recherche AGAP, F-34398 Montpellier, France (A.D., G.D., J.-F.D.)
| | - Jean-François Dufayard
- Institut National de la Recherche Agronomique, Unité Mixte de Recherche Amélioration Génétique et Adaptation des Plantes Méditerranéennes et Tropicales, F-34060 Montpellier, France (I.F., N.C.); andCentre de Coopération Internationale en Recherche Agronomique Pour le Développement, Unité Mixte de Recherche AGAP, F-34398 Montpellier, France (A.D., G.D., J.-F.D.)
| | - Nathalie Chantret
- Institut National de la Recherche Agronomique, Unité Mixte de Recherche Amélioration Génétique et Adaptation des Plantes Méditerranéennes et Tropicales, F-34060 Montpellier, France (I.F., N.C.); andCentre de Coopération Internationale en Recherche Agronomique Pour le Développement, Unité Mixte de Recherche AGAP, F-34398 Montpellier, France (A.D., G.D., J.-F.D.)
| |
Collapse
|
105
|
Noh S, Marshall JL. Sorted gene genealogies and species-specific nonsynonymous substitutions point to putative postmating prezygotic isolation genes in Allonemobius crickets. PeerJ 2016; 4:e1678. [PMID: 26893965 PMCID: PMC4756749 DOI: 10.7717/peerj.1678] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2015] [Accepted: 01/14/2016] [Indexed: 12/19/2022] Open
Abstract
In the Allonemobius socius complex of crickets, reproductive isolation is primarily accomplished via postmating prezygotic barriers. We tested seven protein-coding genes expressed in the male ejaculate for patterns of evolution consistent with a putative role as postmating prezygotic isolation genes. Our recently diverged species generally lacked sequence variation. As a result, ω-based tests were only mildly successful. Some of our genes showed evidence of elevated ω values on the internal branches of gene trees. In a couple of genes, these internal branches coincided with both species branching events of the species tree, between A. fasciatus and the other two species, and between A. socius and A. sp. nov. Tex. In comparison, more successful approaches were those that took advantage of the varying degrees of lineage sorting and allele sharing among our young species. These approaches were particularly powerful within the contact zone. Among the genes we tested we found genes with genealogies that indicated relatively advanced degrees of lineage sorting across both allopatric and contact zone alleles. Within a contact zone between two members of the species complex, only a subset of genes maintained allelic segregation despite evidence of ongoing gene flow in other genes. The overlap in these analyses was arginine kinase (AK) and apolipoprotein A-1 binding protein (APBP). These genes represent two of the first examples of sperm maturation, capacitation, and motility proteins with fixed non-synonymous substitutions between species-specific alleles that may lead to postmating prezygotic isolation. Both genes express ejaculate proteins transferred to females during copulation and were previously identified through comparative proteomics. We discuss the potential function of these genes in the context of the specific postmating prezygotic isolation phenotype among our species, namely conspecific sperm precedence and the superior ability of conspecific males to induce oviposition in females.
Collapse
Affiliation(s)
- Suegene Noh
- Department of Biology, Washington University in St. Louis , St. Louis, MO , United States
| | - Jeremy L Marshall
- Department of Entomology, Kansas State University , Manhattan, KS , United States
| |
Collapse
|
106
|
Pang E, Wu X, Lin K. Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences. Mol Genet Genomics 2016; 291:1127-36. [PMID: 26833483 PMCID: PMC4875946 DOI: 10.1007/s00438-016-1170-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2015] [Accepted: 01/18/2016] [Indexed: 11/30/2022]
Abstract
Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution.
Collapse
Affiliation(s)
- Erli Pang
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, 100875, China.
| | - Xiaomei Wu
- College of Life and Environmental Sciences, Hangzhou Normal University, Hangzhou, 310036, China
| | - Kui Lin
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, 100875, China
| |
Collapse
|
107
|
Machado HE, Bergland AO, O'Brien KR, Behrman EL, Schmidt PS, Petrov DA. Comparative population genomics of latitudinal variation in Drosophila simulans and Drosophila melanogaster. Mol Ecol 2016; 25:723-40. [PMID: 26523848 DOI: 10.1111/mec.13446] [Citation(s) in RCA: 111] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2015] [Revised: 10/26/2015] [Accepted: 10/28/2015] [Indexed: 12/15/2022]
Abstract
Examples of clinal variation in phenotypes and genotypes across latitudinal transects have served as important models for understanding how spatially varying selection and demographic forces shape variation within species. Here, we examine the selective and demographic contributions to latitudinal variation through the largest comparative genomic study to date of Drosophila simulans and Drosophila melanogaster, with genomic sequence data from 382 individual fruit flies, collected across a spatial transect of 19 degrees latitude and at multiple time points over 2 years. Consistent with phenotypic studies, we find less clinal variation in D. simulans than D. melanogaster, particularly for the autosomes. Moreover, we find that clinally varying loci in D. simulans are less stable over multiple years than comparable clines in D. melanogaster. D. simulans shows a significantly weaker pattern of isolation by distance than D. melanogaster and we find evidence for a stronger contribution of migration to D. simulans population genetic structure. While population bottlenecks and migration can plausibly explain the differences in stability of clinal variation between the two species, we also observe a significant enrichment of shared clinal genes, suggesting that the selective forces associated with climate are acting on the same genes and phenotypes in D. simulans and D. melanogaster.
Collapse
Affiliation(s)
- Heather E Machado
- Department of Biology, Stanford University, 371 Serra Mall, Stanford, CA, 94305-5020, USA
| | - Alan O Bergland
- Department of Biology, Stanford University, 371 Serra Mall, Stanford, CA, 94305-5020, USA
| | - Katherine R O'Brien
- School of Biological Sciences, University of Nebraska-Lincoln, 348 Manter Hall, Lincoln, NE, 68588, USA.,Department of Biology, University of Pennsylvania, 102 Leidy Laboratories, Philadelphia, PA, 19104-6313, USA
| | - Emily L Behrman
- Department of Biology, University of Pennsylvania, 102 Leidy Laboratories, Philadelphia, PA, 19104-6313, USA
| | - Paul S Schmidt
- Department of Biology, University of Pennsylvania, 102 Leidy Laboratories, Philadelphia, PA, 19104-6313, USA
| | - Dmitri A Petrov
- Department of Biology, Stanford University, 371 Serra Mall, Stanford, CA, 94305-5020, USA
| |
Collapse
|
108
|
Cagan A, Blass T. Identification of genomic variants putatively targeted by selection during dog domestication. BMC Evol Biol 2016; 16:10. [PMID: 26754411 PMCID: PMC4710014 DOI: 10.1186/s12862-015-0579-7] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2015] [Accepted: 12/22/2015] [Indexed: 01/16/2023] Open
Abstract
BACKGROUND Dogs [Canis lupus familiaris] were the first animal species to be domesticated and continue to occupy an important place in human societies. Recent studies have begun to reveal when and where dog domestication occurred. While much progress has been made in identifying the genetic basis of phenotypic differences between dog breeds we still know relatively little about the genetic changes underlying the phenotypes that differentiate all dogs from their wild progenitors, wolves [Canis lupus]. In particular, dogs generally show reduced aggression and fear towards humans compared to wolves. Therefore, selection for tameness was likely a necessary prerequisite for dog domestication. With the increasing availability of whole-genome sequence data it is possible to try and directly identify the genetic variants contributing to the phenotypic differences between dogs and wolves. RESULTS We analyse the largest available database of genome-wide polymorphism data in a global sample of dogs 69 and wolves 7. We perform a scan to identify regions of the genome that are highly differentiated between dogs and wolves. We identify putatively functional genomic variants that are segregating or at high frequency [> = 0.75 Fst] for alternative alleles between dogs and wolves. A biological pathways analysis of the genes containing these variants suggests that there has been selection on the 'adrenaline and noradrenaline biosynthesis pathway', well known for its involvement in the fight-or-flight response. We identify 11 genes with putatively functional variants fixed for alternative alleles between dogs and wolves. The segregating variants in these genes are strong candidates for having been targets of selection during early dog domestication. CONCLUSIONS We present the first genome-wide analysis of the different categories of putatively functional variants that are fixed or segregating at high frequency between a global sampling of dogs and wolves. We find evidence that selection has been strongest around non-synonymous variants. Strong selection in the initial stages of dog domestication appears to have occurred on multiple genes involved in the fight-or-flight response, particularly in the catecholamine synthesis pathway. Different alleles in some of these genes have been associated with behavioral differences between modern dog breeds, suggesting an important role for this pathway at multiple stages in the domestication process.
Collapse
Affiliation(s)
- Alex Cagan
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103, Leipzig, Germany.
| | - Torsten Blass
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Platz 6, 04103, Leipzig, Germany.
| |
Collapse
|
109
|
Dettman JR, Sztepanacz JL, Kassen R. The properties of spontaneous mutations in the opportunistic pathogen Pseudomonas aeruginosa. BMC Genomics 2016; 17:27. [PMID: 26732503 PMCID: PMC4702332 DOI: 10.1186/s12864-015-2244-3] [Citation(s) in RCA: 56] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2015] [Accepted: 11/25/2015] [Indexed: 12/23/2022] Open
Abstract
Background Natural genetic variation ultimately arises from the process of mutation. Knowledge of how the raw material for evolution is produced is necessary for a full understanding of several fundamental evolutionary concepts. We performed a mutation accumulation experiment with wild-type and mismatch-repair deficient, mutator lines of the pathogenic bacterium Pseudomonas aeruginosa, and used whole-genome sequencing to reveal the genome-wide rate, spectrum, distribution, leading/lagging bias, and context-dependency of spontaneous mutations. Results Wild-type base-pair mutation and indel rates were ~10−10 and ~10−11 per nucleotide per generation, respectively, and deficiencies in the mismatch-repair system caused rates to increase by over two orders of magnitude. A universal bias towards AT was observed in wild-type lines, but was reversed in mutator lines to a bias towards GC. Biases for which types of mutations occurred during replication of the leading versus lagging strand were detected reciprocally in both replichores. The distribution of mutations along the chromosome was non-random, with peaks near the terminus of replication and at positions intermediate to the replication origin and terminus. A similar distribution bias was observed along the chromosome in natural populations of P. aeruginosa. Site-specific mutation rates were higher when the focal nucleotide was immediately flanked by C:G pairings. Conclusions Whole-genome sequencing of mutation accumulation lines allowed the comprehensive identification of mutations and revealed what factors of molecular and genomic architecture affect the mutational process. Our study provides a more complete view of how several mechanisms of mutation, mutation repair, and bias act simultaneously to produce the raw material for evolution. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-2244-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jeremy R Dettman
- Department of Biology and Centre for Advanced Research in Environmental Genomics, University of Ottawa, Ottawa, ON, K1N 6N5, Canada.
| | | | - Rees Kassen
- Department of Biology and Centre for Advanced Research in Environmental Genomics, University of Ottawa, Ottawa, ON, K1N 6N5, Canada.
| |
Collapse
|
110
|
Hauber DJ, Grogan DW, DeBry RW. Mutations to Less-Preferred Synonymous Codons in a Highly Expressed Gene of Escherichia coli: Fitness and Epistatic Interactions. PLoS One 2016; 11:e0146375. [PMID: 26727272 PMCID: PMC4699635 DOI: 10.1371/journal.pone.0146375] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2015] [Accepted: 12/16/2015] [Indexed: 01/11/2023] Open
Abstract
Codon-tRNA coevolution to maximize protein production has been, until recently, the dominant hypothesis to explain codon-usage bias in highly expressed bacterial genes. Two predictions of this hypothesis are 1) selection is weak; and 2) similar silent replacements at different codons should have similar fitness consequence. We used an allele-replacement strategy to change five specific 3rd-codon-position (silent) sites in the highly expressed Escherichia coli ribosomal protein gene rplQ from the wild type to a less-preferred alternative. We introduced the five mutations within a 10-codon region. Four of the silent sites were chosen to test the second prediction, with a CTG to CTA mutation being introduced at two closely linked leucine codons and an AAA to AAG mutation being introduced at two closely linked lysine codons. We also introduced a fifth silent mutation, a GTG to GTA mutation at a valine codon in the same genic region. We measured the fitness effect of the individual mutations by competing each single-mutant strain against the parental wild-type strain, using a disrupted form of the araA gene as a selectively neutral phenotypic marker to distinguish between strains in direct competition experiments. Three of the silent mutations had a fitness effect of |s| > 0.02, which is contradictory to the prediction that selection will be weak. The two leucine mutations had significantly different fitness effects, as did the two lysine mutations, contradictory to the prediction that similar mutations at different codons should have similar fitness effects. We also constructed a strain carrying all five silent mutations in combination. Its fitness effect was greater than that predicted from the individual fitness values, suggesting that negative synergistic epistasis acts on the combination allele.
Collapse
Affiliation(s)
- David J. Hauber
- Department of Biological Sciences, University of Cincinnati, Cincinnati, Ohio, United States of America
| | - Dennis W. Grogan
- Department of Biological Sciences, University of Cincinnati, Cincinnati, Ohio, United States of America
| | - Ronald W. DeBry
- Department of Biological Sciences, University of Cincinnati, Cincinnati, Ohio, United States of America
| |
Collapse
|
111
|
Price N, Graur D. Are Synonymous Sites in Primates and Rodents Functionally Constrained? J Mol Evol 2015; 82:51-64. [PMID: 26563252 DOI: 10.1007/s00239-015-9719-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2015] [Accepted: 11/04/2015] [Indexed: 11/28/2022]
Abstract
It has been claimed that synonymous sites in mammals are under selective constraint. Furthermore, in many studies the selective constraint at such sites in primates was claimed to be more stringent than that in rodents. Given the larger effective population sizes in rodents than in primates, the theoretical expectation is that selection in rodents would be more effective than that in primates. To resolve this contradiction between expectations and observations, we used processed pseudogenes as a model for strict neutral evolution, and estimated selective constraint on synonymous sites using the rate of substitution at pseudosynonymous and pseudononsynonymous sites in pseudogenes as the neutral expectation. After controlling for the effects of GC content, our results were similar to those from previous studies, i.e., synonymous sites in primates exhibited evidence for higher selective constraint that those in rodents. Specifically, our results indicated that in primates up to 24% of synonymous sites could be under purifying selection, while in rodents synonymous sites evolved neutrally. To further control for shifts in GC content, we estimated selective constraint at fourfold degenerate sites using a maximum parsimony approach. This allowed us to estimate selective constraint using mutational patterns that cause a shift in GC content (GT ↔ TG, CT ↔ TC, GA ↔ AG, and CA ↔ AC) and ones that do not (AT ↔ TA and CG ↔ GC). Using this approach, we found that synonymous sites evolve neutrally in both primates and rodents. Apparent deviations from neutrality were caused by a higher rate of C → A and C → T mutations in pseudogenes. Such differences are most likely caused by the shift in GC content experienced by pseudogenes. We conclude that previous estimates according to which 20-40% of synonymous sites in primates were under selective constraint were most likely artifacts of the biased pattern of mutation.
Collapse
Affiliation(s)
- Nicholas Price
- Department of Bioagricultural Sciences and Pest Management, Colorado State University, Fort Collins, CO, 80523, USA.
| | - Dan Graur
- Department of Biology and Biochemistry, University of Houston, Houston, TX, 77204-5001, USA
| |
Collapse
|
112
|
Carlini DB, Makowski M. Codon bias and gene ontology in holometabolous and hemimetabolous insects. JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION 2015; 324:686-98. [DOI: 10.1002/jez.b.22647] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2015] [Accepted: 08/10/2015] [Indexed: 01/25/2023]
|
113
|
Castellano D, Coronado-Zamora M, Campos JL, Barbadilla A, Eyre-Walker A. Adaptive Evolution Is Substantially Impeded by Hill-Robertson Interference in Drosophila. Mol Biol Evol 2015; 33:442-55. [PMID: 26494843 PMCID: PMC4794616 DOI: 10.1093/molbev/msv236] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Hill-Robertson interference (HRi) is expected to reduce the efficiency of natural selection when two or more linked selected sites do not segregate freely, but no attempt has been done so far to quantify the overall impact of HRi on the rate of adaptive evolution for any given genome. In this work, we estimate how much HRi impedes the rate of adaptive evolution in the coding genome of Drosophila melanogaster. We compiled a data set of 6,141 autosomal protein-coding genes from Drosophila, from which polymorphism levels in D. melanogaster and divergence out to D. yakuba were estimated. The rate of adaptive evolution was calculated using a derivative of the McDonald-Kreitman test that controls for slightly deleterious mutations. We find that the rate of adaptive amino acid substitution at a given position of the genome is positively correlated to both the rate of recombination and the mutation rate, and negatively correlated to the gene density of the region. These correlations are robust to controlling for each other, for synonymous codon bias and for gene functions related to immune response and testes. We show that HRi diminishes the rate of adaptive evolution by approximately 27%. Interestingly, genes with low mutation rates embedded in gene poor regions lose approximately 17% of their adaptive substitutions whereas genes with high mutation rates embedded in gene rich regions lose approximately 60%. We conclude that HRi hampers the rate of adaptive evolution in Drosophila and that the variation in recombination, mutation, and gene density along the genome affects the HRi effect.
Collapse
Affiliation(s)
- David Castellano
- Genomics, Bioinformatics and Evolution Group, Institut de Biotecnologia i de Biomedicina (IBB) and Department de Genètica i Microbiologia, Campus Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Marta Coronado-Zamora
- Genomics, Bioinformatics and Evolution Group, Institut de Biotecnologia i de Biomedicina (IBB) and Department de Genètica i Microbiologia, Campus Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Jose L Campos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Antonio Barbadilla
- Genomics, Bioinformatics and Evolution Group, Institut de Biotecnologia i de Biomedicina (IBB) and Department de Genètica i Microbiologia, Campus Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Adam Eyre-Walker
- Centre for the Study of Evolution, School of Life Sciences, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
114
|
Gu W, Gurguis CI, Zhou JJ, Zhu Y, Ko EA, Ko JH, Wang T, Zhou T. Functional and Structural Consequence of Rare Exonic Single Nucleotide Polymorphisms: One Story, Two Tales. Genome Biol Evol 2015; 7:2929-40. [PMID: 26454016 PMCID: PMC4684694 DOI: 10.1093/gbe/evv191] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/05/2015] [Indexed: 01/01/2023] Open
Abstract
Genetic variation arising from single nucleotide polymorphisms (SNPs) is ubiquitously found among human populations. While disease-causing variants are known in some cases, identifying functional or causative variants for most human diseases remains a challenging task. Rare SNPs, rather than common ones, are thought to be more important in the pathology of most human diseases. We propose that rare SNPs should be divided into two categories dependent on whether the minor alleles are derived or ancestral. Derived alleles are less likely to have been purified by evolutionary processes and may be more likely to induce deleterious effects. We therefore hypothesized that the rare SNPs with derived minor alleles would be more important for human diseases and predicted that these variants would have larger functional or structural consequences relative to the rare variants for which the minor alleles are ancestral. We systematically investigated the consequences of the exonic SNPs on protein function, mRNA structure, and translation. We found that the functional and structural consequences are more significant for the rare exonic variants for which the minor alleles are derived. However, this pattern is reversed when the minor alleles are ancestral. Thus, the rare exonic SNPs with derived minor alleles are more likely to be deleterious. Age estimation of rare SNPs confirms that these potentially deleterious SNPs are recently evolved in the human population. These results have important implications for understanding the function of genetic variations in human exonic regions and for prioritizing functional SNPs in genome-wide association studies of human diseases.
Collapse
Affiliation(s)
- Wanjun Gu
- Research Center for Learning Sciences, Southeast University, Nanjing, Jiangsu, China
| | | | - Jin J Zhou
- Department of Epidemiology and Biostatistics, The University of Arizona
| | - Yihua Zhu
- School of Biological Science and Medical Engineering, Southeast University, Nanjing, Jiangsu, China College of Information Science and Technology, Nanjing Agricultural University, Nanjing, Jiangsu, China
| | - Eun-A Ko
- Department of Pharmacology, The University of Nevada School of Medicine, Reno
| | - Jae-Hong Ko
- Department of Physiology, College of Medicine, Chung-Ang University, Seoul, South Korea
| | - Ting Wang
- Department of Medicine, The University of Arizona
| | - Tong Zhou
- Department of Medicine, The University of Arizona
| |
Collapse
|
115
|
Maddamsetti R, Hatcher PJ, Cruveiller S, Médigue C, Barrick JE, Lenski RE. Synonymous Genetic Variation in Natural Isolates of Escherichia coli Does Not Predict Where Synonymous Substitutions Occur in a Long-Term Experiment. Mol Biol Evol 2015. [PMID: 26199375 PMCID: PMC4651231 DOI: 10.1093/molbev/msv161] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Synonymous genetic differences vary by more than 20-fold among genes in natural isolates of Escherichia coli. One hypothesis to explain this heterogeneity is that genes with high levels of synonymous variation mutate at higher rates than genes with low synonymous variation. If so, then one would expect to observe similar mutational patterns in evolution experiments. In fact, however, the pattern of synonymous substitutions in a long-term evolution experiment with E. coli does not support this hypothesis. In particular, the extent of synonymous variation across genes in that experiment does not reflect the variation observed in natural isolates of E. coli. Instead, gene length alone predicts with high accuracy the prevalence of synonymous changes in the experimental populations. We hypothesize that patterns of synonymous variation in natural E. coli populations are instead caused by differences across genomic regions in their effective population size that, in turn, reflect different histories of recombination, horizontal gene transfer, selection, and population structure.
Collapse
Affiliation(s)
- Rohan Maddamsetti
- Ecology, Evolutionary Biology, and Behavior Program, Michigan State University BEACON Center for the Study of Evolution in Action, Michigan State University
| | | | - Stéphane Cruveiller
- CNRS-UMR 8030 and Commissariat à l'Energie Atomique CEA/DSV/IG/Genoscope LABGeM, Evry, France
| | - Claudine Médigue
- CNRS-UMR 8030 and Commissariat à l'Energie Atomique CEA/DSV/IG/Genoscope LABGeM, Evry, France
| | - Jeffrey E Barrick
- BEACON Center for the Study of Evolution in Action, Michigan State University Department of Molecular Biosciences, Institute for Cellular and Molecular Biology, Center for Systems and Synthetic Biology, The University of Texas at Austin
| | - Richard E Lenski
- Ecology, Evolutionary Biology, and Behavior Program, Michigan State University BEACON Center for the Study of Evolution in Action, Michigan State University
| |
Collapse
|
116
|
Choi KS, Park S. The complete chloroplast genome sequence of Aster spathulifolius (Asteraceae); genomic features and relationship with Asteraceae. Gene 2015; 572:214-21. [PMID: 26164759 DOI: 10.1016/j.gene.2015.07.020] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2014] [Revised: 06/02/2015] [Accepted: 07/06/2015] [Indexed: 11/26/2022]
Abstract
Aster spathulifolius, a member of the Asteraceae family, is distributed along the coast of Japan and Korea. This plant is used for medicinal and ornamental purposes. The complete chloroplast (cp) genome of A. sphathulifolius consists of 149,473 bp that include a pair of inverted repeats of 24,751 bp separated by a large single copy region of 81,998 bp and a small single copy region of 17,973 bp. The chloroplast genome contains 78 coding genes, four rRNA genes and 29 tRNA genes. When compared to other cpDNA sequences of Asteraceae, A. spathulifolius showed the closest relationship with Jacobaea vulgaris, and its atpB gene was found to be a pseudogene, unlike J. vulgaris. Furthermore, evaluation of the gene compositions of J. vulgaris, Helianthus annuus, Guizotia abyssinica and A. spathulifolius revealed that 13.6-kb showed inversion from ndhF to rps15, unlike Lactuca of Asteraceae. Comparison of the synonymous (Ks) and nonsynonymous (Ka) substitution rates with J. vulgaris revealed that synonymous genes related to a small subunit of the ribosome showed the highest value (0.1558), while nonsynonymous rates of genes related to ATP synthase genes were highest (0.0118). These findings revealed that substitution has occurred at similar rates in most genes, and the substitution rates suggested that most genes is a purified selection.
Collapse
Affiliation(s)
- Kyoung Su Choi
- Department of Life Sciences, Yeungnam University, Gyeongsan, Gyeongsangbuk-do 712-749, South Korea
| | - SeonJoo Park
- Department of Life Sciences, Yeungnam University, Gyeongsan, Gyeongsangbuk-do 712-749, South Korea.
| |
Collapse
|
117
|
Koufopanou V, Lomas S, Tsai IJ, Burt A. Estimating the Fitness Effects of New Mutations in the Wild Yeast Saccharomyces paradoxus. Genome Biol Evol 2015; 7:1887-95. [PMID: 26085542 PMCID: PMC4524479 DOI: 10.1093/gbe/evv112] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The nature of selection acting on a population is in large measure determined by the distribution of fitness effects of new mutations. In this study, we use DNA sequences from four closely related clades of Saccharomyces paradoxus and Saccharomyces cerevisiae to identify and polarize new mutations and estimate their fitness effects. By progressively restricting the analyses to narrower categories of sites, we further seek to characterize sites with predictable mutational effects, that is, unconditionally deleterious, neutral or beneficial. Consistent with previous studies on S. paradoxus, we have failed to find evidence for mutations with beneficial effects, even in regions that were divergent in two outgroup clades, perhaps a consequence of the relatively unchallenged, predominantly asexual and highly inbred lifestyle of this species. On the other hand, there is abundant evidence of deleterious mutations, varying in severity of effect from strongly deleterious to very mild, particularly in regions conserved in the outgroup taxa, indicating a history of persistent purifying selection. Narrowing the analysis down to individual amino acids reduces further the range of effects: for example, mutations changing cysteine are predicted to be nearly always strongly deleterious, whereas those changing arginine, serine, and tyrosine are expected to be nearly neutral. The proportion of mutations with deleterious effects for a particular amino acid is correlated with long-term stasis of that amino acid among highly divergent sequences from a variety of organisms, showing that functionality of sites tends to persist through the diversification of clades and that our findings are also relevant to longer evolutionary times and other taxa.
Collapse
Affiliation(s)
- Vassiliki Koufopanou
- Department of Life Sciences, Imperial College London, Silwood Park, Ascot, Berks, United Kingdom
| | - Susan Lomas
- Department of Life Sciences, Imperial College London, Silwood Park, Ascot, Berks, United Kingdom
| | - Isheng J Tsai
- Department of Life Sciences, Imperial College London, Silwood Park, Ascot, Berks, United Kingdom Present address: Biodiversity Research Center, Academia Sinica, Taipei, Taiwan
| | - Austin Burt
- Department of Life Sciences, Imperial College London, Silwood Park, Ascot, Berks, United Kingdom
| |
Collapse
|
118
|
Platt A, Gugger PF, Pellegrini M, Sork VL. Genome-wide signature of local adaptation linked to variable CpG methylation in oak populations. Mol Ecol 2015; 24:3823-30. [DOI: 10.1111/mec.13230] [Citation(s) in RCA: 84] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2014] [Revised: 04/22/2015] [Accepted: 04/23/2015] [Indexed: 02/04/2023]
Affiliation(s)
- Alexander Platt
- Ecology and Evolutionary Biology; University of California; 610 Charles E. Young Dr. E. Los Angeles CA 90095 USA
| | - Paul F. Gugger
- Ecology and Evolutionary Biology; University of California; 610 Charles E. Young Dr. E. Los Angeles CA 90095 USA
| | - Matteo Pellegrini
- Ecology and Evolutionary Biology; University of California; 610 Charles E. Young Dr. E. Los Angeles CA 90095 USA
| | - Victoria L. Sork
- Ecology and Evolutionary Biology; University of California; 610 Charles E. Young Dr. E. Los Angeles CA 90095 USA
| |
Collapse
|
119
|
Gilchrist MA, Chen WC, Shah P, Landerer CL, Zaretzki R. Estimating Gene Expression and Codon-Specific Translational Efficiencies, Mutation Biases, and Selection Coefficients from Genomic Data Alone. Genome Biol Evol 2015; 7:1559-79. [PMID: 25977456 PMCID: PMC4494061 DOI: 10.1093/gbe/evv087] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Extracting biologically meaningful information from the continuing flood of genomic data is a major challenge in the life sciences. Codon usage bias (CUB) is a general feature of most genomes and is thought to reflect the effects of both natural selection for efficient translation and mutation bias. Here we present a mechanistically interpretable, Bayesian model (ribosome overhead costs Stochastic Evolutionary Model of Protein Production Rate [ROC SEMPPR]) to extract meaningful information from patterns of CUB within a genome. ROC SEMPPR is grounded in population genetics and allows us to separate the contributions of mutational biases and natural selection against translational inefficiency on a gene-by-gene and codon-by-codon basis. Until now, the primary disadvantage of similar approaches was the need for genome scale measurements of gene expression. Here, we demonstrate that it is possible to both extract accurate estimates of codon-specific mutation biases and translational efficiencies while simultaneously generating accurate estimates of gene expression, rather than requiring such information. We demonstrate the utility of ROC SEMPPR using the Saccharomyces cerevisiae S288c genome. When we compare our model fits with previous approaches we observe an exceptionally high agreement between estimates of both codon-specific parameters and gene expression levels ([Formula: see text] in all cases). We also observe strong agreement between our parameter estimates and those derived from alternative data sets. For example, our estimates of mutation bias and those from mutational accumulation experiments are highly correlated ([Formula: see text]). Our estimates of codon-specific translational inefficiencies and tRNA copy number-based estimates of ribosome pausing time ([Formula: see text]), and mRNA and ribosome profiling footprint-based estimates of gene expression ([Formula: see text]) are also highly correlated, thus supporting the hypothesis that selection against translational inefficiency is an important force driving the evolution of CUB. Surprisingly, we find that for particular amino acids, codon usage in highly expressed genes can still be largely driven by mutation bias and that failing to take mutation bias into account can lead to the misidentification of an amino acid's "optimal" codon. In conclusion, our method demonstrates that an enormous amount of biologically important information is encoded within genome scale patterns of codon usage, accessing this information does not require gene expression measurements, but instead carefully formulated biologically interpretable models.
Collapse
Affiliation(s)
- Michael A Gilchrist
- Department of Ecology & Evolutionary Biology, University of Tennessee, Knoxville National Institute for Mathematical and Biological Synthesis, Knoxville, Tennessee
| | - Wei-Chen Chen
- Department of Ecology & Evolutionary Biology, University of Tennessee, Knoxville Center for Biologics Evaluation and Research, Food and Drug Administration, Silver Spring, Maryland
| | - Premal Shah
- Department of Biology, University of Pennsylvania
| | - Cedric L Landerer
- Department of Ecology & Evolutionary Biology, University of Tennessee, Knoxville
| | - Russell Zaretzki
- National Institute for Mathematical and Biological Synthesis, Knoxville, Tennessee Department of Business Analytics and Statistics, University of Tennessee, Knoxville
| |
Collapse
|
120
|
Santpere G, Carnero-Montoro E, Petit N, Serra F, Hvilsom C, Rambla J, Heredia-Genestar JM, Halligan DL, Dopazo H, Navarro A, Bosch E. Analysis of Five Gene Sets in Chimpanzees Suggests Decoupling between the Action of Selection on Protein-Coding and on Noncoding Elements. Genome Biol Evol 2015; 7:1490-505. [PMID: 25977458 PMCID: PMC4494068 DOI: 10.1093/gbe/evv082] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
We set out to investigate potential differences and similarities between the selective forces acting upon the coding and noncoding regions of five different sets of genes defined according to functional and evolutionary criteria: 1) two reference gene sets presenting accelerated and slow rates of protein evolution (the Complement and Actin pathways); 2) a set of genes with evidence of accelerated evolution in at least one of their introns; and 3) two gene sets related to neurological function (Parkinson’s and Alzheimer’s diseases). To that effect, we combine human–chimpanzee divergence patterns with polymorphism data obtained from target resequencing 20 central chimpanzees, our closest relatives with largest long-term effective population size. By using the distribution of fitness effect-alpha extension of the McDonald–Kreitman test, we reproduce inferences of rates of evolution previously based only on divergence data on both coding and intronic sequences and also obtain inferences for other classes of genomic elements (untranslated regions, promoters, and conserved noncoding sequences). Our results suggest that 1) the distribution of fitness effect-alpha method successfully helps distinguishing different scenarios of accelerated divergence (adaptation or relaxed selective constraints) and 2) the adaptive history of coding and noncoding sequences within the gene sets analyzed is decoupled.
Collapse
Affiliation(s)
- Gabriel Santpere
- Departament de Ciències Experimentals i la Salut, Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Barcelona, Spain
| | - Elena Carnero-Montoro
- Departament de Ciències Experimentals i la Salut, Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Barcelona, Spain
| | - Natalia Petit
- Departament de Ciències Experimentals i la Salut, Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Barcelona, Spain
| | - François Serra
- Structural Genomics Team, Genome Biology Group, Centre Nacional d'Anàlisi Genòmica (CNAG), Barcelona, Spain
| | | | - Jordi Rambla
- Departament de Ciències Experimentals i la Salut, Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Barcelona, Spain
| | - Jose Maria Heredia-Genestar
- Departament de Ciències Experimentals i la Salut, Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Barcelona, Spain
| | - Daniel L Halligan
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Hernan Dopazo
- Biomedical Genomics & Evolution Laboratory, Departamento de Ecología, Genética y Evolución, IEGEBA (CONICET-UBA), Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Argentina
| | - Arcadi Navarro
- Departament de Ciències Experimentals i la Salut, Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Barcelona, Spain National Institute for Bioinformatics (INB), PRBB, Barcelona, Spain Institució Catalana de Recerca i Estudis Avançats (ICREA), PRBB, Barcelona, Spain Center for Genomic Regulation (CRG), PRBB, Barcelona, Spain
| | - Elena Bosch
- Departament de Ciències Experimentals i la Salut, Institute of Evolutionary Biology (UPF-CSIC), Universitat Pompeu Fabra, PRBB, Barcelona, Spain
| |
Collapse
|
121
|
Abstract
Transposable elements (TEs) are an important factor shaping eukaryotic genomes. Although a significant body of research has been conducted on the abundance of TEs in nuclear genomes, TEs in mitochondrial genomes remain elusive. In this study, we successfully assembled 28 complete yeast mitochondrial genomes and took advantage of the power of population genomics to determine mobile DNAs and their propensity. We have observed compelling evidence of GC clusters propagating within the mitochondrial genome and being horizontally transferred between species. These mitochondrial TEs experience rapid diversification by nucleotide substitution and, more importantly, undergo dynamic merger and shuffling to form new TEs. Given the hyper mobile and transformable nature of mitochondrial TEs, our findings open the door to a deeper understanding of eukaryotic mitochondrial genome evolution and the origin of nonautonomous TEs.
Collapse
|
122
|
Abstract
Numerous computational methods exist to assess the mode and strength of natural selection in protein-coding sequences, yet how distinct methods relate to one another remains largely unknown. Here, we elucidate the relationship between two widely used phylogenetic modeling frameworks: dN/dS models and mutation-selection (MutSel) models. We derive a mathematical relationship between dN/dS and scaled selection coefficients, the focal parameters of MutSel models, and use this relationship to gain deeper insight into the behaviors, limitations, and applicabilities of these two modeling frameworks. We prove that, if all synonymous changes are neutral, standard MutSel models correspond to dN/dS ≤ 1. However, if synonymous codons differ in fitness, dN/dS can take on arbitrarily high values even if all selection is purifying. Thus, the MutSel modeling framework cannot necessarily accommodate positive, diversifying selection, while dN/dS cannot distinguish between purifying selection on synonymous codons and positive selection on amino acids. We further propose a new benchmarking strategy of dN/dS inferences against MutSel simulations and demonstrate that the widely used Goldman-Yang-style dN/dS models yield substantially biased dN/dS estimates on realistic sequence data. In contrast, the less frequently used Muse-Gaut-style models display much less bias. Strikingly, the least-biased and most precise dN/dS estimates are never found in the models with the best fit to the data, measured through both AIC and BIC scores. Thus, selecting models based on goodness-of-fit criteria can yield poor parameter estimates if the models considered do not precisely correspond to the underlying mechanism that generated the data. In conclusion, establishing mathematical links among modeling frameworks represents a novel, powerful strategy to pinpoint previously unrecognized model limitations and strengths.
Collapse
Affiliation(s)
- Stephanie J Spielman
- Department of Integrative Biology, Center for Computational Biology and Bioinformatics, and Institute of Cellular and Molecular Biology, The University of Texas at Austin
| | - Claus O Wilke
- Department of Integrative Biology, Center for Computational Biology and Bioinformatics, and Institute of Cellular and Molecular Biology, The University of Texas at Austin
| |
Collapse
|
123
|
Corbett-Detig RB, Hartl DL, Sackton TB. Natural selection constrains neutral diversity across a wide range of species. PLoS Biol 2015; 13:e1002112. [PMID: 25859758 PMCID: PMC4393120 DOI: 10.1371/journal.pbio.1002112] [Citation(s) in RCA: 197] [Impact Index Per Article: 21.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Accepted: 02/20/2015] [Indexed: 11/19/2022] Open
Abstract
The neutral theory of molecular evolution predicts that the amount of neutral polymorphisms within a species will increase proportionally with the census population size (Nc). However, this prediction has not been borne out in practice: while the range of Nc spans many orders of magnitude, levels of genetic diversity within species fall in a comparatively narrow range. Although theoretical arguments have invoked the increased efficacy of natural selection in larger populations to explain this discrepancy, few direct empirical tests of this hypothesis have been conducted. In this work, we provide a direct test of this hypothesis using population genomic data from a wide range of taxonomically diverse species. To do this, we relied on the fact that the impact of natural selection on linked neutral diversity depends on the local recombinational environment. In regions of relatively low recombination, selected variants affect more neutral sites through linkage, and the resulting correlation between recombination and polymorphism allows a quantitative assessment of the magnitude of the impact of selection on linked neutral diversity. By comparing whole genome polymorphism data and genetic maps using a coalescent modeling framework, we estimate the degree to which natural selection reduces linked neutral diversity for 40 species of obligately sexual eukaryotes. We then show that the magnitude of the impact of natural selection is positively correlated with Nc, based on body size and species range as proxies for census population size. These results demonstrate that natural selection removes more variation at linked neutral sites in species with large Nc than those with small Nc and provides direct empirical evidence that natural selection constrains levels of neutral genetic diversity across many species. This implies that natural selection may provide an explanation for this longstanding paradox of population genetics.
Collapse
Affiliation(s)
- Russell B. Corbett-Detig
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge Massachusetts, United States of America
- Department of Integrative Biology, University of California, Berkeley, Berkeley, California, United States of America
| | - Daniel L. Hartl
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge Massachusetts, United States of America
| | - Timothy B. Sackton
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge Massachusetts, United States of America
| |
Collapse
|
124
|
Solà E, Álvarez-Presas M, Frías-López C, Littlewood DTJ, Rozas J, Riutort M. Evolutionary analysis of mitogenomes from parasitic and free-living flatworms. PLoS One 2015; 10:e0120081. [PMID: 25793530 PMCID: PMC4368550 DOI: 10.1371/journal.pone.0120081] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2014] [Accepted: 01/19/2015] [Indexed: 11/23/2022] Open
Abstract
Mitochondrial genomes (mitogenomes) are useful and relatively accessible sources of molecular data to explore and understand the evolutionary history and relationships of eukaryotic organisms across diverse taxonomic levels. The availability of complete mitogenomes from Platyhelminthes is limited; of the 40 or so published most are from parasitic flatworms (Neodermata). Here, we present the mitogenomes of two free-living flatworms (Tricladida): the complete genome of the freshwater species Crenobia alpina (Planariidae) and a nearly complete genome of the land planarian Obama sp. (Geoplanidae). Moreover, we have reanotated the published mitogenome of the species Dugesia japonica (Dugesiidae). This contribution almost doubles the total number of mtDNAs published for Tricladida, a species-rich group including model organisms and economically important invasive species. We took the opportunity to conduct comparative mitogenomic analyses between available free-living and selected parasitic flatworms in order to gain insights into the putative effect of life cycle on nucleotide composition through mutation and natural selection. Unexpectedly, we did not find any molecular hallmark of a selective relaxation in mitogenomes of parasitic flatworms; on the contrary, three out of the four studied free-living triclad mitogenomes exhibit higher A+T content and selective relaxation levels. Additionally, we provide new and valuable molecular data to develop markers for future phylogenetic studies on planariids and geoplanids.
Collapse
Affiliation(s)
- Eduard Solà
- Institut de Recerca de la Biodiversitat and Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Catalonia, Spain
| | - Marta Álvarez-Presas
- Institut de Recerca de la Biodiversitat and Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Catalonia, Spain
| | - Cristina Frías-López
- Institut de Recerca de la Biodiversitat and Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Catalonia, Spain
| | | | - Julio Rozas
- Institut de Recerca de la Biodiversitat and Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Catalonia, Spain
| | - Marta Riutort
- Institut de Recerca de la Biodiversitat and Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Catalonia, Spain
- * E-mail: (MR)
| |
Collapse
|
125
|
Wu X, Hurst LD. Why Selection Might Be Stronger When Populations Are Small: Intron Size and Density Predict within and between-Species Usage of Exonic Splice Associated cis-Motifs. Mol Biol Evol 2015; 32:1847-61. [PMID: 25771198 PMCID: PMC4476162 DOI: 10.1093/molbev/msv069] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
The nearly neutral theory predicts that small effective population size provides the conditions for weakened selection. This is postulated to explain why our genome is more “bloated” than that of, for example, yeast, ours having large introns and large intergene spacer. If a bloated genome is also an error prone genome might it, however, be the case that selection for error-mitigating properties is stronger in our genome? We examine this notion using splicing as an exemplar, not least because large introns can predispose to noisy splicing. We thus ask whether, owing to genomic decay, selection for splice error-control mechanisms is stronger, not weaker, in species with large introns and small populations. In humans much information defining splice sites is in cis-exonic motifs, most notably exonic splice enhancers (ESEs). These act as splice-error control elements. Here then we ask whether within and between-species intron size is a predictor of the commonality of exonic cis-splicing motifs. We show that, as predicted, the proportion of synonymous sites that are ESE-associated and under selection in humans is weakly positively correlated with the size of the flanking intron. In a phylogenetically controlled framework, we observe, also as expected, that mean intron size is both predicted by Ne.μ and is a good predictor of cis-motif usage across species, this usage coevolving with splice site definition. Unexpectedly, however, across taxa intron density is a better predictor of cis-motif usage than intron size. We propose that selection for splice-related motifs is driven by a need to avoid decoy splice sites that will be more common in genes with many and large introns. That intron number and density predict ESE usage within human genes is consistent with this, as is the finding of intragenic heterogeneity in ESE density. As intronic content and splice site usage across species is also well predicted by Ne.μ, the result also suggests an unusual circumstance in which selection (for cis-modifiers of splicing) might be stronger when population sizes are smaller, as here splicing is noisier, resulting in a greater need to control error-prone splicing.
Collapse
Affiliation(s)
- XianMing Wu
- Department of Biology and Biochemistry, University of Bath, Bath, Somerset, United Kingdom
| | - Laurence D Hurst
- Department of Biology and Biochemistry, University of Bath, Bath, Somerset, United Kingdom
| |
Collapse
|
126
|
Garud NR, Messer PW, Buzbas EO, Petrov DA. Recent selective sweeps in North American Drosophila melanogaster show signatures of soft sweeps. PLoS Genet 2015; 11:e1005004. [PMID: 25706129 PMCID: PMC4338236 DOI: 10.1371/journal.pgen.1005004] [Citation(s) in RCA: 257] [Impact Index Per Article: 28.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Accepted: 01/14/2015] [Indexed: 11/18/2022] Open
Abstract
Adaptation from standing genetic variation or recurrent de novo mutation in large populations should commonly generate soft rather than hard selective sweeps. In contrast to a hard selective sweep, in which a single adaptive haplotype rises to high population frequency, in a soft selective sweep multiple adaptive haplotypes sweep through the population simultaneously, producing distinct patterns of genetic variation in the vicinity of the adaptive site. Current statistical methods were expressly designed to detect hard sweeps and most lack power to detect soft sweeps. This is particularly unfortunate for the study of adaptation in species such as Drosophila melanogaster, where all three confirmed cases of recent adaptation resulted in soft selective sweeps and where there is evidence that the effective population size relevant for recent and strong adaptation is large enough to generate soft sweeps even when adaptation requires mutation at a specific single site at a locus. Here, we develop a statistical test based on a measure of haplotype homozygosity (H12) that is capable of detecting both hard and soft sweeps with similar power. We use H12 to identify multiple genomic regions that have undergone recent and strong adaptation in a large population sample of fully sequenced Drosophila melanogaster strains from the Drosophila Genetic Reference Panel (DGRP). Visual inspection of the top 50 candidates reveals that in all cases multiple haplotypes are present at high frequencies, consistent with signatures of soft sweeps. We further develop a second haplotype homozygosity statistic (H2/H1) that, in combination with H12, is capable of differentiating hard from soft sweeps. Surprisingly, we find that the H12 and H2/H1 values for all top 50 peaks are much more easily generated by soft rather than hard sweeps. We discuss the implications of these results for the study of adaptation in Drosophila and in species with large census population sizes. Evolutionary adaptation is a process in which beneficial mutations increase in frequency in response to selective pressures. If these mutations were previously rare or absent from the population, adaptation should generate a characteristic signature in the genetic diversity around the adaptive locus, known as a selective sweep. Such selective sweeps can be distinguished into hard selective sweeps, where only a single adaptive mutation rises in frequency, or soft selective sweeps, where multiple adaptive mutations at the same locus sweep through the population simultaneously. Here we design a new statistical method that can identify both hard and soft sweeps in population genomic data and apply this method to a Drosophila melanogaster population genomic dataset consisting of 145 sequenced strains collected in North Carolina. We find that selective sweeps were abundant in the recent history of this population. Interestingly, we also find that practically all of the strongest and most recent sweeps show patterns that are more consistent with soft rather than hard sweeps. We discuss the implications of these findings for the discovery and quantification of adaptation from population genomic data in Drosophila and other species with large population sizes.
Collapse
Affiliation(s)
- Nandita R. Garud
- Department of Genetics, Stanford University, Stanford, California, United States of America
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail: (NRG); (DAP)
| | - Philipp W. Messer
- Department of Biology, Stanford University, Stanford, California, United States of America
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York, United States of America
| | - Erkan O. Buzbas
- Department of Biology, Stanford University, Stanford, California, United States of America
- Department of Statistical Science, University of Idaho, Moscow, Idaho, United States of America
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail: (NRG); (DAP)
| |
Collapse
|
127
|
Zaborske JM, Bauer DuMont VL, Wallace EWJ, Pan T, Aquadro CF, Drummond DA. A nutrient-driven tRNA modification alters translational fidelity and genome-wide protein coding across an animal genus. PLoS Biol 2014; 12:e1002015. [PMID: 25489848 PMCID: PMC4260829 DOI: 10.1371/journal.pbio.1002015] [Citation(s) in RCA: 82] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2014] [Accepted: 10/22/2014] [Indexed: 11/19/2022] Open
Abstract
Use of the nutrient queuine to modify tRNA anticodons can change the accuracy of certain codons during protein synthesis, resulting in evolutionary recoding of fruit fly genomes. Natural selection favors efficient expression of encoded proteins, but the causes, mechanisms, and fitness consequences of evolved coding changes remain an area of aggressive inquiry. We report a large-scale reversal in the relative translational accuracy of codons across 12 fly species in the Drosophila/Sophophora genus. Because the reversal involves pairs of codons that are read by the same genomically encoded tRNAs, we hypothesize, and show by direct measurement, that a tRNA anticodon modification from guanosine to queuosine has coevolved with these genomic changes. Queuosine modification is present in most organisms but its function remains unclear. Modification levels vary across developmental stages in D. melanogaster, and, consistent with a causal effect, genes maximally expressed at each stage display selection for codons that are most accurate given stage-specific queuosine modification levels. In a kinetic model, the known increased affinity of queuosine-modified tRNA for ribosomes increases the accuracy of cognate codons while reducing the accuracy of near-cognate codons. Levels of queuosine modification in D. melanogaster reflect bioavailability of the precursor queuine, which eukaryotes scavenge from the tRNAs of bacteria and absorb in the gut. These results reveal a strikingly direct mechanism by which recoding of entire genomes results from changes in utilization of a nutrient. Ribosomes translate mRNA into protein using tRNAs, and these tRNAs often translate multiple synonymous codons. Although synonymous codons specify the same amino acid, tRNAs read codons with differing speed and accuracy, and so some codons may be more accurately translated than their synonyms. Such variation in the efficiency of translation between synonymous codons can result in costs to cellular fitness. By favoring certain coding choices over evolutionary timescales, natural selection leaves signs of pressure for translational fidelity on evolved genomes. We have found that the way in which proteins are encoded has changed systematically across several closely related fruit fly species. Surprisingly, several of these changes involve two codons both read by the same tRNA. Here we confirm experimentally that the anticodons of these tRNAs are chemically modified—from guanine to queuosine—in vivo, and that the levels of this modification in different species track the differences in protein coding. Furthermore, queuosine modification levels are known to change during fruit fly development, and we find that genes expressed maximally during a given developmental stage have codings reflecting levels of modification at that stage. Remarkably, queuosine modification depends upon acquisition of its precursor, queuine, as a nutrient that eukaryotes must obtain from bacteria through the gut. We have thus elucidated a mechanism by which availability of a nutrient can shape the coding patterns of whole genomes.
Collapse
Affiliation(s)
- John M. Zaborske
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois, United States of America
| | - Vanessa L. Bauer DuMont
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - Edward W. J. Wallace
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois, United States of America
| | - Tao Pan
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois, United States of America
| | - Charles F. Aquadro
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, United States of America
| | - D. Allan Drummond
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois, United States of America
- Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
- * E-mail:
| |
Collapse
|
128
|
Sedeek KEM, Scopece G, Staedler YM, Schönenberger J, Cozzolino S, Schiestl FP, Schlüter PM. Genic rather than genome‐wide differences between sexually deceptive
O
phrys
orchids with different pollinators. Mol Ecol 2014; 23:6192-205. [DOI: 10.1111/mec.12992] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2014] [Revised: 10/25/2014] [Accepted: 10/29/2014] [Indexed: 01/12/2023]
Affiliation(s)
- Khalid E. M. Sedeek
- Institute of Systematic Botany University of Zurich Zollikerstr. 107 CH‐8008 Zurich Switzerland
| | - Giovanni Scopece
- Department of Biology University of Naples Federico II Complesso Universitario MSA Via Cinthia I‐80126 Naples Italy
| | - Yannick M. Staedler
- Department of Botany and Biodiversity Research University of Vienna Rennweg 14 A‐1030 Vienna Austria
| | - Jürg Schönenberger
- Department of Botany and Biodiversity Research University of Vienna Rennweg 14 A‐1030 Vienna Austria
| | - Salvatore Cozzolino
- Department of Biology University of Naples Federico II Complesso Universitario MSA Via Cinthia I‐80126 Naples Italy
| | - Florian P. Schiestl
- Institute of Systematic Botany University of Zurich Zollikerstr. 107 CH‐8008 Zurich Switzerland
| | - Philipp M. Schlüter
- Institute of Systematic Botany University of Zurich Zollikerstr. 107 CH‐8008 Zurich Switzerland
| |
Collapse
|
129
|
Bergland AO, Behrman EL, O'Brien KR, Schmidt PS, Petrov DA. Genomic evidence of rapid and stable adaptive oscillations over seasonal time scales in Drosophila. PLoS Genet 2014; 10:e1004775. [PMID: 25375361 PMCID: PMC4222749 DOI: 10.1371/journal.pgen.1004775] [Citation(s) in RCA: 329] [Impact Index Per Article: 32.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2014] [Accepted: 09/24/2014] [Indexed: 01/06/2023] Open
Abstract
In many species, genomic data have revealed pervasive adaptive evolution indicated by the fixation of beneficial alleles. However, when selection pressures are highly variable along a species' range or through time adaptive alleles may persist at intermediate frequencies for long periods. So called “balanced polymorphisms” have long been understood to be an important component of standing genetic variation, yet direct evidence of the strength of balancing selection and the stability and prevalence of balanced polymorphisms has remained elusive. We hypothesized that environmental fluctuations among seasons in a North American orchard would impose temporally variable selection on Drosophila melanogaster that would drive repeatable adaptive oscillations at balanced polymorphisms. We identified hundreds of polymorphisms whose frequency oscillates among seasons and argue that these loci are subject to strong, temporally variable selection. We show that these polymorphisms respond to acute and persistent changes in climate and are associated in predictable ways with seasonally variable phenotypes. In addition, our results suggest that adaptively oscillating polymorphisms are likely millions of years old, with some possibly predating the divergence between D. melanogaster and D. simulans. Taken together, our results are consistent with a model of balancing selection wherein rapid temporal fluctuations in climate over generational time promotes adaptive genetic diversity at loci underlying polygenic variation in fitness related phenotypes. Herein, we investigate the genomic basis of rapid adaptive evolution in response to seasonal fluctuations in the environment. We identify hundreds of polymorphisms (seasonal SNPs) that undergo dramatic shifts in allele frequency – on average between 40 and 60% – and oscillate between seasons repeatedly over multiple years, likely inducing high levels of genome-wide genetic differentiation. We provide evidence that seasonal SNPs are functional, being both sensitive to an acute frost event and associated with two stress tolerance traits. Finally, we show that some seasonal SNPs are possibly ancient balanced polymorphisms. Taken together, our results suggest that environmental heterogeneity can promote the long-term persistence of functional polymorphisms within populations that fuels fast directional adaptive response at any one time.
Collapse
Affiliation(s)
- Alan O. Bergland
- Department of Biology, Stanford University, Stanford, California, United States of America
- * E-mail:
| | - Emily L. Behrman
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Katherine R. O'Brien
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Paul S. Schmidt
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, California, United States of America
| |
Collapse
|
130
|
Kessler MD, Dean MD. Effective population size does not predict codon usage bias in mammals. Ecol Evol 2014; 4:3887-900. [PMID: 25505518 PMCID: PMC4242573 DOI: 10.1002/ece3.1249] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2014] [Revised: 08/04/2014] [Accepted: 08/07/2014] [Indexed: 12/20/2022] Open
Abstract
Synonymous codons are not used at equal frequency throughout the genome, a phenomenon termed codon usage bias (CUB). It is often assumed that interspecific variation in the intensity of CUB is related to species differences in effective population sizes (Ne), with selection on CUB operating less efficiently in species with small Ne. Here, we specifically ask whether variation in Ne predicts differences in CUB in mammals and report two main findings. First, across 41 mammalian genomes, CUB was not correlated with two indirect proxies of Ne (body mass and generation time), even though there was statistically significant evidence of selection shaping CUB across all species. Interestingly, autosomal genes showed higher codon usage bias compared to X-linked genes, and high-recombination genes showed higher codon usage bias compared to low recombination genes, suggesting intraspecific variation in Ne predicts variation in CUB. Second, across six mammalian species with genetic estimates of Ne (human, chimpanzee, rabbit, and three mouse species: Mus musculus, M. domesticus, and M. castaneus), Ne and CUB were weakly and inconsistently correlated. At least in mammals, interspecific divergence in Ne does not strongly predict variation in CUB. One hypothesis is that each species responds to a unique distribution of selection coefficients, confounding any straightforward link between Ne and CUB.
Collapse
Affiliation(s)
- Michael D Kessler
- Molecular and Computational Biology, University of Southern California 1050 Childs Way, Los Angeles, California, 90089
| | - Matthew D Dean
- Molecular and Computational Biology, University of Southern California 1050 Childs Way, Los Angeles, California, 90089
| |
Collapse
|
131
|
Babbitt GA, Alawad MA, Schulze KV, Hudson AO. Synonymous codon bias and functional constraint on GC3-related DNA backbone dynamics in the prokaryotic nucleoid. Nucleic Acids Res 2014; 42:10915-26. [PMID: 25200075 PMCID: PMC4176184 DOI: 10.1093/nar/gku811] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
While mRNA stability has been demonstrated to control rates of translation, generating both global and local synonymous codon biases in many unicellular organisms, this explanation cannot adequately explain why codon bias strongly tracks neighboring intergene GC content; suggesting that structural dynamics of DNA might also influence codon choice. Because minor groove width is highly governed by 3-base periodicity in GC, the existence of triplet-based codons might imply a functional role for the optimization of local DNA molecular dynamics via GC content at synonymous sites (≈GC3). We confirm a strong association between GC3-related intrinsic DNA flexibility and codon bias across 24 different prokaryotic multiple whole-genome alignments. We develop a novel test of natural selection targeting synonymous sites and demonstrate that GC3-related DNA backbone dynamics have been subject to moderate selective pressure, perhaps contributing to our observation that many genes possess extreme DNA backbone dynamics for their given protein space. This dual function of codons may impose universal functional constraints affecting the evolution of synonymous and non-synonymous sites. We propose that synonymous sites may have evolved as an 'accessory' during an early expansion of a primordial genetic code, allowing for multiplexed protein coding and structural dynamic information within the same molecular context.
Collapse
Affiliation(s)
- Gregory A Babbitt
- Thomas H. Gosnell School of Life Sciences, Rochester Institute of Technology, Rochester NY, USA 14623
| | - Mohammed A Alawad
- B. Thomas Golisano College of Computing and Information Sciences, Rochester Institute of Technology, Rochester NY, USA 14623
| | - Katharina V Schulze
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston TX, USA 77030
| | - André O Hudson
- Thomas H. Gosnell School of Life Sciences, Rochester Institute of Technology, Rochester NY, USA 14623
| |
Collapse
|
132
|
Wnt pathway activation increases hypoxia tolerance during development. PLoS One 2014; 9:e103292. [PMID: 25093834 PMCID: PMC4122365 DOI: 10.1371/journal.pone.0103292] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2014] [Accepted: 06/27/2014] [Indexed: 11/19/2022] Open
Abstract
Adaptation to hypoxia, defined as a condition of inadequate oxygen supply, has enabled humans to successfully colonize high altitude regions. The mechanisms attempted by organisms to cope with short-term hypoxia include increased ATP production via anaerobic respiration and stabilization of Hypoxia Inducible Factor 1α (HIF-1α). However, less is known about the means through which populations adapt to chronic hypoxia during the process of development within a life time or over generations. Here we show that signaling via the highly conserved Wnt pathway impacts the ability of Drosophila melanogaster to complete its life cycle under hypoxia. We identify this pathway through analyses of genome sequencing and gene expression of a Drosophila melanogaster population adapted over >180 generations to tolerate a concentration of 3.5-4% O2 in air. We then show that genetic activation of the Wnt canonical pathway leads to increased rates of adult eclosion in low O2. Our results indicate that a previously unsuspected major developmental pathway, Wnt, plays a significant role in hypoxia tolerance.
Collapse
|
133
|
Arnold CD, Gerlach D, Spies D, Matts JA, Sytnikova YA, Pagani M, Lau NC, Stark A. Quantitative genome-wide enhancer activity maps for five Drosophila species show functional enhancer conservation and turnover during cis-regulatory evolution. Nat Genet 2014; 46:685-92. [PMID: 24908250 PMCID: PMC4250274 DOI: 10.1038/ng.3009] [Citation(s) in RCA: 115] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2014] [Accepted: 05/15/2014] [Indexed: 12/14/2022]
Abstract
Phenotypic differences between closely related species are thought to arise primarily from changes in gene expression due to mutations in cis-regulatory sequences (enhancers). However, it has remained unclear how frequently mutations alter enhancer activity or create functional enhancers de novo. Here we use STARR-seq, a recently developed quantitative enhancer assay, to determine genome-wide enhancer activity profiles for five Drosophila species in the constant trans-regulatory environment of Drosophila melanogaster S2 cells. We find that the functions of a large fraction of D. melanogaster enhancers are conserved for their orthologous sequences owing to selection and stabilizing turnover of transcription factor motifs. Moreover, hundreds of enhancers have been gained since the D. melanogaster-Drosophila yakuba split about 11 million years ago without apparent adaptive selection and can contribute to changes in gene expression in vivo. Our finding that enhancer activity is often deeply conserved and frequently gained provides functional insights into regulatory evolution.
Collapse
Affiliation(s)
- Cosmas D Arnold
- 1] Research Institute of Molecular Pathology (IMP), Vienna Biocenter (VBC), Vienna, Austria. [2]
| | - Daniel Gerlach
- 1] Research Institute of Molecular Pathology (IMP), Vienna Biocenter (VBC), Vienna, Austria. [2] [3]
| | - Daniel Spies
- Research Institute of Molecular Pathology (IMP), Vienna Biocenter (VBC), Vienna, Austria
| | - Jessica A Matts
- 1] Department of Biology, Brandeis University, Waltham, Massachusetts, USA. [2] Rosenstiel Basic Medical Science Research Center at Brandeis University, Waltham, Massachusetts, USA. [3]
| | - Yuliya A Sytnikova
- 1] Department of Biology, Brandeis University, Waltham, Massachusetts, USA. [2] Rosenstiel Basic Medical Science Research Center at Brandeis University, Waltham, Massachusetts, USA
| | - Michaela Pagani
- Research Institute of Molecular Pathology (IMP), Vienna Biocenter (VBC), Vienna, Austria
| | - Nelson C Lau
- 1] Department of Biology, Brandeis University, Waltham, Massachusetts, USA. [2] Rosenstiel Basic Medical Science Research Center at Brandeis University, Waltham, Massachusetts, USA
| | - Alexander Stark
- Research Institute of Molecular Pathology (IMP), Vienna Biocenter (VBC), Vienna, Austria
| |
Collapse
|
134
|
Singh ND, Koerich LB, Carvalho AB, Clark AG. Positive and purifying selection on the Drosophila Y chromosome. Mol Biol Evol 2014; 31:2612-23. [PMID: 24974375 DOI: 10.1093/molbev/msu203] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Y chromosomes, with their reduced effective population size, lack of recombination, and male-limited transmission, present a unique collection of constraints for the operation of natural selection. Male-limited transmission may greatly increase the efficacy of selection for male-beneficial mutations, but the reduced effective size also inflates the role of random genetic drift. Together, these defining features of the Y chromosome are expected to influence rates and patterns of molecular evolution on the Y as compared with X-linked or autosomal loci. Here, we use sequence data from 11 genes in 9 Drosophila species to gain insight into the efficacy of natural selection on the Drosophila Y relative to the rest of the genome. Drosophila is an ideal system for assessing the consequences of Y-linkage for molecular evolution in part because the gene content of Drosophila Y chromosomes is highly dynamic, with orthologous genes being Y-linked in some species whereas autosomal in others. Our results confirm the expectation that the efficacy of natural selection at weakly selected sites is reduced on the Y chromosome. In contrast, purifying selection on the Y chromosome for strongly deleterious mutations does not appear to be compromised. Finally, we find evidence of recurrent positive selection for 4 of the 11 genes studied here. Our results thus highlight the variable nature of the mode and impact of natural selection on the Drosophila Y chromosome.
Collapse
Affiliation(s)
- Nadia D Singh
- Department of Biological Sciences, North Carolina State University
| | - Leonardo B Koerich
- Departamento de Genética, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | | | - Andrew G Clark
- Department of Molecular Biology and Genetics, Cornell University
| |
Collapse
|
135
|
Adaptive synonymous mutations in an experimentally evolved Pseudomonas fluorescens population. Nat Commun 2014; 5:4076. [PMID: 24912567 DOI: 10.1038/ncomms5076] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2014] [Accepted: 05/08/2014] [Indexed: 01/22/2023] Open
Abstract
Conventional wisdom holds that synonymous mutations, nucleotide changes that do not alter the encoded amino acid, have no detectable effect on phenotype or fitness. However, a growing body of evidence from both comparative and experimental studies suggests otherwise. Synonymous mutations have been shown to impact gene expression, protein folding and fitness, however, direct evidence that they can be positively selected, and so contribute to adaptation, is lacking. Here we report the recovery of two beneficial synonymous single base pair changes that arose spontaneously and independently in an experimentally evolved population of Pseudomonas fluorescens. We show experimentally that these mutations increase fitness by an amount comparable to non-synonymous mutations and that the fitness increases stem from increased gene expression. These results provide unequivocal evidence that synonymous mutations can drive adaptive evolution and suggest that this class of mutation may be underappreciated as a cause of adaptation and evolutionary dynamics.
Collapse
|
136
|
Abstract
The rates and properties of new mutations affecting fitness have implications for a number of outstanding questions in evolutionary biology. Obtaining estimates of mutation rates and effects has historically been challenging, and little theory has been available for predicting the distribution of fitness effects (DFE); however, there have been recent advances on both fronts. Extreme-value theory predicts the DFE of beneficial mutations in well-adapted populations, while phenotypic fitness landscape models make predictions for the DFE of all mutations as a function of the initial level of adaptation and the strength of stabilizing selection on traits underlying fitness. Direct experimental evidence confirms predictions on the DFE of beneficial mutations and favors distributions that are roughly exponential but bounded on the right. A growing number of studies infer the DFE using genomic patterns of polymorphism and divergence, recovering a wide range of DFE. Future work should be aimed at identifying factors driving the observed variation in the DFE. We emphasize the need for further theory explicitly incorporating the effects of partial pleiotropy and heterogeneity in the environment on the expected DFE.
Collapse
Affiliation(s)
- Thomas Bataillon
- Bioinformatics Research Center, Aarhus University, Aarhus, Denmark
| | | |
Collapse
|
137
|
Abstract
Evolutionary conservation has been an accurate predictor of functional elements across the first decade of metazoan genomics. More recently, there has been a move to define functional elements instead from biochemical annotations. Evolutionary methods are, however, more comprehensive than biochemical approaches can be and can assess quantitatively, especially for subtle effects, how biologically important--how injurious after mutation--different types of elements are. Evolutionary methods are thus critical for understanding the large fraction (up to 10%) of the human genome that does not encode proteins and yet might convey function. These methods can also capture the ephemeral nature of much noncoding functional sequence, with large numbers of functional elements having been gained and lost rapidly along each mammalian lineage. Here, we review how different strengths of purifying selection have impacted on protein-coding and non-protein-coding loci and on transcription factor binding sites in mammalian and fruit fly genomes.
Collapse
Affiliation(s)
- Wilfried Haerty
- MRC Functional Genomics Unit, Department of Physiology, Anatomy, and Genetics, University of Oxford, Oxford OX1 3PT, United Kingdom; ,
| | | |
Collapse
|
138
|
Lawrie DS, Petrov DA. Comparative population genomics: power and principles for the inference of functionality. Trends Genet 2014; 30:133-9. [PMID: 24656563 DOI: 10.1016/j.tig.2014.02.002] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2013] [Revised: 01/31/2014] [Accepted: 02/06/2014] [Indexed: 11/19/2022]
Abstract
The availability of sequenced genomes from multiple related organisms allows the detection and localization of functional genomic elements based on the idea that such elements evolve more slowly than neutral sequences. Although such comparative genomics methods have proven useful in discovering functional elements and ascertaining levels of functional constraint in the genome as a whole, here we outline limitations intrinsic to this approach that cannot be overcome by sequencing more species. We argue that it is essential to supplement comparative genomics with ultra-deep sampling of populations from closely related species to enable substantially more powerful genomic scans for functional elements. The convergence of sequencing technology and population genetics theory has made such projects feasible and has exciting implications for functional genomics.
Collapse
Affiliation(s)
- David S Lawrie
- Department of Genetics, Stanford University, Stanford, CA, USA; Department of Biology, Stanford University, Stanford, CA, USA.
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA, USA
| |
Collapse
|
139
|
Cancer evolution is associated with pervasive positive selection on globally expressed genes. PLoS Genet 2014; 10:e1004239. [PMID: 24603726 PMCID: PMC3945297 DOI: 10.1371/journal.pgen.1004239] [Citation(s) in RCA: 74] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2013] [Accepted: 01/29/2014] [Indexed: 12/22/2022] Open
Abstract
Cancer is an evolutionary process in which cells acquire new transformative, proliferative and metastatic capabilities. A full understanding of cancer requires learning the dynamics of the cancer evolutionary process. We present here a large-scale analysis of the dynamics of this evolutionary process within tumors, with a focus on breast cancer. We show that the cancer evolutionary process differs greatly from organismal (germline) evolution. Organismal evolution is dominated by purifying selection (that removes mutations that are harmful to fitness). In contrast, in the cancer evolutionary process the dominance of purifying selection is much reduced, allowing for a much easier detection of the signals of positive selection (adaptation). We further show that, as a group, genes that are globally expressed across human tissues show a very strong signal of positive selection within tumors. Indeed, known cancer genes are enriched for global expression patterns. Yet, positive selection is prevalent even on globally expressed genes that have not yet been associated with cancer, suggesting that globally expressed genes are enriched for yet undiscovered cancer related functions. We find that the increased positive selection on globally expressed genes within tumors is not due to their expression in the tissue relevant to the cancer. Rather, such increased adaptation is likely due to globally expressed genes being enriched in important housekeeping and essential functions. Thus, our results suggest that tumor adaptation is most often mediated through somatic changes to those genes that are important for the most basic cellular functions. Together, our analysis reveals the uniqueness of the cancer evolutionary process and the particular importance of globally expressed genes in driving cancer initiation and progression.
Collapse
|
140
|
Tobler R, Franssen SU, Kofler R, Orozco-terWengel P, Nolte V, Hermisson J, Schlötterer C. Massive habitat-specific genomic response in D. melanogaster populations during experimental evolution in hot and cold environments. Mol Biol Evol 2014; 31:364-75. [PMID: 24150039 PMCID: PMC3907058 DOI: 10.1093/molbev/mst205] [Citation(s) in RCA: 124] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Experimental evolution in combination with whole-genome sequencing (evolve and resequence [E&R]) is a promising approach to define the genotype-phenotype map and to understand adaptation in evolving populations. Many previous studies have identified a large number of putative selected sites (i.e., candidate loci), but it remains unclear to what extent these loci are genuine targets of selection or experimental noise. To address this question, we exposed the same founder population to two different selection regimes-a hot environment and a cold environment-and quantified the genomic response in each. We detected large numbers of putative selected loci in both environments, albeit with little overlap between the two sets of candidates, indicating that most resulted from habitat-specific selection. By quantifying changes across multiple independent biological replicates, we demonstrate that most of the candidate SNPs were false positives that were linked to selected sites over distances much larger than the typical linkage disequilibrium range of Drosophila melanogaster. We show that many of these mid- to long-range associations were attributable to large segregating inversions and confirm by computer simulations that such patterns could be readily replicated when strong selection acts on rare haplotypes. In light of our findings, we outline recommendations to improve the performance of future Drosophila E&R studies which include using species with negligible inversion loads, such as D. mauritiana and D. simulans, instead of D. melanogaster.
Collapse
Affiliation(s)
- Ray Tobler
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
| | | | - Robert Kofler
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
| | | | - Viola Nolte
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
| | - Joachim Hermisson
- Mathematics and Biosciences Group, Department of Mathematics, University of Vienna, Vienna, Austria
- Max F. Perutz Laboratories, Vienna, Austria
| | | |
Collapse
|
141
|
Campos JL, Halligan DL, Haddrill PR, Charlesworth B. The relation between recombination rate and patterns of molecular evolution and variation in Drosophila melanogaster. Mol Biol Evol 2014; 31:1010-28. [PMID: 24489114 PMCID: PMC3969569 DOI: 10.1093/molbev/msu056] [Citation(s) in RCA: 123] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Genetic recombination associated with sexual reproduction increases the efficiency of natural selection by reducing the strength of Hill–Robertson interference. Such interference can be caused either by selective sweeps of positively selected alleles or by background selection (BGS) against deleterious mutations. Its consequences can be studied by comparing patterns of molecular evolution and variation in genomic regions with different rates of crossing over. We carried out a comprehensive study of the benefits of recombination in Drosophila melanogaster, both by contrasting five independent genomic regions that lack crossing over with the rest of the genome and by comparing regions with different rates of crossing over, using data on DNA sequence polymorphisms from an African population that is geographically close to the putatively ancestral population for the species, and on sequence divergence from a related species. We observed reductions in sequence diversity in noncrossover (NC) regions that are inconsistent with the effects of hard selective sweeps in the absence of recombination. Overall, the observed patterns suggest that the recombination rate experienced by a gene is positively related to an increase in the efficiency of both positive and purifying selection. The results are consistent with a BGS model with interference among selected sites in NC regions, and joint effects of BGS, selective sweeps, and a past population expansion on variability in regions of the genome that experience crossing over. In such crossover regions, the X chromosome exhibits a higher rate of adaptive protein sequence evolution than the autosomes, implying a Faster-X effect.
Collapse
Affiliation(s)
- José L Campos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | | | | | | |
Collapse
|
142
|
Fung KL, Pan J, Ohnuma S, Lund PE, Pixley JN, Kimchi-Sarfaty C, Ambudkar SV, Gottesman MM. MDR1 synonymous polymorphisms alter transporter specificity and protein stability in a stable epithelial monolayer. Cancer Res 2013; 74:598-608. [PMID: 24305879 DOI: 10.1158/0008-5472.can-13-2064] [Citation(s) in RCA: 94] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
The drug efflux function of P-glycoprotein (P-gp) encoded by MDR1 can be influenced by genetic polymorphisms, including two synonymous changes in the coding region of MDR1. Here we report that the conformation of P-gp and its drug efflux activity can be altered by synonymous polymorphisms in stable epithelial monolayers expressing P-gp. Several cell lines with similar MDR1 DNA copy number were developed and termed LLC-MDR1-WT (expresses wild-type P-gp), LLC-MDR1-3H (expresses common haplotype P-gp), and LLC-MDR1-3HA (a mutant that carries a different valine codon in position 3435). These cell lines express similar levels of recombinant mRNA and protein. P-gp in each case is localized on the apical surface of polarized cells. However, the haplotype and its mutant P-gps fold differently from the wild-type, as determined by UIC2 antibody shift assays and limited proteolysis assays. Surface biotinylation experiments suggest that the non-wild-type P-gps have longer recycling times. Drug transport assays show that wild-type and haplotype P-gp respond differently to P-gp inhibitors that block efflux of rhodamine 123 or mitoxantrone. In addition, cytotoxicity assays show that the LLC-MDR1-3H cells are more resistant to mitoxantrone than the LLC-MDR1-WT cells after being treated with a P-gp inhibitor. Expression of polymorphic P-gp, however, does not affect the host cell's morphology, growth rate, or monolayer formation. Also, ATPase activity assays indicate that neither basal nor drug-stimulated ATPase activities are affected in the variant P-gps. Taken together, our findings indicate that "silent" polymorphisms significantly change P-gp function, which would be expected to affect interindividual drug disposition and response.
Collapse
Affiliation(s)
- King Leung Fung
- Authors' Affiliations: Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, NIH; and Center for Biologics Evaluation and Research, Division of Hematology, Food and Drug Administration, Bethesda, Maryland
| | | | | | | | | | | | | | | |
Collapse
|
143
|
Forsdyke DR. Implications of HIV RNA structure for recombination, speciation, and the neutralism-selectionism controversy. Microbes Infect 2013; 16:96-103. [PMID: 24211872 DOI: 10.1016/j.micinf.2013.10.017] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2013] [Revised: 10/24/2013] [Accepted: 10/24/2013] [Indexed: 11/29/2022]
Abstract
The conflict between the needs to encode both a protein (impaired by non-synonymous mutation), and nucleic acid structure (impaired by synonymous or non-synonymous mutation), can sometimes be resolved in favour of the nucleic acid because its structure is critical for a selectively advantageous genome-wide activity--recombination. However, above a sequence difference threshold, recombination is impaired. It may then be advantageous for new species to arise. Building on the work of Grantham and others critical of the neutralist viewpoint, heuristic support for this hypothesis emerged from studies of the base composition and structure of retroviral genomes. The extreme enrichment in the purine A of the RNA of human immunodeficiency virus (HIV-1), parallels the mild purine-loading of the RNAs of most organisms, for which there is an adaptive explanation--immune evasion. However, human T cell leukaemia virus (HTLV-1), with the potential to invade the same host cell, shows extreme enrichment in the pyrimidine C. Assuming the low GC% HIV and the high GC% HTLV-1 to share a common ancestor, it was postulated that differences in GC% had arisen to prevent homologous recombination between these emerging lentiviral species. Sympatrically isolated by this intracellular reproductive barrier, prototypic HIV-1 seized the AU-rich (low GC%) high ground (thus committing to purine A rather than purine G). Prototypic HTLV-1 forwent this advantage and evolved an independent evolutionary strategy--similar to that of the GC%-rich Epstein-Barr virus--profound latency maintained by transcription of one purine-rich mRNA. The evidence supporting these interpretations is reviewed.
Collapse
Affiliation(s)
- Donald R Forsdyke
- Department of Biomedical and Molecular Sciences, Queen's University, Kingston, ON K7L3N6, Canada.
| |
Collapse
|
144
|
Smith JD, McManus KF, Fraser HB. A novel test for selection on cis-regulatory elements reveals positive and negative selection acting on mammalian transcriptional enhancers. Mol Biol Evol 2013; 30:2509-18. [PMID: 23904330 DOI: 10.1093/molbev/mst134] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Measuring natural selection on genomic elements involved in the cis-regulation of gene expression--such as transcriptional enhancers and promoters--is critical for understanding the evolution of genomes, yet it remains a major challenge. Many studies have attempted to detect positive or negative selection in these noncoding elements by searching for those with the fastest or slowest rates of evolution, but this can be problematic. Here, we introduce a new approach to this issue, and demonstrate its utility on three mammalian transcriptional enhancers. Using results from saturation mutagenesis studies of these enhancers, we classified all possible point mutations as upregulating, downregulating, or silent, and determined which of these mutations have occurred on each branch of a phylogeny. Applying a framework analogous to Ka/Ks in protein-coding genes, we measured the strength of selection on upregulating and downregulating mutations, in specific branches as well as entire phylogenies. We discovered distinct modes of selection acting on different enhancers: although all three have experienced negative selection against downregulating mutations, the selection pressures on upregulating mutations vary. In one case, we detected positive selection for upregulation, whereas the other two had no detectable selection on upregulating mutations. Our methodology is applicable to the growing number of saturation mutagenesis data sets, and provides a detailed picture of the mode and strength of natural selection acting on cis-regulatory elements.
Collapse
|