1
|
McShea H, Weibel C, Wehbi S, Goodman P, James JE, Wheeler AL, Masel J. The effectiveness of selection in a species affects the direction of amino acid frequency evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.02.01.526552. [PMID: 38948853 PMCID: PMC11212923 DOI: 10.1101/2023.02.01.526552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]
Abstract
Nearly neutral theory predicts that species with higher effective population size (N e ) are better able to purge slightly deleterious mutations. We compare evolution in high-N e vs. low-N e vertebrates to reveal which amino acid frequencies are subject to subtle selective preferences. We take three complementary approaches, two measuring flux and one measuring outcomes. First, we fit non-stationary substitution models of amino acid flux using maximum likelihood, comparing the high-N e clade of rodents and lagomorphs to its low-N e sister clade of primates and colugos. Second, we compare evolutionary outcomes across a wider range of vertebrates, via correlations between amino acid frequencies and N e . Third, we dissect the details of flux in human, chimpanzee, mouse, and rat, as scored by parsimony - this also enables comparison to a historical paper. All three methods agree on which amino acids are preferred under more effective selection. Preferred amino acids tend to be smaller, less costly to synthesize, and to promote intrinsic structural disorder. Parsimony-induced bias in the historical study produces an apparent reduction in structural disorder, perhaps driven by slightly deleterious substitutions. Within highly exchangeable pairs of amino acids, arginine is strongly preferred over lysine, and valine over isoleucine, consistent with more effective selection preferring a marginally larger free energy of folding. These two preferences match differences between thermophiles and mesophilic relatives. These results reveal the biophysical consequences of mutation-selection-drift balance, and demonstrate the utility of nearly neutral theory for understanding protein evolution.
Collapse
Affiliation(s)
- Hanon McShea
- Department of Earth System Science, Stanford University
| | - Catherine Weibel
- Department of Ecology & Evolutionary Biology, University of Arizona
- Department of Applied Physics, Stanford University
| | - Sawsan Wehbi
- Graduate Interdisciplinary Program in Genetics, University of Arizona
| | | | - Jennifer E James
- Department of Ecology & Evolutionary Biology, University of Arizona
- Department of Ecology and Genetics, Uppsala University
| | - Andrew L Wheeler
- Graduate Interdisciplinary Program in Genetics, University of Arizona
| | - Joanna Masel
- Department of Ecology & Evolutionary Biology, University of Arizona
| |
Collapse
|
2
|
Gupta MK, Vadde R. Next-generation development and application of codon model in evolution. Front Genet 2023; 14:1091575. [PMID: 36777719 PMCID: PMC9911445 DOI: 10.3389/fgene.2023.1091575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 01/17/2023] [Indexed: 01/28/2023] Open
Abstract
To date, numerous nucleotide, amino acid, and codon substitution models have been developed to estimate the evolutionary history of any sequence/organism in a more comprehensive way. Out of these three, the codon substitution model is the most powerful. These models have been utilized extensively to detect selective pressure on a protein, codon usage bias, ancestral reconstruction and phylogenetic reconstruction. However, due to more computational demanding, in comparison to nucleotide and amino acid substitution models, only a few studies have employed the codon substitution model to understand the heterogeneity of the evolutionary process in a genome-scale analysis. Hence, there is always a question of how to develop more robust but less computationally demanding codon substitution models to get more accurate results. In this review article, the authors attempted to understand the basis of the development of different types of codon-substitution models and how this information can be utilized to develop more robust but less computationally demanding codon substitution models. The codon substitution model enables to detect selection regime under which any gene or gene region is evolving, codon usage bias in any organism or tissue-specific region and phylogenetic relationship between different lineages more accurately than nucleotide and amino acid substitution models. Thus, in the near future, these codon models can be utilized in the field of conservation, breeding and medicine.
Collapse
|
3
|
Majeed A, Kaur H, Bhardwaj P. Selection constraints determine preference for A/U-ending codons in Taxus contorta. Genome 2020; 63:215-224. [PMID: 31986060 DOI: 10.1139/gen-2019-0165] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Unequal utilization of synonymous codons is a well-known phenomenon among living organisms. This phenomenon plays a major role in the enhancement of the accuracy and efficiency of translation. Gymnosperms are rarely paid attention in this aspect. Understanding the degree of and determining the forces influencing codon usage bias (CUB) in Taxus contorta, an endangered Himalayan gymnosperm, will prove useful in interpreting the evolutionary characteristics of this species. Using RNAseq data, 93 790 assembled transcripts were clustered into 32 701 unigenes. Around 13 061 full-length sequences were utilized for the analysis of CUB. Compositional properties showed that GC-content ranged from 28.76% to 65.22%, with an average value of 44.28%, suggesting an AT-rich genome. The mean effective number of codons (ENC) value revealed that CUB is not strong in T. contorta. The preferred codons tended to be A/U ending, whereas the avoided codons tended to be G/C ending. A P2 index of 0.54 and a Mutation Responsive Index (MRI) value of -0.02 in addition to the results revealed by the neutrality, ENC, and parity plots showed that natural selection is a predominating factor governing CUB. Mutational pressure, gene length, hydropathiciy, aromaticity, and nucleotide composition influence CUB weakly.
Collapse
Affiliation(s)
- Aasim Majeed
- Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, City Campus, Mansa Road, Bathinda-151001, India.,Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, City Campus, Mansa Road, Bathinda-151001, India
| | - Harpreet Kaur
- Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, City Campus, Mansa Road, Bathinda-151001, India.,Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, City Campus, Mansa Road, Bathinda-151001, India
| | - Pankaj Bhardwaj
- Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, City Campus, Mansa Road, Bathinda-151001, India.,Molecular Genetics Laboratory, Department of Botany, Central University of Punjab, City Campus, Mansa Road, Bathinda-151001, India
| |
Collapse
|
4
|
Whittle CA, Extavour CG. Selection shapes turnover and magnitude of sex-biased expression in Drosophila gonads. BMC Evol Biol 2019; 19:60. [PMID: 30786879 PMCID: PMC6383255 DOI: 10.1186/s12862-019-1377-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Accepted: 01/23/2019] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND Sex-biased gene expression is thought to drive the phenotypic differences in males and females in metazoans. Drosophila has served as a primary model for studying male-female differences in gene expression, and its effects on protein sequence divergence. However, the forces shaping evolution of sex-biased expression remain largely unresolved, including the roles of selection and pleiotropy. Research on sex organs in Drosophila, employing original approaches and multiple-species contrasts, provides a means to gain insights into factors shaping the turnover and magnitude (fold-bias) of sex-biased expression. RESULTS Here, using recent RNA-seq data, we studied sex-biased gonadal expression in 10,740 protein coding sequences in four species of Drosophila, D. melanogaster, D. simulans, D. yakuba and D. ananassae (5 to 44 My divergence). Using an approach wherein we identified genes with lineage-specific transitions (LSTs) in sex-biased status (amongst testis-biased, ovary-biased and unbiased; thus, six transition types) standardized to the number of genes with the ancestral state (S-LSTs), and those with clade-wide expression bias status, we reveal several key findings. First, the six categorical types of S-LSTs in sex-bias showed disparate rates of turnover, consistent with differential selection pressures. Second, the turnover in sex-biased status was largely unrelated to cross-tissue expression breadth, suggesting pleiotropy does not restrict evolution of sex-biased expression. Third, the fold-sex-biased expression, for both testis-biased and ovary-biased genes, evolved directionally over time toward higher values, a crucial finding that could be interpreted as a selective advantage of greater sex-bias, and sexual antagonism. Fourth, in terms of protein divergence, genes with LSTs to testis-biased expression exhibited weak signals of elevated rates of evolution (than ovary-biased) in as little as 5 My, which strengthened over time. Moreover, genes with clade-wide testis-specific expression (44 My), a status not observed for any ovary-biased genes, exhibited striking acceleration of protein divergence, which was linked to low pleiotropy. CONCLUSIONS By studying LSTs and clade-wide sex-biased gonadal expression in a multi-species clade of Drosophila, we describe evidence that interspecies turnover and magnitude of sex-biased expression have been influenced by selection. Further, whilst pleiotropy was not connected to turnover in sex-biased gonadal expression, it likely explains protein sequence divergence.
Collapse
Affiliation(s)
- Carrie A Whittle
- Department of Organismic and Evolutionary Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA, 02138, USA
| | - Cassandra G Extavour
- Department of Organismic and Evolutionary Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA, 02138, USA.
- Department of Molecular and Cellular Biology, Harvard University, 16 Divinity Avenue, Cambridge, MA, 02138, USA.
| |
Collapse
|
5
|
Wang SP, Althoff DM. Phenotypic plasticity facilitates initial colonization of a novel environment. Evolution 2019; 73:303-316. [DOI: 10.1111/evo.13676] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Revised: 11/30/2018] [Accepted: 12/21/2018] [Indexed: 01/07/2023]
Affiliation(s)
- Sheng Pei Wang
- Department of Biology Syracuse University Syracuse NY 13244
| | - David M. Althoff
- Department of Biology Syracuse University Syracuse NY 13244
- Archbold Biological Station Venus FL 33960
| |
Collapse
|
6
|
Distinguishing Among Evolutionary Forces Acting on Genome-Wide Base Composition: Computer Simulation Analysis of Approximate Methods for Inferring Site Frequency Spectra of Derived Mutations. G3-GENES GENOMES GENETICS 2018; 8:1755-1769. [PMID: 29588382 PMCID: PMC5940166 DOI: 10.1534/g3.117.300512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Inferred ancestral nucleotide states are increasingly employed in analyses of within- and between -species genome variation. Although numerous studies have focused on ancestral inference among distantly related lineages, approaches to infer ancestral states in polymorphism data have received less attention. Recently developed approaches that employ complex transition matrices allow us to infer ancestral nucleotide sequence in various evolutionary scenarios of base composition. However, the requirement of a single gene tree to calculate a likelihood is an important limitation for conducting ancestral inference using within-species variation in recombining genomes. To resolve this problem, and to extend the applicability of ancestral inference in studies of base composition evolution, we first evaluate three previously proposed methods to infer ancestral nucleotide sequences among within- and between-species sequence variation data. The methods employ a single allele, bifurcating tree, or a star tree for within-species variation data. Using simulated nucleotide sequences, we employ ancestral inference to infer fixations and polymorphisms. We find that all three methods show biased inference. We modify the bifurcating tree method to include weights to adjust for an expected site frequency spectrum, “bifurcating tree with weighting” (BTW). Our simulation analysis show that the BTW method can substantially improve the reliability and robustness of ancestral inference in a range of scenarios that include non-neutral and/or non-stationary base composition evolution.
Collapse
|
7
|
Charlesworth B, Campos JL, Jackson BC. Faster-X evolution: Theory and evidence from Drosophila. Mol Ecol 2018; 27:3753-3771. [PMID: 29431881 DOI: 10.1111/mec.14534] [Citation(s) in RCA: 59] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2017] [Revised: 01/31/2018] [Accepted: 02/06/2018] [Indexed: 12/13/2022]
Abstract
A faster rate of adaptive evolution of X-linked genes compared with autosomal genes can be caused by the fixation of recessive or partially recessive advantageous mutations, due to the full expression of X-linked mutations in hemizygous males. Other processes, including recombination rate and mutation rate differences between X chromosomes and autosomes, may also cause faster evolution of X-linked genes. We review population genetics theory concerning the expected relative values of variability and rates of evolution of X-linked and autosomal DNA sequences. The theoretical predictions are compared with data from population genomic studies of several species of Drosophila. We conclude that there is evidence for adaptive faster-X evolution of several classes of functionally significant nucleotides. We also find evidence for potential differences in mutation rates between X-linked and autosomal genes, due to differences in mutational bias towards GC to AT mutations. Many aspects of the data are consistent with the male hemizygosity model, although not all possible confounding factors can be excluded.
Collapse
Affiliation(s)
- Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - José L Campos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - Benjamin C Jackson
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| |
Collapse
|
8
|
Jackson BC, Campos JL, Haddrill PR, Charlesworth B, Zeng K. Variation in the Intensity of Selection on Codon Bias over Time Causes Contrasting Patterns of Base Composition Evolution in Drosophila. Genome Biol Evol 2017; 9:102-123. [PMID: 28082609 PMCID: PMC5381600 DOI: 10.1093/gbe/evw291] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/07/2016] [Indexed: 12/11/2022] Open
Abstract
Four-fold degenerate coding sites form a major component of the genome, and are often used to make inferences about selection and demography, so that understanding their evolution is important. Despite previous efforts, many questions regarding the causes of base composition changes at these sites in Drosophila remain unanswered. To shed further light on this issue, we obtained a new whole-genome polymorphism data set from D. simulans. We analyzed samples from the putatively ancestral range of D. simulans, as well as an existing polymorphism data set from an African population of D. melanogaster. By using D. yakuba as an outgroup, we found clear evidence for selection on 4-fold sites along both lineages over a substantial period, with the intensity of selection increasing with GC content. Based on an explicit model of base composition evolution, we suggest that the observed AT-biased substitution pattern in both lineages is probably due to an ancestral reduction in selection intensity, and is unlikely to be the result of an increase in mutational bias towards AT alone. By using two polymorphism-based methods for estimating selection coefficients over different timescales, we show that the selection intensity on codon usage has been rather stable in D. simulans in the recent past, but the long-term estimates in D. melanogaster are much higher than the short-term ones, indicating a continuing decline in selection intensity, to such an extent that the short-term estimates suggest that selection is only active in the most GC-rich parts of the genome. Finally, we provide evidence for complex evolutionary patterns in the putatively neutral short introns, which cannot be explained by the standard GC-biased gene conversion model. These results reveal a dynamic picture of base composition evolution.
Collapse
Affiliation(s)
- Benjamin C Jackson
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - José L Campos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Penelope R Haddrill
- Centre for Forensic Science, Department of Pure and Applied Chemistry, University of Strathclyde, Glasgow, United Kingdom
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Kai Zeng
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| |
Collapse
|
9
|
Mahato NK, Gupta V, Singh P, Kumari R, Verma H, Tripathi C, Rani P, Sharma A, Singhvi N, Sood U, Hira P, Kohli P, Nayyar N, Puri A, Bajaj A, Kumar R, Negi V, Talwar C, Khurana H, Nagar S, Sharma M, Mishra H, Singh AK, Dhingra G, Negi RK, Shakarad M, Singh Y, Lal R. Microbial taxonomy in the era of OMICS: application of DNA sequences, computational tools and techniques. Antonie van Leeuwenhoek 2017; 110:1357-1371. [PMID: 28831610 DOI: 10.1007/s10482-017-0928-1] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2017] [Accepted: 08/10/2017] [Indexed: 02/06/2023]
Abstract
The current prokaryotic taxonomy classifies phenotypically and genotypically diverse microorganisms using a polyphasic approach. With advances in the next-generation sequencing technologies and computational tools for analysis of genomes, the traditional polyphasic method is complemented with genomic data to delineate and classify bacterial genera and species as an alternative to cumbersome and error-prone laboratory tests. This review discusses the applications of sequence-based tools and techniques for bacterial classification and provides a scheme for more robust and reproducible bacterial classification based on genomic data. The present review highlights promising tools and techniques such as ortho-Average Nucleotide Identity, Genome to Genome Distance Calculator and Multi Locus Sequence Analysis, which can be validly employed for characterizing novel microorganisms and assessing phylogenetic relationships. In addition, the review discusses the possibility of employing metagenomic data to assess the phylogenetic associations of uncultured microorganisms. Through this article, we present a review of genomic approaches that can be included in the scheme of taxonomy of bacteria and archaea based on computational and in silico advances to boost the credibility of taxonomic classification in this genomic era.
Collapse
Affiliation(s)
| | - Vipin Gupta
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Priya Singh
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Rashmi Kumari
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | | | - Charu Tripathi
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Pooja Rani
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Anukriti Sharma
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Nirjara Singhvi
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Utkarsh Sood
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Princy Hira
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Puneet Kohli
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Namita Nayyar
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Akshita Puri
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Abhay Bajaj
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Roshan Kumar
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Vivek Negi
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Chandni Talwar
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Himani Khurana
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Shekhar Nagar
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Monika Sharma
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Harshita Mishra
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Amit Kumar Singh
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Gauri Dhingra
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Ram Krishan Negi
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | | | - Yogendra Singh
- Department of Zoology, University of Delhi, Delhi, 110007, India
| | - Rup Lal
- Department of Zoology, University of Delhi, Delhi, 110007, India.
| |
Collapse
|
10
|
Szövényi P, Ullrich KK, Rensing SA, Lang D, van Gessel N, Stenøien HK, Conti E, Reski R. Selfing in Haploid Plants and Efficacy of Selection: Codon Usage Bias in the Model Moss Physcomitrella patens. Genome Biol Evol 2017; 9:1528-1546. [PMID: 28549175 PMCID: PMC5507605 DOI: 10.1093/gbe/evx098] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/25/2017] [Indexed: 12/15/2022] Open
Abstract
A long-term reduction in effective population size will lead to major shift in genome evolution. In particular, when effective population size is small, genetic drift becomes dominant over natural selection. The onset of self-fertilization is one evolutionary event considerably reducing effective size of populations. Theory predicts that this reduction should be more dramatic in organisms capable for haploid than for diploid selfing. Although theoretically well-grounded, this assertion received mixed experimental support. Here, we test this hypothesis by analyzing synonymous codon usage bias of genes in the model moss Physcomitrella patens frequently undergoing haploid selfing. In line with population genetic theory, we found that the effect of natural selection on synonymous codon usage bias is very weak. Our conclusion is supported by four independent lines of evidence: 1) Very weak or nonsignificant correlation between gene expression and codon usage bias, 2) no increased codon usage bias in more broadly expressed genes, 3) no evidence that codon usage bias would constrain synonymous and nonsynonymous divergence, and 4) predominant role of genetic drift on synonymous codon usage predicted by a model-based analysis. These findings show striking similarity to those observed in AT-rich genomes with weak selection for optimal codon usage and GC content overall. Our finding is in contrast to a previous study reporting adaptive codon usage bias in the moss P. patens.
Collapse
Affiliation(s)
- Péter Szövényi
- Department of Systematic and Evolutionary Botany, University of Zurich, Switzerland
| | - Kristian K. Ullrich
- Plant Cell Biology, Faculty of Biology, University of Marburg, Germany
- Present address: Max-Planck-Insitut für Evolutionsbiologie, Plön, Germany
| | - Stefan A. Rensing
- Plant Cell Biology, Faculty of Biology, University of Marburg, Germany
- BIOSS—Centre for Biological Signalling Studies, University of Freiburg, Germany
| | - Daniel Lang
- Plant Genome and Systems Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Nico van Gessel
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Germany
| | | | - Elena Conti
- Department of Systematic and Evolutionary Botany, University of Zurich, Switzerland
| | - Ralf Reski
- BIOSS—Centre for Biological Signalling Studies, University of Freiburg, Germany
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Germany
| |
Collapse
|
11
|
Choi JY, Aquadro CF. Recent and Long-Term Selection Across Synonymous Sites in Drosophila ananassae. J Mol Evol 2016; 83:50-60. [PMID: 27481397 DOI: 10.1007/s00239-016-9753-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2015] [Accepted: 07/23/2016] [Indexed: 11/28/2022]
Abstract
In Drosophila, many studies have examined the short- or long-term evolution occurring across synonymous sites. Few, however, have examined both the recent and long-term evolution to gain a complete view of this selection. Here we have analyzed Drosophila ananassae DNA polymorphism and divergence data using several different methods, and have identified evidence of positive selection favoring preferred codons in both recent and long-term evolutionary time scale. Further in D. ananassae, the strength of selection for preferred codons was stronger on the X chromosome compared to the autosomes. We show that this stronger selection is not due to higher gene expression of X-linked genes. Analysis of the selectively neutral introns indicated that the X chromosome also had a preference for GC over AT nucleotides, potentially from GC-biased gene conversions (gcBGCs) that can also affect the base composition of synonymous sites. Thus selection for preferred codons and gcBGC both seem to be partially responsible for shaping the D. ananassae synonymous site evolution.
Collapse
Affiliation(s)
- Jae Young Choi
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, USA.
| | - Charles F Aquadro
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, USA
| |
Collapse
|
12
|
Matsumoto T, John A, Baeza-Centurion P, Li B, Akashi H. Codon Usage Selection Can Bias Estimation of the Fraction of Adaptive Amino Acid Fixations. Mol Biol Evol 2016; 33:1580-9. [PMID: 26873577 DOI: 10.1093/molbev/msw027] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
A growing number of molecular evolutionary studies are estimating the proportion of adaptive amino acid substitutions (α) from comparisons of ratios of polymorphic and fixed DNA mutations. Here, we examine how violations of two of the model assumptions, neutral evolution of synonymous mutations and stationary base composition, affect α estimation. We simulated the evolution of coding sequences assuming weak selection on synonymous codon usage bias and neutral protein evolution, α = 0. We show that weak selection on synonymous mutations can give polymorphism/divergence ratios that yield α-hat (estimated α) considerably larger than its true value. Nonstationary evolution (changes in population size, selection, or mutation) can exacerbate such biases or, in some scenarios, give biases in the opposite direction, α-hat < α. These results demonstrate that two factors that appear to be prevalent among taxa, weak selection on synonymous mutations and non-steady-state nucleotide composition, should be considered when estimating α. Estimates of the proportion of adaptive amino acid fixations from large-scale analyses of Drosophila melanogaster polymorphism and divergence data are positively correlated with codon usage bias. Such patterns are consistent with α-hat inflation from weak selection on synonymous mutations and/or mutational changes within the examined gene trees.
Collapse
Affiliation(s)
- Tomotaka Matsumoto
- Division of Evolutionary Genetics, National Institute of Genetics, Yata, Mishima, Shizuoka, Japan
| | - Anoop John
- Division of Evolutionary Genetics, National Institute of Genetics, Yata, Mishima, Shizuoka, Japan
| | - Pablo Baeza-Centurion
- Division of Evolutionary Genetics, National Institute of Genetics, Yata, Mishima, Shizuoka, Japan
| | - Boyang Li
- Division of Evolutionary Genetics, National Institute of Genetics, Yata, Mishima, Shizuoka, Japan
| | - Hiroshi Akashi
- Division of Evolutionary Genetics, National Institute of Genetics, Yata, Mishima, Shizuoka, Japan Department of Genetics, The Graduate University for Advanced Studies (SOKENDAI), Yata, Mishima, Shizuoka, Japan
| |
Collapse
|
13
|
Evaluation of Ancestral Sequence Reconstruction Methods to Infer Nonstationary Patterns of Nucleotide Substitution. Genetics 2015; 200:873-90. [PMID: 25948563 DOI: 10.1534/genetics.115.177386] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2015] [Accepted: 04/28/2015] [Indexed: 01/07/2023] Open
Abstract
Inference of gene sequences in ancestral species has been widely used to test hypotheses concerning the process of molecular sequence evolution. However, the approach may produce spurious results, mainly because using the single best reconstruction while ignoring the suboptimal ones creates systematic biases. Here we implement methods to correct for such biases and use computer simulation to evaluate their performance when the substitution process is nonstationary. The methods we evaluated include parsimony and likelihood using the single best reconstruction (SBR), averaging over reconstructions weighted by the posterior probabilities (AWP), and a new method called expected Markov counting (EMC) that produces maximum-likelihood estimates of substitution counts for any branch under a nonstationary Markov model. We simulated base composition evolution on a phylogeny for six species, with different selective pressures on G+C content among lineages, and compared the counts of nucleotide substitutions recorded during simulation with the inference by different methods. We found that large systematic biases resulted from (i) the use of parsimony or likelihood with SBR, (ii) the use of a stationary model when the substitution process is nonstationary, and (iii) the use of the Hasegawa-Kishino-Yano (HKY) model, which is too simple to adequately describe the substitution process. The nonstationary general time reversible (GTR) model, used with AWP or EMC, accurately recovered the substitution counts, even in cases of complex parameter fluctuations. We discuss model complexity and the compromise between bias and variance and suggest that the new methods may be useful for studying complex patterns of nucleotide substitution in large genomic data sets.
Collapse
|
14
|
Kessler MD, Dean MD. Effective population size does not predict codon usage bias in mammals. Ecol Evol 2014; 4:3887-900. [PMID: 25505518 PMCID: PMC4242573 DOI: 10.1002/ece3.1249] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2014] [Revised: 08/04/2014] [Accepted: 08/07/2014] [Indexed: 12/20/2022] Open
Abstract
Synonymous codons are not used at equal frequency throughout the genome, a phenomenon termed codon usage bias (CUB). It is often assumed that interspecific variation in the intensity of CUB is related to species differences in effective population sizes (Ne), with selection on CUB operating less efficiently in species with small Ne. Here, we specifically ask whether variation in Ne predicts differences in CUB in mammals and report two main findings. First, across 41 mammalian genomes, CUB was not correlated with two indirect proxies of Ne (body mass and generation time), even though there was statistically significant evidence of selection shaping CUB across all species. Interestingly, autosomal genes showed higher codon usage bias compared to X-linked genes, and high-recombination genes showed higher codon usage bias compared to low recombination genes, suggesting intraspecific variation in Ne predicts variation in CUB. Second, across six mammalian species with genetic estimates of Ne (human, chimpanzee, rabbit, and three mouse species: Mus musculus, M. domesticus, and M. castaneus), Ne and CUB were weakly and inconsistently correlated. At least in mammals, interspecific divergence in Ne does not strongly predict variation in CUB. One hypothesis is that each species responds to a unique distribution of selection coefficients, confounding any straightforward link between Ne and CUB.
Collapse
Affiliation(s)
- Michael D Kessler
- Molecular and Computational Biology, University of Southern California 1050 Childs Way, Los Angeles, California, 90089
| | - Matthew D Dean
- Molecular and Computational Biology, University of Southern California 1050 Childs Way, Los Angeles, California, 90089
| |
Collapse
|
15
|
Nucleosomes shape DNA polymorphism and divergence. PLoS Genet 2014; 10:e1004457. [PMID: 24991813 PMCID: PMC4081404 DOI: 10.1371/journal.pgen.1004457] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2013] [Accepted: 05/12/2014] [Indexed: 11/30/2022] Open
Abstract
An estimated 80% of genomic DNA in eukaryotes is packaged as nucleosomes, which, together with the remaining interstitial linker regions, generate higher order chromatin structures [1]. Nucleosome sequences isolated from diverse organisms exhibit ∼10 bp periodic variations in AA, TT and GC dinucleotide frequencies. These sequence elements generate intrinsically curved DNA and help establish the histone-DNA interface. We investigated an important unanswered question concerning the interplay between chromatin organization and genome evolution: do the DNA sequence preferences inherent to the highly conserved histone core exert detectable natural selection on genomic divergence and polymorphism? To address this hypothesis, we isolated nucleosomal DNA sequences from Drosophila melanogaster embryos and examined the underlying genomic variation within and between species. We found that divergence along the D. melanogaster lineage is periodic across nucleosome regions with base changes following preferred nucleotides, providing new evidence for systematic evolutionary forces in the generation and maintenance of nucleosome-associated dinucleotide periodicities. Further, Single Nucleotide Polymorphism (SNP) frequency spectra show striking periodicities across nucleosomal regions, paralleling divergence patterns. Preferred alleles occur at higher frequencies in natural populations, consistent with a central role for natural selection. These patterns are stronger for nucleosomes in introns than in intergenic regions, suggesting selection is stronger in transcribed regions where nucleosomes undergo more displacement, remodeling and functional modification. In addition, we observe a large-scale (∼180 bp) periodic enrichment of AA/TT dinucleotides associated with nucleosome occupancy, while GC dinucleotide frequency peaks in linker regions. Divergence and polymorphism data also support a role for natural selection in the generation and maintenance of these super-nucleosomal patterns. Our results demonstrate that nucleosome-associated sequence periodicities are under selective pressure, implying that structural interactions between nucleosomes and DNA sequence shape sequence evolution, particularly in introns. In eukaryotic cells, the majority of DNA is packaged in nucleosomes comprised of ∼147 bp of DNA wound tightly around the highly conserved histone octamer. Nucleosomal DNA from diverse organisms shows an anti-correlated ∼10 bp periodicity of AT-rich and GC-rich dinucleotides. These sequence features influence DNA bending and shape, facilitating structural interactions. We asked whether natural selection mediated through the periodic sequence preferences of nucleosomes shapes the evolution of non-protein-coding regions of D. melanogaster by examining the inter- and intra-species genomic variation relative to these fundamental chromatin building blocks. The sequence changes across nucleosome-bound regions on the melanogaster lineage mirror the observed nucleosome dinucleotide periodicities. Importantly, we show that the frequencies of polymorphisms in natural populations vary across these regions, paralleling divergence, with higher frequencies of preferred alleles. These patterns are most evident for intronic regions and indicate that non-protein coding regions are evolving toward sequences that facilitate the canonical association with the histone core. This result is consistent with the hypothesis that interactions between DNA and the core have systematic impacts on function that are subject to natural selection and are not solely due to mutational bias. These ubiquitous interactions with the histone core partially account for the evolutionary constraint observed in unannotated genomic regions, and may drive broad changes in base composition.
Collapse
|
16
|
Adaptive synonymous mutations in an experimentally evolved Pseudomonas fluorescens population. Nat Commun 2014; 5:4076. [PMID: 24912567 DOI: 10.1038/ncomms5076] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2014] [Accepted: 05/08/2014] [Indexed: 01/22/2023] Open
Abstract
Conventional wisdom holds that synonymous mutations, nucleotide changes that do not alter the encoded amino acid, have no detectable effect on phenotype or fitness. However, a growing body of evidence from both comparative and experimental studies suggests otherwise. Synonymous mutations have been shown to impact gene expression, protein folding and fitness, however, direct evidence that they can be positively selected, and so contribute to adaptation, is lacking. Here we report the recovery of two beneficial synonymous single base pair changes that arose spontaneously and independently in an experimentally evolved population of Pseudomonas fluorescens. We show experimentally that these mutations increase fitness by an amount comparable to non-synonymous mutations and that the fitness increases stem from increased gene expression. These results provide unequivocal evidence that synonymous mutations can drive adaptive evolution and suggest that this class of mutation may be underappreciated as a cause of adaptation and evolutionary dynamics.
Collapse
|
17
|
Lee YCG, Langley CH, Begun DJ. Differential strengths of positive selection revealed by hitchhiking effects at small physical scales in Drosophila melanogaster. Mol Biol Evol 2013; 31:804-16. [PMID: 24361994 DOI: 10.1093/molbev/mst270] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
The long time scale of adaptive evolution makes it difficult to directly observe the spread of most beneficial mutations through natural populations. Therefore, inferring attributes of beneficial mutations by studying the genomic signals left by directional selection is an important component of population genetics research. One kind of signal is a trough in nearby neutral genetic variation due to selective fixation of initially rare alleles, a phenomenon known as "genetic hitchhiking." Accumulated evidence suggests that a considerable fraction of substitutions in the Drosophila genome results from positive selection, most of which are expected to have small selection coefficients and influence the population genetics of sites in the immediate vicinity. Using Drosophila melanogaster population genomic data, we found that the heterogeneity in synonymous polymorphism surrounding different categories of coding fixations is readily observable even within 25 bp of focal substitutions, which we interpret as the result of small-scale hitchhiking effects. The strength of natural selection on different sites appears to be quite heterogeneous. Particularly, neighboring fixations that changed amino acid polarities in a way that maintained the overall polarities of a protein were under stronger selection than other categories of fixations. Interestingly, we found that substitutions in slow-evolving genes are associated with stronger hitchhiking effects. This is consistent with the idea that adaptive evolution may involve few substitutions with large effects or many substitutions with small effects. Because our approach only weakly depends on the numbers of recent nonsynonymous substitutions, it can provide a complimentary view to the adaptive evolution inferred by other divergence-based evolutionary genetic methods.
Collapse
Affiliation(s)
- Yuh Chwen G Lee
- Department of Evolution and Ecology and Center for Population Biology, University of California, Davis
| | | | | |
Collapse
|
18
|
Charlesworth B. Stabilizing selection, purifying selection, and mutational bias in finite populations. Genetics 2013; 194:955-71. [PMID: 23709636 PMCID: PMC3730922 DOI: 10.1534/genetics.113.151555] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2013] [Accepted: 05/18/2013] [Indexed: 12/16/2022] Open
Abstract
Genomic traits such as codon usage and the lengths of noncoding sequences may be subject to stabilizing selection rather than purifying selection. Mutations affecting these traits are often biased in one direction. To investigate the potential role of stabilizing selection on genomic traits, the effects of mutational bias on the equilibrium value of a trait under stabilizing selection in a finite population were investigated, using two different mutational models. Numerical results were generated using a matrix method for calculating the probability distribution of variant frequencies at sites affecting the trait, as well as by Monte Carlo simulations. Analytical approximations were also derived, which provided useful insights into the numerical results. A novel conclusion is that the scaled intensity of selection acting on individual variants is nearly independent of the effective population size over a wide range of parameter space and is strongly determined by the logarithm of the mutational bias parameter. This is true even when there is a very small departure of the mean from the optimum, as is usually the case. This implies that studies of the frequency spectra of DNA sequence variants may be unable to distinguish between stabilizing and purifying selection. A similar investigation of purifying selection against deleterious mutations was also carried out. Contrary to previous suggestions, the scaled intensity of purifying selection with synergistic fitness effects is sensitive to population size, which is inconsistent with the general lack of sensitivity of codon usage to effective population size.
Collapse
Affiliation(s)
- Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom.
| |
Collapse
|
19
|
Strong purifying selection at synonymous sites in D. melanogaster. PLoS Genet 2013; 9:e1003527. [PMID: 23737754 PMCID: PMC3667748 DOI: 10.1371/journal.pgen.1003527] [Citation(s) in RCA: 143] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2013] [Accepted: 04/08/2013] [Indexed: 11/19/2022] Open
Abstract
Synonymous sites are generally assumed to be subject to weak selective constraint. For this reason, they are often neglected as a possible source of important functional variation. We use site frequency spectra from deep population sequencing data to show that, contrary to this expectation, 22% of four-fold synonymous (4D) sites in Drosophila melanogaster evolve under very strong selective constraint while few, if any, appear to be under weak constraint. Linking polymorphism with divergence data, we further find that the fraction of synonymous sites exposed to strong purifying selection is higher for those positions that show slower evolution on the Drosophila phylogeny. The function underlying the inferred strong constraint appears to be separate from splicing enhancers, nucleosome positioning, and the translational optimization generating canonical codon bias. The fraction of synonymous sites under strong constraint within a gene correlates well with gene expression, particularly in the mid-late embryo, pupae, and adult developmental stages. Genes enriched in strongly constrained synonymous sites tend to be particularly functionally important and are often involved in key developmental pathways. Given that the observed widespread constraint acting on synonymous sites is likely not limited to Drosophila, the role of synonymous sites in genetic disease and adaptation should be reevaluated.
Collapse
|
20
|
Poh YP, Ting CT, Fu HW, Langley CH, Begun DJ. Population genomic analysis of base composition evolution in Drosophila melanogaster. Genome Biol Evol 2013; 4:1245-55. [PMID: 23160062 PMCID: PMC3542573 DOI: 10.1093/gbe/evs097] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
The relative importance of mutation, selection, and biased gene conversion to patterns of base composition variation in Drosophila melanogaster, and to a lesser extent, D. simulans, has been investigated for many years. However, genomic data from sufficiently large samples to thoroughly characterize patterns of base composition polymorphism within species have been lacking. Here, we report a genome-wide analysis of coding and noncoding polymorphism in a large sample of inbred D. melanogaster strains from Raleigh, North Carolina. Consistent with previous results, we observed that AT mutations fix more frequently than GC mutations in D. melanogaster. Contrary to predictions of previous models of codon usage in D. melanogaster, we found that synonymous sites segregating for derived AT polymorphisms were less skewed toward low frequencies compared with sites segregating a derived GC polymorphism. However, no such pattern was observed for comparable base composition polymorphisms in noncoding DNA. These results suggest that AT-ending codons could currently be favored by natural selection in the D. melanogaster lineage.
Collapse
Affiliation(s)
- Yu-Ping Poh
- Institute of Molecular and Cellular Biology, National Tsing Hua University, Taiwan, Republic of China.
| | | | | | | | | |
Collapse
|
21
|
Shabalina SA, Spiridonov NA, Kashina A. Sounds of silence: synonymous nucleotides as a key to biological regulation and complexity. Nucleic Acids Res 2013; 41:2073-94. [PMID: 23293005 PMCID: PMC3575835 DOI: 10.1093/nar/gks1205] [Citation(s) in RCA: 187] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Messenger RNA is a key component of an intricate regulatory network of its own. It accommodates numerous nucleotide signals that overlap protein coding sequences and are responsible for multiple levels of regulation and generation of biological complexity. A wealth of structural and regulatory information, which mRNA carries in addition to the encoded amino acid sequence, raises the question of how these signals and overlapping codes are delineated along non-synonymous and synonymous positions in protein coding regions, especially in eukaryotes. Silent or synonymous codon positions, which do not determine amino acid sequences of the encoded proteins, define mRNA secondary structure and stability and affect the rate of translation, folding and post-translational modifications of nascent polypeptides. The RNA level selection is acting on synonymous sites in both prokaryotes and eukaryotes and is more common than previously thought. Selection pressure on the coding gene regions follows three-nucleotide periodic pattern of nucleotide base-pairing in mRNA, which is imposed by the genetic code. Synonymous positions of the coding regions have a higher level of hybridization potential relative to non-synonymous positions, and are multifunctional in their regulatory and structural roles. Recent experimental evidence and analysis of mRNA structure and interspecies conservation suggest that there is an evolutionary tradeoff between selective pressure acting at the RNA and protein levels. Here we provide a comprehensive overview of the studies that define the role of silent positions in regulating RNA structure and processing that exert downstream effects on proteins and their functions.
Collapse
Affiliation(s)
- Svetlana A Shabalina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20984, USA.
| | | | | |
Collapse
|
22
|
Clemente F, Vogl C. Evidence for complex selection on four-fold degenerate sites in Drosophila melanogaster. J Evol Biol 2012; 25:2582-95. [PMID: 23020078 DOI: 10.1111/jeb.12003] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2012] [Revised: 08/31/2012] [Accepted: 08/31/2012] [Indexed: 01/04/2023]
Abstract
We considered genome-wide four-fold degenerate sites from an African Drosophila melanogaster population and compared them to short introns. To include divergence and to polarize the data, we used its close relatives Drosophila simulans, Drosophila sechellia, Drosophila erecta and Drosophila yakuba as outgroups. In D. melanogaster, the GC content at four-fold degenerate sites is higher than in short introns; compared to its relatives, more AT than GC is fixed. The former has been explained by codon usage bias (CUB) favouring GC; the latter by decreased intensity of directional selection or by increased mutation bias towards AT. With a biallelic equilibrium model, evidence for directional selection comes mostly from the GC-rich ancestral base composition. Together with a slight mutation bias, it leads to an asymmetry of the unpolarized allele frequency spectrum, from which directional selection is inferred. Using a quasi-equilibrium model and polarized spectra, however, only purifying and no directional selection is detected. Furthermore, polarized spectra are proportional to those of the presumably unselected short introns. As we have no evidence for a decrease in effective population size, relaxed CUB must be due to a reduction in the selection coefficient. Going beyond the biallelic model and considering all four bases, signs of directional selection are stronger. In contrast to short introns, complementary bases show strand specificity and allele frequency spectra depend on mutation directions. Hence, the traditional biallelic model to describe the evolution of four-fold degenerate sites should be replaced by more complex models assuming only quasi-equilibrium and accounting for all four bases.
Collapse
Affiliation(s)
- F Clemente
- Institute of Population Genetics, Veterinärmedizinische Universität Wien, Vienna, Austria
| | | |
Collapse
|
23
|
Langley CH, Stevens K, Cardeno C, Lee YCG, Schrider DR, Pool JE, Langley SA, Suarez C, Corbett-Detig RB, Kolaczkowski B, Fang S, Nista PM, Holloway AK, Kern AD, Dewey CN, Song YS, Hahn MW, Begun DJ. Genomic variation in natural populations of Drosophila melanogaster. Genetics 2012; 192:533-98. [PMID: 22673804 PMCID: PMC3454882 DOI: 10.1534/genetics.112.142018] [Citation(s) in RCA: 243] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2011] [Accepted: 05/24/2012] [Indexed: 02/07/2023] Open
Abstract
This report of independent genome sequences of two natural populations of Drosophila melanogaster (37 from North America and 6 from Africa) provides unique insight into forces shaping genomic polymorphism and divergence. Evidence of interactions between natural selection and genetic linkage is abundant not only in centromere- and telomere-proximal regions, but also throughout the euchromatic arms. Linkage disequilibrium, which decays within 1 kbp, exhibits a strong bias toward coupling of the more frequent alleles and provides a high-resolution map of recombination rate. The juxtaposition of population genetics statistics in small genomic windows with gene structures and chromatin states yields a rich, high-resolution annotation, including the following: (1) 5'- and 3'-UTRs are enriched for regions of reduced polymorphism relative to lineage-specific divergence; (2) exons overlap with windows of excess relative polymorphism; (3) epigenetic marks associated with active transcription initiation sites overlap with regions of reduced relative polymorphism and relatively reduced estimates of the rate of recombination; (4) the rate of adaptive nonsynonymous fixation increases with the rate of crossing over per base pair; and (5) both duplications and deletions are enriched near origins of replication and their density correlates negatively with the rate of crossing over. Available demographic models of X and autosome descent cannot account for the increased divergence on the X and loss of diversity associated with the out-of-Africa migration. Comparison of the variation among these genomes to variation among genomes from D. simulans suggests that many targets of directional selection are shared between these species.
Collapse
Affiliation(s)
- Charles H Langley
- Department of Evolution and Ecology, University of California, Davis, CA 95616, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Akashi H, Osada N, Ohta T. Weak selection and protein evolution. Genetics 2012; 192:15-31. [PMID: 22964835 PMCID: PMC3430532 DOI: 10.1534/genetics.112.140178] [Citation(s) in RCA: 91] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2012] [Accepted: 06/11/2012] [Indexed: 01/23/2023] Open
Abstract
The "nearly neutral" theory of molecular evolution proposes that many features of genomes arise from the interaction of three weak evolutionary forces: mutation, genetic drift, and natural selection acting at its limit of efficacy. Such forces generally have little impact on allele frequencies within populations from generation to generation but can have substantial effects on long-term evolution. The evolutionary dynamics of weakly selected mutations are highly sensitive to population size, and near neutrality was initially proposed as an adjustment to the neutral theory to account for general patterns in available protein and DNA variation data. Here, we review the motivation for the nearly neutral theory, discuss the structure of the model and its predictions, and evaluate current empirical support for interactions among weak evolutionary forces in protein evolution. Near neutrality may be a prevalent mode of evolution across a range of functional categories of mutations and taxa. However, multiple evolutionary mechanisms (including adaptive evolution, linked selection, changes in fitness-effect distributions, and weak selection) can often explain the same patterns of genome variation. Strong parameter sensitivity remains a limitation of the nearly neutral model, and we discuss concave fitness functions as a plausible underlying basis for weak selection.
Collapse
Affiliation(s)
- Hiroshi Akashi
- Division of Evolutionary Genetics, Department of Population Genetics, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan.
| | | | | |
Collapse
|
25
|
Hu TT, Eisen MB, Thornton KR, Andolfatto P. A second-generation assembly of the Drosophila simulans genome provides new insights into patterns of lineage-specific divergence. Genome Res 2012; 23:89-98. [PMID: 22936249 PMCID: PMC3530686 DOI: 10.1101/gr.141689.112] [Citation(s) in RCA: 116] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
We create a new assembly of the Drosophila simulans genome using 142 million paired short-read sequences and previously published data for strain w501. Our assembly represents a higher-quality genomic sequence with greater coverage, fewer misassemblies, and, by several indexes, fewer sequence errors. Evolutionary analysis of this genome reference sequence reveals interesting patterns of lineage-specific divergence that are different from those previously reported. Specifically, we find that Drosophila melanogaster evolves faster than D. simulans at all annotated classes of sites, including putatively neutrally evolving sites found in minimal introns. While this may be partly explained by a higher mutation rate in D. melanogaster, we also find significant heterogeneity in rates of evolution across classes of sites, consistent with historical differences in the effective population size for the two species. Also contrary to previous findings, we find that the X chromosome is evolving significantly faster than autosomes for nonsynonymous and most noncoding DNA sites and significantly slower for synonymous sites. The absence of a X/A difference for putatively neutral sites and the robustness of the pattern to Gene Ontology and sex-biased expression suggest that partly recessive beneficial mutations may comprise a substantial fraction of noncoding DNA divergence observed between species. Our results have more general implications for the interpretation of evolutionary analyses of genomes of different quality.
Collapse
Affiliation(s)
- Tina T Hu
- Department of Ecology and Evolutionary Biology and the Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, USA.
| | | | | | | |
Collapse
|
26
|
Clemente F, Vogl C. Unconstrained evolution in short introns? - an analysis of genome-wide polymorphism and divergence data from Drosophila. J Evol Biol 2012; 25:1975-1990. [PMID: 22901008 DOI: 10.1111/j.1420-9101.2012.02580.x] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2012] [Revised: 06/15/2012] [Accepted: 06/22/2012] [Indexed: 12/23/2022]
Abstract
An unconstrained reference sequence facilitates the detection of selection. In Drosophila, sequence variation in short introns seems to be least influenced by selection and dominated by mutation and drift. Here, we test this with genome-wide sequences using an African population (Malawi) of D. melanogaster and data from the related outgroup species D. simulans, D. sechellia, D. erecta and D. yakuba. The distribution of mutations deviates from equilibrium, and the content of A and T (AT) nucleotides shows an excess of variance among introns. We explain this by a complex mutational pattern: a shift in mutational bias towards AT, leading to a slight nonequilibrium in base composition and context-dependent mutation rates, with G or C (GC) sites mutating most frequently in AT-rich introns. By comparing the corresponding allele frequency spectra of AT-rich vs. GC-rich introns, we can rule out the influence of directional selection or biased gene conversion on the mutational pattern. Compared with neutral equilibrium expectations, polymorphism spectra show an excess of low frequency and a paucity of intermediate frequency variants, irrespective of the direction of mutation. Combining the information from different outgroups with the polymorphism data and using a generalized linear model, we find evidence for shared ancestral polymorphism between D. melanogaster and D. simulans, D. sechellia, arguing against a bottleneck in D. melanogaster. Generally, we find that short introns can be used as a neutral reference on a genome-wide level, if the spatially and temporally varying mutational pattern is accounted for.
Collapse
Affiliation(s)
- F Clemente
- Institute of Population Genetics, Veterinärmedizinische Universität Wien, Vienna, Austria
| | - C Vogl
- Institute of Animal Breeding and Genetics, Veterinärmedizinische Universität Wien, Vienna, Austria
| |
Collapse
|
27
|
Ma X, Zhang P, Song G, Chen Y, Wang Z, Yin Y, Kong D, Zhang S, Zhao Z, Ouyang H, Tang B, Li Z. The construction and expression of lysine-rich gene in the mammary gland of transgenic mice. DNA Cell Biol 2012; 31:1372-83. [PMID: 22577831 DOI: 10.1089/dna.2011.1599] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Lysine is the limiting amino acid in cereal grains, which represent a major source of human food and animal feed worldwide, and is considered the most important of the essential amino acids. In this study, β-casein, αS2-casein, and lactotransferrin cDNA clone fragments encoding lysine-rich peptides were fused together to generate a lysine-rich (LR) gene and the mammary gland-specific expression vector pBC1-LR-NEO(r) was constructed. Transgenic mice were generated by pronuclear microinjection of the linearized expression vectors harboring the LR transgene. The transgenic mice and their offspring were examined using multiplex polymerase chain reaction (PCR), Southern blotting, reverse transcriptase-PCR, in situ hybridization, and Western blotting techniques. Our results showed that the LR gene was successfully integrated into the mouse genome and was transmitted stably. The specific LR gene expression was restricted to the mammary gland, active alveoli of the transgenic female mice during lactation. The lysine level of the two transgenic lines was significantly higher than that of nontransgenic controls (p<0.05). In addition, the growth performance of transgenic pups was enhanced by directly feeding them the LR protein-enriched transgenic milk. Our results demonstrated that lysine-rich gene was successfully constructed and expressed in mammary gland of transgenic mice. This study will provide a better understanding of how mammary gland expression systems that increase the lysine content of milk can be applied to other mammals, such as cows.
Collapse
Affiliation(s)
- Xin Ma
- Jilin Provincial Key Laboratory of Animal Embryo Engineering, The Center for Animal Embryo Engineering of Jilin Province, College of Animal Science and Veterinary Medicine, Jilin University, Changchun, Jilin, China
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Gossmann TI, Keightley PD, Eyre-Walker A. The effect of variation in the effective population size on the rate of adaptive molecular evolution in eukaryotes. Genome Biol Evol 2012; 4:658-67. [PMID: 22436998 PMCID: PMC3381672 DOI: 10.1093/gbe/evs027] [Citation(s) in RCA: 110] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The role of adaptation is a fundamental question in molecular evolution. Theory predicts that species with large effective population sizes should undergo a higher rate of adaptive evolution than species with low effective population sizes if adaptation is limited by the supply of mutations. Previous analyses have appeared to support this conjecture because estimates of the proportion of nonsynonymous substitutions fixed by adaptive evolution, α, tend to be higher in species with large Ne. However, α is a function of both the number of advantageous and effectively neutral substitutions, either of which might depend on Ne. Here, we investigate the relationship between Ne and ωa, the rate of adaptive evolution relative to the rate of neutral evolution, using nucleotide polymorphism and divergence data from 13 independent pairs of eukaryotic species. We find a highly significant positive correlation between ωa and Ne. We also find some evidence that the rate of adaptive evolution varies between groups of organisms for a given Ne. The correlation between ωa and Ne does not appear to be an artifact of demographic change or selection on synonymous codon use. Our results suggest that adaptation is to some extent limited by the supply of mutations and that at least some adaptation depends on newly occurring mutations rather than on standing genetic variation. Finally, we show that the proportion of nearly neutral nonadaptive substitutions declines with increasing Ne. The low rate of adaptive evolution and the high proportion of effectively neutral substitution in species with small Ne are expected to combine to make it difficult to detect adaptive molecular evolution in species with small Ne.
Collapse
Affiliation(s)
- Toni I Gossmann
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | | | | |
Collapse
|
29
|
Whittle CA, Sun Y, Johannesson H. Genome-wide selection on codon usage at the population level in the fungal model organism Neurospora crassa. Mol Biol Evol 2012; 29:1975-86. [PMID: 22334579 DOI: 10.1093/molbev/mss065] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Many organisms exhibit biased codon usage in their genome, including the fungal model organism Neurospora crassa. The preferential use of subset of synonymous codons (optimal codons) at the macroevolutionary level is believed to result from a history of selection to promote translational efficiency. At present, few data are available about selection on optimal codons at the microevolutionary scale, that is, at the population level. Herein, we conducted a large-scale assessment of codon mutations at biallelic sites, spanning more than 5,100 genes, in 2 distinct populations of N. crassa: the Caribbean and Louisiana populations. Based on analysis of the frequency spectra of synonymous codon mutations at biallelic sites, we found that derived (nonancestral) optimal codon mutations segregate at a higher frequency than derived nonoptimal codon mutations in each population; this is consistent with natural selection favoring optimal codons. We also report that optimal codon variants were less frequent in longer genes and that the fixation of optimal codons was reduced in rapidly evolving long genes/proteins, trends suggestive of genetic hitchhiking (Hill-Robertson) altering codon usage variation. Notably, nonsynonymous codon mutations segregated at a lower frequency than synonymous nonoptimal codon mutations (which impair translational efficiency) in each N. crassa population, suggesting that changes in protein composition are more detrimental to fitness than mutations altering translation. Overall, the present data demonstrate that selection, and partly genetic interference, shapes codon variation across the genome in N. crassa populations.
Collapse
Affiliation(s)
- C A Whittle
- Department of Evolutionary Biology, Uppsala University, Uppsala, Sweden
| | | | | |
Collapse
|
30
|
Campos JL, Charlesworth B, Haddrill PR. Molecular evolution in nonrecombining regions of the Drosophila melanogaster genome. Genome Biol Evol 2012; 4:278-88. [PMID: 22275518 PMCID: PMC3318434 DOI: 10.1093/gbe/evs010] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
We study the evolutionary effects of reduced recombination on the Drosophila melanogaster genome, analyzing more than 200 new genes that lack crossing-over and employing a novel orthology search among species of the melanogaster subgroup. These genes are located in the heterochromatin of chromosomes other than the dot (fourth) chromosome. Noncrossover regions of the genome all exhibited an elevated level of evolutionary divergence from D. yakuba at nonsynonymous sites, lower codon usage bias, lower GC content in coding and noncoding regions, and longer introns. Levels of gene expression are similar for genes in regions with and without crossing-over, which rules out the possibility that the reduced level of adaptation that we detect is caused by relaxed selection due to lower levels of gene expression in the heterochromatin. The patterns observed are consistent with a reduction in the efficacy of selection in all regions of the genome of D. melanogaster that lack crossing-over, as a result of the effects of enhanced Hill-Robertson interference. However, we also detected differences among nonrecombining locations: The X chromosome seems to exhibit the weakest effects, whereas the fourth chromosome and the heterochromatic genes on the autosomes located most proximal to the centromere showed the largest effects. However, signatures of selection on both nonsynonymous mutations and on codon usage persist in all heterochromatic regions.
Collapse
Affiliation(s)
- José L Campos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, United Kingdom.
| | | | | |
Collapse
|
31
|
Vogl C, Clemente F. The allele-frequency spectrum in a decoupled Moran model with mutation, drift, and directional selection, assuming small mutation rates. Theor Popul Biol 2012; 81:197-209. [PMID: 22269092 PMCID: PMC3315028 DOI: 10.1016/j.tpb.2012.01.001] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2011] [Revised: 12/23/2011] [Accepted: 01/04/2012] [Indexed: 01/22/2023]
Abstract
We analyze a decoupled Moran model with haploid population size N, a biallelic locus under mutation and drift with scaled forward and backward mutation rates θ1=μ1N and θ0=μ0N, and directional selection with scaled strength γ=sN. With small scaled mutation rates θ0 and θ1, which is appropriate for single nucleotide polymorphism data in highly recombining regions, we derive a simple approximate equilibrium distribution for polymorphic alleles with a constant of proportionality. We also put forth an even simpler model, where all mutations originate from monomorphic states. Using this model we derive the sojourn times, conditional on the ancestral and fixed allele, and under equilibrium the distributions of fixed and polymorphic alleles and fixation rates. Furthermore, we also derive the distribution of small samples in the diffusion limit and provide convenient recurrence relations for calculating this distribution. This enables us to give formulas analogous to the Ewens–Watterson estimator of θ for biased mutation rates and selection. We apply this theory to a polymorphism dataset of fourfold degenerate sites in Drosophila melanogaster.
Collapse
Affiliation(s)
- Claus Vogl
- Institute of Animal Breeding and Genetics, Veterinärmedizinische Universität Wien, Veterinärplatz 1, A-1210 Vienna, Austria.
| | | |
Collapse
|
32
|
HUANG QIANLI, LI YONG, LI-LING JESSE, HUANG HUIFANG, CHEN XUEPING, MA FEI. COMPARATIVE ANALYSIS OF DISEASE-ASSOCIATED MUTATIONS IN THE CODING REGIONS OF ALTERNATIVELY AND CONSTITUTIVELY SPLICED HUMAN GENES. J BIOL SYST 2011. [DOI: 10.1142/s0218339008002563] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
To better understand the evolutionary and molecular mechanisms of alternative splicing causing human diseases, we have systematically compared the pattern, the distribution and the density of disease-associated mutations as well as the influence of codon usage bias on the single mutation between alternatively and constitutively spliced genes through analysis of the large datasets from human disease genes. The results indicated that:1. The most common pattern of single mutation in alternatively and constitutively spliced genes are, respectively, C/T (25.17%), (22.81%) and G/A (21.54%), (22.73%), suggesting that the two types of disease genes are prone to C → T and G → A mutations.2. There is an overall preponderance for transitions over transversions in alternatively (62.88% versus 37.12%) and constitutively (64.41% versus 35.59%) spliced disease genes.3. For the second base of codons, there exist significant differences in transitions and transversions between the two types of genes.4. Our data indicated that the single mutation tends to occur preferentially when the upstream neighboring-nucleotide is C or G in human disease genes.5. Codon usage bias and synonymous codon usage have great influence on the single mutation in both alternatively and constitutively spliced genes. The GC content and gene length also have very evident influence on such mutations.Our results seem to imply that disease-associated mutations within the coding regions of alternatively spliced human disease genes have different mechanisms from constitutively spliced genes. Such findings may facilitate understanding the molecular mechanism of alternative splicing causing human diseases, and the development of gene therapies for such diseases.
Collapse
Affiliation(s)
- QIANLI HUANG
- Department of Chemistry, University of Science and Technology of China, Hefei 230031, China
- College of Life Sciences, Liaoning Normal University, No. 850, Huanghe Road, Dalian 116029, China
| | - YONG LI
- Department of Computer Science and Technology, Dalian Maritime University, Dalian 116026, China
| | - JESSE LI-LING
- Department of Medical Genetics, China Medical University, Shenyang 110001, China
| | - HUIFANG HUANG
- College of Life Sciences, Liaoning Normal University, No. 850, Huanghe Road, Dalian 116029, China
| | - XUEPING CHEN
- Department of Chemistry, University of Science and Technology of China, Hefei 230031, China
| | - FEI MA
- College of Life Sciences, Liaoning Normal University, No. 850, Huanghe Road, Dalian 116029, China
| |
Collapse
|
33
|
Verspoor RL, Haddrill PR. Genetic diversity, population structure and Wolbachia infection status in a worldwide sample of Drosophila melanogaster and D. simulans populations. PLoS One 2011; 6:e26318. [PMID: 22022599 PMCID: PMC3192181 DOI: 10.1371/journal.pone.0026318] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2011] [Accepted: 09/23/2011] [Indexed: 12/03/2022] Open
Abstract
Drosophila melanogaster and its close relatives have been extremely important model species in the development of population genetic models that serve to explain patterns of diversity in natural populations, a major goal of evolutionary biology. A detailed picture of the evolutionary history of these species is beginning to emerge, as the relative importance of forces including demographic changes and natural selection is established. A continuing aim is to characterise levels of genetic diversity in a large number of populations of these species, covering a wide geographic area. We have used collections from five previously un-sampled wild populations of D. melanogaster and two of D. simulans, across three continents. We estimated levels of genetic diversity within, and divergence between, these populations, and looked for evidence of genetic structure both between ancestral and derived populations, and amongst derived populations. We also investigated the prevalence of infection with the bacterial endosymbiont Wolbachia. We found that D. melanogaster populations from Sub-Saharan Africa are the most diverse, and that divergence is highest between these and non-Sub-Saharan populations. There is strong evidence for structuring of populations between Sub-Saharan Africa and the rest of the world, and some evidence for weak structure amongst derived populations. Populations from Sub-Saharan Africa also differ in the prevalence of Wolbachia infection, with very low levels of infection compared to populations from the rest of the world.
Collapse
Affiliation(s)
- Rudi L. Verspoor
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Penelope R. Haddrill
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
- * E-mail:
| |
Collapse
|
34
|
Abstract
The effective population size (N(e)) is one of the most fundamental parameters in population genetics. It is thought to vary across the genome as a consequence of differences in the rate of recombination and the density of selected sites due to the processes of genetic hitchhiking and background selection. Although it is known that there is intragenomic variation in the effective population size in some species, it is not known whether this is widespread or how much variation in the effective population size there is. Here, we test whether the effective population size varies across the genome, between protein-coding genes, in 10 eukaryotic species by considering whether there is significant variation in neutral diversity, taking into account differences in the mutation rate between loci by using the divergence between species. In most species we find significant evidence of variation. We investigate whether the variation in N(e) is correlated to recombination rate and the density of selected sites in four species, for which these data are available. We find that N(e) is positively correlated to recombination rate in one species, Drosophila melanogaster, and negatively correlated to a measure of the density of selected sites in two others, humans and Arabidopsis thaliana. However, much of the variation remains unexplained. We use a hierarchical Bayesian analysis to quantify the amount of variation in the effective population size and show that it is quite modest in all species-most genes have an N(e) that is within a few fold of all other genes. Nonetheless we show that this modest variation in N(e) is sufficient to cause significant differences in the efficiency of natural selection across the genome, by demonstrating that the ratio of the number of nonsynonymous to synonymous polymorphisms is significantly correlated to synonymous diversity and estimates of N(e), even taking into account the obvious nonindependence between these measures.
Collapse
|
35
|
Castillo DM, Mell JC, Box KS, Blumenstiel JP. Molecular evolution under increasing transposable element burden in Drosophila: a speed limit on the evolutionary arms race. BMC Evol Biol 2011; 11:258. [PMID: 21917173 PMCID: PMC3185285 DOI: 10.1186/1471-2148-11-258] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2011] [Accepted: 09/14/2011] [Indexed: 01/28/2023] Open
Abstract
BACKGROUND Genome architecture is profoundly influenced by transposable elements (TEs), and natural selection against their harmful effects is a critical factor limiting their spread. Genome defense by the piRNA silencing pathway also plays a crucial role in limiting TE proliferation. How these two forces jointly determine TE abundance is not well understood. To shed light on the nature of factors that predict TE success, we test three distinct hypotheses in the Drosophila genus. First, we determine whether TE abundance and relaxed genome-wide purifying selection on protein sequences are positively correlated. This serves to test the hypothesis that variation in TE abundance in the Drosophila genus can be explained by the strength of natural selection, relative to drift, acting in parallel against mildly deleterious non-synonymous mutations. Second, we test whether increasing TE abundance is correlated with an increased rate of amino-acid evolution in genes encoding the piRNA machinery, as might be predicted by an evolutionary arms race model. Third, we test whether increasing TE abundance is correlated with greater codon bias in genes of the piRNA machinery. This is predicted if increasing TE abundance selects for increased efficiency in the machinery of genome defense. RESULTS Surprisingly, we find neither of the first two hypotheses to be true. Specifically, we found that genome-wide levels of purifying selection, measured by the ratio of non-synonymous to synonymous substitution rates (ω), were greater in species with greater TE abundance. In addition, species with greater TE abundance have greater levels of purifying selection in the piRNA machinery. In contrast, it appears that increasing TE abundance has primarily driven adaptation in the piRNA machinery by increasing codon bias. CONCLUSIONS These results indicate that within the Drosophila genus, a historically reduced strength of selection relative to drift is unlikely to explain patterns of increased TE success across species. Other factors, such as ecological exposure, are likely to contribute to variation in TE abundances within species. Furthermore, constraints on the piRNA machinery may temper the evolutionary arms race that would drive increasing rates of evolution at the amino acid level. In the face of these constraints, selection may act primarily by improving the translational efficiency of the machinery of genome defense through efficient codon usage.
Collapse
Affiliation(s)
- Dean M Castillo
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence Kansas 66045, USA
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Joshua Chang Mell
- Life Sciences Centre (Zoology), 2350 Health Sciences Mall, University of British Columbia, Vancouver, BC, V6T 3Z4, Canada
| | - Kimberly S Box
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence Kansas 66045, USA
| | - Justin P Blumenstiel
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence Kansas 66045, USA
| |
Collapse
|
36
|
Stoletzki N. The surprising negative correlation of gene length and optimal codon use--disentangling translational selection from GC-biased gene conversion in yeast. BMC Evol Biol 2011; 11:93. [PMID: 21481245 PMCID: PMC3096941 DOI: 10.1186/1471-2148-11-93] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2010] [Accepted: 04/11/2011] [Indexed: 02/06/2023] Open
Abstract
Background Surprisingly, in several multi-cellular eukaryotes optimal codon use correlates negatively with gene length. This contrasts with the expectation under selection for translational accuracy. While suggested explanations focus on variation in strength and efficiency of translational selection, it has rarely been noticed that the negative correlation is reported only in organisms whose optimal codons are biased towards codons that end with G or C (-GC). This raises the question whether forces that affect base composition - such as GC-biased gene conversion - contribute to the negative correlation between optimal codon use and gene length. Results Yeast is a good organism to study this as equal numbers of optimal codons end in -GC and -AT and one may hence compare frequencies of optimal GC- with optimal AT-ending codons to disentangle the forces. Results of this study demonstrate in yeast frequencies of GC-ending (optimal AND non-optimal) codons decrease with gene length and increase with recombination. A decrease of GC-ending codons along genes contributes to the negative correlation with gene length. Correlations with recombination and gene expression differentiate between GC-ending and optimal codons, and also substitution patterns support effects of GC-biased gene conversion. Conclusion While the general effect of GC-biased gene conversion is well known, the negative correlation of optimal codon use with gene length has not been considered in this context before. Initiation of gene conversion events in promoter regions and the presence of a gene conversion gradient most likely explain the observed decrease of GC-ending codons with gene length and gene position.
Collapse
Affiliation(s)
- Nina Stoletzki
- Ludwig-Maximilan Universität, Biocenter, Grosshadernerstr, 2, D-82152 Planegg-Martinsried, Germany.
| |
Collapse
|
37
|
Andolfatto P, Wong KM, Bachtrog D. Effective population size and the efficacy of selection on the X chromosomes of two closely related Drosophila species. Genome Biol Evol 2010; 3:114-28. [PMID: 21173424 PMCID: PMC3038356 DOI: 10.1093/gbe/evq086] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
The prevalence of natural selection relative to genetic drift is of central interest in evolutionary biology. Depending on the distribution of fitness effects of new mutations, the importance of these evolutionary forces may differ in species with different effective population sizes. Here, we survey population genetic variation at 105 orthologous X-linked protein coding regions in Drosophila melanogaster and its sister species D. simulans, two closely related species with distinct demographic histories. We observe significantly higher levels of polymorphism and evidence for stronger selection on codon usage bias in D. simulans, consistent with a larger historical effective population size on average for this species. Despite these differences, we estimate that <10% of newly arising nonsynonymous mutations have deleterious fitness effects in the nearly neutral range (i.e., −10 < Nes < 0) in both species. The inferred distributions of fitness effects and demographic models translate into surprisingly high estimates of the fraction of “adaptive” protein divergence in both species (∼85–90%). Despite evidence for different demographic histories, differences in population size have apparently played little role in the dynamics of protein evolution in these two species, and estimates of the adaptive fraction (α) of protein divergence in both species remain high even if we account for recent 10-fold growth. Furthermore, although several recent studies have noted strong signatures of recurrent adaptive protein evolution at genes involved in immunity, reproduction, sexual conflict, and intragenomic conflict, our finding of high levels of adaptive protein divergence at randomly chosen proteins (with respect to function) suggests that many other factors likely contribute to the adaptive protein divergence signature in Drosophila.
Collapse
Affiliation(s)
- Peter Andolfatto
- Department of Ecology and Evolutionary Biology and Lewis-Sigler Institute for Integrative Genomics, Princeton University, USA.
| | | | | |
Collapse
|
38
|
Analysis of Codon Usage Patterns in Toxic Dinoflagellate Alexandrium tamarense through Expressed Sequence Tag Data. Comp Funct Genomics 2010; 2010:138538. [PMID: 21052492 PMCID: PMC2967832 DOI: 10.1155/2010/138538] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2010] [Accepted: 10/01/2010] [Indexed: 11/29/2022] Open
Abstract
We have analyzed synonymous codon usage in the genome of A. tamarense CCMP 1598 for protein-coding sequences from 10865 expressed sequence tags (ESTs). We reconstructed a total of 4284 unigenes, including 74 ribosomal protein and 40 plastid-related genes, from ESTs using FrameDP, an open reading frame (ORF) prediction program. Correspondence analysis of A. tamarense genes based on codon usage showed that the GC content at the third base of synonymous codons (GC3s) was strongly correlated with the first axis (r = 0.93 with P < .001). On the other hand, the second axis discriminated between presumed highly and low expressed genes, with expression levels being confirmed by the analysis of EST frequencies (r = −0.89 with P < .001). Our results suggest that mutational bias is the major factor in shaping codon usage in A. tamarense genome, but other factors, namely, translational selection, hydropathy, and aromaticity, also appear to influence the selection of codon usage in this species.
Collapse
|
39
|
Estimating the parameters of selection on nonsynonymous mutations in Drosophila pseudoobscura and D. miranda. Genetics 2010; 185:1381-96. [PMID: 20516497 DOI: 10.1534/genetics.110.117614] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We present the results of surveys of diversity in sets of >40 X-linked and autosomal loci in samples from natural populations of Drosophila miranda and D. pseudoobscura, together with their sequence divergence from D. affinis. Mean silent site diversity in D. miranda is approximately one-quarter of that in D. pseudoobscura; mean X-linked silent diversity is about three-quarters of that for the autosomes in both species. Estimates of the distribution of selection coefficients against heterozygous, deleterious nonsynonymous mutations from two different methods suggest a wide distribution, with coefficients of variation greater than one, and with the average segregating amino acid mutation being subject to only very weak selection. Only a small fraction of new amino acid mutations behave as effectively neutral, however. A large fraction of amino acid differences between D. pseudoobscura and D. affinis appear to have been fixed by positive natural selection, using three different methods of estimation; estimates between D. miranda and D. affinis are more equivocal. Sources of bias in the estimates, especially those arising from selection on synonymous mutations and from the choice of genes, are discussed and corrections for these applied. Overall, the results show that both purifying selection and positive selection on nonsynonymous mutations are pervasive.
Collapse
|
40
|
Slotte T, Foxe JP, Hazzouri KM, Wright SI. Genome-wide evidence for efficient positive and purifying selection in Capsella grandiflora, a plant species with a large effective population size. Mol Biol Evol 2010; 27:1813-21. [PMID: 20194429 DOI: 10.1093/molbev/msq062] [Citation(s) in RCA: 116] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Recent studies comparing genome-wide polymorphism and divergence in Drosophila have found evidence for a surprisingly high proportion of adaptive amino acid fixations, but results for other taxa are mixed. In particular, few studies have found convincing evidence for adaptive amino acid substitution in plants. To assess the generality of this finding, we have sequenced 257 loci in the outcrossing crucifer Capsella grandiflora, which has a large effective population size and low population structure. Using a new method that jointly infers selective and demographic effects, we estimate that 40% of amino acid substitutions were fixed by positive selection in this species, and we also infer a low proportion of slightly deleterious amino acid mutations. We contrast these estimates with those for a similar data set from the closely related Arabidopsis thaliana and find significantly higher rates of adaptive evolution and fewer nearly neutral mutations in C. grandiflora. In agreement with results for other taxa, genes involved in reproduction show the strongest evidence for positive selection in C. grandiflora. Taken together, these results imply that both positive and purifying selection are more effective in C. grandiflora than in A. thaliana, consistent with the contrasting demographic history and effective population sizes of these species.
Collapse
Affiliation(s)
- Tanja Slotte
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada.
| | | | | | | |
Collapse
|
41
|
Zeng K, Charlesworth B. Studying patterns of recent evolution at synonymous sites and intronic sites in Drosophila melanogaster. J Mol Evol 2009; 70:116-28. [PMID: 20041239 DOI: 10.1007/s00239-009-9314-6] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2009] [Accepted: 12/07/2009] [Indexed: 10/20/2022]
Abstract
Most previous studies of the evolution of codon usage bias (CUB) and intronic GC content (iGC) in Drosophila melanogaster were based on between-species comparisons, reflecting long-term evolutionary events. However, a complete picture of the evolution of CUB and iGC cannot be drawn without knowledge of their more recent evolutionary history. Here, we used a polymorphism dataset collected from Zimbabwe to study patterns of the recent evolution of CUB and iGC. Analyzing coding and intronic data jointly with a model which can simultaneously estimate selection, mutational, and demographic parameters, we have found that: (1) natural selection is probably acting on synonymous codons; (2) a constant population size model seems to be sufficient to explain most of the observed synonymous polymorphism patterns; (3) GC is favored over AT in introns. In agreement with the long-term evolutionary patterns, ongoing selection acting on X-linked synonymous codons is stronger than that acting on autosomal codons. The selective differences between preferred and unpreferred codons tend to be greater than the differences between GC and AT in introns, suggesting that natural selection, not just biased gene conversion, may have influenced the evolution of CUB. Interestingly, evidence for non-equilibrium evolution comes exclusively from the intronic data. However, three different models, an equilibrium model with two classes of selected sites and two non-equilibrium models with changes in either population size or mutational parameters, fit the intronic data equally well. These results show that using inadequate selection (or demographic) models can result in incorrect estimates of demographic (or selection) parameters.
Collapse
Affiliation(s)
- Kai Zeng
- Ashworth Laboratories, School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, King's Buildings, West Mains Road, Edinburgh EH9 3JT, UK.
| | | |
Collapse
|
42
|
Takahashi A. Effect of exonic splicing regulation on synonymous codon usage in alternatively spliced exons of Dscam. BMC Evol Biol 2009; 9:214. [PMID: 19709440 PMCID: PMC2741454 DOI: 10.1186/1471-2148-9-214] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2008] [Accepted: 08/27/2009] [Indexed: 12/31/2022] Open
Abstract
Background Synonymous codon usage is typically biased towards translationally superior codons in many organisms. In Drosophila, genomic data indicates that translationally optimal codons and splice optimal codons are mostly mutually exclusive, and adaptation to translational efficiency is reduced in the intron-exon boundary regions where potential exonic splicing enhancers (ESEs) reside. In contrast to genomic scale analyses on large datasets, a refined study on a well-controlled set of samples can be effective in demonstrating the effects of particular splice-related factors. Down syndrome cell adhesion molecule (Dscam) has the largest number of alternatively spliced exons (ASEs) known to date, and the splicing frequency of each ASE is accessible from the relative abundance of the transcript. Thus, these ASEs comprise a unique model system for studying the effect of splicing regulation on synonymous codon usage. Results Codon Bias Indices (CBI) in the 3' boundary regions were reduced compared to the rest of the exonic regions among 48 and 33 ASEs of exon 6 and 9 clusters, respectively. These regional differences in CBI were affected by splicing frequency and distance from adjacent exons. Synonymous divergence levels between the 3' boundary region and the remaining exonic region of exon 6 ASEs were similar. Additionally, another sensitive comparison of paralogous exonic regions in recently retrotransposed processed genes and their parental genes revealed that, in the former, the differences in CBI between what were formerly the central regions and the boundary regions gradually became smaller over time. Conclusion Analyses of the multiple ASEs of Dscam allowed direct tests of the effect of splice-related factors on synonymous codon usage and provided clear evidence that synonymous codon usage bias is restricted by exonic splicing signals near the intron-exon boundary. A similar synonymous divergence level between the different exonic regions suggests that the intensity of splice-related selection is generally weak and comparable to that of translational selection. Finally, the leveling off of differences in codon bias over time in retrotransposed genes meets the direct prediction of the tradeoff model that invokes conflict between translational superiority and splicing regulation, and strengthens the conclusions obtained from Dscam.
Collapse
Affiliation(s)
- Aya Takahashi
- Division of Population Genetics, National Institute of Genetics, Mishima 411-8540, Japan.
| |
Collapse
|
43
|
Estimating selection intensity on synonymous codon usage in a nonequilibrium population. Genetics 2009; 183:651-62, 1SI-23SI. [PMID: 19620398 DOI: 10.1534/genetics.109.101782] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Codon usage bias is the nonrandom use of synonymous codons for the same amino acid. Most population genetic models of codon usage evolution assume that the population is at mutation-selection-drift equilibrium. Natural populations, however, frequently deviate from equilibrium, often because of recent demographic changes. Here, we construct a matrix model that includes the effects of a recent change in population size on estimates of selection on preferred vs. unpreferred codons. Our results suggest that patterns of synonymous polymorphisms affecting codon usage can be quite erratic after such a change; statistical methods that fail to take demographic effects into account can then give incorrect estimates of important parameters. We propose a new method that can accurately estimate both demographic and codon usage parameters. The method also provides a simple way of testing for the effects of covariates such as gene length and level of gene expression on the intensity of selection, which we apply to a large Drosophila melanogaster polymorphism data set. Our analyses of twofold degenerate codons reveal that (i) selection acts in favor of preferred codons, (ii) there is mutational bias in favor of unpreferred codons, (iii) shorter genes and genes with higher expression levels are under stronger selection, and (iv) there is little evidence for a recent change in population size in the Zimbabwe population of D. melanogaster.
Collapse
|
44
|
Abstract
Over the past four decades, the predominant view of molecular evolution saw little connection between natural selection and genome evolution, assuming that the functionally constrained fraction of the genome is relatively small and that adaptation is sufficiently infrequent to play little role in shaping patterns of variation within and even between species. Recent evidence from Drosophila, reviewed here, suggests that this view may be invalid. Analyses of genetic variation within and between species reveal that much of the Drosophila genome is under purifying selection, and thus of functional importance, and that a large fraction of coding and noncoding differences between species are adaptive. The findings further indicate that, in Drosophila, adaptations may be both common and strong enough that the fate of neutral mutations depends on their chance linkage to adaptive mutations as much as on the vagaries of genetic drift. The emerging evidence has implications for a wide variety of fields, from conservation genetics to bioinformatics, and presents challenges to modelers and experimentalists alike.
Collapse
|
45
|
DuMont VLB, Singh ND, Wright MH, Aquadro CF. Locus-specific decoupling of base composition evolution at synonymous sites and introns along the Drosophila melanogaster and Drosophila sechellia lineages. Genome Biol Evol 2009; 1:67-74. [PMID: 20333178 PMCID: PMC2817403 DOI: 10.1093/gbe/evp008] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/15/2009] [Indexed: 12/20/2022] Open
Abstract
Selection is thought to be partially responsible for patterns of molecular evolution at synonymous sites within numerous Drosophila species. Recently, “per-site” and likelihood methods have been developed to detect loci for which positive selection is a major component of synonymous site evolution. An underlying assumption of these methods, however, is a homogeneous mutation process. To address this potential shortcoming, we perform a complementary analysis making gene-by-gene comparisons of paired synonymous site and intron substitution rates toward and away from the nucleotides G and C because preferred codons are G or C ending in Drosophila. This comparison may reduce both the false-positive rate (due to broadscale heterogeneity in mutation) and false-negative rate (due to lack of power comparing small numbers of sites) of the per-site and likelihood methods. We detect loci with patterns of evolution suggestive of synonymous site selection pressures predominately favoring unpreferred and preferred codons along the Drosophila melanogaster and Drosophila sechellia lineages, respectively. Intron selection pressures do not appear sufficient to explain all these results as the magnitude of the difference in synonymous and intron evolution is dependent on recombination environment and chromosomal location in a direction supporting the hypothesis of selectively driven synonymous fixations. This comparison identifies 101 loci with an apparent switch in codon preference between D. melanogaster and D. sechellia, a pattern previously only observed at the Notch locus.
Collapse
|
46
|
Singh ND, Arndt PF, Clark AG, Aquadro CF. Strong evidence for lineage and sequence specificity of substitution rates and patterns in Drosophila. Mol Biol Evol 2009; 26:1591-605. [PMID: 19351792 DOI: 10.1093/molbev/msp071] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Rates of single nucleotide substitution in Drosophila are highly variable within the genome, and several examples illustrate that evolutionary rates differ among Drosophila species as well. Here, we use a maximum likelihood method to quantify lineage-specific substitutional patterns and apply this method to 4-fold degenerate synonymous sites and introns from more than 8,000 genes aligned in the Drosophila melanogaster group. We find that within species, different classes of sequence evolve at different rates, with long introns evolving most slowly and short introns evolving most rapidly. Relative rates of individual single nucleotide substitutions vary approximately 3-fold among lineages, yielding patterns of substitution that are comparatively less GC-biased in the melanogaster species complex relative to Drosophila yakuba and Drosophila erecta. These results are consistent with a model coupling a mutational shift toward reduced GC content, or a shift in mutation-selection balance, in the D. melanogaster species complex, with variation in selective constraint among different classes of DNA sequence. Finally, base composition of coding and intronic sequences is not at equilibrium with respect to substitutional patterns, which primarily reflects the slow rate of the substitutional process. These results thus support the view that mutational and/or selective processes are labile on an evolutionary timescale and that if the process is indeed selection driven, then the distribution of selective constraint is variable across the genome.
Collapse
Affiliation(s)
- Nadia D Singh
- Department of Molecular Biology and Genetics, Cornell University.
| | | | | | | |
Collapse
|
47
|
Lu B, Wang N, Xiao J, Xu Y, Murphy RW, Huang D. Expression and evolutionary divergence of the non-conventional olfactory receptor in four species of fig wasp associated with one species of fig. BMC Evol Biol 2009; 9:43. [PMID: 19232102 PMCID: PMC2661049 DOI: 10.1186/1471-2148-9-43] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2008] [Accepted: 02/20/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The interactions of fig wasps and their host figs provide a model for investigating co-evolution. Fig wasps have specialized morphological characters and lifestyles thought to be adaptations to living in the fig's syconium. Although these aspects of natural history are well documented, the genetic mechanism(s) underlying these changes remain(s) unknown. Fig wasp olfaction is the key to host-specificity. The Or83b gene class, an unusual member of olfactory receptor family, plays a critical role in enabling the function of conventional olfactory receptors. Four Or83b orthologous genes from one pollinator (PFW) (Ceratosolen solmsi) and three non-pollinator fig wasps (NPFWs) (Apocrypta bakeri, Philotrypesis pilosa and Philotrypesis sp.) associated with one species of fig (Ficus hispida) can be used to better understand the molecular mechanism underlying the fig wasp's adaptation to its host. We made a comparison of spatial tissue-specific expression patterns and substitution rates of one orthologous gene in these fig wasps and sought evidence for selection pressures. RESULTS A newly identified Or83b orthologous gene was named Or2. Expressions of Or2 were restricted to the heads of all wingless male fig wasps, which usually live in the dark cavity of a fig throughout their life cycle. However, expressions were widely detected in the antennae, legs and abdomens of all female fig wasps that fly from one fig to another for oviposition, and secondarily pollination. Weak expression was also observed in the thorax of PFWs. Compared with NPFWs, the Or2 gene in C. solmsi had an elevated rate of substitutions and lower codon usage. Analyses using Tajima's D, Fu and Li's D* and F* tests indicated a non-neutral pattern of nucleotide variation in all fig wasps. Unlike in NPFWs, this non-neutral pattern was also observed for synonymous sites of Or2 within PFWs. CONCLUSION The sex- and species-specific expression patterns of Or2 genes detected beyond the known primary olfactory tissues indicates the location of cryptic olfactory inputs. The specialized ecological niche of these wasps explains the unique habits and adaptive evolution of Or2 genes. The Or2 gene in C. solmsi is evolving very rapidly. Negative deviation from the neutral model of evolution reflects possible selection pressures acting on Or2 sequences of fig wasp, particularly on PFWs who are more host-specific to figs.
Collapse
Affiliation(s)
- Bin Lu
- College of Plant Protection, Shandong Agricultural University, Tai'an, Shandong 271018, PR China.
| | | | | | | | | | | |
Collapse
|
48
|
Petit N, Barbadilla A. Selection efficiency and effective population size in Drosophila species. J Evol Biol 2008; 22:515-26. [PMID: 19170822 DOI: 10.1111/j.1420-9101.2008.01672.x] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
A corollary of the nearly neutral theory of molecular evolution is that the efficiency of natural selection depends on effective population size. In this study, we evaluated the differences in levels of synonymous polymorphism among Drosophila species and showed that these differences can be explained by differences in effective population size. The differences can have implications for the molecular evolution of the Drosophila species, as is suggested by our results showing that the levels of codon bias and the proportion of adaptive substitutions are both higher in species with higher levels of synonymous polymorphism. Moreover, species with lower synonymous polymorphism have higher levels of nonsynonymous polymorphism and larger content of repetitive sequences in their genomes, suggesting a diminished efficiency of selection in species with smaller effective population size.
Collapse
Affiliation(s)
- N Petit
- Group of Genomics, Bioinformatics and Evolution, Departament de Genètica i Microbiologia, Facultat de Biociències, Universitat Autònoma de Barcelona, Bellaterra, Spain.
| | | |
Collapse
|
49
|
Bachtrog D. Similar rates of protein adaptation in Drosophila miranda and D. melanogaster, two species with different current effective population sizes. BMC Evol Biol 2008; 8:334. [PMID: 19091130 PMCID: PMC2633301 DOI: 10.1186/1471-2148-8-334] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2008] [Accepted: 12/18/2008] [Indexed: 01/13/2023] Open
Abstract
BACKGROUND Adaptive protein evolution is common in several Drosophila species investigated. Some studies point to very weak selection operating on amino-acid mutations, with average selection intensities on the order of Nes approximately in D. melanogaster and D. simulans. Species with lower effective population sizes should undergo less adaptation since they generate fewer mutations and selection is ineffective on a greater proportion of beneficial mutations. RESULTS Here I study patterns of polymorphism and divergence at 91 X-linked loci in D. miranda, a species with a roughly 5-fold smaller effective population size than D. melanogaster. Surprisingly, I find a similar fraction of amino-acid mutations being driven to fixation by positive selection in D. miranda and D. melanogaster. Genes with higher rates of amino-acid evolution show lower levels of neutral diversity, a pattern predicted by recurrent adaptive protein evolution. I fit a hitchhiking model to patterns of polymorphism in D. miranda and D. melanogaster and estimate an order of magnitude higher selection coefficients for beneficial mutations in D. miranda. CONCLUSION This analysis suggests that effective population size may not be a major determinant in rates of protein adaptation. Instead, adaptation may not be mutation-limited, or the distribution of fitness effects for beneficial mutations might differ vastly between different species or populations. Alternative explanation such as biases in estimating the fraction of beneficial mutations or slightly deleterious mutation models are also discussed.
Collapse
Affiliation(s)
- Doris Bachtrog
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, USA.
| |
Collapse
|
50
|
A multispecies approach for comparing sequence evolution of X-linked and autosomal sites inDrosophila. Genet Res (Camb) 2008; 90:421-31. [DOI: 10.1017/s0016672308009804] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
SummaryPopulation genetics models show that, under certain conditions, the X chromosome is expected to be under more efficient selection than the autosomes. This could lead to ‘faster-X evolution’, if a large proportion of mutations are fixed by positive selection, as suggested by recent studies inDrosophila. We used a multispecies approach to test this: Muller's element D, an autosomal arm, is fused to the ancestral X chromosome inDrosophila pseudoobscuraand its sister species,Drosophila affinis. We tested whether the same set of genes had higher rates of non-synonymous evolution when they were X-linked (in theD. pseudoobscura/D. affiniscomparison) than when they were autosomal (inDrosophila melanogaster/Drosophila yakuba). Although not significant, our results suggest this may be the case, but only for genes under particularly strong positive selection/weak purifying selection. They also suggest that genes that have become X-linked have higher levels of codon bias and slower synonymous site evolution, consistent with more effective selection on codon usage at X-linked sites.
Collapse
|