1
|
Zhang Z, Townsend JP. Maximum-likelihood model averaging to profile clustering of site types across discrete linear sequences. PLoS Comput Biol 2009; 5:e1000421. [PMID: 19557160 PMCID: PMC2695770 DOI: 10.1371/journal.pcbi.1000421] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2009] [Accepted: 05/21/2009] [Indexed: 11/19/2022] Open
Abstract
A major analytical challenge in computational biology is the detection and description of clusters of specified site types, such as polymorphic or substituted sites within DNA or protein sequences. Progress has been stymied by a lack of suitable methods to detect clusters and to estimate the extent of clustering in discrete linear sequences, particularly when there is no a priori specification of cluster size or cluster count. Here we derive and demonstrate a maximum likelihood method of hierarchical clustering. Our method incorporates a tripartite divide-and-conquer strategy that models sequence heterogeneity, delineates clusters, and yields a profile of the level of clustering associated with each site. The clustering model may be evaluated via model selection using the Akaike Information Criterion, the corrected Akaike Information Criterion, and the Bayesian Information Criterion. Furthermore, model averaging using weighted model likelihoods may be applied to incorporate model uncertainty into the profile of heterogeneity across sites. We evaluated our method by examining its performance on a number of simulated datasets as well as on empirical polymorphism data from diverse natural alleles of the Drosophila alcohol dehydrogenase gene. Our method yielded greater power for the detection of clustered sites across a breadth of parameter ranges, and achieved better accuracy and precision of estimation of clusters, than did the existing empirical cumulative distribution function statistics.
Collapse
Affiliation(s)
- Zhang Zhang
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, Connecticut, United States of America
| | - Jeffrey P. Townsend
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, Connecticut, United States of America
- * E-mail:
| |
Collapse
|
2
|
Díaz-Castillo C, Golic KG. Evolution of gene sequence in response to chromosomal location. Genetics 2007; 177:359-74. [PMID: 17890366 PMCID: PMC2013720 DOI: 10.1534/genetics.107.077081] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2007] [Accepted: 06/06/2007] [Indexed: 12/26/2022] Open
Abstract
Evolutionary forces acting on the repetitive DNA of heterochromatin are not constrained by the same considerations that apply to protein-coding genes. Consequently, such sequences are subject to rapid evolutionary change. By examining the Troponin C gene family of Drosophila melanogaster, which has euchromatic and heterochromatic members, we find that protein-coding genes also evolve in response to their chromosomal location. The heterochromatic members of the family show a reduced CG content and increased variation in DNA sequence. We show that the CG reduction applies broadly to the protein-coding sequences of genes located at the heterochromatin:euchromatin interface, with a very strong correlation between CG content and the distance from centric heterochromatin. We also observe a similar trend in the transition from telomeric heterochromatin to euchromatin. We propose that the methylation of DNA is one of the forces driving this sequence evolution.
Collapse
|
3
|
Keller I, Bensasson D, Nichols RA. Transition-transversion bias is not universal: a counter example from grasshopper pseudogenes. PLoS Genet 2007; 3:e22. [PMID: 17274688 PMCID: PMC1790724 DOI: 10.1371/journal.pgen.0030022] [Citation(s) in RCA: 108] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2006] [Accepted: 12/18/2006] [Indexed: 12/02/2022] Open
Abstract
Comparisons of the DNA sequences of metazoa show an excess of transitional over transversional substitutions. Part of this bias is due to the relatively high rate of mutation of methylated cytosines to thymine. Postmutation processes also introduce a bias, particularly selection for codon-usage bias in coding regions. It is generally assumed, however, that there is a universal bias in favour of transitions over transversions, possibly as a result of the underlying chemistry of mutation. Surprisingly, this underlying trend has been evaluated only in two types of metazoan, namely Drosophila and the Mammalia. Here, we investigate a third group, and find no such bias. We characterize the point substitution spectrum in Podisma pedestris, a grasshopper species with a very large genome. The accumulation of mutations was surveyed in two pseudogene families, nuclear mitochondrial and ribosomal DNA sequences. The cytosine-guanine (CpG) dinucleotides exhibit the high transition frequencies expected of methylated sites. The transition rate at other cytosine residues is significantly lower. After accounting for this methylation effect, there is no significant difference between transition and transversion rates. These results contrast with reports from other taxa and lead us to reject the hypothesis of a universal transition/transversion bias. Instead we suggest fundamental interspecific differences in point substitution processes. Some mutations occur more frequently than others. We need to understand these biases if we are to interpret the differences that have accumulated between species and individuals. Applications include estimating the time since evolutionary lineages diverged and detecting the signature of natural selection in DNA sequences. However, mutational biases have been obscured because, since mutations arose, natural selection has eliminated some whilst allowing others to persist to the present. We therefore study mutations that have accumulated in regions of the genome that are free from selection in a grasshopper with a gigantic genome. All other animal studies using this approach find an excess of mutations between DNA bases having similar biochemical properties (transitions rather than transversions). This bias has been widely interpreted as a consequence of the fundamental biochemical basis of mutation. However, once we exclude mutations associated with DNA methylation, we find no evidence of a transition bias, unlike the few comparable animal studies that make the same correction. We propose that this result indicates previously unanticipated differences between species in the selection on or mutation of their DNA.
Collapse
Affiliation(s)
- Irene Keller
- School of Biological and Chemical Sciences, Queen Mary, University of London, London, United Kingdom.
| | | | | |
Collapse
|
4
|
Gu Z, David L, Petrov D, Jones T, Davis RW, Steinmetz LM. Elevated evolutionary rates in the laboratory strain of Saccharomyces cerevisiae. Proc Natl Acad Sci U S A 2005; 102:1092-7. [PMID: 15647350 PMCID: PMC545845 DOI: 10.1073/pnas.0409159102] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
By using the maximum likelihood method, we made a genome-wide comparison of the evolutionary rates in the lineages leading to the laboratory strain (S288c) and a wild strain (YJM789) of Saccharomyces cerevisiae and found that genes in the laboratory strain tend to evolve faster than in the wild strain. The pattern of elevated evolution suggests that relaxation of selection intensity is the dominant underlying reason, which is consistent with recurrent bottlenecks in the S. cerevisiae laboratory strain population. Supporting this conclusion are the following observations: (i) the increases in nonsynonymous evolutionary rate occur for genes in all functional categories; (ii) most of the synonymous evolutionary rate increases in S288c occur in genes with strong codon usage bias; (iii) genes under stronger negative selection have a larger increase in nonsynonymous evolutionary rate; and (iv) more genes with adaptive evolution were detected in the laboratory strain, but they do not account for the majority of the increased evolution. The present discoveries suggest that experimental and possible industrial manipulations of the laboratory strain of yeast could have had a strong effect on the genetic makeup of this model organism. Furthermore, they imply an evolution of laboratory model organisms away from their wild counterparts, questioning the relevancy of the models especially when extensive laboratory cultivation has occurred. In addition, these results shed light on the evolution of livestock and crop species that have been under human domestication for years.
Collapse
Affiliation(s)
- Zhenglong Gu
- Stanford Genome Technology Center, 855 California Avenue, Palo Alto, CA 94304, USA
| | | | | | | | | | | |
Collapse
|
5
|
Morton BR. Selection at the amino acid level can influence synonymous codon usage: implications for the study of codon adaptation in plastid genes. Genetics 2001; 159:347-58. [PMID: 11560910 PMCID: PMC1461792 DOI: 10.1093/genetics/159.1.347] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
A previously employed method that uses the composition of noncoding DNA as the basis of a test for selection between synonymous codons in plastid genes is reevaluated. The test requires the assumption that in the absence of selective differences between synonymous codons the composition of silent sites in coding sequences will match the composition of noncoding sites. It is demonstrated here that this assumption is not necessarily true and, more generally, that using compositional properties to draw inferences about selection on silent changes in coding sequences is much more problematic than commonly assumed. This is so because selection on nonsynonymous changes can influence the composition of synonymous sites (i.e., codon usage) in a complex manner, meaning that the composition biases of different silent sites, including neutral noncoding DNA, are not comparable. These findings also draw into question the commonly utilized method of investigating how selection to increase translation accuracy influences codon usage. The work then focuses on implications for studies that assess codon adaptation, which is selection on codon usage to enhance translation rate, in plastid genes. A new test that does not require the use of noncoding DNA is proposed and applied. The results of this test suggest that far fewer plastid genes display codon adaptation than previously thought.
Collapse
Affiliation(s)
- B R Morton
- Department of Biological Sciences, Barnard College, Columbia University, New York, New York 10027, USA.
| |
Collapse
|
6
|
Musto H, Cruveiller S, D'Onofrio G, Romero H, Bernardi G. Translational selection on codon usage in Xenopus laevis. Mol Biol Evol 2001; 18:1703-7. [PMID: 11504850 DOI: 10.1093/oxfordjournals.molbev.a003958] [Citation(s) in RCA: 51] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
A correspondence analysis of codon usage in Xenopus laevis revealed that the first axis is strongly correlated with the base composition at third codon positions. The second axis discriminates between putatively highly expressed genes and the other coding sequences, with expression levels being confirmed by the analysis of Expressed sequence tag frequencies. The comparison of codon usage of the sequences displaying the extreme values on the second axis indicates that several codons are statistically more frequent among the highly expressed (mainly housekeeping) genes. Translational selection appears, therefore, to influence synonymous codon usage in Xenopus.
Collapse
Affiliation(s)
- H Musto
- Laboratorio di Evoluzione Molecolare, Stazione Zoologica Anton Dohrn, Naples, Italy
| | | | | | | | | |
Collapse
|
7
|
Takano-Shimizu T. Local changes in GC/AT substitution biases and in crossover frequencies on Drosophila chromosomes. Mol Biol Evol 2001; 18:606-19. [PMID: 11264413 DOI: 10.1093/oxfordjournals.molbev.a003841] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
I present here evidence of remarkable local changes in GC/AT substitution biases and in crossover frequencies on Drosophila chromosomes. The substitution pattern at 10 loci in the telomeric region of the X chromosome was studied for four species of the Drosophila melanogaster species subgroup. Drosophila orena and Drosophila erecta are clearly the most closely related species pair (the erecta complex) among the four species studied; however, the overall data at the 10 loci revealed a clear dichotomy in the silent substitution patterns between the AT-biased- substitution melanogaster and erecta lineages and the GC-biased-substitution yakuba and orena lineages, suggesting two or more independent changes in GC/AT substitution biases. More importantly, the results indicated a between- loci heterogeneity in GC/AT substitution bias in this small region independently in the yakuba and orena lineages. Indeed, silent substitutions in the orena lineage were significantly biased toward G and C at the consecutive yellow, lethal of scute, and asense loci, but they were significantly biased toward A and T at sta. The substitution bias toward G and C was centered in different areas in yakuba (significantly biased at EG:165H7.3, EG:171D11.2, and suppressor of sable). The similar silent substitution patterns in coding and noncoding regions, furthermore, suggested mutational biases as a cause of the substitution biases. On the other hand, previous study reveals that Drosophila yakuba has about 20-fold higher crossover frequencies in the telomeric region of the X chromosome than does D. melanogaster; this study revealed that the total genetic map length of the yakuba X chromosome was only about 1.5 times as large as that of melanogaster and that the map length of the X-telomeric y-sta region did not differ between Drosophila yakuba and D. erecta. Taken together, the data strongly suggested that an approximately 20- fold reduction in the X-telomeric crossover frequencies occurred in the ancestral population of D. melanogaster after the melanogaster-yakuba divergence but before the melanogaster-simulans divergence.
Collapse
Affiliation(s)
- T Takano-Shimizu
- Department of Population Genetics, National Institute of Genetics, Mishima, Shizuoka-ken, Japan.
| |
Collapse
|
8
|
Dunn KA, Bielawski JP, Yang Z. Substitution rates in Drosophila nuclear genes: implications for translational selection. Genetics 2001; 157:295-305. [PMID: 11139510 PMCID: PMC1461466 DOI: 10.1093/genetics/157.1.295] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The relationships between synonymous and nonsynonymous substitution rates and between synonymous rate and codon usage bias are important to our understanding of the roles of mutation and selection in the evolution of Drosophila genes. Previous studies used approximate estimation methods that ignore codon bias. In this study we reexamine those relationships using maximum-likelihood methods to estimate substitution rates, which accommodate the transition/transversion rate bias and codon usage bias. We compiled a sample of homologous DNA sequences at 83 nuclear loci from Drosophila melanogaster and at least one other species of Drosophila. Our analysis was consistent with previous studies in finding that synonymous rates were positively correlated with nonsynonymous rates. Our analysis differed from previous studies, however, in that synonymous rates were unrelated to codon bias. We therefore conducted a simulation study to investigate the differences between approaches. The results suggested that failure to properly account for multiple substitutions at the same site and for biased codon usage by approximate methods can lead to an artifactual correlation between synonymous rate and codon bias. Implications of the results for translational selection are discussed.
Collapse
Affiliation(s)
- K A Dunn
- Department of Biology, Galton Laboratory, University College, London NW1 2HE, United Kingdom.
| | | | | |
Collapse
|
9
|
Abstract
In the present work we show that in the Drosophila genome (which covers a 37-51% GC range at a DNA size of approx.50kb) a linear correlation holds between GC (or GC(3)50kb) genomic sequences embedding them. This correlation allows us to position the two compositional distributions of (a) coding sequences, and (b) of long DNA segments relative to each other and to calculate gene concentration across the compositional range of the Drosophila genome. Using this approach, we show that gene concentration increases with increasing GC of the regions embedding the genes, reaching a 7-fold higher level in the GC-richest regions compared with the GC-poorest regions. The gene distribution of the Drosophila genome is, therefore, similar to (although less striking than) that of the human genome, whereas it is very different from those of the Arabidopsis genome, which has about the same size as the Drosophila genome.
Collapse
Affiliation(s)
- K Jabbari
- Laboratoire de Génétique Moléculaire, Institut Jacques Monod, 2 Place Jussieu, 75005, Paris, France
| | | |
Collapse
|
10
|
Chen Y, Carlini DB, Baines JF, Parsch J, Braverman JM, Tanda S, Stephan W. RNA secondary structure and compensatory evolution. Genes Genet Syst 1999; 74:271-86. [PMID: 10791023 DOI: 10.1266/ggs.74.271] [Citation(s) in RCA: 66] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
The classic concept of epistatic fitness interactions between genes has been extended to study interactions within gene regions, especially between nucleotides that are important in maintaining pre-mRNA/mRNA secondary structures. It is shown that the majority of linkage disequilibria found within the Drosophila Adh gene are likely to be caused by epistatic selection operating on RNA secondary structures. A recently proposed method of RNA secondary structure prediction based on DNA sequence comparisons is reviewed and applied to several types of RNAs, including tRNA, rRNA, and mRNA. The patterns of covariation in these RNAs are analyzed based on Kimura's compensatory evolution model. The results suggest that this model describes the substitution process in the pairing regions (helices) of RNA secondary structures well when the helices are evolutionarily conserved and thermodynamically stable, but fails in some other cases. Epistatic selection maintaining pre-mRNA/mRNA secondary structures is compared to weak selective forces that determine features such as base composition and synonymous codon usage. The relationships among these forces and their relative strengths are addressed. Finally, our mutagenesis experiments using the Drosophila Adh locus are reviewed. These experiments analyze long-range compensatory interactions between the 5' and 3' ends of Adh mRNA, the different constraints on secondary structures in introns and exons, and the possible role of secondary structures in RNA splicing.
Collapse
Affiliation(s)
- Y Chen
- Department of Biology, University of Rochester, NY 14627, USA
| | | | | | | | | | | | | |
Collapse
|
11
|
Abstract
I studied the cause of the significant difference in the synonymous-substitution pattern found in the achaete-scute complex genes in two Drosophila lineages, higher codon bias in Drosophila yakuba, and lower bias in D. melanogaster. Besides these genes, the functionally unrelated yellow gene showed the same substitution pattern, suggesting a region-dependent phenomenon in the X-chromosome telomere. Because the numbers of A/T --> G/C substitutions were not significantly different from those of G/C --> A/T in the yellow noncoding regions of these species, a AT/GC mutational bias could not completely account for the synonymous-substitution biases. In contrast, we did find an approximately 14-fold difference in recombination rates in the X-chromosome telomere regions between the two species, suggesting that the reduction of recombination rates in this region resulted in the reduction of the efficacy of selection in D. melanogaster. In addition, the D. orena yellow showed a 5% increase in the G + C content at silent sites in the coding and noncoding regions since the divergence from D. erecta. This pattern was significantly different from those at the orena Adh and Amy loci. These results suggest that local changes in recombination rates and mutational pressures are contributing to the irregular synonymous-substitution patterns in Drosophila.
Collapse
Affiliation(s)
- T Takano-Shimizu
- Department of Population Genetics, National Institute of Genetics, Mishima, Shizuoka-ken 411-8540, Japan.
| |
Collapse
|
12
|
Rodríguez-Trelles F, Tarrío R, Ayala FJ. Switch in codon bias and increased rates of amino acid substitution in the Drosophila saltans species group. Genetics 1999; 153:339-50. [PMID: 10471717 PMCID: PMC1460741 DOI: 10.1093/genetics/153.1.339] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We investigated the nucleotide composition of five genes, Xdh, Adh, Sod, Per, and 28SrRNA, in nine species of Drosophila (subgenus Sophophora) and one of Scaptodrosophila. The six species of the Drosophila saltans group markedly differ from the others in GC content and codon use bias. The GC content in the third codon position, and to a lesser extent in the first position and the introns, is higher in the D. melanogaster and D. obscura groups than in the D. saltans group (in Scaptodrosophila it is intermediate but closer to the melanogaster and obscura species). Differences are greater for Xdh than for Adh, Sod, Per, and 28SrRNA, which are functionally more constrained. We infer that rapid evolution of GC content in the saltans lineage is largely due to a shift in mutation pressure, which may have been associated with diminished natural selection due to smaller effective population numbers rather than reduced recombination rates. The rate of GC content evolution impacts the rate of protein evolution and may distort phylogenetic inferences. Previous observations suggesting that GC content evolution is very limited in Drosophila may have been distorted due to the restricted number of genes and species (mostly D. melanogaster) investigated.
Collapse
Affiliation(s)
- F Rodríguez-Trelles
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-2525, USA.
| | | | | |
Collapse
|
13
|
Abstract
An interplay among experimental studies of protein synthesis, evolutionary theory, and comparisons of DNA sequence data has shed light on the roles of natural selection and genetic drift in 'silent' DNA evolution.
Collapse
Affiliation(s)
- H Akashi
- Department of Ecology and Evolutionary Biology, University of Kansas, Lawrence, Kansas 66045-2106, USA.
| | | |
Collapse
|
14
|
Mutation pressure, natural selection, and the evolution of base composition in Drosophila. ACTA ACUST UNITED AC 1998. [DOI: 10.1007/978-94-011-5210-5_5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/27/2023]
|
15
|
Abstract
Although non-random patterns of synonymous codon usage are a prominent feature in the genomes of many organisms, the relatives roles of mutational biases and natural selection in maintaining codon bias remain a contentious issue. In some species, patterns of codon bias and empirical findings on the biology of translation suggest 'major codon preference', a balance among mutation pressure, genetic drift, and weak selection in favor of translationally superior codons. Population genetics theory makes testable predictions to distinguish such a model from a strictly mutational model of codon bias. Major codon preference predicts two fitness classes of synonymous DNA changes: 'preferred' mutations from non-major to major codons and 'unpreferred' changes in the opposite direction. An extension of current statistical methods is employed to reveal differences in the within and between species dynamics of preferred and unpreferred silent mutations in Drosophila simulans. In this lineage, codon bias appears to be maintained under roughly equal magnitudes of natural selection and genetic drift. In the sibling species, D. melanogaster, however, a reduction in N(e)s, the product of effective population size and selection coefficient, appears to have allowed a genome-wide reduction in codon bias.
Collapse
Affiliation(s)
- H Akashi
- Section of Evolution and Ecology, University of California, Davis 95616, USA.
| |
Collapse
|
16
|
Akashi H, Schaeffer SW. Natural selection and the frequency distributions of "silent" DNA polymorphism in Drosophila. Genetics 1997; 146:295-307. [PMID: 9136019 PMCID: PMC1207945 DOI: 10.1093/genetics/146.1.295] [Citation(s) in RCA: 139] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
In Escherichia coli, Saccharomyces cerevisiae, and Drosophila melanogaster, codon bias may be maintained by a balance among mutation pressure, genetic drift, and natural selection favoring translationally superior codons. Under such an evolutionary model, silent mutations fall into two fitness categories: preferred mutations that increase codon bias and unpreferred changes in the opposite direction. This prediction can be tested by comparing the frequency spectra of synonymous changes segregating within populations; natural selection will elevate the frequencies of advantageous mutations relative to that of deleterious changes. The frequency distributions of preferred and unpreferred mutations differ in the predicted direction among 99 alleles of two D. pseudoobscura genes and five alleles of eight D. simulans genes. This result confirms the existence of fitness classes of silent mutations. Maximum likelihood estimates suggest that selection intensity at silent sites is, on average, very weak in both D. pseudoobscura and D. simulans (magnitude of NS approximately 1). Inference of evolutionary processes from within-species sequence variation is often hindered by the assumption of a stationary frequency distribution. This assumption can be avoided when identifying the action of selection and tested when estimating selection intensity.
Collapse
Affiliation(s)
- H Akashi
- Section of Evolution and Ecology, University of California at Davis 95616, USA.
| | | |
Collapse
|
17
|
Meghlaoui GK, Veuille M. Selection and methionine accumulation in the fat body protein 2 gene (FBP2), a duplicate of the Drosophila alcohol dehydrogenase (ADH) gene. J Mol Evol 1997; 44:23-32. [PMID: 9010133 DOI: 10.1007/pl00006118] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
The Drosophila fat body protein 2 gene (Fbp2) is an ancient duplication of the alcohol dehydrogenase gene (Adh) which encodes a protein that differs substantially from ADH in its methionine content. In D. melanogaster, there is one methionine in ADH, while there are 51 (20% of all amino acids) in FBP2. Methionine is involved in 46% of amino acid replacements when Fbp2 DNA sequences are compared between D. melanogaster and D. pseudoobscura. Methionine accumulation does not affect conserved residues of the ADH-ADHr-FBP2 multigene family. The multigene family has evolved by replacement of mildly hydrophobic amino acids by methionine with no apparent reversion. Its short-term evolution was compared between two Drosophila species, while its long-term evolution was compared between two genera belonging respectively to acalyptrate and calyptrate Diptera, Drosophila and Sarcophaga. The pattern of nucleotide substitution was consistent with an independent accumulation of methionines at the Fbp2 locus in each lineage. Under a steady-state model, the rate of methionine accumulation was constant in the lineage leading to Drosophila, and was twice as fast as that in the calyptrate lineage. Substitution rates were consistent with a slight positive selective advantage for each methionine change in about one-half of amino acid sites in Drosophila. This shows that selection can potentially account for a large proportion of amino acid replacements in the molecular evolution of proteins.
Collapse
Affiliation(s)
- G K Meghlaoui
- Institut d'Ecologie, Université Pierre et Marie Curie, 4 Place Jussieu, 75252 Paris Cedex 05, France
| | | |
Collapse
|
18
|
Akashi H. Molecular evolution between Drosophila melanogaster and D. simulans: reduced codon bias, faster rates of amino acid substitution, and larger proteins in D. melanogaster. Genetics 1996; 144:1297-307. [PMID: 8913769 PMCID: PMC1207620 DOI: 10.1093/genetics/144.3.1297] [Citation(s) in RCA: 168] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Both natural selection and mutational biases contribute to variation in codon usage bias within Drosophila species. This study addresses the cause of codon bias differences between the sibling species, Drosophila melanogaster and D. simulans. Under a model of mutation-selection-drift, variation in mutational processes between species predicts greater base composition differences in neutrally evolving regions than in highly biased genes. Variation in selection intensity, however, predicts larger base composition differences in highly biased loci. Greater differences in the G+C content of 34 coding regions than 46 intron sequences between D. melanogaster and D. simulans suggest that D. melanogaster has undergone a reduction in selection intensity for codon bias. Computer simulations suggest at least a fivefold reduction in Nes at silent sites in this lineage. Other classes of molecular change show lineage effects between these species. Rates of amino acid substitution are higher in the D. melanogaster lineage than in D. simulans in 14 genes for which outgroup sequences are available. Surprisingly, protein sizes are larger in D. melanogaster than in D. simulans in the 34 genes compared between the two species. A substantial fraction of silent, replacement, and insertion/deletion mutations in coding regions may be weakly selected in Drosophila.
Collapse
Affiliation(s)
- H Akashi
- Section of Evolution and Ecology, University of California, Davis 95616, USA.
| |
Collapse
|
19
|
Comeron JM, Aguadé M. Synonymous substitutions in the Xdh gene of Drosophila: heterogeneous distribution along the coding region. Genetics 1996; 144:1053-62. [PMID: 8913749 PMCID: PMC1207601 DOI: 10.1093/genetics/144.3.1053] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
The Xdh (rosy) region of Drosophila subobscura has been sequenced and compared to the homologous region of D. pseudoobscura and D. melanogaster. Estimates of the numbers of synonymous substitutions per site (Ks) confirm that Xdh has a high synonymous substitution rate. The distributions of both nonsynonymous and synonymous substitutions along the coding region were found to be heterogeneous. Also, no relationship has been detected between Ks estimates and codon usage bias along the gene, in contrast with the generally observed relationship among genes. This heterogeneous distribution of synonymous substitutions along the Xdh gene, which is expression-level independent, could be explained by a differential selection pressure on synonymous sites along the coding region acting on mRNA secondary structure. The synonymous rate in the Xdh coding region is lower in the D. subobscura than in the D. pseudoobscura lineage, whereas the reverse is true for the Adh gene.
Collapse
Affiliation(s)
- J M Comeron
- Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Spain
| | | |
Collapse
|
20
|
Akashi H. Inferring weak selection from patterns of polymorphism and divergence at "silent" sites in Drosophila DNA. Genetics 1995; 139:1067-76. [PMID: 7713409 PMCID: PMC1206357 DOI: 10.1093/genetics/139.2.1067] [Citation(s) in RCA: 363] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
Patterns of codon usage and "silent" DNA divergence suggest that natural selection discriminates among synonymous codons in Drosophila. "Preferred" codons are consistently found in higher frequencies within their synonymous families in Drosophila melanogaster genes. This suggests a simple model of silent DNA evolution where natural selection favors mutations from unpreferred to preferred codons (preferred changes). Changes in the opposite direction, from preferred to unpreferred synonymous codons (unpreferred changes), are selected against. Here, selection on synonymous DNA mutations is investigated by comparing the evolutionary dynamics of these two categories of silent DNA changes. Sequences from outgroups are used to determine the direction of synonymous DNA changes within and between D. melanogaster and Drosophila simulans for five genes. Population genetics theory shows that differences in the fitness effect of mutations can be inferred from the comparison of ratios of polymorphism to divergence. Unpreferred changes show a significantly higher ratio of polymorphism to divergence than preferred changes in the D. simulans lineage, confirming the action of selection at silent sites. An excess of unpreferred fixations in 28 genes suggests a relaxation of selection on synonymous mutations in D. melanogaster. Estimates of selection coefficients for synonymous mutations (3.6 < magnitude of Nes < 1.3) in D. simulans are consistent with the reduced efficacy of natural selection (magnitude of Nes < 1) in the three- to sixfold smaller effective population size of D. melanogaster. Synonymous DNA changes appear to be a prevalent class of weakly selected mutations in Drosophila.
Collapse
Affiliation(s)
- H Akashi
- Department of Ecology and Evolution, University of Chicago, Illinois 60637, USA
| |
Collapse
|
21
|
Abstract
The rates and patterns of evolution at silent sites in codons reveal much about the basic features of molecular evolution. Recent increases in the amount of sequence data available for various species and more precise knowledge of the chromosomal locations of those sequences, coming in particular from genome projects, reveal that some features of molecular evolution vary around the genome.
Collapse
Affiliation(s)
- P M Sharp
- Department of Genetics, University of Nottingham, Queens Medical Centre, UK
| | | |
Collapse
|
22
|
Kliman RM, Hey J. The effects of mutation and natural selection on codon bias in the genes of Drosophila. Genetics 1994; 137:1049-56. [PMID: 7982559 PMCID: PMC1206052 DOI: 10.1093/genetics/137.4.1049] [Citation(s) in RCA: 66] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Codon bias varies widely among the loci of Drosophila melanogaster, and some of this diversity has been explained by variation in the strength of natural selection. A study of correlations between intron and coding region base composition shows that variation in mutation pattern also contributes to codon bias variation. This finding is corroborated by an analysis of variance (ANOVA), which shows a tendency for introns from the same gene to be similar in base composition. The strength of base composition correlations between introns and codon third positions is greater for genes with low codon bias than for genes with high codon bias. This pattern can be explained by an overwhelming effect of natural selection, relative to mutation, in highly biased loci. In particular, this correlation is absent when examining fourfold degenerate sites of highly biased genes. In general, it appears that selection acts more strongly in choosing among fourfold degenerate codons than among twofold degenerate codons. Although the results indicate regional variation in mutational bias, no evidence is found for large scale regions of compositional homogeneity.
Collapse
Affiliation(s)
- R M Kliman
- Department of Biological Sciences, Rutgers University, Piscataway, New Jersey 08855-1059
| | | |
Collapse
|
23
|
Akashi H. Synonymous codon usage in Drosophila melanogaster: natural selection and translational accuracy. Genetics 1994; 136:927-35. [PMID: 8005445 PMCID: PMC1205897 DOI: 10.1093/genetics/136.3.927] [Citation(s) in RCA: 487] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
I present evidence that natural selection biases synonymous codon usage to enhance the accuracy of protein synthesis in Drosophila melanogaster. Since the fitness cost of a translational misincorporation will depend on how the amino acid substitution affects protein function, selection for translational accuracy predicts an association between codon usage in DNA and functional constraint at the protein level. The frequency of preferred codons is significantly higher at codons conserved for amino acids than at nonconserved codons in 38 genes compared between D. melanogaster and Drosophila virilis or Drosophila pseudoobscura (Z = 5.93, P < 10(-6)). Preferred codon usage is also significantly higher in putative zinc-finger and homeodomain regions than in the rest of 28 D. melanogaster transcription factor encoding genes (Z = 8.38, P < 10(-6)). Mutational alternatives (within-gene differences in mutation rates, amino acid changes altering codon preference states, and doublet mutations at adjacent bases) do not appear to explain this association between synonymous codon usage and amino acid constraint.
Collapse
Affiliation(s)
- H Akashi
- Department of Ecology and Evolution, University of Chicago, Illinois 60637
| |
Collapse
|
24
|
Abstract
Intermediate between DNA sequences and broad patterns of karyotypic change there is a major gap in understanding genome structure and evolution. The gap is at the megabase level between genes and chromosomes. New methods for analyzing large DNA fragments cloned in yeast or bacterial vectors provide experimental access to genome evolution at the megabase level by enabling the assembly of megabase-size contiguous regions. Genome evolution at the megabase level can also be studied using high-resolution genetic maps. Rates and patterns of genome evolution in mammals (mouse versus humans) and Drosophila (D. virilis versus D. melanogaster) are compared and contrasted. Opportunities for research in genome evolution using the new technologies are enumerated and discussed.
Collapse
Affiliation(s)
- D L Hartl
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138
| | | |
Collapse
|