51
|
Cheung SCK, Sun SSM, Chan JCN, Tong PCY. Expression and subcellular targeting of human insulin-like growth factor binding protein-3 in transgenic tobacco plants. Transgenic Res 2009; 18:943-51. [PMID: 19504171 DOI: 10.1007/s11248-009-9286-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2008] [Accepted: 05/16/2009] [Indexed: 12/17/2022]
Abstract
Human insulin-like growth factor binding protein-3 (hIGFBP-3) is a multifunctional protein which has high affinity for insulin-like growth factor-I (IGF-I). It combines with IGF-I to form a tertiary complex in circulation, thus regulating the activity of IGF-I. Furthermore, recombinant hIGFBP-3 (rhIGFBP-3) has been found to negatively regulate cell proliferation and induce apoptosis. In this study, we have established an efficient plant bioreactor platform for mass production of rhIGFBP-3. Different expression constructs, driven by the seed-specific phaseolin promoter, were designed and transformed into tobacco plant via Agrobacterium. To enhance protein expression level, the signal peptide (SP) and the C-terminal tetrapeptide AFVY of phaseolin were used to direct rhIGFBP-3 to protein storage vacuole (PSV) in tobacco seed for stable accumulation. Western blot analysis showed that rhIGFBP-3 was successfully synthesized in transgenic tobacco seeds, with the highest protein expression of 800 mug/g dry weight. The localization of rhIGFBP-3 in PSV was also evident by confocal immunofluorescence microscopy. Our results indicated that protein sorting sequences could benefit the expression level of rhIGFBP-3 and it is feasible to use plant as "bio-factory" to produce therapeutic recombinant proteins in large quantity.
Collapse
Affiliation(s)
- Stanley C K Cheung
- Department of Medicine and Therapeutics, Prince of Wales Hospital, The Chinese University of Hong Kong, Shatin, Hong Kong SAR, China
| | | | | | | |
Collapse
|
52
|
Singh ND, Arndt PF, Clark AG, Aquadro CF. Strong evidence for lineage and sequence specificity of substitution rates and patterns in Drosophila. Mol Biol Evol 2009; 26:1591-605. [PMID: 19351792 DOI: 10.1093/molbev/msp071] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Rates of single nucleotide substitution in Drosophila are highly variable within the genome, and several examples illustrate that evolutionary rates differ among Drosophila species as well. Here, we use a maximum likelihood method to quantify lineage-specific substitutional patterns and apply this method to 4-fold degenerate synonymous sites and introns from more than 8,000 genes aligned in the Drosophila melanogaster group. We find that within species, different classes of sequence evolve at different rates, with long introns evolving most slowly and short introns evolving most rapidly. Relative rates of individual single nucleotide substitutions vary approximately 3-fold among lineages, yielding patterns of substitution that are comparatively less GC-biased in the melanogaster species complex relative to Drosophila yakuba and Drosophila erecta. These results are consistent with a model coupling a mutational shift toward reduced GC content, or a shift in mutation-selection balance, in the D. melanogaster species complex, with variation in selective constraint among different classes of DNA sequence. Finally, base composition of coding and intronic sequences is not at equilibrium with respect to substitutional patterns, which primarily reflects the slow rate of the substitutional process. These results thus support the view that mutational and/or selective processes are labile on an evolutionary timescale and that if the process is indeed selection driven, then the distribution of selective constraint is variable across the genome.
Collapse
Affiliation(s)
- Nadia D Singh
- Department of Molecular Biology and Genetics, Cornell University.
| | | | | | | |
Collapse
|
53
|
Cutter AD, Dey A, Murray RL. Evolution of the Caenorhabditis elegans genome. Mol Biol Evol 2009; 26:1199-234. [PMID: 19289596 DOI: 10.1093/molbev/msp048] [Citation(s) in RCA: 90] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
A fundamental problem in genome biology is to elucidate the evolutionary forces responsible for generating nonrandom patterns of genome organization. As the first metazoan to benefit from full-genome sequencing, Caenorhabditis elegans has been at the forefront of research in this area. Studies of genomic patterns, and their evolutionary underpinnings, continue to be augmented by the recent push to obtain additional full-genome sequences of related Caenorhabditis taxa. In the near future, we expect to see major advances with the onset of whole-genome resequencing of multiple wild individuals of the same species. In this review, we synthesize many of the important insights to date in our understanding of genome organization and function that derive from the evolutionary principles made explicit by theoretical population genetics and molecular evolution and highlight fertile areas for future research on unanswered questions in C. elegans genome evolution. We call attention to the need for C. elegans researchers to generate and critically assess nonadaptive hypotheses for genomic and developmental patterns, in addition to adaptive scenarios. We also emphasize the potential importance of evolution in the gonochoristic (female and male) ancestors of the androdioecious (hermaphrodite and male) C. elegans as the source for many of its genomic and developmental patterns.
Collapse
Affiliation(s)
- Asher D Cutter
- Department of Ecology & Evolutionary Biology and the Centre for the Analysis of Genome Evolution and Function, University of Toronto, Toronto, Ontario, Canada.
| | | | | |
Collapse
|
54
|
Evidence of a major role of GP64 in group I alphabaculovirus evolution. Virus Res 2009; 142:85-91. [PMID: 19428740 DOI: 10.1016/j.virusres.2009.01.015] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2008] [Revised: 01/16/2009] [Accepted: 01/19/2009] [Indexed: 11/22/2022]
Abstract
Previous investigations suggest that the divergence of the group I alphabaculoviruses was later than that of the group II alphabaculoviruses, however, there is no quantitative data to support this hypothesis. To examine this theory, the evolutionary rates of the 30 core genes that are conserved among all baculoviruses and the 11 unique genes among group I alphabaculoviruses were estimated in this report. For core genes, the synonymous and nonsynonymous substitution rates (Ks and Ka) were found to be significantly different among different groups, with the rates being granulovirus>group II>group I. Among the 11 unique genes, gp64 was found to have the highest amino acid identity and the lowest omega (Ka/Ks) and Ka values. The significant difference in the selection pressure was found in the F-like protein. These analyses suggests the following interpretation: (i) group I evolved from an ancestral group II alphabaculovirus that had 11 genes not present in other members of this group; (ii) the acquisition of the gp64 gene may have stimulated or initiated the formation of the group I as a major lineage distinct from group II; and (iii) after being functionally displaced by gp64, the F-like gene of group I evolved under a relaxed selection pressure that lead to the partial lost of its function.
Collapse
|
55
|
Zhao Y, Miyashita K, Ando T, Kakeji Y, Yamanaka T, Taguchi K, Ushijima T, Oda S, Maehara Y. Exclusive KRAS mutation in microsatellite-unstable human colorectal carcinomas with sequence alterations in the DNA mismatch repair gene, MLH1. Gene 2008; 423:188-93. [PMID: 18692554 DOI: 10.1016/j.gene.2008.07.014] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2008] [Revised: 07/10/2008] [Accepted: 07/15/2008] [Indexed: 11/18/2022]
Abstract
Microsatellite instability (MSI) is regarded as reflecting defective DNA mismatch repair (MMR). MMR defects lead to an increase in point mutations, as well as repeat instability, on the genome. However, despite the highly unstable microsatellites, base substitutions in representative oncogenes or tumor suppressors are extremely infrequent in MSI-positive tumors. Recently, the heterogeneity in MSI-positive colorectal tumors is pointed out, and the 'hereditary' and 'sporadic settings' are proposed. Particularly in the former, base substitution mutations in KRAS are regarded as relatively frequent. We sequenced the KRAS gene in a panel of 76 human colorectal carcinomas in which the MSI status has been determined. KRAS mutations were detected in 22 tumors (28.9%). Intriguingly, all of the KRAS-mutant MSI-H (high) tumors harbored sequence alterations in an essential MMR gene, MLH1, which implies that KRAS mutation more frequently and almost exclusively occurs in MMR gene-mutant MSI-H tumors. Furthermore, in contrast with the prevailing viewpoint, some of these tumors are derived from sporadic colorectal cancer patients. The tight connection between MMR gene mutation and KRAS mutation may suggest previously unrecognized complexities in the relationship between MSI and the mutator phenotype derived from defective MMR.
Collapse
Affiliation(s)
- Yan Zhao
- Institute for Clinical Research, National Kyushu Cancer Center, 3-1-1, Notame, Fukuoka 811-1395, Japan
| | | | | | | | | | | | | | | | | |
Collapse
|
56
|
Zhou T, Drummond DA, Wilke CO. Contact density affects protein evolutionary rate from bacteria to animals. J Mol Evol 2008; 66:395-404. [PMID: 18379715 DOI: 10.1007/s00239-008-9094-4] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2007] [Revised: 02/16/2008] [Accepted: 02/25/2008] [Indexed: 12/29/2022]
Abstract
The density of contacts or the fraction of buried sites in a protein structure is thought to be related to a protein's designability, and genes encoding more designable proteins should evolve faster than other genes. Several recent studies have tested this hypothesis but have found conflicting results. Here, we investigate how a gene's evolutionary rate is affected by its protein's contact density, considering the four species Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster, and Homo sapiens. We find for all four species that contact density correlates positively with evolutionary rate, and that these correlations do not seem to be confounded by gene expression level. The strength of this signal, however, varies widely among species. We also study the effect of contact density on domain evolution in multidomain proteins and find that a domain's contact density influences the domain's evolutionary rate. Within the same protein, a domain with higher contact density tends to evolve faster than a domain with lower contact density. Our study provides evidence that contact density can increase evolutionary rates, and that it acts similarly on the level of entire proteins and of individual protein domains.
Collapse
Affiliation(s)
- Tong Zhou
- Center for Computational Biology and Bioinformatics, Section of Integrative Biology, University of Texas at Austin, Austin, TX 78731, USA
| | | | | |
Collapse
|
57
|
Warden CD, Kim SH, Yi SV. Predicted functional RNAs within coding regions constrain evolutionary rates of yeast proteins. PLoS One 2008; 3:e1559. [PMID: 18270559 PMCID: PMC2216430 DOI: 10.1371/journal.pone.0001559] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2007] [Accepted: 12/30/2007] [Indexed: 11/25/2022] Open
Abstract
Functional RNAs (fRNAs) are being recognized as an important regulatory component in biological processes. Interestingly, recent computational studies suggest that the number and biological significance of functional RNAs within coding regions (coding fRNAs) may have been underestimated. We hypothesized that such coding fRNAs will impose additional constraint on sequence evolution because the DNA primary sequence has to simultaneously code for functional RNA secondary structures on the messenger RNA in addition to the amino acid codons for the protein sequence. To test this prediction, we first utilized computational methods to predict conserved fRNA secondary structures within multiple species alignments of Saccharomyces sensu strico genomes. We predict that as much as 5% of the genes in the yeast genome contain at least one functional RNA secondary structure within their protein-coding region. We then analyzed the impact of coding fRNAs on the evolutionary rate of protein-coding genes because a decrease in evolutionary rate implies constraint due to biological functionality. We found that our predicted coding fRNAs have a significant influence on evolutionary rates (especially at synonymous sites), independent of other functional measures. Thus, coding fRNA may play a role on sequence evolution. Given that coding regions of humans and flies contain many more predicted coding fRNAs than yeast, the impact of coding fRNAs on sequence evolution may be substantial in genomes of higher eukaryotes.
Collapse
Affiliation(s)
- Charles D. Warden
- School of Biology, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Seong-Ho Kim
- Division of Biostatistics, Indiana University School of Medicine, Indianapolis, Indiana, United States of America
| | - Soojin V. Yi
- School of Biology, Georgia Institute of Technology, Atlanta, Georgia, United States of America
- *E-mail:
| |
Collapse
|
58
|
Fuglsang A. Impact of bias discrepancy and amino acid usage on estimates of the effective number of codons used in a gene, and a test for selection on codon usage. Gene 2007; 410:82-8. [PMID: 18248919 DOI: 10.1016/j.gene.2007.12.001] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2007] [Revised: 10/22/2007] [Accepted: 12/03/2007] [Indexed: 11/26/2022]
Abstract
The effective number of codons (Nc) used in a gene is one of the most commonly used measures of synonymous codon usage bias, owing much of its popularity to the fact that it is species independent and that simulation studies have shown that it is less dependent of gene length than other measures. In this paper I provide a clear and practically meaningful definition of bias discrepancy (BD; when the degree of codon bias varies within a degeneracy class). Moreover I evaluate the impact of BD and amino acid usage on estimates of Nc. It is shown that both factors have a significant effect on accuracy and precision. Both amino acid usage and BD influence accuracy considerably, especially in short genes. Finally, I demonstrate how the definition of bias discrepancy can be applied to investigate if codon usage is influenced by selection and I discuss this test in relation to the incongruous literature that exists for Buchnera sp. APS and Borrelia burgdorferi.
Collapse
Affiliation(s)
- Anders Fuglsang
- University of Copenhagen, Faculty of Pharmaceutical Sciences, 2 Universitetsparken, Copenhagen O, Denmark.
| |
Collapse
|
59
|
Chen R, Yan H, Zhao KN, Martinac B, Liu GB. Comprehensive analysis of prokaryotic mechanosensation genes: their characteristics in codon usage. ACTA ACUST UNITED AC 2007; 18:269-78. [PMID: 17541832 DOI: 10.1080/10425170601136564] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
In the present study, we examined GC nucleotide composition, relative synonymous codon usage (RSCU), effective number of codons (ENC), codon adaptation index (CAI) and gene length for 308 prokaryotic mechanosensitive ion channel (MSC) genes from six evolutionary groups: Euryarchaeota, Actinobacteria, Alphaproteobacteria, Betaproteobacteria, Firmicutes, and Gammaproteobacteria. Results showed that: (1) a wide variation of overrepresentation of nucleotides exists in the MSC genes; (2) codon usage bias varies considerably among the MSC genes; (3) both nucleotide constraint and gene length play an important role in shaping codon usage of the bacterial MSC genes; and (4) synonymous codon usage of prokaryotic MSC genes is phylogenetically conserved. Knowledge of codon usage in prokaryotic MSC genes may benefit from the study of the MSC genes in eukaryotes in which few MSC genes have been identified and functionally analysed.
Collapse
Affiliation(s)
- Rong Chen
- School of Medicine, Xi'an Jiaotong University, Xi'an, People's Republic of China
| | | | | | | | | |
Collapse
|
60
|
Zhong J, Li Y, Zhao S, Liu S, Zhang Z. Mutation pressure shapes codon usage in the GC-Rich genome of foot-and-mouth disease virus. Virus Genes 2007; 35:767-76. [PMID: 17768673 PMCID: PMC7089325 DOI: 10.1007/s11262-007-0159-z] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2007] [Accepted: 08/09/2007] [Indexed: 11/25/2022]
Abstract
Foot-and-mouth disease (FMD) is economically the most important viral-induced livestock disease worldwide. In this study, we report the results of a survey of codon usage bias of FMD virus (FMDV) representing all seven serotypes (A, O, C, Asia 1, SAT 1, SAT 2, and SAT 3). Correspondence analysis, a commonly used multivariate statistical approach, was carried out to analyze synonymous codon usage bias. The analysis showed that the overall extent of codon usage bias in FMDV is low. Furthermore, the good correlation between the frequency of G + C at the synonymous third position of sense codons (GC3S) content at silent sites of each sequence and codon usage bias suggested that mutation pressure rather than natural (translational) selection is the most important determinant of the codon bias observed. In addition, other factors, such as the lengths of open reading frame (ORF) and the hydrophobicity of genes also influence the codon usage variation among the genomes of FMDV in a minor way. The result of phylogenetic analyses based on the relative synonymous codon usage (RSCU) values indicated a few obvious phylogenetic incongruities, which suggest that more FMDV genome diversity may exist in nature than is currently indicated. Our work might give some clues to the features of FMDV genome and some evolutionary information of this virus.
Collapse
Affiliation(s)
- Jincheng Zhong
- University of Electronic Science and Technology of China, Chengdu, Sichuan 610054 P.R. China
| | - Yanmin Li
- Institute for Animal Health, Pirbright, Woking, Surrey GU24 0NF UK
| | - Sheng Zhao
- Jingmen Technical College, Jingmen, Hubei 448000 P.R. China
| | - Shenggang Liu
- University of Electronic Science and Technology of China, Chengdu, Sichuan 610054 P.R. China
| | - Zhidong Zhang
- Institute for Animal Health, Pirbright, Woking, Surrey GU24 0NF UK
| |
Collapse
|
61
|
Sauvage C, Bierne N, Lapègue S, Boudry P. Single Nucleotide polymorphisms and their relationship to codon usage bias in the Pacific oyster Crassostrea gigas. Gene 2007; 406:13-22. [PMID: 17616269 DOI: 10.1016/j.gene.2007.05.011] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2007] [Revised: 05/14/2007] [Accepted: 05/18/2007] [Indexed: 10/23/2022]
Abstract
DNA sequence polymorphism and codon usage bias were investigated in a set of 41 nuclear loci in the Pacific oyster Crassostrea gigas. Our results revealed a very high level of DNA polymorphism in oysters, in the order of magnitude of the highest levels reported in animals to date. A total of 290 single nucleotide polymorphisms (SNPs) were detected, 76 of which being localised in exons and 214 in non-coding regions. Average density of SNPs was estimated to be one SNP every 60 bp in coding regions and one every 40 bp in non-coding regions. Non-synonymous substitutions contributed substantially to the polymorphism observed in coding regions. The non-synonymous to silent diversity ratio was 0.16 on average, which is fairly higher to the ratio reported in other invertebrate species recognised to display large population sizes. Therefore, purifying selection does not appear to be as strong as it could have been expected for a species with a large effective population size. The level of non-synonymous diversity varied greatly from one gene to another, in accordance with varying selective constraints. We examined codon usage bias and its relationship with DNA polymorphism. The table of optimal codons was deduced from the analysis of an EST dataset, using EST counts as a rough assessment of gene expression. As recently observed in some other taxa, we found a strong and significant negative relationship between codon bias and non-synonymous diversity suggesting correlated selective constraints on synonymous and non-synonymous substitutions. Codon bias as measured by the frequency of optimal codons for expression might therefore provide a useful indicator of the level of constraint upon proteins in the oyster genome.
Collapse
Affiliation(s)
- C Sauvage
- Laboratoire de Génétique et Pathologie - IFREMER - La Tremblade, France
| | | | | | | |
Collapse
|
62
|
Phenotypic mutation rates and the abundance of abnormal proteins in yeast. PLoS Comput Biol 2007; 3:e203. [PMID: 18039025 PMCID: PMC2082502 DOI: 10.1371/journal.pcbi.0030203] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2007] [Accepted: 09/05/2007] [Indexed: 11/23/2022] Open
Abstract
Phenotypic mutations are errors that occur during protein synthesis. These errors lead to amino acid substitutions that give rise to abnormal proteins. Experiments suggest that such errors are quite common. We present a model to study the effect of phenotypic mutation rates on the amount of abnormal proteins in a cell. In our model, genes are regulated to synthesize a certain number of functional proteins. During this process, depending on the phenotypic mutation rate, abnormal proteins are generated. We use data on protein length and abundance in Saccharomyces cerevisiae to parametrize our model. We calculate that for small phenotypic mutation rates most abnormal proteins originate from highly expressed genes that are on average nearly twice as large as the average yeast protein. For phenotypic mutation rates much above 5 × 10−4, the error-free synthesis of large proteins is nearly impossible and lowly expressed, very large proteins contribute more and more to the amount of abnormal proteins in a cell. This fact leads to a steep increase of the amount of abnormal proteins for phenotypic mutation rates above 5 × 10−4. Simulations show that this property leads to an upper limit for the phenotypic mutation rate of approximately 2 × 10−3 even if the costs for abnormal proteins are extremely low. We also consider the adaptation of individual proteins. Individual genes/proteins can decrease their phenotypic mutation rate by using preferred codons or by increasing their robustness against amino acid substitutions. We discuss the similarities and differences between the two mechanisms and show that they can only slow down but not prevent the rapid increase of the amount of abnormal proteins. Our work allows us to estimate the phenotypic mutation rate based on data on the fraction of abnormal proteins. For S. cerevisiae, we predict that the value for the phenotypic mutation rate is between 2 × 10−4 and 6 × 10−4. A functional protein machinery, built from genetic information, is central to every living organism. Surprisingly, the decoding of genes into amino acid sequences is fairly inaccurate. Errors in this process (phenotypic mutations) are several orders of magnitude more frequent than errors during DNA replication (genotypic mutations). Many researchers have explored the evolution of genotypic mutation rates, but there are as yet few investigations into the evolutionary dynamics of phenotypic mutation rates. Here we present a mathematical model that describes the effect of phenotypic mutation on the amount of abnormal proteins in cells. We parameterize our model using data from yeast (Saccharomyces cerevisiae). We show that for phenotypic mutation rates above 5 × 10−4 per amino acid, the error-free synthesis of large proteins becomes nearly impossible. We estimate the phenotypic mutation rate of S. cerevisiae to be between 2 × 10−4 and 6 × 10−4 per amino acid.
Collapse
|
63
|
Singh ND, Arndt PF, Petrov DA. Minor shift in background substitutional patterns in the Drosophila saltans and willistoni lineages is insufficient to explain GC content of coding sequences. BMC Biol 2006; 4:37. [PMID: 17049096 PMCID: PMC1626080 DOI: 10.1186/1741-7007-4-37] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2006] [Accepted: 10/18/2006] [Indexed: 11/10/2022] Open
Abstract
Background Several lines of evidence suggest that codon usage in the Drosophila saltans and D. willistoni lineages has shifted towards a less frequent use of GC-ending codons. Introns in these lineages show a parallel shift toward a lower GC content. These patterns have been alternatively ascribed to either a shift in mutational patterns or changes in the definition of preferred and unpreferred codons in these lineages. Results and discussion To gain additional insight into this question, we quantified background substitutional patterns in the saltans/willistoni group using inactive copies of a novel, Q-like retrotransposable element. We demonstrate that the pattern of background substitutions in the saltans/willistoni lineage has shifted to a significant degree, primarily due to changes in mutational biases. These differences predict a lower equilibrium GC content in the genomes of the saltans/willistoni species compared with that in the D. melanogaster species group. The magnitude of the difference can readily account for changes in intronic GC content, but it appears insufficient to explain changes in codon usage within the saltans/willistoni lineage. Conclusion We suggest that the observed changes in codon usage in the saltans/willistoni clade reflects either lineage-specific changes in the definitions of preferred and unpreferred codons, or a weaker selective pressure on codon bias in this lineage.
Collapse
Affiliation(s)
- Nadia D Singh
- Department of Biological Sciences, Stanford University, 371 Serra Mall, Stanford, CA 94305, USA
| | - Peter F Arndt
- Max Planck for Molecular Genetics, 14195 Berlin, Germany
| | - Dmitri A Petrov
- Department of Biological Sciences, Stanford University, 371 Serra Mall, Stanford, CA 94305, USA
| |
Collapse
|
64
|
Bierne N, Eyre-Walker A. Variation in synonymous codon use and DNA polymorphism within the Drosophila genome. J Evol Biol 2006; 19:1-11. [PMID: 16405571 DOI: 10.1111/j.1420-9101.2005.00996.x] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A strong negative correlation between the rate of amino-acid substitution and codon usage bias in Drosophila has been attributed to interference between positive selection at nonsynonymous sites and weak selection on codon usage. To further explore this possibility we have investigated polymorphism and divergence at three kinds of sites: synonymous, nonsynonymous and intronic in relation to codon bias in D. melanogaster and D. simulans. We confirmed that protein evolution is one of the main explicative parameters for interlocus codon bias variation (r(2) approximately 40%). However, intron or synonymous diversities, which could have been expected to be good indicators of local interference [here defined as the additional increase of drift due to selection on tightly linked sites, also called 'genetic draft' by Gillespie (2000)] did not covary significantly with codon bias or with protein evolution. Concurrently, levels of polymorphism were reduced in regions of low recombination rates whereas codon bias was not. Finally, while nonsynonymous diversities were very well correlated between species, neither synonymous nor intron diversities observed in D. melanogaster were correlated with those observed in D. simulans. All together, our results suggest that the selective constraint on the protein is a stable component of gene evolution while local interference is not. The pattern of variation in genetic draft along the genome therefore seems to be instable through evolutionary times and should therefore be considered as a minor determinant of codon bias variance. We argue that selective constraints for optimal codon usage are likely to be correlated with selective constraints on the protein, both between codons within a gene, as previously suggested, and also between genes within a genome.
Collapse
Affiliation(s)
- N Bierne
- Centre for the Study of Evolution and School of Biological Sciences, University of Sussex, Brighton, UK.
| | | |
Collapse
|
65
|
Cutter AD, Baird SE, Charlesworth D. High nucleotide polymorphism and rapid decay of linkage disequilibrium in wild populations of Caenorhabditis remanei. Genetics 2006; 174:901-13. [PMID: 16951062 PMCID: PMC1602088 DOI: 10.1534/genetics.106.061879] [Citation(s) in RCA: 99] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
The common ancestor of the self-fertilizing nematodes Caenorhabditis elegans and C. briggsae must have reproduced by obligate outcrossing, like most species in this genus. However, we have only a limited understanding about how genetic variation is patterned in such male-female (gonochoristic) Caenorhabditis species. Here, we report results from surveying nucleotide variation of six nuclear loci in a broad geographic sample of wild isolates of the gonochoristic C. remanei. We find high levels of diversity in this species, with silent-site diversity averaging 4.7%, implying an effective population size close to 1 million. Additionally, the pattern of polymorphisms reveals little evidence for population structure or deviation from neutral expectations, suggesting that the sampled C. remanei populations approximate panmixis and demographic equilibrium. Combined with the observation that linkage disequilibrium between pairs of polymorphic sites decays rapidly with distance, this suggests that C. remanei will provide an excellent system for identifying the genetic targets of natural selection from deviant patterns of polymorphism and linkage disequilibrium. The patterns revealed in this obligately outcrossing species may provide a useful model of the evolutionary circumstances in C. elegans' gonochoristic progenitor. This will be especially important if self-fertilization evolved recently in C. elegans history, because most of the evolutionary time separating C. elegans from its known relatives would have occurred in a state of obligate outcrossing.
Collapse
Affiliation(s)
- Asher D Cutter
- Institute of Evolutionary Biology, University of Edinburgh, United Kingdom.
| | | | | |
Collapse
|
66
|
Cutter AD, Wasmuth JD, Blaxter ML. The evolution of biased codon and amino acid usage in nematode genomes. Mol Biol Evol 2006; 23:2303-15. [PMID: 16936139 DOI: 10.1093/molbev/msl097] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Despite the degeneracy of the genetic code, whereby different codons encode the same amino acid, alternative codons and amino acids are utilized nonrandomly within and between genomes. Such biases in codon and amino acid usage have been demonstrated extensively in prokaryote genomes and likely reflect a balance between the action of mutation, selection, and genetic drift. Here, we quantify the effects of selection and mutation drift as causes of codon and amino acid-usage bias in a large collection of nematode partial genomes from 37 species spanning approximately 700 Myr of evolution, as inferred from expressed sequence tag (EST) measures of gene expression and from base composition variation. Average G + C content at silent sites among these taxa ranges from 10% to 63%, and EST counts range more than 100-fold, underlying marked differences between the identities of major codons and optimal codons for a given species as well as influencing patterns of amino acid abundance among taxa. Few species in our sample demonstrate a dominant role of selection in shaping intragenomic codon-usage biases, and these are principally free living rather than parasitic nematodes. This suggests that deviations in effective population size among species, with small effective sizes among parasites, are partly responsible for species differences in the extent to which selection shapes patterns of codon usage. Nevertheless, a consensus set of optimal codons emerges that is common to most taxa, indicating that, with some notable exceptions, selection for translational efficiency and accuracy favors similar sets of codons regardless of the major codon-usage trends defined by base compositional properties of individual nematode genomes.
Collapse
Affiliation(s)
- Asher D Cutter
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom.
| | | | | |
Collapse
|
67
|
Warringer J, Blomberg A. Evolutionary constraints on yeast protein size. BMC Evol Biol 2006; 6:61. [PMID: 16911784 PMCID: PMC1560397 DOI: 10.1186/1471-2148-6-61] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2006] [Accepted: 08/15/2006] [Indexed: 11/22/2022] Open
Abstract
Background Despite a strong evolutionary pressure to reduce genome size, proteins vary in length over a surprisingly wide range also in very compact genomes. Here we investigated the evolutionary forces that act on protein size in the yeast Saccharomyces cerevisiae utilizing a system-wide bioinformatics approach. Data on yeast protein size was compared to global experimental data on protein expression, phenotypic pleiotropy, protein-protein interactions, protein evolutionary rate and biochemical classification. Results Comparing the experimentally determined abundance of individual proteins, highly expressed proteins were found to be consistently smaller than lowly expressed proteins, in accordance with the biosynthetic cost minimization hypothesis. Yeast proteins able to maintain a high expression level despite a large size tended to belong to a very distinct set of protein families, notably nuclear transport and translation initiation/elongation. Large proteins have significantly more protein-protein interactions than small proteins, suggesting that a requirement for multiple interaction domains may constitute a positive selective pressure for large protein size in yeast. The higher frequency of protein-protein interactions in large proteins was not accompanied by a higher phenotypic pleiotropy. Hence, the increase in interactions may not reflect an increase in function differentiation. Proteins of different sizes also evolved at similar rates. Finally, whereas the biological process involved was found to have little influence on protein size the biochemical activity exerted by the protein represented a dominant factor. More than one third of all biochemical activity classes were enriched in one or more size intervals. Conclusion In yeast, there is an inverse relationship between protein size and protein expression such that highly expressed proteins tend to be of smaller size. Also, protein size is moderately affected by protein connectivity and strongly affected by biochemical activity. Phenotypic pleiotropy does not seem to affect protein size.
Collapse
Affiliation(s)
- Jonas Warringer
- Department of Cell and Molecular Biology, Lundberg Laboratory, Göteborg University Medicinaregatan 9c, 41390 Göteborg, Sweden
| | - Anders Blomberg
- Department of Cell and Molecular Biology, Lundberg Laboratory, Göteborg University Medicinaregatan 9c, 41390 Göteborg, Sweden
| |
Collapse
|
68
|
Wang L, Roossinck MJ. Comparative analysis of expressed sequences reveals a conserved pattern of optimal codon usage in plants. PLANT MOLECULAR BIOLOGY 2006; 61:699-710. [PMID: 16897485 DOI: 10.1007/s11103-006-0041-8] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/08/2005] [Accepted: 03/09/2006] [Indexed: 05/11/2023]
Abstract
Codon usage bias is a ubiquitous phenomenon, which may be caused by mutational bias, selection, or both. The patterns of codon usage in plants are not well understood. Datasets of expressed sequence tags (ESTs) available for many plant species provide the resources for large-scale comparative analysis of codon usage patterns. We developed a computational approach to translate EST or assembled contig sequences, and then used the coding information for comparative analysis of codon usage in 12 plant species, including 6 eudicots, 5 monocots and the green alga Chlamydomonas reinhardtii. While codon nucleotide composition is highly conserved within eudicots or monocots, there is a significant difference between these two major taxonomic groups of higher plants. The third nucleotide position of codons is AU-rich in the eudicot genomes (35-42% of G+C content), but GC-rich in the monocot genomes (59-61% of G+C content). To identify optimal codons in these species, we used EST counts to estimate gene transcript levels. It was demonstrated that codon usage bias is correlated positively with gene transcript levels. Interestingly, the use of optimal codons appears to be well conserved between eudicots and monocots, and to a lesser degree between the higher plants and C. reinhardtii. Most of the optimal codons end with a C or G base, regardless of the different nucleotide composition in these genomes. The results suggest that plant codon usage is affected by translational selection, and the selective pressure appears to be conserved in the plant kingdom.
Collapse
Affiliation(s)
- Liangjiang Wang
- Bioinformatics Center, Division of Biology, Kansas State University, Manhattan, KS 66506, USA.
| | | |
Collapse
|
69
|
Spiridonova LN, Chelomina GN, Tsuda K, Yonekawa H, Starikov VP. Genetic evidence of extensive introgression of short-tailed ground squirrel genes in a hybridization zone of Spermophilus major and S. erythrogenys, inferred from sequencing of the mtDNA cytochrome b gene. RUSS J GENET+ 2006. [DOI: 10.1134/s1022795406070167] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
70
|
Cutter AD, Félix MA, Barrière A, Charlesworth D. Patterns of nucleotide polymorphism distinguish temperate and tropical wild isolates of Caenorhabditis briggsae. Genetics 2006; 173:2021-31. [PMID: 16783011 PMCID: PMC1569728 DOI: 10.1534/genetics.106.058651] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Caenorhabditis briggsae provides a natural comparison species for the model nematode C. elegans, given their similar morphology, life history, and hermaphroditic mode of reproduction. Despite C. briggsae boasting a published genome sequence and establishing Caenorhabditis as a model genus for genetics and development, little is known about genetic variation across the geographic range of this species. In this study, we greatly expand the collection of natural isolates and characterize patterns of nucleotide variation for six loci in 63 strains from three continents. The pattern of polymorphisms reveals differentiation between C. briggsae strains found in temperate localities in the northern hemisphere from those sampled near the Tropic of Cancer, with diversity within the tropical region comparable to what is found for C. elegans in Europe. As in C. elegans, linkage disequilibrium is pervasive, although recombination is evident among some variant sites, indicating that outcrossing has occurred at a low rate in the history of the sample. In contrast to C. elegans, temperate regions harbor extremely little variation, perhaps reflecting colonization and recent expansion of C. briggsae into northern latitudes. We discuss these findings in relation to their implications for selection, demographic history, and the persistence of self-fertilization.
Collapse
Affiliation(s)
- Asher D Cutter
- Institute of Evolutionary Biology, University of Edinburgh, UK.
| | | | | | | |
Collapse
|
71
|
Bloom JD, Drummond DA, Arnold FH, Wilke CO. Structural determinants of the rate of protein evolution in yeast. Mol Biol Evol 2006; 23:1751-61. [PMID: 16782762 DOI: 10.1093/molbev/msl040] [Citation(s) in RCA: 148] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
We investigate how a protein's structure influences the rate at which its sequence evolves. Our basic hypothesis is that proteins with highly designable structures (structures that are encoded by many sequences) will evolve more rapidly. Recent theoretical advances argue that structures with a higher density of interresidue contacts are more designable, and we show that high contact density is correlated with an increased rate of sequence evolution in yeast. In addition, we investigate the correlations between the rate of sequence evolution and several other structural descriptors, carefully controlling for the strong effect of expression level on evolutionary rate. Overall, we find that the structural descriptors that we consider appear to explain roughly 10% of the variation in rates of protein evolution in yeast. We also show that despite the well-known trend for buried residues to be more conserved, proteins with a higher fraction of buried residues, nonetheless, tend to evolve their sequences more rapidly. We suggest that this effect is due to the increased designability of structures with more buried residues. Our results provide evidence that protein structure plays an important role in shaping the rate of sequence evolution and provide evidence to support recent theoretical advances linking structural designability to contact density.
Collapse
Affiliation(s)
- Jesse D Bloom
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California, USA
| | | | | | | |
Collapse
|
72
|
McInerney JO. The causes of protein evolutionary rate variation. Trends Ecol Evol 2006; 21:230-2. [PMID: 16697908 DOI: 10.1016/j.tree.2006.03.008] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2005] [Revised: 03/02/2006] [Accepted: 03/13/2006] [Indexed: 11/22/2022]
Abstract
The rate of protein evolution varies more than 1000-fold and, for the past 30 years, it was thought that the rate was determined by protein function. Drummond and co-workers have now shown that a single factor underlying mRNA expression, protein abundance and synonymous codon usage is the chief causal agent of protein evolutionary rate in yeast. It will be interesting to see whether this is shown to be a universal rule for all biological systems.
Collapse
Affiliation(s)
- James O McInerney
- Department of Biology, National University of Ireland, Maynooth, County Kildare, Ireland.
| |
Collapse
|
73
|
Kotlar D, Lavner Y. The action of selection on codon bias in the human genome is related to frequency, complexity, and chronology of amino acids. BMC Genomics 2006; 7:67. [PMID: 16584540 PMCID: PMC1456966 DOI: 10.1186/1471-2164-7-67] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2005] [Accepted: 04/03/2006] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND The question of whether synonymous codon choice is affected by cellular tRNA abundance has been positively answered in many organisms. In some recent works, concerning the human genome, this relation has been studied, but no conclusive answers have been found. In the human genome, the variation in base composition and the absence of cellular tRNA count data makes the study of the question more complicated. In this work we study the relation between codon choice and tRNA abundance in the human genome by correcting relative codon usage for background base composition and using a measure based on tRNA-gene copy numbers as a rough estimate of tRNA abundance. RESULTS We term major codons to be those codons with a relatively large tRNA-gene copy number for their corresponding amino acid. We use two measures of expression: breadth of expression (the number of tissues in which a gene was expressed) and maximum expression level among tissues (the highest value of expression of a gene among tissues). We show that for half the amino acids in the study (8 of 16) the relative major codon usage rises with breadth of expression. We show that these amino acids are significantly more frequent, are smaller and simpler, and are more ancient than the rest of the amino acids. Similar, although weaker, results were obtained for maximum expression level. CONCLUSION There is evidence that codon bias in the human genome is related to selection, although the selection forces acting on codon bias may not be straightforward and may be different for different amino acids. We suggest that, in the first group of amino acids, selection acts to enhance translation efficiency in highly expressed genes by preferring major codons, and acts to reduce translation rate in lowly expressed genes by preferring non-major ones. In the second group of amino acids other selection forces, such as reducing misincorporation rate of expensive amino acids, in terms of their size/complexity, may be in action. The fact that codon usage is more strongly related to breadth of expression than to maximum expression level supports the notion, presented in a recent study, that codon choice may be related to the tRNA abundance in the tissue in which a gene is expressed.
Collapse
Affiliation(s)
- Daniel Kotlar
- Department of Computer Science, Tel-Hai Academic College, Upper Galilee, 12210, Israel
| | - Yizhar Lavner
- Department of Computer Science, Tel-Hai Academic College, Upper Galilee, 12210, Israel
| |
Collapse
|
74
|
Abstract
Recent work has shown that expression level is the main predictor of a gene's evolutionary rate and that more highly expressed genes evolve slower. A possible explanation for this observation is selection for proteins that fold properly despite mistranslation, in short selection for translational robustness. Translational robustness leads to the somewhat paradoxical prediction that highly expressed genes are extremely tolerant to missense substitutions but nevertheless evolve very slowly. Here, we study a simple theoretical model of translational robustness that allows us to gain analytic insight into how this paradoxical behavior arises.
Collapse
Affiliation(s)
- Claus O Wilke
- Section of Integrative Biology and Center for Computational Biology and Bioinformatics, University of Texas, Austin 78712, USA.
| | | |
Collapse
|
75
|
Liu Q. Analysis of codon usage pattern in the radioresistant bacterium Deinococcus radiodurans. Biosystems 2006; 85:99-106. [PMID: 16431014 DOI: 10.1016/j.biosystems.2005.12.003] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2005] [Revised: 12/06/2005] [Accepted: 12/12/2005] [Indexed: 11/23/2022]
Abstract
The main factors shaping codon usage bias in the Deinococcus radiodurans genome were reported. Correspondence analysis (COA) was carried out to analyze synonymous codon usage bias. The results showed that the main trend was strongly correlated with gene expression level assessed by the "Codon Adaptation Index" (CAI) values, a result that was confirmed by the distribution of genes along the first axis. The results of correlation analysis, variance analysis and neutrality plot indicated that gene nucleotide composition was clearly contributed to codon bias. CDS length was also key factor in dictating codon usage variation. A general tendency of more biased codon usage of genes with longer CDS length to higher expression level was found. Further, the hydrophobicity of each protein also played a role in shaping codon usage in this organism, which could be confirmed by the significant correlation between the positions of genes placed on the first axis and the hydrophobicity values (r=-0.100, P<0.01). In summary, gene expression level played a crucial role, nucleotide mutational bias, CDS length and the hydrophobicity of each protein just in a minor way in shaping the codon usage pattern of D. radiodurans. Notably, 19 codons firstly defined as "optimal codons" may provide useful clues for molecular genetic engineering and evolutionary studying.
Collapse
Affiliation(s)
- Qingpo Liu
- Department of Agronomy, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou 310029, China.
| |
Collapse
|
76
|
Mitreva M, Wendl MC, Martin J, Wylie T, Yin Y, Larson A, Parkinson J, Waterston RH, McCarter JP. Codon usage patterns in Nematoda: analysis based on over 25 million codons in thirty-two species. Genome Biol 2006; 7:R75. [PMID: 26271136 PMCID: PMC1779591 DOI: 10.1186/gb-2006-7-8-r75] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2006] [Revised: 06/30/2006] [Accepted: 08/14/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Codon usage has direct utility in molecular characterization of species and is also a arker for molecular evolution. To understand codon usage within the diverse phylum Nematoda,we analyzed a total of 265,494 expressed sequence tags (ESTs) from 30 nematode species. The full genomes of Caenorhabditis elegans and C. briggsae were also examined. A total of 25,871,325 codons ere analyzed and a comprehensive codon usage table for all species was generated. This is the first codon usage table available for 24 of these organisms. RESULTS Codon usage similarity in Nematoda usually persists over the breadth of a genus but thenrapidly diminishes even within each clade. Globodera, Meloidogyne, Pristionchus, and Strongyloides have the most highly derived patterns of codon usage. The major factor affecting differences in codon usage between species is the coding sequence GC content, which varies in nematodes from 32%to 51%. Coding GC content (measured as GC3) also explains much of the observed variation in the effective number of codons (R = 0.70), which is a measure of codon bias, and it even accounts for differences in amino acid frequency. Codon usage is also affected by neighboring nucleotides(N1 context). Coding GC content correlates strongly with estimated noncoding genomic GC content (R = 0.92). On examining abundant clusters in five species, candidate optimal codons were identified that may be preferred in highly expressed transcripts. CONCLUSION Evolutionary models indicate that total genomic GC content, probably the product of directional mutation pressure, drives codon usage rather than the converse, a conclusion that is supported by examination of nematode genomes.
Collapse
Affiliation(s)
- Makedonka Mitreva
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri 63108, USA
| | - Michael C Wendl
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri 63108, USA
| | - John Martin
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri 63108, USA
| | - Todd Wylie
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri 63108, USA
| | - Yong Yin
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri 63108, USA
| | - Allan Larson
- Department of Biology, Washington University, St. Louis, Missouri 63130, USA
| | - John Parkinson
- Hospital for Sick Children, Toronto, and Departments of Biochemistry/Medical Genetics and Microbiology, University of Toronto, M5G 1X8, Canada
| | - Robert H Waterston
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - James P McCarter
- Genome Sequencing Center, Washington University School of Medicine, St Louis, Missouri 63108, USA
- Divergence Inc., St Louis, Missouri 63141, USA
| |
Collapse
|
77
|
Zhou T, Sun X, Lu Z. Synonymous codon usage in environmental chlamydia UWE25 reflects an evolutional divergence from pathogenic chlamydiae. Gene 2005; 368:117-25. [PMID: 16380221 DOI: 10.1016/j.gene.2005.10.035] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2005] [Revised: 09/22/2005] [Accepted: 10/27/2005] [Indexed: 10/25/2022]
Abstract
Publication of the complete genome sequence for the Acanthamoeba sp. endosymbiont UWE25 has illuminated the evolution history of chlamydiae. In this study, the codon usage bias in UWE25 and five other species of pathogenic chlamydiae was calculated. It was found that genomic composition constraints are the major source of codon usage variation in UWE25. This result is different from the former observation in pathogenic chlamydiae, whose genomic base composition is more unbiased. Four other factors, such as strand-specific mutational bias, natural selection acting at the level of translation, hydropathy level of each protein and the conservation level of amino acids also have influence in shaping the codon usage in these six species to some extent. Further analysis suggests that the high stability of the UWE25 genome partially account for the difference in codon usage pattern between environmental and pathogenic chlamydiae. Moreover, our results imply that the replicational selection pressure in pathogenic chlamydiae is stronger than that in UWE25. Analyzing the codon usage pattern in the environmental chlamydia and comparing it with that of the pathogenic chlamydiae may provide clues how the chlamydiae have evolved from their common ancestor.
Collapse
Affiliation(s)
- Tong Zhou
- State Key Laboratory of Bioelectronics, Southeast University, 210096 China.
| | | | | |
Collapse
|
78
|
Rayment JH, Forsdyke DR. Amino acids as placeholders: base-composition pressures on protein length in malaria parasites and prokaryotes. ACTA ACUST UNITED AC 2005; 4:117-30. [PMID: 16128613 DOI: 10.2165/00822942-200504020-00005] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
BACKGROUND The composition and sequence of amino acids in a protein may serve the underlying needs of the nucleic acids that encode the protein (the genome phenotype). In extreme form, amino acids become mere placeholders inserted between functional segments or domains, and--apart from increasing protein length--playing no role in the specific function or structure of a protein (the conventional phenotype). METHODS We studied the genomes of two malarial parasites and 521 prokaryotes (144 complete) that differ widely in GC% and optimum growth temperature, comparing the base compositions of the protein coding regions and corresponding lengths (kilobases). RESULTS Malarial parasites show distinctive responses to base-compositional pressures that increase as protein lengths increase. A low-GC% species (Plasmodium falciparum) is likely to have more placeholder amino acids than an intermediate-GC% species (P. vivax), so that homologous proteins are longer. In prokaryotes, GC% is generally greater and AG% is generally less in open reading frames (ORFs) encoding long proteins. The increased GC% in long ORFs increases as species' GC% increases, and decreases as species' AG% increases. In low- and intermediate-GC% prokaryotic species, increases in ORF GC% as encoded proteins increase in length are largely accounted for by the base compositions of first and second (amino acid-determining) codon positions. In high-GC% prokaryotic species, first and third (non-amino acid-determining) codon positions play this role. CONCLUSION In low- and intermediate-GC% prokaryotes, placeholder amino acids are likely to be well defined, corresponding to codons enriched in G and/or C at first and second positions. In high-GC% prokaryotes, placeholder amino acids are likely to be less well defined. Increases in ORF GC% as encoded proteins increase in length are greater in mesophiles than in thermophiles, which are constrained from increasing protein lengths in response to base-composition pressures.
Collapse
Affiliation(s)
- Jonathan H Rayment
- Department of Biochemistry, Queen's University, Kingston, Ontario, Canada
| | | |
Collapse
|
79
|
Cutter AD. Nucleotide polymorphism and linkage disequilibrium in wild populations of the partial selfer Caenorhabditis elegans. Genetics 2005; 172:171-84. [PMID: 16272415 PMCID: PMC1456145 DOI: 10.1534/genetics.105.048207] [Citation(s) in RCA: 129] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
An understanding of the relative contributions of different evolutionary forces on an organism's genome requires an accurate description of the patterns of genetic variation within and between natural populations. To this end, I report a survey of nucleotide polymorphism in six loci from 118 strains of the nematode Caenorhabditis elegans. These strains derive from wild populations of several regions within France, Germany, and new localities in Scotland, in addition to stock center isolates. Overall levels of silent-site diversity are low within and between populations of this self-fertile species, averaging 0.2% in European samples and 0.3% worldwide. Population structure is present despite a lack of association of sequences with geography, and migration appears to occur at all geographic scales. Linkage disequilibrium is extensive in the C. elegans genome, extending even between chromosomes. Nevertheless, recombination is clearly present in the pattern of polymorphisms, indicating that outcrossing is an infrequent, but important, feature in this species ancestry. The range of outcrossing rates consistent with the data is inferred from linkage disequilibrium, using "scattered" samples representing the collecting phase of the coalescent process in a subdivided population. I propose that genetic variation in this species is shaped largely by population subdivision due to self-fertilization coupled with long- and short-range migration between subpopulations.
Collapse
Affiliation(s)
- Asher D Cutter
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
| |
Collapse
|
80
|
Drummond DA, Raval A, Wilke CO. A single determinant dominates the rate of yeast protein evolution. Mol Biol Evol 2005; 23:327-37. [PMID: 16237209 DOI: 10.1093/molbev/msj038] [Citation(s) in RCA: 303] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A gene's rate of sequence evolution is among the most fundamental evolutionary quantities in common use, but what determines evolutionary rates has remained unclear. Here, we carry out the first combined analysis of seven predictors (gene expression level, dispensability, protein abundance, codon adaptation index, gene length, number of protein-protein interactions, and the gene's centrality in the interaction network) previously reported to have independent influences on protein evolutionary rates. Strikingly, our analysis reveals a single dominant variable linked to the number of translation events which explains 40-fold more variation in evolutionary rate than any other, suggesting that protein evolutionary rate has a single major determinant among the seven predictors. The dominant variable explains nearly half the variation in the rate of synonymous and protein evolution. We show that the two most commonly used methods to disentangle the determinants of evolutionary rate, partial correlation analysis and ordinary multivariate regression, produce misleading or spurious results when applied to noisy biological data. We overcome these difficulties by employing principal component regression, a multivariate regression of evolutionary rate against the principal components of the predictor variables. Our results support the hypothesis that translational selection governs the rate of synonymous and protein sequence evolution in yeast.
Collapse
Affiliation(s)
- D Allan Drummond
- Program in Computation and Neural Systems, California Institute of Technology, Pasadena, USA
| | | | | |
Collapse
|
81
|
Singh ND, Davis JC, Petrov DA. Codon Bias and Noncoding GC Content Correlate Negatively with Recombination Rate on the Drosophila X Chromosome. J Mol Evol 2005; 61:315-24. [PMID: 16044248 DOI: 10.1007/s00239-004-0287-1] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2004] [Accepted: 03/10/2005] [Indexed: 11/28/2022]
Abstract
The patterns and processes of molecular evolution may differ between the X chromosome and the autosomes in Drosophila melanogaster. This may in part be due to differences in the effective population size between the two chromosome sets and in part to the hemizygosity of the X chromosome in Drosophila males. These and other factors may lead to differences both in the gene complements of the X and the autosomes and in the properties of the genes residing on those chromosomes. Here we show that codon bias and recombination rate are correlated strongly and negatively on the X chromosome, and that this correlation cannot be explained by indirect relationships with other known determinants of codon bias. This is in dramatic contrast to the weak positive correlation found on the autosomes. We explored possible explanations for these patterns, which required a comprehensive analysis of the relationships among multiple genetic properties such as protein length and expression level. This analysis highlights conserved features of coding sequence evolution on the X and the autosomes and illuminates interesting differences between these two chromosome sets.
Collapse
Affiliation(s)
- Nadia D Singh
- Department of Biological Sciences, Stanford University, 371 Serra Mall, Stanford, California, 90305-5020, USA.
| | | | | |
Collapse
|
82
|
Stenøien HK, Stephan W. Global mRNA stability is not associated with levels of gene expression in Drosophila melanogaster but shows a negative correlation with codon bias. J Mol Evol 2005; 61:306-14. [PMID: 16044249 DOI: 10.1007/s00239-004-0271-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2004] [Accepted: 03/16/2005] [Indexed: 11/26/2022]
Abstract
A multitude of factors contribute to the regulation of gene expression in living cells. The relationship between codon usage bias and gene expression has been extensively studied, and it has been shown that codon bias may have adaptive significance in many unicellular and multicellular organisms. Given the central role of mRNA in post-transcriptional regulation, we hypothesize that mRNA stability is another important factor associated either with positive or negative regulation of gene expression. We have conducted genome-wide studies of the association between gene expression (measured as transcript abundance in public EST databases), mRNA stability, codon bias, GC content, and gene length in Drosophila melanogaster. To remove potential bias of gene length inherently present in EST libraries, gene expression is measured as normalized transcript abundance. It is demonstrated that codon bias and GC content in second codon position are positively associated with transcript abundance. Gene length is negatively associated with transcript abundance. The stability of thermodynamically predicted mRNA secondary structures is not associated with transcript abundance, but there is a negative correlation between mRNA stability and codon bias. This finding does not support the hypothesis that codon bias has evolved as an indirect consequence of selection favoring thermodynamically stable mRNA molecules.
Collapse
Affiliation(s)
- Hans K Stenøien
- Plant Ecology/Department of Ecology and Evolution, Evolutionary Biology Centre, Uppsala University, SE-752 36, Uppsala, Sweden
| | | |
Collapse
|
83
|
Zhou T, Gu W, Ma J, Sun X, Lu Z. Analysis of synonymous codon usage in H5N1 virus and other influenza A viruses. Biosystems 2005; 81:77-86. [PMID: 15917130 DOI: 10.1016/j.biosystems.2005.03.002] [Citation(s) in RCA: 103] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2004] [Revised: 03/05/2005] [Accepted: 03/07/2005] [Indexed: 10/25/2022]
Abstract
In this study, we calculated the codon usage bias in H5N1 virus and performed a comparative analysis of synonymous codon usage patterns in H5N1 virus, five other evolutionary related influenza A viruses and a influenza B virus. Codon usage bias in H5N1 genome is a little slight, which is mainly determined by the base compositions on the third codon position. By comparing synonymous codon usage patterns in different viruses, we observed that the codon usage pattern of H5N1 virus is similar with other influenza A viruses, but not influenza B virus, and the synonymous codon usage in influenza A virus genes is phylogenetically conservative, but not strain-specific. Synonymous codon usage in genes encoded by different influenza A viruses is genus conservative. Compositional constraints could explain most of the variation of synonymous codon usage among these virus genes, while gene function is also correlated to synonymous codon usages to a certain extent. However, translational selection and gene length have no effect on the variations of synonymous codon usage in these virus genes.
Collapse
Affiliation(s)
- Tong Zhou
- Key Laboratory of Molecular and Biomolecular Electronics of the Ministry of Education, Southeast University, Nanjing, Jiangsu 210096, China
| | | | | | | | | |
Collapse
|
84
|
Singh ND, Davis JC, Petrov DA. X-linked genes evolve higher codon bias in Drosophila and Caenorhabditis. Genetics 2005; 171:145-55. [PMID: 15965246 PMCID: PMC1456507 DOI: 10.1534/genetics.105.043497] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Comparing patterns of molecular evolution between autosomes and sex chromosomes (such as X and W chromosomes) can provide insight into the forces underlying genome evolution. Here we investigate patterns of codon bias evolution on the X chromosome and autosomes in Drosophila and Caenorhabditis. We demonstrate that X-linked genes have significantly higher codon bias compared to autosomal genes in both Drosophila and Caenorhabditis. Furthermore, genes that become X-linked evolve higher codon bias gradually, over tens of millions of years. We provide several lines of evidence that this elevation in codon bias is due exclusively to their chromosomal location and not to any other property of X-linked genes. We present two possible explanations for these observations. One possibility is that natural selection is more efficient on the X chromosome due to effective haploidy of the X chromosomes in males and persistently low effective numbers of reproducing males compared to that of females. Alternatively, X-linked genes might experience stronger natural selection for higher codon bias as a result of maladaptive reduction of their dosage engendered by the loss of the Y-linked homologs.
Collapse
Affiliation(s)
- Nadia D Singh
- Stanford University, Stanford, California 94305, USA.
| | | | | |
Collapse
|
85
|
Subramanian S, Kumar S. Gene expression intensity shapes evolutionary rates of the proteins encoded by the vertebrate genome. Genetics 2005; 168:373-81. [PMID: 15454550 PMCID: PMC1448110 DOI: 10.1534/genetics.104.028944] [Citation(s) in RCA: 196] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Natural selection leaves its footprints on protein-coding sequences by modulating their silent and replacement evolutionary rates. In highly expressed genes in invertebrates, these footprints are seen in the higher codon usage bias and lower synonymous divergence. In mammals, the highly expressed genes have a shorter gene length in the genome and the breadth of expression is known to constrain the rate of protein evolution. Here we have examined how the rates of evolution of proteins encoded by the vertebrate genomes are modulated by the amount (intensity) of gene expression. To understand how natural selection operates on proteins that appear to have arisen in earlier and later phases of animal evolution, we have contrasted patterns of mouse proteins that have homologs in invertebrate and protist genomes (Precambrian genes) with those that do not have such detectable homologs (vertebrate-specific genes). We find that the intensity of gene expression relates inversely to the rate of protein sequence evolution on a genomic scale. The most highly expressed genes actually show the lowest total number of substitutions per polypeptide, consistent with cumulative effects of purifying selection on individual amino acid replacements. Precambrian genes exhibit a more pronounced difference in protein evolutionary rates (up to three times) between the genes with high and low expression levels as compared to the vertebrate-specific genes, which appears to be due to the narrower breadth of expression of the vertebrate-specific genes. These results provide insights into the differential relationship and effect of the increasing complexity of animal body form on evolutionary rates of proteins.
Collapse
Affiliation(s)
- Sankar Subramanian
- Center for Evolutionary Functional Genomics, The Biodesign Institute, Arizona State University, Tempe 85287-4501, USA
| | | |
Collapse
|
86
|
Marais G, Domazet-Loso T, Tautz D, Charlesworth B. Correlated evolution of synonymous and nonsynonymous sites in Drosophila. J Mol Evol 2005; 59:771-9. [PMID: 15599509 DOI: 10.1007/s00239-004-2671-2] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2004] [Accepted: 06/30/2004] [Indexed: 11/28/2022]
Abstract
Recent work has shown that Drosophila melanogaster genes with fast-evolving nonsynonymous sites have lower codon usage bias. This pattern has been attributed to interference between positive selection at nonsynonymous sites and weak selection on codon usage. Here we have looked for this correlation in a much larger and less biased dataset, comprising 630 gene pairs from D. melanogaster and D. yakuba. We confirmed that there is a negative correlation between the rate of nonsynonymous substitutions (d(N)) and codon bias in D. melanogaster. We then tested the interference hypothesis and other alternative explanations, including one involving gene expression. We found that d(N) indeed correlates with the level of gene expression. Given that gene expression is a strong determinant of codon bias, the relationship between d(N) and codon bias might be a by-product of gene expression. However, our tests show that none of the hypotheses we consider seem to explain the data fully.
Collapse
Affiliation(s)
- Gabriel Marais
- Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh, EH9 3JT, Scotland, UK
| | | | | | | |
Collapse
|
87
|
Hudler P, Vouk K, Voulk K, Liovic M, Repse S, Juvan R, Komel R. Mutations in the hMLH1 gene in Slovenian patients with gastric carcinoma. Clin Genet 2004; 65:405-11. [PMID: 15099349 DOI: 10.1111/j.0009-9163.2004.0234.x] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Alterations of multiple oncogenes and tumor suppressor genes, together with genetic instability, are responsible for carcinogenesis in gastric cancer. The microsatellite mutator phenotype is the cause of many somatic frameshift and point mutations in non-coding repetitive sequences and in coding regions associated with cell proliferation and apoptosis. Genetic mutations in hMLH1 and transcriptional silencing of its promoter by hypermethylation lead to the inactivation of the mismatch repair system. In our study, we screened for mutations the hMLH1 gene in patients expressing the microsatellite instability genotype by using single-strand conformational polymorphism analysis and direct sequencing. Seven changes were identified; of these, three (A92P, E433Q, and K618A) were germline mutations and the other four (IVS5 453 + 79 A > G, I219V, 1039 - 7 del (T)(n), and IVS15 1668 - 19 A > G) germline polymorphisms. A92P and E433Q are novel, previously unidentified mutations. In addition, we found a rather complex distribution of mutations and polymorphisms in individual patients and in two cases also a methylated hMLH1 promoter.
Collapse
Affiliation(s)
- P Hudler
- Medical Center for Molecular Biology, Institute of Biochemistry, Ljubljana, Slovenia
| | | | | | | | | | | | | |
Collapse
|
88
|
Bartolomé C, Maside X, Yi S, Grant AL, Charlesworth B. Patterns of selection on synonymous and nonsynonymous variants in Drosophila miranda. Genetics 2004; 169:1495-507. [PMID: 15545653 PMCID: PMC1449532 DOI: 10.1534/genetics.104.033068] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We have investigated patterns of within-species polymorphism and between-species divergence for synonymous and nonsynonymous variants at a set of autosomal and X-linked loci of Drosophila miranda. D. pseudoobscura and D. affinis were used for the between-species comparisons. The results suggest the action of purifying selection on nonsynonymous, polymorphic variants. Among synonymous polymorphisms, there is a significant excess of synonymous mutations from preferred to unpreferred codons and of GC to AT mutations. There was no excess of GC to AT mutations among polymorphisms at noncoding sites. This suggests that selection is acting to maintain the use of preferred codons. Indirect evidence suggests that biased gene conversion in favor of GC base pairs may also be operating. The joint intensity of selection and biased gene conversion, in terms of the product of effective population size and the sum of the selection and conversion coefficients, was estimated to be approximately 0.65.
Collapse
Affiliation(s)
- Carolina Bartolomé
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, UK.
| | | | | | | | | |
Collapse
|
89
|
Chou T, Lakatos G. Clustered bottlenecks in mRNA translation and protein synthesis. PHYSICAL REVIEW LETTERS 2004; 93:198101. [PMID: 15600884 DOI: 10.1103/physrevlett.93.198101] [Citation(s) in RCA: 88] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/23/2003] [Indexed: 05/24/2023]
Abstract
Using a model based on the totally asymmetric exclusion process, we investigate the effects of slow codons along messenger RNA. Ribosome density profiles near neighboring clusters of slow codons interact, enhancing suppression of ribosome throughput when such bottlenecks are closely spaced. Increasing the slow codon cluster size beyond approximately 3-4 codons does not significantly reduce the ribosome current. Our results are verified by both extensive Monte Carlo simulations and numerical calculation, and provide a biologically motivated explanation for the experimentally observed clustering of low-usage codons.
Collapse
Affiliation(s)
- Tom Chou
- Department of Biomathematics, UCLA, Los Angeles, California 90095, USA
| | | |
Collapse
|
90
|
Abstract
Patterns of codon usage bias were studied in the moss model species Physcomitrella patens. A total of 92 nuclear, protein coding genes were employed, and estimated levels of gene expression were tested for association with two measures of codon usage bias and other variables hypothesized to be associated with gene expression. Codon bias was found to be positively associated both with estimated levels of gene expression and GC content in the coding parts of studied genes. However, GC content in noncoding parts, that is, introns and 5' and 3' untranslated regions (UTRs), was not associated with estimated levels of gene expression. It is argued that codon bias is not shaped by mutational bias, but rather by weak natural selection for translational efficiency in P. patens. The possible role of life history characteristics in shaping patterns of codon usage in this species is discussed.
Collapse
Affiliation(s)
- H K Stenøien
- Plant Ecology/Department of Ecology and Evolution, Evolutionary Biology Centre, Uppsala University, Villav. 14, Uppsala SE-752 36, Sweden.
| |
Collapse
|
91
|
Desai D, Zhang K, Barik S, Srivastava A, Bolander MEME, Sarkar G. Intragenic codon bias in a set of mouse and human genes. J Theor Biol 2004; 230:215-25. [PMID: 15302553 DOI: 10.1016/j.jtbi.2004.05.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2004] [Revised: 05/06/2004] [Accepted: 05/06/2004] [Indexed: 11/20/2022]
Abstract
To better conceptualize the mechanism underlying the evolution of synonymous codons, we have analysed intragenic codon usage in chosen "regions" of some mouse and human genes. We divided a given gene into two regions: one consisting of a trinucleotide repeat (TNR) and the other consisting of the "rest of the coding region" (RCR). Usually, a TNR is composed of a repetitive single codon, which may reflect its frequency in a gene. In contrast, a non-random frequency of a codon in the RCR versus TNR (or vice versa) of a gene should indicate a bias for that codon within the TNR. We examined this scenario by comparing codon frequency between the RCR and the cognate TNR(s) for a set of human and mouse genes. A TNR length of six amino acids or more was used to identify genes from the Genbank database. Twenty nine human and twenty one mouse genes containing TNRs coding for nine different amino acid runs were identified. The ratio of codon frequency in a TNR versus the corresponding RCR was expressed as "fold change" which was also regarded as a measure of codon bias (defined as preferential use either in TNR or in RCR). Chi-square values were then determined from the distribution of codon frequency in a TNR vs. the cognate RCR. At p<0.001, 22% and 27%, respectively, of human and mouse TNRs showed codon bias. Greater than 40% of the TNRs (29 out of 69 in human, and 18 of 42 in mouse) showed codon bias at p<0.05. In addition, we identify eight single-codon TNRs in mouse and ten in human genes. Thus, our results show intragenic codon bias in both mouse and human genes expressed in diverse tissue types. Since our results are independent of the Codon Adaptation Index (CAI) and starvation CAI, and since the tRNA repertoire in a cell or in a tissue is constant, our data suggest that other constraints besides tRNA abundance played a role in creating intragenic codon bias in these genes.
Collapse
Affiliation(s)
- Dinakar Desai
- Department of Orthopedics, Mayo Clinic and Foundation, Medical Science Building 3-69, 200 1st Street, SW, Rochester, MN 55905, USA
| | | | | | | | | | | |
Collapse
|
92
|
Gu W, Zhou T, Ma J, Sun X, Lu Z. Analysis of synonymous codon usage in SARS Coronavirus and other viruses in the Nidovirales. Virus Res 2004; 101:155-61. [PMID: 15041183 PMCID: PMC7127446 DOI: 10.1016/j.virusres.2004.01.006] [Citation(s) in RCA: 129] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2003] [Revised: 01/09/2004] [Accepted: 01/09/2004] [Indexed: 12/04/2022]
Abstract
In this study, we calculated the codon usage bias in severe acute respiratory syndrome Coronavirus (SARSCoV) and performed a comparative analysis of synonymous codon usage patterns in SARSCoV and 10 other evolutionary related viruses in the Nidovirales. Although there is a significant variation in codon usage bias among different SARSCoV genes, codon usage bias in SARSCoV is a little slight, which is mainly determined by the base compositions on the third codon position. By comparing synonymous codon usage patterns in different viruses, we observed that synonymous codon usage pattern in these virus genes was virus specific and phylogenetically conserved, but it was not host specific. Phylogenetic analysis based on codon usage pattern suggested that SARSCoV was diverged far from all three known groups of Coronavirus. Compositional constraints could explain most of the variation of synonymous codon usage among these virus genes, while gene function is also correlated to synonymous codon usages to a certain extent. However, translational selection and gene length have no effect on the variations of synonymous codon usage in these virus genes.
Collapse
Affiliation(s)
| | | | | | | | - Zuhong Lu
- Corresponding author. Tel.: +86-25-83619983; fax: +86-25-83619983.
| |
Collapse
|
93
|
Cutter AD, Payseur BA, Salcedo T, Estes AM, Good JM, Wood E, Hartl T, Maughan H, Strempel J, Wang B, Bryan AC, Dellos M. Molecular correlates of genes exhibiting RNAi phenotypes in Caenorhabditis elegans. Genome Res 2004; 13:2651-7. [PMID: 14656969 PMCID: PMC403806 DOI: 10.1101/gr.1659203] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Understanding genome-wide links between genotype and phenotype has generally been difficult due to both the complexity of phenotypes, and until recently, inaccessibility to large numbers of genes that might underlie a trait. To address this issue, we establish the association between particular RNAi phenotypes in Caenorhabditis elegans and sequence characteristics of the corresponding proteins and DNA. We find that genes showing RNAi phenotypes are long and highly expressed with little noncoding DNA and high rates of synonymous site substitution (KS). In addition, genes conferring RNAi phenotypes have significantly lower rates of nonsynonymous site substitution (KA). Collectively, these sequence features explain nearly 20% of the difference between the sets of loci that display or lack a RNAi-mediated effect, and reflect aspects both of the RNAi mechanism and the biological function of the genes. For example, the particularly low rate of evolution of genes in the sterility RNAi phenotype class suggests a role of C. elegans life history in shaping these patterns of sequence and expression characteristics on phenotypes. This approach also allows prediction of a set of heretofore-uncharacterized loci for which we expect future RNAi studies to reveal phenotypic effects (i.e., false negatives in present screens).
Collapse
Affiliation(s)
- Asher D Cutter
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
94
|
Abstract
A variety of models propose that the accumulation of deleterious mutations plays an important role in the evolution of breeding systems. These models make predictions regarding the relative rates of protein evolution and deleterious mutation in taxa with contrasting modes of reproduction. Here we compare available coding sequences from one obligately outcrossing and two primarily selfing species of Caenorhabditis to explore the potential for mutational models to explain the evolution of breeding system in this clade. If deleterious mutations interact synergistically, the mutational deterministic hypothesis predicts that a high genomic deleterious mutation rate (U) will offset the reproductive disadvantage of outcrossing relative to asexual or selfing reproduction. Therefore, C. elegans and C. briggsae (both largely selfing) should both exhibit lower rates of deleterious mutation than the obligately outcrossing relative C. remanei. Using a comparative approach, we estimate U to be equivalent (and < 1) among all three related species. Stochastic mutational models, Muller's ratchet and Hill-Robertson interference, are expected to cause reductions in the effective population size in species that rarely outcross, thereby allowing deleterious mutations to accumulate at an elevated rate. We find only limited support for more rapid molecular evolution in selfing lineages. Overall, our analyses indicate that the evolution of breeding system in this group is unlikely to be explained solely by available mutational models.
Collapse
Affiliation(s)
- A D Cutter
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA.
| | | |
Collapse
|
95
|
Philibert RA, Nelson JJ, Sandhu HK, Crowe RR, Coryell WH. Association of an exonic LDHA polymorphism with altered respiratory response in probands at high risk for panic disorder. Am J Med Genet B Neuropsychiatr Genet 2003; 117B:11-7. [PMID: 12555229 DOI: 10.1002/ajmg.b.10015] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Panic disorder (PD) is a clinical syndrome characterized by recurrent discrete episodes of fear accompanied by a variety of physiological and psychological symptoms, often with prominent respiratory components. A series of clinical observations has led some investigators to hypothesize that subtle alterations in ventilatory regulation are integral to at least a subtype of PD. In order to identify genetic factors that might predispose individuals to these alterations in ventilatory response, we conducted single stranded conformation polymorphism analysis across the exons of the lactate dehydrogenase A and B genes (LDHA and LDHB) using DNA prepared from 86 subjects previously characterized by respiratory response to a CO(2) challenge with a variable genetic loading for PD. Remarkably, a single conserved LDHA exon 5 haplotype conferred increased risk for a paradoxical ventilatory response pattern to CO(2) inhalation which robustly separated well subjects at high risk for PD from low-risk control subjects. But, comparison of LDHA exon 5 genotypes in PD probands (n = 25) to that of random newborn controls (n = 182) did not demonstrate any significant differences. Given the pivotal role of LDH in the metabolism of lactate, a known inducer of panic attacks, and the dependence of LDH activity on cell pH, we suggest that LDHA polymorphisms may contribute to the variability to CO(2) respiratory challenge.
Collapse
Affiliation(s)
- Robert A Philibert
- Department of Psychiatry, The University of Iowa, Iowa City, Iowa 52242-1000, USA.
| | | | | | | | | |
Collapse
|
96
|
Betancourt AJ, Presgraves DC. Linkage limits the power of natural selection in Drosophila. Proc Natl Acad Sci U S A 2002; 99:13616-20. [PMID: 12370444 PMCID: PMC129723 DOI: 10.1073/pnas.212277199] [Citation(s) in RCA: 161] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2002] [Indexed: 11/18/2022] Open
Abstract
Population genetic theory shows that the efficacy of natural selection is limited by linkage-selection at one site interferes with selection at linked sites. Such interference slows adaptation in asexual genomes and may explain the evolutionary advantage of sex. Here, we test for two signatures of constraint caused by linkage in a sexual genome, by using sequence data from 255 Drosophila melanogaster and Drosophila simulans loci. We find that (i) the rate of protein adaptation is reduced in regions of low recombination, and (ii) evolution at strongly selected amino acid sites interferes with optimal codon usage at weakly selected, tightly linked synonymous sites. Together these findings suggest that linkage limits the rate and degree of adaptation even in recombining genomes.
Collapse
|
97
|
Marais G, Piganeau G. Hill-Robertson interference is a minor determinant of variations in codon bias across Drosophila melanogaster and Caenorhabditis elegans genomes. Mol Biol Evol 2002; 19:1399-406. [PMID: 12200468 DOI: 10.1093/oxfordjournals.molbev.a004203] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
According to population genetics models, genomic regions with lower crossing-over rates are expected to experience less effective selection because of Hill-Robertson interference (HRi). The effect of genetic linkage is thought to be particularly important for a selection of weak intensity such as selection affecting codon usage. Consistent with this model, codon bias correlates positively with recombination rate in Drosophila melanogaster and Caenorhabditis elegans. However, in these species, the G+C content of both noncoding DNA and synonymous sites correlates positively with recombination, which suggests that mutation patterns and recombination are associated. To remove this effect of mutation patterns on codon bias, we used the synonymous sites of lowly expressed genes that are expected to be effectively neutral sites. We measured the differences between codon biases of highly expressed genes and their lowly expressed neighbors. In D. melanogaster we find that HRi weakly reduces selection on codon usage of genes located in regions of very low recombination; but these genes only comprise 4% of the total. In C. elegans we do not find any evidence for the effect of recombination on selection for codon bias. Computer simulations indicate that HRi poorly enhances codon bias if the local recombination rate is greater than the mutation rate. This prediction of the model is consistent with our data and with the current estimate of the mutation rate in D. melanogaster. The case of C. elegans, which is highly self-fertilizing, is discussed. Our results suggest that HRi is a minor determinant of variations in codon bias across the genome.
Collapse
Affiliation(s)
- Gabriel Marais
- Laboratoire Biométrie et biologie évolutive, UMR CNRS 5558, Université Claude Bernard Lyon 1, Villeurbanne, France.
| | | |
Collapse
|
98
|
Abstract
The combination of complete genome sequence information and estimates of mRNA abundances have begun to reveal causes of both silent and protein sequence evolution. Translational selection appears to explain patterns of synonymous codon usage in many prokaryotes as well as a number of eukaryotic model organisms (with the notable exception of vertebrates). Relationships between gene length and codon usage bias, however, remain unexplained. Intriguing correlations between expression patterns and protein divergence suggest some general mechanisms underlying protein evolution.
Collapse
Affiliation(s)
- H Akashi
- Institute of Molecular Evolutionary Genetics and Department of Biology, 208 Mueller Laboratory, Pennsylvania State University, University Park, Pennsylvania 06138, USA.
| |
Collapse
|