1
|
Gutiérrez G, Casadesús J, Oliver JL, Marín A. Compositional heterogeneity of the Escherichia coli genome: a role for VSP repair? J Mol Evol 1994; 39:340-6. [PMID: 7966364 DOI: 10.1007/bf00160266] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
E. coli genes that contain a high frequency of the tetranucleotide CTAG are also rich in the tetramers CTTG, CCTA, CCAA, TTGG, TAGG, and CAAG (group-I tetramers). Conversely, E. coli genes lacking CTAG are rich in the tetranucleotides CCTG, CCAG, CTGG, and CAGG (group-II tetramers). These two gene samples differ also in codon usage, amino acid composition, frequency of Dcm sites, and contrast vocabularies. Group-I tetramers have in common that they are depleted by very-short-patch repair (VSP), while group-II tetramers are favored by VSP activity. The VSP system repairs G:T mismatches to G:C, thereby increasing the overall G+C content of the genome; for this reason the CTAG-rich sample has a lower G+C content than the CTAG-poor sample. This compositional heterogeneity can be tentatively explained by a low level of VSP activity on the CTAG-rich sample. A negative correlation is found between the frequency of group-I tetramers and the level of gene expression, as measured by the Codon Adaptation Index (CAI). A possible link between the rate of VSP activity and the level of gene expression is considered.
Collapse
Affiliation(s)
- G Gutiérrez
- Departamento de Genética, Universidad de Sevilla, Spain
| | | | | | | |
Collapse
|
2
|
Abstract
The codon usage (CU) of 34 genes from the closely related species, Brevibacterium lactofermentum and Corynebacterium glutamicum (BLCG), was analysed and compared with that of 23 genes from other Brevibacterium and Corynebacterium species. The G+C content of the BLCG genes ranged from 50 to 62%. A wider range was found in other corynebacterial genes (25-71%). The G+C contents of non-coding regions in glutamic acid bacteria are lower than those of the coding regions and both values are lower than the G+C content of ribosomal RNA (rRNA) sequences, suggesting an unusual biased mutation pressure. The CU and synonymous codon usage (SCU) analysis showed several common characteristics among the sequenced corynebacterial genes, consistent with the close relatedness of B. lactofermentum and C. glutamicum. A subset of 25 preferred codons were deduced from the presumably highly expressed genes and they encode most of the amino acid (aa) residues of the BLCG group. An analysis of the effective number of codons (Nc) was carried out in order to check the GC3s (G+C content at the silent third position of sense codons) dependence of the CU in corynebacteria. Nc values showed differences between the BLCG group and other corynebacterial sequences. A comparison of the most used codons for each aa showed a stronger similarity to Streptomyces than to Escherichia coli. The CU/SCU tables of corynebacteria are useful for identification of protein-coding regions, including start codons when they are uncertain, and for designing oligodeoxyribonucleotide probes from an aa sequence.
Collapse
Affiliation(s)
- M Malumbres
- Department of Ecology, Genetics and Microbiology, Faculty of Biology, University of León, Spain
| | | | | |
Collapse
|
3
|
Abstract
The DNA sequences of the recA gene from 25 strains of bacteria are known. The evolution of these recA gene sequences, and of the derived RecA protein sequences, is examined, with special reference to the effect of variations in genomic G + C content. From the aligned RecA protein sequences, phylogenetic trees have been drawn using both distance matrix and maximum parsimony methods. There is a broad concordance between these trees and those derived from other data (largely 16S ribosomal RNA sequences). There is a fair degree of certainty in the relationships among the "Purple" or Proteobacteria, but the branching pattern between higher taxa within the eubacteria cannot be reliably resolved with these data.
Collapse
Affiliation(s)
- A T Lloyd
- Department of Genetics, Trinity College, Dublin, Ireland
| | | |
Collapse
|
4
|
Vaughan LM, Smith PR, Foster TJ. An aromatic-dependent mutant of the fish pathogen Aeromonas salmonicida is attenuated in fish and is effective as a live vaccine against the salmonid disease furunculosis. Infect Immun 1993; 61:2172-81. [PMID: 8478107 PMCID: PMC280818 DOI: 10.1128/iai.61.5.2172-2181.1993] [Citation(s) in RCA: 69] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
Aeromonas salmonicida is the etiological agent of furunculosis in salmonid fish. The disease is responsible for severe economic losses in intensively cultured salmon and trout. Bacterin vaccines provide inadequate protection against infection. We have constructed an aromatic-dependent mutant of A. salmonicida in order to investigate the possibility of an effective live-attenuated vaccine. The aroA gene of A. salmonicida was cloned in Escherichia coli, and the nucleotide sequence was determined. The codon usage pattern of aroA was found to be quite distinct from that of the vapA gene coding for the surface array protein layer (A layer). The aroA gene was inactivated by inserting a fragment expressing kanamycin resistance within the coding sequence. The aroA::Kar mutation was introduced into the chromosome of virulent A. salmonicida 644Rb and 640V2 by allele replacement by using a suicide plasmid delivery system. The aroA mutation did not revert at a detectable frequency (< 10(-11). The mutation resulted in attenuation when bacteria were injected intramuscularly into Atlantic salmon (Salmo salar L.). Introduction of the wild-type aroA gene into the A. salmonicida mutants on a broad-host-range plasmid restored virulence. A. salmonicida mutant 644Rb aroA::Kar persisted in the kidney of brown trout (Salmo trutta L.) for 12 days at 10 degrees C. Vaccination of brown trout with 10(7) CFU of A. salmonicida 644Rb aroA by intraperitoneal injection resulted in a 253-fold increase in the 50% lethal dose (LD50) compared with unvaccinated controls challenged with a virulent clinical isolate 9 weeks later. A second vaccination after 6 weeks increased the LD50 by a further 16-fold.
Collapse
Affiliation(s)
- L M Vaughan
- Department of Microbiology, Moyne Institute, Ireland
| | | | | |
Collapse
|
5
|
Sharp PM, Lloyd AT. Regional base composition variation along yeast chromosome III: evolution of chromosome primary structure. Nucleic Acids Res 1993; 21:179-83. [PMID: 8441625 PMCID: PMC309089 DOI: 10.1093/nar/21.2.179] [Citation(s) in RCA: 96] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
The recent determination of the complete sequence of chromosome III from the yeast Saccharomyces cerevisiae allows, for the first time, the investigation of the long range primary structure of a eukaryotic chromosome. We have found that, against a background G+C level of about 35%, there are two regions (one in each chromosome arm) in which G+C values rise to over 50%. This effect is seen in silent sites within genes, but not in noncoding intergenic sequences. The variation in G+C content is not related to differential selection of synonymous codons, and probably reflects mutational biases. That the intergenic regions do not exhibit the same phenomenon is particularly interesting, and suggests that they are under substantial constraint. The yeast chromosome may be a model of the structure of the human genome, since there is evidence that it is also a mosaic of long regions of different base compositions, reflected in wide variation of G+C content at silent sites among genes. Two possible causes of this regional effect, replication timing, and recombination frequency, are discussed.
Collapse
Affiliation(s)
- P M Sharp
- Department of Genetics, Trinity College, Dublin, Ireland
| | | |
Collapse
|
6
|
Abstract
The codon usage (CU) patterns of 64 genes from the Gram+ prokaryotic genus Streptomyces were analysed. Despite the extremely high overall G+C content of the Streptomyces genome (estimated at 0.74), individual genes varied in G+C content from 0.610 to 0.797, and had third codon position G+C contents (GC3s) that varied from 0.764 to 0.983. The variation in GC3s explains a significant proportion of the variation in CU patterns. This is consistent with an evolutionary model of the Streptomyces genome where biased mutation pressure has led to a high average G+C content with random variation about the mean, although the variation observed is greater than that expected from a simple binomial model. The only gene in the sample that can be confidently predicted to be highly expressed, EF-Tu of Streptomyces coelicolor A3(2) (GC3s = 0.927), shows a preference for a third position C in several of the four codon families, and for CGY and GGY for Arg and Gly codons, respectively (Y = pyrimidine); similar CU patterns are found in highly expressed genes of the G+C-rich Micrococcus luteus genome. It thus appears that codon usage in Streptomyces is determined predominantly by mutation bias, with weak translational selection operating only in highly expressed genes. We discuss the possible consequences of the extreme codon bias of Streptomyces and consider how it may have evolved. A set of CU tables is provided for use with computer programs that locate protein-coding regions.
Collapse
Affiliation(s)
- F Wright
- Scottish Agricultural Statistics Service, University of Edinburgh, U.K
| | | |
Collapse
|
7
|
Abstract
We have investigated the compositional distributions of third codon positions of genes from the 16 prokaryotes and seven eukaryotes for which the largest numbers of coding sequences are available in data banks. In prokaryotes, both narrow and broad distributions were found. In eukaryotes, distributions were very broad (except for Saccharomyces cerevisiae) and remarkably different for different genomes. In low-GC genomes, third codon positions were lower in GC than first + second codon positions and trailed towards high GC; the opposite situation was found for high-GC genomes. In all genomes, first codon positions were higher in GC than second codon positions. We then investigated the compositional correlations between third and first + second codon positions in prokaryotic genomes (the 16 mentioned above plus 87 additional ones) and in genome compartments of eukaryotes. A general, common relationship was found, which also holds within the same (heterogeneous) genomes. This universal correlation is due to the fact that the relative effects of compositional constraints on different codon positions are the same, on the average, whatever the genome under consideration.
Collapse
Affiliation(s)
- G D'Onofrio
- Laboratoire de Génétique Moleculaire, Institut Jacques Monod, Paris, France
| | | |
Collapse
|
8
|
Abstract
A new statistical test has been developed to detect selection on silent sites. This test compares the codon usage within a gene and thus does not require knowledge of which genes are under the greatest selection, that there exist common trends in codon usage across genes, or that genes have the same mutation pattern. It also controls for mutational biases that might be introduced by the adjacent bases. The test was applied to 62 mammalian sequences, and significant codon usage biases were detected in all three species examined (humans, rats, and mice). However, these biases appear not to be the consequence of selection, but of the first base pair in the codon influencing the mutation pattern at the third position.
Collapse
Affiliation(s)
- A C Eyre-Walker
- Institute of Cell, Animal and Population Biology, University of Edinburgh, United Kingdom
| |
Collapse
|
9
|
Abstract
Synonymous and nonsynonymous substitution rates at the loci encoding glyceraldehyde-3-phosphate dehydrogenase (gap) and outer membrane protein 3A (ompA) were examined in 12 species of enteric bacteria. By examining homologous sequences in species of varying degrees of relatedness and of known phylogenetic relationships, we analyzed the patterns of synonymous and nonsynonymous substitutions within and among these genes. Although both loci accumulate synonymous substitutions at reduced rates due to codon usage bias, portions of the gap and ompA reading frames show significant deviation in synonymous substitution rates not attributable to local codon bias. A paucity of synonymous substitutions in portions of the ompA gene may reflect selection for a novel mRNA secondary structure. In addition, these studies allow comparisons of homologous protein-coding sequences (gap) in plants, animals, and bacteria, revealing differences in evolutionary constraints on this glycolytic enzyme in these lineages.
Collapse
Affiliation(s)
- J G Lawrence
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110
| | | | | |
Collapse
|
10
|
Sharp PM. Determinants of DNA sequence divergence between Escherichia coli and Salmonella typhimurium: codon usage, map position, and concerted evolution. J Mol Evol 1991; 33:23-33. [PMID: 1909371 DOI: 10.1007/bf02100192] [Citation(s) in RCA: 171] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The nature and extent of DNA sequence divergence between homologous protein-coding genes from Escherichia coli and Salmonella typhimurium have been examined. The degree of divergence varies greatly among genes at both synonymous (silent) and nonsynonymous sites. Much of the variation in silent substitution rates can be explained by natural selection on synonymous codon usage, varying in intensity with gene expression level. Silent substitution rates also vary significantly with chromosomal location, with genes near oriC having lower divergence. Certain genes have been examined in more detail. In particular, the duplicate genes encoding elongation factor Tu, tufA and tufB, from S. typhimurium have been compared to their E. coli homologues. As expected these very highly expressed genes have high codon usage bias and have diverged very little between the two species. Interestingly, these genes, which are widely spaced on the bacterial chromosome, also appear to be undergoing concerted evolution, i.e., there has been exchange between the loci subsequent to the divergence of the two species.
Collapse
Affiliation(s)
- P M Sharp
- Department of Genetics, Trinity College, Dublin, Ireland
| |
Collapse
|
11
|
Abstract
Recent progress in studies on the bacterial chromosome is summarized. Although the greatest amount of information comes from studies on Escherichia coli, reports on studies of many other bacteria are also included. A compilation of the sizes of chromosomal DNAs as determined by pulsed-field electrophoresis is given, as well as a discussion of factors that affect gene dosage, including redundancy of chromosomes on the one hand and inactivation of chromosomes on the other hand. The distinction between a large plasmid and a second chromosome is discussed. Recent information on repeated sequences and chromosomal rearrangements is presented. The growing understanding of limitations on the rearrangements that can be tolerated by bacteria and those that cannot is summarized, and the sensitive region flanking the terminator loci is described. Sources and types of genetic variation in bacteria are listed, from simple single nucleotide mutations to intragenic and intergenic recombinations. A model depicting the dynamics of the evolution and genetic activity of the bacterial chromosome is described which entails acquisition by recombination of clonal segments within the chromosome. The model is consistent with the existence of only a few genetic types of E. coli worldwide. Finally, there is a summary of recent reports on lateral genetic exchange across great taxonomic distances, yet another source of genetic variation and innovation.
Collapse
Affiliation(s)
- S Krawiec
- Department of Biology, Lehigh University, Bethlehem, Pennsylvania 18015
| | | |
Collapse
|
12
|
Han KS, Archer JA, Sinskey AJ. The molecular structure of the Corynebacterium glutamicum threonine synthase gene. Mol Microbiol 1990; 4:1693-702. [PMID: 2127631 DOI: 10.1111/j.1365-2958.1990.tb00546.x] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
The minimal region encoding the Corynebacterium glutamicum threonine synthase structural gene and its promoter was mapped by deletion analysis and complementation of the C. glutamicum thrC allele to a 1.6 kb region of the recombinant plasmid pFS80. The nucleotide sequence of this and flanking DNA was determined. The transcription and translation start points were identified by S1 mapping analysis and amino-terminal protein sequencing, respectively. The thrC gene encodes a 54481-Dalton polypeptide product. Translation of the thrC mRNA initiates only six nucleotides downstream from transcription. The length of the mRNA transcript is consistent with a single gene transcription unit. The C. glutamicum thrC gene is expressed independently of the other threonine-specific genes hom and thrB.
Collapse
Affiliation(s)
- K S Han
- Seoul Miwon Co. Ltd, Dobong-ku, Korea
| | | | | |
Collapse
|
13
|
Shields DC. Switches in species-specific codon preferences: the influence of mutation biases. J Mol Evol 1990; 31:71-80. [PMID: 2120452 DOI: 10.1007/bf02109476] [Citation(s) in RCA: 48] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
A model of synonymous codon usage is developed in which the most frequent codons are selectively advantageous because of their coadaptation with tRNA abundances. Random drift opposes the progress of this coevolution by pushing codon frequencies in the direction of the frequency that would result from mutation in the absence of selection. It is predicted that, within a certain range, an increased mutation bias away from an advantageous codon has little influence on its usage in highly expressed genes. However, a subsequent small increase in mutation bias over a critical range leads to a large reduction in the frequency of the codon. The switch in preference from one synonym to another is a sharp transition, with no stable intermediate state in which neither codon is advantageous. Codon usage patterns were compared among three related bacterial species of differing genomic G & C contents, Escherichia coli, Serratia marcescens, and Proteus vulgaris. It was found that although changes in mutation biases do not always result in switches in codon preferences, some switches have occurred in the direction of species-specific mutation biases. Fluctuating mutation biases may therefore be the main cause of differences between species in their codon preferences.
Collapse
Affiliation(s)
- D C Shields
- Department of Genetics, Trinity College, Dublin, Ireland
| |
Collapse
|