51
|
Helm RA, Porwollik S, Stanley AE, Maloy S, McClelland M, Rabsch W, Eisenstark A. Pigeon-associated strains of Salmonella enterica serovar Typhimurium phage type DT2 have genomic rearrangements at rRNA operons. Infect Immun 2004; 72:7338-41. [PMID: 15557664 PMCID: PMC529114 DOI: 10.1128/iai.72.12.7338-7341.2004] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Strains from a subgroup of Salmonella enterica serovar Typhimurium frequently associated with pigeon infections were tested for genomic anomalies and virulence in mice. Some strains have a genomic inversion between rrn operons. Two prophages found in the common laboratory strain LT2 were absent. Pigeon-associated strains are still virulent in mice.
Collapse
Affiliation(s)
- R Allen Helm
- Department of Microbiology-Immunology, Northwestern University School of Medicine, 303 E. Chicago Ave., Chicago, IL 60611, USA.
| | | | | | | | | | | | | |
Collapse
|
52
|
Abstract
The genomes from three mammals (human, mouse, and rat), two worms, and several yeasts have been sequenced, and more genomes will be completed in the near future for comparison with those of the major model organisms. Scientists have used various methods to align and compare the sequenced genomes to address critical issues in genome function and evolution. This review covers some of the major new insights about gene content, gene regulation, and the fraction of mammalian genomes that are under purifying selection and presumed functional. We review the evolutionary processes that shape genomes, with particular attention to variation in rates within genomes and along different lineages. Internet resources for accessing and analyzing the treasure trove of sequence alignments and annotations are reviewed, and we discuss critical problems to address in new bioinformatic developments in comparative genomics.
Collapse
Affiliation(s)
- Webb Miller
- The Center for Comparative Genomics and Bioinformatics, The Huck Institutes of Life Sciences, Department of Biology, Pennsylvania State University, University Park, Pennsylvania, USA.
| | | | | | | |
Collapse
|
53
|
Washietl S, Hofacker IL. Consensus folding of aligned sequences as a new measure for the detection of functional RNAs by comparative genomics. J Mol Biol 2004; 342:19-30. [PMID: 15313604 DOI: 10.1016/j.jmb.2004.07.018] [Citation(s) in RCA: 119] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2004] [Revised: 07/05/2004] [Accepted: 07/09/2004] [Indexed: 01/22/2023]
Abstract
Facing the ever-growing list of newly discovered classes of functional RNAs, it can be expected that further types of functional RNAs are still hidden in recently completed genomes. The computational identification of such RNA genes is, therefore, of major importance. While most known functional RNAs have characteristic secondary structures, their free energies are generally not statistically significant enough to distinguish RNA genes from the genomic background. Additional information is required. Considering the wide availability of new genomic data of closely related species, comparative studies seem to be the most promising approach. Here, we show that prediction of consensus structures of aligned sequences can be a significant measure to detect functional RNAs. We report a new method to test multiple sequence alignments for the existence of an unusually structured and conserved fold. We show for alignments of six types of well-known functional RNA that an energy score consisting of free energy and a covariation term significantly improves sensitivity compared to single sequence predictions. We further test our method on a number of non-coding RNAs from Caenorhabditis elegans/Caenorhabditis briggsae and seven Saccharomyces species. Most RNAs can be detected with high significance. We provide a Perl implementation that can be used readily to score single alignments and discuss how the methods described here can be extended to allow for efficient genome-wide screens.
Collapse
Affiliation(s)
- Stefan Washietl
- Institut für Theoretische Chemie und Molekulare Strukturbiologie, Universität Wien, Währingerstrasse 17, A-1090, Austria
| | | |
Collapse
|
54
|
Thomson N, Baker S, Pickard D, Fookes M, Anjum M, Hamlin N, Wain J, House D, Bhutta Z, Chan K, Falkow S, Parkhill J, Woodward M, Ivens A, Dougan G. The role of prophage-like elements in the diversity of Salmonella enterica serovars. J Mol Biol 2004; 339:279-300. [PMID: 15136033 DOI: 10.1016/j.jmb.2004.03.058] [Citation(s) in RCA: 82] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2003] [Revised: 03/19/2004] [Accepted: 03/22/2004] [Indexed: 10/26/2022]
Abstract
The Salmonella enterica serovar Typhi CT18 (S.Typhi) chromosome harbours seven distinct prophage-like elements, some of which may encode functional bacteriophages. In silico analyses were used to investigate these regions in S.Typhi CT18, and ultimately compare these integrated bacteriophages against 40 other Salmonella isolates using DNA microarray technology. S.Typhi CT18 contains prophages that show similarity to the lambda, Mu, P2 and P4 bacteriophage families. When compared to other S.Typhi isolates, these elements were generally conserved, supporting a clonal origin of this serovar. However, distinct variation was detected within a broad range of Salmonella serovars; many of the prophage regions are predicted to be specific to S.Typhi. Some of the P2 family prophage analysed have the potential to carry non-essential "cargo" genes within the hyper-variable tail region, an observation that suggests that these bacteriophage may confer a level of specialisation on their host. Lysogenic bacteriophages therefore play a crucial role in the generation of genetic diversity within S.enterica.
Collapse
Affiliation(s)
- Nicholas Thomson
- The Pathogen Sequencing Unit, The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, United Kingdom.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
55
|
Shakeri-Garakani A, Brinkkötter A, Schmid K, Turgut S, Lengeler JW. The genes and enzymes for the catabolism of galactitol, D-tagatose, and related carbohydrates in Klebsiella oxytoca M5a1 and other enteric bacteria display convergent evolution. Mol Genet Genomics 2004; 271:717-28. [PMID: 15257457 DOI: 10.1007/s00438-004-1022-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2004] [Accepted: 04/30/2004] [Indexed: 01/03/2023]
Abstract
Enteric bacteria (Enteriobacteriaceae) carry on their single chromosome about 4000 genes that all strains have in common (referred to here as "obligatory genes"), and up to 1300 "facultative" genes that vary from strain to strain and from species to species. In closely related species, obligatory and facultative genes are orthologous genes that are found at similar loci. We have analyzed a set of facultative genes involved in the degradation of the carbohydrates galactitol, D-tagatose, D-galactosamine and N-acetyl-galactosamine in various pathogenic and non-pathogenic strains of these bacteria. The four carbohydrates are transported into the cell by phosphotransferase (PTS) uptake systems, and are metabolized by closely related or even identical catabolic enzymes via pathways that share several intermediates. In about 60% of Escherichia coli strains the genes for galactitol degradation map to a gat operon at 46.8 min. In strains of Salmonella enterica, Klebsiella pneumoniae and K. oxytoca, the corresponding gat genes, although orthologous to their E. coli counterparts, are found at 70.7 min, clustered in a regulon together with three tag genes for the degradation of D-tagatose, an isomer of D-fructose. In contrast, in all the E. coli strains tested, this chromosomal site was found to be occupied by an aga/kba gene cluster for the degradation of D-galactosamine and N-acetyl-galactosamine. The aga/kba and the tag genes were paralogous either to the gat cluster or to the fru genes for degradation of D-fructose. Finally, in more then 90% of strains of both Klebsiella species, and in about 5% of the E. coli strains, two operons were found at 46.8 min that comprise paralogous genes for catabolism of the isomers D-arabinitol (genes atl or dal) and ribitol (genes rtl or rbt). In these strains gat genes were invariably absent from this location, and they were totally absent in S. enterica. These results strongly indicate that these various gene clusters and metabolic pathways have been subject to convergent evolution among the Enterobacteriaceae. This apparently involved recent horizontal gene transfer and recombination events, as indicated by major chromosomal rearrangements found in their immediate vicinity.
Collapse
Affiliation(s)
- A Shakeri-Garakani
- Fachbereich Biologie/Chemie, Universität Osnabrück, Barbarastr.11, 49069, Osnabrück, Germany
| | | | | | | | | |
Collapse
|
56
|
Porwollik S, Wong RMY, Helm RA, Edwards KK, Calcutt M, Eisenstark A, McClelland M. DNA amplification and rearrangements in archival Salmonella enterica serovar Typhimurium LT2 cultures. J Bacteriol 2004; 186:1678-82. [PMID: 14996798 PMCID: PMC355959 DOI: 10.1128/jb.186.6.1678-1682.2004] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Variations in genome size and gene order were observed in archival Salmonella enterica serovar Typhimurium cultures stored for over 40 years. In one strain, microarray analysis revealed a large, stable amplification. PCR analysis of the same strain revealed a genomic duplication that underwent a translocation. Other strains had smaller duplications and deletions. These results demonstrate that storage in stabs over time at room temperature not only allows for further bacterial growth but also may produce an environment that selects for a variety of mutations, including genomic rearrangements.
Collapse
|
57
|
Fernandez FJ, Vega MC, Lehmann F, Sandmeier E, Gehring H, Christen P, Wilmanns M. Structural studies of the catalytic reaction pathway of a hyperthermophilic histidinol-phosphate aminotransferase. J Biol Chem 2004; 279:21478-88. [PMID: 15007066 DOI: 10.1074/jbc.m400291200] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
In histidine biosynthesis, histidinol-phosphate aminotransferase catalyzes the transfer of the amino group from glutamate to imidazole acetol-phosphate producing 2-oxoglutarate and histidinol phosphate. In some organisms such as the hyperthermophile Thermotoga maritima, specific tyrosine and aromatic amino acid transaminases have not been identified to date, suggesting an additional role for histidinol-phosphate aminotransferase in other transamination reactions generating aromatic amino acids. To gain insight into the specific function of this transaminase, we have determined its crystal structure in the absence of any ligand except phosphate, in the presence of covalently bound pyridoxal 5'-phosphate, of the coenzyme histidinol phosphate adduct, and of pyridoxamine 5'-phosphate. The enzyme accepts histidinol phosphate, tyrosine, tryptophan, and phenylalanine, but not histidine, as substrates. The structures provide a model of how these different substrates could be accommodated by histidinol-phosphate aminotransferase. Some of the structural features of the enzyme are more preserved between the T. maritima enzyme and a related threonine-phosphate decarboxylase from S. typhimurium than with histidinol-phosphate aminotransferases from different organisms.
Collapse
Affiliation(s)
- Francisco J Fernandez
- EMBL c/o Deutsches Elektronen-Synchrotron, Notkestrasse 85, Building 25A, D-22603 Hamburg, Germany
| | | | | | | | | | | | | |
Collapse
|
58
|
Sibley MH, Raleigh EA. Cassette-like variation of restriction enzyme genes in Escherichia coli C and relatives. Nucleic Acids Res 2004; 32:522-34. [PMID: 14744977 PMCID: PMC373321 DOI: 10.1093/nar/gkh194] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2003] [Revised: 12/08/2003] [Accepted: 12/08/2003] [Indexed: 11/13/2022] Open
Abstract
A surprising result of comparative bacterial genomics has been the large amount of DNA found to be present in one strain but not in another of the same species. We examine in detail one location where gene content varies extensively, the restriction cluster in Escherichia coli. This region is designated the Immigration Control Region (ICR) for the density and variability of restriction functions found there. To better define the boundaries of this variable locus, we determined the sequence of the region from a restrictionless strain, E.coli C. Here we compare the 13.7 kb E.coli C sequence spanning the site of the ICR with corresponding sequences from five E.coli strains and Salmonella typhimurium LT2. To discuss this variation, we adopt the term 'framework' to refer to genes that are stable components of genomes within related lineages, while 'migratory' genes are transient inhabitants of the genome. Strikingly, seven different migratory DNA segments, encoding different sets of genes and gene fragments, alternatively occupy a single well-defined location in the seven strains examined. The flanking framework genes, yjiS and yjiA, display approximately normal patterns of conservation. The patterns observed are consistent with the action of a site-specific recombinase. Since no nearby gene codes for a likely recombinase of known families, such a recombinase must be of a new family or unlinked.
Collapse
|
59
|
Tchufistova LS, Komarova AV, Boni IV. A key role for the mRNA leader structure in translational control of ribosomal protein S1 synthesis in gamma-proteobacteria. Nucleic Acids Res 2003; 31:6996-7002. [PMID: 14627832 PMCID: PMC290253 DOI: 10.1093/nar/gkg883] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2003] [Revised: 09/16/2003] [Accepted: 10/07/2003] [Indexed: 11/13/2022] Open
Abstract
The translation initiation region (TIR) of the Escherichia coli rpsA mRNA coding for ribosomal protein S1 is characterized by a remarkable efficiency in driving protein synthesis despite the absence of the canonical Shine-Dalgarno element, and by a strong and specific autogenous repression in the presence of free S1 in trans. The efficient and autoregulated E.coli rpsA TIR comprises not less than 90 nt upstream of the translation start and can be unambiguously folded into three irregular hairpins (HI, HII and HIII) separated by A/U-rich single-stranded regions (ss1 and ss2). Phylogenetic comparison revealed that this specific fold is highly conserved in the gamma-subdivision of proteobacteria (but not in other subdivisions), except for the Pseudomonas group. To test phylogenetic predictions experimentally, we have generated rpsA'-'lacZ translational fusions by inserting the rpsA TIRs from various gamma-proteobacteria in-frame with the E.coli chromosomal lacZ gene. Measurements of their translation efficiency and negative regulation by excess protein S1 in trans have shown that only those rpsA TIRs which share the structural features with that of E.coli can govern efficient and regulated translation. We conclude that the E.coli-like mechanism for controlling the efficiency of protein S1 synthesis evolved after divergence of Pseudomona.
Collapse
Affiliation(s)
- Ludmila S Tchufistova
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, 117997 Moscow, Russia
| | | | | |
Collapse
|
60
|
Abstract
Comparing the genomes of two different species allow the exploration of a host of intriguing evolutionary and genetic questions
Collapse
|
61
|
Abstract
Bacterial genome nucleotide sequences are being completed at a rapid and increasing rate. Integrated virus genomes (prophages) are common in such genomes. Fifty-one of the 82 such genomes published to date carry prophages, and these contain 230 recognizable putative prophages. Prophages can constitute as much as 10-20% of a bacterium's genome and are major contributors to differences between individuals within species. Many of these prophages appear to be defective and are in a state of mutational decay. Prophages, including defective ones, can contribute important biological properties to their bacterial hosts. Therefore, if we are to comprehend bacterial genomes fully, it is essential that we are able to recognize accurately and understand their prophages from nucleotide sequence analysis. Analysis of the evolution of prophages can shed light on the evolution of both bacteriophages and their hosts. Comparison of the Rac prophages in the sequenced genomes of three Escherichia coli strains and the Pnm prophages in two Neisseria meningitidis strains suggests that some prophages can lie in residence for very long times, perhaps millions of years, and that recombination events have occurred between related prophages that reside at different locations in a bacterium's genome. In addition, many genes in defective prophages remain functional, so a significant portion of the temperate bacteriophage gene pool resides in prophages.
Collapse
Affiliation(s)
- Sherwood Casjens
- Department of Pathology, University of Utah Medical School, Salt Lake City, UT 84132-2501, USA.
| |
Collapse
|
62
|
Kellis M, Patterson N, Endrizzi M, Birren B, Lander ES. Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature 2003; 423:241-54. [PMID: 12748633 DOI: 10.1038/nature01644] [Citation(s) in RCA: 1305] [Impact Index Per Article: 62.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2003] [Accepted: 04/01/2003] [Indexed: 11/09/2022]
Abstract
Identifying the functional elements encoded in a genome is one of the principal challenges in modern biology. Comparative genomics should offer a powerful, general approach. Here, we present a comparative analysis of the yeast Saccharomyces cerevisiae based on high-quality draft sequences of three related species (S. paradoxus, S. mikatae and S. bayanus). We first aligned the genomes and characterized their evolution, defining the regions and mechanisms of change. We then developed methods for direct identification of genes and regulatory motifs. The gene analysis yielded a major revision to the yeast gene catalogue, affecting approximately 15% of all genes and reducing the total count by about 500 genes. The motif analysis automatically identified 72 genome-wide elements, including most known regulatory motifs and numerous new motifs. We inferred a putative function for most of these motifs, and provided insights into their combinatorial interactions. The results have implications for genome analysis of diverse organisms, including the human.
Collapse
Affiliation(s)
- Manolis Kellis
- Whitehead/MIT Center for Genome Research, Nine Cambridge Center, Cambridge, Massachusetts 02142, USA.
| | | | | | | | | |
Collapse
|
63
|
Ureta-Vidal A, Ettwiller L, Birney E. Comparative genomics: genome-wide analysis in metazoan eukaryotes. Nat Rev Genet 2003; 4:251-62. [PMID: 12671656 DOI: 10.1038/nrg1043] [Citation(s) in RCA: 156] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The increasing number of complete and nearly complete metazoan genome sequences provides a significant amount of material for large-scale comparative genomic analysis. Finding new effective methods to analyse such enormous datasets has been the object of intense research. Three main areas in comparative genomics have recently shown important developments: whole-genome alignment, gene prediction and regulatory-region prediction. Each of these areas improves the methods of deciphering long genomic sequences and uncovering what lies hidden in them.
Collapse
Affiliation(s)
- Abel Ureta-Vidal
- EnsEMBL Project, Room A2-06, EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | | | | |
Collapse
|
64
|
Seputiene V, Motiejūnas D, Suziedelis K, Tomenius H, Normark S, Melefors O, Suziedeliene E. Molecular characterization of the acid-inducible asr gene of Escherichia coli and its role in acid stress response. J Bacteriol 2003; 185:2475-84. [PMID: 12670971 PMCID: PMC152617 DOI: 10.1128/jb.185.8.2475-2484.2003] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2002] [Accepted: 01/29/2003] [Indexed: 11/20/2022] Open
Abstract
Enterobacteria have developed numerous constitutive and inducible strategies to sense and adapt to an external acidity. These molecular responses require dozens of specific acid shock proteins (ASPs), as shown by genomic and proteomic analysis. Most of the ASPs remain poorly characterized, and their role in the acid response and survival is unknown. We recently identified an Escherichia coli gene, asr (acid shock RNA), encoding a protein of unknown function, which is strongly induced by high environmental acidity (pH < 5.0). We show here that Asr is required for growth at moderate acidity (pH 4.5) as well as for the induction of acid tolerance at moderate acidity, as shown by its ability to survive subsequent transfer to extreme acidity (pH 2.0). Sodium dodecyl sulfate-polyacrylamide gel electrophoresis and Western analysis of acid-shocked E. coli cells harboring a plasmid-borne asr gene demonstrated that the Asr protein is synthesized as a precursor with an apparent molecular mass of 18 kDa. Mutational studies of the asr gene also demonstrated the Asr preprotein contains 102 amino acids. This protein is subjected to an N-terminal cleavage of the signal peptide and a second processing event, yielding 15- and 8-kDa products, respectively. Only the 8-kDa polypeptide was detected in acid-shocked cells containing only the chromosomal copy of the asr gene. N-terminal sequencing and site-directed mutagenesis revealed the two processing sites in the Asr protein precursor. Deletion of amino acids encompassing the processing site required for release of the 8-kDa protein resulted in an acid-sensitive phenotype similar to that observed for the asr null mutant, suggesting that the 8-kDa product plays an important role in the adaptation to acid shock. Analysis of Asr:PhoA fusions demonstrated a periplasmic location for the Asr protein after removal of the signal peptide. Homologues of the asr gene from other Enterobacteriaceae were cloned and shown to be induced in E. coli under acid shock conditions.
Collapse
Affiliation(s)
- Vaida Seputiene
- Department of Biochemistry and Biophysics, Vilnius University, Vilnius LT-2009, Lithuania
| | | | | | | | | | | | | |
Collapse
|
65
|
Frazer KA, Elnitski L, Church DM, Dubchak I, Hardison RC. Cross-species sequence comparisons: a review of methods and available resources. Genome Res 2003; 13:1-12. [PMID: 12529301 PMCID: PMC430969 DOI: 10.1101/gr.222003] [Citation(s) in RCA: 155] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
With the availability of whole-genome sequences for an increasing number of species, we are now faced with the challenge of decoding the information contained within these DNA sequences. Comparative analysis of DNA sequences from multiple species at varying evolutionary distances is a powerful approach for identifying coding and functional noncoding sequences, as well as sequences that are unique for a given organism. In this review, we outline the strategy for choosing DNA sequences from different species for comparative analyses and describe the methods used and the resources publicly available for these studies.
Collapse
Affiliation(s)
- Kelly A Frazer
- Perlegen Sciences, Mountain View, California 94043, USA.
| | | | | | | | | |
Collapse
|
66
|
Jahreis K, Bentler L, Bockmann J, Hans S, Meyer A, Siepelmeyer J, Lengeler JW. Adaptation of sucrose metabolism in the Escherichia coli wild-type strain EC3132. J Bacteriol 2002; 184:5307-16. [PMID: 12218016 PMCID: PMC135337 DOI: 10.1128/jb.184.19.5307-5316.2002] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Although Escherichia coli strain EC3132 possesses a chromosomally encoded sucrose metabolic pathway, its growth on low sucrose concentrations (5 mM) is unusually slow, with a doubling time of 20 h. In this report we describe the subcloning and further characterization of the corresponding csc genes and adjacent genes. The csc regulon comprises three genes for a sucrose permease, a fructokinase, and a sucrose hydrolase (genes cscB, cscK, and cscA, respectively). The genes are arranged in two operons and are negatively controlled at the transcriptional level by the repressor CscR. Furthermore, csc gene expression was found to be cyclic AMP-CrpA dependent. A comparison of the genomic sequences of the E. coli strains EC3132, K-12, and O157:H7 in addition to Salmonella enterica serovar Typhimurium LT2 revealed that the csc genes are located in a hot spot region for chromosomal rearrangements in enteric bacteria. The comparison further indicated that the csc genes might have been transferred relatively recently to the E. coli wild-type EC3132 at around the time when the different strains of the enteric bacteria diverged. We found evidence that a mobile genetic element, which used the gene argW for site-specific integration into the chromosome, was probably involved in this horizontal gene transfer and that the csc genes are still in the process of optimal adaptation to the new host. Selection for such adaptational mutants growing faster on low sucrose concentrations gave three different classes of mutants. One class comprised cscR(Con) mutations that expressed all csc genes constitutively. The second class constituted a cscKo operator mutation, which became inducible for csc gene expression at low sucrose concentrations. The third class was found to be a mutation in the sucrose permease that caused an increase in transport activity.
Collapse
Affiliation(s)
- Knut Jahreis
- Arbeitsgruppe Genetik, Fachbereich Biologie/Chemie, Universität Osnabrück, D-49069 Osnabrück, Germany.
| | | | | | | | | | | | | |
Collapse
|
67
|
Porwollik S, Wong RMY, McClelland M. Evolutionary genomics of Salmonella: gene acquisitions revealed by microarray analysis. Proc Natl Acad Sci U S A 2002; 99:8956-61. [PMID: 12072558 PMCID: PMC124405 DOI: 10.1073/pnas.122153699] [Citation(s) in RCA: 208] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The presence of homologues of Salmonella enterica sv. Typhimurium LT2 genes was assessed in 22 other Salmonella including members of all seven subspecies and Salmonella bongori. Genomes were hybridized to a microarray of over 97% of the 4,596 annotated ORFs in the LT2 genome. A phylogenetic tree based on homologue content, relative to LT2, was largely concordant with previous studies using sequence information from several loci. Based on the topology of this tree, homologues of genes in LT2 acquired by various clades were predicted including 513 homologues acquired by the ancestor of all Salmonella, 111 acquired by S. enterica, 105 by diphasic Salmonella, and 216 by subspecies 1, most of which are of unknown function. Because this subspecies is responsible for almost all Salmonella infections of mammals and birds, these genes will be of particular interest for further mechanistic studies. Overall, a high level of gene gain, loss, or rapid divergence was predicted along all lineages. For example, at least 425 close homologues of LT2 genes may have been laterally transferred into Salmonella and then between Salmonella lineages.
Collapse
Affiliation(s)
- Steffen Porwollik
- Sidney Kimmel Cancer Center, 10835 Altman Row, San Diego, CA 92121, USA
| | | | | |
Collapse
|
68
|
Kolisnychenko V, Plunkett G, Herring CD, Fehér T, Pósfai J, Blattner FR, Pósfai G. Engineering a reduced Escherichia coli genome. Genome Res 2002; 12:640-7. [PMID: 11932248 PMCID: PMC187512 DOI: 10.1101/gr.217202] [Citation(s) in RCA: 202] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Our goal is to construct an improved Escherichia coli to serve both as a better model organism and as a more useful technological tool for genome science. We developed techniques for precise genomic surgery and applied them to deleting the largest K-islands of E. coli, identified by comparative genomics as recent horizontal acquisitions to the genome. They are loaded with cryptic prophages, transposons, damaged genes, and genes of unknown function. Our method leaves no scars or markers behind and can be applied sequentially. Twelve K-islands were successfully deleted, resulting in an 8.1% reduced genome size, a 9.3% reduction of gene count, and elimination of 24 of the 44 transposable elements of E. coli. These are particularly detrimental because they can mutagenize the genome or transpose into clones being propagated for sequencing, as happened in 18 places of the draft human genome sequence. We found no change in the growth rate on minimal medium, confirming the nonessential nature of these islands. This demonstration of feasibility opens the way for constructing a maximally reduced strain, which will provide a clean background for functional genomics studies, a more efficient background for use in biotechnology applications, and a unique tool for studies of genome stability and evolution.
Collapse
|
69
|
Abstract
Information technologies for chemical structure prediction, heterogeneous database access, pattern discovery, and systems and molecular modeling have evolved to become core components of the modern drug discovery process. As this evolution continues, the balance between in silico modeling and 'wet' chemistry will continue to shift and it might eventually be possible to step through the discovery pipeline without the aid of traditional laboratory techniques. Rapid advances in the industrialization of gene sequencing combined with databases of protein sequence and structure have created a target-rich but lead-poor environment. During the next decade, newer information technologies that facilitate the molecular modeling of drug-target interactions are likely to shift this balance towards molecular-based personalized medicine -- the ultimate goal of the drug discovery process.
Collapse
Affiliation(s)
- Jeffrey Augen
- Strategy, IBM Life Sciences, Route 100, Somers, NY 10589, USA.
| |
Collapse
|
70
|
Abstract
As the number of completed genome sequences increases, there is increasing emphasis on comparative genomic analysis of closely related organisms. Comparison of the similarities and differences between the five publicly available Salmonella genome sequences reveals extensive sequence conservation among the Salmonella serovars. However, horizontal gene transfer has provided each genome with between 10% and 12% of unique DNA. Genome comparisons of the closely related salmonellae emphasize the insights that can be gleaned from sequencing genomes of a single species.
Collapse
Affiliation(s)
- Robert A Edwards
- University of Tennessee Health Sciences Center, MSB 101 858 Madison Ave, Memphis, TN 38163, USA.
| | | | | |
Collapse
|
71
|
Abstract
Initial human genome sequence analysis has revealed large segments of nearly identical sequence in particular chromosomal regions. The recent origin of these segments and their abundance (approximately 5%) has challenged investigators to elucidate their underlying mechanism and role in primate genome evolution. Although the precise fraction is unknown, some of these duplicated segments have recently been shown to be associated with rapid gene innovation and chromosomal rearrangement in the genomes of man and the great apes.
Collapse
Affiliation(s)
- Rhea Vallente Samonte
- Department of Genetics and Center for Human Genetics, School of Medicine and University Hospitals of Cleveland, Case Western Reserve University, Cleveland, Ohio 44106, USA.
| | | |
Collapse
|
72
|
Kingsley RA, Bäumler AJ. Pathogenicity Islands and Host Adaptation of Salmonella Serovars. Curr Top Microbiol Immunol 2002. [DOI: 10.1007/978-3-642-56031-6_5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
73
|
Díaz E, Ferrández A, Prieto MA, García JL. Biodegradation of aromatic compounds by Escherichia coli. Microbiol Mol Biol Rev 2001; 65:523-69, table of contents. [PMID: 11729263 PMCID: PMC99040 DOI: 10.1128/mmbr.65.4.523-569.2001] [Citation(s) in RCA: 288] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Although Escherichia coli has long been recognized as the best-understood living organism, little was known about its abilities to use aromatic compounds as sole carbon and energy sources. This review gives an extensive overview of the current knowledge of the catabolism of aromatic compounds by E. coli. After giving a general overview of the aromatic compounds that E. coli strains encounter and mineralize in the different habitats that they colonize, we provide an up-to-date status report on the genes and proteins involved in the catabolism of such compounds, namely, several aromatic acids (phenylacetic acid, 3- and 4-hydroxyphenylacetic acid, phenylpropionic acid, 3-hydroxyphenylpropionic acid, and 3-hydroxycinnamic acid) and amines (phenylethylamine, tyramine, and dopamine). Other enzymatic activities acting on aromatic compounds in E. coli are also reviewed and evaluated. The review also reflects the present impact of genomic research and how the analysis of the whole E. coli genome reveals novel aromatic catabolic functions. Moreover, evolutionary considerations derived from sequence comparisons between the aromatic catabolic clusters of E. coli and homologous clusters from an increasing number of bacteria are also discussed. The recent progress in the understanding of the fundamentals that govern the degradation of aromatic compounds in E. coli makes this bacterium a very useful model system to decipher biochemical, genetic, evolutionary, and ecological aspects of the catabolism of such compounds. In the last part of the review, we discuss strategies and concepts to metabolically engineer E. coli to suit specific needs for biodegradation and biotransformation of aromatics and we provide several examples based on selected studies. Finally, conclusions derived from this review may serve as a lead for future research and applications.
Collapse
Affiliation(s)
- E Díaz
- Department of Molecular Microbiology, Centro de Investigaciones Biológicas, Consejo Superior de Investigaciones Científicas, 28006 Madrid, Spain.
| | | | | | | |
Collapse
|
74
|
Moran NA, Mira A. The process of genome shrinkage in the obligate symbiont Buchnera aphidicola. Genome Biol 2001; 2:RESEARCH0054. [PMID: 11790257 PMCID: PMC64839 DOI: 10.1186/gb-2001-2-12-research0054] [Citation(s) in RCA: 175] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2001] [Revised: 10/12/2001] [Accepted: 10/15/2001] [Indexed: 01/28/2023] Open
Abstract
BACKGROUND Very small genomes have evolved repeatedly in eubacterial lineages that have adopted obligate associations with eukaryotic hosts. Complete genome sequences have revealed that small genomes retain very different gene sets, raising the question of how final genome content is determined. To examine the process of genome reduction, the tiny genome of the endosymbiont Buchnera aphidicola was compared to the larger ancestral genome, reconstructed on the basis of the phylogenetic distribution of gene orthologs among fully sequenced relatives of Escherichia coli and Buchnera. RESULTS The reconstructed ancestral genome contained 2,425 open reading frames (ORFs). The Buchnera genome, containing 564 ORFs, consists of 153 fragments of 1-34 genes that are syntenic with reconstructed ancestral regions. On the basis of this reconstruction, 503 genes were eliminated within syntenic fragments, and 1,403 genes were lost from the gaps between syntenic fragments, probably in connection with genome rearrangements. Lost regions are sometimes large, and often span functionally unrelated genes. In addition, individual genes and regulatory regions have been lost or eroded. For the categories of DNA repair genes and rRNA genes, most lost loci fall in regions between syntenic fragments. This history of gene loss is reflected in the sequences of intergenic spacers at positions where genes were once present. CONCLUSIONS The most plausible interpretation of this reconstruction is that Buchnera lost many genes through the fixation of large deletions soon after the acquisition of an obligate endosymbiotic lifestyle. An implication is that final genome composition may be partly the chance outcome of initial deletions and that neighboring genes influence the likelihood of loss of particular genes and pathways.
Collapse
Affiliation(s)
- N A Moran
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA.
| | | |
Collapse
|
75
|
Goss TJ, Perez-Matos A, Bender RA. Roles of glutamate synthase, gltBD, and gltF in nitrogen metabolism of Escherichia coli and Klebsiella aerogenes. J Bacteriol 2001; 183:6607-19. [PMID: 11673431 PMCID: PMC95492 DOI: 10.1128/jb.183.22.6607-6619.2001] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Mutants of Escherichia coli and Klebsiella aerogenes that are deficient in glutamate synthase (glutamate-oxoglutarate amidotransferase [GOGAT]) activity have difficulty growing with nitrogen sources other than ammonia. Two models have been proposed to account for this inability to grow. One model postulated an imbalance between glutamine synthesis and glutamine degradation that led to a repression of the Ntr system and the subsequent failure to activate transcription of genes required for the use of alternative nitrogen sources. The other model postulated that mutations in gltB or gltD (which encode the subunits of GOGAT) were polar on a downstream gene, gltF, which is necessary for proper activation of gene expression by the Ntr system. The data reported here show that the gltF model is incorrect for three reasons: first, a nonpolar gltB and a polar gltD mutation of K. aerogenes both show the same phenotype; second, K. aerogenes and several other enteric bacteria lack a gene homologous to gltF; and third, mutants of E. coli whose gltF gene has been deleted show no defect in nitrogen metabolism. The argument that accumulated glutamine represses the Ntr system in gltB or gltD mutants is also incorrect, because these mutants can derepress the Ntr system normally so long as sufficient glutamate is supplied. Thus, we conclude that gltB or gltD mutants grow slowly on many poor nitrogen sources because they are starved for glutamate. Much of the glutamate formed by catabolism of alternative nitrogen sources is converted to glutamine, which cannot be efficiently converted to glutamate in the absence of GOGAT activity. Finally, GOGAT-deficient E. coli cells growing with glutamine as the sole nitrogen source increase their synthesis of the other glutamate-forming enzyme, glutamate dehydrogenase, severalfold, but this is still insufficient to allow rapid growth under these conditions.
Collapse
Affiliation(s)
- T J Goss
- Department of Biology, The University of Michigan, Ann Arbor 48109-1048, USA
| | | | | |
Collapse
|
76
|
McClelland M, Sanderson KE, Spieth J, Clifton SW, Latreille P, Courtney L, Porwollik S, Ali J, Dante M, Du F, Hou S, Layman D, Leonard S, Nguyen C, Scott K, Holmes A, Grewal N, Mulvaney E, Ryan E, Sun H, Florea L, Miller W, Stoneking T, Nhan M, Waterston R, Wilson RK. Complete genome sequence of Salmonella enterica serovar Typhimurium LT2. Nature 2001; 413:852-6. [PMID: 11677609 DOI: 10.1038/35101614] [Citation(s) in RCA: 1414] [Impact Index Per Article: 61.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Salmonella enterica subspecies I, serovar Typhimurium (S. typhimurium), is a leading cause of human gastroenteritis, and is used as a mouse model of human typhoid fever. The incidence of non-typhoid salmonellosis is increasing worldwide, causing millions of infections and many deaths in the human population each year. Here we sequenced the 4,857-kilobase (kb) chromosome and 94-kb virulence plasmid of S. typhimurium strain LT2. The distribution of close homologues of S. typhimurium LT2 genes in eight related enterobacteria was determined using previously completed genomes of three related bacteria, sample sequencing of both S. enterica serovar Paratyphi A (S. paratyphi A) and Klebsiella pneumoniae, and hybridization of three unsequenced genomes to a microarray of S. typhimurium LT2 genes. Lateral transfer of genes is frequent, with 11% of the S. typhimurium LT2 genes missing from S. enterica serovar Typhi (S. typhi), and 29% missing from Escherichia coli K12. The 352 gene homologues of S. typhimurium LT2 confined to subspecies I of S. enterica-containing most mammalian and bird pathogens-are useful for studies of epidemiology, host specificity and pathogenesis. Most of these homologues were previously unknown, and 50 may be exported to the periplasm or outer membrane, rendering them accessible as therapeutic or vaccine targets.
Collapse
Affiliation(s)
- M McClelland
- Sidney Kimmel Cancer Center, 10835 Altman Row, San Diego, California 92121, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
77
|
Fitch W, Brisse S, Stevens J, Tibayrenc M. Infectious diseases and the golden age of phylogenetics: an E-debate. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2001; 1:69-74. [PMID: 12798052 DOI: 10.1016/s1567-1348(01)00010-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- W Fitch
- Department of Ecology and Evolutionary Biology, UC Irvine, Irvine, CA 92717, USA
| | | | | | | |
Collapse
|
78
|
Abstract
We have now complete genome sequences of several pairs of closely related prokaryotes (conspecific strains or congeneric species). Surprisingly, even strains of the same species can differ by as much as 20% in gene content. Conceptual and methodological approaches for dealing with such diversity are now being developed, and should transform microbial genomics.
Collapse
Affiliation(s)
- Y Boucher
- Canadian Institute for Advanced Research Program in Evolutionary Biology, Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia BSH 4H7, Canada
| | | | | |
Collapse
|
79
|
Abstract
Traditionally, evolutionary biologists have viewed mutations within individual genes as the major source of phenotypic variation leading to adaptation through natural selection, and ultimately generating diversity among species. Although such processes must contribute to the initial development of gene functions and their subsequent fine-tuning, changes in genome repertoire, occurring through gene acquisition and deletion, are the major events underlying the emergence and evolution of bacterial pathogens and symbionts. Furthermore, pathogens and symbionts depend on similar mechanisms for interacting with hosts and show parallel trends in genome evolution.
Collapse
Affiliation(s)
- H Ochman
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85718, USA.
| | | |
Collapse
|
80
|
Current Awareness on Comparative and Functional Genomics. Comp Funct Genomics 2001. [PMCID: PMC2447213 DOI: 10.1002/cfg.58] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
|