251
|
Roberts F, Jennison AV, Verma NK. The Shigella flexneri serotype Y vaccine candidate SFL124 originated from a serotype 2a background. ACTA ACUST UNITED AC 2005; 45:285-9. [PMID: 15963704 DOI: 10.1016/j.femsim.2005.05.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2005] [Accepted: 05/04/2005] [Indexed: 11/16/2022]
Abstract
Shigella flexneri is endemic in most developing countries and responsible for the highest mortality rate among the Shigella species. The attenuated serotype Y S. flexneri strain SFL124 has been used as the parental strain for the development of recombinant vaccines expressing multiple O-antigen structures. During the development of one such multivalent vaccine, a region of gtrII homology was found in SFL124. Sequencing and analysis of this region revealed the presence of an insertion element interrupted serotype 2a serotype-conversion locus in the serotype Y vaccine strain SFL124. The data presented suggests that SFL124 has derived from a serotype 2a background.
Collapse
Affiliation(s)
- Fleur Roberts
- Faculty of Science, School of Biochemistry and Molecular Biology, The Australian National University, Canberra, ACT 0200, Australia
| | | | | |
Collapse
|
252
|
Nielsen P, Krogh A. Large-scale prokaryotic gene prediction and comparison to genome annotation. Bioinformatics 2005; 21:4322-9. [PMID: 16249266 DOI: 10.1093/bioinformatics/bti701] [Citation(s) in RCA: 106] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Prokaryotic genomes are sequenced and annotated at an increasing rate. The methods of annotation vary between sequencing groups. It makes genome comparison difficult and may lead to propagation of errors when questionable assignments are adapted from one genome to another. Genome comparison either on a large or small scale would be facilitated by using a single standard for annotation, which incorporates a transparency of why an open reading frame (ORF) is considered to be a gene. RESULTS A total of 143 prokaryotic genomes were scored with an updated version of the prokaryotic genefinder EasyGene. Comparison of the GenBank and RefSeq annotations with the EasyGene predictions reveals that in some genomes up to approximately 60% of the genes may have been annotated with a wrong start codon, especially in the GC-rich genomes. The fractional difference between annotated and predicted confirms that too many short genes are annotated in numerous organisms. Furthermore, genes might be missing in the annotation of some of the genomes. We predict 41 of 143 genomes to be over-annotated by >5%, meaning that too many ORFs are annotated as genes. We also predict that 12 of 143 genomes are under-annotated. These results are based on the difference between the number of annotated genes not found by EasyGene and the number of predicted genes that are not annotated in GenBank. We argue that the average performance of our standardized and fully automated method is slightly better than the annotation.
Collapse
Affiliation(s)
- Pernille Nielsen
- Bioinformatics Centre, Institute of Molecular Biology and Physiology, University of Copenhagen Universitetsparken 15, 2100 Copenhagen, Denmark.
| | | |
Collapse
|
253
|
Dagan T, Blekhman R, Graur D. The “Domino Theory” of Gene Death: Gradual and Mass Gene Extinction Events in Three Lineages of Obligate Symbiotic Bacterial Pathogens. Mol Biol Evol 2005; 23:310-6. [PMID: 16237210 DOI: 10.1093/molbev/msj036] [Citation(s) in RCA: 57] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
During the adaptation of an organism to a parasitic lifestyle, various gene functions may be rendered superfluous due to the fact that the host may supply these needs. As a consequence, obligate symbiotic bacterial pathogens tend to undergo reductive genomic evolution through gene death (nonfunctionalization or pseudogenization) and deletion. Here, we examine the evolutionary sequence of gene-death events during the process of genome miniaturization in three bacterial species that have experienced extensive genome reduction: Mycobacterium leprae, Shigella flexneri, and Salmonella typhi. We infer that in all three lineages, the distribution of functional categories is similar in pseudogenes and genes but different from that of absent genes. Based on an analysis of evolutionary distances, we propose a two-step "domino effect" model for reductive genome evolution. The process starts with a gradual gene-by-gene-death sequence of events. Eventually, a crucial gene within a complex pathway or network is rendered nonfunctional triggering a "mass gene extinction" of the dependent genes. In contrast to published reports according to which genes belonging to certain functional categories are prone to nonfunctionalization more frequently and earlier than genes belonging to other functional categories, we could discern no characteristic regularity in the temporal order of function loss.
Collapse
Affiliation(s)
- Tal Dagan
- Institüt für Botanik III, Heinrich-Heine Universität Düsseldorf, Düsseldorf, Germany
| | | | | |
Collapse
|
254
|
Reva ON, Tümmler B. Differentiation of regions with atypical oligonucleotide composition in bacterial genomes. BMC Bioinformatics 2005; 6:251. [PMID: 16225667 PMCID: PMC1274298 DOI: 10.1186/1471-2105-6-251] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2005] [Accepted: 10/14/2005] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Complete sequencing of bacterial genomes has become a common technique of present day microbiology. Thereafter, data mining in the complete sequence is an essential step. New in silico methods are needed that rapidly identify the major features of genome organization and facilitate the prediction of the functional class of ORFs. We tested the usefulness of local oligonucleotide usage (OU) patterns to recognize and differentiate types of atypical oligonucleotide composition in DNA sequences of bacterial genomes. RESULTS A total of 163 bacterial genomes of eubacteria and archaea published in the NCBI database were analyzed. Local OU patterns exhibit substantial intrachromosomal variation in bacteria. Loci with alternative OU patterns were parts of horizontally acquired gene islands or ancient regions such as genes for ribosomal proteins and RNAs. OU statistical parameters, such as local pattern deviation (D), pattern skew (PS) and OU variance (OUV) enabled the detection and visualization of gene islands of different functional classes. CONCLUSION A set of approaches has been designed for the statistical analysis of nucleotide sequences of bacterial genomes. These methods are useful for the visualization and differentiation of regions with atypical oligonucleotide composition prior to or accompanying gene annotation.
Collapse
Affiliation(s)
- Oleg N Reva
- Klinische Forschergruppe, OE6711, Medizinische Hochschule Hannover, Carl-Neuberg-Strasse 1, D-30625 Hannover, Germany
- Danylo Zabolotny Institute of Microbiology and Virology of the National Academy of Science of Ukraine, Dep. of Antibiotics, 154 Zabolotnogo Str., D03680, Kyiv GSP, Ukraine
| | - Burkhard Tümmler
- Klinische Forschergruppe, OE6711, Medizinische Hochschule Hannover, Carl-Neuberg-Strasse 1, D-30625 Hannover, Germany
| |
Collapse
|
255
|
Li B, Brown EW, D'Agostino C, LeClerc JE, Cebula TA. Structure and distribution of the phosphoprotein phosphatase genes, prpA and prpB, among Shigella subgroups. Microbiology (Reading) 2005; 151:2671-2683. [PMID: 16079345 DOI: 10.1099/mic.0.27990-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Phosphoprotein phosphatases encoded by theprpAandprpBgenes function in signal transduction pathways for degradation of misfolded proteins in the extracytoplasmic compartments ofEscherichia coli. In order to trace the evolution ofprpgenes and assess their roles in other enteric pathogens, the structure and distribution of these genes among closely related Shigella subgroups were studied. PCR amplification, probe hybridization studies and DNA sequencing were used to determine theprpgenotypes of 58 strains from the four Shigella subgroups, Dysenteriae, Boydii, Sonnei and Flexneri. It was found that theprpalleles among Shigella subgroups were extremely susceptible to gene inactivation and that the mutations involved inprpallele inactivation were varied. They included IS insertions, gene replacement by an IS element, a small deletion within the gene or large deletion engulfing the entire gene region, and base substitutions that generated premature termination codons. As a result, of 58 strains studied, only eight (14 %) possessed intactprpAandprpBgenes. Of the Shigella strains examined, 76 % (44/58) showed at least one of theprpalleles inactivated by one or more IS elements, including IS1, IS4, IS600and IS629. Phylogenetic analysis revealed that IS elements have been independently acquired in multiple lineages of Shigella, suggesting that loss of functional alleles has been advantageous during Shigella strain evolution.
Collapse
Affiliation(s)
- Baoguang Li
- Division of Molecular Biology, Center for Food Safety and Applied Nutrition, Food and Drug Administration, 8301 Muirkirk Road, Laurel, MD 20708, USA
| | - Eric W Brown
- Division of Molecular Biology, Center for Food Safety and Applied Nutrition, Food and Drug Administration, 8301 Muirkirk Road, Laurel, MD 20708, USA
| | - Christine D'Agostino
- Division of Molecular Biology, Center for Food Safety and Applied Nutrition, Food and Drug Administration, 8301 Muirkirk Road, Laurel, MD 20708, USA
| | - J Eugene LeClerc
- Division of Molecular Biology, Center for Food Safety and Applied Nutrition, Food and Drug Administration, 8301 Muirkirk Road, Laurel, MD 20708, USA
| | - Thomas A Cebula
- Division of Molecular Biology, Center for Food Safety and Applied Nutrition, Food and Drug Administration, 8301 Muirkirk Road, Laurel, MD 20708, USA
| |
Collapse
|
256
|
Abstract
Pseudogenes are nonfunctional DNA sequences that can accumulate in the genomes of some bacterial species, especially those undergoing processes like niche change, host specialization, or weak selection strength. They may last for long evolutionary periods, opening the question of how the genome prevents expression of these degenerated or disrupted genes that would presumably give rise to malfunctioning proteins. We have investigated ribosomal binding strength at Shine-Dalgarno sequences and the prevalence of sigma70 promoter regions in pseudogenes across bacteria. It is reported that the RNA polymerase-binding sites and more strongly the ribosome-binding regions of pseudogenes are highly degraded, suggesting that transcription and translation are impaired in nonfunctional open reading frames. This would reduce the metabolic investment on faulty proteins because although pseudogenes can persist for long time periods, they would be effectively silenced. It is unclear whether mutation accumulation on regulatory regions is neutral or whether it is accelerated by selection.
Collapse
|
257
|
Chiapello H, Bourgait I, Sourivong F, Heuclin G, Gendrault-Jacquemard A, Petit MA, El Karoui M. Systematic determination of the mosaic structure of bacterial genomes: species backbone versus strain-specific loops. BMC Bioinformatics 2005; 6:171. [PMID: 16011797 PMCID: PMC1187871 DOI: 10.1186/1471-2105-6-171] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2005] [Accepted: 07/12/2005] [Indexed: 11/23/2022] Open
Abstract
Background Public databases now contain multitude of complete bacterial genomes, including several genomes of the same species. The available data offers new opportunities to address questions about bacterial genome evolution, a task that requires reliable fine comparison data of closely related genomes. Recent analyses have shown, using pairwise whole genome alignments, that it is possible to segment bacterial genomes into a common conserved backbone and strain-specific sequences called loops. Results Here, we generalize this approach and propose a strategy that allows systematic and non-biased genome segmentation based on multiple genome alignments. Segmentation analyses, as applied to 13 different bacterial species, confirmed the feasibility of our approach to discern the 'mosaic' organization of bacterial genomes. Segmentation results are available through a Web interface permitting functional analysis, extraction and visualization of the backbone/loops structure of documented genomes. To illustrate the potential of this approach, we performed a precise analysis of the mosaic organization of three E. coli strains and functional characterization of the loops. Conclusion The segmentation results including the backbone/loops structure of 13 bacterial species genomes are new and available for use by the scientific community at the URL: .
Collapse
Affiliation(s)
- H Chiapello
- Mathématique, Informatique & Génome, INRA Domaine de Vilvert, 78352 Jouy-en-Josas cedex, France
| | - I Bourgait
- Mathématique, Informatique & Génome, INRA Domaine de Vilvert, 78352 Jouy-en-Josas cedex, France
| | - F Sourivong
- Mathématique, Informatique & Génome, INRA Domaine de Vilvert, 78352 Jouy-en-Josas cedex, France
| | - G Heuclin
- Mathématique, Informatique & Génome, INRA Domaine de Vilvert, 78352 Jouy-en-Josas cedex, France
| | - A Gendrault-Jacquemard
- Mathématique, Informatique & Génome, INRA Domaine de Vilvert, 78352 Jouy-en-Josas cedex, France
| | - M-A Petit
- Unité de Recherches Laitières et Génétique Appliquée, INRA Domaine de Vilvert, 78352 Jouy-en-Josas cedex, France
| | - M El Karoui
- Unité de Recherches Laitières et Génétique Appliquée, INRA Domaine de Vilvert, 78352 Jouy-en-Josas cedex, France
| |
Collapse
|
258
|
Tominaga A, Lan R, Reeves PR. Evolutionary changes of the flhDC flagellar master operon in Shigella strains. J Bacteriol 2005; 187:4295-302. [PMID: 15937193 PMCID: PMC1151726 DOI: 10.1128/jb.187.12.4295-4302.2005] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Shigella strains are nonmotile. The master operon of flagellar synthesis, flhDC, was analyzed for genetic damage in 46 Shigella strains representing all known serotypes. In 11 strains (B1, B3, B6, B8, B10, B18, D5, F1B, D10, F3A, and F3C) the flhDC operon was completely deleted. PCR and sequence analysis of the flhDC region of the remaining 35 strains revealed many insertions or deletions associated with insertion sequences, and the majority of the strains were found to be defective in their flhDC genes. As these genes also play a role in regulation of non-flagellar genes, the loss may have other consequences or be driven by selection pressures other than those against flagellar motility. It has been suggested that Shigella strains fall mostly into three clusters within Escherichia coli, with five outlier strains, four of which are also within E. coli (G. M. Pupo, R. Lan, and P. R. Reeves, Proc. Natl. Acad. Sci. USA 97:10567-10572, 2000). The distribution of genetic changes in the flhDC region correlated very well with the three clusters and outlier strains found using housekeeping gene DNA sequences, enabling us to follow the sequence of mutational change in the flhDC locus. Two cluster 2 strains were found to have unique flhDC sequences, which are most probably due to recombination during the exchange of the adjacent O-antigen gene clusters.
Collapse
Affiliation(s)
- Akira Tominaga
- School of Molecular and Microbial Biosciences (GO8), The University of Sydney, Sydney, NSW 2006, Australia
| | | | | |
Collapse
|
259
|
Kothapalli S, Nair S, Alokam S, Pang T, Khakhria R, Woodward D, Johnson W, Stocker BAD, Sanderson KE, Liu SL. Diversity of genome structure in Salmonella enterica serovar Typhi populations. J Bacteriol 2005; 187:2638-50. [PMID: 15805510 PMCID: PMC1070368 DOI: 10.1128/jb.187.8.2638-2650.2005] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
The genomes of most strains of Salmonella and Escherichia coli are highly conserved. In contrast, all 136 wild-type strains of Salmonella enterica serovar Typhi analyzed by partial digestion with I-CeuI (an endonuclease which cuts within the rrn operons) and pulsed-field gel electrophoresis and by PCR have rearrangements due to homologous recombination between the rrn operons leading to inversions and translocations. Recombination between rrn operons in culture is known to be equally frequent in S. enterica serovar Typhi and S. enterica serovar Typhimurium; thus, the recombinants in S. enterica serovar Typhi, but not those in S. enterica serovar Typhimurium, are able to survive in nature. However, even in S. enterica serovar Typhi the need for genome balance and the need for gene dosage impose limits on rearrangements. Of 100 strains of genome types 1 to 6, 72 were only 25.5 kb off genome balance (the relative lengths of the replichores during bidirectional replication from oriC to the termination of replication [Ter]), while 28 strains were less balanced (41 kb off balance), indicating that the survival of the best-balanced strains was greater. In addition, the need for appropriate gene dosage apparently selected against rearrangements which moved genes from their accustomed distance from oriC. Although rearrangements involving the seven rrn operons are very common in S. enterica serovar Typhi, other duplicated regions, such as the 25 IS200 elements, are very rarely involved in rearrangements. Large deletions and insertions in the genome are uncommon, except for deletions of Salmonella pathogenicity island 7 (usually 134 kb) from fragment I-CeuI-G and 40-kb insertions, possibly a prophage, in fragment I-CeuI-E. The phage types were determined, and the origins of the phage types appeared to be independent of the origins of the genome types.
Collapse
Affiliation(s)
- Sushma Kothapalli
- Department of Biological Sciences, University of Calgary, Calgary T2N 1N4, Canada
| | | | | | | | | | | | | | | | | | | |
Collapse
|
260
|
Kuo MMC, Haynes WJ, Loukin SH, Kung C, Saimi Y. Prokaryotic K(+) channels: from crystal structures to diversity. FEMS Microbiol Rev 2005; 29:961-85. [PMID: 16026885 DOI: 10.1016/j.femsre.2005.03.003] [Citation(s) in RCA: 88] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2004] [Revised: 03/21/2005] [Accepted: 03/23/2005] [Indexed: 10/25/2022] Open
Abstract
The deep roots and wide branches of the K(+)-channel family are evident from genome surveys and laboratory experimentation. K(+)-channel genes are widespread and found in nearly all the free-living bacteria, archaea and eukarya. The conservation of basic structures and mechanisms such as the K(+) filter, the gate, and some of the gate's regulatory domains have allowed general insights on animal K(+) channels to be gained from crystal structures of prokaryotic channels. Since microbes are the great majority of life's diversity, it is not surprising that microbial genomes reveal structural motifs beyond those found in animals. There are open-reading frames that encode K(+)-channel subunits with unconventional filter sequences, or regulatory domains of different sizes and numbers not previously known. Parasitic or symbiotic bacteria tend not to have K(+) channels, while those showing lifestyle versatility often have more than one K(+)-channel gene. It is speculated that prokaryotic K(+) channels function to allow adaptation to environmental and metabolic changes, although the actual roles of these channels in prokaryotes are not yet known. Unlike enzymes in basic metabolism, K(+) channel, though evolved early, appear to play more diverse roles than revealed by animal research. Finding and sorting out these roles will be the goal and challenge of the near future.
Collapse
Affiliation(s)
- Mario M-C Kuo
- Laboratory of Molecular Biology, University of Wisconsin, 1525 Linden Drive, Madison, WI 53706, USA
| | | | | | | | | |
Collapse
|
261
|
Gall TL, Mavris M, Martino MC, Bernardini ML, Denamur E, Parsot C. Analysis of virulence plasmid gene expression defines three classes of effectors in the type III secretion system of Shigella flexneri. MICROBIOLOGY-SGM 2005; 151:951-962. [PMID: 15758240 DOI: 10.1099/mic.0.27639-0] [Citation(s) in RCA: 102] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Proteins directly involved in entry and dissemination of Shigella flexneri into epithelial cells are encoded by a virulence plasmid of 200 kb. A 30-kb region (designated the entry region) of this plasmid encodes components of a type III secretion (TTS) apparatus, substrates of this apparatus and their dedicated chaperones. During growth of bacteria in broth, expression of these genes is induced at 37 degrees C and the TTS apparatus is assembled in the bacterial envelope but is not active. Secretion is activated upon contact of bacteria with host cells and is deregulated in an ipaB mutant. The plasmid encodes four transcriptional regulators, VirF, VirB, MxiE and Orf81. VirF controls transcription of virB, whose product is required for transcription of entry region genes. MxiE, with the chaperone IpgC acting as a co-activator, controls expression of several effectors that are induced under conditions of secretion. Genes under the control of Orf81 are not known. The aim of this study was to define further the repertoires of virulence plasmid genes that are under the control of (i) the growth temperature, (ii) each of the known virulence plasmid-encoded transcriptional regulators (VirF, VirB, MxiE and Orf81) and (iii) the activity of the TTS apparatus. Using a macroarray analysis, the expression profiles of 71 plasmid genes were compared in the wild-type strain grown at 37 and 30 degrees C and in virF, virB, mxiE, ipaB, ipaB mxiE and orf81 mutants grown at 37 degrees C. Many genes were found to be under the control of VirB and indirectly of VirF. No alteration of expression of any gene was detected in the orf81 mutant. Expression of 13 genes was increased in the secretion-deregulated ipaB mutant in an MxiE-dependent manner. On the basis of their expression profile, substrates of the TTS apparatus can be classified into three categories: (i) those that are controlled by VirB, (ii) those that are controlled by MxiE and (iii) those that are controlled by both VirB and MxiE. The differential regulation of expression of TTS effectors in response to the TTS apparatus activity suggests that different effectors might be required at different times following contact of bacteria with host cells.
Collapse
Affiliation(s)
- Tony Le Gall
- Laboratoire de Bactériologie, Faculté de Médecine de Brest, F-29285 Brest Cedex, France
- INSERM E0339, Faculté de Médecine Xavier Bichat, 16 rue Henri Huchard, F-75018 Paris, France
| | - Maria Mavris
- Unité de Pathogénie Microbienne Moléculaire, INSERM U389, Institut Pasteur, 25 rue du Dr Roux, F-75724 Paris Cedex 15, France
| | - Maria Celeste Martino
- University Roma La Sapienza, Sez Sci Microbiol, Dipartimento Biol Cellulare & Sviluppo, Via Sardi 70, Rome, I-00185 Italy
| | - Maria Lina Bernardini
- University Roma La Sapienza, Sez Sci Microbiol, Dipartimento Biol Cellulare & Sviluppo, Via Sardi 70, Rome, I-00185 Italy
| | - Erick Denamur
- INSERM E0339, Faculté de Médecine Xavier Bichat, 16 rue Henri Huchard, F-75018 Paris, France
| | - Claude Parsot
- Unité de Pathogénie Microbienne Moléculaire, INSERM U389, Institut Pasteur, 25 rue du Dr Roux, F-75724 Paris Cedex 15, France
| |
Collapse
|
262
|
Abstract
Pseudogenes are now known to be a regular feature of bacterial genomes and are found in particularly high numbers within the genomes of recently emerged bacterial pathogens. As most pseudogenes are recognized by sequence alignments, we use newly available genomic sequences to identify the pseudogenes in 11 genomes from 4 bacterial genera, each of which contains at least 1 human pathogen. The numbers of pseudogenes range from 27 in Staphylococcus aureus MW2 to 337 in Yersinia pestis CO92 (e.g. 1–8% of the annotated genes in the genome). Most pseudogenes are formed by small frameshifting indels, but because stop codons are A + T-rich, the two low-G + C Gram-positive taxa (Streptococcus and Staphylococcus) have relatively high fractions of pseudogenes generated by nonsense mutations when compared with more G + C-rich genomes. Over half of the pseudogenes are produced from genes whose original functions were annotated as ‘hypothetical’ or ‘unknown’; however, several broadly distributed genes involved in nucleotide processing, repair or replication have become pseudogenes in one of the sequenced Vibrio vulnificus genomes. Although many of our comparisons involved closely related strains with broadly overlapping gene inventories, each genome contains a largely unique set of pseudogenes, suggesting that pseudogenes are formed and eliminated relatively rapidly from most bacterial genomes.
Collapse
Affiliation(s)
| | - Howard Ochman
- Department of Biochemistry and Molecular Biophysics, University of ArizonaTucson, AZ 87521, USA
- To whom correspondence should be addressed. Tel: +1 520 626 8355; Fax: +1 520 621 3709;
| |
Collapse
|
263
|
Le Gall T, Darlu P, Escobar-Páramo P, Picard B, Denamur E. Selection-driven transcriptome polymorphism in Escherichia coli/Shigella species. Genome Res 2005; 15:260-8. [PMID: 15687289 PMCID: PMC546527 DOI: 10.1101/gr.2405905] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
To explore the role of transcriptome polymorphism in adaptation of organisms to their environment, we evaluated this parameter for the Escherichia coli/Shigella bacterial species, which is composed of well-characterized phylogenetic groups that exhibit characteristic life styles ranging from commensalism to intracellular pathogenicity. Both the genomic content and the transcriptome of 10 strains representative of the major E. coli/Shigella phylogenetic groups were evaluated using macroarrays displaying the 4290 K12-MG1655 open reading frames (ORFs). Although Shigella and enteroinvasive E. coli (EIEC) are not monophyletic, phylogenetic analysis of the binary coded (presence/absence) gene content data showed that these organisms group together due to similar patterns of undetectable K12-MG1655 genes. The variation in transcript abundance was then analyzed using a core genome of 2880 genes present in all strains, after adjusting RNA hybridization signals for DNA hybridization signals. Nonrandom changes in gene expression during the evolution of the E. coli/Shigella species were evidenced. Phylogenetic analysis of transcriptome data again showed that Shigella and EIEC strains group together in terms of gene expression, and this convergence involved groups of genes displaying biologically coherent patterns of functional divergence. Unlike the other E. coli strains evaluated, Shigella and EIEC are intracellular pathogens, and therefore face similar selective pressures. Thus, within the E. coli/Shigella species, strains exhibiting a particular life style have converged toward a specific gene expression pattern in a subset of genes common to the species, revealing the role of selection in shaping transcriptome polymorphism.
Collapse
Affiliation(s)
- Tony Le Gall
- Institut National de la Santé et de la Recherche Médicale (INSERM) E0339, Faculté de Médecine Xavier Bichat, 75018, Paris, France
| | | | | | | | | |
Collapse
|
264
|
Chiu CM, Thomas CM. Evidence for past integration of IncP-1 plasmids into bacterial chromosomes. FEMS Microbiol Lett 2005; 241:163-9. [PMID: 15598528 DOI: 10.1016/j.femsle.2004.10.016] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2004] [Revised: 10/05/2004] [Accepted: 10/07/2004] [Indexed: 10/26/2022] Open
Abstract
Plasmids of the IncP-1 incompatibility group are self-transmissible between and stably maintained in a very broad range of Gram-negative bacteria. A characteristic feature of IncP-1 genomes is the existence of multiple binding sites (OB) for the KorB protein which plays a dual role in active partitioning of plasmid and coordinate regulation of expression of genes for replication, maintenance and transfer. A search of the available bacterial genome sequences revealed a significant number (70 out of 322) with one or more putative KorB binding sites. Binding of KorB to such a site was demonstrated by chromatin immunoprecipitation (ChIP) for Pseudomonas putida KT2440. While such a site may arise by chance, this is unlikely for Pseudomonas aeruginosa UCBPP-PA14 whose genome sequence contains four clustered OB sites and several regions have more than 80% nucleotide identity to traJ, trbJ and trbL of IncP-1 plasmids. A number of other bacterial genomes also contain integrated partial IncP-1 genomes or their remnants. These data provide evidence for multiple past integration events of IncP-1 plasmids into bacterial chromosomes and provide new evidence for IncP-1 plasmids being important elements in gene mobility.
Collapse
Affiliation(s)
- Chung-Min Chiu
- School of Bioscience, University of Birmingham, Edgbaston, Birmingham B15 2TT, UK
| | | |
Collapse
|
265
|
Nhieu GTV, Enninga J, Sansonetti P, Grompone G. Tyrosine kinase signaling and type III effectors orchestrating Shigella invasion. Curr Opin Microbiol 2005; 8:16-20. [PMID: 15694852 DOI: 10.1016/j.mib.2004.12.006] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Upon epithelial cell contact, Shigella type III effectors activate complex signaling pathways that induce localized membrane ruffling, resulting in Shigella invasion. Bacterial induced membrane ruffles require a timely coordination of cytoskeletal processes, including actin polymerization, filament reorganization and depolymerization, orchestrated by Rho GTPases and tyrosine kinases. An emerging concept is that multiple Shigella effectors act in synergy to promote actin polymerization in membrane extensions at the site of bacterial entry. Recent advances point to the role of Abl/Arg and Src tyrosine kinases as key regulators of bacterial induced cytoskeletal dynamics.
Collapse
Affiliation(s)
- Guy Tran Van Nhieu
- Unité de Pathogénie Microbienne Moléculaire. Inserm U389. Institut Pasteur. 28, rue du Dr. Roux, 75724 Paris Cedex 15, France.
| | | | | | | |
Collapse
|
266
|
Korbel JO, Doerks T, Jensen LJ, Perez-Iratxeta C, Kaczanowski S, Hooper SD, Andrade MA, Bork P. Systematic association of genes to phenotypes by genome and literature mining. PLoS Biol 2005; 3:e134. [PMID: 15799710 PMCID: PMC1073694 DOI: 10.1371/journal.pbio.0030134] [Citation(s) in RCA: 105] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2004] [Accepted: 02/02/2005] [Indexed: 11/23/2022] Open
Abstract
One of the major challenges of functional genomics is to unravel the connection between genotype and phenotype. So far no global analysis has attempted to explore those connections in the light of the large phenotypic variability seen in nature. Here, we use an unsupervised, systematic approach for associating genes and phenotypic characteristics that combines literature mining with comparative genome analysis. We first mine the MEDLINE literature database for terms that reflect phenotypic similarities of species. Subsequently we predict the likely genomic determinants: genes specifically present in the respective genomes. In a global analysis involving 92 prokaryotic genomes we retrieve 323 clusters containing a total of 2,700 significant gene–phenotype associations. Some clusters contain mostly known relationships, such as genes involved in motility or plant degradation, often with additional hypothetical proteins associated with those phenotypes. Other clusters comprise unexpected associations; for example, a group of terms related to food and spoilage is linked to genes predicted to be involved in bacterial food poisoning. Among the clusters, we observe an enrichment of pathogenicity-related associations, suggesting that the approach reveals many novel genes likely to play a role in infectious diseases. The combination of text mining and comparative genomics is shown to be a powerful approach to predicting phenotypes that are associated with particular genes in bacterial genomes
Collapse
Affiliation(s)
- Jan O Korbel
- 1European Molecular Biology LaboratoryHeidelbergGermany
| | - Tobias Doerks
- 1European Molecular Biology LaboratoryHeidelbergGermany
| | - Lars J Jensen
- 1European Molecular Biology LaboratoryHeidelbergGermany
- 2Max Delbrück Center for Molecular MedicineBerlin-BuchGermany
| | | | - Szymon Kaczanowski
- 4Institute of Biochemistry and Biophysics, Polish Academy of SciencesWarsawPoland
| | - Sean D Hooper
- 1European Molecular Biology LaboratoryHeidelbergGermany
| | - Miguel A Andrade
- 3Ontario Genomics Innovation Centre, Ottawa Health Research InstituteOttawaCanada
| | - Peer Bork
- 1European Molecular Biology LaboratoryHeidelbergGermany
- 2Max Delbrück Center for Molecular MedicineBerlin-BuchGermany
| |
Collapse
|
267
|
Runyen-Janecky LJ, Boyle AM, Kizzee A, Liefer L, Payne SM. Role of the Pst system in plaque formation by the intracellular pathogen Shigella flexneri. Infect Immun 2005; 73:1404-10. [PMID: 15731038 PMCID: PMC1064976 DOI: 10.1128/iai.73.3.1404-1410.2005] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
In response to the host cell environment, the intracellular pathogen Shigella flexneri induces the expression of numerous genes, including those in the pst operon which is predicted to encode a high-affinity phosphate acquisition system that is expressed under reduced phosphate conditions. An S. flexneri pst mutant forms smaller plaques in Henle cell monolayers than does the parental strain. This mutant exhibited normal production and localization of the S. flexneri IcsA protein. The pst mutant had the same growth rate as the parental strain in both phosphate-reduced and phosphate-replete media in vitro and during the first 3 h of growth in Henle cells in vivo. During growth in phosphate-replete media, the PhoB regulon was constitutively expressed in the pst mutant but not the parental strain. This suggested that the inability of the S. flexneri pst mutant to form wild-type plaques in Henle cell monolayers may be due to aberrant expression of the PhoB regulon. A mutation in phoB was constructed in the S. flexneri pst mutant, and the phoB mutation suppressed the small plaque phenotype of the pst mutant. Additionally, a specific mutation (R220Q) was constructed in the pstA gene of the pst operon that was predicted to eliminate Pst-mediated phosphate transport but allow normal PhoB-regulated gene expression, based on the phenotype of an Escherichia coli strain harboring the same mutation. Addition of this pstA(R220Q) mutation to a S. flexneri pst mutant, as part of the pst operon, restored normal plaque formation and regulation of phoA expression.
Collapse
|
268
|
Szczepanowski R, Braun S, Riedel V, Schneiker S, Krahn I, Pühler A, Schlüter A. The 120 592 bp IncF plasmid pRSB107 isolated from a sewage-treatment plant encodes nine different antibiotic-resistance determinants, two iron-acquisition systems and other putative virulence-associated functions. Microbiology (Reading) 2005; 151:1095-1111. [PMID: 15817778 DOI: 10.1099/mic.0.27773-0] [Citation(s) in RCA: 96] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The antibiotic-multiresistance IncF plasmid pRSB107 was isolated by a transformation-based approach from activated-sludge bacteria of a wastewater-treatment plant. It confers resistance to ampicillin, penicillin G, chloramphenicol, erythromycin, kanamycin, neomycin, streptomycin, sulfonamides, tetracycline and trimethoprim and against mercuric ions. Complete sequencing of this plasmid revealed that it is 120 592 bp in size and has a G+C content of 53.1 mol%. The plasmid backbone is composed of three replicons, RepFIA, RepFIB and RepFII, which are almost identical to corresponding regions located on the F-plasmid and on R100. The three replicons encode replication initiation (rep) and replication control, multimer resolution (mrs), post-segregational killing of plasmid-free cells (psk) and active plasmid partitioning (sopABC locus). Part of the F-leading region and remnants of the F-homologous DNA-transfer (tra) module complete the pRSB107 backbone. Plasmid pRSB107 contains a complex, highly mosaic 35 991 bp antibiotic-resistance region consisting of a Tn21- and a Tn10-derivative and a chloramphenicol-resistance module. The Tn21 derivative is composed of a mercury-resistance region (mer), a Tn4352B-like kanamycin/neomycin-resistance transposon, a streptomycin/sulfonamide-resistance module, remnants of the beta-lactam-resistance transposon Tn1, a macrolide-resistance module flanked by copies of IS26 and IS6100, remnants of Tn402 integrating a class 1 integron and the Tn21-specific transposition module. A truncated version of the tetracycline-resistance transposon Tn10 and the chloramphenicol acetyltransferase gene catA complete the pRSB107 resistance region. In addition to antibiotic resistance, pRSB107 encodes the following putative virulence-associated functions: (i) an aerobactin iron-acquisition siderophore system (iuc/iut); (ii) a putative high-affinity Fe(2+) uptake system which was previously identified on a pathogenicity island of Yersinia pestis and in the genome of the phytopathogen Erwinia carotovora subsp. atroseptica SCRI1043; (iii) an sn-glycerol-3-phosphate transport system (ugp); and (iv) the virulence-associated genes vagCD having a possible function in stable plasmid inheritance. All the accessory modules are framed by insertion sequences, indicating that pRSB107 was gradually assembled by integration of different horizontally acquired DNA segments via transposition or homologous recombination.
Collapse
Affiliation(s)
- R Szczepanowski
- Fakultät für Biologie, Lehrstuhl für Genetik, Universität Bielefeld, Postfach 100131, D-33501 Bielefeld, Germany
| | - S Braun
- Fakultät für Biologie, Lehrstuhl für Genetik, Universität Bielefeld, Postfach 100131, D-33501 Bielefeld, Germany
| | - V Riedel
- Fakultät für Biologie, Lehrstuhl für Genetik, Universität Bielefeld, Postfach 100131, D-33501 Bielefeld, Germany
| | - S Schneiker
- Fakultät für Biologie, Lehrstuhl für Genetik, Universität Bielefeld, Postfach 100131, D-33501 Bielefeld, Germany
| | - I Krahn
- Fakultät für Biologie, Lehrstuhl für Genetik, Universität Bielefeld, Postfach 100131, D-33501 Bielefeld, Germany
| | - A Pühler
- Fakultät für Biologie, Lehrstuhl für Genetik, Universität Bielefeld, Postfach 100131, D-33501 Bielefeld, Germany
| | - A Schlüter
- Fakultät für Biologie, Lehrstuhl für Genetik, Universität Bielefeld, Postfach 100131, D-33501 Bielefeld, Germany
| |
Collapse
|
269
|
Affiliation(s)
- J S Hill Gaston
- University of Cambridge, Addenbrooke's Hospital, Cambridge CB2 2QQ.
| |
Collapse
|
270
|
Abstract
Recent advances in DNA-sequencing technologies have made available an enormous resource of data for the study of bacterial genomes. The broad sample of complete genomes currently available allows us to look at variation in the gross features and characteristics of genomes while the detail of the sequences reveal some of the mechanisms by which these genomes evolve. This review aims to describe bacterial genome structures according to current knowledge and proposed hypotheses. We also describe examples where mechanisms of genome evolution have acted in the adaptation of bacterial species to particular niches.
Collapse
Affiliation(s)
- Stephen D Bentley
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire CB10 1SA, UK.
| | | |
Collapse
|
271
|
Belda E, Moya A, Silva FJ. Genome rearrangement distances and gene order phylogeny in gamma-Proteobacteria. Mol Biol Evol 2005; 22:1456-67. [PMID: 15772379 DOI: 10.1093/molbev/msi134] [Citation(s) in RCA: 87] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Genome rearrangements have been studied in 30 gamma-proteobacterial complete genomes by comparing the order of a reduced set of genes on the chromosome. This set included those genes fulfilling several characteristics, the main ones being that an ortholog was present in every genome and that none of them had been acquired by horizontal gene transfer. Genome rearrangement distances were estimated based on either the number of breakpoints or the minimal number of inversions separating two genomes. Breakpoint and inversion distances were highly correlated, indicating that inversions were the main type of rearrangement event in gamma-Proteobacteria. In general, the progressive increase in sequence-based distances between genome pairs was associated with the increase in their rearrangement-based distances but with several groups of distances not following this pattern. Compared with free-living enteric bacteria, the lineages of Pasteurellaceae were evolving, on average, to relatively higher rates of between 2.02 and 1.64, while the endosymbiotic bacterial lineages of Buchnera aphidicola and Wigglesworthia glossinidia were evolving at moderately higher rates of 1.38 and 1.35, respectively. Because we know that the rearrangement rate in the Bu. aphidicola lineage was close to zero during the last 100-150 Myr of evolution, we deduced that a much higher rate took place in the first period of lineage evolution after the divergence of the Escherichia coli lineage. On the other hand, the lineage of the endosymbiont Blochmannia floridanus did present an almost identical rate to free-living enteric bacteria, indicating that the increase in the genome rearrangement rate is not a general change associated with bacterial endosymbiosis. Phylogenetic reconstruction based on rearrangement distances showed a different topology from the one inferred by sequence information. This topology broke the proposed monophyly of the three endosymbiotic lineages and placed Bl. floridanus as a closer relative to E. coli than Yersinia pestis. These results indicate that the phylogeny of these insect endosymbionts is still an open question that will require the development of specific phylogenetic methods to confirm whether the sisterhood of the three endosymbiotic lineages is real or a consequence of a long-branch attraction phenomenon.
Collapse
Affiliation(s)
- Eugeni Belda
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva and Departament de Genètica, Universitat de València, Valencia, Spain
| | | | | |
Collapse
|
272
|
Ren CP, Beatson SA, Parkhill J, Pallen MJ. The Flag-2 locus, an ancestral gene cluster, is potentially associated with a novel flagellar system from Escherichia coli. J Bacteriol 2005; 187:1430-40. [PMID: 15687208 PMCID: PMC545627 DOI: 10.1128/jb.187.4.1430-1440.2005] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Escherichia coli K-12 possesses two adjacent, divergent, promoterless flagellar genes, fhiA-mbhA, that are absent from Salmonella enterica. Through bioinformatics analysis, we found that these genes are remnants of an ancestral 44-gene cluster and are capable of encoding a novel flagellar system, Flag-2. In enteroaggregative E. coli strain 042, there is a frameshift in lfgC that is likely to have inactivated the system in this strain. Tiling path PCR studies showed that the Flag-2 cluster is present in 15 of 72 of the well-characterized ECOR strains. The Flag-2 system resembles the lateral flagellar systems of Aeromonas and Vibrio, particularly in its apparent dependence on RpoN. Unlike the conventional Flag-1 flagellin, the Flag-2 flagellin shows a remarkable lack of sequence polymorphism. The Flag-2 gene cluster encodes a flagellar type III secretion system (including a dedicated flagellar sigma-antisigma combination), thus raising the number of distinct type III secretion systems in Escherichia/Shigella to five. The presence of the Flag-2 cluster at identical sites in E. coli and its close relative Citrobacter rodentium, combined with its absence from S. enterica, suggests that it was acquired by horizontal gene transfer after the former two species diverged from Salmonella. The presence of Flag-2-like gene clusters in Yersinia pestis, Yersinia pseudotuberculosis, and Chromobacterium violaceum suggests that coexistence of two flagellar systems within the same species is more common than previously suspected. The fact that the Flag-2 gene cluster was not discovered in the first 10 Escherichia/Shigella genome sequences studied emphasizes the importance of maintaining an energetic program of genome sequencing for this important taxonomic group.
Collapse
Affiliation(s)
- Chuan-Peng Ren
- Bacterial Pathogenesis and Genomics Unit, Division of Immunity and Infection, Medical School, University of Birmingham, Birmingham, England B15 2TT
| | | | | | | |
Collapse
|
273
|
Winfield MD, Latifi T, Groisman EA. Transcriptional regulation of the 4-amino-4-deoxy-L-arabinose biosynthetic genes in Yersinia pestis. J Biol Chem 2005; 280:14765-72. [PMID: 15710615 DOI: 10.1074/jbc.m413900200] [Citation(s) in RCA: 78] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Inducible membrane remodeling is an adaptive mechanism that enables Gram-negative bacteria to resist killing by cationic antimicrobial peptides and to avoid eliciting an immune response. Addition of 4-amino-4-deoxy-l -arabinose (4-aminoarabinose) moieties to the phosphate residues of the lipid A portion of the lipopolysaccharide decreases the net negative charge of the bacterial membrane resulting in protection from the cationic antimicrobial peptide polymyxin B. In Salmonella enterica serovar Typhimurium, the PmrA/PmrB two-component regulatory system governs resistance to polymyxin B by controlling transcription of the 4-aminoarabinose biosynthetic genes. Transcription of PmrA-activated genes is induced by Fe(3+), which is sensed by PmrA cognate sensor PmrB, and by low Mg(2+), in a mechanism that requires not only the PmrA and PmrB proteins but also the Mg(2+)-responding PhoP/PhoQ system and the PhoP-activated PmrD protein, a post-translational activator of the PmrA protein. Surprisingly, Yersinia pestis can promote PhoP-dependent modification of its lipid A with 4-aminoarabinose despite lacking a PmrD protein. Here we report that Yersinia uses different promoters to transcribe the 4-aminoarabinose biosynthetic genes pbgP and ugd depending on the inducing signal. This is accomplished by the presence of distinct binding sites for the PmrA and PhoP proteins in the promoters of the pbgP and ugd genes. Our results demonstrate that closely related bacterial species may use disparate regulatory pathways to control genes encoding conserved proteins.
Collapse
Affiliation(s)
- Mollie D Winfield
- Department of Molecular Microbiology, Howard Hughes Medical Institute, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | | | | |
Collapse
|
274
|
Dufraigne C, Fertil B, Lespinats S, Giron A, Deschavanne P. Detection and characterization of horizontal transfers in prokaryotes using genomic signature. Nucleic Acids Res 2005; 33:e6. [PMID: 15653627 PMCID: PMC546175 DOI: 10.1093/nar/gni004] [Citation(s) in RCA: 99] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Horizontal DNA transfer is an important factor of evolution and participates in biological diversity. Unfortunately, the location and length of horizontal transfers (HTs) are known for very few species. The usage of short oligonucleotides in a sequence (the so-called genomic signature) has been shown to be species-specific even in DNA fragments as short as 1 kb. The genomic signature is therefore proposed as a tool to detect HTs. Since DNA transfers originate from species with a signature different from those of the recipient species, the analysis of local variations of signature along recipient genome may allow for detecting exogenous DNA. The strategy consists in (i) scanning the genome with a sliding window, and calculating the corresponding local signature (ii) evaluating its deviation from the signature of the whole genome and (iii) looking for similar signatures in a database of genomic signatures. A total of 22 prokaryote genomes are analyzed in this way. It has been observed that atypical regions make up ∼6% of each genome on the average. Most of the claimed HTs as well as new ones are detected. The origin of putative DNA transfers is looked for among ∼12 000 species. Donor species are proposed and sometimes strongly suggested, considering similarity of signatures. Among the species studied, Bacillus subtilis, Haemophilus Influenzae and Escherichia coli are investigated by many authors and give the opportunity to perform a thorough comparison of most of the bioinformatics methods used to detect HTs.
Collapse
Affiliation(s)
| | | | | | | | - Patrick Deschavanne
- To whom correspondence should be addressed. Tel: 33 1 44 27 77 12; Fax: +33 1 43 26 38 30;
| |
Collapse
|
275
|
Bergthorsson U, Roth JR. Natural isolates of Salmonella enterica serovar Dublin carry a single nadA missense mutation. J Bacteriol 2005; 187:400-3. [PMID: 15601727 PMCID: PMC538839 DOI: 10.1128/jb.187.1.400-403.2005] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Nicotinic acid is required by most isolates of Salmonella enterica (serovar Dublin), a pathogen of cattle. A single nadA missense mutation causes the nutritional requirement of all serovar Dublin isolates tested. Models for persistence of this allele are tested and discussed.
Collapse
|
276
|
Ou HY, Smith R, Lucchini S, Hinton J, Chaudhuri RR, Pallen M, Barer MR, Rajakumar K. ArrayOme: a program for estimating the sizes of microarray-visualized bacterial genomes. Nucleic Acids Res 2005; 33:e3. [PMID: 15640440 PMCID: PMC546176 DOI: 10.1093/nar/gni005] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
ArrayOme is a new program that calculates the size of genomes represented by microarray-based probes and facilitates recognition of key bacterial strains carrying large numbers of novel genes. Protein-coding sequences (CDS) that are contiguous on annotated reference templates and classified as ‘Present’ in the test strain by hybridization to microarrays are merged into ICs (ICs). These ICs are then extended to account for flanking intergenic sequences. Finally, the lengths of all extended ICs are summated to yield the ‘microarray-visualized genome (MVG)’ size. We tested and validated ArrayOme using both experimental and in silico-generated genomic hybridization data. MVG sizing of five sequenced Escherichia coli and Shigella strains resulted in an accuracy of 97–99%, as compared to true genome sizes, when the comprehensive ShE.coli meta-array gene sequences (6239 CDS) were used for in silico hybridization analysis. However, the E.coli CFT073 genome size was underestimated by 14% as this meta-array lacked probes for many CFT073 CDS. ArrayOme permits rapid recognition of discordances between PFGE-measured genome and MVG sizes, thereby enabling high-throughput identification of strains rich in novel genes. Gene discovery studies focused on these strains will greatly facilitate characterization of the global gene pool accessible to individual bacterial species.
Collapse
Affiliation(s)
- Hong-Yu Ou
- Department of Infection, Immunity and Inflammation, Leicester Medical School, University of LeicesterLeicester LE1 9HN, UK
| | - Rebecca Smith
- Department of Infection, Immunity and Inflammation, Leicester Medical School, University of LeicesterLeicester LE1 9HN, UK
| | - Sacha Lucchini
- Molecular Microbiology Group, Institute of Food ResearchNorwich Research Park, Norwich NR4 7UA, UK
| | - Jay Hinton
- Molecular Microbiology Group, Institute of Food ResearchNorwich Research Park, Norwich NR4 7UA, UK
| | - Roy R. Chaudhuri
- Bacterial Pathogenesis and Genomics Unit, Division of Immunity and Infection, Medical School, University of BirminghamBirmingham B15 2TT, UK
| | - Mark Pallen
- Bacterial Pathogenesis and Genomics Unit, Division of Immunity and Infection, Medical School, University of BirminghamBirmingham B15 2TT, UK
| | - Michael R. Barer
- Department of Infection, Immunity and Inflammation, Leicester Medical School, University of LeicesterLeicester LE1 9HN, UK
- Department of Clinical Microbiology, University Hospitals of Leicester NHS TrustLeicester LE1 5WW, UK
| | - Kumar Rajakumar
- Department of Infection, Immunity and Inflammation, Leicester Medical School, University of LeicesterLeicester LE1 9HN, UK
- Department of Clinical Microbiology, University Hospitals of Leicester NHS TrustLeicester LE1 5WW, UK
- To whom correspondence should be addressed at Department of Infection, Immunity and Inflammation, Leicester Medical School, University of Leicester, Maurice Shock Building, University Road, PO Box 138, Leicester LE1 9HN, UK. Tel: +44 0 116 2231498; Fax: +44 0 116 2525030;
| |
Collapse
|
277
|
Zhou S, Kile A, Bechner M, Place M, Kvikstad E, Deng W, Wei J, Severin J, Runnheim R, Churas C, Forrest D, Dimalanta ET, Lamers C, Burland V, Blattner FR, Schwartz DC. Single-molecule approach to bacterial genomic comparisons via optical mapping. J Bacteriol 2004; 186:7773-82. [PMID: 15516592 PMCID: PMC524920 DOI: 10.1128/jb.186.22.7773-7782.2004] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Modern comparative genomics has been established, in part, by the sequencing and annotation of a broad range of microbial species. To gain further insights, new sequencing efforts are now dealing with the variety of strains or isolates that gives a species definition and range; however, this number vastly outstrips our ability to sequence them. Given the availability of a large number of microbial species, new whole genome approaches must be developed to fully leverage this information at the level of strain diversity that maximize discovery. Here, we describe how optical mapping, a single-molecule system, was used to identify and annotate chromosomal alterations between bacterial strains represented by several species. Since whole-genome optical maps are ordered restriction maps, sequenced strains of Shigella flexneri serotype 2a (2457T and 301), Yersinia pestis (CO 92 and KIM), and Escherichia coli were aligned as maps to identify regions of homology and to further characterize them as possible insertions, deletions, inversions, or translocations. Importantly, an unsequenced Shigella flexneri strain (serotype Y strain AMC[328Y]) was optically mapped and aligned with two sequenced ones to reveal one novel locus implicated in serotype conversion and several other loci containing insertion sequence elements or phage-related gene insertions. Our results suggest that genomic rearrangements and chromosomal breakpoints are readily identified and annotated against a prototypic sequenced strain by using the tools of optical mapping.
Collapse
Affiliation(s)
- Shiguo Zhou
- Laboratory for Molecular and Computation Genomics, University of Wisconsin-Madison, Madison, WI 53706, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
278
|
Winfield MD, Groisman EA. Phenotypic differences between Salmonella and Escherichia coli resulting from the disparate regulation of homologous genes. Proc Natl Acad Sci U S A 2004; 101:17162-7. [PMID: 15569938 PMCID: PMC534605 DOI: 10.1073/pnas.0406038101] [Citation(s) in RCA: 118] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2004] [Indexed: 01/01/2023] Open
Abstract
Phenotypic differences among closely related bacteria have been largely ascribed to species-specific genes, such as those residing in pathogenicity islands. However, we now report that the differential regulation of homologous genes is the mechanism responsible for the divergence of the enteric bacteria Salmonella enterica and Escherichia coli in their ability to make LPS modifications mediating resistance to the antibiotic polymyxin B. In S. enterica serovar Typhimurium, the PmrA/PmrB two-component system governing polymyxin B resistance is induced in low Mg(2+) in a process that requires the PmrD protein and by Fe(3+) in a PmrD-independent fashion. We establish that E. coli K-12 induces PmrA-activated gene transcription and polymyxin B resistance in response to Fe(3+), but that it is blind to the low Mg(2+) signal. The highly divergent PmrD protein is responsible for this phenotype as replacement of the E. coli pmrD gene by its Salmonella counterpart resulted in an E. coli strain that transcribed PmrA-activated genes and displayed polymyxin B resistance under the same conditions as Salmonella. Molecular analysis of natural isolates of E. coli and Salmonella revealed that the PmrD proteins are conserved within each genus and that selection might have driven the divergence between the Salmonella and E. coli PmrD proteins. Investigation of PmrD function demonstrated statistically different distributions for the Salmonella and E. coli isolates in PmrD-dependent transcription occurring in low Mg(2+). Our results suggest that the differential regulation of conserved genes may have ecological consequences, determining the range of niches a microorganism can occupy.
Collapse
Affiliation(s)
- Mollie D Winfield
- Department of Molecular Microbiology, Howard Hughes Medical Institute, Washington University School of Medicine, Campus Box 8230, 660 South Euclid Avenue, St. Louis, MO 63110, USA
| | | |
Collapse
|
279
|
Abstract
Many genomic sequences have been recently published for bacteria that can replicate only within eukaryotic hosts. Comparisons of genomic features with those of closely related bacteria retaining free-living stages indicate that rapid evolutionary change often occurs immediately after host restriction. Typical changes include a large increase in the frequency of mobile elements in the genome, chromosomal rearrangements mediated by recombination among these elements, pseudogene formation, and deletions of varying size. In anciently host-restricted lineages, the frequency of insertion sequence elements decreases as genomes become extremely small and strictly clonal. These changes represent a general syndrome of genome evolution, which is observed repeatedly in host-restricted lineages from numerous phylogenetic groups. Considerable variation also exists, however, in part reflecting unstudied aspects of the population structure and ecology of host-restricted bacterial lineages.
Collapse
Affiliation(s)
- Nancy A Moran
- Department of Ecology and Evolutionary Biology & Center for Insect Science, University of Arizona, Tucson, AZ 85721, USA.
| | | |
Collapse
|
280
|
Lacour S, Landini P. SigmaS-dependent gene expression at the onset of stationary phase in Escherichia coli: function of sigmaS-dependent genes and identification of their promoter sequences. J Bacteriol 2004; 186:7186-95. [PMID: 15489429 PMCID: PMC523212 DOI: 10.1128/jb.186.21.7186-7195.2004] [Citation(s) in RCA: 225] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The sigma(S) subunit of RNA polymerase, the product of the rpoS gene, controls the expression of genes responding to starvation and cellular stresses. Using gene array technology, we investigated rpoS-dependent expression at the onset of stationary phase in Escherichia coli grown in rich medium. Forty-one genes were expressed at significantly lower levels in an rpoS mutant derived from the MG1655 strain; for 10 of these, we also confirmed rpoS and stationary-phase dependence by reverse transcription-PCR. Only seven genes (dps, osmE, osmY, sodC, rpsV, wrbA, and yahO) had previously been recognized as rpoS dependent. Several newly identified rpoS-dependent genes are involved in the uptake and metabolism of amino acids, sugars, and iron. Indeed, the rpoS mutant strain shows severely impaired growth on some sugars such as fructose and N-acetylglucosamine. The rpoS gene controls the production of indole, which acts as a signal molecule in stationary-phase cells, via regulation of the tnaA-encoded tryptophanase enzyme. Genes involved in protein biosynthesis, encoding the ribosome-associated protein RpsV (sra) and the initiation factor IF-1 (infA), were also induced in an rpoS-dependent fashion. Using primer extension, we determined the promoter sequences of a selection of rpoS-regulated genes representative of different functional classes. Significant fractions of these promoters carry sequence features specific for Esigma(S) recognition of the -10 region, such as cytosines at positions -13 (70%) and -12 (30%) as well as a TG motif located upstream of the -10 region (50%), thus supporting the TGN(0-2)C(C/T)ATA(C/A)T consensus sequence recently proposed for sigma(S).
Collapse
Affiliation(s)
- Stephan Lacour
- Swiss Federal Institute of Environmental Technology (EAWAG), Dübendorf, Switzerland
| | | |
Collapse
|
281
|
Rezwan F, Lan R, Reeves PR. Molecular basis of the indole-negative reaction in Shigella strains: extensive damages to the tna operon by insertion sequences. J Bacteriol 2004; 186:7460-5. [PMID: 15489459 PMCID: PMC523188 DOI: 10.1128/jb.186.21.7460-7465.2004] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The molecular basis of the loss of tryptophan utilization (indole-negative phenotype) of Shigella strains, in effect clones of Escherichia coli, was investigated. Analysis of the tna operon of 23 Shigella strains representing each of the indole-negative serotypes revealed that insertion sequence-mediated insertion and/or deletions damaged the tna operon, leading to inability to convert tryptophan to indole. These events differ for cluster 1, cluster 3, and the outlier Shigella strains, confirming our previous observation of independent origins of these lineages from within E. coli. Parallel loss of the trait and prevalence of indole-negative strains suggest that the trait is deleterious in Shigella strains and advantages those without it.
Collapse
Affiliation(s)
- Ferdousi Rezwan
- School of Molecular and Microbial Biosciences, University of Sydney, NSW 2006, Australia
| | | | | |
Collapse
|
282
|
McClelland M, Sanderson KE, Clifton SW, Latreille P, Porwollik S, Sabo A, Meyer R, Bieri T, Ozersky P, McLellan M, Harkins CR, Wang C, Nguyen C, Berghoff A, Elliott G, Kohlberg S, Strong C, Du F, Carter J, Kremizki C, Layman D, Leonard S, Sun H, Fulton L, Nash W, Miner T, Minx P, Delehaunty K, Fronick C, Magrini V, Nhan M, Warren W, Florea L, Spieth J, Wilson RK. Comparison of genome degradation in Paratyphi A and Typhi, human-restricted serovars of Salmonella enterica that cause typhoid. Nat Genet 2004; 36:1268-74. [PMID: 15531882 DOI: 10.1038/ng1470] [Citation(s) in RCA: 304] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2004] [Accepted: 10/13/2004] [Indexed: 11/09/2022]
Abstract
Salmonella enterica serovars often have a broad host range, and some cause both gastrointestinal and systemic disease. But the serovars Paratyphi A and Typhi are restricted to humans and cause only systemic disease. It has been estimated that Typhi arose in the last few thousand years. The sequence and microarray analysis of the Paratyphi A genome indicates that it is similar to the Typhi genome but suggests that it has a more recent evolutionary origin. Both genomes have independently accumulated many pseudogenes among their approximately 4,400 protein coding sequences: 173 in Paratyphi A and approximately 210 in Typhi. The recent convergence of these two similar genomes on a similar phenotype is subtly reflected in their genotypes: only 30 genes are degraded in both serovars. Nevertheless, these 30 genes include three known to be important in gastroenteritis, which does not occur in these serovars, and four for Salmonella-translocated effectors, which are normally secreted into host cells to subvert host functions. Loss of function also occurs by mutation in different genes in the same pathway (e.g., in chemotaxis and in the production of fimbriae).
Collapse
Affiliation(s)
- Michael McClelland
- Sidney Kimmel Cancer Center, 10835 Altman Row, San Diego, California 92121, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
283
|
Gilmour MW, Thomson NR, Sanders M, Parkhill J, Taylor DE. The complete nucleotide sequence of the resistance plasmid R478: defining the backbone components of incompatibility group H conjugative plasmids through comparative genomics. Plasmid 2004; 52:182-202. [PMID: 15518875 DOI: 10.1016/j.plasmid.2004.06.006] [Citation(s) in RCA: 138] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2004] [Indexed: 11/25/2022]
Abstract
Horizontal transfer of resistance determinants amongst bacteria can be achieved by conjugative plasmid DNA elements. We have determined the complete 274,762 bp sequence of the incompatibility group H (IncH) plasmid R478, originally isolated from the Gram negative opportunistic pathogen Serratia marcescens. This self-transferable extrachromosomal genetic element contains 295 predicted genes, of which 144 are highly similar to coding sequences of IncH plasmids R27 and pHCM1. The regions of similarity among these three IncH plasmids principally encode core plasmid determinants (i.e., replication, partitioning and stability, and conjugative transfer) and we conducted a comparative analysis to define the minimal IncHI plasmid backbone determinants. No resistance determinants are included in the backbone and most of the sequences unique to R478 were contained in a large contiguous region between the two transfer regions. These findings indicate that plasmid evolution occurs through gene acquisition/loss predominantly in regions outside of the core determinants. Furthermore, a modular evolution for R478 was signified by the presence of gene neighbors or operons that were highly related to sequences from a wide range of chromosomal, transposon, and plasmid elements. The conjugative transfer regions are most similar to sequences encoded on SXT, Rts1, pCAR1, R391, and pRS241d. The dual partitioning modules encoded on R478 resemble numerous sequences; including pMT1, pCTX-M3, pCP301, P1, P7, and pB171. R478 also codes for resistance to tetracycline (Tn10), chloramphenicol (cat), kanamycin (aphA), mercury (similar to Tn21), silver (similar to pMG101), copper (similar to pRJ1004), arsenic (similar to pYV), and tellurite (two separate regions similar to IncHI2 ter determinants and IncP kla determinants). Other R478-encoded sequences are related to Tn7, IS26, tus, mucAB, and hok, where the latter is surrounded by insLKJ, and could potentially be involved in post-segregation killing. The similarity to a diverse set of bacterial sequences highlights the ability of horizontally transferable DNA elements to acquire and disseminate genetic traits through the bacterial gene pool.
Collapse
Affiliation(s)
- Matthew W Gilmour
- Department of Medical Microbiology and Immunology, University of Alberta, Edmonton, Alta., Canada T6G 2R3
| | | | | | | | | |
Collapse
|
284
|
Hanoulle X, Rollet E, Clantin B, Landrieu I, Odberg-Ferragut C, Lippens G, Bohin JP, Villeret V. Structural analysis of Escherichia coli OpgG, a protein required for the biosynthesis of osmoregulated periplasmic glucans. J Mol Biol 2004; 342:195-205. [PMID: 15313617 DOI: 10.1016/j.jmb.2004.07.004] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2004] [Revised: 07/06/2004] [Accepted: 07/06/2004] [Indexed: 11/17/2022]
Abstract
Osmoregulated periplasmic glucans (OPGs) G protein (OpgG) is required for OPGs biosynthesis. OPGs from Escherichia coli are branched glucans, with a backbone of beta-1,2 glucose units and with branches attached by beta-1,6 linkages. In Proteobacteria, OPGs are involved in osmoprotection, biofilm formation, virulence and resistance to antibiotics. Despite their important biological implications, enzymes synthesizing OPGs are poorly characterized. Here, we report the 2.5 A crystal structure of OpgG from E.coli. The structure was solved using a selenemethionine derivative of OpgG and the multiple anomalous diffraction method (MAD). The protein is composed of two beta-sandwich domains connected by one turn of 3(10) helix. The N-terminal domain (residues 22-388) displays a 25-stranded beta-sandwich fold found in several carbohydrate-related proteins. It exhibits a large cleft comprising many aromatic and acidic residues. This putative binding site shares some similarities with enzymes such as galactose mutarotase and glucodextranase, suggesting a potential catalytic role for this domain in OPG synthesis. On the other hand, the C-terminal domain (residues 401-512) has a seven-stranded immunoglobulin-like beta-sandwich fold, found in many proteins where it is mainly implicated in interactions with other molecules. The structural data suggest that OpgG is an OPG branching enzyme in which the catalytic activity is located in the large N-terminal domain and controlled via the smaller C-terminal domain.
Collapse
Affiliation(s)
- Xavier Hanoulle
- UMR 8525 CNRS, Institut de Biologie de Lille, Université de Lille II, 1 rue du Professeur Calmette, BP447, 59021, France.
| | | | | | | | | | | | | | | |
Collapse
|
285
|
Abstract
Because bacterial chromosomes are tightly packed with genes and were traditionally viewed as being optimized for size and replication speed, it was not surprising that the early annotations of sequenced bacterial genomes reported few, if any, pseudogenes. But because pseudogenes are generally recognized by comparisons with their functional counterparts, as more genome sequences accumulated, many bacterial pathogens were found to harbor large numbers of truncated, inactivated, and degraded genes. Because the mutational events that inactivate genes occur continuously in all genomes, we investigated whether the rarity of pseudogenes in some bacteria was attributable to properties inherent to the organism or to the failure to recognize pseudogenes. By developing a program suite (called Psi-Phi, for Psi-gene Finder) that applies a comparative method to identify pseudogenes (attributable both to misannotation and to nonrecognition), we analyzed the pseudogene inventories in the sequenced members of the Escherichia coli/Shigella clade. This approach recovered hundreds of previously unrecognized pseudogenes and showed that pseudogenes are a regular feature of bacterial genomes, even in those whose original annotations registered no truncated or otherwise inactivated genes. In Shigella flexneri 2a, large proportions of pseudogenes are generated by nonsense mutations and IS element insertions, events that seldom produce the pseudogenes present in the other genomes examined. Almost all (>95%) pseudogenes are restricted to only one of the genomes and are of relatively recent origin, suggesting that these bacteria possess active mechanisms to eliminate nonfunctional genes.
Collapse
Affiliation(s)
- Emmanuelle Lerat
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 87521, USA
| | | |
Collapse
|
286
|
Lan R, Alles MC, Donohoe K, Martinez MB, Reeves PR. Molecular evolutionary relationships of enteroinvasive Escherichia coli and Shigella spp. Infect Immun 2004; 72:5080-8. [PMID: 15322001 PMCID: PMC517479 DOI: 10.1128/iai.72.9.5080-5088.2004] [Citation(s) in RCA: 125] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Enteroinvasive Escherichia coli (EIEC), a distinctive pathogenic form of E. coli causing dysentery, is similar in many properties to bacteria placed in the four species of Shigella. Shigella has been separated as a genus but in fact comprises several clones of E. coli. The evolutionary relationships of 32 EIEC strains of 12 serotypes have been determined by sequencing of four housekeeping genes and two plasmid genes which were used previously to determine the relationships of Shigella strains. The EIEC strains were grouped in four clusters with one outlier strain, indicating independent derivation of EIEC several times. Three of the four clusters contain more than one O antigen type. One EIEC strain (an O112ac:H- strain) was found in Shigella cluster 3 but is not identical to the Shigella cluster 3 D2 and B15 strains with the same O antigen. Two forms of the virulence plasmid pINV have been identified in Shigella strains by using the sequences of ipgD and mxiA genes, and all but two of our EIEC strains have pINV A. The EIEC strains were grouped in two subclusters with a very low level of variation, generally not intermingled with Shigella pINV A strains. The EIEC clusters based on housekeeping genes were reflected in the plasmid gene sequences, with some exceptions. Two strains were found in the pINV B form by using the ipgD sequence, with one strain having an mxiA sequence similar to the divergent sequence of D1. Clearly, EIEC and Shigella spp. form a pathovar of E. coli.
Collapse
Affiliation(s)
- Ruiting Lan
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Australia.
| | | | | | | | | |
Collapse
|
287
|
Darling ACE, Mau B, Blattner FR, Perna NT. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res 2004; 14:1394-403. [PMID: 15231754 PMCID: PMC442156 DOI: 10.1101/gr.2289704] [Citation(s) in RCA: 3508] [Impact Index Per Article: 167.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
As genomes evolve, they undergo large-scale evolutionary processes that present a challenge to sequence comparison not posed by short sequences. Recombination causes frequent genome rearrangements, horizontal transfer introduces new sequences into bacterial chromosomes, and deletions remove segments of the genome. Consequently, each genome is a mosaic of unique lineage-specific segments, regions shared with a subset of other genomes and segments conserved among all the genomes under consideration. Furthermore, the linear order of these segments may be shuffled among genomes. We present methods for identification and alignment of conserved genomic DNA in the presence of rearrangements and horizontal transfer. Our methods have been implemented in a software package called Mauve. Mauve has been applied to align nine enterobacterial genomes and to determine global rearrangement structure in three mammalian genomes. We have evaluated the quality of Mauve alignments and drawn comparison to other methods through extensive simulations of genome evolution.
Collapse
Affiliation(s)
- Aaron C E Darling
- Department of Computer Science, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | | | | | | |
Collapse
|
288
|
Aguilar D, Aviles FX, Querol E, Sternberg MJE. Analysis of phenetic trees based on metabolic capabilites across the three domains of life. J Mol Biol 2004; 340:491-512. [PMID: 15210350 DOI: 10.1016/j.jmb.2004.04.059] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2003] [Revised: 04/26/2004] [Accepted: 04/29/2004] [Indexed: 11/28/2022]
Abstract
Here, we used data of complete genomes to study comparatively the metabolism of different species. We built phenetic trees based on the enzymatic functions present in different parts of metabolism. Seven broad metabolic classes, comprising a total of 69 metabolic pathways, were comparatively analyzed for 27 fully sequenced organisms of the domains Eukarya, Bacteria and Archaea. Phylogenetic profiles based on the presence/absence of enzymatic functions for each metabolic class were determined and distance matrices for all the organisms were then derived from the profiles. Unrooted phenetic trees based upon the matrices revealed the distribution of the organisms according to their metabolic capabilities, reflecting the ecological pressures and adaptations that those species underwent during their evolution. We found that organisms that are closely related in phylogenetic terms could be distantly related metabolically and that the opposite is also true. For example, obligate bacterial pathogens were usually grouped together in our metabolic trees, demonstrating that obligate pathogens share common metabolic features regardless of their diverse phylogenetic origins. The branching order of proteobacteria often did not match their classical phylogenetic classification and Gram-positive bacteria showed diverse metabolic affinities. Archaea were found to be metabolically as distant from free-living bacteria as from eukaryotes, and sometimes were placed close to the metabolically highly specialized group of obligate bacterial pathogens. Metabolic trees represent an integrative approach for the comparison of the evolution of the metabolism and its correlation with the evolution of the genome, helping to find new relationships in the tree of life.
Collapse
Affiliation(s)
- Daniel Aguilar
- Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, 08193 Bellaterra (Barcelona), Spain
| | | | | | | |
Collapse
|
289
|
Williams KM, Bigley EC. Identification of an I-Ed-restricted T-cell epitope of Escherichia coli outer membrane protein F. Infect Immun 2004; 72:3907-13. [PMID: 15213134 PMCID: PMC427395 DOI: 10.1128/iai.72.7.3907-3913.2004] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
A predominant T-cell epitope of Escherichia coli outer membrane protein F (OmpF) that encompasses amino acids 295 to 314 was identified in H-2(d) mice. BALB/c-derived T-cell hybridomas generated against this region were CD3(+), CD4(+), CD8(-), and T-cell receptor alphabeta(+) and secreted TH-1-associated cytokines (interleukin-2 [IL-2] and gamma interferon), but not a TH-2-associated cytokine (IL-4), when restimulated with peptide 295-314. Class II(+) mouse lymphoma (A20) cells, but not class II(-) mouse mastocytoma (P815) cells, supported IL-2 secretion of hybridomas when substituted for syngeneic splenocytes as antigen-presenting cells (APCs). Antibodies specific for I-E(d) blocked IL-2 secretion by hybridomas, but I-A(d)-specific antiserum did not. When transfected L cells expressing I-A(d) (AalphaAbeta(d)), I-E(d) (EalphaEbeta(d)), or the hybrid molecule I-EalphaAbeta(d) were used as APCs, hybridomas recognized peptide only when presented by the I-E(d)-transfected cells. When peptide 295-314 truncated at either the C or the N terminus of the sequence was used, the minimal epitope was determined. Critical residues were determined by using alanine-substituted peptide analogues. T-cell hybridomas were only stimulated by peptides that encompassed amino acids 295 to 303 (9-mer), and the core sequence required a minimum of three additional amino acids at either the amino or the carboxy terminus to induce IL-2 secretion. Critical residues were determined to be phenylalanine at position 295, threonine at position 300, and tyrosines at positions 301 and 302. This study is the first to identify a minimal T-cell epitope and major histocompatibility complex restriction element of the OmpF protein and confirms previous observations that there is considerable degeneracy in the length of peptides that can bind I-E(d) and variability in the amino acid composition of the C and N termini of these peptides.
Collapse
Affiliation(s)
- Kristina M Williams
- Center for Food Safety and Applied Nutrition, Immunobiology Branch, Food and Drug Administration, 8301 Muirkirk Road, Laurel, MD 20708, USA.
| | | |
Collapse
|
290
|
Hacker J, Hochhut B, Middendorf B, Schneider G, Buchrieser C, Gottschalk G, Dobrindt U. Pathogenomics of mobile genetic elements of toxigenic bacteria. Int J Med Microbiol 2004; 293:453-61. [PMID: 15149018 DOI: 10.1078/1438-4221-00290] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
The growing knowledge of genetic diversity and whole genome organization in bacteria shows that pathogenicity islands (PAIs) represent a subtype of a more general genetic element, termed genomic island (GEI), which is widespread among pathogenic and non-pathogenic microbes. These findings mirror the importance of horizontal gene transfer, genome reduction and recombination events as fundamental mechanisms involved in evolution of bacterial variants. GEIs are part of the flexible gene pool and carry selfish genes, but also determinants which may be beneficial under certain conditions thus increasing bacterial fitness and consequently their survival or transmission. In this review, we focus on the role of mobile genetic elements that may also contain toxin-encoding genes for genome variability and evolution of bacteria.
Collapse
Affiliation(s)
- Jörg Hacker
- University of Würzburg, Institute for Molecular Biology of Infectious Diseases, Würzburg, Germany.
| | | | | | | | | | | | | |
Collapse
|
291
|
Abstract
Differences in gene repertoire among bacterial genomes are usually ascribed to gene loss or to lateral gene transfer from unrelated cellular organisms. However, most bacteria contain large numbers of ORFans, that is, annotated genes that are restricted to a particular genome and that possess no known homologs. The uniqueness of ORFans within a genome has precluded the use of a comparative approach to examine their function and evolution. However, by identifying sequences unique to monophyletic groups at increasing phylogenetic depths, we can make direct comparisons of the characteristics of ORFans of different ages in the Escherichia coli genome, and establish their functional status and evolutionary rates. Relative to the genes ancestral to gamma-Proteobacteria and to those genes distributed sporadically in other prokaryotic species, ORFans in the E. coli lineage are short, A+T rich, and evolve quickly. Moreover, most encode functional proteins. Based on these features, ORFans are not attributable to errors in gene annotation, limitations of current databases, or to failure of methods for detecting homology. Rather, ORFans in the genomes of free-living microorganisms apparently derive from bacteriophage and occasionally become established by assuming roles in key cellular functions.
Collapse
Affiliation(s)
- Vincent Daubin
- Department of Biochemistry & Molecular Biophysics, University of Arizona, Tucson, Arizona 85721, USA.
| | | |
Collapse
|
292
|
Casjens S, Winn-Stapley DA, Gilcrease EB, Morona R, Kühlewein C, Chua JEH, Manning PA, Inwood W, Clark AJ. The chromosome of Shigella flexneri bacteriophage Sf6: complete nucleotide sequence, genetic mosaicism, and DNA packaging. J Mol Biol 2004; 339:379-94. [PMID: 15136040 DOI: 10.1016/j.jmb.2004.03.068] [Citation(s) in RCA: 111] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2004] [Accepted: 03/22/2004] [Indexed: 11/28/2022]
Abstract
Shigella flexneri temperate bacteriophage Sf6 is of interest in part because its prophage expresses the oac gene that alters the antigenic properties of the surface O-antigen polysaccharide of its host bacterium. We have determined the complete sequence of its 39,044 bp genome. The sequence shows that Sf6 is a member of the canonical lambdoid phage group, and like other phages of this type has a highly mosaic genome. It has chromosomal regions that encode proteins >80% identical with at least 15 different previously characterized lambdoid phages and prophages, but 43% of the genome, including the virion assembly genes, is homologous to the genome of one phage, HK620. An analysis of the nucleotide differences between Sf6 and HK620 indicates that even these similar regions are highly mosaic. This mosaicism suggests ways in which the virion structural proteins might interact with each other. The Sf6 early operons are arranged like a typical lambdoid phage, with "boundary sequences" often found between functional modules in the "metabolic" genome domain. By virtue of high degree of similarity in the encoding genes and their DNA target sites, we predict that the integrase, early transcription anti-terminator, CI and Cro repressors, and CII protein of Sf6 have DNA binding specificities very similar to the homologous proteins encoded by phages HK620, lambda, 434 and P22, respectively. The late operon contains two tRNA genes. The Sf6 terminase genes are unusual. Analysis of in vivo initiation of the DNA packaging series showed that the Sf6 apparatus that recognizes DNA for packaging appears to cleave DNA for initiation of packaging series at many sites within a large region of about 1800 bp that includes a possible pac site. This is unlike previously characterized phage packaging mechanisms.
Collapse
Affiliation(s)
- Sherwood Casjens
- Department of Pathology, University of Utah Medical School, Salt Lake City, UT 84132, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
293
|
Fukiya S, Mizoguchi H, Tobe T, Mori H. Extensive genomic diversity in pathogenic Escherichia coli and Shigella Strains revealed by comparative genomic hybridization microarray. J Bacteriol 2004; 186:3911-21. [PMID: 15175305 PMCID: PMC419953 DOI: 10.1128/jb.186.12.3911-3921.2004] [Citation(s) in RCA: 119] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Escherichia coli, including the closely related genus Shigella, is a highly diverse species in terms of genome structure. Comparative genomic hybridization (CGH) microarray analysis was used to compare the gene content of E. coli K-12 with the gene contents of pathogenic strains. Missing genes in a pathogen were detected on a microarray slide spotted with 4,071 open reading frames (ORFs) of W3110, a commonly used wild-type K-12 strain. For 22 strains subjected to the CGH microarray analyses 1,424 ORFs were found to be absent in at least one strain. The common backbone of the E. coli genome was estimated to contain about 2,800 ORFs. The mosaic distribution of absent regions indicated that the genomes of pathogenic strains were highly diversified because of insertions and deletions. Prophages, cell envelope genes, transporter genes, and regulator genes in the K-12 genome often were not present in pathogens. The gene contents of the strains tested were recognized as a matrix for a neighbor-joining analysis. The phylogenic tree obtained was consistent with the results of previous studies. However, unique relationships between enteroinvasive strains and Shigella, uropathogenic, and some enteropathogenic strains were suggested by the results of this study. The data demonstrated that the CGH microarray technique is useful not only for genomic comparisons but also for phylogenic analysis of E. coli at the strain level.
Collapse
Affiliation(s)
- Satoru Fukiya
- Kyowa Hakko Kogyo, Tokyo Research Laboratories, 3-6-6 Asahimachi, Machidashi, Tokyo 194-8533, Japan
| | | | | | | |
Collapse
|
294
|
Soni RK, Mehra P, Choudhury NR, Mukhopadhyay G, Dhar SK. Functional characterization of Helicobacter pylori DnaB helicase. Nucleic Acids Res 2004; 31:6828-40. [PMID: 14627816 PMCID: PMC290263 DOI: 10.1093/nar/gkg895] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Helicobacter pylori causes gastric ulcer diseases and gastric adenocarcinoma in humans. Not much is known regarding DNA replication in H.pylori that is important for cell survival. Here we report the cloning, expression and characterization of H.pylori DnaB (HpDnaB) helicase both in vitro and in vivo. Among the DnaB homologs, only Escherichia coli DnaB has been studied extensively. HpDnaB showed strong 5' to 3' helicase and ATPase activity. Interestingly, H.pylori does not have an obvious DnaC homolog which is essential for DnaB loading on the E.coli chromosomal DNA replication origin (oriC). However, HpDnaB can functionally complement the E.coli DnaB temperature-sensitive mutant at the non-permissive temperature, confirming that HpDnaB is a true replicative helicase. Escherichia coli DnaC co-eluted in the same fraction with HpDnaB following gel filtration analysis suggesting that these proteins might physically interact with each other. It is possible that a functional DnaC homolog is present in H.pylori. The complete characterization of H.pylori DnaB helicase will also help the comparative analysis of DnaB helicases among bacteria.
Collapse
Affiliation(s)
- Rajesh K Soni
- Special Centre For Molecular Medicine, Jawaharlal Nehru University, New Delhi 110067, India
| | | | | | | | | |
Collapse
|
295
|
Ren CP, Chaudhuri RR, Fivian A, Bailey CM, Antonio M, Barnes WM, Pallen MJ. The ETT2 gene cluster, encoding a second type III secretion system from Escherichia coli, is present in the majority of strains but has undergone widespread mutational attrition. J Bacteriol 2004; 186:3547-60. [PMID: 15150243 PMCID: PMC415751 DOI: 10.1128/jb.186.11.3547-3560.2004] [Citation(s) in RCA: 93] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2003] [Accepted: 02/12/2004] [Indexed: 11/20/2022] Open
Abstract
ETT2 is a second cryptic type III secretion system in Escherichia coli which was first discovered through the analysis of genome sequences of enterohemorrhagic E. coli O157:H7. Comparative analyses of Escherichia and Shigella genome sequences revealed that the ETT2 gene cluster is larger than was previously thought, encompassing homologues of genes from the Spi-1, Spi-2, and Spi-3 Salmonella pathogenicity islands. ETT2-associated genes, including regulators and chaperones, were found at the same chromosomal location in the majority of genome-sequenced strains, including the laboratory strain K-12. Using a PCR-based approach, we constructed a complete tiling path through the ETT2 gene cluster for 79 strains, including the well-characterized E. coli reference collection supplemented with additional pathotypes. The ETT2 gene cluster was found to be present in whole or in part in the majority of E. coli strains, whether pathogenic or commensal, with patterns of distribution and deletion mirroring the known phylogenetic structure of the species. In almost all strains, including enterohemorrhagic E. coli O157:H7, ETT2 has been subjected to varying degrees of mutational attrition that render it unable to encode a functioning secretion system. A second type III secretion system-associated locus that likely encodes the ETT2 translocation apparatus was found in some E. coli strains. Intact versions of both ETT2-related clusters are apparently present in enteroaggregative E. coli strain O42.
Collapse
Affiliation(s)
- Chuan-Peng Ren
- Bacterial Pathogenesis and Genomics Unit, Division of Immunity and Infection, Medical School, University of Birmingham, Birmingham B15 2TT, United Kingdom
| | | | | | | | | | | | | |
Collapse
|
296
|
Kaniuk NA, Vinogradov E, Li J, Monteiro MA, Whitfield C. Chromosomal and plasmid-encoded enzymes are required for assembly of the R3-type core oligosaccharide in the lipopolysaccharide of Escherichia coli O157:H7. J Biol Chem 2004; 279:31237-50. [PMID: 15155763 DOI: 10.1074/jbc.m401879200] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The type R3 core oligosaccharide predominates in the lipopolysaccharides from enterohemorrhagic Escherichia coli isolates including O157:H7. The R3 core biosynthesis (waa) genetic locus contains two genes, waaD and waaJ, that are predicted to encode glycosyltransferases involved in completion of the outer core. Through determination of the structures of the lipopolysaccharide core in precise mutants and biochemical analyses of enzyme activities, WaaJ was shown to be a UDP-glucose:(galactosyl) lipopolysaccharide alpha-1,2-glucosyltransferase, and WaaD was shown to be a UDP-glucose:(glucosyl)lipopolysaccharide alpha-1,2-glucosyltransferase. The residue added by WaaJ was identified as the ligation site for O polysaccharide, and this was confirmed by determination of the structure of the linkage region in serotype O157 lipopolysaccharide. The initial O157 repeat unit begins with an N-acetylgalactosamine residue in a beta-anomeric configuration, whereas the biological repeat unit for O157 contains alpha-linked N-acetylgalactosamine residues. With the characterization of WaaJ and WaaD, the activities of all of the enzymes encoded by the R3 waa locus are either known or predicted from homology data with a high level of confidence. However, when core oligosaccharide structure is considered, the origin of an additional alpha-1,3-linked N-acetylglucosamine residue in the outer core is unknown. The gene responsible for a nonstoichiometric alpha-1,7-linked N-acetylglucosamine substituent in the heptose (inner core) region was identified on the large virulence plasmids of E. coli O157 and Shigella flexneri serotype 2a. This is the first plasmid-encoded core oligosaccharide biosynthesis enzyme reported in E. coli.
Collapse
Affiliation(s)
- Natalia A Kaniuk
- Department of Microbiology, University of Guelph, Guelph, Ontario N1G 2W1, Canada
| | | | | | | | | |
Collapse
|
297
|
Yap MN, Barak JD, Charkowski AO. Genomic diversity of Erwinia carotovora subsp. carotovora and its correlation with virulence. Appl Environ Microbiol 2004; 70:3013-23. [PMID: 15128563 PMCID: PMC404413 DOI: 10.1128/aem.70.5.3013-3023.2004] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2003] [Accepted: 01/06/2004] [Indexed: 11/20/2022] Open
Abstract
We used genetic and biochemical methods to examine the genomic diversity of the enterobacterial plant pathogen Erwinia carotovora subsp. carotovora. The results obtained with each method showed that E. carotovora subsp. carotovora strains isolated from one ecological niche, potato plants, are surprisingly diverse compared to related pathogens. A comparison of 23 partial mdh sequences revealed a maximum pairwise difference of 10.49% and an average pairwise difference of 2.13%, values which are much greater than the maximum variation (1.81%) and average variation (0.75%) previously reported for Escherichia coli. Pulsed-field gel electrophoresis analysis of I-CeuI-digested genomic DNA revealed seven rrn operons in all E. carotovora subsp. carotovora strains examined except strain WPP17, which had only six copies. We identified 26 I-CeuI restriction fragment length polymorphism patterns and observed significant polymorphism in fragment sizes ranging from 100 to 450 kb for all strains. We detected large plasmids in two strains, including the model strain E. carotovora subsp. carotovora 71. The two least virulent strains had an unusual chromosomal structure, suggesting that a particular pulsotype is correlated with virulence. To compare chromosomal organization of multiple enterobacterial genomes, several genes were mapped onto I-CeuI fragments. We identified portions of the genome that appear to be conserved across enterobacteria and portions that have undergone genome rearrangements. We found that the least virulent strain, WPP17, failed to oxidize cellobiose and was missing several hrp and hrc genes. The unexpected variability among isolates obtained from clonal hosts in one region and in one season suggests that factors other than the host plant, potato, drive the evolution of this common environmental bacterium and key plant pathogen.
Collapse
Affiliation(s)
- Mee-Ngan Yap
- Department of Plant Pathology, Russell Laboratories, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | | | | |
Collapse
|
298
|
Jennison AV, Verma NK. Shigella flexneri infection: pathogenesis and vaccine development. FEMS Microbiol Rev 2004; 28:43-58. [PMID: 14975529 DOI: 10.1016/j.femsre.2003.07.002] [Citation(s) in RCA: 179] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2003] [Revised: 07/25/2003] [Accepted: 07/30/2003] [Indexed: 02/08/2023] Open
Abstract
Shigella flexneri is a gram-negative bacterium which causes the most communicable of bacterial dysenteries, shigellosis. Shigellosis causes 1.1 million deaths and over 164 million cases each year, with the majority of cases occurring in the children of developing nations. The pathogenesis of S. flexneri is based on the bacteria's ability to invade and replicate within the colonic epithelium, which results in severe inflammation and epithelial destruction. The molecular mechanisms used by S. flexneri to cross the epithelial barrier, evade the host's immune response and enter epithelial cells have been studied extensively in both in vitro and in vivo models. Consequently, numerous virulence factors essential to bacterial invasion, intercellular spread and the induction of inflammation have been identified in S. flexneri. The inflammation produced by the host has been implicated in both the destruction of the colonic epithelium and in controlling and containing the Shigella infection. The host's humoral response to S. flexneri also appears to be important in protecting the host, whilst the role of the cellular immune response remains unclear. The host's immune response to shigellosis is serotype-specific and protective against reinfection by the same serotype, making vaccination a possibility. Since the 1940s vaccines for S. flexneri have been developed with little success, however, the growing understanding of S. flexneri's pathogenesis and the host's immune response is assisting in the generation of more refined vaccine strategies. Current research encompasses a variety of vaccine types, which despite disparity in their efficacy and safety in humans represent promising progress in S. flexneri vaccine development.
Collapse
Affiliation(s)
- Amy V Jennison
- Faculty of Science, School of Biochemistry and Molecular Biology, The Australian National University, Canberra, ACT 0200, Australia
| | | |
Collapse
|
299
|
Abstract
Few microorganisms are as versatile as Escherichia coli. An important member of the normal intestinal microflora of humans and other mammals, E. coli has also been widely exploited as a cloning host in recombinant DNA technology. But E. coli is more than just a laboratory workhorse or harmless intestinal inhabitant; it can also be a highly versatile, and frequently deadly, pathogen. Several different E. coli strains cause diverse intestinal and extraintestinal diseases by means of virulence factors that affect a wide range of cellular processes.
Collapse
Affiliation(s)
- James B Kaper
- Center for Vaccine Development, Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, Maryland 21201, USA.
| | | | | |
Collapse
|
300
|
Pushker R, Mira A, Rodríguez-Valera F. Comparative genomics of gene-family size in closely related bacteria. Genome Biol 2004; 5:R27. [PMID: 15059260 PMCID: PMC395786 DOI: 10.1186/gb-2004-5-4-r27] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2003] [Revised: 01/23/2004] [Accepted: 02/06/2004] [Indexed: 11/25/2022] Open
Abstract
The size of a given gene family is remarkably similar in strains of the same species and in closely related species, suggesting that homologous gene families are vertically transmitted and depend little on horizontal gene transfer. Background The wealth of genomic data in bacteria is helping microbiologists understand the factors involved in gene innovation. Among these, the expansion and reduction of gene families appears to have a fundamental role in this, but the factors influencing gene family size are unclear. Results The relative content of paralogous genes in bacterial genomes increases with genome size, largely due to the expansion of gene family size in large genomes. Bacteria undergoing genome reduction display a parallel process of redundancy elimination, by which gene families are reduced to one or a few members. Gene family size is also influenced by sequence divergence and physiological function. Large gene families show wider sequence divergence, suggesting they are probably older, and certain functions (such as metabolite transport mechanisms) are overrepresented in large families. The size of a given gene family is remarkably similar in strains of the same species and in closely related species, suggesting that homologous gene families are vertically transmitted and depend little on horizontal gene transfer (HGT). Conclusions The remarkable preservation of copy numbers in widely different ecotypes indicates a functional role for the different copies rather than simply a back-up role. When different genera are compared, the increase in phylogenetic distance and/or ecological specialization disrupts this preservation, albeit in a gradual manner and maintaining an overall similarity, which also supports this view. HGT can have an important role, however, in nonhomologous gene families, as exemplified by a comparison between saprophytic and enterohemorrhagic strains of Escherichia coli.
Collapse
Affiliation(s)
- Ravindra Pushker
- Evolutionary Genomics Group, Universidad Miguel Hernández, Campus de San Juan, Apartado 18, 03550 San Juan de Alicante, Alicante, Spain
| | - Alex Mira
- Evolutionary Genomics Group, Universidad Miguel Hernández, Campus de San Juan, Apartado 18, 03550 San Juan de Alicante, Alicante, Spain
| | - Francisco Rodríguez-Valera
- Evolutionary Genomics Group, Universidad Miguel Hernández, Campus de San Juan, Apartado 18, 03550 San Juan de Alicante, Alicante, Spain
| |
Collapse
|