51
|
Grass S, Zidorn C, Blattner FR, Stuppner H. Comparative molecular and phytochemical investigation of Leontodon autumnalis (Asteraceae, Lactuceae) populations from Central Europe. PHYTOCHEMISTRY 2006; 67:122-31. [PMID: 16325215 DOI: 10.1016/j.phytochem.2005.10.019] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2005] [Revised: 09/28/2005] [Accepted: 10/11/2005] [Indexed: 05/05/2023]
Abstract
Previous analyses of Leontodon autumnalis L. revealed the existence of two chemotypes. In the current study molecular and phytochemical methods were combined to investigate 24 Central European populations of L. autumnalis. The focus of this study was the correlation of molecular and phytochemical characters at the intraspecific level. DNA fingerprint profiles of 183 individuals were obtained by random amplified polymorphic DNA (RAPD) providing 77 molecular markers. Contents of phenolics and sesquiterpenoids of flowering heads and sub-aerial parts were quantified by HPLC-DAD analyses. HPLC results were evaluated by principal component analysis. Geographic distribution of the two detected chemotypes partially overlapped. Phylogenetic groupings displayed in an unrooted neighbor-joining tree calculated from the RAPD data matrix were correlated with the geographical origin of the plant material. However, genetic profiles neither correlated with the two chemotypes nor with the morphologically based subspecies of L. autumnalis recognized by some authors. The presented data imply that the morphotypes are of multiple origins or due to different ecological growing conditions rather than genetically determined and that phytochemical races are induced by a limited number of genetical differences, which might have occurred independently in different lineages of the L. autumnalis group.
Collapse
|
52
|
Blattner FR. Multiple intercontinental dispersals shaped the distribution area of Hordeum (Poaceae). THE NEW PHYTOLOGIST 2006; 169:603-14. [PMID: 16411962 DOI: 10.1111/j.1469-8137.2005.01610.x] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
The grass genus Hordeum (Poaceae, Triticeae), comprising 31 species distributed in temperate and dry regions of the world, was analysed to determine the relative contributions of vicariance and long-distance dispersal to the extant distribution pattern of the genus. Sequences from three nuclear regions (DMC1, EF-G and ITS) were combined and analysed phylogenetically for all diploid (20 species) and two tetraploid Hordeum species and the outgroup Psathyrostachys. Ages of clades within Hordeum were estimated using a penalized likelihood analysis of sequence divergence. The sequence data resulted in an almost fully resolved phylogenetic tree that allowed the reconstruction of intrageneric migration routes. Hordeum evolved c. 12 million years ago in South-west Asia and spread into Europe and Central Asia. The colonization of the New World and South Africa involved at least six intercontinental exchanges during the last 4 million years (twice Eurasia-North America, North America-South America, twice South America-North America and Europe-South Africa). Repeated long-distance dispersal between the northern and southern hemisphere were important colonization mechanisms in Hordeum.
Collapse
|
53
|
Blattner FR. Phylogenetic analysis of Hordeum (Poaceae) as inferred by nuclear rDNA ITS sequences. Mol Phylogenet Evol 2005; 33:289-99. [PMID: 15336664 DOI: 10.1016/j.ympev.2004.05.012] [Citation(s) in RCA: 98] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2003] [Revised: 05/12/2004] [Indexed: 11/17/2022]
Abstract
Hordeum (Poaceae, Triticeae) occurs with 31 species worldwide in temperate regions, with the exception of Australasia. About 50% of the species are polyploids (4x, 6x) or occur as di- and polyploid forms. To analyze the phylogenetic relationships among diploid and polyploid taxa of the genus the nuclear rDNA internal transcribed spacer region (ITS) was analyzed for 91 accessions, representing all Hordeum species, together with 10 outgroup species. PCR products were either directly sequenced (outgroups) or cloned and eight clones per individual were analyzed. Phylogenetic analysis revealed four major clades that concur with the four genome groups in Hordeum (H, I, Xa, and Xu). Allopolyploids, putative autopolyploids, and species groups within the closely related H-genome clade could be identified. The ITS data indicate times of independent evolution of paralogous rDNA clusters on different chromosomes intermitted by sweeps of homogenization among these clusters and bi-directional homogenization of the clusters in diploids. Penalized likelihood analysis revealed an age of about 12 million years (my) for the genus and indicated the start of a rapid radiation in the H-genome group about 2.5 my ago in South America and Asia.
Collapse
|
54
|
Demidov D, Van Damme D, Geelen D, Blattner FR, Houben A. Identification and dynamics of two classes of aurora-like kinases in Arabidopsis and other plants. THE PLANT CELL 2005; 17:836-48. [PMID: 15722465 PMCID: PMC1069702 DOI: 10.1105/tpc.104.029710] [Citation(s) in RCA: 110] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Aurora-like kinases play key roles in chromosome segregation and cytokinesis in yeast, plant, and animal systems. Here, we characterize three Arabidopsis thaliana protein kinases, designated AtAurora1, AtAurora2, and AtAurora3, which share high amino acid identities with the Ser/Thr kinase domain of yeast Ipl1 and animal Auroras. Structure and expression of AtAurora1 and AtAurora2 suggest that these genes arose by a recent gene duplication, whereas the diversification of plant alpha and beta Aurora kinases predates the origin of land plants. The transcripts and proteins of all three kinases are most abundant in tissues containing dividing cells. Intracellular localization of green fluorescent protein-tagged AtAuroras revealed an AtAurora-type specific association mainly with dynamic mitotic structures, such as microtubule spindles and centromeres, and with the emerging cell plate of dividing tobacco (Nicotiana tabacum) BY-2 cells. Immunolabeling using AtAurora antibodies yielded specific signals at the centromeres that are coincident with histone H3 that is phosphorylated at Ser position10 during mitosis. An in vitro kinase assay demonstrated that AtAurora1 preferentially phosphorylates histone H3 at Ser 10 but not at Ser 28 or Thr 3, 11, and 32. The phylogenetic analysis of available Aurora sequences from different eukaryotic origins suggests that, although a plant Aurora gene has been duplicated early in the evolution of plants, the paralogs nevertheless maintained a role in cell cycle-related signal transduction pathways.
Collapse
|
55
|
Jakob SS, Meister A, Blattner FR. The considerable genome size variation of Hordeum species (poaceae) is linked to phylogeny, life form, ecology, and speciation rates. Mol Biol Evol 2004; 21:860-9. [PMID: 15014163 DOI: 10.1093/molbev/msh092] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Genome size variation in plants is thought to be correlated with cytological, physiological, or ecological characters. However, conclusions drawn in several studies were often contradictory. To analyze nuclear genome size evolution in a phylogenetic framework, DNA contents of 134 accessions, representing all but one species of the barley genus Hordeum L., were measured by flow cytometry. The 2C DNA contents were in a range from 6.85 to 10.67 pg in diploids (2n = 14) and reached up to 29.85 pg in hexaploid species (2n = 42). The smallest genomes were found in taxa from the New World, which became secondarily annual, whereas the largest diploid genomes occur in Eurasian annuals. Genome sizes of polyploid taxa equaled mostly the added sizes of their proposed progenitors or were slightly (1% to 5%) smaller. The analysis of ancestral genome sizes on the base of the phylogeny of the genus revealed lineages with decreasing and with increasing genome sizes. Correlations of intraspecific genome size variation with the length of vegetation period were found in H. marinum populations from Western Europe but were not significant within two species from South America. On a higher taxonomical level (i.e., for species groups or the entire genus), environmental correlations were absent. This could mostly be attributed to the superimposition of life-form changes and phylogenetic constraints, which conceal ecogeographical correlations.
Collapse
|
56
|
Stehlik I, Blattner FR. Sex-specific SCAR markers in the dioecious plant Rumex nivalis (Polygonaceae) and implications for the evolution of sex chromosomes. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2004; 108:238-242. [PMID: 13679980 DOI: 10.1007/s00122-003-1425-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2003] [Accepted: 07/04/2003] [Indexed: 05/24/2023]
Abstract
We developed SCAR primers based on isolated and sequenced male-specific fragments as identified in an AFLP analysis of the dioecious plant Rumex nivalis. PCR amplification using these primers on females and males resulted in fragments exclusively present in males. Co-amplification of the nuclear rDNA internal transcribed spacer 2 together with the male-specific fragment was applied as an internal control for successful PCR reactions to avoid false-negative sex scoring. With a length of about 164 bp, the AFLP fragment was of a similar size as the tandemly arranged, repetitive sequences of 180 bp located on the Y chromosomes of Rumex acetosa. The genetic distances between the Y-chromosomal sequences of R. nivalis and R. acetosa, both members of the section Acetosa, were substantial. We found intra-individual divergence among cloned sequences of the male-specific fragment in R. nivalis. The patterns of interspecific and intra-individual sequence variation found are in accordance with proposed modes of the evolution of sex chromosomes. Y chromosomes possibly arose only once in the genus Rumex and consist mainly of heterochromatic DNA. Due to the almost complete absence of selection on them, Y chromosomes are likely to accumulate large numbers of mutations.
Collapse
|
57
|
Shutov AD, Bäumlein H, Blattner FR, Müntz K. Storage and mobilization as antagonistic functional constraints on seed storage globulin evolution. JOURNAL OF EXPERIMENTAL BOTANY 2003; 54:1645-54. [PMID: 12754262 DOI: 10.1093/jxb/erg165] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
When seeds germinate nearly all the proteins are degraded in senescing storage tissue cells. All these proteins act as amino acid reserves which are mobilized to nourish the seedling. Nevertheless, the major amount of the seeds' protein reserve consists of a few enzymatically inactive, abundant, genuine storage proteins. In their metabolism the conflicting processes of biosynthesis, protein turnover and breakdown, are temporally separated. No degradation of correctly formed storage proteins was observed at the time of synthesis and accumulation during seed maturation. Breakdown takes place after a (long) period of rest when seeds germinate and seedlings start growing. At that time genuine storage proteins are no longer synthesized. Genuine storage proteins have evolved structural features permitting controlled temporal patterns of protection and proteolysis. The acquisition of inserted sequence stretches as sites accessible to limited proteolysis played a key role in the evolution of this control system and happened in coevolution of genuine storage proteins with specific proteinases. This can be deduced from the results of current research on the mechanisms of limited and unlimited proteolysis of storage globulins and on storage globulin evolution. The evolved system of controlled structure-function interplay between storage globulins and proteinases is part of a syndrome that, in addition, comprises differential compartmentation and gene expression of storage proteins and proteinases for controlling the total spatial and temporal patterns of globulin storage and mobilization in maturing and germinating seeds.
Collapse
|
58
|
Wei J, Goldberg MB, Burland V, Venkatesan MM, Deng W, Fournier G, Mayhew GF, Plunkett G, Rose DJ, Darling A, Mau B, Perna NT, Payne SM, Runyen-Janecky LJ, Zhou S, Schwartz DC, Blattner FR. Complete genome sequence and comparative genomics of Shigella flexneri serotype 2a strain 2457T. Infect Immun 2003; 71:2775-86. [PMID: 12704152 PMCID: PMC153260 DOI: 10.1128/iai.71.5.2775-2786.2003] [Citation(s) in RCA: 303] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
We determined the complete genome sequence of Shigella flexneri serotype 2a strain 2457T (4,599,354 bp). Shigella species cause >1 million deaths per year from dysentery and diarrhea and have a lifestyle that is markedly different from those of closely related bacteria, including Escherichia coli. The genome exhibits the backbone and island mosaic structure of E. coli pathogens, albeit with much less horizontally transferred DNA and lacking 357 genes present in E. coli. The strain is distinctive in its large complement of insertion sequences, with several genomic rearrangements mediated by insertion sequences, 12 cryptic prophages, 372 pseudogenes, and 195 S. flexneri-specific genes. The 2457T genome was also compared with that of a recently sequenced S. flexneri 2a strain, 301. Our data are consistent with Shigella being phylogenetically indistinguishable from E. coli. The S. flexneri-specific regions contain many genes that could encode proteins with roles in virulence. Analysis of these will reveal the genetic basis for aspects of this pathogenic organism's distinctive lifestyle that have yet to be explained.
Collapse
|
59
|
Welch RA, Burland V, Plunkett G, Redford P, Roesch P, Rasko D, Buckles EL, Liou SR, Boutin A, Hackett J, Stroud D, Mayhew GF, Rose DJ, Zhou S, Schwartz DC, Perna NT, Mobley HLT, Donnenberg MS, Blattner FR. Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli. Proc Natl Acad Sci U S A 2002; 99:17020-4. [PMID: 12471157 PMCID: PMC139262 DOI: 10.1073/pnas.252529799] [Citation(s) in RCA: 1026] [Impact Index Per Article: 46.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We present the complete genome sequence of uropathogenic Escherichia coli, strain CFT073. A three-way genome comparison of the CFT073, enterohemorrhagic E. coli EDL933, and laboratory strain MG1655 reveals that, amazingly, only 39.2% of their combined (nonredundant) set of proteins actually are common to all three strains. The pathogen genomes are as different from each other as each pathogen is from the benign strain. The difference in disease potential between O157:H7 and CFT073 is reflected in the absence of genes for type III secretion system or phage- and plasmid-encoded toxins found in some classes of diarrheagenic E. coli. The CFT073 genome is particularly rich in genes that encode potential fimbrial adhesins, autotransporters, iron-sequestration systems, and phase-switch recombinases. Striking differences exist between the large pathogenicity islands of CFT073 and two other well-studied uropathogenic E. coli strains, J96 and 536. Comparisons indicate that extraintestinal pathogenic E. coli arose independently from multiple clonal lineages. The different E. coli pathotypes have maintained a remarkable synteny of common, vertically evolved genes, whereas many islands interrupting this common backbone have been acquired by different horizontal transfer events in each strain.
Collapse
|
60
|
Hartung F, Blattner FR, Puchta H. Intron gain and loss in the evolution of the conserved eukaryotic recombination machinery. Nucleic Acids Res 2002; 30:5175-81. [PMID: 12466542 PMCID: PMC137963 DOI: 10.1093/nar/gkf649] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Intron conservation, intron gain or loss and putative intron sliding events were determined for a set of three genes (SPO11, MRE11 and DMC1) involved in basic aspects of recombination in eukaryotes. These are ancient genes and present in nearly all of the major kingdoms. MRE11 is of bacterial origin and can be found in all kingdoms. DMC1 is a specialized homolog of the bacterial RecA protein, whereas the SPO11 gene is of archaebacterial origin. Only unique homologs of SPO11 are found in animals and fungi whereas three distantly related SPO11 copies are present in plant genomes. A comparison of the respective intron positions and phases of all genes was performed, demonstrating that a quarter of the intron positions were perfectly conserved over more than 1 000 000 000 years. Regarding the remaining three quarters of the introns we found insertions to be about three times more frequent than deletions. Aligning the introns of the three different SPO11 homologs of Arabidopsis thaliana we propose a conclusive model of their evolution. We postulate that at least one duplication event occurred shortly after the divergence of plants from animals and fungi and that a respective homolog has been retained in a protist group, the apicomplexa.
Collapse
|
61
|
Stehlik I, Blattner FR, Holderegger R, Bachmann K. Nunatak survival of the high Alpine plant Eritrichium nanum (L.) Gaudin in the central Alps during the ice ages. Mol Ecol 2002; 11:2027-36. [PMID: 12296946 DOI: 10.1046/j.1365-294x.2002.01595.x] [Citation(s) in RCA: 119] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Polymerase chain reaction-restriction fragment length polymorphisms (PCR-RFLPs) and sequence analysis of noncoding regions of chloroplast DNA were used to investigate 37 populations of Eritrichium nanum covering its total distribution area, the European Alps. There was no haplotypic variation within the populations, and most haplotypes were restricted to single sites or to neighbouring populations, suggesting low levels of long distance gene flow via seeds. The present geographical distribution of haplotypes probably reflects an ancient geographical pattern within two regions in the intensely glaciated western and eastern central Alps identified as genetic hotspot areas. These two regions contained seven of the total of 11 haplotypes, including many of the most derived ones. The divergent haplotypes formed closely related groups, which supported a separate evolution of these haplotypes in these two regions and, more importantly, gave strong evidence for the in situ survival of these populations on nunataks within the western and eastern central Alps during Pleistocene glaciation. This result is in concordance with a previous study on E. nanum using nuclear markers. Only one haplotype was common and widespread throughout the distributional range of E. nanum. At the same time, it was the evolutionarily basal-most and all other haplotypes were best described as its descendants. This haplotype is hypothesized to be genetically identical to a Tertiary Alpine colonizing ancestor, whose distribution was secondarily fragmented and infiltrated by derived haplotypes originating through local mutations.
Collapse
|
62
|
Lim A, Dimalanta ET, Potamousis KD, Yen G, Apodoca J, Tao C, Lin J, Qi R, Skiadas J, Ramanathan A, Perna NT, Plunkett G, Burland V, Mau B, Hackett J, Blattner FR, Anantharaman TS, Mishra B, Schwartz DC. Shotgun optical maps of the whole Escherichia coli O157:H7 genome. Genome Res 2001; 11:1584-93. [PMID: 11544203 PMCID: PMC311123 DOI: 10.1101/gr.172101] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2000] [Accepted: 06/04/2001] [Indexed: 11/24/2022]
Abstract
We have constructed NheI and XhoI optical maps of Escherichia coli O157:H7 solely from genomic DNA molecules to provide a uniquely valuable scaffold for contig closure and sequence validation. E. coli O157:H7 is a common pathogen found in contaminated food and water. Our approach obviated the need for the analysis of clones, PCR products, and hybridizations, because maps were constructed from ensembles of single DNA molecules. Shotgun sequencing of bacterial genomes remains labor-intensive, despite advances in sequencing technology. This is partly due to manual intervention required during the last stages of finishing. The applicability of optical mapping to this problem was enhanced by advances in machine vision techniques that improved mapping throughput and created a path to full automation of mapping. Comparisons were made between maps and sequence data that characterized sequence gaps and guided nascent assemblies.
Collapse
|
63
|
Newton MA, Kendziorski CM, Richmond CS, Blattner FR, Tsui KW. On differential variability of expression ratios: improving statistical inference about gene expression changes from microarray data. J Comput Biol 2001; 8:37-52. [PMID: 11339905 DOI: 10.1089/106652701300099074] [Citation(s) in RCA: 479] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We consider the problem of inferring fold changes in gene expression from cDNA microarray data. Standard procedures focus on the ratio of measured fluorescent intensities at each spot on the microarray, but to do so is to ignore the fact that the variation of such ratios is not constant. Estimates of gene expression changes are derived within a simple hierarchical model that accounts for measurement error and fluctuations in absolute gene expression levels. Significant gene expression changes are identified by deriving the posterior odds of change within a similar model. The methods are tested via simulation and are applied to a panel of Escherichia coli microarrays.
Collapse
|
64
|
Blattner FR, Weising K, Bänfer G, Maschwitz U, Fiala B. Molecular analysis of phylogenetic relationships among Myrmecophytic macaranga species (Euphorbiaceae). Mol Phylogenet Evol 2001; 19:331-44. [PMID: 11399144 DOI: 10.1006/mpev.2001.0941] [Citation(s) in RCA: 112] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Many species of the paleotropical pioneer tree genus Macaranga Thou. (Euphorbiaceae) live in association with ants. Various types of mutualistic interactions exist, ranging from the attraction of unspecific ant visitors to obligate myrmecophytism. In the latter, nesting space and food bodies are exchanged for protection by highly specific ant partners (mainly species of the myrmicine genus Crematogaster). As a first step toward elucidating the coevolution of ant-plant interactions in the Macaranga-Crematogaster system, we have initiated a molecular investigation of the plant partners' phylogeny. Nuclear ribosomal DNA internal transcribed spacer (ITS) sequences were analyzed for 73 accessions from 47 Macaranga species, representing 17 sections or informally described species groups. Three accessions from the putative sister taxon Mallotus Lour, were included as outgroups. Cladograms of the ITS data revealed Macaranga to be nested within Mallotus. ITS sequences are highly similar within section Pachystemon s.str., suggesting a relatively recent and rapid radiation of obligate myrmecophytes within this section. Forty-three accessions, mainly of ant-inhabited species, were additionally investigated by random amplified polymorphic DNA (RAPD) and microsatellite-primed PCR (MP-PCR) techniques. Phenetic analysis of RAPD and MP-PCR banding profiles generally confirmed the ITS results. Best resolutions for individual clades were obtained when ITS and RAPD/MP-PCR data were combined into a single matrix and analyzed phenetically. The combined analysis suggests multiple (four) rather than a single evolutionary origin of myrmecophytism, at least one reversal from obligate myrmecophytism to nonmyrmecophytism, and one loss of mutualistic specifity.
Collapse
|
65
|
Venkatesan MM, Goldberg MB, Rose DJ, Grotbeck EJ, Burland V, Blattner FR. Complete DNA sequence and analysis of the large virulence plasmid of Shigella flexneri. Infect Immun 2001; 69:3271-85. [PMID: 11292750 PMCID: PMC98286 DOI: 10.1128/iai.69.5.3271-3285.2001] [Citation(s) in RCA: 145] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The complete sequence analysis of the 210-kb Shigella flexneri 5a virulence plasmid was determined. Shigella spp. cause dysentery and diarrhea by invasion and spread through the colonic mucosa. Most of the known Shigella virulence determinants are encoded on a large plasmid that is unique to virulent strains of Shigella and enteroinvasive Escherichia coli; these known genes account for approximately 30 to 35% of the virulence plasmid. In the complete sequence of the virulence plasmid, 286 open reading frames (ORFs) were identified. An astonishing 153 (53%) of these were related to known and putative insertion sequence (IS) elements; no known bacterial plasmid has previously been described with such a high proportion of IS elements. Four new IS elements were identified. Fifty putative proteins show no significant homology to proteins of known function; of these, 18 have a G+C content of less than 40%, typical of known virulence genes on the plasmid. These 18 constitute potentially unknown virulence genes. Two alleles of shet2 and five alleles of ipaH were also identified on the plasmid. Thus, the plasmid sequence suggests a remarkable history of IS-mediated acquisition of DNA across bacterial species. The complete sequence will permit targeted characterization of potential new Shigella virulence determinants.
Collapse
|
66
|
Dong Y, Glasner JD, Blattner FR, Triplett EW. Genomic interspecies microarray hybridization: rapid discovery of three thousand genes in the maize endophyte, Klebsiella pneumoniae 342, by microarray hybridization with Escherichia coli K-12 open reading frames. Appl Environ Microbiol 2001; 67:1911-21. [PMID: 11282649 PMCID: PMC92813 DOI: 10.1128/aem.67.4.1911-1921.2001] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2000] [Accepted: 01/11/2001] [Indexed: 11/20/2022] Open
Abstract
In an effort to efficiently discover genes in the diazotrophic endophyte of maize, Klebsiella pneumoniae 342, DNA from strain 342 was hybridized to a microarray containing 96% (n = 4,098) of the annotated open reading frames from Escherichia coli K-12. Using a criterion of 55% identity or greater, 3,000 (70%) of the E. coli K-12 open reading frames were also found to be present in strain 342. Approximately 24% (n = 1,030) of the E. coli K-12 open reading frames are absent in strain 342. For 1.6% (n = 68) of the open reading frames, the signal was too low to make a determination regarding the presence or absence of the gene. Genes with high identity between the two organisms are those involved in energy metabolism, amino acid metabolism, fatty acid metabolism, cofactor synthesis, cell division, DNA replication, transcription, translation, transport, and regulatory proteins. Functions that were less highly conserved included carbon compound metabolism, membrane proteins, structural proteins, putative transport proteins, cell processes such as adaptation and protection, and central intermediary metabolism. Open reading frames of E. coli K-12 with little or no identity in strain 342 included putative regulatory proteins, putative chaperones, surface structure proteins, mobility proteins, putative enzymes, hypothetical proteins, and proteins of unknown function, as well as genes presumed to have been acquired by lateral transfer from sources such as phage, plasmids, or transposons. The results were in agreement with the physiological properties of the two strains. Whole genome comparisons by genomic interspecies microarray hybridization are shown to rapidly identify thousands of genes in a previously uncharacterized bacterial genome provided that the genome of a close relative has been fully sequenced. This approach will become increasingly more useful as more full genome sequences become available.
Collapse
|
67
|
Perna NT, Plunkett G, Burland V, Mau B, Glasner JD, Rose DJ, Mayhew GF, Evans PS, Gregor J, Kirkpatrick HA, Pósfai G, Hackett J, Klink S, Boutin A, Shao Y, Miller L, Grotbeck EJ, Davis NW, Lim A, Dimalanta ET, Potamousis KD, Apodaca J, Anantharaman TS, Lin J, Yen G, Schwartz DC, Welch RA, Blattner FR. Genome sequence of enterohaemorrhagic Escherichia coli O157:H7. Nature 2001; 409:529-33. [PMID: 11206551 DOI: 10.1038/35054089] [Citation(s) in RCA: 1470] [Impact Index Per Article: 63.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The bacterium Escherichia coli O157:H7 is a worldwide threat to public health and has been implicated in many outbreaks of haemorrhagic colitis, some of which included fatalities caused by haemolytic uraemic syndrome. Close to 75,000 cases of O157:H7 infection are now estimated to occur annually in the United States. The severity of disease, the lack of effective treatment and the potential for large-scale outbreaks from contaminated food supplies have propelled intensive research on the pathogenesis and detection of E. coli O157:H7 (ref. 4). Here we have sequenced the genome of E. coli O157:H7 to identify candidate genes responsible for pathogenesis, to develop better methods of strain detection and to advance our understanding of the evolution of E. coli, through comparison with the genome of the non-pathogenic laboratory strain E. coli K-12 (ref. 5). We find that lateral gene transfer is far more extensive than previously anticipated. In fact, 1,387 new genes encoded in strain-specific clusters of diverse sizes were found in O157:H7. These include candidate virulence factors, alternative metabolic capacities, several prophages and other new functions--all of which could be targets for surveillance.
Collapse
|
68
|
Wei Y, Lee JM, Richmond C, Blattner FR, Rafalski JA, LaRossa RA. High-density microarray-mediated gene expression profiling of Escherichia coli. J Bacteriol 2001; 183:545-56. [PMID: 11133948 PMCID: PMC94910 DOI: 10.1128/jb.183.2.545-556.2001] [Citation(s) in RCA: 181] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2000] [Accepted: 10/25/2000] [Indexed: 11/20/2022] Open
Abstract
A nearly complete collection of 4,290 Escherichia coli open reading frames was amplified and arrayed in high density on glass slides. To exploit this reagent, conditions for RNA isolation from E. coli cells, cDNA production with attendant fluorescent dye incorporation, DNA-DNA hybridization, and hybrid quantitation have been established. A brief isopropyl-beta-D-thiogalactopyranoside (IPTG) treatment elevated lacZ, lacY, and lacA transcript content about 30-fold; in contrast, most other transcript titers remained unchanged. Distinct RNA expression patterns between E. coli cultures in the exponential and transitional phases of growth were catalogued, as were differences associated with culturing in minimal and rich media. The relative abundance of each transcript was estimated by using hybridization of a genomic DNA-derived, fluorescently labeled probe as a correction factor. This inventory provided a quantitative view of the steady-state level of each mRNA species. Genes the expression of which was detected by this method were enumerated, and results were compared with the current understanding of E. coli physiology.
Collapse
|
69
|
Selinger DW, Cheung KJ, Mei R, Johansson EM, Richmond CS, Blattner FR, Lockhart DJ, Church GM. RNA expression analysis using a 30 base pair resolution Escherichia coli genome array. Nat Biotechnol 2000; 18:1262-8. [PMID: 11101804 DOI: 10.1038/82367] [Citation(s) in RCA: 286] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
We have developed a high-resolution "genome array" for the study of gene expression and regulation in Escherichia coli. This array contains on average one 25-mer oligonucleotide probe per 30 base pairs over the entire genome, with one every 6 bases for the intergenic regions and every 60 bases for the 4,290 open reading frames (ORFs). Twofold concentration differences can be detected at levels as low as 0.2 messenger RNA (mRNA) copies per cell, and differences can be seen over a dynamic range of three orders of magnitude. In rich medium we detected transcripts for 97% and 87% of the ORFs in stationary and log phases, respectively. We found that 1, 529 transcripts were differentially expressed under these conditions. As expected, genes involved in translation were expressed at higher levels in log phase, whereas many genes known to be involved in the starvation response were expressed at higher levels in stationary phase. Many previously unrecognized growth phase-regulated genes were identified, such as a putative receptor (b0836) and a 30S ribosomal protein subunit (S22), both of which are highly upregulated in stationary phase. Transcription of between 3,000 and 4,000 predicted ORFs was observed from the antisense strand, indicating that most of the genome is transcribed at a detectable level. Examples are also presented for high-resolution array analysis of transcript start and stop sites and RNA secondary structure.
Collapse
|
70
|
Friesen N, Fritsch RM, Pollner S, Blattner FR. Molecular and morphological evidence for an origin of the aberrant genus Milula within himalayan species of Allium (Alliacae). Mol Phylogenet Evol 2000; 17:209-18. [PMID: 11083935 DOI: 10.1006/mpev.2000.0844] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Phylogenetic relationships between Allium and the monotypic Himalayan genus Milula were analyzed using sequences of the nuclear ribosomal DNA internal transcribed spacer (ITS) region and of the intergenic spacers from the chloroplast trnD(GUC)-trnT(GGU) region. Both marker systems unambiguously placed Milula spicata within Allium subgenus Rhizirideum, close to A. cyathophorum. Morphologically, the main difference between Allium and Milula is the conspicuous spicate inflorescence of Milula vs the mostly capitate or umbellate inflorescences in Allium. Anatomical investigations of leaf characters support a close relationship of Milula with A. cyathophorum and A. mairei, whereas root characters are distinctive from other species of section Cyathophora. To maintain Allium as monophyletic, Milula has been included as A. spicatum in Allium subgenus Rhizirideum.
Collapse
MESH Headings
- Allium/classification
- Allium/genetics
- DNA, Chloroplast/chemistry
- DNA, Chloroplast/genetics
- DNA, Plant/chemistry
- DNA, Plant/genetics
- DNA, Ribosomal Spacer/genetics
- Evolution, Molecular
- Liliaceae/anatomy & histology
- Liliaceae/classification
- Liliaceae/genetics
- Molecular Sequence Data
- Phylogeny
- Plant Leaves/anatomy & histology
- Plant Roots/anatomy & histology
- RNA, Ribosomal, 5.8S/genetics
- RNA, Transfer/genetics
- Sequence Analysis, DNA
Collapse
|
71
|
Ffrench-Constant RH, Waterfield N, Burland V, Perna NT, Daborn PJ, Bowen D, Blattner FR. A genomic sample sequence of the entomopathogenic bacterium Photorhabdus luminescens W14: potential implications for virulence. Appl Environ Microbiol 2000; 66:3310-29. [PMID: 10919786 PMCID: PMC92150 DOI: 10.1128/aem.66.8.3310-3329.2000] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Photorhabdus luminescens is a pathogenic bacterium that lives in the guts of insect-pathogenic nematodes. After invasion of an insect host by a nematode, bacteria are released from the nematode gut and help kill the insect, in which both the bacteria and the nematodes subsequently replicate. However, the bacterial virulence factors associated with this "symbiosis of pathogens" remain largely obscure. In order to identify genes encoding potential virulence factors, we performed approximately 2,000 random sequencing reads from a P. luminescens W14 genomic library. We then compared the sequences obtained to sequences in existing gene databases and to the Escherichia coli K-12 genome sequence. Here we describe the different classes of potential virulence factors found. These factors include genes that putatively encode Tc insecticidal toxin complexes, Rtx-like toxins, proteases and lipases, colicin and pyocins, and various antibiotics. They also include a diverse array of secretion (e.g., type III), iron uptake, and lipopolysaccharide production systems. We speculate on the potential functions of each of these gene classes in insect infection and also examine the extent to which the invertebrate pathogen P. luminescens shares potential antivertebrate virulence factors. The implications for understanding both the biology of this insect pathogen and links between the evolution of vertebrate virulence factors and the evolution of invertebrate virulence factors are discussed.
Collapse
|
72
|
Sherburne CK, Lawley TD, Gilmour MW, Blattner FR, Burland V, Grotbeck E, Rose DJ, Taylor DE. The complete DNA sequence and analysis of R27, a large IncHI plasmid from Salmonella typhi that is temperature sensitive for transfer. Nucleic Acids Res 2000; 28:2177-86. [PMID: 10773089 PMCID: PMC105367 DOI: 10.1093/nar/28.10.2177] [Citation(s) in RCA: 126] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/1999] [Revised: 03/20/2000] [Accepted: 03/20/2000] [Indexed: 11/14/2022] Open
Abstract
Salmonella typhi, the causative agent of typhoid fever, annually infects 16 million people and kills 600 000 world wide. Plasmid-encoded multiple drug resistance in S. typhi is always encoded by plasmids of incompatibility group H (IncH). The complete DNA sequence of the large temperature-sensitive conjugative plasmid R27, the prototype for the IncHI1 family of plasmids, has been compiled and analyzed. This 180 kb plasmid contains 210 open reading frames (ORFs), of which 14 have been previously identified and 56 exhibit similarity to other plasmid and prokaryotic ORFs. A number of insertion elements were found, including the full Tn 10 transposon, which carries tetracycline resistance genes. Two transfer regions, Tra1 and Tra2, are present, which are separated by a minimum of 64 kb. Homologs of the DNA-binding proteins TlpA and H-NS that act as temperature-regulated repressors in other systems have been located in R27. Sequence analysis of transfer and replication regions supports a mosaic-like structure for R27. The genes responsible for conjugation and plasmid maintenance have been identified and mechanisms responsible for thermosensitive transfer are discussed.
Collapse
|
73
|
Salgado H, Santos-Zavaleta A, Gama-Castro S, Millán-Zárate D, Blattner FR, Collado-Vides J. RegulonDB (version 3.0): transcriptional regulation and operon organization in Escherichia coli K-12. Nucleic Acids Res 2000; 28:65-7. [PMID: 10592182 PMCID: PMC102478 DOI: 10.1093/nar/28.1.65] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
RegulonDB is a database on transcription regulation and operon organization in Escherichia coli. The current version describes regulatory signals of transcription initiation, promoters, regulatory binding sites of specific regulators, ribosome binding sites and terminators, as well as information on genes clustered in operons. These specific annotations have been gathered from a constant search in the literature, as well as based on computational sequence predictions. The genomic coordinates of all these objects in the E.coli K-12 chromosome are clearly indicated. Every known object has a link to at least one MEDLINE reference. We have also added direct links to recent expression data of E.coli K-12. The version presented here has important modifications both in the structure of the database, as well as in the amount and type of information encoded in the database. RegulonDB can be accessed on the web at URL: http://www.cifn.unam. mx/Computational_Biology/regulondb/
Collapse
|
74
|
Blattner FR. Direct amplification of the entire ITS region from poorly preserved plant material using recombinant PCR. Biotechniques 1999; 27:1180-6. [PMID: 10631497 DOI: 10.2144/99276st04] [Citation(s) in RCA: 214] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Sequences of the internal transcribed spacers (ITS) of the nuclear ribosomal DNA are important molecular markers in phylogenetic analyses. To obtain sequences from herbarium material in which DNA often is severely degraded, the ITS region has to be amplified in two steps. Two methods that reduce bench time and reagents used are described. (i) Separately amplified preparations of subunits ITS-1 and ITS-2 are combined before purification. The presence of two fragments in the sequencing reaction does not impair the quality of sequences. (ii) Newly designed internal primers amplify partly overlapping regions of the two subunits. A combination of these internal primers with the external primers in one PCR allows the amplification of the entire ITS region even when degraded DNAs are used. This recombinant PCR approach, taking into account the +A bases added by several Taq DNA polymerases, will also be useful with other marker regions used in molecular phylogenetics.
Collapse
|
75
|
Pósfai G, Kolisnychenko V, Bereczki Z, Blattner FR. Markerless gene replacement in Escherichia coli stimulated by a double-strand break in the chromosome. Nucleic Acids Res 1999; 27:4409-15. [PMID: 10536150 PMCID: PMC148724 DOI: 10.1093/nar/27.22.4409] [Citation(s) in RCA: 210] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A simple and efficient gene replacement method, based on the recombination and repair activities of the cell, was developed. The method permits the targeted construction of markerless deletions, insertions and point mutations in the Escherichia coli chromosome. A suicide plasmid, carrying the mutant allele and the recognition site of meganuclease I- Sce I, is inserted into the genome by homologous recombination between the mutant and the wild-type (wt) alleles. Resolution of this cointegrate by intramolecular recombination of the allele pair results in either a mutant or a wt chromosome which can be distinguished by allele-specific PCR screening. The resolution process is stimulated by introducing a unique double-strand break (DSB) into the chromosome at the I- Sce I site. Cleavage by the nuclease not only enhances the frequency of resolution by two to three orders of magnitude, but also selects for the resolved products. The DSB-stimulated gene replacement method can be used in recombination-proficient E.coli cells, does not require specific growth conditions, and is potentially applicable in other microorganisms. Use of the method was demonstrated by constructing a 17-bp and a 62-kb deletion in the MG1655 chromosome. Cleavage of the chromosome induces the SOS response but does not lead to an increased mutation rate.
Collapse
|
76
|
Tao H, Bausch C, Richmond C, Blattner FR, Conway T. Functional genomics: expression analysis of Escherichia coli growing on minimal and rich media. J Bacteriol 1999; 181:6425-40. [PMID: 10515934 PMCID: PMC103779 DOI: 10.1128/jb.181.20.6425-6440.1999] [Citation(s) in RCA: 455] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
DNA arrays of the entire set of Escherichia coli genes were used to measure the genomic expression patterns of cells growing in late logarithmic phase on minimal glucose medium and on Luria broth containing glucose. Ratios of the transcript levels for all 4,290 E. coli protein-encoding genes (cds) were obtained, and analysis of the expression ratio data indicated that the physiological state of the cells under the two growth conditions could be ascertained. The cells in the rich medium grew faster, and expression of the majority of the translation apparatus genes was significantly elevated under this growth condition, consistent with known patterns of growth rate-dependent regulation and increased rate of protein synthesis in rapidly growing cells. The cells grown on minimal medium showed significantly elevated expression of many genes involved in biosynthesis of building blocks, most notably the amino acid biosynthetic pathways. Nearly half of the known RpoS-dependent genes were expressed at significantly higher levels in minimal medium than in rich medium, and rpoS expression was similarly elevated. The role of RpoS regulation in these logarithmic phase cells was suggested by the functions of the RpoS dependent genes that were induced. The hallmark features of E. coli cells growing on glucose minimal medium appeared to be the formation and excretion of acetate, metabolism of the acetate, and protection of the cells from acid stress. A hypothesis invoking RpoS and UspA (universal stress protein, also significantly elevated in minimal glucose medium) as playing a role in coordinating these various aspects and consequences of glucose and acetate metabolism was generated. This experiment demonstrates that genomic expression assays can be applied in a meaningful way to the study of whole-bacterial-cell physiology for the generation of hypotheses and as a guide for more detailed studies of particular genes of interest.
Collapse
|
77
|
Richmond CS, Glasner JD, Mau R, Jin H, Blattner FR. Genome-wide expression profiling in Escherichia coli K-12. Nucleic Acids Res 1999; 27:3821-35. [PMID: 10481021 PMCID: PMC148645 DOI: 10.1093/nar/27.19.3821] [Citation(s) in RCA: 410] [Impact Index Per Article: 16.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We have established high resolution methods for global monitoring of gene expression in Escherichia coli. Hybridization of radiolabeled cDNA to spot blots on nylon membranes was compared to hybridization of fluorescently-labeled cDNA to glass microarrays for efficiency and reproducibility. A complete set of PCR primers was created for all 4290 annotated open reading frames (ORFs) from the complete genome sequence of E.coli K-12 (MG1655). Glass- and nylon-based arrays of PCR products were prepared and used to assess global changes in gene expression. Full-length coding sequences for array printing were generated by two-step PCR amplification. In this study we measured changes in RNA levels after exposure to heat shock and following treatment with isopropyl-beta-D-thiogalactopyranoside (IPTG). Both radioactive and fluorescence-based methods showed comparable results. Treatment with IPTG resulted in high level induction of the lacZYA and melAB operons. Following heat shock treatment 119 genes were shown to have significantly altered expression levels, including 35 previously uncharacterized ORFs and most genes of the heat shock stimulon. Analysis of spot intensities from hybridization to replicate arrays identified sets of genes with signals consistently above background suggesting that at least 25% of genes were expressed at detectable levels during growth in rich media.
Collapse
|
78
|
Shutov AD, Blattner FR, Bäumlein H. Evolution of a conserved protein module from Archaea to plants. Trends Genet 1999; 15:348-9. [PMID: 10461202 DOI: 10.1016/s0168-9525(99)01813-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
79
|
Allex CF, Shavlik JW, Blattner FR. Neural network input representations that produce accurate consensus sequences from DNA fragment assemblies. Bioinformatics 1999; 15:723-8. [PMID: 10498772 DOI: 10.1093/bioinformatics/15.9.723] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Given inputs extracted from an aligned column of DNA bases and the underlying Perkin Elmer Applied Biosystems (ABI) fluorescent traces, our goal is to train a neural network to determine correctly the consensus base for the column. Choosing an appropriate network input representation is critical to success in this task. We empirically compare five representations; one uses only base calls and the others include trace information. RESULTS We attained the most accurate results from networks that incorporate trace information into their input representations. Based on estimates derived from using 10-fold cross-validation, the best network topology produces consensus accuracies ranging from 99.26% to >99.98% for coverages from two to six aligned sequences. With a coverage of six, it makes only three errors in 20 000 consensus calls. In contrast, the network that only uses base calls in its input representation has over double that error rate: eight errors in 20 000 consensus calls. CONTACT allex@cs.wisc.edu
Collapse
|
80
|
Chuang SE, Blattner FR. Identification of phosphate-regulated genes by differential expression in the UV-irradiated host system. J Microbiol Methods 1999; 37:93-6. [PMID: 10395468 DOI: 10.1016/s0167-7012(99)00050-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The UV-irradiated host system has been used for identifying protein products of genes cloned in a phage vector. By starving the host cells for phosphate immediately before UV-irradiation, we demonstrate that phosphate-regulated genes can be easily identified. By employing this new technique, we also provide evidence showing that the gpsA gene might be a new member of the phosphate starvation-inducible (psi) genes of E. coli.
Collapse
|
81
|
Friesen N, Pollner S, Bachmann K, Blattner FR. RAPDs and noncoding chloroplast DNA reveal a single origin of the cultivated Allium fistulosum from A. altaicum (Alliaceae). AMERICAN JOURNAL OF BOTANY 1999; 86:554-562. [PMID: 10205076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The origin of the crop species Allium fistulosum (bunching onion) and its relation to its wild relative A. altaicum were surveyed with a restriction fragment length polymorphism (RFLP) analysis of five noncoding cpDNA regions and with a random amplified polymorhic DNA (RAPD) analysis of nuclear DNA. Sixteen accessions of A. altaicum, 14 accessions of A. fistulosum, representing the morphological variability of the species, and five additional outgroup species from Allium section Cepa were included in this study. The RFLP analysis detected 14 phylogenetically informative character transformations, whereas RAPD revealed 126 polymorphic fragments. Generalized parsimony, neighbor-joining analysis of genetic distances, and a principal co-ordinate analysis were able to distinguish the two species, but only RAPD data allowed clarification of the interrelationship of the two taxa. The main results of this investigation were: (1) A. fistulosum is of monophyletic origin, and (2) A. fistulosum originated from an A. altaicum progenitor, making A. altaicum a paraphyletic species. Compared with A. altaicum the cultivated accessions of the bunching onion show less genetic variability, a phenomenon that often occurs in crop species due to the severe genetic bottleneck of domestication. Allium altaicum and A. fistulosum easily hybridize when grown together, and most garden-grown material is of recent hybrid origin.
Collapse
|
82
|
Plunkett G, Rose DJ, Durfee TJ, Blattner FR. Sequence of Shiga toxin 2 phage 933W from Escherichia coli O157:H7: Shiga toxin as a phage late-gene product. J Bacteriol 1999; 181:1767-78. [PMID: 10074068 PMCID: PMC93574 DOI: 10.1128/jb.181.6.1767-1778.1999] [Citation(s) in RCA: 305] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/1998] [Accepted: 01/06/1999] [Indexed: 11/20/2022] Open
Abstract
Lysogenic bacteriophages are major vehicles for the transfer of genetic information between bacteria, including pathogenicity and/or virulence determinants. In the enteric pathogen Escherichia coli O157:H7, which causes hemorrhagic colitis and hemolytic-uremic syndrome, Shiga toxins 1 and 2 (Stx1 and Stx2) are phage encoded. The sequence and analysis of the Stx2 phage 933W is presented here. We find evidence that the toxin genes are part of a late-phage transcript, suggesting that toxin production may be coupled with, if not dependent upon, phage release during lytic growth. Another phage gene, stk, encodes a product resembling eukaryotic serine/threonine protein kinases. Based on its position in the sequence, Stk may be produced by the prophage in the lysogenic state, and, like the YpkA protein of Yersinia species, it may interfere with the signal transduction pathway of the mammalian host. Three novel tRNA genes present in the phage genome may serve to increase the availability of rare tRNA species associated with efficient expression of pathogenicity determinants: both the Shiga toxin and serine/threonine kinase genes contain rare isoleucine and arginine codons. 933W also has homology to lom, encoding a member of a family of outer membrane proteins associated with virulence by conferring the ability to survive in macrophages, and bor, implicated in serum resistance.
Collapse
MESH Headings
- Attachment Sites, Microbiological/genetics
- Bacterial Toxins/genetics
- Base Sequence
- Coliphages/genetics
- Coliphages/ultrastructure
- DNA, Viral/genetics
- Escherichia coli O157/genetics
- Escherichia coli O157/pathogenicity
- Escherichia coli O157/virology
- Genes, Bacterial
- Genes, Viral
- Humans
- Microscopy, Electron
- Molecular Sequence Data
- Nucleic Acid Conformation
- Open Reading Frames
- Operator Regions, Genetic
- Promoter Regions, Genetic
- RNA, Transfer/chemistry
- RNA, Transfer/genetics
- RNA, Viral/chemistry
- RNA, Viral/genetics
- Shiga Toxins
- Terminator Regions, Genetic
- Virulence/genetics
Collapse
|
83
|
Lindler LE, Plano GV, Burland V, Mayhew GF, Blattner FR. Complete DNA sequence and detailed analysis of the Yersinia pestis KIM5 plasmid encoding murine toxin and capsular antigen. Infect Immun 1998; 66:5731-42. [PMID: 9826348 PMCID: PMC108724 DOI: 10.1128/iai.66.12.5731-5742.1998] [Citation(s) in RCA: 115] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Yersinia pestis, the causative agent of plague, harbors at least three plasmids necessary for full virulence of the organism, two of which are species specific. One of the Y. pestis-specific plasmids, pMT1, is thought to promote deep tissue invasion, resulting in more acute onset of symptoms and death. We determined the entire nucleotide sequence of Y. pestis KIM5 pMT1 and identified potential open reading frames (ORFs) encoded by the 100,990-bp molecule. Based on codon usage for known yersinial genes, homology with known proteins in the databases, and potential ribosome binding sites, we determined that 115 of the potential ORFs which we considered could encode polypeptides in Y. pestis. Five of these ORFs were genes previously identified as being necessary for production of the classic virulence factors, murine toxin (MT), and the fraction 1 (F1) capsule antigen. The regions of pMT1 encoding MT and F1 were surrounded by remnants of multiple transposition events and bacteriophage, respectively, suggesting horizontal gene transfer of these virulence factors. We identified seven new potential virulence factors that might interact with the mammalian host or flea vector. Forty-three of the remaining 115 putative ORFs did not display any significant homology with proteins in the current databases. Furthermore, DNA sequence analysis allowed the determination of the putative replication and partitioning regions of pMT1. We identified a single 2,450-bp region within pMT1 that could function as the origin of replication, including a RepA-like protein similar to RepFIB, RepHI1B, and P1 and P7 replicons. Plasmid partitioning function was located ca. 36 kb from the putative origin of replication and was most similar to the parABS bacteriophage P1 and P7 system. Y. pestis pMT1 encoded potential genes with a high degree of similarity to a wide variety of organisms, plasmids, and bacteriophage. Accordingly, our analysis of the pMT1 DNA sequence emphasized the mosaic nature of this large bacterial virulence plasmid and provided implications as to its evolution.
Collapse
|
84
|
Mahillon J, Kirkpatrick HA, Kijenski HL, Bloch CA, Rode CK, Mayhew GF, Rose DJ, Plunkett G, Burland V, Blattner FR. Subdivision of the Escherichia coli K-12 genome for sequencing: manipulation and DNA sequence of transposable elements introducing unique restriction sites. Gene 1998; 223:47-54. [PMID: 9858680 DOI: 10.1016/s0378-1119(98)00365-5] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
A transposon-based method of introducing unique restriction sites was used for subdivision of the Escherichia coli genome into a contiguous series of large non-overlapping segments spanning 2.5Mb. The segments, sizes ranging from 150 to 250kb, were isolated from the chromosome using the inserted restriction sites and shotgun cloned into an M13 vector for DNA sequencing. These shotgun sizes proved easily manageable, allowing the genomic sequence of E. coli to be completed more efficiently and rapidly than was possible by previously available methods. The 9bp duplication generated during transposition was used as a tag for accurate splicing of the segments; no further sequence redundancy at the junction sites was needed. The system is applicable to larger genomes even if they are not already well-characterized. We present the technology for segment sequencing, results of applying this method to E. coli, and the sequences of the transposon cassettes.
Collapse
|
85
|
Gehring AM, DeMoll E, Fetherston JD, Mori I, Mayhew GF, Blattner FR, Walsh CT, Perry RD. Iron acquisition in plague: modular logic in enzymatic biogenesis of yersiniabactin by Yersinia pestis. CHEMISTRY & BIOLOGY 1998; 5:573-86. [PMID: 9818149 DOI: 10.1016/s1074-5521(98)90115-6] [Citation(s) in RCA: 186] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
BACKGROUND Virulence in the pathogenic bacterium Yersinia pestis, causative agent of bubonic plague, has been correlated with the biosynthesis and transport of an iron-chelating siderophore, yersiniabactin, which is induced under iron-starvation conditions. Initial DNA sequencing suggested that this system is highly conserved among the pathogenic Yersinia. Yersiniabactin contains a phenolic group and three five-membered thiazole heterocycles that serve as iron ligands. RESULTS The entire Y. pestis yersiniabactin region has been sequenced. Sequence analysis of yersiniabactin biosynthetic regions (irp2-ybtE and ybtS) reveals a strategy for siderophore production using a mixed polyketide synthase/nonribosomal peptide synthetase complex formed between HMWP1 and HMWP2 (encoded by irp1 and irp2). The complex contains 16 domains, five of them variants of phosphopantetheine-modified peptidyl carrier protein or acyl carrier protein domains. HMWP1 and HMWP2 also contain methyltransferase and heterocyclization domains. Mutating ybtS revealed that this gene encodes a protein essential for yersiniabactin synthesis. CONCLUSIONS The HMWP1 and HMWP2 domain organization suggests that the yersiniabactin siderophore is assembled in a modular fashion, in which a series of covalent intermediates are passed from the amino terminus of HMWP2 to the carboxyl terminus of HMWP1. Biosynthetic labeling studies indicate that the three yersiniabactin methyl moieties are donated by S-adenosylmethionine and that the linker between the thiazoline and thiazolidine rings is derived from malonyl-CoA. The salicylate moiety is probably synthesized using the aromatic amino-acid biosynthetic pathway, the final step of which converts chorismate to salicylate. YbtS might be necessary for converting chorismate to salicylate.
Collapse
|
86
|
Perry RD, Straley SC, Fetherston JD, Rose DJ, Gregor J, Blattner FR. DNA sequencing and analysis of the low-Ca2+-response plasmid pCD1 of Yersinia pestis KIM5. Infect Immun 1998; 66:4611-23. [PMID: 9746557 PMCID: PMC108568 DOI: 10.1128/iai.66.10.4611-4623.1998] [Citation(s) in RCA: 130] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/1998] [Accepted: 07/10/1998] [Indexed: 01/03/2023] Open
Abstract
The low-Ca2+-response (LCR) plasmid pCD1 of the plague agent Yersinia pestis KIM5 was sequenced and analyzed for its genetic structure. pCD1 (70,509 bp) has an IncFIIA-like replicon and a SopABC-like partition region. We have assigned 60 apparently intact open reading frames (ORFs) that are not contained within transposable elements. Of these, 47 are proven or possible members of the LCR, a major virulence property of human-pathogenic Yersinia spp., that had been identified previously in one or more of Y. pestis or the enteropathogenic yersiniae Yersinia enterocolitica and Yersinia pseudotuberculosis. Of these 47 LCR-related ORFs, 35 constitute a continuous LCR cluster. The other LCR-related ORFs are interspersed among three intact insertion sequence (IS) elements (IS100 and two new IS elements, IS1616 and IS1617) and numerous defective or partial transposable elements. Regional variations in percent GC content and among ORFs encoding effector proteins of the LCR are additional evidence of a complex history for this plasmid. Our analysis suggested the possible addition of a new Syc- and Yop-encoding operon to the LCR-related pCD1 genes and gave no support for the existence of YopL. YadA likely is not expressed, as was the case for Y. pestis EV76, and the gene for the lipoprotein YlpA found in Y. enterocolitica likely is a pseudogene in Y. pestis. The yopM gene is longer than previously thought (by a sequence encoding two leucine-rich repeats), the ORF upstream of ypkA-yopJ is discussed as a potential Syc gene, and a previously undescribed ORF downstream of yopE was identified as being potentially significant. Eight other ORFs not associated with IS elements were identified and deserve future investigation into their functions.
Collapse
|
87
|
Burland V, Shao Y, Perna NT, Plunkett G, Sofia HJ, Blattner FR. The complete DNA sequence and analysis of the large virulence plasmid of Escherichia coli O157:H7. Nucleic Acids Res 1998; 26:4196-204. [PMID: 9722640 PMCID: PMC147824 DOI: 10.1093/nar/26.18.4196] [Citation(s) in RCA: 235] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The complete DNA sequence of pO157, the large virulence plasmid of EHEC strain O157:H7 EDL 933, is presented. The 92 kb F-like plasmid is composed of segments of putative virulence genes in a framework of replication and maintenance regions, with seven insertion sequence elements, located mostly at the boundaries of the virulence segments. One hundred open reading frames (ORFs) were identified, of which 19 were previously sequenced potential virulence genes. Forty-two ORFs were sufficiently similar to known proteins for suggested functions to be assigned, and 22 had no convincing similarity with any known proteins. Of the newly identified genes, an unusually large ORF of 3169 amino acids has a putative cytotoxin active site shared with the large clostridial toxin (LCT) family and proteins such as ToxA and B of Clostridium difficile . A conserved motif was detected that links the large ORF and the LCT proteins with the OCH1 family of glycosyltransferases. In the complete sequence, the mosaic form can be observed at the levels of base composition, codon usage and gene organization. Insights were obtained from patterns of DNA composition as well as the pathogenic and 'housekeeping' gene segments. Evolutionary trees built from shared plasmid maintenance genes show that even these genes have heterogeneous origins.
Collapse
|
88
|
Perna NT, Mayhew GF, Pósfai G, Elliott S, Donnenberg MS, Kaper JB, Blattner FR. Molecular evolution of a pathogenicity island from enterohemorrhagic Escherichia coli O157:H7. Infect Immun 1998; 66:3810-7. [PMID: 9673266 PMCID: PMC108423 DOI: 10.1128/iai.66.8.3810-3817.1998] [Citation(s) in RCA: 307] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/1997] [Accepted: 05/27/1998] [Indexed: 02/08/2023] Open
Abstract
We report the complete 43,359-bp sequence of the locus of enterocyte effacement (LEE) from EDL933, an enterohemorrhagic Escherichia coli O157:H7 serovar originally isolated from contaminated hamburger implicated in an outbreak of hemorrhagic colitis. The locus was isolated from the EDL933 chromosome with a homologous-recombination-driven targeting vector. Recent completion of the LEE sequence from enteropathogenic E. coli (EPEC) E2348/69 afforded the opportunity for a comparative analysis of the entire pathogenicity island. We have identified a total of 54 open reading frames in the EDL933 LEE. Of these, 13 fall within a putative P4 family prophage designated 933L. The prophage is not present in E2348/69 but is found in a closely related EPEC O55:H7 serovar and other O157:H7 isolates. The remaining 41 genes are shared by the two complete LEEs, and we describe the nature and extent of variation among the two strains for each gene. The rate of divergence is heterogeneous along the locus. Most genes show greater than 95% identity between the two strains, but other genes vary more than expected for clonal divergence among E. coli strains. Several of these highly divergent genes encode proteins that are known to be involved in interactions with the host cell. This pattern suggests recombinational divergence coupled with natural selection and has implications for our understanding of the interaction of both pathogens with their host, for the emergence of O157:H7, and for the evolutionary history of pathogens in general.
Collapse
|
89
|
Allex CF, Baldwin SF, Shavlik JW, Blattner FR. Increasing consensus accuracy in DNA fragment assemblies by incorporating fluorescent trace representations. PROCEEDINGS. INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS FOR MOLECULAR BIOLOGY 1997; 5:3-14. [PMID: 9322009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
We present a new method for determining the consensus sequence in DNA fragment assemblies. The new method, Trace-Evidence, directly incorporates aligned ABI trace information into consensus calculations via our previously described representation, Trace-Data Classifications. The new method extracts and sums evidence indicated by the representation to determine consensus calls. Using the Trace-Evidence method results in automatically produced consensus sequences that are more accurate and less ambiguous than those produced with standard majority-voting methods. Additionally, these improvements are achieved with less coverage than required by the standard methods-using Trace-Evidence and a coverage of only three, error rates are as low as those with a coverage of over ten sequences.
Collapse
|
90
|
Blattner FR, Plunkett G, Bloch CA, Perna NT, Burland V, Riley M, Collado-Vides J, Glasner JD, Rode CK, Mayhew GF, Gregor J, Davis NW, Kirkpatrick HA, Goeden MA, Rose DJ, Mau B, Shao Y. The complete genome sequence of Escherichia coli K-12. Science 1997; 277:1453-62. [PMID: 9278503 DOI: 10.1126/science.277.5331.1453] [Citation(s) in RCA: 5277] [Impact Index Per Article: 195.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The 4,639,221-base pair sequence of Escherichia coli K-12 is presented. Of 4288 protein-coding genes annotated, 38 percent have no attributed function. Comparison with five other sequenced microbes reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within E. coli are also evident. The largest family of paralogous proteins contains 80 ABC transporters. The genome as a whole is strikingly organized with respect to the local direction of replication; guanines, oligonucleotides possibly related to replication and recombination, and most genes are so oriented. The genome also contains insertion sequence (IS) elements, phage remnants, and many other patches of unusual composition indicating genome plasticity through horizontal transfer.
Collapse
MESH Headings
- Bacterial Proteins/chemistry
- Bacterial Proteins/genetics
- Bacterial Proteins/metabolism
- Bacteriophage lambda/genetics
- Base Composition
- Binding Sites
- Chromosome Mapping
- DNA Replication
- DNA Transposable Elements
- DNA, Bacterial/genetics
- Escherichia coli/genetics
- Genes, Bacterial
- Genome, Bacterial
- Molecular Sequence Data
- Mutation
- Operon
- RNA, Bacterial/genetics
- RNA, Transfer/genetics
- Recombination, Genetic
- Regulatory Sequences, Nucleic Acid
- Repetitive Sequences, Nucleic Acid
- Sequence Analysis, DNA
- Sequence Homology, Amino Acid
Collapse
|
91
|
Pósfai G, Koob MD, Kirkpatrick HA, Blattner FR. Versatile insertion plasmids for targeted genome manipulations in bacteria: isolation, deletion, and rescue of the pathogenicity island LEE of the Escherichia coli O157:H7 genome. J Bacteriol 1997; 179:4426-8. [PMID: 9209066 PMCID: PMC179272 DOI: 10.1128/jb.179.13.4426-4428.1997] [Citation(s) in RCA: 169] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
A system of versatile insertion plasmids was constructed that permits efficient delivery of the target sites of an ultra-rare-cutting endonuclease and the recombinase FLP into preselected sites of the bacterial genome. With the help of this system, the pathogenicity island LEE of the Escherichia coli O157:H7 genome was excised and isolated in vitro, deleted in vivo, rescued as a plasmid, and transferred into another strain.
Collapse
|
92
|
Koop BF, Richards JE, Durfee TD, Bansberg J, Wells J, Gilliam AC, Chen HL, Clausell A, Tucker PW, Blattner FR. Analysis and comparison of the mouse and human immunoglobulin heavy chain JH-Cmu-Cdelta locus. Mol Phylogenet Evol 1996; 5:33-49. [PMID: 8673297 DOI: 10.1006/mpev.1996.0005] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
We report here 23,686 bases of contiguous DNA sequences from the mouse germline immunoglobulin heavy chain (H) constant (C) mu delta region. The sequence spans the joining (JH) regions, the mu constant region (C mu), the delta constant region (C delta) coding regions, a domain relic, the mu switch region (S mu), seven blocks of simple sequence repeats, a large unique sequence inverted repeat, a large unique sequence forward repeat, and all of the intervening material. A comparison of this 23.7-kb region with the corresponding human C mu/C delta region reveals clear homology in the coding and introns of C mu but not in the 5' flanking J gene segments nor in the intergenic and C delta regions. This mixed pattern of similarity between the human and the mouse sequences contrasts with high levels of similarity found in the T-cell receptor C alpha/C delta region and alpha and beta myosin genes and the very low levels found in the gamma-crystallin, XRCC1, and beta-globin gene clusters. The human and mouse comparison further suggests the incorporation of novel sequences into expressed genes of IgD.
Collapse
|
93
|
Burland V, Plunkett G, Sofia HJ, Daniels DL, Blattner FR. Analysis of the Escherichia coli genome VI: DNA sequence of the region from 92.8 through 100 minutes. Nucleic Acids Res 1995; 23:2105-19. [PMID: 7610040 PMCID: PMC306997 DOI: 10.1093/nar/23.12.2105] [Citation(s) in RCA: 117] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
The 338.5 kb of the Escherichia coli genome described here together with previously described segments bring the total of contiguous finished sequence of this genome to > 1 Mb. Of 319 open reading frames (ORFs) found in this 338.5 kb segment, 147 (46%) are potential new genes. The positions of several genes which had been previously located here by mapping or partial sequencing have been confirmed. Several ORFs have functions suggested by similarities to other characterised genes but cannot be assigned with certainty. Fifteen of the ORFs of unknown function had been previously sequenced. Eight transfer RNAs are encoded in the region and there are two grey holes in which no features were found. The attachment site for phage P4 and three insertion sequences were located. The region was also analysed for chi sites, bend sites, REP elements and other repeats. A computer search identified potential promoters and tentative transcription units were assigned. The occurrence of the rare tetramer CTAG was analysed in 1.6 Mb of contiguous E.coli sequence. Hypotheses addressing the rarity and distribution of CTAG are discussed.
Collapse
|
94
|
Lake EM, Jiang H, Blattner FR, Adler J. Analogue of aspartate and glutamate active at synapses are attractants for Escherichia coli. Cell Mol Neurobiol 1995; 15:283-8. [PMID: 8590457 DOI: 10.1007/bf02073334] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
1. This research was carried out to compare Escherichia coli bacteria with animals in their response to L-aspartate and L-glutamate and their analogues. 2. Various analogues of aspartate and glutamate known to be neurotransmitters at synapses were shown to be attractants for E. coli. 3. The amino acid sequences of the animal receptors and the bacterial receptor, however, have no detectable relationship. Based on the amino acid sequence, evolutionarily the two systems appear not to be related.
Collapse
|
95
|
Chuang SE, Blattner FR. Ultrafast DNA recovery from agarose by centrifugation through a paper slurry. Biotechniques 1994; 17:634, 636. [PMID: 7833013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
|
96
|
Sofia HJ, Burland V, Daniels DL, Plunkett G, Blattner FR. Analysis of the Escherichia coli genome. V. DNA sequence of the region from 76.0 to 81.5 minutes. Nucleic Acids Res 1994; 22:2576-86. [PMID: 8041620 PMCID: PMC308212 DOI: 10.1093/nar/22.13.2576] [Citation(s) in RCA: 184] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
The DNA sequence of a 225.4 kilobase segment of the Escherichia coli K-12 genome is described here, from 76.0 to 81.5 minutes on the genetic map. This brings the total of contiguous sequence from the E.coli genome project to 725.1 kb (76.0 to 92.8 minutes). We found 191 putative coding genes (ORFs) of which 72 genes were previously known, and 110 of which remain unidentified despite literature and similarity searches. Seven new genes--arsE, arsF, arsG, treF, xylR, xylG, and xylH--were identified as well as the previously mapped pit and dctA genes. The arrangement of proposed genes relative to possible promoters and terminators suggests 90 potential transcription units. Other features include 19 REP elements, 95 computer-predicted bends, 50 Chi sites, and one grey hole. Thirty-one putative signal peptides were found, including those of thirteen known membrane or periplasmic proteins. One tRNA gene (proK) and two insertion sequences (IS5 and IS150) are located in this segment. The genes in this region are organized with equal numbers oriented with or against replication.
Collapse
|
97
|
Sukharev SI, Blount P, Martinac B, Blattner FR, Kung C. A large-conductance mechanosensitive channel in E. coli encoded by mscL alone. Nature 1994; 368:265-8. [PMID: 7511799 DOI: 10.1038/368265a0] [Citation(s) in RCA: 535] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
All cellular organisms respond to vibration, touch, gravity or changes in osmolarity, although the molecules on which such mechanosensations depend are unknown. Candidates include certain channels that gate in response to membrane stretch. Patch-clamp experiments with Escherichia coli envelope have revealed a mechanosensitive channel with very large conductance (MscL) and one with a smaller conductance (MscS) which may be important in osmoregulation. Here we have solubilized and fractionated the envelope, reconstituted the MscL activity in vitro, and traced it to a small protein, whose gene, mscL, we then cloned. Insertional disruption of mscL removes the channel activity, whereas re-expression of mscL borne on an expression plasmid restores it. MscL-channel activities were observed in material from a cell-free expression system with mscL as the only template. The mscL nucleotide sequence predicts a unique protein of only 136 amino acids, with a highly hydrophobic core and very different from porins or other known proteins.
Collapse
|
98
|
Chuang SE, Burland V, Plunkett G, Daniels DL, Blattner FR. Sequence analysis of four new heat-shock genes constituting the hslTS/ibpAB and hslVU operons in Escherichia coli. Gene X 1993; 134:1-6. [PMID: 8244018 DOI: 10.1016/0378-1119(93)90167-2] [Citation(s) in RCA: 115] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
Sequences of four new heat-shock (HS) genes of Escherichia coli organized into two operons were determined. The operon at 83 min specifies two proteins of 15.8 kDa (HslT) and 16.1 kDa (HslS), which are identical to IbpA and IbpB, respectively. Expression of mRNA from a sigma 32-dependent promoter of the hslTS/ibpAB operon is stimulated 30-75-fold upon temperature upshift. The transcription start point (tsp) is located at a G, 96 bp upstream from the AUG start codon of hslT/ibpA. The deduced amino acid sequences of HslT/IbpA and HslS/IbpB are 48% identical to each other and were found to be remotely related to the chloroplast low-molecular-weight HS protein, which is highly conserved among plants. The second hs operon is much less actively stimulated by temperature upshift, although it has a hs promoter that perfectly matches the consensus of promoters recognized by sigma 32. Located at 88.9 min, the hslVU operon specifies proteins of 19.1 kDa (HslV) and 49.6 kDa (HslU). Multiple tsp were found in this operon. HslV is remotely related to the eukaryotic proteasome proteins, and HslU is very similar to a Pasteurella haemolytica protein of unknown function. Both HslU and the P. haemolytica protein share a ATP/GTP-binding motif near their N-termini. The two operons described here are transcribed counterclockwise on the standard genetic map.
Collapse
|
99
|
Blattner FR, Burland V, Plunkett G, Sofia HJ, Daniels DL. Analysis of the Escherichia coli genome. IV. DNA sequence of the region from 89.2 to 92.8 minutes. Nucleic Acids Res 1993; 21:5408-17. [PMID: 8265357 PMCID: PMC310579 DOI: 10.1093/nar/21.23.5408] [Citation(s) in RCA: 133] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
We present the sequence of 176 kilobases of the Escherichia coli K-12 genome, from katG at 89.2 to an open reading frame (ORF) of unknown function at 92.8 minutes on the genetic map. This brings the total of contiguous sequence from the E. coli genome project to 500 kb (81.5 to 92.8 minutes). This segment contains 134 putative coding genes (ORFs) of which 66 genes were previously identified. Eight new genes--acs, pepE, and nrfB-G--were identified as well as the previously mapped gldA and alr genes. Still, 58 ORFs remain unidentified despite literature and similarity searches. The arrangement of proposed genes relative to possible promoters and terminators suggests 55 potential transcription units. Other features include 13 REP elements, one IRU (ERIC) repeat, 59 computer-predicted bends, 42 Chi sites and one new grey hole. Sixteen signal peptides were found, including those of lamB, btuB, and malE. Two ribosomal RNA loci, rrnB and rrnE, are located in this segment, so we have now sequenced four of the seven E. coli rRNA loci. Comparison of the rRNA loci reveals some differences in the ribosomal structural RNAs which are generally compatible with the proposed secondary structures.
Collapse
|
100
|
Abstract
Most organisms respond to heat by substantial alteration of the pattern of gene expression. This has been particularly well studied with Escherichia coli although the response has by no means been completely characterized. Here we report the characterization of 26 new heat shock genes of E. coli, termed hsl, discovered by global transcription analysis with an overlapping lambda clone bank. We have measured the molecular weights of the corresponding heat shock proteins and mapped each of them to within a few kilobases on the E. coli genome. In vitro, 16 of them can be activated by the E sigma 32 RNA polymerase, which specifically transcribes heat shock genes. In vivo expression kinetics of seven of eight examined new proteins were found to be similar to those of the four most studied heat shock proteins, DnaK, DnaJ, GroEL (MopA), and GroES (MopB). In the course of this work, we confirmed that the catalytic subunit of the ATP-dependent Clp protease (also known as Ti protease), ClpP, is derived from a larger precursor protein. Possible assignments of some of the hsl genes to known proteins are discussed.
Collapse
|