51
|
Grass S, Zidorn C, Blattner FR, Stuppner H. Comparative molecular and phytochemical investigation of Leontodon autumnalis (Asteraceae, Lactuceae) populations from Central Europe. PHYTOCHEMISTRY 2006; 67:122-31. [PMID: 16325215 DOI: 10.1016/j.phytochem.2005.10.019] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2005] [Revised: 09/28/2005] [Accepted: 10/11/2005] [Indexed: 05/05/2023]
Abstract
Previous analyses of Leontodon autumnalis L. revealed the existence of two chemotypes. In the current study molecular and phytochemical methods were combined to investigate 24 Central European populations of L. autumnalis. The focus of this study was the correlation of molecular and phytochemical characters at the intraspecific level. DNA fingerprint profiles of 183 individuals were obtained by random amplified polymorphic DNA (RAPD) providing 77 molecular markers. Contents of phenolics and sesquiterpenoids of flowering heads and sub-aerial parts were quantified by HPLC-DAD analyses. HPLC results were evaluated by principal component analysis. Geographic distribution of the two detected chemotypes partially overlapped. Phylogenetic groupings displayed in an unrooted neighbor-joining tree calculated from the RAPD data matrix were correlated with the geographical origin of the plant material. However, genetic profiles neither correlated with the two chemotypes nor with the morphologically based subspecies of L. autumnalis recognized by some authors. The presented data imply that the morphotypes are of multiple origins or due to different ecological growing conditions rather than genetically determined and that phytochemical races are induced by a limited number of genetical differences, which might have occurred independently in different lineages of the L. autumnalis group.
Collapse
|
52
|
Blattner FR. Multiple intercontinental dispersals shaped the distribution area of Hordeum (Poaceae). THE NEW PHYTOLOGIST 2006; 169:603-14. [PMID: 16411962 DOI: 10.1111/j.1469-8137.2005.01610.x] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
The grass genus Hordeum (Poaceae, Triticeae), comprising 31 species distributed in temperate and dry regions of the world, was analysed to determine the relative contributions of vicariance and long-distance dispersal to the extant distribution pattern of the genus. Sequences from three nuclear regions (DMC1, EF-G and ITS) were combined and analysed phylogenetically for all diploid (20 species) and two tetraploid Hordeum species and the outgroup Psathyrostachys. Ages of clades within Hordeum were estimated using a penalized likelihood analysis of sequence divergence. The sequence data resulted in an almost fully resolved phylogenetic tree that allowed the reconstruction of intrageneric migration routes. Hordeum evolved c. 12 million years ago in South-west Asia and spread into Europe and Central Asia. The colonization of the New World and South Africa involved at least six intercontinental exchanges during the last 4 million years (twice Eurasia-North America, North America-South America, twice South America-North America and Europe-South Africa). Repeated long-distance dispersal between the northern and southern hemisphere were important colonization mechanisms in Hordeum.
Collapse
|
53
|
Blattner FR. Phylogenetic analysis of Hordeum (Poaceae) as inferred by nuclear rDNA ITS sequences. Mol Phylogenet Evol 2005; 33:289-99. [PMID: 15336664 DOI: 10.1016/j.ympev.2004.05.012] [Citation(s) in RCA: 98] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2003] [Revised: 05/12/2004] [Indexed: 11/17/2022]
Abstract
Hordeum (Poaceae, Triticeae) occurs with 31 species worldwide in temperate regions, with the exception of Australasia. About 50% of the species are polyploids (4x, 6x) or occur as di- and polyploid forms. To analyze the phylogenetic relationships among diploid and polyploid taxa of the genus the nuclear rDNA internal transcribed spacer region (ITS) was analyzed for 91 accessions, representing all Hordeum species, together with 10 outgroup species. PCR products were either directly sequenced (outgroups) or cloned and eight clones per individual were analyzed. Phylogenetic analysis revealed four major clades that concur with the four genome groups in Hordeum (H, I, Xa, and Xu). Allopolyploids, putative autopolyploids, and species groups within the closely related H-genome clade could be identified. The ITS data indicate times of independent evolution of paralogous rDNA clusters on different chromosomes intermitted by sweeps of homogenization among these clusters and bi-directional homogenization of the clusters in diploids. Penalized likelihood analysis revealed an age of about 12 million years (my) for the genus and indicated the start of a rapid radiation in the H-genome group about 2.5 my ago in South America and Asia.
Collapse
|
54
|
Demidov D, Van Damme D, Geelen D, Blattner FR, Houben A. Identification and dynamics of two classes of aurora-like kinases in Arabidopsis and other plants. THE PLANT CELL 2005; 17:836-48. [PMID: 15722465 PMCID: PMC1069702 DOI: 10.1105/tpc.104.029710] [Citation(s) in RCA: 110] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Aurora-like kinases play key roles in chromosome segregation and cytokinesis in yeast, plant, and animal systems. Here, we characterize three Arabidopsis thaliana protein kinases, designated AtAurora1, AtAurora2, and AtAurora3, which share high amino acid identities with the Ser/Thr kinase domain of yeast Ipl1 and animal Auroras. Structure and expression of AtAurora1 and AtAurora2 suggest that these genes arose by a recent gene duplication, whereas the diversification of plant alpha and beta Aurora kinases predates the origin of land plants. The transcripts and proteins of all three kinases are most abundant in tissues containing dividing cells. Intracellular localization of green fluorescent protein-tagged AtAuroras revealed an AtAurora-type specific association mainly with dynamic mitotic structures, such as microtubule spindles and centromeres, and with the emerging cell plate of dividing tobacco (Nicotiana tabacum) BY-2 cells. Immunolabeling using AtAurora antibodies yielded specific signals at the centromeres that are coincident with histone H3 that is phosphorylated at Ser position10 during mitosis. An in vitro kinase assay demonstrated that AtAurora1 preferentially phosphorylates histone H3 at Ser 10 but not at Ser 28 or Thr 3, 11, and 32. The phylogenetic analysis of available Aurora sequences from different eukaryotic origins suggests that, although a plant Aurora gene has been duplicated early in the evolution of plants, the paralogs nevertheless maintained a role in cell cycle-related signal transduction pathways.
Collapse
|
55
|
Jakob SS, Meister A, Blattner FR. The considerable genome size variation of Hordeum species (poaceae) is linked to phylogeny, life form, ecology, and speciation rates. Mol Biol Evol 2004; 21:860-9. [PMID: 15014163 DOI: 10.1093/molbev/msh092] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Genome size variation in plants is thought to be correlated with cytological, physiological, or ecological characters. However, conclusions drawn in several studies were often contradictory. To analyze nuclear genome size evolution in a phylogenetic framework, DNA contents of 134 accessions, representing all but one species of the barley genus Hordeum L., were measured by flow cytometry. The 2C DNA contents were in a range from 6.85 to 10.67 pg in diploids (2n = 14) and reached up to 29.85 pg in hexaploid species (2n = 42). The smallest genomes were found in taxa from the New World, which became secondarily annual, whereas the largest diploid genomes occur in Eurasian annuals. Genome sizes of polyploid taxa equaled mostly the added sizes of their proposed progenitors or were slightly (1% to 5%) smaller. The analysis of ancestral genome sizes on the base of the phylogeny of the genus revealed lineages with decreasing and with increasing genome sizes. Correlations of intraspecific genome size variation with the length of vegetation period were found in H. marinum populations from Western Europe but were not significant within two species from South America. On a higher taxonomical level (i.e., for species groups or the entire genus), environmental correlations were absent. This could mostly be attributed to the superimposition of life-form changes and phylogenetic constraints, which conceal ecogeographical correlations.
Collapse
|
56
|
Stehlik I, Blattner FR. Sex-specific SCAR markers in the dioecious plant Rumex nivalis (Polygonaceae) and implications for the evolution of sex chromosomes. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2004; 108:238-242. [PMID: 13679980 DOI: 10.1007/s00122-003-1425-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/03/2003] [Accepted: 07/04/2003] [Indexed: 05/24/2023]
Abstract
We developed SCAR primers based on isolated and sequenced male-specific fragments as identified in an AFLP analysis of the dioecious plant Rumex nivalis. PCR amplification using these primers on females and males resulted in fragments exclusively present in males. Co-amplification of the nuclear rDNA internal transcribed spacer 2 together with the male-specific fragment was applied as an internal control for successful PCR reactions to avoid false-negative sex scoring. With a length of about 164 bp, the AFLP fragment was of a similar size as the tandemly arranged, repetitive sequences of 180 bp located on the Y chromosomes of Rumex acetosa. The genetic distances between the Y-chromosomal sequences of R. nivalis and R. acetosa, both members of the section Acetosa, were substantial. We found intra-individual divergence among cloned sequences of the male-specific fragment in R. nivalis. The patterns of interspecific and intra-individual sequence variation found are in accordance with proposed modes of the evolution of sex chromosomes. Y chromosomes possibly arose only once in the genus Rumex and consist mainly of heterochromatic DNA. Due to the almost complete absence of selection on them, Y chromosomes are likely to accumulate large numbers of mutations.
Collapse
|
57
|
Shutov AD, Bäumlein H, Blattner FR, Müntz K. Storage and mobilization as antagonistic functional constraints on seed storage globulin evolution. JOURNAL OF EXPERIMENTAL BOTANY 2003; 54:1645-54. [PMID: 12754262 DOI: 10.1093/jxb/erg165] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
When seeds germinate nearly all the proteins are degraded in senescing storage tissue cells. All these proteins act as amino acid reserves which are mobilized to nourish the seedling. Nevertheless, the major amount of the seeds' protein reserve consists of a few enzymatically inactive, abundant, genuine storage proteins. In their metabolism the conflicting processes of biosynthesis, protein turnover and breakdown, are temporally separated. No degradation of correctly formed storage proteins was observed at the time of synthesis and accumulation during seed maturation. Breakdown takes place after a (long) period of rest when seeds germinate and seedlings start growing. At that time genuine storage proteins are no longer synthesized. Genuine storage proteins have evolved structural features permitting controlled temporal patterns of protection and proteolysis. The acquisition of inserted sequence stretches as sites accessible to limited proteolysis played a key role in the evolution of this control system and happened in coevolution of genuine storage proteins with specific proteinases. This can be deduced from the results of current research on the mechanisms of limited and unlimited proteolysis of storage globulins and on storage globulin evolution. The evolved system of controlled structure-function interplay between storage globulins and proteinases is part of a syndrome that, in addition, comprises differential compartmentation and gene expression of storage proteins and proteinases for controlling the total spatial and temporal patterns of globulin storage and mobilization in maturing and germinating seeds.
Collapse
|
58
|
Wei J, Goldberg MB, Burland V, Venkatesan MM, Deng W, Fournier G, Mayhew GF, Plunkett G, Rose DJ, Darling A, Mau B, Perna NT, Payne SM, Runyen-Janecky LJ, Zhou S, Schwartz DC, Blattner FR. Complete genome sequence and comparative genomics of Shigella flexneri serotype 2a strain 2457T. Infect Immun 2003; 71:2775-86. [PMID: 12704152 PMCID: PMC153260 DOI: 10.1128/iai.71.5.2775-2786.2003] [Citation(s) in RCA: 303] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
We determined the complete genome sequence of Shigella flexneri serotype 2a strain 2457T (4,599,354 bp). Shigella species cause >1 million deaths per year from dysentery and diarrhea and have a lifestyle that is markedly different from those of closely related bacteria, including Escherichia coli. The genome exhibits the backbone and island mosaic structure of E. coli pathogens, albeit with much less horizontally transferred DNA and lacking 357 genes present in E. coli. The strain is distinctive in its large complement of insertion sequences, with several genomic rearrangements mediated by insertion sequences, 12 cryptic prophages, 372 pseudogenes, and 195 S. flexneri-specific genes. The 2457T genome was also compared with that of a recently sequenced S. flexneri 2a strain, 301. Our data are consistent with Shigella being phylogenetically indistinguishable from E. coli. The S. flexneri-specific regions contain many genes that could encode proteins with roles in virulence. Analysis of these will reveal the genetic basis for aspects of this pathogenic organism's distinctive lifestyle that have yet to be explained.
Collapse
|
59
|
Welch RA, Burland V, Plunkett G, Redford P, Roesch P, Rasko D, Buckles EL, Liou SR, Boutin A, Hackett J, Stroud D, Mayhew GF, Rose DJ, Zhou S, Schwartz DC, Perna NT, Mobley HLT, Donnenberg MS, Blattner FR. Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli. Proc Natl Acad Sci U S A 2002; 99:17020-4. [PMID: 12471157 PMCID: PMC139262 DOI: 10.1073/pnas.252529799] [Citation(s) in RCA: 1026] [Impact Index Per Article: 46.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We present the complete genome sequence of uropathogenic Escherichia coli, strain CFT073. A three-way genome comparison of the CFT073, enterohemorrhagic E. coli EDL933, and laboratory strain MG1655 reveals that, amazingly, only 39.2% of their combined (nonredundant) set of proteins actually are common to all three strains. The pathogen genomes are as different from each other as each pathogen is from the benign strain. The difference in disease potential between O157:H7 and CFT073 is reflected in the absence of genes for type III secretion system or phage- and plasmid-encoded toxins found in some classes of diarrheagenic E. coli. The CFT073 genome is particularly rich in genes that encode potential fimbrial adhesins, autotransporters, iron-sequestration systems, and phase-switch recombinases. Striking differences exist between the large pathogenicity islands of CFT073 and two other well-studied uropathogenic E. coli strains, J96 and 536. Comparisons indicate that extraintestinal pathogenic E. coli arose independently from multiple clonal lineages. The different E. coli pathotypes have maintained a remarkable synteny of common, vertically evolved genes, whereas many islands interrupting this common backbone have been acquired by different horizontal transfer events in each strain.
Collapse
|
60
|
Hartung F, Blattner FR, Puchta H. Intron gain and loss in the evolution of the conserved eukaryotic recombination machinery. Nucleic Acids Res 2002; 30:5175-81. [PMID: 12466542 PMCID: PMC137963 DOI: 10.1093/nar/gkf649] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Intron conservation, intron gain or loss and putative intron sliding events were determined for a set of three genes (SPO11, MRE11 and DMC1) involved in basic aspects of recombination in eukaryotes. These are ancient genes and present in nearly all of the major kingdoms. MRE11 is of bacterial origin and can be found in all kingdoms. DMC1 is a specialized homolog of the bacterial RecA protein, whereas the SPO11 gene is of archaebacterial origin. Only unique homologs of SPO11 are found in animals and fungi whereas three distantly related SPO11 copies are present in plant genomes. A comparison of the respective intron positions and phases of all genes was performed, demonstrating that a quarter of the intron positions were perfectly conserved over more than 1 000 000 000 years. Regarding the remaining three quarters of the introns we found insertions to be about three times more frequent than deletions. Aligning the introns of the three different SPO11 homologs of Arabidopsis thaliana we propose a conclusive model of their evolution. We postulate that at least one duplication event occurred shortly after the divergence of plants from animals and fungi and that a respective homolog has been retained in a protist group, the apicomplexa.
Collapse
|
61
|
Stehlik I, Blattner FR, Holderegger R, Bachmann K. Nunatak survival of the high Alpine plant Eritrichium nanum (L.) Gaudin in the central Alps during the ice ages. Mol Ecol 2002; 11:2027-36. [PMID: 12296946 DOI: 10.1046/j.1365-294x.2002.01595.x] [Citation(s) in RCA: 119] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Polymerase chain reaction-restriction fragment length polymorphisms (PCR-RFLPs) and sequence analysis of noncoding regions of chloroplast DNA were used to investigate 37 populations of Eritrichium nanum covering its total distribution area, the European Alps. There was no haplotypic variation within the populations, and most haplotypes were restricted to single sites or to neighbouring populations, suggesting low levels of long distance gene flow via seeds. The present geographical distribution of haplotypes probably reflects an ancient geographical pattern within two regions in the intensely glaciated western and eastern central Alps identified as genetic hotspot areas. These two regions contained seven of the total of 11 haplotypes, including many of the most derived ones. The divergent haplotypes formed closely related groups, which supported a separate evolution of these haplotypes in these two regions and, more importantly, gave strong evidence for the in situ survival of these populations on nunataks within the western and eastern central Alps during Pleistocene glaciation. This result is in concordance with a previous study on E. nanum using nuclear markers. Only one haplotype was common and widespread throughout the distributional range of E. nanum. At the same time, it was the evolutionarily basal-most and all other haplotypes were best described as its descendants. This haplotype is hypothesized to be genetically identical to a Tertiary Alpine colonizing ancestor, whose distribution was secondarily fragmented and infiltrated by derived haplotypes originating through local mutations.
Collapse
|
62
|
Lim A, Dimalanta ET, Potamousis KD, Yen G, Apodoca J, Tao C, Lin J, Qi R, Skiadas J, Ramanathan A, Perna NT, Plunkett G, Burland V, Mau B, Hackett J, Blattner FR, Anantharaman TS, Mishra B, Schwartz DC. Shotgun optical maps of the whole Escherichia coli O157:H7 genome. Genome Res 2001; 11:1584-93. [PMID: 11544203 PMCID: PMC311123 DOI: 10.1101/gr.172101] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2000] [Accepted: 06/04/2001] [Indexed: 11/24/2022]
Abstract
We have constructed NheI and XhoI optical maps of Escherichia coli O157:H7 solely from genomic DNA molecules to provide a uniquely valuable scaffold for contig closure and sequence validation. E. coli O157:H7 is a common pathogen found in contaminated food and water. Our approach obviated the need for the analysis of clones, PCR products, and hybridizations, because maps were constructed from ensembles of single DNA molecules. Shotgun sequencing of bacterial genomes remains labor-intensive, despite advances in sequencing technology. This is partly due to manual intervention required during the last stages of finishing. The applicability of optical mapping to this problem was enhanced by advances in machine vision techniques that improved mapping throughput and created a path to full automation of mapping. Comparisons were made between maps and sequence data that characterized sequence gaps and guided nascent assemblies.
Collapse
|
63
|
Newton MA, Kendziorski CM, Richmond CS, Blattner FR, Tsui KW. On differential variability of expression ratios: improving statistical inference about gene expression changes from microarray data. J Comput Biol 2001; 8:37-52. [PMID: 11339905 DOI: 10.1089/106652701300099074] [Citation(s) in RCA: 479] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We consider the problem of inferring fold changes in gene expression from cDNA microarray data. Standard procedures focus on the ratio of measured fluorescent intensities at each spot on the microarray, but to do so is to ignore the fact that the variation of such ratios is not constant. Estimates of gene expression changes are derived within a simple hierarchical model that accounts for measurement error and fluctuations in absolute gene expression levels. Significant gene expression changes are identified by deriving the posterior odds of change within a similar model. The methods are tested via simulation and are applied to a panel of Escherichia coli microarrays.
Collapse
|
64
|
Blattner FR, Weising K, Bänfer G, Maschwitz U, Fiala B. Molecular analysis of phylogenetic relationships among Myrmecophytic macaranga species (Euphorbiaceae). Mol Phylogenet Evol 2001; 19:331-44. [PMID: 11399144 DOI: 10.1006/mpev.2001.0941] [Citation(s) in RCA: 112] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Many species of the paleotropical pioneer tree genus Macaranga Thou. (Euphorbiaceae) live in association with ants. Various types of mutualistic interactions exist, ranging from the attraction of unspecific ant visitors to obligate myrmecophytism. In the latter, nesting space and food bodies are exchanged for protection by highly specific ant partners (mainly species of the myrmicine genus Crematogaster). As a first step toward elucidating the coevolution of ant-plant interactions in the Macaranga-Crematogaster system, we have initiated a molecular investigation of the plant partners' phylogeny. Nuclear ribosomal DNA internal transcribed spacer (ITS) sequences were analyzed for 73 accessions from 47 Macaranga species, representing 17 sections or informally described species groups. Three accessions from the putative sister taxon Mallotus Lour, were included as outgroups. Cladograms of the ITS data revealed Macaranga to be nested within Mallotus. ITS sequences are highly similar within section Pachystemon s.str., suggesting a relatively recent and rapid radiation of obligate myrmecophytes within this section. Forty-three accessions, mainly of ant-inhabited species, were additionally investigated by random amplified polymorphic DNA (RAPD) and microsatellite-primed PCR (MP-PCR) techniques. Phenetic analysis of RAPD and MP-PCR banding profiles generally confirmed the ITS results. Best resolutions for individual clades were obtained when ITS and RAPD/MP-PCR data were combined into a single matrix and analyzed phenetically. The combined analysis suggests multiple (four) rather than a single evolutionary origin of myrmecophytism, at least one reversal from obligate myrmecophytism to nonmyrmecophytism, and one loss of mutualistic specifity.
Collapse
|
65
|
Venkatesan MM, Goldberg MB, Rose DJ, Grotbeck EJ, Burland V, Blattner FR. Complete DNA sequence and analysis of the large virulence plasmid of Shigella flexneri. Infect Immun 2001; 69:3271-85. [PMID: 11292750 PMCID: PMC98286 DOI: 10.1128/iai.69.5.3271-3285.2001] [Citation(s) in RCA: 145] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The complete sequence analysis of the 210-kb Shigella flexneri 5a virulence plasmid was determined. Shigella spp. cause dysentery and diarrhea by invasion and spread through the colonic mucosa. Most of the known Shigella virulence determinants are encoded on a large plasmid that is unique to virulent strains of Shigella and enteroinvasive Escherichia coli; these known genes account for approximately 30 to 35% of the virulence plasmid. In the complete sequence of the virulence plasmid, 286 open reading frames (ORFs) were identified. An astonishing 153 (53%) of these were related to known and putative insertion sequence (IS) elements; no known bacterial plasmid has previously been described with such a high proportion of IS elements. Four new IS elements were identified. Fifty putative proteins show no significant homology to proteins of known function; of these, 18 have a G+C content of less than 40%, typical of known virulence genes on the plasmid. These 18 constitute potentially unknown virulence genes. Two alleles of shet2 and five alleles of ipaH were also identified on the plasmid. Thus, the plasmid sequence suggests a remarkable history of IS-mediated acquisition of DNA across bacterial species. The complete sequence will permit targeted characterization of potential new Shigella virulence determinants.
Collapse
|
66
|
Dong Y, Glasner JD, Blattner FR, Triplett EW. Genomic interspecies microarray hybridization: rapid discovery of three thousand genes in the maize endophyte, Klebsiella pneumoniae 342, by microarray hybridization with Escherichia coli K-12 open reading frames. Appl Environ Microbiol 2001; 67:1911-21. [PMID: 11282649 PMCID: PMC92813 DOI: 10.1128/aem.67.4.1911-1921.2001] [Citation(s) in RCA: 69] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2000] [Accepted: 01/11/2001] [Indexed: 11/20/2022] Open
Abstract
In an effort to efficiently discover genes in the diazotrophic endophyte of maize, Klebsiella pneumoniae 342, DNA from strain 342 was hybridized to a microarray containing 96% (n = 4,098) of the annotated open reading frames from Escherichia coli K-12. Using a criterion of 55% identity or greater, 3,000 (70%) of the E. coli K-12 open reading frames were also found to be present in strain 342. Approximately 24% (n = 1,030) of the E. coli K-12 open reading frames are absent in strain 342. For 1.6% (n = 68) of the open reading frames, the signal was too low to make a determination regarding the presence or absence of the gene. Genes with high identity between the two organisms are those involved in energy metabolism, amino acid metabolism, fatty acid metabolism, cofactor synthesis, cell division, DNA replication, transcription, translation, transport, and regulatory proteins. Functions that were less highly conserved included carbon compound metabolism, membrane proteins, structural proteins, putative transport proteins, cell processes such as adaptation and protection, and central intermediary metabolism. Open reading frames of E. coli K-12 with little or no identity in strain 342 included putative regulatory proteins, putative chaperones, surface structure proteins, mobility proteins, putative enzymes, hypothetical proteins, and proteins of unknown function, as well as genes presumed to have been acquired by lateral transfer from sources such as phage, plasmids, or transposons. The results were in agreement with the physiological properties of the two strains. Whole genome comparisons by genomic interspecies microarray hybridization are shown to rapidly identify thousands of genes in a previously uncharacterized bacterial genome provided that the genome of a close relative has been fully sequenced. This approach will become increasingly more useful as more full genome sequences become available.
Collapse
|
67
|
Perna NT, Plunkett G, Burland V, Mau B, Glasner JD, Rose DJ, Mayhew GF, Evans PS, Gregor J, Kirkpatrick HA, Pósfai G, Hackett J, Klink S, Boutin A, Shao Y, Miller L, Grotbeck EJ, Davis NW, Lim A, Dimalanta ET, Potamousis KD, Apodaca J, Anantharaman TS, Lin J, Yen G, Schwartz DC, Welch RA, Blattner FR. Genome sequence of enterohaemorrhagic Escherichia coli O157:H7. Nature 2001; 409:529-33. [PMID: 11206551 DOI: 10.1038/35054089] [Citation(s) in RCA: 1470] [Impact Index Per Article: 63.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The bacterium Escherichia coli O157:H7 is a worldwide threat to public health and has been implicated in many outbreaks of haemorrhagic colitis, some of which included fatalities caused by haemolytic uraemic syndrome. Close to 75,000 cases of O157:H7 infection are now estimated to occur annually in the United States. The severity of disease, the lack of effective treatment and the potential for large-scale outbreaks from contaminated food supplies have propelled intensive research on the pathogenesis and detection of E. coli O157:H7 (ref. 4). Here we have sequenced the genome of E. coli O157:H7 to identify candidate genes responsible for pathogenesis, to develop better methods of strain detection and to advance our understanding of the evolution of E. coli, through comparison with the genome of the non-pathogenic laboratory strain E. coli K-12 (ref. 5). We find that lateral gene transfer is far more extensive than previously anticipated. In fact, 1,387 new genes encoded in strain-specific clusters of diverse sizes were found in O157:H7. These include candidate virulence factors, alternative metabolic capacities, several prophages and other new functions--all of which could be targets for surveillance.
Collapse
|
68
|
Wei Y, Lee JM, Richmond C, Blattner FR, Rafalski JA, LaRossa RA. High-density microarray-mediated gene expression profiling of Escherichia coli. J Bacteriol 2001; 183:545-56. [PMID: 11133948 PMCID: PMC94910 DOI: 10.1128/jb.183.2.545-556.2001] [Citation(s) in RCA: 181] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2000] [Accepted: 10/25/2000] [Indexed: 11/20/2022] Open
Abstract
A nearly complete collection of 4,290 Escherichia coli open reading frames was amplified and arrayed in high density on glass slides. To exploit this reagent, conditions for RNA isolation from E. coli cells, cDNA production with attendant fluorescent dye incorporation, DNA-DNA hybridization, and hybrid quantitation have been established. A brief isopropyl-beta-D-thiogalactopyranoside (IPTG) treatment elevated lacZ, lacY, and lacA transcript content about 30-fold; in contrast, most other transcript titers remained unchanged. Distinct RNA expression patterns between E. coli cultures in the exponential and transitional phases of growth were catalogued, as were differences associated with culturing in minimal and rich media. The relative abundance of each transcript was estimated by using hybridization of a genomic DNA-derived, fluorescently labeled probe as a correction factor. This inventory provided a quantitative view of the steady-state level of each mRNA species. Genes the expression of which was detected by this method were enumerated, and results were compared with the current understanding of E. coli physiology.
Collapse
|
69
|
Selinger DW, Cheung KJ, Mei R, Johansson EM, Richmond CS, Blattner FR, Lockhart DJ, Church GM. RNA expression analysis using a 30 base pair resolution Escherichia coli genome array. Nat Biotechnol 2000; 18:1262-8. [PMID: 11101804 DOI: 10.1038/82367] [Citation(s) in RCA: 286] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
We have developed a high-resolution "genome array" for the study of gene expression and regulation in Escherichia coli. This array contains on average one 25-mer oligonucleotide probe per 30 base pairs over the entire genome, with one every 6 bases for the intergenic regions and every 60 bases for the 4,290 open reading frames (ORFs). Twofold concentration differences can be detected at levels as low as 0.2 messenger RNA (mRNA) copies per cell, and differences can be seen over a dynamic range of three orders of magnitude. In rich medium we detected transcripts for 97% and 87% of the ORFs in stationary and log phases, respectively. We found that 1, 529 transcripts were differentially expressed under these conditions. As expected, genes involved in translation were expressed at higher levels in log phase, whereas many genes known to be involved in the starvation response were expressed at higher levels in stationary phase. Many previously unrecognized growth phase-regulated genes were identified, such as a putative receptor (b0836) and a 30S ribosomal protein subunit (S22), both of which are highly upregulated in stationary phase. Transcription of between 3,000 and 4,000 predicted ORFs was observed from the antisense strand, indicating that most of the genome is transcribed at a detectable level. Examples are also presented for high-resolution array analysis of transcript start and stop sites and RNA secondary structure.
Collapse
|
70
|
Friesen N, Fritsch RM, Pollner S, Blattner FR. Molecular and morphological evidence for an origin of the aberrant genus Milula within himalayan species of Allium (Alliacae). Mol Phylogenet Evol 2000; 17:209-18. [PMID: 11083935 DOI: 10.1006/mpev.2000.0844] [Citation(s) in RCA: 46] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Phylogenetic relationships between Allium and the monotypic Himalayan genus Milula were analyzed using sequences of the nuclear ribosomal DNA internal transcribed spacer (ITS) region and of the intergenic spacers from the chloroplast trnD(GUC)-trnT(GGU) region. Both marker systems unambiguously placed Milula spicata within Allium subgenus Rhizirideum, close to A. cyathophorum. Morphologically, the main difference between Allium and Milula is the conspicuous spicate inflorescence of Milula vs the mostly capitate or umbellate inflorescences in Allium. Anatomical investigations of leaf characters support a close relationship of Milula with A. cyathophorum and A. mairei, whereas root characters are distinctive from other species of section Cyathophora. To maintain Allium as monophyletic, Milula has been included as A. spicatum in Allium subgenus Rhizirideum.
Collapse
MESH Headings
- Allium/classification
- Allium/genetics
- DNA, Chloroplast/chemistry
- DNA, Chloroplast/genetics
- DNA, Plant/chemistry
- DNA, Plant/genetics
- DNA, Ribosomal Spacer/genetics
- Evolution, Molecular
- Liliaceae/anatomy & histology
- Liliaceae/classification
- Liliaceae/genetics
- Molecular Sequence Data
- Phylogeny
- Plant Leaves/anatomy & histology
- Plant Roots/anatomy & histology
- RNA, Ribosomal, 5.8S/genetics
- RNA, Transfer/genetics
- Sequence Analysis, DNA
Collapse
|
71
|
Ffrench-Constant RH, Waterfield N, Burland V, Perna NT, Daborn PJ, Bowen D, Blattner FR. A genomic sample sequence of the entomopathogenic bacterium Photorhabdus luminescens W14: potential implications for virulence. Appl Environ Microbiol 2000; 66:3310-29. [PMID: 10919786 PMCID: PMC92150 DOI: 10.1128/aem.66.8.3310-3329.2000] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Photorhabdus luminescens is a pathogenic bacterium that lives in the guts of insect-pathogenic nematodes. After invasion of an insect host by a nematode, bacteria are released from the nematode gut and help kill the insect, in which both the bacteria and the nematodes subsequently replicate. However, the bacterial virulence factors associated with this "symbiosis of pathogens" remain largely obscure. In order to identify genes encoding potential virulence factors, we performed approximately 2,000 random sequencing reads from a P. luminescens W14 genomic library. We then compared the sequences obtained to sequences in existing gene databases and to the Escherichia coli K-12 genome sequence. Here we describe the different classes of potential virulence factors found. These factors include genes that putatively encode Tc insecticidal toxin complexes, Rtx-like toxins, proteases and lipases, colicin and pyocins, and various antibiotics. They also include a diverse array of secretion (e.g., type III), iron uptake, and lipopolysaccharide production systems. We speculate on the potential functions of each of these gene classes in insect infection and also examine the extent to which the invertebrate pathogen P. luminescens shares potential antivertebrate virulence factors. The implications for understanding both the biology of this insect pathogen and links between the evolution of vertebrate virulence factors and the evolution of invertebrate virulence factors are discussed.
Collapse
|
72
|
Sherburne CK, Lawley TD, Gilmour MW, Blattner FR, Burland V, Grotbeck E, Rose DJ, Taylor DE. The complete DNA sequence and analysis of R27, a large IncHI plasmid from Salmonella typhi that is temperature sensitive for transfer. Nucleic Acids Res 2000; 28:2177-86. [PMID: 10773089 PMCID: PMC105367 DOI: 10.1093/nar/28.10.2177] [Citation(s) in RCA: 126] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/1999] [Revised: 03/20/2000] [Accepted: 03/20/2000] [Indexed: 11/14/2022] Open
Abstract
Salmonella typhi, the causative agent of typhoid fever, annually infects 16 million people and kills 600 000 world wide. Plasmid-encoded multiple drug resistance in S. typhi is always encoded by plasmids of incompatibility group H (IncH). The complete DNA sequence of the large temperature-sensitive conjugative plasmid R27, the prototype for the IncHI1 family of plasmids, has been compiled and analyzed. This 180 kb plasmid contains 210 open reading frames (ORFs), of which 14 have been previously identified and 56 exhibit similarity to other plasmid and prokaryotic ORFs. A number of insertion elements were found, including the full Tn 10 transposon, which carries tetracycline resistance genes. Two transfer regions, Tra1 and Tra2, are present, which are separated by a minimum of 64 kb. Homologs of the DNA-binding proteins TlpA and H-NS that act as temperature-regulated repressors in other systems have been located in R27. Sequence analysis of transfer and replication regions supports a mosaic-like structure for R27. The genes responsible for conjugation and plasmid maintenance have been identified and mechanisms responsible for thermosensitive transfer are discussed.
Collapse
|
73
|
Salgado H, Santos-Zavaleta A, Gama-Castro S, Millán-Zárate D, Blattner FR, Collado-Vides J. RegulonDB (version 3.0): transcriptional regulation and operon organization in Escherichia coli K-12. Nucleic Acids Res 2000; 28:65-7. [PMID: 10592182 PMCID: PMC102478 DOI: 10.1093/nar/28.1.65] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
RegulonDB is a database on transcription regulation and operon organization in Escherichia coli. The current version describes regulatory signals of transcription initiation, promoters, regulatory binding sites of specific regulators, ribosome binding sites and terminators, as well as information on genes clustered in operons. These specific annotations have been gathered from a constant search in the literature, as well as based on computational sequence predictions. The genomic coordinates of all these objects in the E.coli K-12 chromosome are clearly indicated. Every known object has a link to at least one MEDLINE reference. We have also added direct links to recent expression data of E.coli K-12. The version presented here has important modifications both in the structure of the database, as well as in the amount and type of information encoded in the database. RegulonDB can be accessed on the web at URL: http://www.cifn.unam. mx/Computational_Biology/regulondb/
Collapse
|
74
|
Blattner FR. Direct amplification of the entire ITS region from poorly preserved plant material using recombinant PCR. Biotechniques 1999; 27:1180-6. [PMID: 10631497 DOI: 10.2144/99276st04] [Citation(s) in RCA: 214] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Sequences of the internal transcribed spacers (ITS) of the nuclear ribosomal DNA are important molecular markers in phylogenetic analyses. To obtain sequences from herbarium material in which DNA often is severely degraded, the ITS region has to be amplified in two steps. Two methods that reduce bench time and reagents used are described. (i) Separately amplified preparations of subunits ITS-1 and ITS-2 are combined before purification. The presence of two fragments in the sequencing reaction does not impair the quality of sequences. (ii) Newly designed internal primers amplify partly overlapping regions of the two subunits. A combination of these internal primers with the external primers in one PCR allows the amplification of the entire ITS region even when degraded DNAs are used. This recombinant PCR approach, taking into account the +A bases added by several Taq DNA polymerases, will also be useful with other marker regions used in molecular phylogenetics.
Collapse
|
75
|
Pósfai G, Kolisnychenko V, Bereczki Z, Blattner FR. Markerless gene replacement in Escherichia coli stimulated by a double-strand break in the chromosome. Nucleic Acids Res 1999; 27:4409-15. [PMID: 10536150 PMCID: PMC148724 DOI: 10.1093/nar/27.22.4409] [Citation(s) in RCA: 210] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A simple and efficient gene replacement method, based on the recombination and repair activities of the cell, was developed. The method permits the targeted construction of markerless deletions, insertions and point mutations in the Escherichia coli chromosome. A suicide plasmid, carrying the mutant allele and the recognition site of meganuclease I- Sce I, is inserted into the genome by homologous recombination between the mutant and the wild-type (wt) alleles. Resolution of this cointegrate by intramolecular recombination of the allele pair results in either a mutant or a wt chromosome which can be distinguished by allele-specific PCR screening. The resolution process is stimulated by introducing a unique double-strand break (DSB) into the chromosome at the I- Sce I site. Cleavage by the nuclease not only enhances the frequency of resolution by two to three orders of magnitude, but also selects for the resolved products. The DSB-stimulated gene replacement method can be used in recombination-proficient E.coli cells, does not require specific growth conditions, and is potentially applicable in other microorganisms. Use of the method was demonstrated by constructing a 17-bp and a 62-kb deletion in the MG1655 chromosome. Cleavage of the chromosome induces the SOS response but does not lead to an increased mutation rate.
Collapse
|