1
|
Kröger M, Wahl R. Compilation of DNA sequences of Escherichia coli K12: description of the interactive databases ECD and ECDC. Nucleic Acids Res 1998; 26:46-9. [PMID: 9399797 PMCID: PMC147217 DOI: 10.1093/nar/26.1.46] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
We have compiled the DNA sequence data for Escherichia coli K12 available from the GenBank and EMBL data libraries and independently from the literature. We provide the most definitive version of the ECD Escherichia coli database now exclusively via the World Wide Web System (http://susi.bio.uni-giessen.de/ecdc.html ). Our database encloses the completed genome sequence recently published by two competing groups and an assembled set of all elder sequences. The organisation of the database allows precise physical location of each individual gene or regulatory region, even taking into consideration discrepancies in nomenclature. The WWW program allows to the user to branch into the original EMBL and SWISS-PROT datafiles. A number of links to other WWW servers dealing with E. coli is provided. A FASTA and BLAST search may be performed online. Besides the WWW format a flat file version may be obtained via ftp. A number of discrepancies between the two systematic sequence determinations and/or the literature have not yet been resolved. However, our database may serve as a reference source for resolution and/or the assignment of strain difference.
Collapse
Affiliation(s)
- M Kröger
- Institut für Mikrobiologie und Molekularbiologie, Fachbereich Biologie, Justus-Liebig-Universität Giessen, Frankfurter Strasse 107, D-35392 Giessen, Germany.
| | | |
Collapse
|
2
|
Kröger M, Wahl R. Compilation of DNA sequences of Escherichia coli K12: description of the interactive databases ECD and ECDC (update 1996). Nucleic Acids Res 1997; 25:39-42. [PMID: 9016501 PMCID: PMC146385 DOI: 10.1093/nar/25.1.39] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
We have compiled the DNA sequence data forEscherichia coliavailable from the GenBank and EMBL data libraries and independently from the literature. We provide the most definitive version of the ECDEscherichia colidatabase now exclusively via the World Wide Web System: http://susi.bio.uni-giessen.de/usr/local/www/ html/ecdc.html . Our database encloses an assembled set of contiguous sequences. Each of these contigs compiles all available sequence information, including those derived from a variety of elder sequences. The organisation of the database allows precise physical location of each individual gene or regulatory region, even taking into consideration discrepancies in nomenclature. The WWW program allows to branch into the original EMBL and SWISSPROT datafiles. A number of links to other WWW servers is provided. A FASTA and BLAST search may be performed online. Besides the WWW format a flat file version may be obtained via ftp. The ftp version may also be obtained from the EMBL data library as part of the CD-ROM issue of the EMBL sequence database, which is released and updated every 3 months. After deletion of all detected overlaps a total of 3 588 706 individual bp has been determined up to the end of September 1996. This corresponds to a total of 77.09% of the entire E.coli chromosome consisting of approximately 4655 kb. About 479 kb (10.3%) are additionally available from Kyoto (Japan). Another 94 kb (2%) are available, but mapping has not been confirmed. Thus the total may have reached 89.4%.
Collapse
Affiliation(s)
- M Kröger
- Institut für Mikrobiologie und Molekularbiologie, Fachbereich Biologie, Justus-Liebig-Universität Giessen, Frankfurter Strasse 107, D-35392 Giessen, Germany.
| | | |
Collapse
|
3
|
Malumbres M, Martín JF. Molecular control mechanisms of lysine and threonine biosynthesis in amino acid-producing corynebacteria: redirecting carbon flow. FEMS Microbiol Lett 1996; 143:103-14. [PMID: 8837462 DOI: 10.1111/j.1574-6968.1996.tb08468.x] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Threonine and lysine are two of the economically most important essential amino acids. They are produced industrially by species of the genera Corynebacterium and Brevibacterium. The branched biosynthetic pathway of these amino acids in corynebacteria is unusual in gene organization and in the control of key enzymatic steps with respect to other microorganisms. This article reviews the molecular control mechanisms of the biosynthetic pathways leading to threonine and lysine in corynebacteria, and their implications in the production of these amino acids. Carbon flux can be redirected at branch points by gene disruption of the competing pathways for lysine or threonine. Removal of bottlenecks has been achieved by amplification of genes which encode feedback resistant aspartokinase and homoserine dehydrogenase (obtained by in vitro directed mutagenesis).
Collapse
Affiliation(s)
- M Malumbres
- Faculty of Biology, University of León, Spain
| | | |
Collapse
|
4
|
Abstract
The scope of the EBI is focused on providing better services to the scientific community. Technological advancements in the hardware area provide EBI with means of producing data much faster than before, and with greater accuracy since there is now a better technical ability to produce more exhaustive searches through larger indices. Hand in hand with the technological developments, research and development work is continuing on better indexing systems and more efficient ways of establishing and maintaining the future databases. The existing links of communication between EBI and the user community are exploited to study the needs of the scientific community, to provide better services, and to enhance the quality of databases by interpreting user feedback and updates. A very important goal is to enhance the awareness of the scientific (and, maybe even more, the nonscientific) public of the importance of the modern field of bioinformatics and to introduce special meetings and courses, in which more specific subjects will be studied in depth. Another aspect of this goal is to help in constructing special bioinformatics programs in university faculties. In such programs, in contrast to the existing layout, students will pursue studies in a combined environment that provides basic training in biology and in computation. Currently, one of the main problems in the field is that scientists are either biologists, who are self-educated in the field of computers and programming, or computer scientists without sufficient knowledge of biology. It is hoped that a combined program will provide a high level of education in both fields of interest at the appropriate ratios. Building an efficient and friendly interface between the EBI and the user community is the basis for any future development. This aim is achieved by using the most modern server systems while continuously researching newer and better systems and interfaces. This task can never be complete without involvement of the user community by providing feedback to any of EBI's services. A better bioinformatics community is a necessity for any future development of the biological research aiming at a better society.
Collapse
Affiliation(s)
- B Shomer
- Europeun Molecular Biology Laboratory Outstation, European Bioinformatics Institute, Hinxton, Combridge, United Kingdom
| | | | | |
Collapse
|
5
|
Kröger M, Wahl R. Compilation of DNA sequences of Escherichia coli K12 (ECD and ECDC; update 1995). Nucleic Acids Res 1996; 24:29-31. [PMID: 8594594 PMCID: PMC145621 DOI: 10.1093/nar/24.1.29] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
We have compiled the DNA sequence data for Escherichia coli available from the GenBank and EMBL data libraries and independently from the literature. Unlike the previous updates of our E.coli databases, we provide the most recent version preferentially via the World Wide Web System (use URL: http://susi.bio.unigiessen.de/usr/local/www++ +/html/ecdc.html). Our database includes an assembled set of contiguous sequences. Each of these contigs compiles all available sequence information, including those derived from a variety of elder sequences. The organization of the database allows one to find the exact physical location of each individual gene or regulatory region, even regarding discrepancies in nomenclature. The WWW program allows access into the original EMBL and SWISSPROT datafiles. A FASTA and BLAST search may be performed online. Besides the WWW format a flat file version may be obtained via ftp. The complete compilation, including a full set of genetic map data and the E.coli protein index, can be obtained in machine readable form from the EMBL data library as a part of the CD-ROM issue of the EMBL sequence database, released and updated every three months. After deletion of all detected overlaps a total of 3 333 878 individual bp was determined by the end of September 1995. This corresponds to a total of 71.71% of the entire E.coli chromosome consisting of about 4720 kbp. About 94 kbp (2%) are available additionally, but have not yet been definitely mapped.
Collapse
Affiliation(s)
- M Kröger
- Institut für Mikrobiologie und Molekularbiologie, Fachbereich Biologie, Justus-Liebig-Universitat Gieben, Germany
| | | |
Collapse
|
6
|
Andersson SG, Kurland CG. Genomic evolution drives the evolution of the translation system. Biochem Cell Biol 1995; 73:775-87. [PMID: 8721994 DOI: 10.1139/o95-086] [Citation(s) in RCA: 68] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Our thesis is that the characteristics of the translational machinery and its organization are selected in part by evolutionary pressure on genomic traits have nothing to do with translation per se. These genomic traits include size, composition, and architecture. To illustrate this point, we draw parallels between the structure of different genomes that have adapted to intracellular niches independently of each other. Our starting point is the general observation that the evolutionary history of organellar and parasitic bacteria have favored bantam genomes. Furthermore, we suggest that the constraints of the reductive mode of genomic evolution account for the divergence of the genetic code in mitochondria and the genetic organization of the translational system observed in parasitic bacteria. In particular, we associate codon reassignments in animal mitochondria with greatly simplified tRNA populations. Likewise, we relate the organization of translational genes in the obligate intracellular parasite Rickettsia prowazekii to the processes supporting the reductive mode of genomic evolution. Such findings provide strong support for the hypothesis that genomes of organelles and of parasitic bacteria have arisen from the much larger genomes of ancestral bacteria that have been reduced by intrachromosomal recombination and deletion events. A consequence of the reductive mode of genomic evolution is that the resulting translation systems may deviate markedly from conventional systems.
Collapse
Affiliation(s)
- S G Andersson
- Department of Molecular Biology, Uppsala University, Sweden
| | | |
Collapse
|
7
|
Wahl R, Kröger M. ECDC--a totally integrated and interactively usable genetic map of Escherichia coli K12. Microbiol Res 1995; 150:7-61. [PMID: 7735721 DOI: 10.1016/s0944-5013(11)80034-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
A printed version of the interactively usable genetic map of Escherichia coli K12 is provided together with some statistical information about the actual status of the respective genome sequencing project. A total of 3,179,967 bp corresponding to 68.38% of the genome is available through the ECDC database. Contigs as well as individual DNA sequences for each gene or open reading frame are provided. Access to a number of other databases is possible using World Wide Web or local programs.
Collapse
Affiliation(s)
- R Wahl
- Institut für Mikrobiologie und Molekularbiologie, Justus-Liebig-Universität Giessen, Germany
| | | |
Collapse
|
8
|
Sugita M, Sugita C, Sugiura M. Structure and expression of the gene encoding ribosomal protein S1 from the cyanobacterium Synechococcus sp. strain PCC 6301: striking sequence similarity to the chloroplast ribosomal protein CS1. MOLECULAR & GENERAL GENETICS : MGG 1995; 246:142-7. [PMID: 7862084 DOI: 10.1007/bf00294676] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
We isolated a 38 kDa ssDNA-binding protein from the unicellular cyanobacterium Synechococcus sp. strain PCC 6301 and determined its N-terminal amino acid sequence. A genomic clone encoding the 38 kDa protein was isolated by using a degenerate oligonucleotide probe based on the amino acid sequence. The nucleotide sequence and predicted amino acid sequence revealed that the 38 kDa protein is 306 amino acids long and homologous to the nuclear-encoded 370 amino acid chloroplast ribosomal protein CS1 of spinach (48% identity), therefore identifying it as ribosomal protein (r-protein) S1. Cyanobacterial and chloroplast S1 proteins differ in size from Escherichia coli r-protein S1 (557 amino acids). This provides an additional evidence that cyanobacteria are closely related to chloroplasts. The Synechococcus gene rps1 encoding S1 is located 1.1 kb downstream from psbB, which encodes the photosystem II P680 chlorophyll a apoprotein. An open reading frame encoding a potential protein of 168 amino acids is present between psbB and rps1 and its deduced amino acid sequence is similar to that of E. coli hypothetical 17.2 kDa protein. Northern blot analysis showed that rps1 is transcribed as a monocistronic mRNA.
Collapse
Affiliation(s)
- M Sugita
- Center for Gene Research, Nagoya University, Japan
| | | | | |
Collapse
|
9
|
Wahl R, Rice P, Rice CM, Kröger M. ECD--a totally integrated database of Escherichia coli K12. Nucleic Acids Res 1994; 22:3450-5. [PMID: 7937044 PMCID: PMC308300 DOI: 10.1093/nar/22.17.3450] [Citation(s) in RCA: 28] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
We have compiled the DNA sequence data for E. coli available from the GENBANK and EMBL data libraries and independently from the literature. Starting with this update of our Escherichia coli database (ECD release 20) we provide major changes compared to previous issues. This update not only represents another substantial increase in sequence information, it also allows now to find the exact physical location of each individual gene or regulatory region, even regarding discrepancies in nomenclature. In order to save space this printed version does not contain the database itself anymore, but we provide several examples. The complete database is publically available in electronic form together with a self explaining application program or as a flat file. The complete compilation including a full set of genetic map data and the E. coli protein index can be obtained in machine readable form from the EMBL data library as a part of the CD-ROM issue of the EMBL sequence database, released and updated every three months. After deletion of all detected overlaps a total of 2,878,364 individual bp is found to be determined till the end of June 1994. This corresponds to a total of 60.98% of the entire E. coli chromosome consisting of about 4,720 kbp. This number may actually be higher by 9161 bp derived from other strains of E. coli.
Collapse
Affiliation(s)
- R Wahl
- Institut für Mikrobiologie und Molekularbiologie, Fachbereich Biologie, Justus-Liebig-Universtät Giessen, Germany
| | | | | | | |
Collapse
|
10
|
Gockel G, Hachtel W, Baier S, Fliss C, Henke M. Genes for components of the chloroplast translational apparatus are conserved in the reduced 73-kb plastid DNA of the nonphotosynthetic euglenoid flagellate Astasia longa. Curr Genet 1994; 26:256-62. [PMID: 7859309 DOI: 10.1007/bf00309557] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
The colourless, nonphotosynthetic protist Astasia longa is phylogenetically related to Euglena gracilis. The 73-kb plastid DNA (ptDNA) of A. longa is about half the size of most chloroplast DNAs (cpDNAs). More than 38 kb of the Astasia ptDNA sequence has been determined. No genes for photosynthetic function have been found except for rbcL. Identified genes include rpoB, tufA, and genes coding for three rRNAs, 17 tRNAs, and 13 ribosomal proteins. Not only is the nucleotide sequence of these genes highly conserved between A. longa and E. gracilis, but a number of these genes are clustered in a similar fashion and have introns in the same positions in both species. The results further support the idea that photosynthetic genes normally encoded in cpDNA have been preferentially lost in Astasia, but that the chloroplast genes coding for components of the plastid translational apparatus have been maintained. This apparatus might be needed for the expression of rbcL and also for that of still unidentified nonphotosynthetic genes of Astasia ptDNA.
Collapse
Affiliation(s)
- G Gockel
- Botanisches Institut, Universität Bonn, Germany
| | | | | | | | | |
Collapse
|
11
|
Abstract
The initiation of DNA replication is under differential control in Caulobacter crescentus. Following cell division, only the chromosome in the progeny stalked cell is able to initiate DNA replication, while the chromosome in the progeny swarmer cell does not replicate until later in the cell cycle. We have isolated the dnaA gene in order to determine whether this essential and ubiquitous replication initiation protein also contributes to differential replication control in C. crescentus. Analysis of the cloned C. crescentus dnaA gene has shown that the deduced amino acid sequence can encode a 486-amino-acid protein that is 37% identical to the DnaA protein of Escherichia coli. The gene is located 2 kb from the origin of replication. Primer extension analysis revealed a single transcript originating from a sigma 70-type promoter. Immunoprecipitation of a DnaA'-beta-lactamase fusion protein showed that although expression occurs throughout the cell cycle, there is a doubling in the rate of expression just prior to the initiation of replication.
Collapse
Affiliation(s)
- G Zweiger
- Department of Genetics, Stanford University School of Medicine, California 94305
| | | |
Collapse
|
12
|
Cartwright P, Timms M, Lithgow T, Høj P, Hoogenraad N. An Escherichia coli gene showing a potential ancestral relationship to the genes for the mitochondrial import site proteins ISP42 and MOM38. BIOCHIMICA ET BIOPHYSICA ACTA 1993; 1153:345-7. [PMID: 8274505 DOI: 10.1016/0005-2736(93)90425-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
An ORF (OrfT) of 1911 base pairs, upstream of the hip operon in Escherichia coli at map position 33.82 has been identified. The protein encoded by this sequence is predicted to have a molecular mass of 68,249 Da and the carboxyterminal 276 residues shows 26.8% and 25.4% identity with the import site proteins ISP42 and MOM38 from the mitochondrial outer membrane of Saccharomyces cerevisiae and Neurospora crassa, respectively. These mitochondrial membrane proteins have been shown to be essential components of the protein translocation apparatus in yeast. These similarities raise the possibility that OrfT might represent the bacterial gene from which these eukaryotic genes evolved.
Collapse
Affiliation(s)
- P Cartwright
- Department of Biochemistry, La Trobe University, Bundoora, Victoria, Australia
| | | | | | | | | |
Collapse
|
13
|
Yoshida T, Ueguchi C, Mizuno T. Physical map location of a set of Escherichia coli genes (hde) whose expression is affected by the nucleoid protein H-NS. J Bacteriol 1993; 175:7747-8. [PMID: 8244952 PMCID: PMC206942 DOI: 10.1128/jb.175.23.7747-7748.1993] [Citation(s) in RCA: 22] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Affiliation(s)
- T Yoshida
- Laboratory of Molecular Microbiology, School of Agriculture, Nagoya University, Japan
| | | | | |
Collapse
|
14
|
Oguiza JA, Malumbres M, Eriani G, Pisabarro A, Mateos LM, Martin F, Martín JF. A gene encoding arginyl-tRNA synthetase is located in the upstream region of the lysA gene in Brevibacterium lactofermentum: regulation of argS-lysA cluster expression by arginine. J Bacteriol 1993; 175:7356-62. [PMID: 8226683 PMCID: PMC206880 DOI: 10.1128/jb.175.22.7356-7362.1993] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
The Brevibacterium lactofermentum argS gene, which encodes an arginyl-tRNA synthetase, was identified in the upstream region of the lysA gene. The cloned gene was sequenced; it encodes a 550-amino-acid protein with an M(r) of 59,797. The deduced amino acid sequence showed 28% identical and 49% similar residues when compared with the sequence of the Escherichia coli arginyl-tRNA synthetase. The B. lactofermentum enzyme showed the highly conserved motifs of class I aminoacyl-tRNA synthetases. Expression of the argS gene in B. lactofermentum and E. coli resulted in an increase in aminoacyl-tRNA synthetase activity, correlated with the presence in sodium dodecyl sulfate-polyacrylamide gels of a clear protein band that corresponds to this enzyme. One single transcript of about 3,000 nucleotides and corresponding to the B. lactofermentum argS-lysA operon was identified. The transcription of these genes is repressed by lysine and induced by arginine, showing an interesting pattern of biosynthetic interlock between the pathways of both amino acids in corynebacteria.
Collapse
Affiliation(s)
- J A Oguiza
- Area of Microbiology, Faculty of Biology, University of León, Spain
| | | | | | | | | | | | | |
Collapse
|
15
|
Affiliation(s)
- E Seeberg
- Division for Environmental Toxicology, Norwegian Defence Research Establishment, Kjeller
| |
Collapse
|
16
|
Médigue C, Viari A, Hénaut A, Danchin A. Colibri: a functional data base for the Escherichia coli genome. Microbiol Rev 1993; 57:623-54. [PMID: 8246843 PMCID: PMC372929 DOI: 10.1128/mr.57.3.623-654.1993] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Several data libraries have been created to organize all the data obtained worldwide about the Escherichia coli genome. Because the known data now amount to more than 40% of the whole genome sequence, it has become necessary to organize the data in such a way that appropriate procedures can associate knowledge produced by experiments about each gene to its position on the chromosome and its relation to other relevant genes, for example. In addition, global properties of genes, affected by the introduction of new entries, should be present as appropriate description fields. A data base, implemented on Macintosh by using the data base management system 4th Dimension, is described. It is constructed around a core constituted by known contigs of E. coli sequences and links data collected in general libraries (unmodified) to data associated with evolving knowledge (with modifiable fields). Biologically significant results obtained through the coupling of appropriate procedures (learning or statistical data analysis) are presented. The data base is available through a 4th Dimension runtime and through FTP on Internet. It has been regularly updated and will be systematically linked to other E. coli data bases (M. Kroger, R. Wahl, G. Schachtel, and P. Rice, Nucleic Acids Res. 20(Suppl.):2119-2144, 1992; K. E. Rudd, W. Miller, C. Werner, J. Ostell, C. Tolstoshev, and S. G. Satterfield, Nucleic Acids Res. 19:637-647, 1991) in the near future.
Collapse
|
17
|
Cardon LR, Burge C, Schachtel GA, Blaisdell BE, Karlin S. Comparative DNA sequence features in two long Escherichia coli contigs. Nucleic Acids Res 1993; 21:3875-84. [PMID: 8367304 PMCID: PMC309913 DOI: 10.1093/nar/21.16.3875] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
The recent sequencing of two relatively long (approximately 100 kb) contigs of E.coli presents unique opportunities for investigating heterogeneity and genomic organization of the E.coli chromosome. We have evaluated a number of common and contrasting sequence features in the two new contigs with comparisons to all available E.coli sequences (> 1.6 Mb). Our analyses include assessments of: (i) counts and distributions of restriction sites, special oligonucleotides (e.g., Chi sites, Dam and Dcm methylase targets), and other marker arrays; (ii) significant distant and close direct and inverted repeat sequences; (iii) sequence similarities between the long contigs and other E.coli sequences; (iv) characterization and identification of rare and frequent oligonucleotides; (v) compositional biases in short oligonucleotides; and (vi) position-dependent fluctuations in sequence composition. The two contigs reveal a number of distinctive features, including: a cluster of five repeat/dyad elements with very regular spacings resembling a transcription attenuator in one of the contigs; REP elements, ERICs, and other long repeats; distinction of the Chi sequence as the most frequent oligonucleotide; regions of clustering, overdispersion, and regularity of certain restriction sites and short palindromes; and comparative domains of inhomogeneities in the two long contigs. These and other features are discussed in relation to the organization of the E.coli chromosome.
Collapse
Affiliation(s)
- L R Cardon
- Department of Mathematics, Stanford University, CA 94035
| | | | | | | | | |
Collapse
|
18
|
Lilley PE, Stamford NP, Vasudevan SG, Dixon NE. The 92-min region of the Escherichia coli chromosome: location and cloning of the ubiA and alr genes. Gene X 1993; 129:9-16. [PMID: 8335265 DOI: 10.1016/0378-1119(93)90690-5] [Citation(s) in RCA: 28] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
A cosmid (pND320) bearing 42.5 kb of Escherichia coli chromosomal DNA, including the genes between xylE and ssb near minute 92 on the linkage map, was isolated by selection for complementation of a dnaB mutation. Known nucleotide (nt) sequences were used to align restriction maps in this region to the physical map of the chromosome (coordinates 4319.5 to 4362 kb), and to locate precisely and define the orientations of 19 genes. Predicted physical linkage of sequenced genes across unsequenced gaps of defined length was confirmed by the nt sequence analysis of fragments subcloned from pND320. Mutant complementation by plasmids showed that ubiA is located between malM and plsB. A previously sequenced long open reading frame that encodes the C-terminal portion of the E. coli ubiA product (4-hydroxybenzoate polyprenyltransferase, HPTase) shows a high degree of sequence identity with the corresponding segment of yeast HPTase (the COQ2 gene product). Comparison of homologous regions from E. coli and Salmonella typhimurium was used to locate precisely the gene alr that encodes alanine racemase (ARase) between dnaB and tyrB. Subcloning of alr downstream from tandem bacteriophage lambda promoters produced a plasmid that directed high-level overproduction of a soluble approx. 40-kDa protein with ARase activity.
Collapse
Affiliation(s)
- P E Lilley
- Centre for Molecular Structure and Function, Research School of Chemistry, Australian National University, Canberra, ACT
| | | | | | | |
Collapse
|
19
|
Affiliation(s)
- C M Rice
- European Molecular Biology Laboratory, Heidelberg, Germany
| | | | | | | | | |
Collapse
|
20
|
Kröger M, Wahl R, Rice P. Compilation of DNA sequences of Escherichia coli (update 1993). Nucleic Acids Res 1993; 21:2973-3000. [PMID: 8332520 PMCID: PMC309723 DOI: 10.1093/nar/21.13.2973] [Citation(s) in RCA: 27] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
We have compiled the DNA sequence data for E. coli available from the GENBANK and EMBL data libraries and over a period of several years independently from the literature. This is the fifth listing replacing and increasing the former listings substantially. However, in order to save space this printed version contains DNA sequence information only, if they are publically available in electronic form. The complete compilation including a full set of genetic map data and the E. coli protein index can be obtained in machine readable form from the EMBL data library (ECD release 15) as a part of the CD-ROM issue of the EMBL sequence database, released and updated every three months. After deletion of all detected overlaps a total of 2,353,635 individual bp is found to be determined till the end of April 1993. This corresponds to a total of 49.87% of the entire E. coli chromosome consisting of about 4,720 kbp. This number may actually be higher by 9161 bp derived from other strains of E. coli.
Collapse
Affiliation(s)
- M Kröger
- Institut für Mikrobiologie und Molekularbiologie, Justus-Liebig-Universität Giessen, Germany
| | | | | |
Collapse
|
21
|
Kasai H, Isono S, Kitakawa M, Mineno J, Akiyama H, Kurnit DM, Berg DE, Isono K. Efficient large-scale sequencing of the Escherichia coli genome: implementation of a transposon- and PCR-based strategy for the analysis of ordered lambda phage clones. Nucleic Acids Res 1992; 20:6509-15. [PMID: 1336178 PMCID: PMC334565 DOI: 10.1093/nar/20.24.6509] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
We have developed a strategy for efficient sequence analysis of the genome of E. coli K-12 using insertions of a Tn5-derived mini-transposon into overlapping ordered lambda phage clones to provide universal primer-binding sites, and PCR amplification of DNA segments adjacent to the insertions. Transposon-containing clones were selected by blue plaque formation on a dnaBamber lacZamber E. coli strain. Insertion points every 0.5-1 kb were identified by 'analytical PCR' and segments between the transposon inserts and phage arms were amplified by 'preparative PCR' using one biotinylated and one non-biotinylated primer. Single strands of amplified DNA fragments were coupled to Streptoavidin-coated paramagnetic beads (Dynabeads M280) through their biotin tails, purified magnetically, and used as templates for fluorescence-based automatic nucleotide sequencing.
Collapse
Affiliation(s)
- H Kasai
- Postgraduate School, Faculty of Science, Kobe University, Japan
| | | | | | | | | | | | | | | |
Collapse
|