1
|
Abstract
Genetic coding in bacteria largely operates via the "one gene-one protein" paradigm. However, the peculiarities of the mRNA structure, the versatility of the genetic code, and the dynamic nature of translation sometimes allow organisms to deviate from the standard rules of protein encoding. Bacteria can use several unorthodox modes of translation to express more than one protein from a single mRNA cistron. One such alternative path is the use of additional translation initiation sites within the gene. Proteins whose translation is initiated at different start sites within the same reading frame will differ in their N termini but will have identical C-terminal segments. On the other hand, alternative initiation of translation in a register different from the frame dictated by the primary start codon will yield a protein whose sequence is entirely different from the one encoded in the main frame. The use of internal mRNA codons as translation start sites is controlled by the nucleotide sequence and the mRNA folding. The proteins of the alternative proteome generated via the "genes-within-genes" strategy may carry important functions. In this review, we summarize the currently known examples of bacterial genes encoding more than one protein due to the utilization of additional translation start sites and discuss the known or proposed functions of the alternative polypeptides in relation to the main protein product of the gene. We also discuss recent proteome- and genome-wide approaches that will allow the discovery of novel translation initiation sites in a systematic fashion.
Collapse
|
2
|
Tirumalai MR, Rastogi R, Zamani N, O’Bryant Williams E, Allen S, Diouf F, Kwende S, Weinstock GM, Venkateswaran KJ, Fox GE. Candidate genes that may be responsible for the unusual resistances exhibited by Bacillus pumilus SAFR-032 spores. PLoS One 2013; 8:e66012. [PMID: 23799069 PMCID: PMC3682946 DOI: 10.1371/journal.pone.0066012] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2013] [Accepted: 04/30/2013] [Indexed: 11/18/2022] Open
Abstract
The spores of several Bacillus species, including Bacillus pumilus SAFR-032 and B. safensis FO-36b, which were isolated from the spacecraft assembly facility at NASA's Jet Propulsion Laboratory, are unusually resistant to UV radiation and hydrogen peroxide. In order to identify candidate genes that might be associated with these resistances, the whole genome of B. pumilus SAFR-032, and the draft genome of B. safensis FO-36b were compared in detail with the very closely related type strain B. pumilus ATCC7061(T). 170 genes are considered characteristic of SAFR-032, because they are absent from both FO-36b and ATCC7061(T). Forty of these SAFR-032 characteristic genes are entirely unique open reading frames. In addition, four genes are unique to the genomes of the resistant SAFR-032 and FO-36b. Fifty three genes involved in spore coat formation, regulation and germination, DNA repair, and peroxide resistance, are missing from all three genomes. The vast majority of these are cleanly deleted from their usual genomic context without any obvious replacement. Several DNA repair and peroxide resistance genes earlier reported to be unique to SAFR-032 are in fact shared with ATCC7061(T) and no longer considered to be promising candidates for association with the elevated resistances. Instead, several SAFR-032 characteristic genes were identified, which along with one or more of the unique SAFR-032 genes may be responsible for the elevated resistances. These new candidates include five genes associated with DNA repair, namely, BPUM_0608 a helicase, BPUM_0652 an ATP binding protein, BPUM_0653 an endonuclease, BPUM_0656 a DNA cytosine-5- methyltransferase, and BPUM_3674 a DNA helicase. Three of these candidate genes are in immediate proximity of two conserved hypothetical proteins, BPUM_0654 and BPUM_0655 that are also absent from both FO-36b and ATCC7061(T). This cluster of five genes is considered to be an especially promising target for future experimental work.
Collapse
Affiliation(s)
- Madhan R. Tirumalai
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
| | - Rajat Rastogi
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
| | - Nader Zamani
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
| | - Elisha O’Bryant Williams
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
| | - Shamail Allen
- Department of Biology, Texas Southern University, Houston, Texas, United States of America
| | - Fatma Diouf
- Department of Biology, Texas Southern University, Houston, Texas, United States of America
| | - Sharon Kwende
- Department of Biology, Texas Southern University, Houston, Texas, United States of America
| | - George M. Weinstock
- The Genome Institute, Washington University School of Medicine, St. Louis, Missouri, United States of America
| | - Kasthuri J. Venkateswaran
- Biotechnology & Planetary Protection Group, NASA Jet Propulsion Laboratories, California Institute of Technology, Pasadena, California, United States of America
| | - George E. Fox
- Department of Biology and Biochemistry, University of Houston, Houston, Texas, United States of America
| |
Collapse
|
3
|
Chapter 1 A Phylogenetic View of Bacterial Ribonucleases. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2009; 85:1-41. [DOI: 10.1016/s0079-6603(08)00801-5] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
4
|
Maillet I, Berndt P, Malo C, Rodriguez S, Brunisholz RA, Pragai Z, Arnold S, Langen H, Wyss M. From the genome sequence to the proteome and back: evaluation of E. coli genome annotation with a 2-D gel-based proteomics approach. Proteomics 2007; 7:1097-106. [PMID: 17366475 DOI: 10.1002/pmic.200600599] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The ambition of systems biology to understand complex biological systems at the molecular level implies that we need to have a concrete and correct understanding of each molecular entity and its function. However, even for the best-studied organism, Escherichia coli, a large number of proteins have never been identified and characterised from wild-type cells, and/or await unravelling of their biological role. Instead, the ORF models for these proteins have been predicted by suitable algorithms and/or through comparison with known, homologous proteins from other organisms, approaches which may be prone to error. In the present study, we used a combination of 2-DE, MALDI-TOF-MS and PMF to identify 1151 different proteins in E. coli K12 JM109. Comparison of the experimental with the theoretical Mr and pI values (4000 experimental values each) allowed the identification of numerous proteins with incorrect or incomplete ORF annotations in the current E. coli genome databases. Several inconsistencies in genome annotation were verified experimentally, and up to 55 candidates await further investigation. Our findings demonstrate how an up-to-date 2-D gel-based proteomics approach can be used for improving the annotation of prokaryotic genomes. They also highlight the need for harmonization among the different E. coli genome databases.
Collapse
|
5
|
Larsabal E, Danchin A. Genomes are covered with ubiquitous 11 bp periodic patterns, the "class A flexible patterns". BMC Bioinformatics 2005; 6:206. [PMID: 16120222 PMCID: PMC1242344 DOI: 10.1186/1471-2105-6-206] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2005] [Accepted: 08/24/2005] [Indexed: 11/17/2022] Open
Abstract
Background The genomes of prokaryotes and lower eukaryotes display a very strong 11 bp periodic bias in the distribution of their nucleotides. This bias is present throughout a given genome, both in coding and non-coding sequences. Until now this bias remained of unknown origin. Results Using a technique for analysis of auto-correlations based on linear projection, we identified the sequences responsible for the bias. Prokaryotic and lower eukaryotic genomes are covered with ubiquitous patterns that we termed "class A flexible patterns". Each pattern is composed of up to ten conserved nucleotides or dinucleotides distributed into a discontinuous motif. Each occurrence spans a region up to 50 bp in length. They belong to what we named the "flexible pattern" type, in that there is some limited fluctuation in the distances between the nucleotides composing each occurrence of a given pattern. When taken together, these patterns cover up to half of the genome in the majority of prokaryotes. They generate the previously recognized 11 bp periodic bias. Conclusion Judging from the structure of the patterns, we suggest that they may define a dense network of protein interaction sites in chromosomes.
Collapse
Affiliation(s)
- Etienne Larsabal
- Unité de Génétique des Génomes Bactériens, Institut Pasteur, URA CNRS 2171, 28, rue du Docteur Roux, 75724 Paris Cedex 15, France
| | - Antoine Danchin
- Unité de Génétique des Génomes Bactériens, Institut Pasteur, URA CNRS 2171, 28, rue du Docteur Roux, 75724 Paris Cedex 15, France
| |
Collapse
|
6
|
Gallant J, Bonthuis P, Lindsley D. Evidence that the bypassing ribosome travels through the coding gap. Proc Natl Acad Sci U S A 2003; 100:13430-5. [PMID: 14576279 PMCID: PMC263831 DOI: 10.1073/pnas.2233745100] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In translational bypassing, a peptidyl-tRNA::ribosome complex skips over a number of nucleotides in a messenger sequence and resumes protein chain elongation after a "landing site" downstream of the bypassed region. The present experiments demonstrate that the complex "scans" processively through the bypassed region. This conclusion rests on three observations. (i) When two potential "landing sites" are present, the protein sequence of the product shows that virtually all ribosomes land at the first and virtually none at the second. (ii) In such a sequence with two landing sites, the presence of a terminator triplet in phase in the coding region immediately after the first landing site drastically reduces the efficiency of bypassing. (iii) Internally complementary sequences that can form a stable stemloop in the bypassed region significantly reduce the efficiency of bypassing. We analyze bypassing from a given "takeoff" site to "landing sites" at different distances downstream so as to derive estimates of the frequency of ribosome takeoff and of the stability of the bypassing complex.
Collapse
Affiliation(s)
- Jonathan Gallant
- Department of Genome Sciences, University of Washington, P.O. Box 357730, Seattle, WA 98105, USA.
| | | | | |
Collapse
|
7
|
|
8
|
Li YC, Korol AB, Fahima T, Beiles A, Nevo E. Microsatellites: genomic distribution, putative functions and mutational mechanisms: a review. Mol Ecol 2002; 11:2453-65. [PMID: 12453231 DOI: 10.1046/j.1365-294x.2002.01643.x] [Citation(s) in RCA: 601] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Microsatellites, or tandem simple sequence repeats (SSR), are abundant across genomes and show high levels of polymorphism. SSR genetic and evolutionary mechanisms remain controversial. Here we attempt to summarize the available data related to SSR distribution in coding and noncoding regions of genomes and SSR functional importance. Numerous lines of evidence demonstrate that SSR genomic distribution is nonrandom. Random expansions or contractions appear to be selected against for at least part of SSR loci, presumably because of their effect on chromatin organization, regulation of gene activity, recombination, DNA replication, cell cycle, mismatch repair system, etc. This review also discusses the role of two putative mutational mechanisms, replication slippage and recombination, and their interaction in SSR variation.
Collapse
Affiliation(s)
- You-Chun Li
- Institute of Evolution, University of Haifa, Haifa 31905, Israel
| | | | | | | | | |
Collapse
|
9
|
Danchin A, Guerdoux-Jamet P, Moszer I, Nitschké P. Mapping the bacterial cell architecture into the chromosome. Philos Trans R Soc Lond B Biol Sci 2000; 355:179-90. [PMID: 10724454 PMCID: PMC1692725 DOI: 10.1098/rstb.2000.0557] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
A genome is not a simple collection of genes. We propose here that it can be viewed as being organized as a 'celluloculus' similar to the homunculus of preformists, but pertaining to the category of programmes (or algorithms) rather than to that of architectures or structures: a significant correlation exists between the distribution of genes along the chromosome and the physical architecture of the cell. We review here data supporting this observation, stressing physical constraints operating on the cell's architecture and dynamics, and their consequences in terms of gene and genome structure. If such a correlation exists, it derives from some selection pressure: simple and general physical principles acting at the level of the cell structure are discussed. As a first case in point we see the piling up of planar modules as a stable, entropy-driven, architectural principle that could be at the root of the coupling between the architecture of the cell and the location of genes at specific places in the chromosome. We propose that the specific organization of certain genes whose products have a general tendency to form easily planar modules is a general motor for architectural organization in the bacterial cell. A second mechanism, operating at the transcription level, is described that could account for the efficient building up of complex structures. As an organizing principle we suggest that exploration by biological polymers of the vast space of possible conformation states is constrained by anchoring points. In particular, we suggest that transcription does not always allow the 5'-end of the transcript to go free and explore the many conformations available, but that, in many cases, it remains linked to the transcribing RNA polymerase complex in such a way that loops of RNA, rather than threads with a free end, explore the surrounding medium. In bacteria, extension of the loops throughout the cytoplasm would therefore be mediated by the de novo synthesis of ribosomes in growing cells. Termination of transcription and mRNA turnover would accordingly be expected to be controlled by sequence features at both the 3'- and 5'-ends of the molecule. These concepts are discussed taking into account in vitro analysis of genome sequences and experimental data about cell compartmentalization, mRNA folding and turnover, as well as known structural features of protein and membrane complexes.
Collapse
Affiliation(s)
- A Danchin
- Regulation de l'Expression Génétique, Institut Pasteur, Paris, France.
| | | | | | | |
Collapse
|
10
|
Moszer I. The complete genome of Bacillus subtilis: from sequence annotation to data management and analysis. FEBS Lett 1998; 430:28-36. [PMID: 9678589 DOI: 10.1016/s0014-5793(98)00620-6] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
The completion of the entire 4.2-Mb genome sequence of the gram-positive bacterium Bacillus subtilis has been a milestone for biological studies on this model organism. This paper describes bioinformatics work related to this joint European and Japanese project: methods and strategies for gene annotation and detection of sequencing errors, using an integrated cooperative computer environment (Imagene); construction of a specialized database for data management and a WWW server for data retrieval (SubtiList); DNA sequence analysis, yielding striking results on oligonucleotide bias, repeated sequences, and codon usage, all landmarks of evolutionary events shaping the B. subtilis genome.
Collapse
Affiliation(s)
- I Moszer
- Unité de Régulation de l'Expression Génétique, Institut Pasteur, Paris, France.
| |
Collapse
|