101
|
Hirabayashi K, Yuda E, Tanaka N, Katayama S, Iwasaki K, Matsumoto T, Kurisu G, Outten FW, Fukuyama K, Takahashi Y, Wada K. Functional Dynamics Revealed by the Structure of the SufBCD Complex, a Novel ATP-binding Cassette (ABC) Protein That Serves as a Scaffold for Iron-Sulfur Cluster Biogenesis. J Biol Chem 2015; 290:29717-31. [PMID: 26472926 DOI: 10.1074/jbc.m115.680934] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2015] [Indexed: 12/19/2022] Open
Abstract
ATP-binding cassette (ABC)-type ATPases are chemomechanical engines involved in diverse biological pathways. Recent genomic information reveals that ABC ATPase domains/subunits act not only in ABC transporters and structural maintenance of chromosome proteins, but also in iron-sulfur (Fe-S) cluster biogenesis. A novel type of ABC protein, the SufBCD complex, functions in the biosynthesis of nascent Fe-S clusters in almost all Eubacteria and Archaea, as well as eukaryotic chloroplasts. In this study, we determined the first crystal structure of the Escherichia coli SufBCD complex, which exhibits the common architecture of ABC proteins: two ABC ATPase components (SufC) with function-specific components (SufB-SufD protomers). Biochemical and physiological analyses based on this structure provided critical insights into Fe-S cluster assembly and revealed a dynamic conformational change driven by ABC ATPase activity. We propose a molecular mechanism for the biogenesis of the Fe-S cluster in the SufBCD complex.
Collapse
Affiliation(s)
- Kei Hirabayashi
- From the Department of Biological Sciences, Graduate School of Science, Osaka University, Osaka 560-0043, Japan, the Organization for Promotion of Tenure Track, University of Miyazaki, Miyazaki 889-1692, Japan
| | - Eiki Yuda
- the Division of Life Science, Graduate School of Science and Engineering, Saitama University, Saitama 338-8570, Japan
| | - Naoyuki Tanaka
- the Division of Life Science, Graduate School of Science and Engineering, Saitama University, Saitama 338-8570, Japan
| | - Sumie Katayama
- the Institute for Protein Research, Osaka University, Osaka 565-0871, Japan
| | - Kenji Iwasaki
- the Institute for Protein Research, Osaka University, Osaka 565-0871, Japan
| | | | - Genji Kurisu
- the Institute for Protein Research, Osaka University, Osaka 565-0871, Japan
| | - F Wayne Outten
- the Department of Chemistry and Biochemistry, University of South Carolina, Columbia, South Carolina 29208, and
| | - Keiichi Fukuyama
- the Division of Applied Chemistry, Graduate School of Engineering Osaka University, Osaka 565-0871, Japan
| | - Yasuhiro Takahashi
- the Division of Life Science, Graduate School of Science and Engineering, Saitama University, Saitama 338-8570, Japan,
| | - Kei Wada
- the Organization for Promotion of Tenure Track, University of Miyazaki, Miyazaki 889-1692, Japan,
| |
Collapse
|
102
|
Thiaville PC, El Yacoubi B, Köhrer C, Thiaville JJ, Deutsch C, Iwata-Reuyl D, Bacusmo JM, Armengaud J, Bessho Y, Wetzel C, Cao X, Limbach PA, RajBhandary UL, de Crécy-Lagard V. Essentiality of threonylcarbamoyladenosine (t(6)A), a universal tRNA modification, in bacteria. Mol Microbiol 2015; 98:1199-221. [PMID: 26337258 DOI: 10.1111/mmi.13209] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/28/2015] [Indexed: 02/06/2023]
Abstract
Threonylcarbamoyladenosine (t(6)A) is a modified nucleoside universally conserved in tRNAs in all three kingdoms of life. The recently discovered genes for t(6)A synthesis, including tsaC and tsaD, are essential in model prokaryotes but not essential in yeast. These genes had been identified as antibacterial targets even before their functions were known. However, the molecular basis for this prokaryotic-specific essentiality has remained a mystery. Here, we show that t(6)A is a strong positive determinant for aminoacylation of tRNA by bacterial-type but not by eukaryotic-type isoleucyl-tRNA synthetases and might also be a determinant for the essential enzyme tRNA(Ile)-lysidine synthetase. We confirm that t(6)A is essential in Escherichia coli and a survey of genome-wide essentiality studies shows that genes for t(6)A synthesis are essential in most prokaryotes. This essentiality phenotype is not universal in Bacteria as t(6)A is dispensable in Deinococcus radiodurans, Thermus thermophilus, Synechocystis PCC6803 and Streptococcus mutans. Proteomic analysis of t(6)A(-) D. radiodurans strains revealed an induction of the proteotoxic stress response and identified genes whose translation is most affected by the absence of t(6)A in tRNAs. Thus, although t(6)A is universally conserved in tRNAs, its role in translation might vary greatly between organisms.
Collapse
Affiliation(s)
- Patrick C Thiaville
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL, 32611, USA.,Genetics and Genomics Graduate Program, University of Florida, Gainesville, FL, 32610, USA.,University of Florida Genetics Institute, University of Florida, Gainesville, FL, 32610, USA.,Institut de Génétique et Microbiologie, Université of Paris-Sud, Orsay, France
| | - Basma El Yacoubi
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL, 32611, USA
| | - Caroline Köhrer
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Jennifer J Thiaville
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL, 32611, USA
| | - Chris Deutsch
- Department of Chemistry, Portland State University, Portland, OR, 97297, USA
| | - Dirk Iwata-Reuyl
- Department of Chemistry, Portland State University, Portland, OR, 97297, USA
| | - Jo Marie Bacusmo
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL, 32611, USA
| | - Jean Armengaud
- CEA, DSV, IBiTec-S, SPI, Li2D, Laboratory 'Innovative technologies for Detection and Diagnostics', Bagnols-sur-Cèze, F-30200, France
| | - Yoshitaka Bessho
- RIKEN SPring-8 Center, 1-1-1 Kouto, Sayo, Hyogo, 679-5148, Japan.,Institute of Physics, Academia Sinica, 128 Sec. 2, Academia Rd., Nankang, Taipei, 11529, Taiwan
| | - Collin Wetzel
- Rieveschl Laboratories for Mass Spectrometry, Department of Chemistry, University of Cincinnati, Cincinnati, Ohio, 45221, USA
| | - Xiaoyu Cao
- Rieveschl Laboratories for Mass Spectrometry, Department of Chemistry, University of Cincinnati, Cincinnati, Ohio, 45221, USA
| | - Patrick A Limbach
- Rieveschl Laboratories for Mass Spectrometry, Department of Chemistry, University of Cincinnati, Cincinnati, Ohio, 45221, USA
| | - Uttam L RajBhandary
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
| | - Valérie de Crécy-Lagard
- Department of Microbiology and Cell Science, University of Florida, Gainesville, FL, 32611, USA.,University of Florida Genetics Institute, University of Florida, Gainesville, FL, 32610, USA
| |
Collapse
|
103
|
Kulp AJ, Sun B, Ai T, Manning AJ, Orench-Rivera N, Schmid AK, Kuehn MJ. Genome-Wide Assessment of Outer Membrane Vesicle Production in Escherichia coli. PLoS One 2015; 10:e0139200. [PMID: 26406465 PMCID: PMC4583269 DOI: 10.1371/journal.pone.0139200] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Accepted: 09/10/2015] [Indexed: 12/31/2022] Open
Abstract
The production of outer membrane vesicles by Gram-negative bacteria has been well documented; however, the mechanism behind the biogenesis of these vesicles remains unclear. Here a high-throughput experimental method and systems-scale analysis was conducted to determine vesiculation values for the whole genome knockout library of Escherichia coli mutant strains (Keio collection). The resultant dataset quantitatively recapitulates previously observed phenotypes and implicates nearly 150 new genes in the process of vesiculation. Gene functional and biochemical pathway analyses suggest that mutations that truncate outer membrane structures such as lipopolysaccharide and enterobacterial common antigen lead to hypervesiculation, whereas mutants in oxidative stress response pathways result in lower levels. This study expands and refines the current knowledge regarding the cellular pathways required for outer membrane vesiculation in E. coli.
Collapse
Affiliation(s)
- Adam J. Kulp
- Dept. of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC, 27710, United States of America
| | - Bo Sun
- Dept. of Biology, Duke University, Durham, NC, 27708, United States of America
| | - Teresa Ai
- Dept. of Biology, Duke University, Durham, NC, 27708, United States of America
| | - Andrew J. Manning
- Dept. of Biochemistry, Duke University Medical Center, Durham, NC, 27710, United States of America
| | - Nichole Orench-Rivera
- Dept. of Biochemistry, Duke University Medical Center, Durham, NC, 27710, United States of America
| | - Amy K. Schmid
- Dept. of Biology, Duke University, Durham, NC, 27708, United States of America
- Duke Center for Systems Biology, Duke University, Durham, NC, 27708, United States of America
- * E-mail:
| | - Meta J. Kuehn
- Dept. of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC, 27710, United States of America
- Dept. of Biochemistry, Duke University Medical Center, Durham, NC, 27710, United States of America
| |
Collapse
|
104
|
Abstract
The identification of translation initiation sites (TISs) constitutes an important aspect of sequence-based genome analysis. An erroneous TIS annotation can impair the identification of regulatory elements and N-terminal signal peptides, and also may flaw the determination of descent, for any particular gene. We have formulated a reference-free method to score the TIS annotation quality. The method is based on a comparison of the observed and expected distribution of all TISs in a particular genome given prior gene-calling. We have assessed the TIS annotations for all available NCBI RefSeq microbial genomes and found that approximately 87% is of appropriate quality, whereas 13% needs substantial improvement. We have analyzed a number of factors that could affect TIS annotation quality such as GC-content, taxonomy, the fraction of genes with a Shine-Dalgarno sequence and the year of publication. The analysis showed that only the first factor has a clear effect. We have then formulated a straightforward Principle Component Analysis-based TIS identification strategy to self-organize and score potential TISs. The strategy is independent of reference data and a priori calculations. A representative set of 277 genomes was subjected to the analysis and we found a clear increase in TIS annotation quality for the genomes with a low quality score. The PCA-based annotation was also compared with annotation with the current tool of reference, Prodigal. The comparison for the model genome of Escherichia coli K12 showed that both methods supplement each other and that prediction agreement can be used as an indicator of a correct TIS annotation. Importantly, the data suggest that the addition of a PCA-based strategy to a Prodigal prediction can be used to ‘flag’ TIS annotations for re-evaluation and in addition can be used to evaluate a given annotation in case a Prodigal annotation is lacking.
Collapse
|
105
|
Kingston AW, Roussel-Rossin C, Dupont C, Raleigh EA. Novel recA-Independent Horizontal Gene Transfer in Escherichia coli K-12. PLoS One 2015; 10:e0130813. [PMID: 26162088 PMCID: PMC4498929 DOI: 10.1371/journal.pone.0130813] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2015] [Accepted: 05/27/2015] [Indexed: 01/19/2023] Open
Abstract
In bacteria, mechanisms that incorporate DNA into a genome without strand-transfer proteins such as RecA play a major role in generating novelty by horizontal gene transfer. We describe a new illegitimate recombination event in Escherichia coli K-12: RecA-independent homologous replacements, with very large (megabase-length) donor patches replacing recipient DNA. A previously uncharacterized gene (yjiP) increases the frequency of RecA-independent replacement recombination. To show this, we used conjugal DNA transfer, combining a classical conjugation donor, HfrH, with modern genome engineering methods and whole genome sequencing analysis to enable interrogation of genetic dependence of integration mechanisms and characterization of recombination products. As in classical experiments, genomic DNA transfer begins at a unique position in the donor, entering the recipient via conjugation; antibiotic resistance markers are then used to select recombinant progeny. Different configurations of this system were used to compare known mechanisms for stable DNA incorporation, including homologous recombination, F'-plasmid formation, and genome duplication. A genome island of interest known as the immigration control region was specifically replaced in a minority of recombinants, at a frequency of 3 X 10(-12) CFU/recipient per hour.
Collapse
Affiliation(s)
- Anthony W. Kingston
- New England Biolabs, Ipswich, Massachusetts, 01938, United States of America
| | | | - Claire Dupont
- New England Biolabs, Ipswich, Massachusetts, 01938, United States of America
| | - Elisabeth A. Raleigh
- New England Biolabs, Ipswich, Massachusetts, 01938, United States of America
- * E-mail:
| |
Collapse
|
106
|
Stone-dwelling actinobacteria Blastococcus saxobsidens, Modestobacter marinus and Geodermatophilus obscurus proteogenomes. ISME JOURNAL 2015; 10:21-9. [PMID: 26125681 DOI: 10.1038/ismej.2015.108] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2015] [Revised: 05/04/2015] [Accepted: 05/09/2015] [Indexed: 01/27/2023]
Abstract
The Geodermatophilaceae are unique model systems to study the ability to thrive on or within stones and their proteogenomes (referring to the whole protein arsenal encoded by the genome) could provide important insight into their adaptation mechanisms. Here we report the detailed comparative genome analysis of Blastococcus saxobsidens (Bs), Modestobacter marinus (Mm) and Geodermatophilus obscurus (Go) isolated respectively from the interior and the surface of calcarenite stones and from desert sandy soils. The genome-scale analysis of Bs, Mm and Go illustrates how adaptation to these niches can be achieved through various strategies including 'molecular tinkering/opportunism' as shown by the high proportion of lost, duplicated or horizontally transferred genes and ORFans. Using high-throughput discovery proteomics, the three proteomes under unstressed conditions were analyzed, highlighting the most abundant biomarkers and the main protein factors. Proteomic data corroborated previously demonstrated stone-related ecological distribution. For instance, these data showed starvation-inducible, biofilm-related and DNA-protection proteins as signatures of the microbes associated with the interior, surface and outside of stones, respectively.
Collapse
|
107
|
Myers KS, Park DM, Beauchene NA, Kiley PJ. Defining bacterial regulons using ChIP-seq. Methods 2015; 86:80-8. [PMID: 26032817 DOI: 10.1016/j.ymeth.2015.05.022] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Revised: 05/22/2015] [Accepted: 05/23/2015] [Indexed: 11/28/2022] Open
Abstract
Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq) is a powerful method that identifies protein-DNA binding sites in vivo. Recent studies have illustrated the value of ChIP-seq in studying transcription factor binding in various bacterial species under a variety of growth conditions. These results show that in addition to identifying binding sites, correlation of ChIP-seq data with expression data can reveal important information about bacterial regulons and regulatory networks. In this chapter, we provide an overview of the current state of knowledge about ChIP-seq methodology in bacteria, from sample preparation to raw data analysis. We also describe visualization and various bioinformatic analyses of processed ChIP-seq data.
Collapse
Affiliation(s)
- Kevin S Myers
- Laboratory of Genetics, University of Wisconsin - Madison, Madison, WI 53706, USA; Great Lakes Bioenergy Research Center, University of Wisconsin - Madison, Madison, WI 53706, USA
| | - Dan M Park
- Lawrence Livermore National Laboratory, Livermore, CA 94550, USA
| | - Nicole A Beauchene
- Department of Biomolecular Chemistry, University of Wisconsin - Madison, Madison, WI 53706, USA
| | - Patricia J Kiley
- Department of Biomolecular Chemistry, University of Wisconsin - Madison, Madison, WI 53706, USA; Great Lakes Bioenergy Research Center, University of Wisconsin - Madison, Madison, WI 53706, USA.
| |
Collapse
|
108
|
Abstract
Databases play an increasingly important role in biology. They archive, store, maintain, and share information on genes, genomes, expression data, protein sequences and structures, metabolites and reactions, interactions, and pathways. All these data are critically important to microbiologists. Furthermore, microbiology has its own databases that deal with model microorganisms, microbial diversity, physiology, and pathogenesis. Thousands of biological databases are currently available, and it becomes increasingly difficult to keep up with their development. The purpose of this minireview is to provide a brief survey of current databases that are of interest to microbiologists.
Collapse
|
109
|
Complete Genome Sequence of ER2796, a DNA Methyltransferase-Deficient Strain of Escherichia coli K-12. PLoS One 2015; 10:e0127446. [PMID: 26010885 PMCID: PMC4444293 DOI: 10.1371/journal.pone.0127446] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2014] [Accepted: 04/14/2015] [Indexed: 11/19/2022] Open
Abstract
We report the complete sequence of ER2796, a laboratory strain of Escherichia coli K-12 that is completely defective in DNA methylation. Because of its lack of any native methylation, it is extremely useful as a host into which heterologous DNA methyltransferase genes can be cloned and the recognition sequences of their products deduced by Pacific Biosciences Single-Molecule Real Time (SMRT) sequencing. The genome was itself sequenced from a long-insert library using the SMRT platform, resulting in a single closed contig devoid of methylated bases. Comparison with K-12 MG1655, the first E. coli K-12 strain to be sequenced, shows an essentially co-linear relationship with no major rearrangements despite many generations of laboratory manipulation. The comparison revealed a total of 41 insertions and deletions, and 228 single base pair substitutions. In addition, the long-read approach facilitated the surprising discovery of four gene conversion events, three involving rRNA operons and one between two cryptic prophages. Such events thus contribute both to genomic homogenization and to bacteriophage diversification. As one of relatively few laboratory strains of E. coli to be sequenced, the genome also reveals the sequence changes underlying a number of classical mutant alleles including those affecting the various native DNA methylation systems.
Collapse
|
110
|
Highly reproductive Escherichia coli cells with no specific assignment to the UAG codon. Sci Rep 2015; 5:9699. [PMID: 25982672 PMCID: PMC4434889 DOI: 10.1038/srep09699] [Citation(s) in RCA: 128] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2015] [Accepted: 03/17/2015] [Indexed: 12/28/2022] Open
Abstract
Escherichia coli is a widely used host organism for recombinant technology, and the bacterial incorporation of non-natural amino acids promises the efficient synthesis of proteins with novel structures and properties. In the present study, we developed E. coli strains in which the UAG codon was reserved for non-natural amino acids, without compromising the reproductive strength of the host cells. Ninety-five of the 273 UAG stop codons were replaced synonymously in the genome of E. coli BL21(DE3), by exploiting the oligonucleotide-mediated base-mismatch-repair mechanism. This genomic modification allowed the safe elimination of the UAG-recognizing cellular component (RF-1), thus leaving the remaining 178 UAG codons with no specific molecule recognizing them. The resulting strain B-95.ΔA grew as vigorously as BL21(DE3) in rich medium at 25-42°C, and its derivative B-95.ΔAΔfabR was better adapted to low temperatures and minimal media than B-95.ΔA. UAG was reassigned to synthetic amino acids by expressing the specific pairs of UAG-reading tRNA and aminoacyl-tRNA synthetase. Due to the preserved growth vigor, the B-95.ΔA strains showed superior productivities for hirudin molecules sulfonated on a particular tyrosine residue, and the Fab fragments of Herceptin containing multiple azido groups.
Collapse
|
111
|
Kyuma T, Kimura S, Hanada Y, Suzuki T, Sekimizu K, Kaito C. Ribosomal RNA methyltransferases contribute toStaphylococcus aureusvirulence. FEBS J 2015; 282:2570-84. [DOI: 10.1111/febs.13302] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2015] [Revised: 04/03/2015] [Accepted: 04/16/2015] [Indexed: 12/13/2022]
Affiliation(s)
- Tatsuhiko Kyuma
- Laboratory of Microbiology; Graduate School of Pharmaceutical Sciences; The University of Tokyo; Japan
| | - Satoshi Kimura
- Department of Chemistry and Biotechnology; Graduate School of Engineering; The University of Tokyo; Japan
| | - Yuichi Hanada
- Laboratory of Microbiology; Graduate School of Pharmaceutical Sciences; The University of Tokyo; Japan
| | - Tsutomu Suzuki
- Department of Chemistry and Biotechnology; Graduate School of Engineering; The University of Tokyo; Japan
| | - Kazuhisa Sekimizu
- Laboratory of Microbiology; Graduate School of Pharmaceutical Sciences; The University of Tokyo; Japan
| | - Chikara Kaito
- Laboratory of Microbiology; Graduate School of Pharmaceutical Sciences; The University of Tokyo; Japan
| |
Collapse
|
112
|
Liang W, Rudd KE, Deutscher MP. A role for REP sequences in regulating translation. Mol Cell 2015; 58:431-9. [PMID: 25891074 DOI: 10.1016/j.molcel.2015.03.019] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2014] [Revised: 01/21/2015] [Accepted: 03/12/2015] [Indexed: 01/07/2023]
Abstract
Repetitive extragenic palindromic (REP) sequences are highly structured elements found downstream of ∼500 genes in Escherichia coli that result in extensive stem-loop structures in their mRNAs. However, their physiological role has remained elusive. Here, we show that REP sequences can downregulate translation, but only if they are within 15 nt of a termination codon; a spacing of 16 nt has no effect, suggesting that the REP element acts to stall ribosome movement. Ribosome stalling leads to cleavage of the mRNA and induction of the trans-translation process. Using nrdAB as a model, we find that its regulation can be partially reversed by overexpression of RNA helicases and can be fully overcome upon UV stress, emphasizing the importance of this regulatory process. Since 50% of REP-associated genes have these elements within the critical 15 nt, these findings identify a regulatory mechanism with the potential to affect translation from a large number of genes.
Collapse
Affiliation(s)
- Wenxing Liang
- Department of Biochemistry and Molecular Biology, Miller School of Medicine, University of Miami, Miami, FL 33101, USA; The Key Laboratory of Integrated Crop Pest Management of Shandong Province, College of Agronomy and Plant Protection, Qingdao Agricultural University, Qingdao 266109, China
| | - Kenneth E Rudd
- Department of Biochemistry and Molecular Biology, Miller School of Medicine, University of Miami, Miami, FL 33101, USA
| | - Murray P Deutscher
- Department of Biochemistry and Molecular Biology, Miller School of Medicine, University of Miami, Miami, FL 33101, USA.
| |
Collapse
|
113
|
General and condition-specific essential functions of Pseudomonas aeruginosa. Proc Natl Acad Sci U S A 2015; 112:5189-94. [PMID: 25848053 DOI: 10.1073/pnas.1422186112] [Citation(s) in RCA: 133] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
The essential functions of a bacterial pathogen reflect the most basic processes required for its viability and growth, and represent potential therapeutic targets. Most screens for essential genes have assayed a single condition--growth in a rich undefined medium--and thus have not distinguished genes that are generally essential from those that are specific to this particular condition. To help define these classes for Pseudomonas aeruginosa, we identified genes required for growth on six different media, including a medium made from cystic fibrosis patient sputum. The analysis used the Tn-seq circle method to achieve high genome coverage and analyzed more than 1,000,000 unique insertion positions (an average of one insertion every 6.0 bp). We identified 352 general and 199 condition-specific essential genes. A subset of assignments was verified in individual strains with regulated expression alleles. The profile of essential genes revealed that, compared with Escherichia coli, P. aeruginosa is highly vulnerable to mutations disrupting central carbon-energy metabolism and reactive oxygen defenses. These vulnerabilities may arise from the stripped-down architecture of the organism's carbohydrate utilization pathways and its reliance on respiration for energy generation. The essential function profile thus provides fundamental insights into P. aeruginosa physiology as well as identifying candidate targets for new antibacterial agents.
Collapse
|
114
|
Abbas MM, Mohie-Eldin MM, EL-Manzalawy Y. Assessing the effects of data selection and representation on the development of reliable E. coli sigma 70 promoter region predictors. PLoS One 2015; 10:e0119721. [PMID: 25803493 PMCID: PMC4372424 DOI: 10.1371/journal.pone.0119721] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2014] [Accepted: 01/26/2015] [Indexed: 11/27/2022] Open
Abstract
As the number of sequenced bacterial genomes increases, the need for rapid and reliable tools for the annotation of functional elements (e.g., transcriptional regulatory elements) becomes more desirable. Promoters are the key regulatory elements, which recruit the transcriptional machinery through binding to a variety of regulatory proteins (known as sigma factors). The identification of the promoter regions is very challenging because these regions do not adhere to specific sequence patterns or motifs and are difficult to determine experimentally. Machine learning represents a promising and cost-effective approach for computational identification of prokaryotic promoter regions. However, the quality of the predictors depends on several factors including: i) training data; ii) data representation; iii) classification algorithms; iv) evaluation procedures. In this work, we create several variants of E. coli promoter data sets and utilize them to experimentally examine the effect of these factors on the predictive performance of E. coli σ70 promoter models. Our results suggest that under some combinations of the first three criteria, a prediction model might perform very well on cross-validation experiments while its performance on independent test data is drastically very poor. This emphasizes the importance of evaluating promoter region predictors using independent test data, which corrects for the over-optimistic performance that might be estimated using the cross-validation procedure. Our analysis of the tested models shows that good prediction models often perform well despite how the non-promoter data was obtained. On the other hand, poor prediction models seems to be more sensitive to the choice of non-promoter sequences. Interestingly, the best performing sequence-based classifiers outperform the best performing structure-based classifiers on both cross-validation and independent test performance evaluation experiments. Finally, we propose a meta-predictor method combining two top performing sequence-based and structure-based classifiers and compare its performance with some of the state-of-the-art E. coli σ70 promoter prediction methods.
Collapse
Affiliation(s)
- Mostafa M. Abbas
- KINDI Center for Computing Research, College of Engineering, Qatar University, Doha, Qatar
| | | | - Yasser EL-Manzalawy
- Systems and Computer Engineering, Al-Azhar University, Cairo, Egypt
- College of Information Sciences, Penn State University, University Park, United States of America
| |
Collapse
|
115
|
Early gene expression in Pseudomonas fluorescens exposed to a polymetallic solution. Cell Biol Toxicol 2015; 31:39-81. [DOI: 10.1007/s10565-015-9294-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2014] [Accepted: 02/23/2015] [Indexed: 11/27/2022]
|
116
|
Chou HH, Marx CJ, Sauer U. Transhydrogenase promotes the robustness and evolvability of E. coli deficient in NADPH production. PLoS Genet 2015; 11:e1005007. [PMID: 25715029 PMCID: PMC4340650 DOI: 10.1371/journal.pgen.1005007] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2014] [Accepted: 01/14/2015] [Indexed: 11/18/2022] Open
Abstract
Metabolic networks revolve around few metabolites recognized by diverse enzymes and involved in myriad reactions. Though hub metabolites are considered as stepping stones to facilitate the evolutionary expansion of biochemical pathways, changes in their production or consumption often impair cellular physiology through their system-wide connections. How does metabolism endure perturbations brought immediately by pathway modification and restore hub homeostasis in the long run? To address this question we studied laboratory evolution of pathway-engineered Escherichia coli that underproduces the redox cofactor NADPH on glucose. Literature suggests multiple possibilities to restore NADPH homeostasis. Surprisingly, genetic dissection of isolates from our twelve evolved populations revealed merely two solutions: (1) modulating the expression of membrane-bound transhydrogenase (mTH) in every population; (2) simultaneously consuming glucose with acetate, an unfavored byproduct normally excreted during glucose catabolism, in two subpopulations. Notably, mTH displays broad phylogenetic distribution and has also played a predominant role in laboratory evolution of Methylobacterium extorquens deficient in NADPH production. Convergent evolution of two phylogenetically and metabolically distinct species suggests mTH as a conserved buffering mechanism that promotes the robustness and evolvability of metabolism. Moreover, adaptive diversification via evolving dual substrate consumption highlights the flexibility of physiological systems to exploit ecological opportunities.
Collapse
Affiliation(s)
- Hsin-Hung Chou
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
- * E-mail:
| | - Christopher J. Marx
- Department of Biological Sciences, University of Idaho, Moscow, Idaho, United States of America
| | - Uwe Sauer
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland
| |
Collapse
|
117
|
Kenyon LJ, Meulia T, Sabree ZL. Habitat visualization and genomic analysis of "Candidatus Pantoea carbekii," the primary symbiont of the brown marmorated stink bug. Genome Biol Evol 2015; 7:620-35. [PMID: 25587021 PMCID: PMC4350177 DOI: 10.1093/gbe/evv006] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Phytophagous pentatomid insects can negatively impact agricultural productivity and the brown marmorated stink bug (Halyomorpha halys) is an emerging invasive pest responsible for damage to many fruit crops and ornamental plants in North America. Many phytophagous stink bugs, including H. halys, harbor gammaproteobacterial symbionts that likely contribute to host development, and characterization of symbiont transmission/acquisition and their contribution to host fitness may offer alternative strategies for managing pest species. “Candidatus Pantoea carbekii” is the primary occupant of gastric ceca lumina flanking the distal midgut of H. halys insects and it is acquired each generation when nymphs feed on maternal extrachorion secretions following hatching. Insects prevented from symbiont uptake exhibit developmental delays and aberrant behaviors. To infer contributions of Ca. P. carbekii to H. halys, the complete genome was sequenced and annotated from a North American H. halys population. Overall, the Ca. P. carbekii genome is nearly one-fourth (1.2 Mb) that of free-living congenerics, and retains genes encoding many functions that are potentially host-supportive. Gene content reflects patterns of gene loss/retention typical of intracellular mutualists of plant-feeding insects. Electron and fluorescence in situ microscopic imaging of H. halys egg surfaces revealed that maternal extrachorion secretions were populated with Ca. P. carbekii cells. The reported findings detail a transgenerational mode of symbiont transmission distinct from that observed for intracellular insect mutualists and illustrate the potential additive functions contributed by the bacterial symbiont to this important agricultural pest.
Collapse
Affiliation(s)
- Laura J Kenyon
- Department of Evolution, Ecology, and Organismal Biology, The Ohio State University
| | - Tea Meulia
- Molecular and Cellular Imaging Center, Ohio Agricultural Research and Development Center and the Department of Plant Pathology, The Ohio State University
| | - Zakee L Sabree
- Department of Evolution, Ecology, and Organismal Biology, The Ohio State University
| |
Collapse
|
118
|
Tran KT, Maeda T, Sanchez-Torres V, Wood TK. Beneficial knockouts in Escherichia coli for producing hydrogen from glycerol. Appl Microbiol Biotechnol 2015; 99:2573-81. [DOI: 10.1007/s00253-014-6338-7] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2014] [Revised: 12/12/2014] [Accepted: 12/14/2014] [Indexed: 12/28/2022]
|
119
|
De Biase D, Lund PA. The Escherichia coli Acid Stress Response and Its Significance for Pathogenesis. ADVANCES IN APPLIED MICROBIOLOGY 2015; 92:49-88. [PMID: 26003933 DOI: 10.1016/bs.aambs.2015.03.002] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Escherichia coli has a remarkable ability to survive low pH and possesses a number of different genetic systems that enable it to do this. These may be expressed constitutively, typically in stationary phase, or induced by growth under a variety of conditions. The activities of these systems have been implicated in the ability of E. coli to pass the acidic barrier of the stomach and to become established in the gastrointestinal tract, something causing serious infections. However, much of the work characterizing these systems has been done on standard laboratory strains of E. coli and under conditions which do not closely resemble those found in the human gut. Here we review what is known about acid resistance in E. coli as a model laboratory organism and in the context of its lifestyle as an inhabitant-sometimes an unwelcome one-of the human gut.
Collapse
|
120
|
Price DRG, Wilson ACC. A substrate ambiguous enzyme facilitates genome reduction in an intracellular symbiont. BMC Biol 2014; 12:110. [PMID: 25527092 PMCID: PMC4306246 DOI: 10.1186/s12915-014-0110-4] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2014] [Accepted: 12/11/2014] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Genome evolution in intracellular microbial symbionts is characterized by gene loss, generating some of the smallest and most gene-poor genomes known. As a result of gene loss these genomes commonly contain metabolic pathways that are fragmented relative to their free-living relatives. The evolutionary retention of fragmented metabolic pathways in the gene-poor genomes of endosymbionts suggests that they are functional. However, it is not always clear how they maintain functionality. To date, the fragmented metabolic pathways of endosymbionts have been shown to maintain functionality through complementation by host genes, complementation by genes of another endosymbiont and complementation by genes in host genomes that have been horizontally acquired from a microbial source that is not the endosymbiont. Here, we demonstrate a fourth mechanism. RESULTS We investigate the evolutionary retention of a fragmented pathway for the essential nutrient pantothenate (vitamin B5) in the pea aphid, Acyrthosiphon pisum endosymbiosis with Buchnera aphidicola. Using quantitative analysis of gene expression we present evidence for complementation of the Buchnera pantothenate biosynthesis pathway by host genes. Further, using complementation assays in an Escherichia coli mutant we demonstrate functional replacement of a pantothenate biosynthesis enzyme, 2-dehydropantoate 2-reductase (E.C. 1.1.1.169), by an endosymbiont gene, ilvC, encoding a substrate ambiguous enzyme. CONCLUSIONS Earlier studies have speculated that missing enzyme steps in fragmented endosymbiont metabolic pathways are completed by adaptable endosymbiont enzymes from other pathways. Here, we experimentally demonstrate completion of a fragmented endosymbiont vitamin biosynthesis pathway by recruitment of a substrate ambiguous enzyme from another pathway. In addition, this work extends host/symbiont metabolic collaboration in the aphid/Buchnera symbiosis from amino acid metabolism to include vitamin biosynthesis.
Collapse
Affiliation(s)
- Daniel R G Price
- Department of Biology, University of Miami, Coral Gables, FL, 33146, USA.
| | - Alex C C Wilson
- Department of Biology, University of Miami, Coral Gables, FL, 33146, USA.
| |
Collapse
|
121
|
Galperin MY, Makarova KS, Wolf YI, Koonin EV. Expanded microbial genome coverage and improved protein family annotation in the COG database. Nucleic Acids Res 2014; 43:D261-9. [PMID: 25428365 DOI: 10.1093/nar/gku1223] [Citation(s) in RCA: 1045] [Impact Index Per Article: 104.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Microbial genome sequencing projects produce numerous sequences of deduced proteins, only a small fraction of which have been or will ever be studied experimentally. This leaves sequence analysis as the only feasible way to annotate these proteins and assign to them tentative functions. The Clusters of Orthologous Groups of proteins (COGs) database (http://www.ncbi.nlm.nih.gov/COG/), first created in 1997, has been a popular tool for functional annotation. Its success was largely based on (i) its reliance on complete microbial genomes, which allowed reliable assignment of orthologs and paralogs for most genes; (ii) orthology-based approach, which used the function(s) of the characterized member(s) of the protein family (COG) to assign function(s) to the entire set of carefully identified orthologs and describe the range of potential functions when there were more than one; and (iii) careful manual curation of the annotation of the COGs, aimed at detailed prediction of the biological function(s) for each COG while avoiding annotation errors and overprediction. Here we present an update of the COGs, the first since 2003, and a comprehensive revision of the COG annotations and expansion of the genome coverage to include representative complete genomes from all bacterial and archaeal lineages down to the genus level. This re-analysis of the COGs shows that the original COG assignments had an error rate below 0.5% and allows an assessment of the progress in functional genomics in the past 12 years. During this time, functions of many previously uncharacterized COGs have been elucidated and tentative functional assignments of many COGs have been validated, either by targeted experiments or through the use of high-throughput methods. A particularly important development is the assignment of functions to several widespread, conserved proteins many of which turned out to participate in translation, in particular rRNA maturation and tRNA modification. The new version of the COGs is expected to become an important tool for microbial genomics.
Collapse
Affiliation(s)
- Michael Y Galperin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 2094, USA
| | - Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 2094, USA
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 2094, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 2094, USA
| |
Collapse
|
122
|
Kriventseva EV, Tegenfeldt F, Petty TJ, Waterhouse RM, Simão FA, Pozdnyakov IA, Ioannidis P, Zdobnov EM. OrthoDB v8: update of the hierarchical catalog of orthologs and the underlying free software. Nucleic Acids Res 2014; 43:D250-6. [PMID: 25428351 PMCID: PMC4383991 DOI: 10.1093/nar/gku1220] [Citation(s) in RCA: 241] [Impact Index Per Article: 24.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Orthology, refining the concept of homology, is the cornerstone of evolutionary comparative studies. With the ever-increasing availability of genomic data, inference of orthology has become instrumental for generating hypotheses about gene functions crucial to many studies. This update of the OrthoDB hierarchical catalog of orthologs (http://www.orthodb.org) covers 3027 complete genomes, including the most comprehensive set of 87 arthropods, 61 vertebrates, 227 fungi and 2627 bacteria (sampling the most complete and representative genomes from over 11,000 available). In addition to the most extensive integration of functional annotations from UniProt, InterPro, GO, OMIM, model organism phenotypes and COG functional categories, OrthoDB uniquely provides evolutionary annotations including rates of ortholog sequence divergence, copy-number profiles, sibling groups and gene architectures. We re-designed the entirety of the OrthoDB website from the underlying technology to the user interface, enabling the user to specify species of interest and to select the relevant orthology level by the NCBI taxonomy. The text searches allow use of complex logic with various identifiers of genes, proteins, domains, ontologies or annotation keywords and phrases. Gene copy-number profiles can also be queried. This release comes with the freely available underlying ortholog clustering pipeline (http://www.orthodb.org/software).
Collapse
Affiliation(s)
- Evgenia V Kriventseva
- Department of Genetic Medicine and Development, University of Geneva Medical School, rue Michel-Servet 1, 1211 Geneva, Switzerland Swiss Institute of Bioinformatics, rue Michel-Servet 1, 1211 Geneva, Switzerland
| | - Fredrik Tegenfeldt
- Department of Genetic Medicine and Development, University of Geneva Medical School, rue Michel-Servet 1, 1211 Geneva, Switzerland Swiss Institute of Bioinformatics, rue Michel-Servet 1, 1211 Geneva, Switzerland
| | - Tom J Petty
- Department of Genetic Medicine and Development, University of Geneva Medical School, rue Michel-Servet 1, 1211 Geneva, Switzerland Swiss Institute of Bioinformatics, rue Michel-Servet 1, 1211 Geneva, Switzerland
| | - Robert M Waterhouse
- Department of Genetic Medicine and Development, University of Geneva Medical School, rue Michel-Servet 1, 1211 Geneva, Switzerland Swiss Institute of Bioinformatics, rue Michel-Servet 1, 1211 Geneva, Switzerland
| | - Felipe A Simão
- Department of Genetic Medicine and Development, University of Geneva Medical School, rue Michel-Servet 1, 1211 Geneva, Switzerland Swiss Institute of Bioinformatics, rue Michel-Servet 1, 1211 Geneva, Switzerland
| | - Igor A Pozdnyakov
- Department of Genetic Medicine and Development, University of Geneva Medical School, rue Michel-Servet 1, 1211 Geneva, Switzerland Swiss Institute of Bioinformatics, rue Michel-Servet 1, 1211 Geneva, Switzerland
| | - Panagiotis Ioannidis
- Department of Genetic Medicine and Development, University of Geneva Medical School, rue Michel-Servet 1, 1211 Geneva, Switzerland Swiss Institute of Bioinformatics, rue Michel-Servet 1, 1211 Geneva, Switzerland
| | - Evgeny M Zdobnov
- Department of Genetic Medicine and Development, University of Geneva Medical School, rue Michel-Servet 1, 1211 Geneva, Switzerland Swiss Institute of Bioinformatics, rue Michel-Servet 1, 1211 Geneva, Switzerland
| |
Collapse
|
123
|
Abstract
Tunable control of protein degradation in bacteria would provide a powerful research tool. We use components of the Mesoplasma florum tmRNA system to create a synthetic degradation system that provides both independent control of the steady-state protein level and inducible degradation of targeted proteins in Escherichia coli. We demonstrate application of this system in synthetic circuit development and control of core bacterial processes and antibacterial targets, and transfer the system to Lactococcus lactis to establish its broad functionality in bacteria. We create a 238-member library of tagged essential proteins in E. coli that can serve as both a research tool to study essential gene function and an applied system for antibiotic discovery. Our synthetic protein degradation system is modular, does not require disruption of host systems, and can be transferred to diverse bacteria with minimal modification.
Collapse
|
124
|
Otsuka Y, Muto A, Takeuchi R, Okada C, Ishikawa M, Nakamura K, Yamamoto N, Dose H, Nakahigashi K, Tanishima S, Suharnan S, Nomura W, Nakayashiki T, Aref WG, Bochner BR, Conway T, Gribskov M, Kihara D, Rudd KE, Tohsato Y, Wanner BL, Mori H. GenoBase: comprehensive resource database of Escherichia coli K-12. Nucleic Acids Res 2014; 43:D606-17. [PMID: 25399415 PMCID: PMC4383962 DOI: 10.1093/nar/gku1164] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Comprehensive experimental resources, such as ORFeome clone libraries and deletion mutant collections, are fundamental tools for elucidation of gene function. Data sets by omics analysis using these resources provide key information for functional analysis, modeling and simulation both in individual and systematic approaches. With the long-term goal of complete understanding of a cell, we have over the past decade created a variety of clone and mutant sets for functional genomics studies of Escherichia coli K-12. We have made these experimental resources freely available to the academic community worldwide. Accordingly, these resources have now been used in numerous investigations of a multitude of cell processes. Quality control is extremely important for evaluating results generated by these resources. Because the annotation has been changed since 2005, which we originally used for the construction, we have updated these genomic resources accordingly. Here, we describe GenoBase (http://ecoli.naist.jp/GB/), which contains key information about comprehensive experimental resources of E. coli K-12, their quality control and several omics data sets generated using these resources.
Collapse
Affiliation(s)
- Yuta Otsuka
- Graduate School of Biological Sciences, Nara Institute of Science and Technology, Ikoma, Nara 630-0101, Japan
| | - Ai Muto
- Graduate School of Biological Sciences, Nara Institute of Science and Technology, Ikoma, Nara 630-0101, Japan
| | - Rikiya Takeuchi
- Graduate School of Biological Sciences, Nara Institute of Science and Technology, Ikoma, Nara 630-0101, Japan
| | - Chihiro Okada
- Mitsubishi Space Software Co., LTD., 5-4-36 Tsukaguchihonnmachi, Amagasaki, Hyougo 661-0001, Japan
| | - Motokazu Ishikawa
- Mitsubishi Space Software Co., LTD., 5-4-36 Tsukaguchihonnmachi, Amagasaki, Hyougo 661-0001, Japan
| | - Koichiro Nakamura
- Mitsubishi Space Software Co., LTD., 5-4-36 Tsukaguchihonnmachi, Amagasaki, Hyougo 661-0001, Japan
| | - Natsuko Yamamoto
- Graduate School of Biological Sciences, Nara Institute of Science and Technology, Ikoma, Nara 630-0101, Japan
| | - Hitomi Dose
- Graduate School of Biological Sciences, Nara Institute of Science and Technology, Ikoma, Nara 630-0101, Japan
| | - Kenji Nakahigashi
- Institute of Advanced Biosciences, Keio University, Tsuruoka 997-0017, Japan
| | - Shigeki Tanishima
- Mitsubishi Space Software Co., LTD., 5-4-36 Tsukaguchihonnmachi, Amagasaki, Hyougo 661-0001, Japan
| | - Sivasundaram Suharnan
- Axiohelix, Okinawa Sangyo Shien Center, 502,1831-1, Oroku, Naha-shi, Okinawa 901-0152, Japan
| | - Wataru Nomura
- Graduate School of Biological Sciences, Nara Institute of Science and Technology, Ikoma, Nara 630-0101, Japan
| | - Toru Nakayashiki
- Graduate School of Biological Sciences, Nara Institute of Science and Technology, Ikoma, Nara 630-0101, Japan
| | - Walid G Aref
- Department of Computer Science, Purdue University, 305 N. University Street, West Lafayette, IN 47907-2107, USA
| | | | - Tyrrell Conway
- Department of Microbiology and Plant Biology, University of Oklahoma, Norman, OK 73019-0245, USA
| | - Michael Gribskov
- Department of Biological Sciences, Purdue University, 915 W. State Street, West Lafayette, IN 47907-2054, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, 305 N. University Street, West Lafayette, IN 47907-2107, USA Department of Biological Sciences, Purdue University, 915 W. State Street, West Lafayette, IN 47907-2054, USA
| | - Kenneth E Rudd
- Department Biochemistry and Molecular Biology, University of Miami, P.O. Box 016129, Miami, FL 33101-6129, USA
| | - Yukako Tohsato
- Department of Bioinformatics, Ritsumeikan University, 1-1-1 Nojihigashi, Kusatsu, Shiga 525-8577, Japan
| | - Barry L Wanner
- Department of Microbiology and Immunobiology, Harvard Medical School, Boston, MA 02115, USA
| | - Hirotada Mori
- Graduate School of Biological Sciences, Nara Institute of Science and Technology, Ikoma, Nara 630-0101, Japan
| |
Collapse
|
125
|
Abstract
Engineering microbial hosts for the production of fungible fuels requires mitigation of limitations posed on the production capacity. One such limitation arises from the inherent toxicity of solvent-like biofuel compounds to production strains, such as Escherichia coli. Here we show the importance of host engineering for the production of short-chain alcohols by studying the overexpression of genes upregulated in response to exogenous isopentenol. Using systems biology data, we selected 40 genes that were upregulated following isopentenol exposure and subsequently overexpressed them in E. coli. Overexpression of several of these candidates improved tolerance to exogenously added isopentenol. Genes conferring isopentenol tolerance phenotypes belonged to diverse functional groups, such as oxidative stress response (soxS, fpr, and nrdH), general stress response (metR, yqhD, and gidB), heat shock-related response (ibpA), and transport (mdlB). To determine if these genes could also improve isopentenol production, we coexpressed the tolerance-enhancing genes individually with an isopentenol production pathway. Our data show that expression of 6 of the 8 candidates improved the production of isopentenol in E. coli, with the methionine biosynthesis regulator MetR improving the titer for isopentenol production by 55%. Additionally, expression of MdlB, an ABC transporter, facilitated a 12% improvement in isopentenol production. To our knowledge, MdlB is the first example of a transporter that can be used to improve production of a short-chain alcohol and provides a valuable new avenue for host engineering in biogasoline production. The use of microbial host platforms for the production of bulk commodities, such as chemicals and fuels, is now a focus of many biotechnology efforts. Many of these compounds are inherently toxic to the host microbe, which in turn places a limit on production despite efforts to optimize the bioconversion pathways. In order to achieve economically viable production levels, it is also necessary to engineer production strains with improved tolerance to these compounds. We demonstrate that microbial tolerance engineering using transcriptomics data can also identify targets that improve production. Our results include an exporter and a methionine biosynthesis regulator that improve isopentenol production, providing a starting point to further engineer the host for biogasoline production.
Collapse
|
126
|
Mandal A, Mandal S, Park MH. Genome-wide analyses and functional classification of proline repeat-rich proteins: potential role of eIF5A in eukaryotic evolution. PLoS One 2014; 9:e111800. [PMID: 25364902 PMCID: PMC4218817 DOI: 10.1371/journal.pone.0111800] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2014] [Accepted: 10/06/2014] [Indexed: 12/16/2022] Open
Abstract
The eukaryotic translation factor, eIF5A has been recently reported as a sequence-specific elongation factor that facilitates peptide bond formation at consecutive prolines in Saccharomyces cerevisiae, as its ortholog elongation factor P (EF-P) does in bacteria. We have searched the genome databases of 35 representative organisms from six kingdoms of life for PPP (Pro-Pro-Pro) and/or PPG (Pro-Pro-Gly)-encoding genes whose expression is expected to depend on eIF5A. We have made detailed analyses of proteome data of 5 selected species, Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster, Mus musculus and Homo sapiens. The PPP and PPG motifs are low in the prokaryotic proteomes. However, their frequencies markedly increase with the biological complexity of eukaryotic organisms, and are higher in newly derived proteins than in those orthologous proteins commonly shared in all species. Ontology classifications of S. cerevisiae and human genes encoding the highest level of polyprolines reveal their strong association with several specific biological processes, including actin/cytoskeletal associated functions, RNA splicing/turnover, DNA binding/transcription and cell signaling. Previously reported phenotypic defects in actin polarity and mRNA decay of eIF5A mutant strains are consistent with the proposed role for eIF5A in the translation of the polyproline-containing proteins. Of all the amino acid tandem repeats (≥3 amino acids), only the proline repeat frequency correlates with functional complexity of the five organisms examined. Taken together, these findings suggest the importance of proline repeat-rich proteins and a potential role for eIF5A and its hypusine modification pathway in the course of eukaryotic evolution.
Collapse
Affiliation(s)
- Ajeet Mandal
- Oral and Pharyngeal Cancer Branch, NIDCR, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Swati Mandal
- Oral and Pharyngeal Cancer Branch, NIDCR, National Institutes of Health, Bethesda, Maryland, United States of America
| | - Myung Hee Park
- Oral and Pharyngeal Cancer Branch, NIDCR, National Institutes of Health, Bethesda, Maryland, United States of America
| |
Collapse
|
127
|
Smith KP, Fields JG, Voogt RD, Deng B, Lam YW, Mintz KP. The cell envelope proteome of Aggregatibacter actinomycetemcomitans. Mol Oral Microbiol 2014; 30:97-110. [PMID: 25055881 DOI: 10.1111/omi.12074] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/18/2014] [Indexed: 12/18/2022]
Abstract
The cell envelope of gram-negative bacteria serves a critical role in maintenance of cellular homeostasis, resistance to external stress, and host-pathogen interactions. Envelope protein composition is influenced by the physiological and environmental demands placed on the bacterium. In this study, we report a comprehensive compilation of cell envelope proteins from the periodontal and systemic pathogen Aggregatibacter actinomycetemcomitans VT1169, an afimbriated serotype b strain. The urea-extracted membrane proteins were identified by mass spectrometry-based shotgun proteomics. The membrane proteome, isolated from actively growing bacteria under normal laboratory conditions, included 648 proteins representing 27% of the predicted open reading frames in the genome. Bioinformatic analyses were used to annotate and predict the cellular location and function of the proteins. Surface adhesins, porins, lipoproteins, numerous influx and efflux pumps, multiple sugar, amino acid and iron transporters, and components of the type I, II and V secretion systems were identified. Periplasmic space and cytoplasmic proteins with chaperone function were also identified. A total of 107 proteins with unknown function were associated with the cell envelope. Orthologs of a subset of these uncharacterized proteins are present in other bacterial genomes, whereas others are found exclusively in A. actinomycetemcomitans. This knowledge will contribute to elucidating the role of cell envelope proteins in bacterial growth and survival in the oral cavity.
Collapse
Affiliation(s)
- K P Smith
- Department of Microbiology and Molecular Genetics, University of Vermont, Burlington, VT, USA
| | | | | | | | | | | |
Collapse
|
128
|
Notari DL, Molin A, Davanzo V, Picolotto D, Ribeiro HG, Silva SDAE. IntergenicDB: a database for intergenic sequences. Bioinformation 2014; 10:381-3. [PMID: 25097383 PMCID: PMC4110431 DOI: 10.6026/97320630010381] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2014] [Accepted: 05/24/2014] [Indexed: 12/01/2022] Open
Abstract
A whole genome contains not only coding regions, but also non-coding regions. These are located between the end of a given
coding region and the beginning of the following coding region. For this reason, the information about gene regulation process
underlies in intergenic regions. There is no easy way to obtain intergenic regions from current available databases. IntergenicDB
was developed to integrate data of intergenic regions and their gene related information from NCBI databases. The main goal of
INTERGENICDB is to offer friendly database for intergenic sequences of bacterial genomes.
Collapse
Affiliation(s)
- Daniel Luis Notari
- Centro de Computação e Tecnologia da Informação, Universidade de Caxias do Sul, Rua Francisco Getúlio Vargas, 1130 - CEP 95070-560 - Caxias do Sul, Rio Grande do Sul, Brasil ; Instituto de Biotecnologia, Universidade de Caxias do Sul, Rua Francisco Getúlio Vargas, 1130 - CEP 95070-560 - Caxias do Sul, Rio Grande do Sul, Brasil
| | - Aurione Molin
- Centro de Computação e Tecnologia da Informação, Universidade de Caxias do Sul, Rua Francisco Getúlio Vargas, 1130 - CEP 95070-560 - Caxias do Sul, Rio Grande do Sul, Brasil
| | - Vanessa Davanzo
- Centro de Computação e Tecnologia da Informação, Universidade de Caxias do Sul, Rua Francisco Getúlio Vargas, 1130 - CEP 95070-560 - Caxias do Sul, Rio Grande do Sul, Brasil
| | - Douglas Picolotto
- Centro de Computação e Tecnologia da Informação, Universidade de Caxias do Sul, Rua Francisco Getúlio Vargas, 1130 - CEP 95070-560 - Caxias do Sul, Rio Grande do Sul, Brasil
| | - Helena Graziottin Ribeiro
- Centro de Computação e Tecnologia da Informação, Universidade de Caxias do Sul, Rua Francisco Getúlio Vargas, 1130 - CEP 95070-560 - Caxias do Sul, Rio Grande do Sul, Brasil
| | - Scheila de Avila E Silva
- Instituto de Biotecnologia, Universidade de Caxias do Sul, Rua Francisco Getúlio Vargas, 1130 - CEP 95070-560 - Caxias do Sul, Rio Grande do Sul, Brasil
| |
Collapse
|
129
|
Peña-Soler E, Fernandez FJ, López-Estepa M, Garces F, Richardson AJ, Quintana JF, Rudd KE, Coll M, Vega MC. Structural analysis and mutant growth properties reveal distinctive enzymatic and cellular roles for the three major L-alanine transaminases of Escherichia coli. PLoS One 2014; 9:e102139. [PMID: 25014014 PMCID: PMC4094517 DOI: 10.1371/journal.pone.0102139] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2014] [Accepted: 06/13/2014] [Indexed: 12/17/2022] Open
Abstract
In order to maintain proper cellular function, the metabolism of the bacterial microbiota presents several mechanisms oriented to keep a correctly balanced amino acid pool. Central components of these mechanisms are enzymes with alanine transaminase activity, pyridoxal 5′-phosphate-dependent enzymes that interconvert alanine and pyruvate, thereby allowing the precise control of alanine and glutamate concentrations, two of the most abundant amino acids in the cellular amino acid pool. Here we report the 2.11-Å crystal structure of full-length AlaA from the model organism Escherichia coli, a major bacterial alanine aminotransferase, and compare its overall structure and active site composition with detailed atomic models of two other bacterial enzymes capable of catalyzing this reaction in vivo, AlaC and valine-pyruvate aminotransferase (AvtA). Apart from a narrow entry channel to the active site, a feature of this new crystal structure is the role of an active site loop that closes in upon binding of substrate-mimicking molecules, and which has only been previously reported in a plant enzyme. Comparison of the available structures indicates that beyond superficial differences, alanine aminotransferases of diverse phylogenetic origins share a universal reaction mechanism that depends on an array of highly conserved amino acid residues and is similarly regulated by various unrelated motifs. Despite this unifying mechanism and regulation, growth competition experiments demonstrate that AlaA, AlaC and AvtA are not freely exchangeable in vivo, suggesting that their functional repertoire is not completely redundant thus providing an explanation for their independent evolutionary conservation.
Collapse
Affiliation(s)
- Esther Peña-Soler
- Centro de Investigaciones Biológicas, Consejo Superior de Investigaciones Científicas (Spanish National Research Council, CSIC), Madrid, Spain
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona, Spain
| | - Francisco J. Fernandez
- Centro de Investigaciones Biológicas, Consejo Superior de Investigaciones Científicas (Spanish National Research Council, CSIC), Madrid, Spain
| | - Miguel López-Estepa
- Centro de Investigaciones Biológicas, Consejo Superior de Investigaciones Científicas (Spanish National Research Council, CSIC), Madrid, Spain
| | - Fernando Garces
- The Scripps Research Institute, La Jolla, California, United States of America
| | - Andrew J. Richardson
- University of Miami Miller School of Medicine, Miami, Florida, United States of America
| | - Juan F. Quintana
- Centro de Investigaciones Biológicas, Consejo Superior de Investigaciones Científicas (Spanish National Research Council, CSIC), Madrid, Spain
| | - Kenneth E. Rudd
- University of Miami Miller School of Medicine, Miami, Florida, United States of America
| | - Miquel Coll
- Institute for Research in Biomedicine (IRB Barcelona), Barcelona, Spain
- Institut de Biologia Molecular de Barcelona (IBMB-CSIC), Barcelona, Spain
| | - M. Cristina Vega
- Centro de Investigaciones Biológicas, Consejo Superior de Investigaciones Científicas (Spanish National Research Council, CSIC), Madrid, Spain
- * E-mail:
| |
Collapse
|
130
|
Gómez-Sagasti MT, Becerril JM, Martín I, Epelde L, Garbisu C. cDNA microarray assessment of early gene expression profiles in Escherichia coli cells exposed to a mixture of heavy metals. Cell Biol Toxicol 2014; 30:207-32. [DOI: 10.1007/s10565-014-9281-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2014] [Accepted: 06/12/2014] [Indexed: 12/30/2022]
|
131
|
Krishnakumar R, Grose C, Haft DH, Zaveri J, Alperovich N, Gibson DG, Merryman C, Glass JI. Simultaneous non-contiguous deletions using large synthetic DNA and site-specific recombinases. Nucleic Acids Res 2014; 42:e111. [PMID: 24914053 PMCID: PMC4132700 DOI: 10.1093/nar/gku509] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Toward achieving rapid and large scale genome modification directly in a target organism, we have developed a new genome engineering strategy that uses a combination of bioinformatics aided design, large synthetic DNA and site-specific recombinases. Using Cre recombinase we swapped a target 126-kb segment of the Escherichia coli genome with a 72-kb synthetic DNA cassette, thereby effectively eliminating over 54 kb of genomic DNA from three non-contiguous regions in a single recombination event. We observed complete replacement of the native sequence with the modified synthetic sequence through the action of the Cre recombinase and no competition from homologous recombination. Because of the versatility and high-efficiency of the Cre-lox system, this method can be used in any organism where this system is functional as well as adapted to use with other highly precise genome engineering systems. Compared to present-day iterative approaches in genome engineering, we anticipate this method will greatly speed up the creation of reduced, modularized and optimized genomes through the integration of deletion analyses data, transcriptomics, synthetic biology and site-specific recombination.
Collapse
Affiliation(s)
- Radha Krishnakumar
- Synthetic Biology and Bioenergy, J. Craig Venter Institute, 9704 Medical Center Dr., Rockville, MD 20850, USA
| | - Carissa Grose
- Synthetic Biology and Bioenergy, J. Craig Venter Institute, 9704 Medical Center Dr., Rockville, MD 20850, USA
| | - Daniel H Haft
- Bioinformatics, J. Craig Venter Institute, 9704 Medical Center Dr., Rockville, MD 20850, USA
| | - Jayshree Zaveri
- Synthetic Biology and Bioenergy, J. Craig Venter Institute, 9704 Medical Center Dr., Rockville, MD 20850, USA
| | - Nina Alperovich
- Synthetic Biology and Bioenergy, J. Craig Venter Institute, 9704 Medical Center Dr., Rockville, MD 20850, USA
| | - Daniel G Gibson
- Synthetic Biology and Bioenergy, J. Craig Venter Institute, 4120 Torrey Pines Road, La Jolla, CA 92037, USA Synthetic Genomics Inc., 11149 N. Torrey Pines Road, La Jolla, CA 92037, USA
| | - Chuck Merryman
- Synthetic Biology and Bioenergy, J. Craig Venter Institute, 4120 Torrey Pines Road, La Jolla, CA 92037, USA
| | - John I Glass
- Synthetic Biology and Bioenergy, J. Craig Venter Institute, 9704 Medical Center Dr., Rockville, MD 20850, USA
| |
Collapse
|
132
|
Chen DH, Huang Y, Liu C, Ruan Y, Shen WH. Functional conservation and divergence of J-domain-containing ZUO1/ZRF orthologs throughout evolution. PLANTA 2014; 239:1159-1173. [PMID: 24659052 DOI: 10.1007/s00425-014-2058-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2013] [Accepted: 03/05/2014] [Indexed: 06/03/2023]
Abstract
Heat shock protein 40s (Hsp40s), also known as J-proteins, are conserved in prokaryotes and eukaryotes. The Zuotin/Zuotin-related factor (ZUO1/ZRF) family belongs to a novel Hsp40 clade exclusively found in eukaryotes. Zuotin/Zuotin-related factor proteins are characterized by a large N terminal ZUO1 domain originally identified in the yeast ZUO1 protein. The ZUO1 domain is characterized by a highly conserved J-domain, together with an atypical UBD domain first identified in the human ZRF1 protein. Furthermore, ZUO1/ZRF protein families in animals and plants harbor a pair of C terminal SANT domains, suggesting the divergence of their functions with those in fungi. Zuotin/Zuotin-related factor proteins retain the ancestral function as an Hsp70co-chaperone implicated in protein folding and renaturation after stress; these proteins also perform diverse neofunctions in the cytoplasm and transcriptional and/or epigenetic regulatory functions in the nucleus. Therefore, these proteins are involved in translational fidelity control, ribosomal biogenesis, asymmetric cell division, cell cycle, apoptosis, differentiation, and tumorigenesis. The results of sequence and domain organization analysis of proteins from diverse organisms provided valuable insights into the evolutionary conservation and diversity of ZUO1/ZRF protein family. Further, phylogenetic analysis provides a platform for future functional investigation on the ZUO1/ZRF protein family, particularly in higher plants.
Collapse
|
133
|
Sharma V, Prère MF, Canal I, Firth AE, Atkins JF, Baranov PV, Fayet O. Analysis of tetra- and hepta-nucleotides motifs promoting -1 ribosomal frameshifting in Escherichia coli. Nucleic Acids Res 2014; 42:7210-25. [PMID: 24875478 PMCID: PMC4066793 DOI: 10.1093/nar/gku386] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Programmed ribosomal -1 frameshifting is a non-standard decoding process occurring when ribosomes encounter a signal embedded in the mRNA of certain eukaryotic and prokaryotic genes. This signal has a mandatory component, the frameshift motif: it is either a Z_ZZN tetramer or a X_XXZ_ZZN heptamer (where ZZZ and XXX are three identical nucleotides) allowing cognate or near-cognate repairing to the -1 frame of the A site or A and P sites tRNAs. Depending on the signal, the frameshifting frequency can vary over a wide range, from less than 1% to more than 50%. The present study combines experimental and bioinformatics approaches to carry out (i) a systematic analysis of the frameshift propensity of all possible motifs (16 Z_ZZN tetramers and 64 X_XXZ_ZZN heptamers) in Escherichia coli and (ii) the identification of genes potentially using this mode of expression amongst 36 Enterobacteriaceae genomes. While motif efficiency varies widely, a major distinctive rule of bacterial -1 frameshifting is that the most efficient motifs are those allowing cognate re-pairing of the A site tRNA from ZZN to ZZZ. The outcome of the genomic search is a set of 69 gene clusters, 59 of which constitute new candidates for functional utilization of -1 frameshifting.
Collapse
Affiliation(s)
- Virag Sharma
- School of Biochemistry and Cell biology, University College Cork, Cork, Ireland
| | - Marie-Françoise Prère
- Laboratoire de Microbiologie et Génétique moléculaire, UMR5100, Centre National de la Recherche Scientifique, Université Paul Sabatier-Toulouse III, 118 route de Narbonne, Toulouse 31062-cedex, France
| | - Isabelle Canal
- Laboratoire de Microbiologie et Génétique moléculaire, UMR5100, Centre National de la Recherche Scientifique, Université Paul Sabatier-Toulouse III, 118 route de Narbonne, Toulouse 31062-cedex, France
| | - Andrew E Firth
- Department of Pathology, University of Cambridge, Cambridge CB2 1QP, UK
| | - John F Atkins
- School of Biochemistry and Cell biology, University College Cork, Cork, Ireland Department of Human Genetics, University of Utah, 15N 2030E, Rm7410, Salt Lake City, UT 84112-5330, USA
| | - Pavel V Baranov
- School of Biochemistry and Cell biology, University College Cork, Cork, Ireland
| | - Olivier Fayet
- Laboratoire de Microbiologie et Génétique moléculaire, UMR5100, Centre National de la Recherche Scientifique, Université Paul Sabatier-Toulouse III, 118 route de Narbonne, Toulouse 31062-cedex, France
| |
Collapse
|
134
|
Xiu Z, Liu Y, Mathieu J, Wang J, Zhu D, Alvarez PJJ. Elucidating the genetic basis for Escherichia coli defense against silver toxicity using mutant arrays. ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY 2014; 33:993-997. [PMID: 24408659 DOI: 10.1002/etc.2514] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Revised: 12/12/2013] [Accepted: 12/21/2013] [Indexed: 06/03/2023]
Abstract
Bacterial adaptation and defense mechanisms against silver are poorly understood at the genetic level. A library of Escherichia coli gene-deletion mutants was used to show that clones lacking sodB (coding for oxidative stress protection), lon (protein damage repair), or cusR (metal efflux pump) are quite sensitive to silver (with 7.3 ± 9.1%, 5.3 ± 1.8%, and 0.4 ± 0.1% of cells surviving, respectively, compared with 90.1 ± 5.4% survival for wild-type E. coli, after 6-h exposure to 8 mg/L AgNO(3)), suggesting the importance of the coded functions as defense mechanisms. Mutants lacking pgaB or wcaD, which code for production of extracellular polymeric substances (EPS), also showed significant (p < 0.05) sensitivity to silver exposure (23.4 ± 16.2% and 23.1 ± 32.6% survival, respectively). Transmission electron microscopy (TEM) with scanning TEM/energy-dispersive X-ray spectroscopy analysis showed accumulation of silver nanoparticles within EPS, suggesting that EPS serve as a protective barrier that also immobilizes dissolved silver as silver nanoparticles.
Collapse
Affiliation(s)
- Zongming Xiu
- Department of Civil and Environmental Engineering, Rice University, Houston, Texas, USA
| | | | | | | | | | | |
Collapse
|
135
|
Ovchinnikov S, Kamisetty H, Baker D. Robust and accurate prediction of residue-residue interactions across protein interfaces using evolutionary information. eLife 2014; 3:e02030. [PMID: 24842992 PMCID: PMC4034769 DOI: 10.7554/elife.02030] [Citation(s) in RCA: 446] [Impact Index Per Article: 44.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Do the amino acid sequence identities of residues that make contact across protein interfaces covary during evolution? If so, such covariance could be used to predict contacts across interfaces and assemble models of biological complexes. We find that residue pairs identified using a pseudo-likelihood-based method to covary across protein–protein interfaces in the 50S ribosomal unit and 28 additional bacterial protein complexes with known structure are almost always in contact in the complex, provided that the number of aligned sequences is greater than the average length of the two proteins. We use this method to make subunit contact predictions for an additional 36 protein complexes with unknown structures, and present models based on these predictions for the tripartite ATP-independent periplasmic (TRAP) transporter, the tripartite efflux system, the pyruvate formate lyase-activating enzyme complex, and the methionine ABC transporter. DOI:http://dx.doi.org/10.7554/eLife.02030.001 Proteins are considered the ‘workhorse molecules’ of life and they are involved in virtually everything that cells do. Proteins are strings of amino acids that have folded into a specific three-dimensional shape. Proteins must have the correct shape to function properly, as they often work by binding to other proteins or molecules—much like a key fitting into a lock. Working out the structure of a protein can, therefore, provide major insights into how the protein does its job. Two or more proteins can bind together and form a complex to perform various tasks; and solving the structures of these complexes can be challenging, even if the structures of the protein subunits are known. Now, Ovchinnikov, Kamisetty, and Baker have developed a method for predicting which parts of the proteins make contact with each other in a two-protein complex. Different species can have copies of the same proteins; but a copy from one species might have different amino acids at certain positions when compared to a related copy from another species. As such, when pairs of interacting proteins from different species are compared, there will be many positions in the two proteins that vary. However, if the amino acid at a position in one protein (let's call it ‘X’) varies, and the amino acid at, say, position ‘Y’ in the other protein also varies such that for any given amino acid at position Y there is often a specific amino acid at position X; positions X and Y are said to ‘co-vary’. Ovchinnikov et al. noticed that when a pair of amino acids (one from each protein in a two-protein complex) co-varied, these two amino acids tended to make contact with each other at the protein–protein interface. Ovchinnikov et al. used the new method to make predictions about the protein–protein interfaces in 28 protein complexes found in bacteria, and also to make a prediction about the interface between protein subunits in the bacterial ribosome. When these predictions were checked against the actual structures, which were all known beforehand, they were found to be accurate if the number of copies of each protein being compared is greater than the average length of the two proteins. Ovchinnikov et al. went on to predict the amino acids on the protein–protein interfaces for another 36 bacterial protein complexes with unknown structures, and to present models for four larger complexes. The next challenge is to extend the method to protein complexes that are found only in eukaryotes (i.e., not in bacteria). Since the number of related copies for eukaryotic proteins tends to be smaller, there are fewer proteins to compare and it is therefore harder to detect ‘covariation’ when it occurs. DOI:http://dx.doi.org/10.7554/eLife.02030.002
Collapse
Affiliation(s)
- Sergey Ovchinnikov
- Department of Biochemistry, Howard Hughes Medical Institute, University of Washington, Seattle, United States Molecular and Cellular Biology Program, University of Washington, Seattle, United States
| | - Hetunandan Kamisetty
- Department of Biochemistry, Howard Hughes Medical Institute, University of Washington, Seattle, United States Facebook Inc., Seattle, United States
| | - David Baker
- Department of Biochemistry, Howard Hughes Medical Institute, University of Washington, Seattle, United States
| |
Collapse
|
136
|
Park MK, Lee SH, Yang KS, Jung SC, Lee JH, Kim SC. Enhancing recombinant protein production with an Escherichia coli host strain lacking insertion sequences. Appl Microbiol Biotechnol 2014; 98:6701-13. [DOI: 10.1007/s00253-014-5739-y] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2014] [Revised: 03/27/2014] [Accepted: 03/29/2014] [Indexed: 02/07/2023]
|
137
|
SearchDOGS bacteria, software that provides automated identification of potentially missed genes in annotated bacterial genomes. J Bacteriol 2014; 196:2030-42. [PMID: 24659774 DOI: 10.1128/jb.01368-13] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
We report the development of SearchDOGS Bacteria, software to automatically detect missing genes in annotated bacterial genomes by combining BLAST searches with comparative genomics. Having successfully applied the approach to yeast genomes, we redeveloped SearchDOGS to function as a standalone, downloadable package, requiring only a set of GenBank annotation files as input. The software automatically generates a homology structure using reciprocal BLAST and a synteny-based method; this is followed by a scan of the entire genome of each species for unannotated genes. Results are provided in a HTML interface, providing coordinates, BLAST results, syntenic location, omega values (Ka/Ks, where Ks is the number of synonymous substitutions per synonymous site and Ka is the number of nonsynonymous substitutions per nonsynonymous site) for protein conservation estimates, and other information for each candidate gene. Using SearchDOGS Bacteria, we identified 155 gene candidates in the Shigella boydii sb227 genome, including 56 candidates of length < 60 codons. SearchDOGS Bacteria has two major advantages over currently available annotation software. First, it outperforms current methods in terms of sensitivity and is highly effective at identifying small or highly diverged genes. Second, as a freely downloadable package, it can be used with unpublished or confidential data.
Collapse
|
138
|
Wozniak M, Wong L, Tiuryn J. eCAMBer: efficient support for large-scale comparative analysis of multiple bacterial strains. BMC Bioinformatics 2014; 15:65. [PMID: 24597904 PMCID: PMC4023553 DOI: 10.1186/1471-2105-15-65] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2013] [Accepted: 02/24/2014] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Inconsistencies are often observed in the genome annotations of bacterial strains. Moreover, these inconsistencies are often not reflected by sequence discrepancies, but are caused by wrongly annotated gene starts as well as mis-identified gene presence. Thus, tools are needed for improving annotation consistency and accuracy among sets of bacterial strain genomes. RESULTS We have developed eCAMBer, a tool for efficiently supporting comparative analysis of multiple bacterial strains within the same species. eCAMBer is a highly optimized revision of our earlier tool, CAMBer, scaling it up for significantly larger datasets comprising hundreds of bacterial strains. eCAMBer works in two phases. First, it transfers gene annotations among all considered bacterial strains. In this phase, it also identifies homologous gene families and annotation inconsistencies. Second, eCAMBer, tries to improve the quality of annotations by resolving the gene start inconsistencies and filtering out gene families arising from annotation errors propagated in the previous phase. CONCLUSIONS [corrected] eCAMBer efficiently identifies and resolves annotation inconsistencies among closely related bacterial genomes. It outperforms other competing tools both in terms of running time and accuracy of produced annotations. Software, user manual, and case study results are available at the project website: http://bioputer.mimuw.edu.pl/ecamber.
Collapse
Affiliation(s)
- Michal Wozniak
- Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Warsaw, Poland.
| | | | | |
Collapse
|
139
|
Abstract
DNAs encoding polypeptides often contain design errors that cause experiments to prematurely fail. One class of design errors is incorrect or missing elements in the DNA, here termed syntax errors. We have identified three major causes of syntax errors: point mutations from sequencing or manual data entry, gene structure misannotation, and unintended open reading frames (ORFs). The Engineered DNA Sequence Syntax Inspector (EDSSI) is an online bioinformatics pipeline that checks for syntax errors through three steps. First, ORF prediction in input DNA sequences is done by GeneMark; next, homologous sequences are retrieved by BLAST, and finally, syntax errors in the protein sequence are predicted by using the SIFT algorithm. We show that the EDSSI is able to identify previously published examples of syntactical errors and also show that our indel addition to the SIFT program is 97% accurate on a test set of Escherichia coli proteins. The EDSSI is available at http://andersonlab.qb3.berkeley.edu/Software/EDSSI/ .
Collapse
Affiliation(s)
| | - J. Christopher Anderson
- Bioengineering Department, University of California, Berkeley, California 94720, United States
| |
Collapse
|
140
|
Brambilla L, Morán-Barrio J, Viale AM. Expression of the Escherichia coli ompW colicin S4 receptor gene is regulated by temperature and modulated by the H-NS and StpA nucleoid-associated proteins. FEMS Microbiol Lett 2014; 352:238-44. [PMID: 24444297 DOI: 10.1111/1574-6968.12385] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2013] [Revised: 01/09/2014] [Accepted: 01/13/2014] [Indexed: 12/15/2022] Open
Abstract
The OmpW family consists of a ubiquitous group of small outer membrane (OM) β-barrel proteins of Gram-negative bacteria with proposed roles in environmental adaptation but poorly understood mechanisms of expression. We report here that Escherichia coli K-12 OmpW contents are drastically modified by temperature changes compatible with the leap from the environment to warm-blooded hosts and/or vice versa. Thus, while OmpW is present in the OM of bacteria grown at 37 °C, it sharply disappears at 23 °C with the concomitant acquisition of colicin S4 resistance by the cells. ompW::lacZY fusions indicated that temperature regulation operates at the level of transcription, being ompW expression almost abolished at 23 °C as compared to 37 °C. Moreover, E. coli Δhns mutants lacking H-NS showed reductions in ompW transcription and OmpW contents at 37 °C, indicating positive modulatory roles for this nucleoid-structuring protein in ompW expression. Also, ΔhnsΔstpA double mutants simultaneously lacking H-NS and its paralog StpA showed more severe reductions in ompW expression at 37 °C, resulting in the complete loss of OmpW. The overall results indicate that OmpW contents in E. coli are regulated by both temperature and H-NS and reinforce OmpW functions in bacterial adaptation to warm-blooded hosts.
Collapse
Affiliation(s)
- Luciano Brambilla
- Instituto de Biología Molecular y Celular de Rosario (IBR, CONICET) and Departamento de Microbiología, Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario (UNR), Rosario, Argentina
| | | | | |
Collapse
|
141
|
Galperin MY, Koonin EV. Comparative Genomics Approaches to Identifying Functionally Related Genes. ALGORITHMS FOR COMPUTATIONAL BIOLOGY 2014. [DOI: 10.1007/978-3-319-07953-0_1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
|
142
|
Paetzel M. Structure and mechanism of Escherichia coli type I signal peptidase. BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR CELL RESEARCH 2013; 1843:1497-508. [PMID: 24333859 DOI: 10.1016/j.bbamcr.2013.12.003] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/25/2013] [Revised: 11/26/2013] [Accepted: 12/04/2013] [Indexed: 12/16/2022]
Abstract
Type I signal peptidase is the enzyme responsible for cleaving off the amino-terminal signal peptide from proteins that are secreted across the bacterial cytoplasmic membrane. It is an essential membrane bound enzyme whose serine/lysine catalytic dyad resides on the exo-cytoplasmic surface of the bacterial membrane. This review discusses the progress that has been made in the structural and mechanistic characterization of Escherichia coli type I signal peptidase (SPase I) as well as efforts to develop a novel class of antibiotics based on SPase I inhibition. This article is part of a Special Issue entitled: Protein trafficking and secretion in bacteria. Guest Editors: Anastassios Economou and Ross Dalbey.
Collapse
Affiliation(s)
- Mark Paetzel
- Department of Molecular Biology and Biochemistry, Simon Fraser University, 8888 University Drive, Burnaby, British Columbia V5A 1S6, Canada.
| |
Collapse
|
143
|
Hu JC, Sherlock G, Siegele DA, Aleksander SA, Ball CA, Demeter J, Gouni S, Holland TA, Karp PD, Lewis JE, Liles NM, McIntosh BK, Mi H, Muruganujan A, Wymore F, Thomas PD, Altman T. PortEco: a resource for exploring bacterial biology through high-throughput data and analysis tools. Nucleic Acids Res 2013; 42:D677-84. [PMID: 24285306 PMCID: PMC3965092 DOI: 10.1093/nar/gkt1203] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
PortEco (http://porteco.org) aims to collect, curate and provide data and analysis tools to support basic biological research in Escherichia coli (and eventually other bacterial systems). PortEco is implemented as a ‘virtual’ model organism database that provides a single unified interface to the user, while integrating information from a variety of sources. The main focus of PortEco is to enable broad use of the growing number of high-throughput experiments available for E. coli, and to leverage community annotation through the EcoliWiki and GONUTS systems. Currently, PortEco includes curated data from hundreds of genome-wide RNA expression studies, from high-throughput phenotyping of single-gene knockouts under hundreds of annotated conditions, from chromatin immunoprecipitation experiments for tens of different DNA-binding factors and from ribosome profiling experiments that yield insights into protein expression. Conditions have been annotated with a consistent vocabulary, and data have been consistently normalized to enable users to find, compare and interpret relevant experiments. PortEco includes tools for data analysis, including clustering, enrichment analysis and exploration via genome browsers. PortEco search and data analysis tools are extensively linked to the curated gene, metabolic pathway and regulation content at its sister site, EcoCyc.
Collapse
Affiliation(s)
- James C Hu
- Department of Biochemistry and Biophysics, Texas A&M University, College Station, TX 77843, USA, Department of Genetics, Stanford University, Stanford, CA 94305, USA, Department of Biology, Texas A&M University, College Station, TX, 77843, USA, Artificial Intelligence Center, SRI International, Menlo Park, CA 94025, USA and Deptartment of Preventive Medicine, University of Southern California, Los Angeles, CA 90089, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
144
|
Abstract
The Transporter Classification Database (TCDB; http://www.tcdb.org) serves as a common reference point for transport protein research. The database contains more than 10 000 non-redundant proteins that represent all currently recognized families of transmembrane molecular transport systems. Proteins in TCDB are organized in a five level hierarchical system, where the first two levels are the class and subclass, the second two are the family and subfamily, and the last one is the transport system. Superfamilies that contain multiple families are included as hyperlinks to the five tier TC hierarchy. TCDB includes proteins from all types of living organisms and is the only transporter classification system that is both universal and recognized by the International Union of Biochemistry and Molecular Biology. It has been expanded by manual curation, contains extensive text descriptions providing structural, functional, mechanistic and evolutionary information, is supported by unique software and is interconnected to many other relevant databases. TCDB is of increasing usefulness to the international scientific community and can serve as a model for the expansion of database technologies. This manuscript describes an update of the database descriptions previously featured in NAR database issues.
Collapse
Affiliation(s)
- Milton H Saier
- Department of Molecular Biology, University of California at San Diego, La Jolla, CA 92093-0116, USA
| | | | | | | |
Collapse
|
145
|
Papakostas K, Botou M, Frillingos S. Functional identification of the hypoxanthine/guanine transporters YjcD and YgfQ and the adenine transporters PurP and YicO of Escherichia coli K-12. J Biol Chem 2013; 288:36827-40. [PMID: 24214977 DOI: 10.1074/jbc.m113.523340] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The evolutionarily broad family nucleobase-cation symporter-2 (NCS2) encompasses transporters that are conserved in binding site architecture but diverse in substrate selectivity. Putative purine transporters of this family fall into one of two homology clusters: COG2233, represented by well studied xanthine and/or uric acid permeases, and COG2252, consisting of transporters for adenine, guanine, and/or hypoxanthine that remain unknown with respect to structure-function relationships. We analyzed the COG2252 genes of Escherichia coli K-12 with homology modeling, functional overexpression, and mutagenesis and showed that they encode high affinity permeases for the uptake of adenine (PurP and YicO) or guanine and hypoxanthine (YjcD and YgfQ). The two pairs of paralogs differ clearly in their substrate and ligand preferences. Of 25 putative inhibitors tested, PurP and YicO recognize with low micromolar affinity N(6)-benzoyladenine, 2,6-diaminopurine, and purine, whereas YjcD and YgfQ recognize 1-methylguanine, 8-azaguanine, 6-thioguanine, and 6-mercaptopurine and do not recognize any of the PurP ligands. Furthermore, the permeases PurP and YjcD were subjected to site-directed mutagenesis at highly conserved sites of transmembrane segments 1, 3, 8, 9, and 10, which have been studied also in COG2233 homologs. Residues irreplaceable for uptake activity or crucial for substrate selectivity were found at positions occupied by similar role amino acids in the Escherichia coli xanthine- and uric acid-transporting homologs (XanQ and UacT, respectively) and predicted to be at or around the binding site. Our results support the contention that the distantly related transporters of COG2233 and COG2252 use topologically similar side chain determinants to dictate their function and the distinct purine selectivity profiles.
Collapse
Affiliation(s)
- Konstantinos Papakostas
- From the Laboratory of Biological Chemistry, University of Ioannina Medical School, 45110 Ioannina, Greece
| | | | | |
Collapse
|
146
|
Szmidt-Middleton HL, Ouellet M, Adams PD, Keasling JD, Mukhopadhyay A. Utilizing a highly responsive gene, yhjX, in E. coli based production of 1,4-butanediol. Chem Eng Sci 2013. [DOI: 10.1016/j.ces.2013.06.044] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
|
147
|
Four products from Escherichia coli pseudogenes increase hydrogen production. Biochem Biophys Res Commun 2013; 439:576-9. [DOI: 10.1016/j.bbrc.2013.09.016] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2013] [Accepted: 09/03/2013] [Indexed: 11/22/2022]
|
148
|
Zhou J, Richardson AJ, Rudd KE. EcoGene-RefSeq: EcoGene tools applied to the RefSeq prokaryotic genomes. Bioinformatics 2013; 29:1917-8. [PMID: 23736533 DOI: 10.1093/bioinformatics/btt302] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
SUMMARY EcoGene.org is a genome database and website dedicated to Escherichia coli K-12 substrain MG1655 that is revised daily using information derived from the biomedical literature and in-house analysis. EcoGene is a major source of annotation updates for the MG1655 Genbank record, one of only a few Genbank genome records that are updated by a community effort. The Reference Sequence (RefSeq) database, built by The National Center for Biotechnology Information, comprises a set of duplicate Genbank genome records that can be modified by the NCBI staff annotators. EcoGene-RefSeq is being developed as a stand-alone internet resource to facilitate the usage of EcoGene-based tools on any of the >2400 completed prokaryotic genome records that are currently available at the RefSeq database. AVAILABILITY The web interface of EcoGene-RefSeq is available at http://www.ecogene.org/refseq. CONTACT krudd@med.miami.edu or j.zhou1@miami.edu.
Collapse
Affiliation(s)
- Jindan Zhou
- Department of Biochemistry and Molecular Biology, The Miller School of Medicine at the University of Miami, Miami, FL 33136, USA.
| | | | | |
Collapse
|
149
|
Lindemann C, Lupilova N, Müller A, Warscheid B, Meyer HE, Kuhlmann K, Eisenacher M, Leichert LI. Redox proteomics uncovers peroxynitrite-sensitive proteins that help Escherichia coli to overcome nitrosative stress. J Biol Chem 2013; 288:19698-714. [PMID: 23696645 DOI: 10.1074/jbc.m113.457556] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Peroxynitrite is a highly reactive chemical species with antibacterial properties that are synthesized in immune cells. In a proteomic approach, we identified specific target proteins of peroxynitrite-induced modifications in Escherichia coli. Although peroxynitrite caused a fairly indiscriminate nitration of tyrosine residues, reversible modifications of protein thiols were highly specific. We used a quantitative redox proteomic method based on isotope-coded affinity tag chemistry and identified four proteins consistently thiol-modified in cells treated with peroxynitrite as follows: AsnB, FrmA, MaeB, and RidA. All four were required for peroxynitrite stress tolerance in vivo. Three of the identified proteins were modified at highly conserved cysteines, and MaeB and FrmA are known to be directly involved in the oxidative and nitrosative stress response in E. coli. In in vitro studies, we could show that the activity of RidA, a recently discovered enamine/imine deaminase, is regulated in a specific manner by the modification of its single conserved cysteine. Mutation of this cysteine 107 to serine generated a constitutively active protein that was not susceptible to peroxynitrite.
Collapse
Affiliation(s)
- Claudia Lindemann
- Medical Proteome Center, Ruhr University Bochum, Universitätsstrasse 150, 44780 Bochum, Germany
| | | | | | | | | | | | | | | |
Collapse
|
150
|
Ivankov DN, Payne SH, Galperin MY, Bonissone S, Pevzner PA, Frishman D. How many signal peptides are there in bacteria? Environ Microbiol 2013; 15:983-90. [PMID: 23556536 PMCID: PMC3621014 DOI: 10.1111/1462-2920.12105] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Over the last 5 years proteogenomics (using mass spectroscopy to identify proteins predicted from genomic sequences) has emerged as a promising approach to the high-throughput identification of protein N-termini, which remains a problem in genome annotation. Comparison of the experimentally determined N-termini with those predicted by sequence analysis tools allows identification of the signal peptides and therefore conclusions on the cytoplasmic or extracytoplasmic (periplasmic or extracellular) localization of the respective proteins. We present here the results of a proteogenomic study of the signal peptides in Escherichia coli K-12 and compare its results with the available experimental data and predictions by such software tools as SignalP and Phobius. A single proteogenomics experiment recovered more than a third of all signal peptides that had been experimentally determined during the past three decades and confirmed at least 31 additional signal peptides, mostly in the known exported proteins, which had been previously predicted but not validated. The filtering of putative signal peptides for the peptide length and the presence of an eight-residue hydrophobic patch and a typical signal peptidase cleavage site proved sufficient to eliminate the false-positive hits. Surprisingly, the results of this proteogenomics study, as well as a re-analysis of the E. coli genome with the latest version of SignalP program, show that the fraction of proteins containing signal peptides is only about 10%, or half of previous estimates.
Collapse
Affiliation(s)
- Dmitry N. Ivankov
- Technische Universität München, Department of Genome-Oriented Bioinformatics, 85354 Freising, Germany
| | - Samuel H. Payne
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Michael Y. Galperin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | - Dmitrij Frishman
- Technische Universität München, Department of Genome-Oriented Bioinformatics, 85354 Freising, Germany
- Helmholtz Zentrum Munich, National Research Center for Environment and Health, Institute for Bioinformatics, 85764 Neuherberg, Germany
| |
Collapse
|