1
|
Usher B, Birkholz N, Beck IN, Fagerlund RD, Jackson SA, Fineran PC, Blower TR. Crystal structure of the anti-CRISPR repressor Aca2. J Struct Biol 2021; 213:107752. [PMID: 34116143 PMCID: PMC8434428 DOI: 10.1016/j.jsb.2021.107752] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2021] [Revised: 05/26/2021] [Accepted: 06/06/2021] [Indexed: 11/16/2022]
Abstract
The crystal structure of the anti-CRISPR repressor Aca2 has been solved to 1.34 Å. Aca2 contains a new dimerization domain for HTH transcriptional regulators. Aca2-like regulators are found encoded in diverse biological contexts.
Bacteria use adaptive CRISPR-Cas immune mechanisms to protect from invasion by bacteriophages and other mobile genetic elements. In response, bacteriophages and mobile genetic elements have co-evolved anti-CRISPR proteins to inhibit the bacterial defense. We and others have previously shown that anti-CRISPR associated (Aca) proteins can regulate this anti-CRISPR counter-attack. Here, we report the first structure of an Aca protein, the Aca2 DNA-binding transcriptional autorepressor from Pectobacterium carotovorum bacteriophage ZF40, determined to 1.34 Å. Aca2 presents a conserved N-terminal helix-turn-helix DNA-binding domain and a previously uncharacterized C-terminal dimerization domain. Dimerization positions the Aca2 recognition helices for insertion into the major grooves of target DNA, supporting its role in regulating anti-CRISPRs. Furthermore, database comparisons identified uncharacterized Aca2 structural homologs in pathogenic bacteria, suggesting that Aca2 represents the first characterized member of a more widespread family of transcriptional regulators.
Collapse
Affiliation(s)
- Ben Usher
- Department of Biosciences, Durham University, Stockton Road, Durham DH1 3LE, UK
| | - Nils Birkholz
- Department of Microbiology and Immunology, University of Otago, PO Box 56, Dunedin 9054, New Zealand
| | - Izaak N Beck
- Department of Biosciences, Durham University, Stockton Road, Durham DH1 3LE, UK
| | - Robert D Fagerlund
- Department of Microbiology and Immunology, University of Otago, PO Box 56, Dunedin 9054, New Zealand; Genetics Otago, University of Otago, PO Box 56, Dunedin 9054, New Zealand
| | - Simon A Jackson
- Department of Microbiology and Immunology, University of Otago, PO Box 56, Dunedin 9054, New Zealand; Genetics Otago, University of Otago, PO Box 56, Dunedin 9054, New Zealand
| | - Peter C Fineran
- Department of Microbiology and Immunology, University of Otago, PO Box 56, Dunedin 9054, New Zealand; Genetics Otago, University of Otago, PO Box 56, Dunedin 9054, New Zealand; Bio-Protection Research Centre, University of Otago, PO Box 56, Dunedin 9054, New Zealand
| | - Tim R Blower
- Department of Biosciences, Durham University, Stockton Road, Durham DH1 3LE, UK.
| |
Collapse
|
2
|
Flavodoxins as Novel Therapeutic Targets against Helicobacter pylori and Other Gastric Pathogens. Int J Mol Sci 2020; 21:ijms21051881. [PMID: 32164177 PMCID: PMC7084853 DOI: 10.3390/ijms21051881] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 03/04/2020] [Accepted: 03/06/2020] [Indexed: 02/06/2023] Open
Abstract
Flavodoxins are small soluble electron transfer proteins widely present in bacteria and absent in vertebrates. Flavodoxins participate in different metabolic pathways and, in some bacteria, they have been shown to be essential proteins representing promising therapeutic targets to fight bacterial infections. Using purified flavodoxin and chemical libraries, leads can be identified that block flavodoxin function and act as bactericidal molecules, as it has been demonstrated for Helicobacter pylori (Hp), the most prevalent human gastric pathogen. Increasing antimicrobial resistance by this bacterium has led current therapies to lose effectiveness, so alternative treatments are urgently required. Here, we summarize, with a focus on flavodoxin, opportunities for pharmacological intervention offered by the potential protein targets described for this bacterium and provide information on other gastrointestinal pathogens and also on bacteria from the gut microbiota that contain flavodoxin. The process of discovery and development of novel antimicrobials specific for Hp flavodoxin that is being carried out in our group is explained, as it can be extrapolated to the discovery of inhibitors specific for other gastric pathogens. The high specificity for Hp of the antimicrobials developed may be of help to reduce damage to the gut microbiota and to slow down the development of resistant Hp mutants.
Collapse
|
3
|
ORCAE-AOCC: A Centralized Portal for the Annotation of African Orphan Crop Genomes. Genes (Basel) 2019; 10:genes10120950. [PMID: 31757073 PMCID: PMC6969924 DOI: 10.3390/genes10120950] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Revised: 11/15/2019] [Accepted: 11/18/2019] [Indexed: 12/19/2022] Open
Abstract
ORCAE (Online Resource for Community Annotation of Eukaryotes) is a public genome annotation curation resource. ORCAE-AOCC is a branch that is dedicated to the genomes published as part of the African Orphan Crops Consortium (AOCC). The motivation behind the development of the ORCAE platform was to create a knowledge-based website where the research-community can make contributions to improve genome annotations. All changes to any given gene-model or gene description are stored, and the entire annotation history can be retrieved. Genomes can either be set to “public” or “restricted” mode; anonymous users can browse public genomes but cannot make any changes. Aside from providing a user- friendly interface to view genome annotations, the platform also includes tools and information (such as gene expression evidence) that enables authorized users to edit and validate genome annotations. The ORCAE-AOCC platform will enable various stakeholders from around the world to coordinate their efforts to annotate and study underutilized crops.
Collapse
|
4
|
de Laia ML, Moreira LM, Gonçalves JF, Ferro MIT, Rodrigues ACP, dos Santos JN, Felestrino ÉB, Ferro JA. Gene expression analysis identifies hypothetical genes that may be critical during the infection process of Xanthomonas citri subsp. citri. ELECTRON J BIOTECHN 2019. [DOI: 10.1016/j.ejbt.2019.10.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022] Open
|
5
|
Kim MY, Kim C, Ainala SK, Bae H, Jeon BH, Park S, Kim JR. Metabolic shift of Klebsiella pneumoniae L17 by electrode-based electron transfer using glycerol in a microbial fuel cell. Bioelectrochemistry 2019; 125:1-7. [DOI: 10.1016/j.bioelechem.2018.08.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Revised: 08/06/2018] [Accepted: 08/06/2018] [Indexed: 10/28/2022]
|
6
|
Reshma S, Sathyanarayanan N, Nagendra H. Characterization of a hypothetical protein YVRE from Bacillus subtilis indicates its key role as glucono-lactonase in pentose phosphate pathway and glucose metabolism. Bioinformation 2017; 13:430-438. [PMID: 29379263 PMCID: PMC5767921 DOI: 10.6026/97320630013430] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2017] [Revised: 12/05/2017] [Accepted: 12/05/2017] [Indexed: 11/23/2022] Open
Abstract
Hypothetical proteins are functionally uncharacterized proteins with assigned function using sequence annotation tools. Almost half of the coding regions of several genomes are hypothetical proteins. Therefore, it is of our interest to characterize a hypothetical protein YVRE from the model system Bacillus subtilis using known data. YVRE is assigned the function as a glucono-lactonase using prediction and phylogenetic analysis. A molecular dynamics simulated homology model of YVRE (with calcium) using human senescence marker protein 30 /SMP30 (PDB ID: 3G4E) as template is reported for functional inference. It is observed that the protein possesses bivalent metal binding domain. Molecular docking studies with the substrate glucono-δ-lactone show YVRE binding with the substrate. This data was further validated using cloning and sub-cloning in pUC57 and pET22b+ respectively, followed by expression and purification using nickel affinity chromatography. The activity of YVRE using the substrate glucono-δ-lactone was calculated. The results show the function of YVRE as a gluconolactonase, with higher preference to zinc than calcium or magnesium. Thus, YVRE is shown to play key role in three metabolic pathways namely, pentose phosphate pathway, ascorbate and aldarate metabolism, and caprolactam degradation.
Collapse
Affiliation(s)
- S.V. Reshma
- Department of Biotechnology, PES University, Bangalore
| | - Nitish Sathyanarayanan
- Department of Biotechnology, Sir M Visvesvaraya Institute of Technology, Hunasemaranahalli, Bangalore 562157
- Present Address: National Centre for Biological Sciences, Tata Institute for Fundamental Research, GKVK Campus, Bellary Road, Bangalore 65
| | - H.G. Nagendra
- Department of Biotechnology, Sir M Visvesvaraya Institute of Technology, Hunasemaranahalli, Bangalore 562157
| |
Collapse
|
7
|
Kumar D, Mondal AK, Kutum R, Dash D. Proteogenomics of rare taxonomic phyla: A prospective treasure trove of protein coding genes. Proteomics 2015; 16:226-40. [PMID: 26773550 DOI: 10.1002/pmic.201500263] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Revised: 09/18/2015] [Accepted: 09/28/2015] [Indexed: 01/04/2023]
Abstract
Sustainable innovations in sequencing technologies have resulted in a torrent of microbial genome sequencing projects. However, the prokaryotic genomes sequenced so far are unequally distributed along their phylogenetic tree; few phyla contain the majority, the rest only a few representatives. Accurate genome annotation lags far behind genome sequencing. While automated computational prediction, aided by comparative genomics, remains a popular choice for genome annotation, substantial fraction of these annotations are erroneous. Proteogenomics utilizes protein level experimental observations to annotate protein coding genes on a genome wide scale. Benefits of proteogenomics include discovery and correction of gene annotations regardless of their phylogenetic conservation. This not only allows detection of common, conserved proteins but also the discovery of protein products of rare genes that may be horizontally transferred or taxonomy specific. Chances of encountering such genes are more in rare phyla that comprise a small number of complete genome sequences. We collated all bacterial and archaeal proteogenomic studies carried out to date and reviewed them in the context of genome sequencing projects. Here, we present a comprehensive list of microbial proteogenomic studies, their taxonomic distribution, and also urge for targeted proteogenomics of underexplored taxa to build an extensive reference of protein coding genes.
Collapse
Affiliation(s)
- Dhirendra Kumar
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Anupam Kumar Mondal
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Rintu Kutum
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Debasis Dash
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| |
Collapse
|
8
|
Zhang H, Zheng B, Gao R, Feng Y. Binding of Shewanella FadR to the fabA fatty acid biosynthetic gene: implications for contraction of the fad regulon. Protein Cell 2015; 6:667-679. [PMID: 26050090 PMCID: PMC4537474 DOI: 10.1007/s13238-015-0172-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2015] [Accepted: 05/06/2015] [Indexed: 11/25/2022] Open
Abstract
The Escherichia coli fadR protein product, a paradigm/prototypical FadR regulator, positively regulates fabA and fabB, the two critical genes for unsaturated fatty acid (UFA) biosynthesis. However the scenario in the other Ɣ-proteobacteria, such as Shewanella with the marine origin, is unusual in that Rodionov and coworkers predicted that only fabA (not fabB) has a binding site for FadR protein. It raised the possibility of fad regulon contraction. Here we report that this is the case. Sequence alignment of the FadR homologs revealed that the N-terminal DNA-binding domain exhibited remarkable similarity, whereas the ligand-accepting motif at C-terminus is relatively-less conserved. The FadR homologue of S. oneidensis (referred to FadR_she) was over-expressed and purified to homogeneity. Integrative evidence obtained by FPLC (fast protein liquid chromatography) and chemical cross-linking analyses elucidated that FadR_she protein can dimerize in solution, whose identity was determined by MALDI-TOF-MS. In vitro data from electrophoretic mobility shift assays suggested that FadR_she is almost functionally-exchangeable/equivalent to E. coli FadR (FadR_ec) in the ability of binding the E. coli fabA (and fabB) promoters. In an agreement with that of E. coli fabA, S. oneidensis fabA promoter bound both FadR_she and FadR_ec, and was disassociated specifically with the FadR regulatory protein upon the addition of long-chain acyl-CoA thioesters. To monitor in vivo effect exerted by FadR on Shewanella fabA expression, the native promoter of S. oneidensis fabA was fused to a LacZ reporter gene to engineer a chromosome fabA-lacZ transcriptional fusion in E. coli. As anticipated, the removal of fadR gene gave about 2-fold decrement of Shewanella fabA expression by β-gal activity, which is almost identical to the inhibitory level by the addition of oleate. Therefore, we concluded that fabA is contracted to be the only one member of fad regulon in the context of fatty acid synthesis in the marine bacteria Shewanella genus.
Collapse
Affiliation(s)
- Huimin Zhang
- Department of Medical Microbiology & Parasitology, Zhejiang University School of Medicine, Hangzhou, 310058 China
| | - Beiwen Zheng
- Department of Medical Microbiology & Parasitology, Zhejiang University School of Medicine, Hangzhou, 310058 China.,State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, Zhejiang University School of Medicine, Hangzhou, 310058 China
| | - Rongsui Gao
- Department of Medical Microbiology & Parasitology, Zhejiang University School of Medicine, Hangzhou, 310058 China
| | - Youjun Feng
- Department of Medical Microbiology & Parasitology, Zhejiang University School of Medicine, Hangzhou, 310058 China.,State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, Zhejiang University School of Medicine, Hangzhou, 310058 China
| |
Collapse
|
9
|
Xu X, Liu T, Ren X, Liu B, Yang J, Chen L, Wei C, Zheng J, Dong J, Sun L, Zhu Y, Jin Q. Proteogenomic Analysis of Trichophyton rubrum Aided by RNA Sequencing. J Proteome Res 2015; 14:2207-18. [PMID: 25868943 DOI: 10.1021/acs.jproteome.5b00009] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Infections caused by dermatophytes, Trichophyton rubrum in particular, are among the most common diseases in humans. In this study, we present a proteogenomic analysis of T. rubrum based on whole-genome proteomics and RNA-Seq studies. We confirmed 4291 expressed proteins in T. rubrum and validated their annotated gene structures based on 35 874 supporting peptides. In addition, we identified 323 novel peptides (not present in the current annotated protein database of T. rubrum) that can be used to enhance current T. rubrum annotations. A total of 104 predicted genes supported by novel peptides were identified, and 127 gene models suggested by the novel peptides that conflicted with existing annotations were manually assigned based on transcriptomic evidence. RNA-Seq confirmed the validity of 95% of the total peptides. Our study provides evidence that confirms and improves the genome annotation of T. rubrum and represents the first survey of T. rubrum genome annotations based on experimental evidence. Additionally, our integrated proteomics and multisourced transcriptomics approach provides stronger evidence for annotation refinement than proteomic data alone, which helps to address the dilemma of one-hit wonders (uncertainties supported by only one peptide).
Collapse
|
10
|
Grobbler C, Virdis B, Nouwens A, Harnisch F, Rabaey K, Bond PL. Use of SWATH mass spectrometry for quantitative proteomic investigation of Shewanella oneidensis MR-1 biofilms grown on graphite cloth electrodes. Syst Appl Microbiol 2015; 38:135-9. [DOI: 10.1016/j.syapm.2014.11.007] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2014] [Revised: 11/17/2014] [Accepted: 11/19/2014] [Indexed: 11/27/2022]
|
11
|
Zhang Y, Ng CK, Cohen Y, Cao B. Cell growth and protein expression of Shewanella oneidensis in biofilms and hydrogel-entrapped cultures. MOLECULAR BIOSYSTEMS 2014; 10:1035-42. [PMID: 24626808 DOI: 10.1039/c3mb70520j] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
The performance of biofilm-based bioprocesses is difficult to predict and control because of the intrinsic heterogeneous and dynamic properties of microbial biofilms. Biofilm mimics, such as microbial cells entrapped in polymeric scaffolds that are permeable for nutrients, have been proposed to replace real biofilms to achieve long-term robust performance in engineering applications. However, the physiological differences between cells that are physically entrapped in a synthetic polymeric matrix and biofilm cells that are encased in a self-produced polymeric matrix remain unknown. In this study, using Shewanella oneidensis as a model organism and alginate hydrogel as a model synthetic matrix, we compared the cell growth and protein expression in entrapped cultures and biofilms. The hydrogel-entrapped cultures were found to exhibit a growth rate comparable with biofilms. There was no substantial difference in cell viability, surface charge, as well as hydrophobicity between the cells grown in alginate hydrogel and those grown in biofilms. However, the gel-entrapped cultures were found to be physiologically different from biofilms. The gel-entrapped cultures had a higher demand for metabolic energy. The siderophore-mediated iron uptake was repressed in the gel-entrapped cells. The presence of the hydrogel matrix decreased the expression of proteins involved in biofilm formation, while inducing the production of extracellular DNA (eDNA) in the gel-entrapped cultures. These results advance the fundamental understanding of the physiology of hydrogel-entrapped cells, which can lead to more efficient biofilm mimic-based applications.
Collapse
Affiliation(s)
- Yingdan Zhang
- Singapore Centre for Environmental Life Sciences Engineering, Nanyang Technological University, Singapore.
| | | | | | | |
Collapse
|
12
|
Catazaro J, Caprez A, Guru A, Swanson D, Powers R. Functional evolution of PLP-dependent enzymes based on active-site structural similarities. Proteins 2014; 82:2597-608. [PMID: 24920327 DOI: 10.1002/prot.24624] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2014] [Revised: 05/30/2014] [Accepted: 06/05/2014] [Indexed: 12/29/2022]
Abstract
Families of distantly related proteins typically have very low sequence identity, which hinders evolutionary analysis and functional annotation. Slowly evolving features of proteins, such as an active site, are therefore valuable for annotating putative and distantly related proteins. To date, a complete evolutionary analysis of the functional relationship of an entire enzyme family based on active-site structural similarities has not yet been undertaken. Pyridoxal-5'-phosphate (PLP) dependent enzymes are primordial enzymes that diversified in the last universal ancestor. Using the comparison of protein active site structures (CPASS) software and database, we show that the active site structures of PLP-dependent enzymes can be used to infer evolutionary relationships based on functional similarity. The enzymes successfully clustered together based on substrate specificity, function, and three-dimensional-fold. This study demonstrates the value of using active site structures for functional evolutionary analysis and the effectiveness of CPASS.
Collapse
Affiliation(s)
- Jonathan Catazaro
- Department of Chemistry, University of Nebraska-Lincoln, Lincoln, Nebraska, 68588-0304
| | | | | | | | | |
Collapse
|
13
|
Islam MA, Waller AS, Hug LA, Provart NJ, Edwards EA, Mahadevan R. New insights into Dehalococcoides mccartyi metabolism from a reconstructed metabolic network-based systems-level analysis of D. mccartyi transcriptomes. PLoS One 2014; 9:e94808. [PMID: 24733489 PMCID: PMC3986231 DOI: 10.1371/journal.pone.0094808] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2013] [Accepted: 03/19/2014] [Indexed: 12/16/2022] Open
Abstract
Organohalide respiration, mediated by Dehalococcoides mccartyi, is a useful bioremediation process that transforms ground water pollutants and known human carcinogens such as trichloroethene and vinyl chloride into benign ethenes. Successful application of this process depends on the fundamental understanding of the respiration and metabolism of D. mccartyi. Reductive dehalogenases, encoded by rdhA genes of these anaerobic bacteria, exclusively catalyze organohalide respiration and drive metabolism. To better elucidate D. mccartyi metabolism and physiology, we analyzed available transcriptomic data for a pure isolate (Dehalococcoides mccartyi strain 195) and a mixed microbial consortium (KB-1) using the previously developed pan-genome-scale reconstructed metabolic network of D. mccartyi. The transcriptomic data, together with available proteomic data helped confirm transcription and expression of the majority genes in D. mccartyi genomes. A composite genome of two highly similar D. mccartyi strains (KB-1 Dhc) from the KB-1 metagenome sequence was constructed, and operon prediction was conducted for this composite genome and other single genomes. This operon analysis, together with the quality threshold clustering analysis of transcriptomic data helped generate experimentally testable hypotheses regarding the function of a number of hypothetical proteins and the poorly understood mechanism of energy conservation in D. mccartyi. We also identified functionally enriched important clusters (13 for strain 195 and 11 for KB-1 Dhc) of co-expressed metabolic genes using information from the reconstructed metabolic network. This analysis highlighted some metabolic genes and processes, including lipid metabolism, energy metabolism, and transport that potentially play important roles in organohalide respiration. Overall, this study shows the importance of an organism's metabolic reconstruction in analyzing various "omics" data to obtain improved understanding of the metabolism and physiology of the organism.
Collapse
Affiliation(s)
- M. Ahsanul Islam
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario, Canada
| | - Alison S. Waller
- European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Laura A. Hug
- Department of Earth and Planetary Science, University of California, Berkeley, California, United States of America
| | - Nicholas J. Provart
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, Canada
| | - Elizabeth A. Edwards
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario, Canada
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, Canada
| | - Radhakrishnan Mahadevan
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario, Canada
- Institute of Biomaterials and Biomedical Engineering, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
14
|
Rachkevych N, Sybirna K, Boyko S, Boretsky Y, Sibirny A. Improving the efficiency of plasmid transformation in Shewanella oneidensis MR-1 by removing ClaI restriction site. J Microbiol Methods 2014; 99:35-7. [PMID: 24462975 DOI: 10.1016/j.mimet.2014.01.009] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2013] [Revised: 01/10/2014] [Accepted: 01/14/2014] [Indexed: 10/25/2022]
Abstract
Here we demonstrate that elimination of ClaI restriction site from the sequence of a plasmid DNA increases the efficiency of transformation of Shewanella oneidensis MR-1 significantly. To achieve reliable transformation of S. oneidensis MR-1 plasmids either lacking ClaI site or isolated from primary transformants of S. oneidensis should be used.
Collapse
Affiliation(s)
- Nazarii Rachkevych
- Department of Molecular Genetics and Biotechnology, Institute of Cell Biology, NAS of Ukraine, Drahomanov Street 14/16, 79005 Lviv, Ukraine
| | - Kateryna Sybirna
- IBiTec-S, SB2SM, LMB (UMR CNRS 8221), DSV, CEA, 91191 Gif-sur-Yvette, France
| | - Solomiya Boyko
- Department of Molecular Genetics and Biotechnology, Institute of Cell Biology, NAS of Ukraine, Drahomanov Street 14/16, 79005 Lviv, Ukraine
| | - Yuriy Boretsky
- Department of Molecular Genetics and Biotechnology, Institute of Cell Biology, NAS of Ukraine, Drahomanov Street 14/16, 79005 Lviv, Ukraine
| | - Andriy Sibirny
- Department of Molecular Genetics and Biotechnology, Institute of Cell Biology, NAS of Ukraine, Drahomanov Street 14/16, 79005 Lviv, Ukraine; Department of Biotechnology and Microbiology, University of Rzeszow, Zelwerowicza 4, 35-601 Rzeszow, Poland.
| |
Collapse
|
15
|
Kreuzer HW, Hill EA, Moran JJ, Bartholomew RA, Yang H, Hegg EL. Contributions of the [NiFe]- and [FeFe]-hydrogenase to H2production inShewanella oneidensisMR-1 as revealed by isotope ratio analysis of evolved H2. FEMS Microbiol Lett 2014; 352:18-24. [DOI: 10.1111/1574-6968.12361] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2013] [Revised: 11/06/2013] [Accepted: 12/14/2013] [Indexed: 11/29/2022] Open
Affiliation(s)
| | - Eric. A. Hill
- Pacific Northwest National Laboratory; Richland WA USA
| | | | | | - Hui Yang
- Michigan State University; East Lansing MI USA
| | | |
Collapse
|
16
|
Galperin MY, Koonin EV. Comparative Genomics Approaches to Identifying Functionally Related Genes. ALGORITHMS FOR COMPUTATIONAL BIOLOGY 2014. [DOI: 10.1007/978-3-319-07953-0_1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
|
17
|
Wilkins MJ, Kennedy DW, Castelle CJ, Field EK, Stepanauskas R, Fredrickson JK, Konopka AE. Single-cell genomics reveals metabolic strategies for microbial growth and survival in an oligotrophic aquifer. MICROBIOLOGY-SGM 2013; 160:362-372. [PMID: 24324032 DOI: 10.1099/mic.0.073965-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Bacteria from the genus Pedobacter are a major component of microbial assemblages at Hanford Site (a largely decommissioned nuclear production complex) in eastern Washington state, USA, and have been shown to change significantly in abundance in response to the subsurface intrusion of Columbia River water. Here we employed single-cell genomics techniques to shed light on the physiological niche of these micro-organisms. Analysis of four Pedobacter single amplified genomes (SAGs) from Hanford Site sediments revealed a chemoheterotrophic lifestyle, with the potential to exist under both aerobic and microaerophilic conditions via expression of both aa3-type and cbb3-type cytochrome c oxidases. These SAGs encoded a wide range of both intra- and extracellular carbohydrate-active enzymes, potentially enabling the degradation of recalcitrant substrates such as xylan and chitin, and the utilization of more labile sugars such as mannose and fucose. Coupled to these enzymes, a diversity of transporters and sugar-binding molecules were involved in the uptake of carbon from the extracellular local environment. The SAGs were enriched in TonB-dependent receptors, which play a key role in uptake of substrates resulting from degradation of recalcitrant carbon. Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR)-Cas mechanisms for resisting viral infections were identified in all SAGs. These data demonstrate the potential mechanisms utilized for persistence by heterotrophic micro-organisms in a carbon-limited aquifer, and hint at potential linkages between observed Pedobacter abundance shifts within the 300 Area (in the south-eastern corner of the site) subsurface and biogeochemical shifts associated with Columbia River water intrusion.
Collapse
|
18
|
Exploring the roles of DNA methylation in the metal-reducing bacterium Shewanella oneidensis MR-1. J Bacteriol 2013; 195:4966-74. [PMID: 23995632 DOI: 10.1128/jb.00935-13] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
We performed whole-genome analyses of DNA methylation in Shewanella oneidensis MR-1 to examine its possible role in regulating gene expression and other cellular processes. Single-molecule real-time (SMRT) sequencing revealed extensive methylation of adenine (N6mA) throughout the genome. These methylated bases were located in five sequence motifs, including three novel targets for type I restriction/modification enzymes. The sequence motifs targeted by putative methyltranferases were determined via SMRT sequencing of gene knockout mutants. In addition, we found that S. oneidensis MR-1 cultures grown under various culture conditions displayed different DNA methylation patterns. However, the small number of differentially methylated sites could not be directly linked to the much larger number of differentially expressed genes under these conditions, suggesting that DNA methylation is not a major regulator of gene expression in S. oneidensis MR-1. The enrichment of methylated GATC motifs in the origin of replication indicates that DNA methylation may regulate genome replication in a manner similar to that seen in Escherichia coli. Furthermore, comparative analyses suggest that many Gammaproteobacteria, including all members of the Shewanellaceae family, may also utilize DNA methylation to regulate genome replication.
Collapse
|
19
|
Impact of Solar Radiation on Gene Expression in Bacteria. Proteomes 2013; 1:70-86. [PMID: 28250399 PMCID: PMC5302746 DOI: 10.3390/proteomes1020070] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2013] [Revised: 06/21/2013] [Accepted: 07/02/2013] [Indexed: 11/17/2022] Open
Abstract
Microorganisms often regulate their gene expression at the level of transcription and/or translation in response to solar radiation. In this review, we present the use of both transcriptomics and proteomics to advance knowledge in the field of bacterial response to damaging radiation. Those studies pertain to diverse application areas such as fundamental microbiology, water treatment, microbial ecology and astrobiology. Even though it has been demonstrated that mRNA abundance is not always consistent with the protein regulation, we present here an exhaustive review on how bacteria regulate their gene expression at both transcription and translation levels to enable biomarkers identification and comparison of gene regulation from one bacterial species to another.
Collapse
|
20
|
Higdon R, Haynes W, Stanberry L, Stewart E, Yandl G, Howard C, Broomall W, Kolker N, Kolker E. Unraveling the Complexities of Life Sciences Data. BIG DATA 2013; 1:42-50. [PMID: 27447037 DOI: 10.1089/big.2012.1505] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
The life sciences have entered into the realm of big data and data-enabled science, where data can either empower or overwhelm. These data bring the challenges of the 5 Vs of big data: volume, veracity, velocity, variety, and value. Both independently and through our involvement with DELSA Global (Data-Enabled Life Sciences Alliance, DELSAglobal.org), the Kolker Lab ( kolkerlab.org ) is creating partnerships that identify data challenges and solve community needs. We specialize in solutions to complex biological data challenges, as exemplified by the community resource of MOPED (Model Organism Protein Expression Database, MOPED.proteinspire.org ) and the analysis pipeline of SPIRE (Systematic Protein Investigative Research Environment, PROTEINSPIRE.org ). Our collaborative work extends into the computationally intensive tasks of analysis and visualization of millions of protein sequences through innovative implementations of sequence alignment algorithms and creation of the Protein Sequence Universe tool (PSU). Pushing into the future together with our collaborators, our lab is pursuing integration of multi-omics data and exploration of biological pathways, as well as assigning function to proteins and porting solutions to the cloud. Big data have come to the life sciences; discovering the knowledge in the data will bring breakthroughs and benefits.
Collapse
Affiliation(s)
- Roger Higdon
- 1 Bioinformatics and High-throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 2 High-throughput Analysis Core, Center for Developmental Therapeutics, Seattle Children's Research Institute , Seattle, Washington
- 3 Predictive Analytics, Seattle Children's , Seattle, Washington
- 4 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Winston Haynes
- 1 Bioinformatics and High-throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 2 High-throughput Analysis Core, Center for Developmental Therapeutics, Seattle Children's Research Institute , Seattle, Washington
- 3 Predictive Analytics, Seattle Children's , Seattle, Washington
- 4 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Larissa Stanberry
- 1 Bioinformatics and High-throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 2 High-throughput Analysis Core, Center for Developmental Therapeutics, Seattle Children's Research Institute , Seattle, Washington
- 3 Predictive Analytics, Seattle Children's , Seattle, Washington
- 4 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Elizabeth Stewart
- 1 Bioinformatics and High-throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 4 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Gregory Yandl
- 1 Bioinformatics and High-throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 2 High-throughput Analysis Core, Center for Developmental Therapeutics, Seattle Children's Research Institute , Seattle, Washington
- 4 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Chris Howard
- 4 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 5 Center for Developmental Therapeutics, Seattle Children's Research Institute , Seattle, Washington
| | - William Broomall
- 2 High-throughput Analysis Core, Center for Developmental Therapeutics, Seattle Children's Research Institute , Seattle, Washington
- 3 Predictive Analytics, Seattle Children's , Seattle, Washington
- 4 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Natali Kolker
- 2 High-throughput Analysis Core, Center for Developmental Therapeutics, Seattle Children's Research Institute , Seattle, Washington
- 3 Predictive Analytics, Seattle Children's , Seattle, Washington
- 4 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
| | - Eugene Kolker
- 1 Bioinformatics and High-throughput Analysis Laboratory, Seattle Children's Research Institute , Seattle, Washington
- 2 High-throughput Analysis Core, Center for Developmental Therapeutics, Seattle Children's Research Institute , Seattle, Washington
- 3 Predictive Analytics, Seattle Children's , Seattle, Washington
- 4 Data-Enabled Life Sciences Alliance (DELSA Global) , Seattle, Washington
- 6 Departments of Biomedical Informatics & Medical Education and Pediatrics, University of Washington , Seattle, Washington
| |
Collapse
|
21
|
Wong CCL, Cociorva D, Miller CA, Schmidt A, Monell C, Aebersold R, Yates JR. Proteomics of Pyrococcus furiosus (Pfu): Identification of Extracted Proteins by Three Independent Methods. J Proteome Res 2013; 12:763-70. [PMID: 23298259 DOI: 10.1021/pr300840j] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Pyrococcus furiosus (Pfu) is an excellent organism to generate reference samples for proteomics laboratories because of its moderately sized genome and very little sequence duplication within the genome. We demonstrated a stable and consistent method to prepare proteins in bulk that eliminates growth and preparation as a source of uncertainty in the standard. We performed several proteomic studies in different laboratories using each laboratory's specific workflow as well as separate and integrated data analysis. This study demonstrated that a Pfu whole cell lysate provides suitable protein sample complexity to not only validate proteomic methods, work flows, and benchmark new instruments but also to facilitate comparison of experimental data generated over time and across instruments or laboratories.
Collapse
Affiliation(s)
- Catherine C L Wong
- Department of Chemical Physiology, The Scripps Research Institute, 10550 North Torrey Pines Road, SR-11, La Jolla, California 92037, USA
| | | | | | | | | | | | | |
Collapse
|
22
|
Systematic characterization of hypothetical proteins in Synechocystis sp. PCC 6803 reveals proteins functionally relevant to stress responses. Gene 2012; 512:6-15. [PMID: 23063937 DOI: 10.1016/j.gene.2012.10.004] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2012] [Revised: 10/03/2012] [Accepted: 10/05/2012] [Indexed: 11/22/2022]
Abstract
We described here a global detection and functional inference of hypothetical proteins involved in stress response in Synechocystis sp. PCC 6803. In the study, we first applied an iTRAQ-LC-MS/MS based quantitative proteomics to the Synechocystis cells grown under five stress conditions. The analysis detected a total of 807 hypothetical proteins with high confidence. Among them, 480 were differentially regulated. We then applied a Weighted Gene Co-expression Network Analysis approach to construct transcriptional networks for Synechocystis under nutrient limitation and osmotic stress conditions using transcriptome datasets. The analysis showed that 305 and 467 coding genes of hypothetical proteins were functionally relevant to nutrient limitation and osmotic stress, respectively. A comparison of responsive hypothetical proteins to all stress conditions allowed identification of 22 hypothetical proteins commonly responsive to all stresses, suggesting they may be part of the core stress responses in Synechocystis. Finally, functional inference of these core stress responsive proteins using both sequence similarity and non-similarity approaches was conducted. The study provided new insights into the stress response networks in Synechocystis, and also demonstrated that a combination of experimental "OMICS" and bioinformatics methodologies could improve functional annotation for hypothetical proteins.
Collapse
|
23
|
Schrimpe-Rutledge AC, Jones MB, Chauhan S, Purvine SO, Sanford JA, Monroe ME, Brewer HM, Payne SH, Ansong C, Frank BC, Smith RD, Peterson SN, Motin VL, Adkins JN. Comparative omics-driven genome annotation refinement: application across Yersiniae. PLoS One 2012; 7:e33903. [PMID: 22479471 PMCID: PMC3313959 DOI: 10.1371/journal.pone.0033903] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2011] [Accepted: 02/19/2012] [Indexed: 02/03/2023] Open
Abstract
Genome sequencing continues to be a rapidly evolving technology, yet most downstream aspects of genome annotation pipelines remain relatively stable or are even being abandoned. The annotation process is now performed almost exclusively in an automated fashion to balance the large number of sequences generated. One possible way of reducing errors inherent to automated computational annotations is to apply data from omics measurements (i.e. transcriptional and proteomic) to the un-annotated genome with a proteogenomic-based approach. Here, the concept of annotation refinement has been extended to include a comparative assessment of genomes across closely related species. Transcriptomic and proteomic data derived from highly similar pathogenic Yersiniae (Y. pestis CO92, Y. pestis Pestoides F, and Y. pseudotuberculosis PB1/+) was used to demonstrate a comprehensive comparative omic-based annotation methodology. Peptide and oligo measurements experimentally validated the expression of nearly 40% of each strain's predicted proteome and revealed the identification of 28 novel and 68 incorrect (i.e., observed frameshifts, extended start sites, and translated pseudogenes) protein-coding sequences within the three current genome annotations. Gene loss is presumed to play a major role in Y. pestis acquiring its niche as a virulent pathogen, thus the discovery of many translated pseudogenes, including the insertion-ablated argD, underscores a need for functional analyses to investigate hypotheses related to divergence. Refinements included the discovery of a seemingly essential ribosomal protein, several virulence-associated factors, a transcriptional regulator, and many hypothetical proteins that were missed during annotation.
Collapse
Affiliation(s)
| | - Marcus B. Jones
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Sadhana Chauhan
- University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Samuel O. Purvine
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - James A. Sanford
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Matthew E. Monroe
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Heather M. Brewer
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Samuel H. Payne
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Charles Ansong
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Bryan C. Frank
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Richard D. Smith
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Scott N. Peterson
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Vladimir L. Motin
- University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Joshua N. Adkins
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
- * E-mail:
| |
Collapse
|
24
|
Correlation of macroscopic aggregation behavior and microscopic adhesion properties of bacteria strains using a dimensionless Tabor's parameter. J Colloid Interface Sci 2012; 374:70-6. [PMID: 22360985 DOI: 10.1016/j.jcis.2012.01.032] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2011] [Revised: 01/17/2012] [Accepted: 01/19/2012] [Indexed: 11/21/2022]
Abstract
Macroscopic adhesion-aggregation, floc formation, and subsequent transportation of microorganisms in porous media are closely related to the microscopic behavior and properties of individual cells. The classical Tabor's parameter in colloidal science is modified to correlate the macroscopic aggregation and microscopic adhesion properties of microorganisms. Seven bacterial strains relevant to wastewater treatment and bioremediation were characterized in terms of their macroscopic aggregation index (AI) using an optical method, and their microscopic coupled adhesion and deformation properties using atomic force microscopy (AFM). Single cells were indented to measure the range and magnitude of the repulsive-attractive intersurface forces, elastic modulus, thickness and density of the cellular surface substances (CSS). The strong correlation suggests that cost and time effective microscopic AFM characterization is capable of making reliable prediction of macroscopic behavior.
Collapse
|
25
|
Parker DL, Borer P, Bernier-Latmani R. The Response of Shewanella oneidensis MR-1 to Cr(III) Toxicity Differs from that to Cr(VI). Front Microbiol 2011; 2:223. [PMID: 22125549 PMCID: PMC3221395 DOI: 10.3389/fmicb.2011.00223] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2011] [Accepted: 10/20/2011] [Indexed: 11/18/2022] Open
Abstract
Chromium is a contaminant of concern that is found in drinking water in its soluble, hexavalent form [Cr(VI)] and that is known to be toxic to eukaryotes and prokaryotes. Trivalent chromium [Cr(III)] is thought to be largely harmless due to its low solubility and inability to enter cells. Previous work has suggested that Cr(III) may also be toxic to microorganisms but the mechanism remained elusive. In this work, we probe the toxicity of Cr(III) to Shewanella oneidensis MR-1, a bacterium able to reduce Cr(VI) to Cr(III) and compare it to Cr(VI) toxicity. We found evidence for Cr(III) toxicity both under Cr(VI) reducing conditions, during which Cr(III) was generated by the reduction process, and under non-reducing conditions, when Cr(III) was amended exogenously. Interestingly, cells exposed to Cr(III) (200 μM) experienced rapid viability loss as measured by colony forming units on Luria–Bertani (LB) agar plates. In contrast, they maintained some enzymatic activity and cellular integrity. Cr(VI)-exposed cells exhibited loss of enzymatic activity and cell lysis. The loss of viability of Cr(III)-exposed cells was not due to membrane damage or to enzymatic inhibition but rather appeared to be associated with an abnormal morphology that consisted of chains of membrane-enclosed units of irregular size. Exposure of abnormal cells to growth conditions resulted in membrane damage and cell death, which is consistent with the observed viability loss on LB plates. While Cr(VI) was taken up intracellularly and caused cell lysis, the toxic effect of Cr(III) appeared to be associated with extracellular interactions leading to an ultimately lethal cell morphology.
Collapse
Affiliation(s)
- Dorothy L Parker
- Environmental Microbiology Laboratory, Ecole Polytechnique Fédérale de Lausanne Lausanne, Switzerland
| | | | | |
Collapse
|
26
|
Okamoto A, Yamada K. Proteome driven re-evaluation and functional annotation of the Streptococcus pyogenes SF370 genome. BMC Microbiol 2011; 11:249. [PMID: 22070424 PMCID: PMC3224786 DOI: 10.1186/1471-2180-11-249] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2011] [Accepted: 11/10/2011] [Indexed: 12/02/2022] Open
Abstract
Background The genome data of Streptococcus pyogenes SF370 has been widely used by many researchers and provides a vast array of interesting findings. Nevertheless, approximately 40% of genes remain classified as hypothetical proteins, and several coding sequences (CDSs) have been unrecognized. In this study, we attempted a shotgun proteomic analysis with a six-frame database that was independent of genome annotation. Results Nine proteins encoded by novel ORFs were found by shotgun proteomic analysis, and their specific mRNAs were verified by reverse transcriptional PCR (RT-PCR). We also provided functional annotations for hypothetical genes using proteomic analysis from three different culture conditions that were separated into three fractions: supernatant, soluble, and insoluble. Consequently, we identified 567 proteins on re-evaluation of the proteomic data using an in-house database comprising 1,697 annotated and nine non-annotated CDSs. We provided functional annotations for 126 hypothetical proteins (18.9% out of the 668 hypothetical proteins) based on their cellular fractions and expression profiles under different culture conditions. Conclusions The list of amino acid sequences that were annotated by genome analysis contains outdated information and unrecognized protein-coding sequences. We suggest that the six-frame database derived from actual DNA sequences be used for reliable proteomic analysis. In addition, the experimental evidence from functional proteomic analysis is useful for the re-evaluation of previously sequenced genomes.
Collapse
Affiliation(s)
- Akira Okamoto
- Department of Molecular Bacteriology, Nagoya University Graduate School of Medicine, 65 Tsurumai-cho, Showa-ku, Nagoya, Aichi 466-8550, Japan.
| | | |
Collapse
|
27
|
Cyclic voltammetric analysis of the electron transfer of Shewanella oneidensis MR-1 and nanofilament and cytochrome knock-out mutants. Bioelectrochemistry 2011; 81:74-80. [DOI: 10.1016/j.bioelechem.2011.02.006] [Citation(s) in RCA: 139] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2010] [Revised: 02/04/2011] [Accepted: 02/15/2011] [Indexed: 11/18/2022]
|
28
|
Shewanella oneidensis MR-1 sensory box protein involved in aerobic and anoxic growth. Appl Environ Microbiol 2011; 77:4647-56. [PMID: 21602393 DOI: 10.1128/aem.03003-10] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Although little is known of potential function for conserved signaling proteins, it is hypothesized that such proteins play important roles to coordinate cellular responses to environmental stimuli. In order to elucidate the function of a putative sensory box protein (PAS domains) in Shewanella oneidensis MR-1, the physiological role of SO3389 was characterized. The predicted open reading frame (ORF) encodes a putative sensory box protein that has PAS, GGDEF, and EAL domains, and an in-frame deletion mutant was constructed (ΔSO3389) with approximately 95% of the ORF deleted. Under aerated conditions, wild-type and mutant cultures had similar growth rates, but the mutant culture had a lower growth rate under static, aerobic conditions. Oxygen consumption rates were lower for mutant cultures (1.5-fold), and wild-type cultures also maintained lower dissolved oxygen concentrations under aerated growth conditions. When transferred to anoxic conditions, the mutant did not grow with fumarate, iron(III), or dimethyl sulfoxide (DMSO) as electron acceptors. Biochemical assays demonstrated the expression of different c-type cytochromes as well as decreased fumarate reductase activity in the mutant transferred to anoxic growth conditions. Transcriptomic studies showed the inability of the mutant to up-express and down-express genes, including c-type cytochromes (e.g., SO4047/SO4048, SO3285/SO3286), reductases (e.g., SO0768, SO1427), and potential regulators (e.g., SO1329). The complemented strain was able to grow when transferred from aerobic to anoxic growth conditions with the tested electron acceptors. The modeled structure for the SO3389 PAS domains was highly similar to the crystal structures of FAD-binding PAS domains that are known O2/redox sensors. Based on physiological, genomic, and bioinformatic results, we suggest that the sensory box protein, SO3389, is an O2/redox sensor that is involved in optimization of aerobic growth and transitions to anoxia in S. oneidensis MR-1.
Collapse
|
29
|
Mueller RS, Dill BD, Pan C, Belnap CP, Thomas BC, VerBerkmoes NC, Hettich RL, Banfield JF. Proteome changes in the initial bacterial colonist during ecological succession in an acid mine drainage biofilm community. Environ Microbiol 2011; 13:2279-92. [PMID: 21518216 DOI: 10.1111/j.1462-2920.2011.02486.x] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Proteomes of acid mine drainage biofilms at different stages of ecological succession were examined to understand microbial responses to changing community membership. We evaluated the degree of reproducibility of the community proteomes between samples of the same growth stage and found stable and predictable protein abundance patterns across time and sampling space, allowing for a set of 50 classifier proteins to be identified for use in predicting growth stages of undefined communities. Additionally, physiological changes in the dominant species, Leptospirillum Group II, were analysed as biofilms mature. During early growth stages, this population responds to abiotic stresses related to growth on the acid mine drainage solution. Enzymes involved in protein synthesis, cell division and utilization of 1- and 2-carbon compounds were more abundant in early growth stages, suggesting rapid growth and a reorganization of metabolism during biofilm initiation. As biofilms thicken and diversify, external stresses arise from competition for dwindling resources, which may inhibit cell division of Leptospirillum Group II through the SOS response. This population also represses translation and synthesizes more complex carbohydrates and amino acids in mature biofilms. These findings provide unprecedented insight into the physiological changes that may result from competitive interactions within communities in natural environments.
Collapse
Affiliation(s)
- Ryan S Mueller
- University of California, Berkeley, California 94720, USA.
| | | | | | | | | | | | | | | |
Collapse
|
30
|
Li W, Ji L, Goya J, Tan G, Wysocki VH. SQID: an intensity-incorporated protein identification algorithm for tandem mass spectrometry. J Proteome Res 2011; 10:1593-602. [PMID: 21204564 DOI: 10.1021/pr100959y] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
To interpret LC-MS/MS data in proteomics, most popular protein identification algorithms primarily use predicted fragment m/z values to assign peptide sequences to fragmentation spectra. The intensity information is often undervalued, because it is not as easy to predict and incorporate into algorithms. Nevertheless, the use of intensity to assist peptide identification is an attractive prospect and can potentially improve the confidence of matches and generate more identifications. On the basis of our previously reported study of fragmentation intensity patterns, we developed a protein identification algorithm, SeQuence IDentfication (SQID), that makes use of the coarse intensity from a statistical analysis. The scoring scheme was validated by comparing with Sequest and X!Tandem using three data sets, and the results indicate an improvement in the number of identified peptides, including unique peptides that are not identified by Sequest or X!Tandem. The software and source code are available under the GNU GPL license at http://quiz2.chem.arizona.edu/wysocki/bioinformatics.htm.
Collapse
Affiliation(s)
- Wenzhou Li
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona 85721, United States
| | | | | | | | | |
Collapse
|
31
|
Hather G, Higdon R, Bauman A, von Haller PD, Kolker E. Estimating false discovery rates for peptide and protein identification using randomized databases. Proteomics 2010; 10:2369-76. [PMID: 20391536 DOI: 10.1002/pmic.200900619] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
MS-based proteomics characterizes protein contents of biological samples. The most common approach is to first match observed MS/MS peptide spectra against theoretical spectra from a protein sequence database and then to score these matches. The false discovery rate (FDR) can be estimated as a function of the score by searching together the protein sequence database and its randomized version and comparing the score distributions of the randomized versus nonrandomized matches. This work introduces a straightforward isotonic regression-based method to estimate the cumulative FDRs and local FDRs (LFDRs) of peptide identification. Our isotonic method not only performed as well as other methods used for comparison, but also has the advantages of being: (i) monotonic in the score, (ii) computationally simple, and (iii) not dependent on assumptions about score distributions. We demonstrate the flexibility of our approach by using it to estimate FDRs and LFDRs for protein identification using summaries of the peptide spectra scores. We reconfirmed that several of these methods were superior to a two-peptide rule. Finally, by estimating both the FDRs and LFDRs, we showed for both peptide and protein identification, moderate FDR values (5%) corresponded to large LFDR values (53 and 60%).
Collapse
Affiliation(s)
- Gregory Hather
- Bioinformatics & High-throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, WA 98101, USA
| | | | | | | | | |
Collapse
|
32
|
Galperin MY, Koonin EV. From complete genome sequence to 'complete' understanding? Trends Biotechnol 2010; 28:398-406. [PMID: 20647113 PMCID: PMC3065831 DOI: 10.1016/j.tibtech.2010.05.006] [Citation(s) in RCA: 119] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2010] [Revised: 05/18/2010] [Accepted: 05/28/2010] [Indexed: 12/29/2022]
Abstract
The rapidly accumulating genome sequence data allow researchers to address fundamental biological questions that were not even asked just a few years ago. A major problem in genomics is the widening gap between the rapid progress in genome sequencing and the comparatively slow progress in the functional characterization of sequenced genomes. Here we discuss two key questions of genome biology: whether we need more genomes, and how deep is our understanding of biology based on genomic analysis. We argue that overly specific annotations of gene functions are often less useful than the more generic, but also more robust, functional assignments based on protein family classification. We also discuss problems in understanding the functions of the remaining 'conserved hypothetical' genes.
Collapse
Affiliation(s)
- Michael Y Galperin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | | |
Collapse
|
33
|
Karpinets TV, Romine MF, Schmoyer DD, Kora GH, Syed MH, Leuze MR, Serres MH, Park BH, Samatova NF, Uberbacher EC. Shewanella knowledgebase: integration of the experimental data and computational predictions suggests a biological role for transcription of intergenic regions. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2010; 2010:baq012. [PMID: 20627862 PMCID: PMC2911847 DOI: 10.1093/database/baq012] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Shewanellae are facultative γ-proteobacteria whose remarkable respiratory versatility has resulted in interest in their utility for bioremediation of heavy metals and radionuclides and for energy generation in microbial fuel cells. Extensive experimental efforts over the last several years and the availability of 21 sequenced Shewanella genomes made it possible to collect and integrate a wealth of information on the genus into one public resource providing new avenues for making biological discoveries and for developing a system level understanding of the cellular processes. The Shewanella knowledgebase was established in 2005 to provide a framework for integrated genome-based studies on Shewanella ecophysiology. The present version of the knowledgebase provides access to a diverse set of experimental and genomic data along with tools for curation of genome annotations and visualization and integration of genomic data with experimental data. As a demonstration of the utility of this resource, we examined a single microarray data set from Shewanella oneidensis MR-1 for new insights into regulatory processes. The integrated analysis of the data predicted a new type of bacterial transcriptional regulation involving co-transcription of the intergenic region with the downstream gene and suggested a biological role for co-transcription that likely prevents the binding of a regulator of the upstream gene to the regulator binding site located in the intergenic region. Database URL:http://shewanella-knowledgebase.org:8080/Shewanella/ or http://spruce.ornl.gov:8080/Shewanella/
Collapse
|
34
|
Finney L, Chishti Y, Khare T, Giometti C, Levina A, Lay PA, Vogt S. Imaging metals in proteins by combining electrophoresis with rapid x-ray fluorescence mapping. ACS Chem Biol 2010; 5:577-87. [PMID: 20392082 DOI: 10.1021/cb1000263] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Growing evidence points toward a very dynamic role for metals in biology. This suggests that physiological circumstance may mandate metal ion redistribution among ligands. This work addresses a critical need for technology that detects, identifies, and measures the metal-containing components of complex biological matrixes. We describe a direct, user-friendly approach for identifying and quantifying metal-protein adducts in complex samples using native- or SDS-PAGE, blotting, and rapid synchrotron X-ray fluorescence mapping with micro-XANES (X-ray absorption near-edge structure) of entire blots. The identification and quantification of each metal bound to a protein spot has been demonstrated, and the technique has been applied in two exemplary cases. In the first, the speciation of the in vitro binding of exogenous chromium to blood serum proteins was influenced markedly by both the oxidation state of chromium exposed to the serum proteins and the treatment conditions, which is of relevance to the biochemistry of Cr dietary supplements. In the second case, in vivo changes in endogenous metal speciation were examined to probe the influence of oxygen depletion on iron speciation in Shewanella oneidensis.
Collapse
Affiliation(s)
- Lydia Finney
- Biosciences Division and
- X-ray Sciences Division, Argonne National Laboratory, Argonne, Illinois
| | | | | | | | - Aviva Levina
- School of Chemistry, The University of Sydney, NSW 2006, Australia
| | - Peter A Lay
- School of Chemistry, The University of Sydney, NSW 2006, Australia
| | - Stefan Vogt
- X-ray Sciences Division, Argonne National Laboratory, Argonne, Illinois
| |
Collapse
|
35
|
Complete genome sequence of the electricity-producing "Thermincola potens" strain JR. J Bacteriol 2010; 192:4078-9. [PMID: 20525829 DOI: 10.1128/jb.00044-10] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
"Thermincola potens" strain JR is one of the first Gram-positive dissimilatory metal-reducing bacteria (DMRB) for which there is a complete genome sequence. Consistent with the physiology of this organism, preliminary annotation revealed an abundance of multiheme c-type cytochromes that are putatively associated with the periplasm and cell surface in a Gram-positive bacterium. Here we report the complete genome sequence of strain JR.
Collapse
|
36
|
Aono E, Baba T, Ara T, Nishi T, Nakamichi T, Inamoto E, Toyonaga H, Hasegawa M, Takai Y, Okumura Y, Baba M, Tomita M, Kato C, Oshima T, Nakasone K, Mori H. Complete genome sequence and comparative analysis of Shewanella violacea, a psychrophilic and piezophilic bacterium from deep sea floor sediments. MOLECULAR BIOSYSTEMS 2010; 6:1216-26. [PMID: 20458400 DOI: 10.1039/c000396d] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Remineralization of organic matter in deep-sea sediments is important in oceanic biogeochemical cycles, and bacteria play a major role in this process. Shewanella violacea DSS12 is a psychrophilic and piezophilic gamma-proteobacterium that was isolated from the surface layer of deep sea sediment at a depth of 5110 m. Here, we report the complete genome sequence of S. violacea and comparative analysis with the genome of S. oneidensis MR-1, isolated from sediments of a freshwater lake. Unlike S. oneidensis, this deep-sea Shewanella possesses very few terminal reductases for anaerobic respiration and no c-type cytochromes or outer membrane proteins involved in respiratory Fe(iii) reduction, which is characteristic of most Shewanella species. Instead, the S. violacea genome contains more terminal oxidases for aerobic respiration and a much greater number of putative secreted proteases and polysaccharases, in particular, for hydrolysis of collagen, cellulose and chitin, than are encoded in S. oneidensis. Transporters and assimilatory reductases for nitrate and nitrite, and nitric oxide-detoxifying mechanisms (flavohemoglobin and flavorubredoxin) are found in S. violacea. Comparative analysis of the S. violacea genome revealed the respiratory adaptation of this bacterium to aerobiosis, leading to predominantly aerobic oxidation of organic matter in surface sediments, as well as its ability to efficiently use diverse organic matter and to assimilate inorganic nitrogen as a survival strategy in the nutrient-poor deep-sea floor.
Collapse
Affiliation(s)
- Eiji Aono
- Graduate School of Biological Sciences, Nara Institute of Science and Technology, Ikoma, Nara, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
37
|
Louie B, Bergen S, Higdon R, Kolker E. Quantifying protein function specificity in the gene ontology. Stand Genomic Sci 2010; 2:238-44. [PMID: 21304708 PMCID: PMC3035283 DOI: 10.4056/sigs.561626] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Quantitative or numerical metrics of protein function specificity made possible by the Gene Ontology are useful in that they enable development of distance or similarity measures between protein functions. Here we describe how to calculate four measures of function specificity for GO terms: 1) number of ancestor terms; 2) number of offspring terms; 3) proportion of terms; and 4) Information Content (IC). We discuss the relationship between the metrics and the strengths and weaknesses of each.
Collapse
Affiliation(s)
- Brenton Louie
- Bioinformatics and High-throughput Analysis Laboratory, Seattle Children’s Research Institute, Seattle, WA, USA
- Predictive Analytics, Seattle Children’s Hospital
| | - Silas Bergen
- Bioinformatics and High-throughput Analysis Laboratory, Seattle Children’s Research Institute, Seattle, WA, USA
- Department of Biostatistics, University of Washington School of Public Health
| | - Roger Higdon
- Bioinformatics and High-throughput Analysis Laboratory, Seattle Children’s Research Institute, Seattle, WA, USA
- Predictive Analytics, Seattle Children’s Hospital
| | - Eugene Kolker
- Bioinformatics and High-throughput Analysis Laboratory, Seattle Children’s Research Institute, Seattle, WA, USA
- Predictive Analytics, Seattle Children’s Hospital
- Biomedical and Health Informatics Division, Department of Medical Education and Biomedical Informatics, University of Washington School of Medicine, Seattle, WA, USA
- Corresponding author:
| |
Collapse
|
38
|
Comparative analysis of membranous proteomics of Shewanella decolorationis S12 grown with azo compound or Fe (III) citrate as sole terminal electron acceptor. Appl Microbiol Biotechnol 2010; 86:1513-23. [DOI: 10.1007/s00253-010-2475-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2009] [Revised: 01/23/2010] [Accepted: 01/25/2010] [Indexed: 02/06/2023]
|
39
|
Bretschger O, Cheung A, Mansfeld F, Nealson K. Comparative Microbial Fuel Cell Evaluations of Shewanella spp. ELECTROANAL 2010. [DOI: 10.1002/elan.200800016] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
40
|
Galperin MY, Higdon R, Kolker E. Interplay of heritage and habitat in the distribution of bacterial signal transduction systems. MOLECULAR BIOSYSTEMS 2010; 6:721-8. [PMID: 20237650 DOI: 10.1039/b908047c] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Comparative analysis of the complete genome sequences from a variety of poorly studied organisms aims at predicting ecological and behavioral properties of these organisms and helping in characterizing their habitats. This task requires finding appropriate descriptors that could be correlated with the core traits of each system and would allow meaningful comparisons. Using the relatively simple bacterial models, first attempts have been made to introduce suitable metrics to describe the complexity of organism's signaling machinery, which included introducing the "bacterial IQ" score. Here, we use an updated census of prokaryotic signal transduction systems to improve this parameter and evaluate its consistency within selected bacterial phyla. We also introduce a more elaborate descriptor, a set of profiles of relative abundance of members of each family of signal transduction proteins encoded in each genome. We show that these family profiles are well conserved within each genus and are often consistent within families of bacteria. Thus, they reflect evolutionary relationships between organisms as well as individual adaptations of each organism to its specific ecological niche.
Collapse
Affiliation(s)
- Michael Y Galperin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, Maryland 20894, USA.
| | | | | |
Collapse
|
41
|
Armengaud J. Proteogenomics and systems biology: quest for the ultimate missing parts. Expert Rev Proteomics 2010; 7:65-77. [DOI: 10.1586/epr.09.104] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/30/2023]
|
42
|
Higdon R, Louie B, Kolker E. Modeling sequence and function similarity between proteins for protein functional annotation. PROCEEDINGS OF THE ... INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING 2010; 2010:499-502. [PMID: 25101328 DOI: 10.1145/1851476.1851548] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
A common task in biological research is to predict function for proteins by comparing sequences between proteins of known and unknown function. This is often done using pair-wise sequence alignment algorithms (e.g. BLAST). A problem with this approach is the assumption of a simple equivalence between a minimum sequence similarity threshold and the function similarity between proteins. This assumption is based on the binary concept of homology in that proteins are or not homologous. The relationship between sequence and function however is more complex as well as pertinent for predicting protein function, e.g. evaluating BLAST alignments or developing training sets for profile models based on functional rather than homologous groupings. Our motivation for this study was to model sequence and function similarity between proteins to gain insights into the "sequence-function similarity relationship between proteins for predicting function. Using our model we found that function similarity generally increases with sequence similarity but with a high degree of variability. This result has implications for pair-wise approaches in that it appears sequence similarity must be very high to ensure high function similarity. Profile models which enable higher sensitivity are a potential solution. However, multiple sequences alignments (a necessary prerequisite) are a problem in that current algorithms have difficulty aligning sequences with very low sequence similarity, which is common in our data set, or are intractable for high numbers of sequences. Given the importance of predicting protein function and the need for multiple sequence alignments, algorithms for accomplishing this task should be further refined and developed.
Collapse
Affiliation(s)
- Roger Higdon
- Seattle Children's Research Institute, 1900 Ninth Avenue, Seattle, WA 98101,
| | - Brenton Louie
- Seattle Children's Research Institute, 1900 Ninth Avenue, Seattle, WA 98101,
| | - Eugene Kolker
- Seattle Children's Research Institute, 1900 Ninth Avenue, Seattle, WA 98101,
| |
Collapse
|
43
|
Cristóbal HA, Schmidt A, Kothe E, Breccia J, Abate CM. Characterization of inducible cold-active β-glucosidases from the psychrotolerant bacterium Shewanella sp. G5 isolated from a sub-Antarctic ecosystem. Enzyme Microb Technol 2009. [DOI: 10.1016/j.enzmictec.2009.06.010] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
44
|
Louie B, Higdon R, Kolker E. A statistical model of protein sequence similarity and function similarity reveals overly-specific function predictions. PLoS One 2009; 4:e7546. [PMID: 19844580 PMCID: PMC2760442 DOI: 10.1371/journal.pone.0007546] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2009] [Accepted: 09/13/2009] [Indexed: 12/02/2022] Open
Abstract
Background Predicting protein function from primary sequence is an important open problem in modern biology. Not only are there many thousands of proteins of unknown function, current approaches for predicting function must be improved upon. One problem in particular is overly-specific function predictions which we address here with a new statistical model of the relationship between protein sequence similarity and protein function similarity. Methodology Our statistical model is based on sets of proteins with experimentally validated functions and numeric measures of function specificity and function similarity derived from the Gene Ontology. The model predicts the similarity of function between two proteins given their amino acid sequence similarity measured by statistics from the BLAST sequence alignment algorithm. A novel aspect of our model is that it predicts the degree of function similarity shared between two proteins over a continuous range of sequence similarity, facilitating prediction of function with an appropriate level of specificity. Significance Our model shows nearly exact function similarity for proteins with high sequence similarity (bit score >244.7, e-value >1e−62, non-redundant NCBI protein database (NRDB)) and only small likelihood of specific function match for proteins with low sequence similarity (bit score <54.6, e-value <1e−05, NRDB). For sequence similarity ranges in between our annotation model shows an increasing relationship between function similarity and sequence similarity, but with considerable variability. We applied the model to a large set of proteins of unknown function, and predicted functions for thousands of these proteins ranging from general to very specific. We also applied the model to a data set of proteins with previously assigned, specific functions that were electronically based. We show that, on average, these prior function predictions are more specific (quite possibly overly-specific) compared to predictions from our model that is based on proteins with experimentally determined function.
Collapse
Affiliation(s)
- Brenton Louie
- Bioinformatics and High-throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington, United States of America
- Predictive Analytics, Seattle Children's Hospital, University of Washington School of Medicine, Seattle, Washington, United States of America
| | - Roger Higdon
- Bioinformatics and High-throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington, United States of America
- Predictive Analytics, Seattle Children's Hospital, University of Washington School of Medicine, Seattle, Washington, United States of America
| | - Eugene Kolker
- Bioinformatics and High-throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, Washington, United States of America
- Predictive Analytics, Seattle Children's Hospital, University of Washington School of Medicine, Seattle, Washington, United States of America
- Biomedical and Health Informatics Division, Department of Medical Education and Biomedical Informatics, University of Washington School of Medicine, Seattle, Washington, United States of America
- * E-mail:
| |
Collapse
|
45
|
Unell M, Abraham PE, Shah M, Zhang B, Rückert C, VerBerkmoes NC, Jansson JK. Impact of phenolic substrate and growth temperature on the Arthrobacter chlorophenolicus proteome. J Proteome Res 2009; 8:1953-64. [PMID: 19714879 DOI: 10.1021/pr800897c] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We compared the Arthrobacter chlorophenolicus proteome during growth on 4-chlorophenol, 4-nitrophenol, or phenol at 5 and 28 degrees C, both for the wild-type and a mutant strain with mass spectrometry based proteomics. A label-free workflow employing spectral counting identified 3749 proteins across all growth conditions, representing over 70% of the predicted genome and 739 of these proteins form the core proteome. Statistically significant differences were found in the proteomes of cells grown under different conditions including differentiation of hundreds of unknown proteins. The 4-chlorophenol-degradation pathway was confirmed, but not that for phenol.
Collapse
Affiliation(s)
- Maria Unell
- Department of Microbiology, Swedish University of Agricultural Sciences, Box 7025, 750 07 Uppsala, Sweden
| | | | | | | | | | | | | |
Collapse
|
46
|
Proteogenomic monitoring of Geobacter physiology during stimulated uranium bioremediation. Appl Environ Microbiol 2009; 75:6591-9. [PMID: 19717633 DOI: 10.1128/aem.01064-09] [Citation(s) in RCA: 116] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Implementation of uranium bioremediation requires methods for monitoring the membership and activities of the subsurface microbial communities that are responsible for reduction of soluble U(VI) to insoluble U(IV). Here, we report a proteomics-based approach for simultaneously documenting the strain membership and microbial physiology of the dominant Geobacter community members during in situ acetate amendment of the U-contaminated Rifle, CO, aquifer. Three planktonic Geobacter-dominated samples were obtained from two wells down-gradient of acetate addition. Over 2,500 proteins from each of these samples were identified by matching liquid chromatography-tandem mass spectrometry spectra to peptides predicted from seven isolate Geobacter genomes. Genome-specific peptides indicate early proliferation of multiple M21 and Geobacter bemidjiensis-like strains and later possible emergence of M21 and G. bemidjiensis-like strains more closely related to Geobacter lovleyi. Throughout biostimulation, the proteome is dominated by enzymes that convert acetate to acetyl-coenzyme A and pyruvate for central metabolism, while abundant peptides matching tricarboxylic acid cycle proteins and ATP synthase subunits were also detected, indicating the importance of energy generation during the period of rapid growth following the start of biostimulation. Evolving Geobacter strain composition may be linked to changes in protein abundance over the course of biostimulation and may reflect changes in metabolic functioning. Thus, metagenomics-independent community proteogenomics can be used to diagnose the status of the subsurface consortia upon which remediation biotechnology relies.
Collapse
|
47
|
Jiang SY, Christoffels A, Ramamoorthy R, Ramachandran S. Expansion mechanisms and functional annotations of hypothetical genes in the rice genome. PLANT PHYSIOLOGY 2009; 150:1997-2008. [PMID: 19535473 PMCID: PMC2719134 DOI: 10.1104/pp.109.139402] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2009] [Accepted: 06/15/2009] [Indexed: 05/18/2023]
Abstract
In each completely sequenced genome, 30% to 50% of genes are annotated as uncharacterized hypothetical genes. In the rice (Oryza sativa) genome, 10,918 hypothetical genes were annotated in the latest version (release 6) of the Michigan State University rice genome annotation. We have implemented an integrative approach to analyze their duplication/expansion and function. The analyses show that tandem/segmental duplication and transposition/retrotransposition have significantly contributed to the expansion of hypothetical genes despite their different contribution rates. A total of 3,769 hypothetical genes have been detected from retrogene, tandem, segmental, Pack-MULE, or long terminated direct repeat-related duplication/expansion. The nonsynonymous substitutions per site and synonymous substitutions per site analyses showed that 21.65% of them were still functional, accounting for 7.47% of total hypothetical genes. Global expression analyses have identified 1,672 expressed hypothetical genes. Among them, 415 genes might function in a developmental stage-specific manner. Antisense strand expression and small RNA analyses have demonstrated that a high percentage of these hypothetical genes might play important roles in negatively regulating gene expression. Homologous searches against Arabidopsis (Arabidopsis thaliana), maize (Zea mays), sorghum (Sorghum bicolor), and indica rice genomes suggest that most of the hypothetical genes could be annotated from recently evolved genomic sequences. These data advance the understanding of rice hypothetical genes as being involved in lineage-specific expansion and that they function in a specific developmental stage. Our analyses also provide a valuable means to facilitate the characterization and functional annotation of hypothetical genes in other organisms.
Collapse
Affiliation(s)
- Shu-Ye Jiang
- Rice Functional Genomics Group, Temasek Life Sciences Laboratory, National University of Singapore, Singapore 117604
| | | | | | | |
Collapse
|
48
|
Alliance of proteomics and genomics to unravel the specificities of Sahara bacterium Deinococcus deserti. PLoS Genet 2009; 5:e1000434. [PMID: 19370165 PMCID: PMC2669436 DOI: 10.1371/journal.pgen.1000434] [Citation(s) in RCA: 125] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2008] [Accepted: 02/23/2009] [Indexed: 12/22/2022] Open
Abstract
To better understand adaptation to harsh conditions encountered in hot arid deserts, we report the first complete genome sequence and proteome analysis of a bacterium, Deinococcus deserti VCD115, isolated from Sahara surface sand. Its genome consists of a 2.8-Mb chromosome and three large plasmids of 324 kb, 314 kb, and 396 kb. Accurate primary genome annotation of its 3,455 genes was guided by extensive proteome shotgun analysis. From the large corpus of MS/MS spectra recorded, 1,348 proteins were uncovered and semiquantified by spectral counting. Among the highly detected proteins are several orphans and Deinococcus-specific proteins of unknown function. The alliance of proteomics and genomics high-throughput techniques allowed identification of 15 unpredicted genes and, surprisingly, reversal of incorrectly predicted orientation of 11 genes. Reversal of orientation of two Deinococcus-specific radiation-induced genes, ddrC and ddrH, and identification in D. deserti of supplementary genes involved in manganese import extend our knowledge of the radiotolerance toolbox of Deinococcaceae. Additional genes involved in nutrient import and in DNA repair (i.e., two extra recA, three translesion DNA polymerases, a photolyase) were also identified and found to be expressed under standard growth conditions, and, for these DNA repair genes, after exposure of the cells to UV. The supplementary nutrient import and DNA repair genes are likely important for survival and adaptation of D. deserti to its nutrient-poor, dry, and UV-exposed extreme environment. D. deserti belongs to the Deinococcaceae, a family of bacteria characterized by an exceptional ability to withstand the lethal effects of DNA-damaging agents, including ionizing radiation, UV light, and desiccation. It was isolated from Sahara surface sands, an extreme and nutrient-poor environment, regularly exposed to intense UV radiation, cycles of extreme temperatures, and desiccation. The evolution of organisms that are able to survive acute irradiation doses of 15,000 Gy is difficult to explain given the apparent absence of highly radioactive habitats on Earth over geologic time. Thus, it seems more likely that the natural selection pressure for the evolution of radiation-resistant bacteria was chronic exposure to nonradioactive forms of DNA damage, in particular those promoted by desiccation. Here, we report the first complete genome sequence of a bacterium, D. deserti VCD115, isolated from hot, arid desert surface sand. Accurate genome annotation of its 3,455 genes was guided by extensive proteome analysis in which 1,348 proteins were uncovered after growth in standard conditions. Supplementary genes involved in manganese import, in nutrient import, and in DNA repair were identified and are likely important for survival and adaptation of D. deserti to its hostile environment.
Collapse
|
49
|
Elias DA, Mukhopadhyay A, Joachimiak MP, Drury EC, Redding AM, Yen HCB, Fields MW, Hazen TC, Arkin AP, Keasling JD, Wall JD. Expression profiling of hypothetical genes in Desulfovibrio vulgaris leads to improved functional annotation. Nucleic Acids Res 2009; 37:2926-39. [PMID: 19293273 PMCID: PMC2685097 DOI: 10.1093/nar/gkp164] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Hypothetical (HyP) and conserved HyP genes account for >30% of sequenced bacterial genomes. For the sulfate-reducing bacterium Desulfovibrio vulgaris Hildenborough, 347 of the 3634 genes were annotated as conserved HyP (9.5%) along with 887 HyP genes (24.4%). Given the large fraction of the genome, it is plausible that some of these genes serve critical cellular roles. The study goals were to determine which genes were expressed and provide a more functionally based annotation. To accomplish this, expression profiles of 1234 HyP and conserved genes were used from transcriptomic datasets of 11 environmental stresses, complemented with shotgun LC–MS/MS and AMT tag proteomic data. Genes were divided into putatively polycistronic operons and those predicted to be monocistronic, then classified by basal expression levels and grouped according to changes in expression for one or multiple stresses. One thousand two hundred and twelve of these genes were transcribed with 786 producing detectable proteins. There was no evidence for expression of 17 predicted genes. Except for the latter, monocistronic gene annotation was expanded using the above criteria along with matching Clusters of Orthologous Groups. Polycistronic genes were annotated in the same manner with inferences from their proximity to more confidently annotated genes. Two targeted deletion mutants were used as test cases to determine the relevance of the inferred functional annotations.
Collapse
Affiliation(s)
- Dwayne A Elias
- Department of Biochemistry, Virtual Institute for Microbial Stress and Survival, University of Missouri-Columbia, Columbia, MO 65211, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Lacerda CMR, Reardon KF. Environmental proteomics: applications of proteome profiling in environmental microbiology and biotechnology. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2009; 8:75-87. [PMID: 19279070 DOI: 10.1093/bfgp/elp005] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
In this review, we present the use of proteomics to advance knowledge in the field of environmental biotechnology, including studies of bacterial physiology, metabolism and ecology. Bacteria are widely applied in environmental biotechnology for their ability to catalyze dehalogenation, methanogenesis, denitrification and sulfate reduction, among others. Their tolerance to radiation and toxic compounds is also of importance. Proteomics has an important role in helping uncover the pathways behind these cellular processes. Environmental samples are often highly complex, which makes proteome studies in this field especially challenging. Some of these challenges are the lack of genome sequences for the vast majority of environmental bacteria, difficulties in isolating bacteria and proteins from certain environments, and the presence of complex microbial communities. Despite these challenges, proteomics offers a unique dynamic view into cellular function. We present examples of environmental proteomics of model organisms, and then discuss metaproteomics (microbial community proteomics), which has the potential to provide insights into the function of a community without isolating organisms. Finally, the environmental proteomics literature is summarized as it pertains to the specific application areas of wastewater treatment, metabolic engineering, microbial ecology and environmental stress responses.
Collapse
Affiliation(s)
- Carla M R Lacerda
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO 80523-1370, USA
| | | |
Collapse
|