101
|
Trichler SA, Bulla SC, Mahajan N, Lunsford KV, Pendarvis K, Nanduri B, McCarthy FM, Bulla C. Identification of canine platelet proteins separated by differential detergent fractionation for nonelectrophoretic proteomics analyzed by Gene Ontology and pathways analysis. VETERINARY MEDICINE-RESEARCH AND REPORTS 2014; 5:1-9. [PMID: 32670841 PMCID: PMC7337207 DOI: 10.2147/vmrr.s47127] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/05/2014] [Accepted: 04/23/2014] [Indexed: 01/20/2023]
Abstract
During platelet development, proteins necessary for the many functional roles of the platelet are stored within cytoplasmic granules. Platelets have also been shown to take up and store many plasma proteins into granules. This makes the platelet a potential novel source of biomarkers for many disease states. Approaches to sample preparation for proteomic studies for biomarkers search vary. Compared with traditional two-dimensional polyacrylamide gel electrophoresis systems, nonelectrophoretic proteomics methods that employ offline protein fractionation methods such as the differential detergent fractionation method have clear advantages. Here we report a proteomic survey of the canine platelet proteome using differential detergent fractionation coupled with mass spectrometry and functional modeling of the canine platelet proteins identified. A total of 5,974 unique proteins were identified from platelets, of which only 298 (5%) had previous experimental evidence of in vivo expression. The use of offline prefractionation of canine proteins by differential detergent fractionation resulted in greater proteome coverage as compared with previous reports. This initial study contributes to a broader understanding of canine platelet biology and aids functional research, identification of potential treatment targets and biomarkers, and sets a new standard for the resting platelet proteome.
Collapse
Affiliation(s)
| | | | | | - Kari V Lunsford
- Department of Clinical Sciences and Animal Health Center, College of Veterinary Medicine, Mississippi State University, Mississippi State, MS
| | - Ken Pendarvis
- Department of Veterinary Science and Microbiology, University of Arizona, Tucson, AZ
| | - Bindu Nanduri
- Department of Biological Sciences, College of Veterinary Medicine, Mississippi State University, Mississippi State, MS.,Institute for Genomics, Biocomputing and Biotechnology, Starkville, MS, USA
| | - Fiona M McCarthy
- Department of Veterinary Science and Microbiology, University of Arizona, Tucson, AZ
| | - Camilo Bulla
- Department of Pathobiology and Population Medicine
| |
Collapse
|
102
|
de Crécy-Lagard V. Variations in metabolic pathways create challenges for automated metabolic reconstructions: Examples from the tetrahydrofolate synthesis pathway. Comput Struct Biotechnol J 2014; 10:41-50. [PMID: 25210598 PMCID: PMC4151868 DOI: 10.1016/j.csbj.2014.05.008] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
Abstract
The availability of thousands of sequenced genomes has revealed the diversity of biochemical solutions to similar chemical problems. Even for molecules at the heart of metabolism, such as cofactors, the pathway enzymes first discovered in model organisms like Escherichia coli or Saccharomyces cerevisiae are often not universally conserved. Tetrahydrofolate (THF) (or its close relative tetrahydromethanopterin) is a universal and essential C1-carrier that most microbes and plants synthesize de novo. The THF biosynthesis pathway and enzymes are, however, not universal and alternate solutions are found for most steps, making this pathway a challenge to annotate automatically in many genomes. Comparing THF pathway reconstructions and functional annotations of a chosen set of folate synthesis genes in specific prokaryotes revealed the strengths and weaknesses of different microbial annotation platforms. This analysis revealed that most current platforms fail in metabolic reconstruction of variant pathways. However, all the pieces are in place to quickly correct these deficiencies if the different databases were built on each other's strengths.
Collapse
Affiliation(s)
- Valérie de Crécy-Lagard
- Department of Microbiology and Cell Science and Genetics Institute, University of Florida, Gainesville, FL, United States
| |
Collapse
|
103
|
Sorokina M, Stam M, Médigue C, Lespinet O, Vallenet D. Profiling the orphan enzymes. Biol Direct 2014; 9:10. [PMID: 24906382 PMCID: PMC4084501 DOI: 10.1186/1745-6150-9-10] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2014] [Accepted: 05/29/2014] [Indexed: 11/10/2022] Open
Abstract
The emergence of Next Generation Sequencing generates an incredible amount of sequence and great potential for new enzyme discovery. Despite this huge amount of data and the profusion of bioinformatic methods for function prediction, a large part of known enzyme activities is still lacking an associated protein sequence. These particular activities are called "orphan enzymes". The present review proposes an update of previous surveys on orphan enzymes by mining the current content of public databases. While the percentage of orphan enzyme activities has decreased from 38% to 22% in ten years, there are still more than 1,000 orphans among the 5,000 entries of the Enzyme Commission (EC) classification. Taking into account all the reactions present in metabolic databases, this proportion dramatically increases to reach nearly 50% of orphans and many of them are not associated to a known pathway. We extended our survey to "local orphan enzymes" that are activities which have no representative sequence in a given clade, but have at least one in organisms belonging to other clades. We observe an important bias in Archaea and find that in general more than 30% of the EC activities have incomplete sequence information in at least one superkingdom. To estimate if candidate proteins for local orphans could be retrieved by homology search, we applied a simple strategy based on the PRIAM software and noticed that candidates may be proposed for an important fraction of local orphan enzymes. Finally, by studying relation between protein domains and catalyzed activities, it appears that newly discovered enzymes are mostly associated with already known enzyme domains. Thus, the exploration of the promiscuity and the multifunctional aspect of known enzyme families may solve part of the orphan enzyme issue. We conclude this review with a presentation of recent initiatives in finding proteins for orphan enzymes and in extending the enzyme world by the discovery of new activities.
Collapse
Affiliation(s)
- Maria Sorokina
- Direction des Sciences du Vivant, Commissariat à l'Energie Atomique (CEA), Institut de Génomique, Genoscope, Laboratoire d'Analyses Bioinformatiques pour la Génomique et le Métabolisme, 2 rue Gaston Crémieux, 91057 Evry, France.
| | | | | | | | | |
Collapse
|
104
|
Islam MA, Waller AS, Hug LA, Provart NJ, Edwards EA, Mahadevan R. New insights into Dehalococcoides mccartyi metabolism from a reconstructed metabolic network-based systems-level analysis of D. mccartyi transcriptomes. PLoS One 2014; 9:e94808. [PMID: 24733489 PMCID: PMC3986231 DOI: 10.1371/journal.pone.0094808] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2013] [Accepted: 03/19/2014] [Indexed: 12/16/2022] Open
Abstract
Organohalide respiration, mediated by Dehalococcoides mccartyi, is a useful bioremediation process that transforms ground water pollutants and known human carcinogens such as trichloroethene and vinyl chloride into benign ethenes. Successful application of this process depends on the fundamental understanding of the respiration and metabolism of D. mccartyi. Reductive dehalogenases, encoded by rdhA genes of these anaerobic bacteria, exclusively catalyze organohalide respiration and drive metabolism. To better elucidate D. mccartyi metabolism and physiology, we analyzed available transcriptomic data for a pure isolate (Dehalococcoides mccartyi strain 195) and a mixed microbial consortium (KB-1) using the previously developed pan-genome-scale reconstructed metabolic network of D. mccartyi. The transcriptomic data, together with available proteomic data helped confirm transcription and expression of the majority genes in D. mccartyi genomes. A composite genome of two highly similar D. mccartyi strains (KB-1 Dhc) from the KB-1 metagenome sequence was constructed, and operon prediction was conducted for this composite genome and other single genomes. This operon analysis, together with the quality threshold clustering analysis of transcriptomic data helped generate experimentally testable hypotheses regarding the function of a number of hypothetical proteins and the poorly understood mechanism of energy conservation in D. mccartyi. We also identified functionally enriched important clusters (13 for strain 195 and 11 for KB-1 Dhc) of co-expressed metabolic genes using information from the reconstructed metabolic network. This analysis highlighted some metabolic genes and processes, including lipid metabolism, energy metabolism, and transport that potentially play important roles in organohalide respiration. Overall, this study shows the importance of an organism's metabolic reconstruction in analyzing various "omics" data to obtain improved understanding of the metabolism and physiology of the organism.
Collapse
Affiliation(s)
- M. Ahsanul Islam
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario, Canada
| | - Alison S. Waller
- European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Laura A. Hug
- Department of Earth and Planetary Science, University of California, Berkeley, California, United States of America
| | - Nicholas J. Provart
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, Canada
| | - Elizabeth A. Edwards
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario, Canada
- Department of Cell and Systems Biology, University of Toronto, Toronto, Ontario, Canada
| | - Radhakrishnan Mahadevan
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario, Canada
- Institute of Biomaterials and Biomedical Engineering, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
105
|
Siller M, Goyal S, Yoshimoto FK, Xiao Y, Wei S, Guengerich FP. Oxidation of endogenous N-arachidonoylserotonin by human cytochrome P450 2U1. J Biol Chem 2014; 289:10476-10487. [PMID: 24563460 DOI: 10.1074/jbc.m114.550004] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Cytochrome P450 (P450) 2U1 has been shown to be expressed, at the mRNA level, in human thymus, brain, and several other tissues. Recombinant P450 2U1 was purified and used as a reagent in a metabolomic search for substrates in bovine brain. In addition to fatty acid oxidation reactions, an oxidation of endogenous N-arachidonoylserotonin was characterized. Subsequent NMR and mass spectrometry and chemical synthesis showed that the main product was the result of C-2 oxidation of the indole ring, in contrast to other human P450s that generated different products. N-Arachidonoylserotonin, first synthesized chemically and described as an inhibitor of fatty acid amide hydrolase, had previously been found in porcine and mouse intestine; we demonstrated its presence in bovine and human brain samples. The product (2-oxo) was 4-fold less active than N-arachidonoylserotonin in inhibiting fatty acid amide hydrolase. The rate of oxidation of N-arachidonoylserotonin was similar to that of arachidonic acid, one of the previously identified fatty acid substrates of P450 2U1. The demonstration of the oxidation of N-arachidonoylserotonin by P450 2U1 suggests a possible role in human brain and possibly other sites.
Collapse
Affiliation(s)
- Michal Siller
- Department of Biochemistry and Center in Molecular Toxicology, Vanderbilt University School of Medicine, Nashville, Tennessee 37232-0146; Pharmacology and Toxicology Division, Institute of Molecular and Translational Medicine, Palacky University Olomouc, 775 15 Olomouc, Czech Republic
| | - Sandeep Goyal
- Department of Biochemistry and Center in Molecular Toxicology, Vanderbilt University School of Medicine, Nashville, Tennessee 37232-0146
| | - Francis K Yoshimoto
- Department of Biochemistry and Center in Molecular Toxicology, Vanderbilt University School of Medicine, Nashville, Tennessee 37232-0146
| | - Yi Xiao
- Department of Biochemistry and Center in Molecular Toxicology, Vanderbilt University School of Medicine, Nashville, Tennessee 37232-0146
| | - Shouzou Wei
- Department of Biochemistry and Center in Molecular Toxicology, Vanderbilt University School of Medicine, Nashville, Tennessee 37232-0146
| | - F Peter Guengerich
- Department of Biochemistry and Center in Molecular Toxicology, Vanderbilt University School of Medicine, Nashville, Tennessee 37232-0146.
| |
Collapse
|
106
|
de la Tour CB, Passot FM, Toueille M, Mirabella B, Guérin P, Blanchard L, Servant P, de Groot A, Sommer S, Armengaud J. Comparative proteomics reveals key proteins recruited at the nucleoid of Deinococcus after irradiation-induced DNA damage. Proteomics 2013; 13:3457-69. [PMID: 24307635 DOI: 10.1002/pmic.201300249] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2013] [Revised: 10/19/2013] [Accepted: 10/23/2013] [Indexed: 11/09/2022]
Abstract
The nucleoids of radiation-resistant Deinococcus species show a high degree of compaction maintained after ionizing irradiation. We identified proteins recruited after irradiation in nucleoids of Deinococcus radiodurans and Deinococcus deserti by means of comparative proteomics. Proteins in nucleoid-enriched fractions from unirradiated and irradiated Deinococcus were identified and semiquantified by shotgun proteomics. The ssDNA-binding protein SSB, DNA gyrase subunits GyrA and GyrB, DNA topoisomerase I, RecA recombinase, UvrA excinuclease, RecQ helicase, DdrA, DdrB, and DdrD proteins were found in significantly higher amounts in irradiated nucleoids of both Deinococcus species. We observed, by immunofluorescence microscopy, the subcellular localization of these proteins in D. radiodurans, showing for the first time the recruitment of the DdrD protein into the D. radiodurans nucleoid. We specifically followed the kinetics of recruitment of RecA, DdrA, and DdrD to the nucleoid after irradiation. Remarkably, RecA proteins formed irregular filament-like structures 1 h after irradiation, before being redistributed throughout the cells by 3 h post-irradiation. Comparable dynamics of DdrD localization were observed, suggesting a possible functional interaction between RecA and DdrD. Several proteins involved in nucleotide synthesis were also seen in higher quantities in the nucleoids of irradiated cells, indicative of the existence of a mechanism for orchestrating the presence of proteins involved in DNA metabolism in nucleoids in response to massive DNA damage. All MS data have been deposited in the ProteomeXchange with identifier PXD00196 (http://proteomecentral.proteomexchange.org/dataset/PXD000196).
Collapse
|
107
|
Hwang WC, Bakolitsa C, Punta M, Coggill PC, Bateman A, Axelrod HL, Rawlings ND, Sedova M, Peterson SN, Eberhardt RY, Aravind L, Pascual J, Godzik A. LUD, a new protein domain associated with lactate utilization. BMC Bioinformatics 2013; 14:341. [PMID: 24274019 PMCID: PMC3924224 DOI: 10.1186/1471-2105-14-341] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2013] [Accepted: 11/19/2013] [Indexed: 11/24/2022] Open
Abstract
Background A novel highly conserved protein domain, DUF162 [Pfam: PF02589], can be mapped to two proteins: LutB and LutC. Both proteins are encoded by a highly conserved LutABC operon, which has been implicated in lactate utilization in bacteria. Based on our analysis of its sequence, structure, and recent experimental evidence reported by other groups, we hereby redefine DUF162 as the LUD domain family. Results JCSG solved the first crystal structure [PDB:2G40] from the LUD domain family: LutC protein, encoded by ORF DR_1909, of Deinococcus radiodurans. LutC shares features with domains in the functionally diverse ISOCOT superfamily. We have observed that the LUD domain has an increased abundance in the human gut microbiome. Conclusions We propose a model for the substrate and cofactor binding and regulation in LUD domain. The significance of LUD-containing proteins in the human gut microbiome, and the implication of lactate metabolism in the radiation-resistance of Deinococcus radiodurans are discussed.
Collapse
Affiliation(s)
- William C Hwang
- Joint Center for Structural Genomics, La Jolla, CA 92037, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
108
|
Revealing the hidden functional diversity of an enzyme family. Nat Chem Biol 2013; 10:42-9. [DOI: 10.1038/nchembio.1387] [Citation(s) in RCA: 81] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2013] [Accepted: 10/02/2013] [Indexed: 11/08/2022]
|
109
|
Tchigvintsev A, Tchigvintsev D, Flick R, Popovic A, Dong A, Xu X, Brown G, Lu W, Wu H, Cui H, Dombrowski L, Joo JC, Beloglazova N, Min J, Savchenko A, Caudy AA, Rabinowitz JD, Murzin AG, Yakunin AF. Biochemical and structural studies of conserved Maf proteins revealed nucleotide pyrophosphatases with a preference for modified nucleotides. ACTA ACUST UNITED AC 2013; 20:1386-98. [PMID: 24210219 PMCID: PMC3899018 DOI: 10.1016/j.chembiol.2013.09.011] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2013] [Revised: 09/06/2013] [Accepted: 09/13/2013] [Indexed: 11/17/2022]
Abstract
Maf (for multicopy associated filamentation) proteins represent a large family of conserved proteins implicated in cell division arrest but whose biochemical activity remains unknown. Here, we show that the prokaryotic and eukaryotic Maf proteins exhibit nucleotide pyrophosphatase activity against 5-methyl-UTP, pseudo-UTP, 5-methyl-CTP, and 7-methyl-GTP, which represent the most abundant modified bases in all organisms, as well as against canonical nucleotides dTTP, UTP, and CTP. Overexpression of the Maf protein YhdE in E. coli cells increased intracellular levels of dTMP and UMP, confirming that dTTP and UTP are the in vivo substrates of this protein. Crystal structures and site-directed mutagenesis of Maf proteins revealed the determinants of their activity and substrate specificity. Thus, pyrophosphatase activity of Maf proteins toward canonical and modified nucleotides might provide the molecular mechanism for a dual role of these proteins in cell division arrest and house cleaning. Maf proteins represent a family of nucleoside triphosphate pyrophosphatases Maf proteins hydrolyze the canonical nucleotides dTTP, UTP, and CTP Maf proteins are also active against m5UTP, m5CTP, pseudo-UTP, and m7GTP Maf structures reveal the molecular mechanisms of their substrate selectivity
Collapse
Affiliation(s)
- Anatoli Tchigvintsev
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, ON M5S 3E5, Canada
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
110
|
Baran R, Ivanova NN, Jose N, Garcia-Pichel F, Kyrpides NC, Gugger M, Northen TR. Functional genomics of novel secondary metabolites from diverse cyanobacteria using untargeted metabolomics. Mar Drugs 2013; 11:3617-31. [PMID: 24084783 PMCID: PMC3826126 DOI: 10.3390/md11103617] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2013] [Revised: 08/21/2013] [Accepted: 09/09/2013] [Indexed: 12/22/2022] Open
Abstract
Mass spectrometry-based metabolomics has become a powerful tool for the detection of metabolites in complex biological systems and for the identification of novel metabolites. We previously identified a number of unexpected metabolites in the cyanobacterium Synechococcus sp. PCC 7002, such as histidine betaine, its derivatives and several unusual oligosaccharides. To test for the presence of these compounds and to assess the diversity of small polar metabolites in other cyanobacteria, we profiled cell extracts of nine strains representing much of the morphological and evolutionary diversification of this phylum. Spectral features in raw metabolite profiles obtained by normal phase liquid chromatography coupled to mass spectrometry (MS) were manually curated so that chemical formulae of metabolites could be assigned. For putative identification, retention times and MS/MS spectra were cross-referenced with those of standards or available sprectral library records. Overall, we detected 264 distinct metabolites. These included indeed different betaines, oligosaccharides as well as additional unidentified metabolites with chemical formulae not present in databases of metabolism. Some of these metabolites were detected only in a single strain, but some were present in more than one. Genomic interrogation of the strains revealed that generally, presence of a given metabolite corresponded well with the presence of its biosynthetic genes, if known. Our results show the potential of combining metabolite profiling and genomics for the identification of novel biosynthetic genes.
Collapse
Affiliation(s)
- Richard Baran
- Life Sciences Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, MS977R0181A, Berkeley, CA 94720, USA; E-Mails: (R.B.); (N.J.)
| | - Natalia N. Ivanova
- DOE Joint Genome Institute, Walnut Creek, CA 94598, USA; E-Mails: (N.N.I.); (N.C.K.)
| | - Nick Jose
- Life Sciences Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, MS977R0181A, Berkeley, CA 94720, USA; E-Mails: (R.B.); (N.J.)
| | - Ferran Garcia-Pichel
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA; E-Mail:
| | - Nikos C. Kyrpides
- DOE Joint Genome Institute, Walnut Creek, CA 94598, USA; E-Mails: (N.N.I.); (N.C.K.)
| | - Muriel Gugger
- Institute Pasteur, Collection of Cyanobacteria, Paris Cedex 15 75724, France; E-Mail:
| | - Trent R. Northen
- Life Sciences Division, Lawrence Berkeley National Laboratory, 1 Cyclotron Rd, MS977R0181A, Berkeley, CA 94720, USA; E-Mails: (R.B.); (N.J.)
- Author to whom correspondence should be addressed; E-Mail: ; Tel.: +1-510-486-5240; Fax: +1-510-486-4545
| |
Collapse
|
111
|
Semi-automated curation of metabolic models via flux balance analysis: a case study with Mycoplasma gallisepticum. PLoS Comput Biol 2013; 9:e1003208. [PMID: 24039564 PMCID: PMC3764002 DOI: 10.1371/journal.pcbi.1003208] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2012] [Accepted: 07/19/2013] [Indexed: 11/19/2022] Open
Abstract
Primarily used for metabolic engineering and synthetic biology, genome-scale metabolic modeling shows tremendous potential as a tool for fundamental research and curation of metabolism. Through a novel integration of flux balance analysis and genetic algorithms, a strategy to curate metabolic networks and facilitate identification of metabolic pathways that may not be directly inferable solely from genome annotation was developed. Specifically, metabolites involved in unknown reactions can be determined, and potentially erroneous pathways can be identified. The procedure developed allows for new fundamental insight into metabolism, as well as acting as a semi-automated curation methodology for genome-scale metabolic modeling. To validate the methodology, a genome-scale metabolic model for the bacterium Mycoplasma gallisepticum was created. Several reactions not predicted by the genome annotation were postulated and validated via the literature. The model predicted an average growth rate of 0.358±0.12, closely matching the experimentally determined growth rate of M. gallisepticum of 0.244±0.03. This work presents a powerful algorithm for facilitating the identification and curation of previously known and new metabolic pathways, as well as presenting the first genome-scale reconstruction of M. gallisepticum. Flux balance analysis (FBA) is a powerful approach for genome-scale metabolic modeling. It provides metabolic engineers with a tool for manipulating, predicting, and optimizing metabolism for biotechnological and biomedical purposes. However, we posit that it can also be used as tool for fundamental research in understanding and curating metabolic networks. Specifically, by using a genetic algorithm integrated with FBA, we developed a curation approach to identify missing reactions, incomplete reactions, and erroneous reactions. Additionally, it was possible to take advantage of the ensemble information from the genetic algorithm to identify the most critical reactions for curation. We tested our strategy using Mycoplasma gallisepticum as our model organism. Using the genome annotation as the basis, the preliminary genome-scale metabolic model consisted of 446 metabolites involved in 380 reactions. Carrying out our analysis, we found over 80 incorrect reactions and 16 missing reactions. Based upon the guidance of the algorithm, we were able to curate and resolve all discrepancies. The model predicted an average bacterial growth rate of 0.358±0.12 h−1 compared to the experimentally observed 0.244±0.03 h−1. Thus, our approach facilitated the curation of a genome-scale metabolic network and generated a high quality metabolic model.
Collapse
|
112
|
A novel predicted calcium-regulated kinase family implicated in neurological disorders. PLoS One 2013; 8:e66427. [PMID: 23840464 PMCID: PMC3696010 DOI: 10.1371/journal.pone.0066427] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2012] [Accepted: 05/08/2013] [Indexed: 12/03/2022] Open
Abstract
The catalogues of protein kinases, the essential effectors of cellular signaling, have been charted in Metazoan genomes for a decade now. Yet, surprisingly, using bioinformatics tools, we predicted protein kinase structure for proteins coded by five related human genes and their Metazoan homologues, the FAM69 family. Analysis of three-dimensional structure models and conservation of the classic catalytic motifs of protein kinases present in four out of five human FAM69 proteins suggests they might have retained catalytic phosphotransferase activity. An EF-hand Ca2+-binding domain in FAM69A and FAM69B proteins, inserted within the structure of the kinase domain, suggests they may function as Ca2+-dependent kinases. The FAM69 genes, FAM69A, FAM69B, FAM69C, C3ORF58 (DIA1) and CXORF36 (DIA1R), are by large uncharacterised molecularly, yet linked to several neurological disorders in genetics studies. The C3ORF58 gene is found deleted in autism, and resides in the Golgi. Unusually high cysteine content and presence of signal peptides in some of the family members suggest that FAM69 proteins may be involved in phosphorylation of proteins in the secretory pathway and/or of extracellular proteins.
Collapse
|
113
|
Larhlimi A, Basler G, Grimbs S, Selbig J, Nikoloski Z. Stoichiometric capacitance reveals the theoretical capabilities of metabolic networks. ACTA ACUST UNITED AC 2013; 28:i502-i508. [PMID: 22962473 PMCID: PMC3436808 DOI: 10.1093/bioinformatics/bts381] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Motivation: Metabolic engineering aims at modulating the capabilities of metabolic networks by changing the activity of biochemical reactions. The existing constraint-based approaches for metabolic engineering have proven useful, but are limited only to reactions catalogued in various pathway databases. Results: We consider the alternative of designing synthetic strategies which can be used not only to characterize the maximum theoretically possible product yield but also to engineer networks with optimal conversion capability by using a suitable biochemically feasible reaction called ‘stoichiometric capacitance’. In addition, we provide a theoretical solution for decomposing a given stoichiometric capacitance over a set of known enzymatic reactions. We determine the stoichiometric capacitance for genome-scale metabolic networks of 10 organisms from different kingdoms of life and examine its implications for the alterations in flux variability patterns. Our empirical findings suggest that the theoretical capacity of metabolic networks comes at a cost of dramatic system's changes. Contact:larhlimi@mpimp-golm.mpg.de, or nikoloski@mpimp-golm.mpg.de Supplementary Information:Supplementary tables are available at Bioinformatics online.
Collapse
Affiliation(s)
- Abdelhalim Larhlimi
- Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Str. 24-25, D-14476 Potsdam, Germany.
| | | | | | | | | |
Collapse
|
114
|
Van Schaftingen E, Rzem R, Marbaix A, Collard F, Veiga-da-Cunha M, Linster CL. Metabolite proofreading, a neglected aspect of intermediary metabolism. J Inherit Metab Dis 2013; 36:427-34. [PMID: 23296366 DOI: 10.1007/s10545-012-9571-1] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/06/2012] [Revised: 11/26/2012] [Accepted: 11/29/2012] [Indexed: 10/27/2022]
Abstract
Enzymes of intermediary metabolism are less specific than what is usually assumed: they often act on metabolites that are not their 'true' substrate, making abnormal metabolites that may be deleterious if they accumulate. Some of these abnormal metabolites are reconverted to normal metabolites by repair enzymes, which play therefore a role akin to the proofreading activities of DNA polymerases and aminoacyl-tRNA synthetases. An illustrative example of such repair enzymes is L-2-hydroxyglutarate dehydrogenase, which eliminates a metabolite abnormally made by a Krebs cycle enzyme. Mutations in L-2-hydroxyglutarate dehydrogenase lead to L-2-hydroxyglutaric aciduria, a leukoencephalopathy. Other examples are the epimerase and the ATP-dependent dehydratase that repair hydrated forms of NADH and NADPH; ethylmalonyl-CoA decarboxylase, which eliminates an abnormal metabolite formed by acetyl-CoA carboxylase, an enzyme of fatty acid synthesis; L-pipecolate oxidase, which repairs a metabolite formed by a side activity of an enzyme of L-proline biosynthesis. Metabolite proofreading enzymes are likely quite common, but most of them are still unidentified. A defect in these enzymes may account for new metabolic disorders.
Collapse
|
115
|
Comparative genomics approaches to understanding and manipulating plant metabolism. Curr Opin Biotechnol 2013; 24:278-84. [DOI: 10.1016/j.copbio.2012.07.005] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2012] [Revised: 07/29/2012] [Accepted: 07/30/2012] [Indexed: 12/11/2022]
|
116
|
Inferring the metabolism of human orphan metabolites from their metabolic network context affirms human gluconokinase activity. Biochem J 2013; 449:427-35. [PMID: 23067238 DOI: 10.1042/bj20120980] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Metabolic network reconstructions define metabolic information within a target organism and can therefore be used to address incomplete metabolic information. In the present study we used a computational approach to identify human metabolites whose metabolism is incomplete on the basis of their detection in humans but exclusion from the human metabolic network reconstruction RECON 1. Candidate solutions, composed of metabolic reactions capable of explaining the metabolism of these compounds, were then identified computationally from a global biochemical reaction database. Solutions were characterized with respect to how metabolites were incorporated into RECON 1 and their biological relevance. Through detailed case studies we show that biologically plausible non-intuitive hypotheses regarding the metabolism of these compounds can be proposed in a semi-automated manner, in an approach that is similar to de novo network reconstruction. We subsequently experimentally validated one of the proposed hypotheses and report that C9orf103, previously identified as a candidate tumour suppressor gene, encodes a functional human gluconokinase. The results of the present study demonstrate how semi-automatic gap filling can be used to refine and extend metabolic reconstructions, thereby increasing their biological scope. Furthermore, we illustrate how incomplete human metabolic knowledge can be coupled with gene annotation in order to prioritize and confirm gene functions.
Collapse
|
117
|
Baran R, Bowen BP, Price MN, Arkin AP, Deutschbauer AM, Northen TR. Metabolic footprinting of mutant libraries to map metabolite utilization to genotype. ACS Chem Biol 2013; 8:189-99. [PMID: 23082955 DOI: 10.1021/cb300477w] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
The discrepancy between the pace of sequencing and functional characterization of genomes is a major challenge in understanding complex microbial metabolic processes and metabolic interactions in the environment. Here, we identified and validated genes related to the utilization of specific metabolites in bacteria by profiling metabolite utilization in libraries of mutant strains. Untargeted mass spectrometry based metabolomics was used to identify metabolites utilized by Escherichia coli and Shewanella oneidensis MR-1. Targeted high-throughput metabolite profiling of spent media of 8042 individual mutant strains was performed to link utilization to specific genes. Using this approach we identified genes of known function as well as novel transport proteins and enzymes required for the utilization of tested metabolites. Specific examples include two subunits of a predicted ABC transporter encoded by the genes SO1043 and SO1044 required for the utilization of citrulline and a predicted histidase encoded by the gene SO3057 required for the utilization of ergothioneine by S. oneidensis. In vitro assays with purified proteins showed substrate specificity of SO3057 toward ergothioneine and histidine betaine in contrast to substrate specificity of a paralogous histidase SO0098 toward histidine. This generally applicable, high-throughput workflow has the potential both to discover novel metabolic capabilities of microorganisms and to identify the corresponding genes.
Collapse
Affiliation(s)
- Richard Baran
- Life
Sciences Division and ‡Physical Biosciences Division, Lawrence Berkeley National Laboratory, 1 Cyclotron
Rd., Berkeley, California 94720, United States
| | - Benjamin P. Bowen
- Life
Sciences Division and ‡Physical Biosciences Division, Lawrence Berkeley National Laboratory, 1 Cyclotron
Rd., Berkeley, California 94720, United States
| | - Morgan N. Price
- Life
Sciences Division and ‡Physical Biosciences Division, Lawrence Berkeley National Laboratory, 1 Cyclotron
Rd., Berkeley, California 94720, United States
| | - Adam P. Arkin
- Life
Sciences Division and ‡Physical Biosciences Division, Lawrence Berkeley National Laboratory, 1 Cyclotron
Rd., Berkeley, California 94720, United States
| | - Adam M. Deutschbauer
- Life
Sciences Division and ‡Physical Biosciences Division, Lawrence Berkeley National Laboratory, 1 Cyclotron
Rd., Berkeley, California 94720, United States
| | - Trent R. Northen
- Life
Sciences Division and ‡Physical Biosciences Division, Lawrence Berkeley National Laboratory, 1 Cyclotron
Rd., Berkeley, California 94720, United States
| |
Collapse
|
118
|
Blais EM, Chavali AK, Papin JA. Linking genome-scale metabolic modeling and genome annotation. Methods Mol Biol 2013; 985:61-83. [PMID: 23417799 DOI: 10.1007/978-1-62703-299-5_4] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Genome-scale metabolic network reconstructions, assembled from annotated genomes, serve as a platform for integrating data from heterogeneous sources and generating hypotheses for further experimental validation. Implementing constraint-based modeling techniques such as flux balance analysis (FBA) on network reconstructions allows for interrogating metabolism at a systems level, which aids in identifying and rectifying gaps in knowledge. With genome sequences for various organisms from prokaryotes to eukaryotes becoming increasingly available, a significant bottleneck lies in the structural and functional annotation of these sequences. Using topologically based and biologically inspired metabolic network refinement, we can better characterize enzymatic functions present in an organism and link annotation of these functions to candidate transcripts; both steps can be experimentally validated.
Collapse
Affiliation(s)
- Edik M Blais
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA
| | | | | |
Collapse
|
119
|
Chen TW, Gan RCR, Wu TH, Huang PJ, Lee CY, Chen YYM, Chen CC, Tang P. FastAnnotator--an efficient transcript annotation web tool. BMC Genomics 2012; 13 Suppl 7:S9. [PMID: 23281853 PMCID: PMC3521244 DOI: 10.1186/1471-2164-13-s7-s9] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Recent developments in high-throughput sequencing (HTS) technologies have made it feasible to sequence the complete transcriptomes of non-model organisms or metatranscriptomes from environmental samples. The challenge after generating hundreds of millions of sequences is to annotate these transcripts and classify the transcripts based on their putative functions. Because many biological scientists lack the knowledge to install Linux-based software packages or maintain databases used for transcript annotation, we developed an automatic annotation tool with an easy-to-use interface. METHODS To elucidate the potential functions of gene transcripts, we integrated well-established annotation tools: Blast2GO, PRIAM and RPS BLAST in a web-based service, FastAnnotator, which can assign Gene Ontology (GO) terms, Enzyme Commission numbers (EC numbers) and functional domains to query sequences. RESULTS Using six transcriptome sequence datasets as examples, we demonstrated the ability of FastAnnotator to assign functional annotations. FastAnnotator annotated 88.1% and 81.3% of the transcripts from the well-studied organisms Caenorhabditis elegans and Streptococcus parasanguinis, respectively. Furthermore, FastAnnotator annotated 62.9%, 20.4%, 53.1% and 42.0% of the sequences from the transcriptomes of sweet potato, clam, amoeba, and Trichomonas vaginalis, respectively, which lack reference genomes. We demonstrated that FastAnnotator can complete the annotation process in a reasonable amount of time and is suitable for the annotation of transcriptomes from model organisms or organisms for which annotated reference genomes are not avaiable. CONCLUSIONS The sequencing process no longer represents the bottleneck in the study of genomics, and automatic annotation tools have become invaluable as the annotation procedure has become the limiting step. We present FastAnnotator, which was an automated annotation web tool designed to efficiently annotate sequences with their gene functions, enzyme functions or domains. FastAnnotator is useful in transcriptome studies and especially for those focusing on non-model organisms or metatranscriptomes. FastAnnotator does not require local installation and is freely available at http://fastannotator.cgu.edu.tw.
Collapse
Affiliation(s)
- Ting-Wen Chen
- Molecular Medicine Research Center, Chang Gung University, Taoyuan, Taiwan
| | | | | | | | | | | | | | | |
Collapse
|
120
|
Crécy-Lagard VD, Phillips G, Grochowski LL, Yacoubi BE, Jenney F, Adams MWW, Murzin AG, White RH. Comparative genomics guided discovery of two missing archaeal enzyme families involved in the biosynthesis of the pterin moiety of tetrahydromethanopterin and tetrahydrofolate. ACS Chem Biol 2012; 7:1807-16. [PMID: 22931285 PMCID: PMC3500442 DOI: 10.1021/cb300342u] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
![]()
C-1 carriers are essential cofactors in all domains of
life, and
in Archaea, these can be derivatives of tetrahydromethanopterin (H4-MPT) or tetrahydrofolate (H4-folate). Their synthesis
requires 6-hydroxymethyl-7,8-dihydropterin diphosphate (6-HMDP) as
the precursor, but the nature of pathways that lead to its formation
were unknown until the recent discovery of the GTP cyclohydrolase
IB/MptA family that catalyzes the first step, the conversion of GTP
to dihydroneopterin 2′,3′-cyclic phosphate or 7,8-dihydroneopterin
triphosphate [El Yacoubi, B.; et al. (2006) J. Biol. Chem., 281, 37586–37593
and Grochowski, L. L.; et al. (2007) Biochemistry46, 6658–6667]. Using a combination of comparative
genomics analyses, heterologous complementation tests, and in vitro assays, we show that the archaeal protein families
COG2098 and COG1634 specify two of the missing 6-HMDP synthesis enzymes.
Members of the COG2098 family catalyze the formation of 6-hydroxymethyl-7,8-dihydropterin
from 7,8-dihydroneopterin, while members of the COG1634 family catalyze
the formation of 6-HMDP from 6-hydroxymethyl-7,8-dihydropterin. The
discovery of these missing genes solves a long-standing mystery and
provides novel examples of convergent evolutions where proteins of
dissimilar architectures perform the same biochemical function.
Collapse
Affiliation(s)
- Valérie de Crécy-Lagard
- Department of Microbiology and
Department of Microbiology and Cell Science, University of Florida, P.O. Box 110700, Gainesville, Florida 32611-0700,
United States
| | - Gabriela Phillips
- Department of Microbiology and
Department of Microbiology and Cell Science, University of Florida, P.O. Box 110700, Gainesville, Florida 32611-0700,
United States
| | - Laura L. Grochowski
- Department
of Biochemistry (0308), Virginia Polytechnic Institute and State University, Blacksburg, Virginia 24061, United
States
| | - Basma El Yacoubi
- Department of Microbiology and
Department of Microbiology and Cell Science, University of Florida, P.O. Box 110700, Gainesville, Florida 32611-0700,
United States
| | - Francis Jenney
- Department of Basic
Sciences,
Georgia Campus, Philadelphia College of Osteopathic Medicine, Suwanee, Georgia 30024, United States
| | - Michael W. W. Adams
- Department of Biochemistry and
Molecular Biology, University of Georgia, Athens, Georgia 30602, United States
| | - Alexey G. Murzin
- MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 0QH,
U.K
| | - Robert H. White
- Department
of Biochemistry (0308), Virginia Polytechnic Institute and State University, Blacksburg, Virginia 24061, United
States
| |
Collapse
|
121
|
Structural analysis of hypothetical proteins from Helicobacter pylori: an approach to estimate functions of unknown or hypothetical proteins. Int J Mol Sci 2012; 13:7109-7137. [PMID: 22837682 PMCID: PMC3397514 DOI: 10.3390/ijms13067109] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2012] [Revised: 05/29/2012] [Accepted: 06/01/2012] [Indexed: 12/12/2022] Open
Abstract
Helicobacter pylori (H. pylori) have a unique ability to survive in extreme acidic environments and to colonize the gastric mucosa. It can cause diverse gastric diseases such as peptic ulcers, chronic gastritis, mucosa-associated lymphoid tissue (MALT) lymphoma, gastric cancer, etc. Based on genomic research of H. pylori, over 1600 genes have been functionally identified so far. However, H. pylori possess some genes that are uncharacterized since: (i) the gene sequences are quite new; (ii) the function of genes have not been characterized in any other bacterial systems; and (iii) sometimes, the protein that is classified into a known protein based on the sequence homology shows some functional ambiguity, which raises questions about the function of the protein produced in H. pylori. Thus, there are still a lot of genes to be biologically or biochemically characterized to understand the whole picture of gene functions in the bacteria. In this regard, knowledge on the 3D structure of a protein, especially unknown or hypothetical protein, is frequently useful to elucidate the structure-function relationship of the uncharacterized gene product. That is, a structural comparison with known proteins provides valuable information to help predict the cellular functions of hypothetical proteins. Here, we show the 3D structures of some hypothetical proteins determined by NMR spectroscopy and X-ray crystallography as a part of the structural genomics of H. pylori. In addition, we show some successful approaches of elucidating the function of unknown proteins based on their structural information.
Collapse
|
122
|
Prediction and identification of sequences coding for orphan enzymes using genomic and metagenomic neighbours. Mol Syst Biol 2012; 8:581. [PMID: 22569339 PMCID: PMC3377989 DOI: 10.1038/msb.2012.13] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2011] [Accepted: 03/24/2012] [Indexed: 11/09/2022] Open
Abstract
Many characterized metabolic enzymes currently lack associated gene and protein sequences. Here, pathway and genomic neighbour data are used to assign genes to these ‘orphan enzymes,' and the predictions are validated with experimental assays and genome-scale metabolic modelling. ![]()
A computational method is developed for assigning candidate sequences to orphan enzymes. The method uses metabolic pathway, genomic neighbourhood, genomic co-occurrence, and protein domain information to predict genes that are likely to perform a particular enzymatic function. Benchmarking of the scoring scheme based on the 4 features above revealed that some combinations of parameters yielded greater than 70% accuracy, and that high-confidence predictions could be generated for 131 orphan enzymes. Enzyme assay experiments confirmed the predicted enzymatic activity for two of the high-confidence candidate sequences. Predicted functions can improve the annotation of genomic and metagenomic data, and can reveal putative genes for enzymes with potential biotechnological applications. Incorporating the predicted enzymatic reactions into genome-scale metabolic models changed the flux connectivity and improved their ability to correctly predict gene essentiality, supporting the biological relevance of these predictions.
Despite the current wealth of sequencing data, one-third of all biochemically characterized metabolic enzymes lack a corresponding gene or protein sequence, and as such can be considered orphan enzymes. They represent a major gap between our molecular and biochemical knowledge, and consequently are not amenable to modern systemic analyses. As 555 of these orphan enzymes have metabolic pathway neighbours, we developed a global framework that utilizes the pathway and (meta)genomic neighbour information to assign candidate sequences to orphan enzymes. For 131 orphan enzymes (37% of those for which (meta)genomic neighbours are available), we associate sequences to them using scoring parameters with an estimated accuracy of 70%, implying functional annotation of 16 345 gene sequences in numerous (meta)genomes. As a case in point, two of these candidate sequences were experimentally validated to encode the predicted activity. In addition, we augmented the currently available genome-scale metabolic models with these new sequence–function associations and were able to expand the models by on average 8%, with a considerable change in the flux connectivity patterns and improved essentiality prediction.
Collapse
|
123
|
Basler G, Grimbs S, Nikoloski Z. Optimizing metabolic pathways by screening for feasible synthetic reactions. Biosystems 2012; 109:186-91. [PMID: 22575307 DOI: 10.1016/j.biosystems.2012.04.007] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2011] [Revised: 03/23/2012] [Accepted: 04/23/2012] [Indexed: 11/18/2022]
Abstract
BACKGROUND Reconstruction of genome-scale metabolic networks has resulted in models capable of reproducing experimentally observed biomass yield/growth rates and predicting the effect of alterations in metabolism for biotechnological applications. The existing studies rely on modifying the metabolic network of an investigated organism by removing or inserting reactions taken either from evolutionary similar organisms or from databases of biochemical reactions (e.g., KEGG). A potential disadvantage of these knowledge-driven approaches is that the result is biased towards known reactions, as such approaches do not account for the possibility of including novel enzymes, together with the reactions they catalyze. RESULTS Here, we explore the alternative of increasing biomass yield in three model organisms, namely Bacillus subtilis, Escherichia coli, and Hordeum vulgare, by applying small, chemically feasible network modifications. We use the predicted and experimentally confirmed growth rates of the wild-type networks as reference values and determine the effect of inserting mass-balanced, thermodynamically feasible reactions on predictions of growth rate by using flux balance analysis. CONCLUSIONS While many replacements of existing reactions naturally lead to a decrease or complete loss of biomass production ability, in all three investigated organisms we find feasible modifications which facilitate a significant increase in this biological function. We focus on modifications with feasible chemical properties and a significant increase in biomass yield. The results demonstrate that small modifications are sufficient to substantially alter biomass yield in the three organisms. The method can be used to predict the effect of targeted modifications on the yield of any set of metabolites (e.g., ethanol), thus providing a computational framework for synthetic metabolic engineering.
Collapse
Affiliation(s)
- Georg Basler
- Max-Planck-Institute for Molecular Plant Physiology, 14476 Potsdam, Germany.
| | | | | |
Collapse
|
124
|
Delavat F, Phalip V, Forster A, Plewniak F, Lett MC, Lièvremont D. Amylases without known homologues discovered in an acid mine drainage: significance and impact. Sci Rep 2012; 2:354. [PMID: 22482035 PMCID: PMC3319935 DOI: 10.1038/srep00354] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2011] [Accepted: 03/08/2012] [Indexed: 12/25/2022] Open
Abstract
Acid Mine Drainages (AMDs) are extreme environments characterized by acidic and oligotrophic conditions and by metal contaminations. A function-based screening of an AMD-derived metagenomic library led to the discovery and partial characterization of two non-homologous endo-acting amylases sharing no sequence similarity with any known amylase nor glycosidase. None carried known amylolytic domains, nor could be assigned to any GH-family. One amylase displayed no similarity with any known protein, whereas the second one was similar to TraC proteins involved in the bacterial type IV secretion system. According to the scarce similarities with known proteins, 3D-structure modelling using I-TASSER was unsuccessful. This study underlined the utility of a function-driven metagenomic approach to obtain a clearer image of the bacterial community enzymatic landscape. More generally, this work points out that screening for microorganisms or biomolecules in a priori incongruous environments could provide unconventional and new exciting ways for bioprospecting.
Collapse
|
125
|
Seaver SMD, Henry CS, Hanson AD. Frontiers in metabolic reconstruction and modeling of plant genomes. JOURNAL OF EXPERIMENTAL BOTANY 2012; 63:2247-58. [PMID: 22238452 DOI: 10.1093/jxb/err371] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
A major goal of post-genomic biology is to reconstruct and model in silico the metabolic networks of entire organisms. Work on bacteria is well advanced, and is now under way for plants and other eukaryotes. Genome-scale modelling in plants is much more challenging than in bacteria. The challenges come from features characteristic of higher organisms (subcellular compartmentation, tissue differentiation) and also from the particular severity in plants of a general problem: genome content whose functions remain undiscovered. This problem results in thousands of genes for which no function is known ('undiscovered genome content') and hundreds of enzymatic and transport functions for which no gene is yet identified. The severity of the undiscovered genome content problem in plants reflects their genome size and complexity. To bring the challenges of plant genome-scale modelling into focus, we first summarize the current status of plant genome-scale models. We then highlight the challenges - and ways to address them - in three areas: identifying genes for missing processes, modelling tissues as opposed to single cells, and finding metabolic functions encoded by undiscovered genome content. We also discuss the emerging view that a significant fraction of undiscovered genome content encodes functions that counter damage to metabolites inflicted by spontaneous chemical reactions or enzymatic mistakes.
Collapse
Affiliation(s)
- Samuel M D Seaver
- Mathematics and Computer Science Division, Argonne National Laboratory, Argonne, IL 60439, USA
| | | | | |
Collapse
|
126
|
Fernie AR, Stitt M. On the discordance of metabolomics with proteomics and transcriptomics: coping with increasing complexity in logic, chemistry, and network interactions scientific correspondence. PLANT PHYSIOLOGY 2012; 158:1139-45. [PMID: 22253257 PMCID: PMC3291261 DOI: 10.1104/pp.112.193235] [Citation(s) in RCA: 136] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Affiliation(s)
- Alisdair R Fernie
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, 14476 Potsdam-Golm, Germany.
| | | |
Collapse
|
127
|
Belin P, Moutiez M, Lautru S, Seguin J, Pernodet JL, Gondry M. The nonribosomal synthesis of diketopiperazines in tRNA-dependent cyclodipeptide synthase pathways. Nat Prod Rep 2012; 29:961-79. [DOI: 10.1039/c2np20010d] [Citation(s) in RCA: 113] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
128
|
Galeazzi L, Bocci P, Amici A, Brunetti L, Ruggieri S, Romine M, Reed S, Osterman AL, Rodionov DA, Sorci L, Raffaelli N. Identification of nicotinamide mononucleotide deamidase of the bacterial pyridine nucleotide cycle reveals a novel broadly conserved amidohydrolase family. J Biol Chem 2011; 286:40365-75. [PMID: 21953451 PMCID: PMC3220592 DOI: 10.1074/jbc.m111.275818] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2011] [Revised: 08/29/2011] [Indexed: 11/06/2022] Open
Abstract
The pyridine nucleotide cycle is a network of salvage and recycling routes maintaining homeostasis of NAD(P) cofactor pool in the cell. Nicotinamide mononucleotide (NMN) deamidase (EC 3.5.1.42), one of the key enzymes of the bacterial pyridine nucleotide cycle, was originally described in Enterobacteria, but the corresponding gene eluded identification for over 30 years. A genomics-based reconstruction of NAD metabolism across hundreds of bacterial species suggested that NMN deamidase reaction is the only possible way of nicotinamide salvage in the marine bacterium Shewanella oneidensis. This prediction was verified via purification of native NMN deamidase from S. oneidensis followed by the identification of the respective gene, termed pncC. Enzymatic characterization of the PncC protein, as well as phenotype analysis of deletion mutants, confirmed its proposed biochemical and physiological function in S. oneidensis. Of the three PncC homologs present in Escherichia coli, NMN deamidase activity was confirmed only for the recombinant purified product of the ygaD gene. A comparative analysis at the level of sequence and three-dimensional structure, which is available for one of the PncC family member, shows no homology with any previously described amidohydrolases. Multiple alignment analysis of functional and nonfunctional PncC homologs, together with NMN docking experiments, allowed us to tentatively identify the active site area and conserved residues therein. An observed broad phylogenomic distribution of predicted functional PncCs in the bacterial kingdom is consistent with a possible role in detoxification of NMN, resulting from NAD utilization by DNA ligase.
Collapse
Affiliation(s)
- Luca Galeazzi
- From the Department of Molecular Pathology and Innovative Therapies, Section of Biochemistry, Università Politecnica delle Marche, Ancona 60131, Italy
| | - Paola Bocci
- From the Department of Molecular Pathology and Innovative Therapies, Section of Biochemistry, Università Politecnica delle Marche, Ancona 60131, Italy
| | - Adolfo Amici
- From the Department of Molecular Pathology and Innovative Therapies, Section of Biochemistry, Università Politecnica delle Marche, Ancona 60131, Italy
| | - Lucia Brunetti
- From the Department of Molecular Pathology and Innovative Therapies, Section of Biochemistry, Università Politecnica delle Marche, Ancona 60131, Italy
| | - Silverio Ruggieri
- From the Department of Molecular Pathology and Innovative Therapies, Section of Biochemistry, Università Politecnica delle Marche, Ancona 60131, Italy
| | - Margaret Romine
- the Pacific Northwest National Laboratory, Richland, Washington 99352, and
| | - Samantha Reed
- the Pacific Northwest National Laboratory, Richland, Washington 99352, and
| | - Andrei L. Osterman
- the Sanford-Burnham Medical Research Institute, La Jolla, California 92037
| | - Dmitry A. Rodionov
- the Sanford-Burnham Medical Research Institute, La Jolla, California 92037
| | - Leonardo Sorci
- From the Department of Molecular Pathology and Innovative Therapies, Section of Biochemistry, Università Politecnica delle Marche, Ancona 60131, Italy
- the Sanford-Burnham Medical Research Institute, La Jolla, California 92037
| | - Nadia Raffaelli
- From the Department of Molecular Pathology and Innovative Therapies, Section of Biochemistry, Università Politecnica delle Marche, Ancona 60131, Italy
| |
Collapse
|
129
|
Abstract
Comparative analysis of the sequences of enzymes encoded in a variety of prokaryotic and eukaryotic genomes reveals convergence and divergence at several levels. Functional convergence can be inferred when structurally distinct and hence non-homologous enzymes show the ability to catalyze the same biochemical reaction. In contrast, as a result of functional diversification, many structurally similar enzyme molecules act on substantially distinct substrates and catalyze diverse biochemical reactions. Here, we present updates on the ATP-grasp, alkaline phosphatase, cupin, HD hydrolase, and N-terminal nucleophile (Ntn) hydrolase enzyme superfamilies and discuss the patterns of sequence and structural conservation and diversity within these superfamilies. Typically, enzymes within a superfamily possess common sequence motifs and key active site residues, as well as (predicted) reaction mechanisms. These observations suggest that the strained conformation (the entatic state) of the active site, which is responsible for the substrate binding and formation of the transition complex, tends to be conserved within enzyme superfamilies. The subsequent fate of the transition complex is not necessarily conserved and depends on the details of the structures of the enzyme and the substrate. This variability of reaction outcomes limits the ability of sequence analysis to predict the exact enzymatic activities of newly sequenced gene products. Nevertheless, sequence-based (super)family assignments and generic functional predictions, even if imprecise, provide valuable leads for experimental studies and remain the best approach to the functional annotation of uncharacterized proteins from new genomes.
Collapse
Affiliation(s)
- Michael Y Galperin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894.
| |
Collapse
|
130
|
von Saint Paul V, Zhang W, Kanawati B, Geist B, Faus-Keßler T, Schmitt-Kopplin P, Schäffner AR. The Arabidopsis glucosyltransferase UGT76B1 conjugates isoleucic acid and modulates plant defense and senescence. THE PLANT CELL 2011; 23:4124-45. [PMID: 22080599 PMCID: PMC3246326 DOI: 10.1105/tpc.111.088443] [Citation(s) in RCA: 130] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2011] [Revised: 09/30/2011] [Accepted: 10/24/2011] [Indexed: 05/18/2023]
Abstract
Plants coordinate and tightly regulate pathogen defense by the mostly antagonistic salicylate (SA)- and jasmonate (JA)-mediated signaling pathways. Here, we show that the previously uncharacterized glucosyltransferase UGT76B1 is a novel player in this SA-JA signaling crosstalk. UGT76B1 was selected as the top stress-induced isoform among all 122 members of the Arabidopsis thaliana UGT family. Loss of UGT76B1 function leads to enhanced resistance to the biotrophic pathogen Pseudomonas syringae and accelerated senescence but increased susceptibility toward necrotrophic Alternaria brassicicola. This is accompanied by constitutively elevated SA levels and SA-related marker gene expression, whereas JA-dependent markers are repressed. Conversely, UGT76B1 overexpression has the opposite effect. Thus, UGT76B1 attenuates SA-dependent plant defense in the absence of infection, promotes the JA response, and delays senescence. The ugt76b1 phenotypes were SA dependent, whereas UGT76B1 overexpression indicated that this gene possibly also has a direct effect on the JA pathway. Nontargeted metabolomic analysis of UGT76B1 knockout and overexpression lines using ultra-high-resolution mass spectrometry and activity assays with the recombinant enzyme led to the ab initio identification of isoleucic acid (2-hydroxy-3-methyl-pentanoic acid) as a substrate of UGT76B1. Exogenously applied isoleucic acid increased resistance against P. syringae infection. These findings indicate a novel link between amino acid-related molecules and plant defense that is mediated by small-molecule glucosylation.
Collapse
Affiliation(s)
- Veronica von Saint Paul
- Institute of Biochemical Plant Pathology, Helmholtz Zentrum München, 85764 Neuherberg, Germany
| | - Wei Zhang
- Institute of Biochemical Plant Pathology, Helmholtz Zentrum München, 85764 Neuherberg, Germany
| | - Basem Kanawati
- Institute of Ecological Chemistry, Helmholtz Zentrum München, 85764 Neuherberg, Germany
| | - Birgit Geist
- Institute of Biochemical Plant Pathology, Helmholtz Zentrum München, 85764 Neuherberg, Germany
| | - Theresa Faus-Keßler
- Institute of Developmental Genetics, Helmholtz Zentrum München, 85764 Neuherberg, Germany
| | | | - Anton R. Schäffner
- Institute of Biochemical Plant Pathology, Helmholtz Zentrum München, 85764 Neuherberg, Germany
- Address correspondence to
| |
Collapse
|
131
|
Rolfsson O, Palsson BØ, Thiele I. The human metabolic reconstruction Recon 1 directs hypotheses of novel human metabolic functions. BMC SYSTEMS BIOLOGY 2011; 5:155. [PMID: 21962087 PMCID: PMC3224382 DOI: 10.1186/1752-0509-5-155] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/29/2011] [Accepted: 10/01/2011] [Indexed: 11/29/2022]
Abstract
Background Metabolic network reconstructions formalize our knowledge of metabolism. Gaps in these networks pinpoint regions of metabolism where biological components and functions are "missing." At the same time, a major challenge in the post genomic era involves characterisation of missing biological components to complete genome annotation. Results We used the human metabolic network reconstruction RECON 1 and established constraint-based modelling tools to uncover novel functions associated with human metabolism. Flux variability analysis identified 175 gaps in RECON 1 in the form of blocked reactions. These gaps were unevenly distributed within metabolic pathways but primarily found in the cytosol and often caused by compounds whose metabolic fate, rather than production, is unknown. Using a published algorithm, we computed gap-filling solutions comprised of non-organism specific metabolic reactions capable of bridging the identified gaps. These candidate solutions were found to be dependent upon the reaction environment of the blocked reaction. Importantly, we showed that automatically generated solutions could produce biologically realistic hypotheses of novel human metabolic reactions such as of the fate of iduronic acid following glycan degradation and of N-acetylglutamate in amino acid metabolism. Conclusions The results demonstrate how metabolic models can be utilised to direct hypotheses of novel metabolic functions in human metabolism; a process that we find is heavily reliant upon manual curation and biochemical insight. The effectiveness of a systems approach for novel biochemical pathway discovery in mammals is demonstrated and steps required to tailor future gap filling algorithms to mammalian metabolic networks are proposed.
Collapse
Affiliation(s)
- Ottar Rolfsson
- Center for Systems Biology, University of Iceland, Sturlugata 8, 101 Reykjavik, Iceland
| | | | | |
Collapse
|
132
|
Pribat A, Blaby IK, Lara-Núñez A, Jeanguenin L, Fouquet R, Frelin O, Gregory JF, Philmus B, Begley TP, de Crécy-Lagard V, Hanson AD. A 5-formyltetrahydrofolate cycloligase paralog from all domains of life: comparative genomic and experimental evidence for a cryptic role in thiamin metabolism. Funct Integr Genomics 2011; 11:467-78. [PMID: 21538139 PMCID: PMC6078417 DOI: 10.1007/s10142-011-0224-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2011] [Revised: 03/19/2011] [Accepted: 04/03/2011] [Indexed: 12/18/2022]
Abstract
A paralog (here termed COG0212) of the ATP-dependent folate salvage enzyme 5-formyltetrahydrofolate cycloligase (5-FCL) occurs in all domains of life and, although typically annotated as 5-FCL in pro- and eukaryotic genomes, is of unknown function. COG0212 is similar in overall structure to 5-FCL, particularly in the substrate binding region, and has distant similarity to other kinases. The Arabidopsis thaliana COG0212 protein was shown to be targeted to chloroplasts and to be required for embryo viability. Comparative genomic analysis revealed that a high proportion (19%) of archaeal and bacterial COG0212 genes are clustered on the chromosome with various genes implicated in thiamin metabolism or transport but showed no such association between COG0212 and folate metabolism. Consistent with the bioinformatic evidence for a role in thiamin metabolism, ablating COG0212 in the archaeon Haloferax volcanii caused accumulation of thiamin monophosphate. Biochemical and functional complementation tests of several known and hypothetical thiamin-related activities (involving thiamin, its breakdown products, and their phosphates) were, however, negative. Also consistent with the bioinformatic evidence, the COG0212 proteins from A. thaliana and prokaryote sources lacked 5-FCL activity in vitro and did not complement the growth defect or the characteristic 5-formyltetrahydrofolate accumulation of a 5-FCL-deficient (ΔygfA) Escherichia coli strain. We therefore propose (a) that COG0212 has an unrecognized yet sometimes crucial role in thiamin metabolism, most probably in salvage or detoxification, and (b) that is not a 5-FCL and should no longer be so annotated.
Collapse
Affiliation(s)
- Anne Pribat
- Horticultural Sciences Department, University of Florida, Gainesville, 32611, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
133
|
Guengerich FP, Cheng Q. Orphans in the human cytochrome P450 superfamily: approaches to discovering functions and relevance in pharmacology. Pharmacol Rev 2011; 63:684-99. [PMID: 21737533 DOI: 10.1124/pr.110.003525] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
As a result of technical advances in recombinant DNA technology and nucleotide sequencing, entire genome sequences have become available in the past decade and offer potential in understanding diseases. However, a central problem in the biochemical sciences is that the functions of only a fraction of the genes/proteins are known, and this is also an issue in pharmacology. This review is focused on issues related to the functions of cytochrome P450 (P450) enzymes. P450 functions can be categorized in several groups: 1) Some P450s have critical roles in the metabolism of endogenous substrates (e.g., sterols and fat-soluble vitamins). 2) Some P450s are not generally critical to normal physiology but function in relatively nonselective protection from the many xenobiotic chemicals to which mammals (including humans) are exposed in their diets [as well as more anthropomorphic chemicals (e.g., drugs, pesticides)]. 3) Some P450s have not been extensively studied and are termed "orphans" here. With regard to elucidation of any physiological functions of the orphan P450s, the major subject of this review, it is clear that simple trial-and-error approaches with individual substrate candidates will not be very productive in addressing questions about function. A series of liquid chromatography/mass spectrometry/informatics approaches are discussed, along with some successes with both human and bacterial P450s. Current information on what are still considered "orphan" P450s is presented. The potential for application of some of these approaches to other enzyme systems is also discussed.
Collapse
Affiliation(s)
- F Peter Guengerich
- Department of Biochemistry and Center in Molecular Toxicology, Vanderbilt University School of Medicine, 638 Robinson Research Building, 2200 Pierce Avenue, Nashville, Tennessee 37232-0146, USA.
| | | |
Collapse
|
134
|
Boynton TO, Gerdes S, Craven SH, Neidle EL, Phillips JD, Dailey HA. Discovery of a gene involved in a third bacterial protoporphyrinogen oxidase activity through comparative genomic analysis and functional complementation. Appl Environ Microbiol 2011; 77:4795-801. [PMID: 21642412 PMCID: PMC3147383 DOI: 10.1128/aem.00171-11] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2011] [Accepted: 05/20/2011] [Indexed: 11/20/2022] Open
Abstract
Tetrapyrroles are ubiquitous molecules in nearly all living organisms. Heme, an iron-containing tetrapyrrole, is widely distributed in nature, including most characterized aerobic and facultative bacteria. A large majority of bacteria that contain heme possess the ability to synthesize it. Despite this capability and the fact that the biosynthetic pathway has been well studied, enzymes catalyzing at least three steps have remained "missing" in many bacteria. In the current work, we have employed comparative genomics via the SEED genomic platform, coupled with experimental verification utilizing Acinetobacter baylyi ADP1, to identify one of the missing enzymes, a new protoporphyrinogen oxidase, the penultimate enzyme in heme biosynthesis. COG1981 was identified by genomic analysis as a candidate protein family for the missing enzyme in bacteria that lacked HemG or HemY, two known protoporphyrinogen oxidases. The predicted amino acid sequence of COG1981 is unlike those of the known enzymes HemG and HemY, but in some genomes, the gene encoding it is found neighboring other heme biosynthetic genes. When the COG1981 gene was deleted from the genome of A. baylyi, a bacterium that lacks both hemG and hemY, the organism became auxotrophic for heme. Cultures accumulated porphyrin intermediates, and crude cell extracts lacked protoporphyrinogen oxidase activity. The heme auxotrophy was rescued by the presence of a plasmid-borne protoporphyrinogen oxidase gene from a number of different organisms, such as hemG from Escherichia coli, hemY from Myxococcus xanthus, or the human gene for protoporphyrinogen oxidase.
Collapse
Affiliation(s)
- Tye O. Boynton
- Biomedical and Health Sciences Institute, University of Georgia, Athens, Georgia 30602
- Department of Microbiology, University of Georgia, Athens, Georgia 30602
| | - Svetlana Gerdes
- Fellowship for Interpretation of Genomes, Burr Ridge, Illinois 60527
| | - Sarah H. Craven
- Department of Microbiology, University of Georgia, Athens, Georgia 30602
| | - Ellen L. Neidle
- Biomedical and Health Sciences Institute, University of Georgia, Athens, Georgia 30602
- Department of Microbiology, University of Georgia, Athens, Georgia 30602
| | - John D. Phillips
- Division of Hematology, University of Utah School of Medicine, Salt Lake City, Utah 84132
| | - Harry A. Dailey
- Biomedical and Health Sciences Institute, University of Georgia, Athens, Georgia 30602
- Department of Microbiology, University of Georgia, Athens, Georgia 30602
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, Georgia 30602
| |
Collapse
|
135
|
Gerdes S, El Yacoubi B, Bailly M, Blaby IK, Blaby-Haas CE, Jeanguenin L, Lara-Núñez A, Pribat A, Waller JC, Wilke A, Overbeek R, Hanson AD, de Crécy-Lagard V. Synergistic use of plant-prokaryote comparative genomics for functional annotations. BMC Genomics 2011; 12 Suppl 1:S2. [PMID: 21810204 PMCID: PMC3223725 DOI: 10.1186/1471-2164-12-s1-s2] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Identifying functions for all gene products in all sequenced organisms is a central challenge of the post-genomic era. However, at least 30-50% of the proteins encoded by any given genome are of unknown or vaguely known function, and a large number are wrongly annotated. Many of these 'unknown' proteins are common to prokaryotes and plants. We set out to predict and experimentally test the functions of such proteins. Our approach to functional prediction integrates comparative genomics based mainly on microbial genomes with functional genomic data from model microorganisms and post-genomic data from plants. This approach bridges the gap between automated homology-based annotations and the classical gene discovery efforts of experimentalists, and is more powerful than purely computational approaches to identifying gene-function associations. RESULTS Among Arabidopsis genes, we focused on those (2,325 in total) that (i) are unique or belong to families with no more than three members, (ii) occur in prokaryotes, and (iii) have unknown or poorly known functions. Computer-assisted selection of promising targets for deeper analysis was based on homology-independent characteristics associated in the SEED database with the prokaryotic members of each family. In-depth comparative genomic analysis was performed for 360 top candidate families. From this pool, 78 families were connected to general areas of metabolism and, of these families, specific functional predictions were made for 41. Twenty-one predicted functions have been experimentally tested or are currently under investigation by our group in at least one prokaryotic organism (nine of them have been validated, four invalidated, and eight are in progress). Ten additional predictions have been independently validated by other groups. Discovering the function of very widespread but hitherto enigmatic proteins such as the YrdC or YgfZ families illustrates the power of our approach. CONCLUSIONS Our approach correctly predicted functions for 19 uncharacterized protein families from plants and prokaryotes; none of these functions had previously been correctly predicted by computational methods. The resulting annotations could be propagated with confidence to over six thousand homologous proteins encoded in over 900 bacterial, archaeal, and eukaryotic genomes currently available in public databases.
Collapse
Affiliation(s)
- Svetlana Gerdes
- Fellowship for Interpretation of Genomes, Burr Ridge, IL, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
136
|
Salamanca-Pinzón SG, Guengerich FP. A tricistronic human adrenodoxin reductase-adrenodoxin-cytochrome P450 27A1 vector system for substrate hydroxylation in Escherichia coli. Protein Expr Purif 2011; 79:231-6. [PMID: 21621619 DOI: 10.1016/j.pep.2011.05.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2011] [Revised: 05/11/2011] [Accepted: 05/12/2011] [Indexed: 01/01/2023]
Abstract
Cytochrome P450 (P450) 27A1 catalyzes 27-hydroxylation of cholesterol and 25-hydroxylation of vitamin D(3), serving as an important component for the maintenance of lipid homeostasis. In eukaryotic cells P450 27A1 is a membrane-bound protein located on the inner mitochondrial membrane and requires two auxiliary reduction partners, adrenodoxin (Adx) and NADPH-adrenodoxin reductase (Adr), for catalysis in the bile acid biosynthesis pathway. A strategy was developed for the functional coexpression of P450 27A1 with Adr and Adx in a tricistronic fashion (single RNA, three proteins) in Escherichia coli, mimicking the mitochondrial P450 system. Intact bacterial cells coexpressing the P450 vector (pTC27A1) efficiently hydroxylated cholesterol at the 27 position as well as vitamin D(3) at the 25 position when supplemented with glycerol as a carbon source. Thus, E. coli containing pTC27A1 is able to hydroxylate cholesterol in a self-sufficient fashion and is suitable for further applications of protein interaction, drug discovery, and inhibitor evaluation and for the study of other mitochondrial P450s and oxysterol production in microorganisms without a need for membrane reconstitution, membrane simulation by detergents, or purification of the components.
Collapse
Affiliation(s)
- S Giovanna Salamanca-Pinzón
- Department of Biochemistry and Center in Molecular Toxicology, Vanderbilt University School of Medicine, Nashville, TN 37232-0146, USA
| | | |
Collapse
|
137
|
Hanson AD, Gregory JF. Folate biosynthesis, turnover, and transport in plants. ANNUAL REVIEW OF PLANT BIOLOGY 2011; 62:105-25. [PMID: 21275646 DOI: 10.1146/annurev-arplant-042110-103819] [Citation(s) in RCA: 145] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Folates are essential cofactors for one-carbon transfer reactions and are needed in the diets of humans and animals. Because plants are major sources of dietary folate, plant folate biochemistry has long been of interest but progressed slowly until the genome era. Since then, genome-enabled approaches have brought rapid advances: We now know (a) all the plant folate synthesis genes and some genes of folate turnover and transport, (b) certain mechanisms governing folate synthesis, and (c) the subcellular locations of folate synthesis enzymes and of folates themselves. Some of this knowledge has been applied, simply and successfully, to engineer folate-enriched food crops (i.e., biofortification). Much remains to be discovered about folates, however, particularly in relation to homeostasis, catabolism, membrane transport, and vacuolar storage. Understanding these processes, which will require both biochemical and -omics research, should lead to improved biofortification strategies based on transgenic or conventional approaches.
Collapse
Affiliation(s)
- Andrew D Hanson
- Horticultural Sciences Department, University of Florida, Gainesville, Florida 32611, USA
| | | |
Collapse
|
138
|
Jeanguenin L, Lara-Núñez A, Pribat A, Mageroy MH, Gregory JF, Rice KC, de Crécy-Lagard V, Hanson AD. Moonlighting glutamate formiminotransferases can functionally replace 5-formyltetrahydrofolate cycloligase. J Biol Chem 2010; 285:41557-66. [PMID: 20952389 DOI: 10.1074/jbc.m110.190504] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
5-Formyltetrahydrofolate (5-CHO-THF) is formed by a side reaction of serine hydroxymethyltransferase. Unlike other folates, it is not a one-carbon donor but a potent inhibitor of folate enzymes and must therefore be metabolized. Only 5-CHO-THF cycloligase (5-FCL) is generally considered to do this. However, comparative genomic analysis indicated (i) that certain prokaryotes lack 5-FCL, implying that they have an alternative 5-CHO-THF-metabolizing enzyme, and (ii) that the histidine breakdown enzyme glutamate formiminotransferase (FT) might moonlight in this role. A functional complementation assay for 5-CHO-THF metabolism was developed in Escherichia coli, based on deleting the gene encoding 5-FCL (ygfA). The deletion mutant accumulated 5-CHO-THF and, with glycine as sole nitrogen source, showed a growth defect; both phenotypes were complemented by bacterial or archaeal genes encoding FT. Furthermore, utilization of supplied 5-CHO-THF by Streptococcus pyogenes was shown to require expression of the native FT. Recombinant bacterial and archaeal FTs catalyzed formyl transfer from 5-CHO-THF to glutamate, with k(cat) values of 0.1-1.2 min(-1) and K(m) values for 5-CHO-THF and glutamate of 0.4-5 μM and 0.03-1 mM, respectively. Although the formyltransferase activities of these proteins were far lower than their formiminotransferase activities, the K(m) values for both substrates relative to their intracellular levels in prokaryotes are consistent with significant in vivo flux through the formyltransferase reaction. Collectively, these data indicate that FTs functionally replace 5-FCL in certain prokaryotes.
Collapse
Affiliation(s)
- Linda Jeanguenin
- Department of Horticultural Sciences, University of Florida, Gainesville, Florida 32611, USA
| | | | | | | | | | | | | | | |
Collapse
|
139
|
Galperin MY, Koonin EV. From complete genome sequence to 'complete' understanding? Trends Biotechnol 2010; 28:398-406. [PMID: 20647113 PMCID: PMC3065831 DOI: 10.1016/j.tibtech.2010.05.006] [Citation(s) in RCA: 119] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2010] [Revised: 05/18/2010] [Accepted: 05/28/2010] [Indexed: 12/29/2022]
Abstract
The rapidly accumulating genome sequence data allow researchers to address fundamental biological questions that were not even asked just a few years ago. A major problem in genomics is the widening gap between the rapid progress in genome sequencing and the comparatively slow progress in the functional characterization of sequenced genomes. Here we discuss two key questions of genome biology: whether we need more genomes, and how deep is our understanding of biology based on genomic analysis. We argue that overly specific annotations of gene functions are often less useful than the more generic, but also more robust, functional assignments based on protein family classification. We also discuss problems in understanding the functions of the remaining 'conserved hypothetical' genes.
Collapse
Affiliation(s)
- Michael Y Galperin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | | |
Collapse
|
140
|
Guengerich FP, Tang Z, Salamanca-Pinzón SG, Cheng Q. Characterizing proteins of unknown function: orphan cytochrome p450 enzymes as a paradigm. Mol Interv 2010; 10:153-63. [PMID: 20539034 PMCID: PMC2895278 DOI: 10.1124/mi.10.3.6] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
With the rapid completion of genomic sequences of organisms today, we have far more gene products than functions we can ascribe. A number of experimental strategies have been developed and applied, both in vitro and in vivo, to put functions to these orphan proteins. The "deorphanization" of human and Streptomyces cytochrome P450 enzymes is considered quite important for pharmacology, with ramifications for the use of clinical therapeutics. The myriad of possibilities is too enormous to screen one reaction at a time, thus metabolomic or proteomic screens with complex biological samples are promising current strategies.
Collapse
Affiliation(s)
- F Peter Guengerich
- Department of Biochemistry and Center in Molecular Toxicology, Vanderbilt University School of Medicine, Nashville, TN 37232-0146, USA.
| | | | | | | |
Collapse
|
141
|
Guengerich FP, Tang Z, Cheng Q, Salamanca-Pinzón SG. Approaches to deorphanization of human and microbial cytochrome P450 enzymes. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2010; 1814:139-45. [PMID: 20493973 DOI: 10.1016/j.bbapap.2010.05.005] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/13/2010] [Revised: 04/30/2010] [Accepted: 05/09/2010] [Indexed: 12/30/2022]
Abstract
One of the general problems in biology today is that we are characterizing genomic sequences much faster than identifying the functions of the gene products, and the same problem exists with cytochromes P450 (P450). One fourth of the human P450s are not well-characterized and therefore considered "orphans." A number of approaches to deorphanization are discussed generally. Several liquid chromatography-mass spectrometry approaches have been applied to some of the human and Streptomyces coelicolor P450s. One current limitation is that too many fatty acid oxidations have been identified and we are probably missing more relevant substrates, possibly due to limits of sensitivity.
Collapse
Affiliation(s)
- F Peter Guengerich
- Department of Biochemistry and Center in Molecular Toxicology, Vanderbilt University School of Medicine, Nashville, TN 37232-0146, USA.
| | | | | | | |
Collapse
|