Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Maltsev N, Glass E, Sulakhe D, Rodriguez A, Syed MH, Bompada T, Zhang Y, D'Souza M. PUMA2--grid-based high-throughput analysis of genomes and metabolic pathways. Nucleic Acids Res 2006;34:D369-72. [PMID: 16381888 PMCID: PMC1347457 DOI: 10.1093/nar/gkj095] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

For:	Maltsev N, Glass E, Sulakhe D, Rodriguez A, Syed MH, Bompada T, Zhang Y, D'Souza M. PUMA2--grid-based high-throughput analysis of genomes and metabolic pathways. Nucleic Acids Res 2006;34:D369-72. [PMID: 16381888 PMCID: PMC1347457 DOI: 10.1093/nar/gkj095] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Number

Cited by Other Article(s)

A Survey of Data Mining and Deep Learning in Bioinformatics. J Med Syst 2018;42:139. [DOI: 10.1007/s10916-018-1003-9] [Citation(s) in RCA: 81] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Accepted: 06/21/2018] [Indexed: 12/13/2022]

Role of N-acetylserotonin O-methyltransferase in bipolar disorders and its dynamics. J Mol Liq 2013. [DOI: 10.1016/j.molliq.2013.03.008] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Zhou T. Computational reconstruction of metabolic networks from KEGG. Methods Mol Biol 2013;930:235-249. [PMID: 23086844 DOI: 10.1007/978-1-62703-059-5_10] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]

Jung SK, McDonald K. Visual gene developer: a fully programmable bioinformatics software for synthetic gene optimization. BMC Bioinformatics 2011;12:340. [PMID: 21846353 PMCID: PMC3215308 DOI: 10.1186/1471-2105-12-340] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2011] [Accepted: 08/16/2011] [Indexed: 08/26/2023] Open

ModEnzA: Accurate Identification of Metabolic Enzymes Using Function Specific Profile HMMs with Optimised Discrimination Threshold and Modified Emission Probabilities. Adv Bioinformatics 2011;2011:743782. [PMID: 21541071 PMCID: PMC3085309 DOI: 10.1155/2011/743782] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2010] [Revised: 12/07/2010] [Accepted: 01/27/2011] [Indexed: 01/07/2023] Open

Wu XL, Beissinger TM, Bauck S, Woodward B, Rosa GJM, Weigel KA, Gatti NDL, Gianola D. A primer on high-throughput computing for genomic selection. Front Genet 2011;2:4. [PMID: 22303303 PMCID: PMC3268564 DOI: 10.3389/fgene.2011.00004] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2010] [Accepted: 02/07/2011] [Indexed: 12/30/2022] Open

Abstract

High-throughput computing (HTC) uses computer clusters to solve advanced computational problems, with the goal of accomplishing high-throughput over relatively long periods of time. In genomic selection, for example, a set of markers covering the entire genome is used to train a model based on known data, and the resulting model is used to predict the genetic merit of selection candidates. Sophisticated models are very computationally demanding and, with several traits to be evaluated sequentially, computing time is long, and output is low. In this paper, we present scenarios and basic principles of how HTC can be used in genomic selection, implemented using various techniques from simple batch processing to pipelining in distributed computer clusters. Various scripting languages, such as shell scripting, Perl, and R, are also very useful to devise pipelines. By pipelining, we can reduce total computing time and consequently increase throughput. In comparison to the traditional data processing pipeline residing on the central processors, performing general-purpose computation on a graphics processing unit provide a new-generation approach to massive parallel computing in genomic selection. While the concept of HTC may still be new to many researchers in animal breeding, plant breeding, and genetics, HTC infrastructures have already been built in many institutions, such as the University of Wisconsin–Madison, which can be leveraged for genomic selection, in terms of central processing unit capacity, network connectivity, storage availability, and middleware connectivity. Exploring existing HTC infrastructures as well as general-purpose computing environments will further expand our capability to meet increasing computing demands posed by unprecedented genomic data that we have today. We anticipate that HTC will impact genomic selection via better statistical models, faster solutions, and more competitive products (e.g., from design of marker panels to realized genetic gain). Eventually, HTC may change our view of data analysis as well as decision-making in the post-genomic era of selection programs in animals and plants, or in the study of complex diseases in humans.

Collapse

Terzer M, Maynard ND, Covert MW, Stelling J. Genome-scale metabolic networks. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2011;1:285-297. [PMID: 20835998 DOI: 10.1002/wsbm.37] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Aho T, Almusa H, Matilainen J, Larjo A, Ruusuvuori P, Aho KL, Wilhelm T, Lähdesmäki H, Beyer A, Harju M, Chowdhury S, Leinonen K, Roos C, Yli-Harja O. Reconstruction and validation of RefRec: a global model for the yeast molecular interaction network. PLoS One 2010;5:e10662. [PMID: 20498836 PMCID: PMC2871048 DOI: 10.1371/journal.pone.0010662] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2009] [Accepted: 04/15/2010] [Indexed: 11/26/2022] Open

Abstract

Molecular interaction networks establish all cell biological processes. The networks are under intensive research that is facilitated by new high-throughput measurement techniques for the detection, quantification, and characterization of molecules and their physical interactions. For the common model organism yeast Saccharomyces cerevisiae, public databases store a significant part of the accumulated information and, on the way to better understanding of the cellular processes, there is a need to integrate this information into a consistent reconstruction of the molecular interaction network. This work presents and validates RefRec, the most comprehensive molecular interaction network reconstruction currently available for yeast. The reconstruction integrates protein synthesis pathways, a metabolic network, and a protein-protein interaction network from major biological databases. The core of the reconstruction is based on a reference object approach in which genes, transcripts, and proteins are identified using their primary sequences. This enables their unambiguous identification and non-redundant integration. The obtained total number of different molecular species and their connecting interactions is approximately 67,000. In order to demonstrate the capacity of RefRec for functional predictions, it was used for simulating the gene knockout damage propagation in the molecular interaction network in approximately 590,000 experimentally validated mutant strains. Based on the simulation results, a statistical classifier was subsequently able to correctly predict the viability of most of the strains. The results also showed that the usage of different types of molecular species in the reconstruction is important for accurate phenotype prediction. In general, the findings demonstrate the benefits of global reconstructions of molecular interaction networks. With all the molecular species and their physical interactions explicitly modeled, our reconstruction is able to serve as a valuable resource in additional analyses involving objects from multiple molecular -omes. For that purpose, RefRec is freely available in the Systems Biology Markup Language format.

Collapse

Iyer LM, Abhiman S, de Souza RF, Aravind L. Origin and evolution of peptide-modifying dioxygenases and identification of the wybutosine hydroxylase/hydroperoxidase. Nucleic Acids Res 2010;38:5261-79. [PMID: 20423905 PMCID: PMC2938197 DOI: 10.1093/nar/gkq265] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open

Abstract

Unlike classical 2-oxoglutarate and iron-dependent dioxygenases, which include several nucleic acid modifiers, the structurally similar jumonji-related dioxygenase superfamily was only known to catalyze peptide modifications. Using comparative genomics methods, we predict that a family of jumonji-related enzymes catalyzes wybutosine hydroxylation/peroxidation at position 37 of eukaryotic tRNAPhe. Identification of this enzyme raised questions regarding the emergence of protein- and nucleic acid-modifying activities among jumonji-related domains. We addressed these with a natural classification of DSBH domains and reconstructed the precursor of the dioxygenases as a sugar-binding domain. This precursor gave rise to sugar epimerases and metal-binding sugar isomerases. The sugar isomerase active site was exapted for catalysis of oxygenation, with a radiation of these enzymes in bacteria, probably due to impetus from the primary oxygenation event in Earth’s history. 2-Oxoglutarate-dependent versions appear to have further expanded with rise of the tricarboxylic acid cycle. We identify previously under-appreciated aspects of their active site and multiple independent innovations of 2-oxoacid-binding basic residues among these superfamilies. We show that double-stranded β-helix dioxygenases diversified extensively in biosynthesis and modification of halogenated siderophores, antibiotics, peptide secondary metabolites and glycine-rich collagen-like proteins in bacteria. Jumonji-related domains diversified into three distinct lineages in bacterial secondary metabolism systems and these were precursors of the three major clades of eukaryotic enzymes. The specificity of wybutosine hydroxylase/peroxidase probably relates to the structural similarity of the modified moiety to the ancestral amino acid substrate of this superfamily.

Collapse

Zhao J, Geng C, Tao L, Zhang D, Jiang Y, Tang K, Zhu R, Yu H, Zhang W, He F, Li Y, Cao Z. Reconstruction and Analysis of Human Liver-Specific Metabolic Network Based on CNHLPP Data. J Proteome Res 2010;9:1648-58. [DOI: 10.1021/pr9006188] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Affiliation(s)

Jing Zhao Shanghai Center for Bioinformation and Technology, Shanghai, China, School of Life Sciences and Technology, Tongji University, Shanghai, China, Key laboratory of Arrthythmias, Ministry of Education, China, Department of Genomics and Proteomics, Beijing Institute of Radiation Medicine, Beijing, China, Beijing Proteome Research Center, Beijing, China, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China, Department of Natural Medicinal Chemistry, College of Pharmacy,
Chao Geng Shanghai Center for Bioinformation and Technology, Shanghai, China, School of Life Sciences and Technology, Tongji University, Shanghai, China, Key laboratory of Arrthythmias, Ministry of Education, China, Department of Genomics and Proteomics, Beijing Institute of Radiation Medicine, Beijing, China, Beijing Proteome Research Center, Beijing, China, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China, Department of Natural Medicinal Chemistry, College of Pharmacy,
Lin Tao Shanghai Center for Bioinformation and Technology, Shanghai, China, School of Life Sciences and Technology, Tongji University, Shanghai, China, Key laboratory of Arrthythmias, Ministry of Education, China, Department of Genomics and Proteomics, Beijing Institute of Radiation Medicine, Beijing, China, Beijing Proteome Research Center, Beijing, China, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China, Department of Natural Medicinal Chemistry, College of Pharmacy,
Duanfeng Zhang Shanghai Center for Bioinformation and Technology, Shanghai, China, School of Life Sciences and Technology, Tongji University, Shanghai, China, Key laboratory of Arrthythmias, Ministry of Education, China, Department of Genomics and Proteomics, Beijing Institute of Radiation Medicine, Beijing, China, Beijing Proteome Research Center, Beijing, China, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China, Department of Natural Medicinal Chemistry, College of Pharmacy,
Ying Jiang Shanghai Center for Bioinformation and Technology, Shanghai, China, School of Life Sciences and Technology, Tongji University, Shanghai, China, Key laboratory of Arrthythmias, Ministry of Education, China, Department of Genomics and Proteomics, Beijing Institute of Radiation Medicine, Beijing, China, Beijing Proteome Research Center, Beijing, China, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China, Department of Natural Medicinal Chemistry, College of Pharmacy,
Kailin Tang Shanghai Center for Bioinformation and Technology, Shanghai, China, School of Life Sciences and Technology, Tongji University, Shanghai, China, Key laboratory of Arrthythmias, Ministry of Education, China, Department of Genomics and Proteomics, Beijing Institute of Radiation Medicine, Beijing, China, Beijing Proteome Research Center, Beijing, China, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China, Department of Natural Medicinal Chemistry, College of Pharmacy,
Ruixin Zhu Shanghai Center for Bioinformation and Technology, Shanghai, China, School of Life Sciences and Technology, Tongji University, Shanghai, China, Key laboratory of Arrthythmias, Ministry of Education, China, Department of Genomics and Proteomics, Beijing Institute of Radiation Medicine, Beijing, China, Beijing Proteome Research Center, Beijing, China, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China, Department of Natural Medicinal Chemistry, College of Pharmacy,
Hong Yu Shanghai Center for Bioinformation and Technology, Shanghai, China, School of Life Sciences and Technology, Tongji University, Shanghai, China, Key laboratory of Arrthythmias, Ministry of Education, China, Department of Genomics and Proteomics, Beijing Institute of Radiation Medicine, Beijing, China, Beijing Proteome Research Center, Beijing, China, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China, Department of Natural Medicinal Chemistry, College of Pharmacy,
Weidong Zhang Shanghai Center for Bioinformation and Technology, Shanghai, China, School of Life Sciences and Technology, Tongji University, Shanghai, China, Key laboratory of Arrthythmias, Ministry of Education, China, Department of Genomics and Proteomics, Beijing Institute of Radiation Medicine, Beijing, China, Beijing Proteome Research Center, Beijing, China, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China, Department of Natural Medicinal Chemistry, College of Pharmacy,
Fuchu He Shanghai Center for Bioinformation and Technology, Shanghai, China, School of Life Sciences and Technology, Tongji University, Shanghai, China, Key laboratory of Arrthythmias, Ministry of Education, China, Department of Genomics and Proteomics, Beijing Institute of Radiation Medicine, Beijing, China, Beijing Proteome Research Center, Beijing, China, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China, Department of Natural Medicinal Chemistry, College of Pharmacy,
Yixue Li Shanghai Center for Bioinformation and Technology, Shanghai, China, School of Life Sciences and Technology, Tongji University, Shanghai, China, Key laboratory of Arrthythmias, Ministry of Education, China, Department of Genomics and Proteomics, Beijing Institute of Radiation Medicine, Beijing, China, Beijing Proteome Research Center, Beijing, China, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China, Department of Natural Medicinal Chemistry, College of Pharmacy,
Zhiwei Cao Shanghai Center for Bioinformation and Technology, Shanghai, China, School of Life Sciences and Technology, Tongji University, Shanghai, China, Key laboratory of Arrthythmias, Ministry of Education, China, Department of Genomics and Proteomics, Beijing Institute of Radiation Medicine, Beijing, China, Beijing Proteome Research Center, Beijing, China, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China, Department of Natural Medicinal Chemistry, College of Pharmacy,

Collapse

Karp PD, Paley SM, Krummenacker M, Latendresse M, Dale JM, Lee TJ, Kaipa P, Gilham F, Spaulding A, Popescu L, Altman T, Paulsen I, Keseler IM, Caspi R. Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology. Brief Bioinform 2009;11:40-79. [PMID: 19955237 DOI: 10.1093/bib/bbp043] [Citation(s) in RCA: 326] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open

Pathway projector: web-based zoomable pathway browser using KEGG atlas and Google Maps API. PLoS One 2009;4:e7710. [PMID: 19907644 PMCID: PMC2770834 DOI: 10.1371/journal.pone.0007710] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2009] [Accepted: 10/11/2009] [Indexed: 11/19/2022] Open

Abstract

Background

Biochemical pathways provide an essential context for understanding comprehensive experimental data and the systematic workings of a cell. Therefore, the availability of online pathway browsers will facilitate post-genomic research, just as genome browsers have contributed to genomics. Many pathway maps have been provided online as part of public pathway databases. Most of these maps, however, function as the gateway interface to a specific database, and the comprehensiveness of their represented entities, data mapping capabilities, and user interfaces are not always sufficient for generic usage.

Methodology/Principal Findings

We have identified five central requirements for a pathway browser: (1) availability of large integrated maps showing genes, enzymes, and metabolites; (2) comprehensive search features and data access; (3) data mapping for transcriptomic, proteomic, and metabolomic experiments, as well as the ability to edit and annotate pathway maps; (4) easy exchange of pathway data; and (5) intuitive user experience without the requirement for installation and regular maintenance. According to these requirements, we have evaluated existing pathway databases and tools and implemented a web-based pathway browser named Pathway Projector as a solution.

Conclusions/Significance

Pathway Projector provides integrated pathway maps that are based upon the KEGG Atlas, with the addition of nodes for genes and enzymes, and is implemented as a scalable, zoomable map utilizing the Google Maps API. Users can search pathway-related data using keywords, molecular weights, nucleotide sequences, and amino acid sequences, or as possible routes between compounds. In addition, experimental data from transcriptomic, proteomic, and metabolomic analyses can be readily mapped. Pathway Projector is freely available for academic users at http://www.g-language.org/PathwayProjector/.

Collapse

Davidsen T, Beck E, Ganapathy A, Montgomery R, Zafar N, Yang Q, Madupu R, Goetz P, Galinsky K, White O, Sutton G. The comprehensive microbial resource. Nucleic Acids Res 2009;38:D340-5. [PMID: 19892825 PMCID: PMC2808947 DOI: 10.1093/nar/gkp912] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Peregrín-Alvarez JM, Sanford C, Parkinson J. The conservation and evolutionary modularity of metabolism. Genome Biol 2009;10:R63. [PMID: 19523219 PMCID: PMC2718497 DOI: 10.1186/gb-2009-10-6-r63] [Citation(s) in RCA: 103] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2009] [Revised: 05/27/2009] [Accepted: 06/12/2009] [Indexed: 01/09/2023] Open

Abstract

A novel evolutionary analysis of metabolic networks across 26 taxa reveals a highly-conserved but flexible core of metabolic enzymes.

Background

Cellular metabolism is a fundamental biological system consisting of myriads of enzymatic reactions that together fulfill the basic requirements of life. The recent availability of vast amounts of sequence data from diverse sets of organisms provides an opportunity to systematically examine metabolism from a comparative perspective. Here we supplement existing genome and protein resources with partial genome datasets derived from 193 eukaryotes to present a comprehensive survey of the conservation of metabolism across 26 taxa representing the three domains of life.

Results

In general, metabolic enzymes are highly conserved. However, organizing these enzymes within the context of functional pathways revealed a spectrum of conservation from those that are highly conserved (for example, carbohydrate, energy, amino acid and nucleotide metabolism enzymes) to those specific to individual taxa (for example, those involved in glycan metabolism and secondary metabolite pathways). Applying a novel co-conservation analysis, KEGG defined pathways did not generally display evolutionary coherence. Instead, such modularity appears restricted to smaller subsets of enzymes. Expanding analyses to a global metabolic network revealed a highly conserved, but nonetheless flexible, 'core' of enzymes largely involved in multiple reactions across different pathways. Enzymes and pathways associated with the periphery of this network were less well conserved and associated with taxon-specific innovations.

Conclusions

These findings point to an emerging picture in which a core of enzyme activities involving amino acid, energy, carbohydrate and lipid metabolism have evolved to provide the basic functions required for life. However, the precise complement of enzymes associated within this core for each species is flexible.

Collapse

Whitaker JW, McConkey GA, Westhead DR. The transferome of metabolic genes explored: analysis of the horizontal transfer of enzyme encoding genes in unicellular eukaryotes. Genome Biol 2009;10:R36. [PMID: 19368726 PMCID: PMC2688927 DOI: 10.1186/gb-2009-10-4-r36] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2008] [Revised: 04/06/2009] [Accepted: 04/15/2009] [Indexed: 12/02/2022] Open

Abstract

Metabolic network analysis in multiple eukaryotes identifies how horizontal and endosymbiotic gene transfer of metabolic enzyme-encoding genes leads to functional gene gain during evolution.

Background

Metabolic networks are responsible for many essential cellular processes, and exhibit a high level of evolutionary conservation from bacteria to eukaryotes. If genes encoding metabolic enzymes are horizontally transferred and are advantageous, they are likely to become fixed. Horizontal gene transfer (HGT) has played a key role in prokaryotic evolution and its importance in eukaryotes is increasingly evident. High levels of endosymbiotic gene transfer (EGT) accompanied the establishment of plastids and mitochondria, and more recent events have allowed further acquisition of bacterial genes. Here, we present the first comprehensive multi-species analysis of E/HGT of genes encoding metabolic enzymes from bacteria to unicellular eukaryotes.

Results

The phylogenetic trees of 2,257 metabolic enzymes were used to make E/HGT assertions in ten groups of unicellular eukaryotes, revealing the sources and metabolic processes of the transferred genes. Analyses revealed a preference for enzymes encoded by genes gained through horizontal and endosymbiotic transfers to be connected in the metabolic network. Enrichment in particular functional classes was particularly revealing: alongside plastid related processes and carbohydrate metabolism, this highlighted a number of pathways in eukaryotic parasites that are rich in enzymes encoded by transferred genes, and potentially key to pathogenicity. The plant parasites Phytophthora were discovered to have a potential pathway for lipopolysaccharide biosynthesis of E/HGT origin not seen before in eukaryotes outside the Plantae.

Conclusions

The number of enzymes encoded by genes gained through E/HGT has been established, providing insight into functional gain during the evolution of unicellular eukaryotes. In eukaryotic parasites, genes encoding enzymes that have been gained through horizontal transfer may be attractive drug targets if they are part of processes not present in the host, or are significantly diverged from equivalent host enzymes.

Collapse

Kastenmüller G, Schenk ME, Gasteiger J, Mewes HW. Uncovering metabolic pathways relevant to phenotypic traits of microbial genomes. Genome Biol 2009;10:R28. [PMID: 19284550 PMCID: PMC2690999 DOI: 10.1186/gb-2009-10-3-r28] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2008] [Revised: 02/12/2009] [Accepted: 03/10/2009] [Indexed: 01/20/2023] Open

A bioinformatician's guide to metagenomics. Microbiol Mol Biol Rev 2009;72:557-78, Table of Contents. [PMID: 19052320 DOI: 10.1128/mmbr.00009-08] [Citation(s) in RCA: 233] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open

e-Science: relieving bottlenecks in large-scale genome analyses. Nat Rev Microbiol 2008;6:948-54. [PMID: 19008893 DOI: 10.1038/nrmicro2031] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Senger RS, Papoutsakis ET. Genome-scale model for Clostridium acetobutylicum: Part I. Metabolic network resolution and analysis. Biotechnol Bioeng 2008;101:1036-52. [PMID: 18767192 DOI: 10.1002/bit.22010] [Citation(s) in RCA: 141] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Whitaker JW, Letunic I, McConkey GA, Westhead DR. metaTIGER: a metabolic evolution resource. Nucleic Acids Res 2008;37:D531-8. [PMID: 18953037 PMCID: PMC2686446 DOI: 10.1093/nar/gkn826] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Bertin PN, Médigue C, Normand P. Advances in environmental genomics: towards an integrated view of micro-organisms and ecosystems. MICROBIOLOGY-SGM 2008;154:347-359. [PMID: 18227239 DOI: 10.1099/mic.0.2007/011791-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Comparative genome analysis in the integrated microbial genomes (IMG) system. Methods Mol Biol 2008;395:35-56. [PMID: 17993666 DOI: 10.1007/978-1-59745-514-5_3] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]

Sulakhe D, Rodriguez A, Wilde M, Foster I, Maltsev N. Interoperability of GADU in Using Heterogeneous Grid Resources for Bioinformatics Applications. ACTA ACUST UNITED AC 2008;12:241-6. [DOI: 10.1109/titb.2007.897783] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Yu C, Zavaljevski N, Desai V, Johnson S, Stevens FJ, Reifman J. The development of PIPA: an integrated and automated pipeline for genome-wide protein function annotation. BMC Bioinformatics 2008;9:52. [PMID: 18221520 PMCID: PMC2259298 DOI: 10.1186/1471-2105-9-52] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2007] [Accepted: 01/25/2008] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Automated protein function prediction methods are needed to keep pace with high-throughput sequencing. With the existence of many programs and databases for inferring different protein functions, a pipeline that properly integrates these resources will benefit from the advantages of each method. However, integrated systems usually do not provide mechanisms to generate customized databases to predict particular protein functions. Here, we describe a tool termed PIPA (Pipeline for Protein Annotation) that has these capabilities.

RESULTS

PIPA annotates protein functions by combining the results of multiple programs and databases, such as InterPro and the Conserved Domains Database, into common Gene Ontology (GO) terms. The major algorithms implemented in PIPA are: (1) a profile database generation algorithm, which generates customized profile databases to predict particular protein functions, (2) an automated ontology mapping generation algorithm, which maps various classification schemes into GO, and (3) a consensus algorithm to reconcile annotations from the integrated programs and databases.PIPA's profile generation algorithm is employed to construct the enzyme profile database CatFam, which predicts catalytic functions described by Enzyme Commission (EC) numbers. Validation tests show that CatFam yields average recall and precision larger than 95.0%. CatFam is integrated with PIPA. We use an association rule mining algorithm to automatically generate mappings between terms of two ontologies from annotated sample proteins. Incorporating the ontologies' hierarchical topology into the algorithm increases the number of generated mappings. In particular, it generates 40.0% additional mappings from the Clusters of Orthologous Groups (COG) to EC numbers and a six-fold increase in mappings from COG to GO terms. The mappings to EC numbers show a very high precision (99.8%) and recall (96.6%), while the mappings to GO terms show moderate precision (80.0%) and low recall (33.0%). Our consensus algorithm for GO annotation is based on the computation and propagation of likelihood scores associated with GO terms. The test results suggest that, for a given recall, the application of the consensus algorithm yields higher precision than when consensus is not used.

CONCLUSION

The algorithms implemented in PIPA provide automated genome-wide protein function annotation based on reconciled predictions from multiple resources.

Collapse

Vinatzer BA, Yan S. Mining the genomes of plant pathogenic bacteria: how not to drown in gigabases of sequence. MOLECULAR PLANT PATHOLOGY 2008;9:105-118. [PMID: 18705888 PMCID: PMC6640517 DOI: 10.1111/j.1364-3703.2007.00438.x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

Pinney JW, Papp B, Hyland C, Wambua L, Westhead DR, McConkey GA. Metabolic reconstruction and analysis for parasite genomes. Trends Parasitol 2007;23:548-54. [PMID: 17950669 DOI: 10.1016/j.pt.2007.08.013] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2007] [Revised: 08/20/2007] [Accepted: 08/20/2007] [Indexed: 01/29/2023]

Rodriguez AA, Bompada T, Syed M, Shah PK, Maltsev N. Evolutionary analysis of enzymes using Chisel. Bioinformatics 2007;23:2961-8. [PMID: 17855417 DOI: 10.1093/bioinformatics/btm421] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Maier TM, Casey MS, Becker RH, Dorsey CW, Glass EM, Maltsev N, Zahrt TC, Frank DW. Identification of Francisella tularensis Himar1-based transposon mutants defective for replication in macrophages. Infect Immun 2007;75:5376-89. [PMID: 17682043 PMCID: PMC2168294 DOI: 10.1128/iai.00238-07] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

Frishman D. Protein annotation at genomic scale: the current status. Chem Rev 2007;107:3448-66. [PMID: 17658902 DOI: 10.1021/cr068303k] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Sun Y, Wipat A, Pocock M, Lee PA, Flanagan K, Worthington JT. Exploring Microbial Genome Sequences to Identify Protein Families on the Grid. ACTA ACUST UNITED AC 2007;11:435-42. [PMID: 17674626 DOI: 10.1109/titb.2007.892913] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Iyer LM, Burroughs AM, Aravind L. The prokaryotic antecedents of the ubiquitin-signaling system and the early evolution of ubiquitin-like beta-grasp domains. Genome Biol 2007;7:R60. [PMID: 16859499 PMCID: PMC1779556 DOI: 10.1186/gb-2006-7-7-r60] [Citation(s) in RCA: 134] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2006] [Revised: 06/12/2006] [Accepted: 07/06/2006] [Indexed: 11/14/2022] Open

Abstract

A systematic analysis of prokaryotic ubiquitin-related beta-grasp fold proteins provides new insights into the Ubiquitin family functional history.

Background

Ubiquitin (Ub)-mediated signaling is one of the hallmarks of all eukaryotes. Prokaryotic homologs of Ub (ThiS and MoaD) and E1 ligases have been studied in relation to sulfur incorporation reactions in thiamine and molybdenum/tungsten cofactor biosynthesis. However, there is no evidence for entire protein modification systems with Ub-like proteins and deconjugation by deubiquitinating enzymes in prokaryotes. Hence, the evolutionary assembly of the eukaryotic Ub-signaling apparatus remains unclear.

Results

We systematically analyzed prokaryotic Ub-related β-grasp fold proteins using sensitive sequence profile searches and structural analysis. Consequently, we identified novel Ub-related proteins beyond the characterized ThiS, MoaD, TGS, and YukD domains. To understand their functional associations, we sought and recovered several conserved gene neighborhoods and domain architectures. These included novel associations involving diverse sulfur metabolism proteins, siderophore biosynthesis and the gene encoding the transfer mRNA binding protein SmpB, as well as domain fusions between Ub-like domains and PIN-domain related RNAses. Most strikingly, we found conserved gene neighborhoods in phylogenetically diverse bacteria combining genes for JAB domains (the primary de-ubiquitinating isopeptidases of the proteasomal complex), along with E1-like adenylating enzymes and different Ub-related proteins. Further sequence analysis of other conserved genes in these neighborhoods revealed several Ub-conjugating enzyme/E2-ligase related proteins. Genes for an Ub-like protein and a JAB domain peptidase were also found in the tail assembly gene cluster of certain caudate bacteriophages.

Conclusion

These observations imply that members of the Ub family had already formed strong functional associations with E1-like proteins, UBC/E2-related proteins, and JAB peptidases in the bacteria. Several of these Ub-like proteins and the associated protein families are likely to function together in signaling systems just as in eukaryotes.

Collapse

Markowitz VM. Microbial genome data resources. Curr Opin Biotechnol 2007;18:267-72. [PMID: 17467973 DOI: 10.1016/j.copbio.2007.04.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2007] [Revised: 03/18/2007] [Accepted: 04/18/2007] [Indexed: 11/17/2022]

D'Souza M, Glass EM, Syed MH, Zhang Y, Rodriguez A, Maltsev N, Galperin MY. Sentra: a database of signal transduction proteins for comparative genome analysis. Nucleic Acids Res 2006;35:D271-3. [PMID: 17135204 PMCID: PMC1751548 DOI: 10.1093/nar/gkl949] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open

Hyland C, Pinney JW, McConkey GA, Westhead DR. metaSHARK: a WWW platform for interactive exploration of metabolic networks. Nucleic Acids Res 2006;34:W725-8. [PMID: 16845107 PMCID: PMC1538829 DOI: 10.1093/nar/gkl196] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open

Stothard P, Wishart DS. Automated bacterial genome analysis and annotation. Curr Opin Microbiol 2006;9:505-10. [PMID: 16931121 DOI: 10.1016/j.mib.2006.08.002] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2006] [Accepted: 08/10/2006] [Indexed: 10/24/2022]

Galperin MY. The Molecular Biology Database Collection: 2006 update. Nucleic Acids Res 2006;34:D3-5. [PMID: 16381871 PMCID: PMC1347524 DOI: 10.1093/nar/gkj162] [Citation(s) in RCA: 59] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open