Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Caspi R, Altman T, Dale JM, Dreher K, Fulcher CA, Gilham F, Kaipa P, Karthikeyan AS, Kothari A, Krummenacker M, Latendresse M, Mueller LA, Paley S, Popescu L, Pujar A, Shearer AG, Zhang P, Karp PD. The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res 2009;38:D473-9. [PMID: 19850718 PMCID: PMC2808959 DOI: 10.1093/nar/gkp875] [Citation(s) in RCA: 329] [Impact Index Per Article: 21.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

For:	Caspi R, Altman T, Dale JM, Dreher K, Fulcher CA, Gilham F, Kaipa P, Karthikeyan AS, Kothari A, Krummenacker M, Latendresse M, Mueller LA, Paley S, Popescu L, Pujar A, Shearer AG, Zhang P, Karp PD. The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res 2009;38:D473-9. [PMID: 19850718 PMCID: PMC2808959 DOI: 10.1093/nar/gkp875] [Citation(s) in RCA: 329] [Impact Index Per Article: 21.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

Number

Cited by Other Article(s)

101

Haq IU, Graupner K, Nazir R, van Elsas JD. The genome of the fungal-interactive soil bacterium Burkholderia terrae BS001-a plethora of outstanding interactive capabilities unveiled. Genome Biol Evol 2014;6:1652-68. [PMID: 24923325 PMCID: PMC4122924 DOI: 10.1093/gbe/evu126] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open

Abstract

Burkholderia terrae strain BS001, obtained as an inhabitant of the mycosphere of Laccaria proxima (a close relative of Lyophyllum sp. strain Karsten), actively interacts with Lyophyllum sp. strain Karsten. We here summarize the remarkable ecological behavior of B. terrae BS001 in the mycosphere and add key data to this. Moreover, we extensively analyze the approximately 11.5-Mb five-replicon genome of B. terrae BS001 and highlight its remarkable features. Seventy-nine regions of genomic plasticity (RGP), that is, 16.48% of the total genome size, were found. One 70.42-kb RGP, RGP76, revealed a typical conjugal element structure, including a full type 4 secretion system. Comparative analyses across 24 related Burkholderia genomes revealed that 95.66% of the total BS001 genome belongs to the variable part, whereas the remaining 4.34% constitutes the core genome. Genes for biofilm formation and several secretion systems, under which a type 3 secretion system (T3SS), were found, which is consistent with the hypothesis that T3SSs play a role in the interaction with Lyophyllum sp. strain Karsten. The high number of predicted metabolic pathways and membrane transporters suggested that strain BS001 can take up and utilize a range of sugars, amino acids and organic acids. In particular, a unique glycerol uptake system was found. The BS001 genome further contains genetic systems for the degradation of complex organic compounds. Moreover, gene clusters encoding nonribosomal peptide synthetases (NRPS) and hybrid polyketide synthases/NRPS were found, highlighting the potential role of secondary metabolites in the ecology of strain BS001. The patchwork of genetic features observed in the genome is consistent with the notion that 1) horizontal gene transfer is a main driver of B. terrae BS001 adaptation and 2) the organism is very flexible in its ecological behavior in soil.

Collapse

102

Montague E, Stanberry L, Higdon R, Janko I, Lee E, Anderson N, Choiniere J, Stewart E, Yandl G, Broomall W, Kolker N, Kolker E. MOPED 2.5--an integrated multi-omics resource: multi-omics profiling expression database now includes transcriptomics data. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2014;18:335-43. [PMID: 24910945 PMCID: PMC4048574 DOI: 10.1089/omi.2014.0061] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Abstract

Multi-omics data-driven scientific discovery crucially rests on high-throughput technologies and data sharing. Currently, data are scattered across single omics repositories, stored in varying raw and processed formats, and are often accompanied by limited or no metadata. The Multi-Omics Profiling Expression Database (MOPED, http://moped.proteinspire.org ) version 2.5 is a freely accessible multi-omics expression database. Continual improvement and expansion of MOPED is driven by feedback from the Life Sciences Community. In order to meet the emergent need for an integrated multi-omics data resource, MOPED 2.5 now includes gene relative expression data in addition to protein absolute and relative expression data from over 250 large-scale experiments. To facilitate accurate integration of experiments and increase reproducibility, MOPED provides extensive metadata through the Data-Enabled Life Sciences Alliance (DELSA Global, http://delsaglobal.org ) metadata checklist. MOPED 2.5 has greatly increased the number of proteomics absolute and relative expression records to over 500,000, in addition to adding more than four million transcriptomics relative expression records. MOPED has an intuitive user interface with tabs for querying different types of omics expression data and new tools for data visualization. Summary information including expression data, pathway mappings, and direct connection between proteins and genes can be viewed on Protein and Gene Details pages. These connections in MOPED provide a context for multi-omics expression data exploration. Researchers are encouraged to submit omics data which will be consistently processed into expression summaries. MOPED as a multi-omics data resource is a pivotal public database, interdisciplinary knowledge resource, and platform for multi-omics understanding.

Collapse

Affiliation(s)

Elizabeth Montague Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington Predictive Analytics, Seattle Children's, Seattle, Washington Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
Larissa Stanberry Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington Predictive Analytics, Seattle Children's, Seattle, Washington Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
Roger Higdon Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington Predictive Analytics, Seattle Children's, Seattle, Washington Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
Imre Janko High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington Predictive Analytics, Seattle Children's, Seattle, Washington Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
Elaine Lee High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington Predictive Analytics, Seattle Children's, Seattle, Washington Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
Nathaniel Anderson Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
John Choiniere Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
Elizabeth Stewart Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
Gregory Yandl Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington Predictive Analytics, Seattle Children's, Seattle, Washington Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
William Broomall High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington Predictive Analytics, Seattle Children's, Seattle, Washington Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
Natali Kolker High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington Predictive Analytics, Seattle Children's, Seattle, Washington Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington
Eugene Kolker Bioinformatics and High-Throughput Analysis Laboratory, Center for Developmental Therapeutics, Seattle Children's Research Institute, Seattle, Washington High-throughput Analysis Core, Seattle Children's Research Institute, Seattle, Washington Predictive Analytics, Seattle Children's, Seattle, Washington Data-Enabled Life Sciences Alliance (DELSA Global), Seattle, Washington Departments of Biomedical Informatics and Medical Education and Pediatrics, University of Washington, Seattle, Washington

Collapse

103

Kumar S, Shah N, Garg V, Bhatia S. Large scale in-silico identification and characterization of simple sequence repeats (SSRs) from de novo assembled transcriptome of Catharanthus roseus (L.) G. Don. PLANT CELL REPORTS 2014;33:905-918. [PMID: 24482265 DOI: 10.1007/s00299-014-1569-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/25/2013] [Revised: 12/17/2013] [Accepted: 01/09/2014] [Indexed: 06/03/2023]

Abstract

Transcriptomic data of C. roseus offering ample sequence resources for providing better insights into gene diversity: large resource of genic SSR markers to accelerate genomic studies and breeding in Catharanthus . Next-generation sequencing is an efficient system for generating high-throughput complete transcripts/genes and developing molecular markers. We present here the transcriptome sequencing of a 26-day-old Catharanthus roseus seedling tissue using Illumina GAIIX platform that resulted in a total of 3.37 Gb of nucleotide sequence data comprising 29,964,104 reads which were de novo assembled into 26,581 unigenes. Based on similarity searches 58 % of the unigenes were annotated of which 13,580 unique transcripts were assigned 5016 gene ontology terms. Further, 7,687 of the unigenes were found to have Cluster of Orthologous Group classifications, and 4,006 were assigned to 289 Kyoto Encyclopedia of Genes and Genome pathways. Also, 5,221 (19.64 %) of transcripts were distributed to 81 known transcription factor (TF) families. In-silico analysis of the transcriptome resulted in identification of 11,004 SSRs in 26.62 % transcripts from which 2,520 SSR markers were designed which exhibited a non-random pattern of distribution. The most abundant was the trinucleotide repeats (AAG/CTT) followed by the dinucleotide repeats (AG/CT). Location specific analysis of SSRs revealed that SSRs were preferentially associated with the 5'-UTRs with a predicted role in regulation of gene expression. A PCR validation of a set of 48 primers revealed 97.9 % successful amplification, and 76.6 % of them showed polymorphism across different Catharanthus species as well as accessions of C. roseus. In summary, this study will provide an insight into understanding the seedling development and resources for novel gene discovery and SSR development for utilization in marker-assisted selective breeding in C. roseus.

Collapse

104

Feltes BC, de Faria Poloni J, Nunes IJG, Bonatto D. Fetal alcohol syndrome, chemo-biology and OMICS: ethanol effects on vitamin metabolism during neurodevelopment as measured by systems biology analysis. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2014;18:344-63. [PMID: 24816220 DOI: 10.1089/omi.2013.0144] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

105

Streptococcus pyogenes polymyxin B-resistant mutants display enhanced ExPortal integrity. J Bacteriol 2014;196:2563-77. [PMID: 24794568 DOI: 10.1128/jb.01596-14] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open

106

Genomic features of a bumble bee symbiont reflect its host environment. Appl Environ Microbiol 2014;80:3793-803. [PMID: 24747890 DOI: 10.1128/aem.00322-14] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open

107

Glass K, Girvan M. Annotation enrichment analysis: an alternative method for evaluating the functional properties of gene sets. Sci Rep 2014;4:4191. [PMID: 24569707 PMCID: PMC3935204 DOI: 10.1038/srep04191] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2013] [Accepted: 01/28/2014] [Indexed: 12/18/2022] Open

108

Macklin DN, Ruggero NA, Covert MW. The future of whole-cell modeling. Curr Opin Biotechnol 2014;28:111-5. [PMID: 24556244 DOI: 10.1016/j.copbio.2014.01.012] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2013] [Revised: 01/19/2014] [Accepted: 01/20/2014] [Indexed: 12/21/2022]

109

Watson E, MacNeil LT, Ritter AD, Yilmaz LS, Rosebrock AP, Caudy AA, Walhout AJM. Interspecies systems biology uncovers metabolites affecting C. elegans gene expression and life history traits. Cell 2014;156:759-70. [PMID: 24529378 PMCID: PMC4169190 DOI: 10.1016/j.cell.2014.01.047] [Citation(s) in RCA: 143] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2013] [Revised: 10/09/2013] [Accepted: 01/09/2014] [Indexed: 01/07/2023]

110

Benedict MN, Henriksen JR, Metcalf WW, Whitaker RJ, Price ND. ITEP: an integrated toolkit for exploration of microbial pan-genomes. BMC Genomics 2014;15:8. [PMID: 24387194 PMCID: PMC3890548 DOI: 10.1186/1471-2164-15-8] [Citation(s) in RCA: 78] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2013] [Accepted: 12/18/2013] [Indexed: 01/31/2023] Open

Abstract

Background

Comparative genomics is a powerful approach for studying variation in physiological traits as well as the evolution and ecology of microorganisms. Recent technological advances have enabled sequencing large numbers of related genomes in a single project, requiring computational tools for their integrated analysis. In particular, accurate annotations and identification of gene presence and absence are critical for understanding and modeling the cellular physiology of newly sequenced genomes. Although many tools are available to compare the gene contents of related genomes, new tools are necessary to enable close examination and curation of protein families from large numbers of closely related organisms, to integrate curation with the analysis of gain and loss, and to generate metabolic networks linking the annotations to observed phenotypes.

Results

We have developed ITEP, an Integrated Toolkit for Exploration of microbial Pan-genomes, to curate protein families, compute similarities to externally-defined domains, analyze gene gain and loss, and generate draft metabolic networks from one or more curated reference network reconstructions in groups of related microbial species among which the combination of core and variable genes constitute the their "pan-genomes". The ITEP toolkit consists of: (1) a series of modular command-line scripts for identification, comparison, curation, and analysis of protein families and their distribution across many genomes; (2) a set of Python libraries for programmatic access to the same data; and (3) pre-packaged scripts to perform common analysis workflows on a collection of genomes. ITEP’s capabilities include de novo protein family prediction, ortholog detection, analysis of functional domains, identification of core and variable genes and gene regions, sequence alignments and tree generation, annotation curation, and the integration of cross-genome analysis and metabolic networks for study of metabolic network evolution.

Conclusions

ITEP is a powerful, flexible toolkit for generation and curation of protein families. ITEP's modular design allows for straightforward extension as analysis methods and tools evolve. By integrating comparative genomics with the development of draft metabolic networks, ITEP harnesses the power of comparative genomics to build confidence in links between genotype and phenotype and helps disambiguate gene annotations when they are evaluated in both evolutionary and metabolic network contexts.

Collapse

111

Dreher K. Putting The Plant Metabolic Network pathway databases to work: going offline to gain new capabilities. Methods Mol Biol 2014;1083:151-71. [PMID: 24218215 DOI: 10.1007/978-1-62703-661-0_10] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]

112

Heath BS, Marshall MJ, Laskin J. The characterization of living bacterial colonies using nanospray desorption electrospray ionization mass spectrometry. Methods Mol Biol 2014;1151:199-208. [PMID: 24838888 DOI: 10.1007/978-1-4939-0554-6_14] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]

113

Somerville GA, Powers R. Growth and preparation of Staphylococcus epidermidis for NMR metabolomic analysis. Methods Mol Biol 2014;1106:71-91. [PMID: 24222456 DOI: 10.1007/978-1-62703-736-5_6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

114

Oakeson KF, Gil R, Clayton AL, Dunn DM, von Niederhausern AC, Hamil C, Aoyagi A, Duval B, Baca A, Silva FJ, Vallier A, Jackson DG, Latorre A, Weiss RB, Heddi A, Moya A, Dale C. Genome degeneration and adaptation in a nascent stage of symbiosis. Genome Biol Evol 2014;6:76-93. [PMID: 24407854 PMCID: PMC3914690 DOI: 10.1093/gbe/evt210] [Citation(s) in RCA: 134] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Abstract

Symbiotic associations between animals and microbes are ubiquitous in nature, with an estimated 15% of all insect species harboring intracellular bacterial symbionts. Most bacterial symbionts share many genomic features including small genomes, nucleotide composition bias, high coding density, and a paucity of mobile DNA, consistent with long-term host association. In this study, we focus on the early stages of genome degeneration in a recently derived insect-bacterial mutualistic intracellular association. We present the complete genome sequence and annotation of Sitophilus oryzae primary endosymbiont (SOPE). We also present the finished genome sequence and annotation of strain HS, a close free-living relative of SOPE and other insect symbionts of the Sodalis-allied clade, whose gene inventory is expected to closely resemble the putative ancestor of this group. Structural, functional, and evolutionary analyses indicate that SOPE has undergone extensive adaptation toward an insect-associated lifestyle in a very short time period. The genome of SOPE is large in size when compared with many ancient bacterial symbionts; however, almost half of the protein-coding genes in SOPE are pseudogenes. There is also evidence for relaxed selection on the remaining intact protein-coding genes. Comparative analyses of the whole-genome sequence of strain HS and SOPE highlight numerous genomic rearrangements, duplications, and deletions facilitated by a recent expansion of insertions sequence elements, some of which appear to have catalyzed adaptive changes. Functional metabolic predictions suggest that SOPE has lost the ability to synthesize several essential amino acids and vitamins. Analyses of the bacterial cell envelope and genes encoding secretion systems suggest that these structures and elements have become simplified in the transition to a mutualistic association.

Collapse

115

Ngounou Wetie AG, Sokolowska I, Woods AG, Roy U, Deinhardt K, Darie CC. Protein-protein interactions: switch from classical methods to proteomics and bioinformatics-based approaches. Cell Mol Life Sci 2014;71:205-28. [PMID: 23579629 PMCID: PMC11113707 DOI: 10.1007/s00018-013-1333-1] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2012] [Revised: 03/25/2013] [Accepted: 03/26/2013] [Indexed: 11/28/2022]

116

Medina S, Domínguez-Perles R, Ferreres F, Tomás-Barberán FA, Gil-Izquierdo Á. The effects of the intake of plant foods on the human metabolome. Trends Analyt Chem 2013. [DOI: 10.1016/j.trac.2013.08.002] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

117

Kuhn M, Szklarczyk D, Pletscher-Frankild S, Blicher TH, von Mering C, Jensen LJ, Bork P. STITCH 4: integration of protein-chemical interactions with user data. Nucleic Acids Res 2013;42:D401-7. [PMID: 24293645 PMCID: PMC3964996 DOI: 10.1093/nar/gkt1207] [Citation(s) in RCA: 309] [Impact Index Per Article: 28.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open

118

Hamilton JJ, Reed JL. Software platforms to facilitate reconstructing genome-scale metabolic networks. Environ Microbiol 2013;16:49-59. [PMID: 24148076 DOI: 10.1111/1462-2920.12312] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2013] [Accepted: 10/12/2013] [Indexed: 12/24/2022]

119

Rodrigues A, Formas-Oliveira A, Bandeira V, Alves P, Hu W, Coroadinha A. Metabolic pathways recruited in the production of a recombinant enveloped virus: Mining targets for process and cell engineering. Metab Eng 2013;20:131-45. [DOI: 10.1016/j.ymben.2013.10.001] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2013] [Revised: 07/22/2013] [Accepted: 10/03/2013] [Indexed: 11/27/2022]

120

Crook N, Alper HS. Model-based design of synthetic, biological systems. Chem Eng Sci 2013. [DOI: 10.1016/j.ces.2012.12.022] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

121

Milreu PV, Klein CC, Cottret L, Acuña V, Birmelé E, Borassi M, Junot C, Marchetti-Spaccamela A, Marino A, Stougie L, Jourdan F, Crescenzi P, Lacroix V, Sagot MF. Telling metabolic stories to explore metabolomics data: a case study on the yeast response to cadmium exposure. ACTA ACUST UNITED AC 2013;30:61-70. [PMID: 24167155 PMCID: PMC3866556 DOI: 10.1093/bioinformatics/btt597] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

Abstract

Motivation: The increasing availability of metabolomics data enables to better understand the metabolic processes involved in the immediate response of an organism to environmental changes and stress. The data usually come in the form of a list of metabolites whose concentrations significantly changed under some conditions, and are thus not easy to interpret without being able to precisely visualize how such metabolites are interconnected.

Results: We present a method that enables to organize the data from any metabolomics experiment into metabolic stories. Each story corresponds to a possible scenario explaining the flow of matter between the metabolites of interest. These scenarios may then be ranked in different ways depending on which interpretation one wishes to emphasize for the causal link between two affected metabolites: enzyme activation, enzyme inhibition or domino effect on the concentration changes of substrates and products. Equally probable stories under any selected ranking scheme can be further grouped into a single anthology that summarizes, in a unique subnetwork, all equivalently plausible alternative stories. An anthology is simply a union of such stories. We detail an application of the method to the response of yeast to cadmium exposure. We use this system as a proof of concept for our method, and we show that we are able to find a story that reproduces very well the current knowledge about the yeast response to cadmium. We further show that this response is mostly based on enzyme activation. We also provide a framework for exploring the alternative pathways or side effects this local response is expected to have in the rest of the network. We discuss several interpretations for the changes we see, and we suggest hypotheses that could in principle be experimentally tested. Noticeably, our method requires simple input data and could be used in a wide variety of applications.

Availability and implementation: The code for the method presented in this article is available at http://gobbolino.gforge.inria.fr.

Contact: pvmilreu@gmail.com; vincent.lacroix@univ-lyon1.fr; marie-france.sagot@inria.fr

Supplementary information:Supplementary data are available at Bioinformatics online.

Collapse

122

Percudani R, Carnevali D, Puggioni V. Ureidoglycolate hydrolase, amidohydrolase, lyase: how errors in biological databases are incorporated in scientific papers and vice versa. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2013;2013:bat071. [PMID: 24107613 PMCID: PMC3793230 DOI: 10.1093/database/bat071] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

123

Kuperstein I, Cohen DPA, Pook S, Viara E, Calzone L, Barillot E, Zinovyev A. NaviCell: a web-based environment for navigation, curation and maintenance of large molecular interaction maps. BMC SYSTEMS BIOLOGY 2013;7:100. [PMID: 24099179 PMCID: PMC3851986 DOI: 10.1186/1752-0509-7-100] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/26/2012] [Accepted: 09/20/2013] [Indexed: 11/24/2022]

Abstract

Background

Molecular biology knowledge can be formalized and systematically represented in a computer-readable form as a comprehensive map of molecular interactions. There exist an increasing number of maps of molecular interactions containing detailed and step-wise description of various cell mechanisms. It is difficult to explore these large maps, to organize discussion of their content and to maintain them. Several efforts were recently made to combine these capabilities together in one environment, and NaviCell is one of them.

Results

NaviCell is a web-based environment for exploiting large maps of molecular interactions, created in CellDesigner, allowing their easy exploration, curation and maintenance. It is characterized by a combination of three essential features: (1) efficient map browsing based on Google Maps; (2) semantic zooming for viewing different levels of details or of abstraction of the map and (3) integrated web-based blog for collecting community feedback. NaviCell can be easily used by experts in the field of molecular biology for studying molecular entities of interest in the context of signaling pathways and crosstalk between pathways within a global signaling network. NaviCell allows both exploration of detailed molecular mechanisms represented on the map and a more abstract view of the map up to a top-level modular representation. NaviCell greatly facilitates curation, maintenance and updating the comprehensive maps of molecular interactions in an interactive and user-friendly fashion due to an imbedded blogging system.

Conclusions

NaviCell provides user-friendly exploration of large-scale maps of molecular interactions, thanks to Google Maps and WordPress interfaces, with which many users are already familiar. Semantic zooming which is used for navigating geographical maps is adopted for molecular maps in NaviCell, making any level of visualization readable. In addition, NaviCell provides a framework for community-based curation of maps.

Collapse

124

Carbonetto P, Stephens M. Integrated enrichment analysis of variants and pathways in genome-wide association studies indicates central role for IL-2 signaling genes in type 1 diabetes, and cytokine signaling genes in Crohn's disease. PLoS Genet 2013;9:e1003770. [PMID: 24098138 PMCID: PMC3789883 DOI: 10.1371/journal.pgen.1003770] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2012] [Accepted: 07/22/2013] [Indexed: 12/17/2022] Open

Abstract

Pathway analyses of genome-wide association studies aggregate information over sets of related genes, such as genes in common pathways, to identify gene sets that are enriched for variants associated with disease. We develop a model-based approach to pathway analysis, and apply this approach to data from the Wellcome Trust Case Control Consortium (WTCCC) studies. Our method offers several benefits over existing approaches. First, our method not only interrogates pathways for enrichment of disease associations, but also estimates the level of enrichment, which yields a coherent way to promote variants in enriched pathways, enhancing discovery of genes underlying disease. Second, our approach allows for multiple enriched pathways, a feature that leads to novel findings in two diseases where the major histocompatibility complex (MHC) is a major determinant of disease susceptibility. Third, by modeling disease as the combined effect of multiple markers, our method automatically accounts for linkage disequilibrium among variants. Interrogation of pathways from eight pathway databases yields strong support for enriched pathways, indicating links between Crohn's disease (CD) and cytokine-driven networks that modulate immune responses; between rheumatoid arthritis (RA) and "Measles" pathway genes involved in immune responses triggered by measles infection; and between type 1 diabetes (T1D) and IL2-mediated signaling genes. Prioritizing variants in these enriched pathways yields many additional putative disease associations compared to analyses without enrichment. For CD and RA, 7 of 8 additional non-MHC associations are corroborated by other studies, providing validation for our approach. For T1D, prioritization of IL-2 signaling genes yields strong evidence for 7 additional non-MHC candidate disease loci, as well as suggestive evidence for several more. Of the 7 strongest associations, 4 are validated by other studies, and 3 (near IL-2 signaling genes RAF1, MAPK14, and FYN) constitute novel putative T1D loci for further study.

Collapse

125

Demir E, Babur Ö, Rodchenkov I, Aksoy BA, Fukuda KI, Gross B, Sümer OS, Bader GD, Sander C. Using biological pathway data with paxtools. PLoS Comput Biol 2013;9:e1003194. [PMID: 24068901 PMCID: PMC3777916 DOI: 10.1371/journal.pcbi.1003194] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2013] [Accepted: 06/25/2013] [Indexed: 11/18/2022] Open

126

Gao L, Du G, Zhou J, Chen J, Liu J. Characterization of a group of pyrroloquinoline quinone-dependent dehydrogenases that are involved in the conversion of L-sorbose to 2-Keto-L-gulonic acid in Ketogulonicigenium vulgare WSH-001. Biotechnol Prog 2013;29:1398-404. [PMID: 23970495 DOI: 10.1002/btpr.1803] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2013] [Revised: 06/02/2013] [Indexed: 11/09/2022]

127

Network-based approaches in drug discovery and early development. Clin Pharmacol Ther 2013;94:651-8. [PMID: 24025802 DOI: 10.1038/clpt.2013.176] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2013] [Accepted: 09/03/2013] [Indexed: 12/20/2022]

128

Structure-based protein-protein interaction networks and drug design. QUANTITATIVE BIOLOGY 2013. [DOI: 10.1007/s40484-013-0018-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

129

Koskimaki JE, Blazier AS, Clarens AF, Papin JA. Computational Models of Algae Metabolism for Industrial Applications. Ind Biotechnol (New Rochelle N Y) 2013. [DOI: 10.1089/ind.2013.0012] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open

130

Helicobacter pylori salvages purines from extracellular host cell DNA utilizing the outer membrane-associated nuclease NucT. J Bacteriol 2013;195:4387-98. [PMID: 23893109 DOI: 10.1128/jb.00388-13] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open

Abstract

Helicobacter pylori is a bacterial pathogen that establishes life-long infections in humans, and its presence in the gastric epithelium is strongly associated with gastritis, peptic ulcer disease, and gastric cancer. Having evolved in this specific gastric niche for hundreds of thousands of years, this microbe has become dependent on its human host. Bioinformatic analysis reveals that H. pylori has lost several genes involved in the de novo synthesis of purine nucleotides, and without this pathway present, H. pylori must salvage purines from its environment in order to grow. While the presence and abundance of free purines in various mammalian tissues has been loosely quantified, the concentration of purines present within the gastric mucosa remains unknown. There is evidence, however, that a significant amount of extracellular DNA is present in the human gastric mucosal layer as a result of epithelial cell turnover, and this DNA has the potential to serve as an adequate purine source for gastric purine auxotrophs. In this study, we characterize the ability of H. pylori to grow utilizing only DNA as a purine source. We show that this ability is independent of the ComB DNA uptake system, and that H. pylori utilization of DNA as a purine source is largely influenced by the presence of an outer membrane-associated nuclease (NucT). A ΔnucT mutant exhibits significantly reduced extracellular nuclease activity and is deficient in growth when DNA is provided as the sole purine source in laboratory growth media. These growth defects are also evident when this nuclease mutant is grown in the presence of AGS cells or in purine-free tissue culture medium that has been conditioned by AGS cells in the absence of fetal bovine serum. Taken together, these results indicate that the salvage of purines from exogenous host cell DNA plays an important role in allowing H. pylori to meet its purine requirements for growth.

Collapse

131

Poliquin PO, Chen J, Cloutier M, Trudeau LÉ, Jolicoeur M. Metabolomics and in-silico analysis reveal critical energy deregulations in animal models of Parkinson's disease. PLoS One 2013;8:e69146. [PMID: 23935941 PMCID: PMC3720533 DOI: 10.1371/journal.pone.0069146] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2013] [Accepted: 06/04/2013] [Indexed: 11/18/2022] Open

132

Chung BKS, Dick T, Lee DY. In silico analyses for the discovery of tuberculosis drug targets. J Antimicrob Chemother 2013;68:2701-9. [DOI: 10.1093/jac/dkt273] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

133

Identification of drug targets by chemogenomic and metabolomic profiling in yeast. Pharmacogenet Genomics 2013;22:877-86. [PMID: 23076370 DOI: 10.1097/fpc.0b013e32835aa888] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

134

Predictions of Enzymatic Parameters: A Mini-Review with Focus on Enzymes for Biofuel. Appl Biochem Biotechnol 2013;171:590-615. [DOI: 10.1007/s12010-013-0328-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2013] [Accepted: 06/11/2013] [Indexed: 12/25/2022]

135

Quantification of endospore-forming firmicutes by quantitative PCR with the functional gene spo0A. Appl Environ Microbiol 2013;79:5302-12. [PMID: 23811505 DOI: 10.1128/aem.01376-13] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

136

Sorokina SY, Kuptzov VN, Urban YN, Fokin AV, Pojarkov SV, Ivankov MY, Melnikov AI, Kulikov AM. Databases as instruments for analysis of large-scale data sets of interactions between molecular biological objects. BIOL BULL+ 2013. [DOI: 10.1134/s1062359013030096] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

137

Sawada Y, Hirai MY. Integrated LC-MS/MS system for plant metabolomics. Comput Struct Biotechnol J 2013;4:e201301011. [PMID: 24688692 PMCID: PMC3962214 DOI: 10.5936/csbj.201301011] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2013] [Revised: 04/01/2013] [Accepted: 04/05/2013] [Indexed: 12/31/2022] Open

138

Lee DH, Lim JA, Lee J, Roh E, Jung K, Choi M, Oh C, Ryu S, Yun J, Heu S. Characterization of genes required for the pathogenicity of Pectobacterium carotovorum subsp. carotovorum Pcc21 in Chinese cabbage. MICROBIOLOGY-SGM 2013;159:1487-1496. [PMID: 23676432 PMCID: PMC3749726 DOI: 10.1099/mic.0.067280-0] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

139

Van Moerkercke A, Fabris M, Pollier J, Baart GJE, Rombauts S, Hasnain G, Rischer H, Memelink J, Oksman-Caldentey KM, Goossens A. CathaCyc, a metabolic pathway database built from Catharanthus roseus RNA-Seq data. PLANT & CELL PHYSIOLOGY 2013;54:673-85. [PMID: 23493402 DOI: 10.1093/pcp/pct039] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]

140

Glez-Peña D, Lourenço A, López-Fernández H, Reboiro-Jato M, Fdez-Riverola F. Web scraping technologies in an API world. Brief Bioinform 2013;15:788-97. [PMID: 23632294 DOI: 10.1093/bib/bbt026] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

141

Inactivation of the Pta-AckA pathway causes cell death in Staphylococcus aureus. J Bacteriol 2013;195:3035-44. [PMID: 23625849 DOI: 10.1128/jb.00042-13] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

142

Steeb B, Claudi B, Burton NA, Tienz P, Schmidt A, Farhan H, Mazé A, Bumann D. Parallel exploitation of diverse host nutrients enhances Salmonella virulence. PLoS Pathog 2013;9:e1003301. [PMID: 23633950 PMCID: PMC3636032 DOI: 10.1371/journal.ppat.1003301] [Citation(s) in RCA: 128] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2012] [Accepted: 02/26/2013] [Indexed: 12/20/2022] Open

143

Vasco-Cárdenas MF, Baños S, Ramos A, Martín JF, Barreiro C. Proteome response of Corynebacterium glutamicum to high concentration of industrially relevant C₄ and C₅ dicarboxylic acids. J Proteomics 2013;85:65-88. [PMID: 23624027 DOI: 10.1016/j.jprot.2013.04.019] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2012] [Revised: 03/05/2013] [Accepted: 04/09/2013] [Indexed: 12/11/2022]

144

Kremmydas GF, Tampakaki AP, Georgakopoulos DG. Characterization of the biocontrol activity of pseudomonas fluorescens strain X reveals novel genes regulated by glucose. PLoS One 2013;8:e61808. [PMID: 23596526 PMCID: PMC3626644 DOI: 10.1371/journal.pone.0061808] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2011] [Accepted: 03/18/2013] [Indexed: 11/18/2022] Open

Abstract

Pseudomonas fluorescens strain X, a bacterial isolate from the rhizosphere of bean seedlings, has the ability to suppress damping-off caused by the oomycete Pythium ultimum. To determine the genes controlling the biocontrol activity of strain X, transposon mutagenesis, sequencing and complementation was performed. Results indicate that, biocontrol ability of this isolate is attributed to gcd gene encoding glucose dehydrogenase, genes encoding its co-enzyme pyrroloquinoline quinone (PQQ), and two genes (sup5 and sup6) which seem to be organized in a putative operon. This operon (named supX) consists of five genes, one of which encodes a non-ribosomal peptide synthase. A unique binding site for a GntR-type transcriptional factor is localized upstream of the supX putative operon. Synteny comparison of the genes in supX revealed that they are common in the genus Pseudomonas, but with a low degree of similarity. supX shows high similarity only to the mangotoxin operon of Ps. syringae pv. syringae UMAF0158. Quantitative real-time PCR analysis indicated that transcription of supX is strongly reduced in the gcd and PQQ-minus mutants of Ps. fluorescens strain X. On the contrary, transcription of supX in the wild type is enhanced by glucose and transcription levels that appear to be higher during the stationary phase. Gcd, which uses PQQ as a cofactor, catalyses the oxidation of glucose to gluconic acid, which controls the activity of the GntR family of transcriptional factors. The genes in the supX putative operon have not been implicated before in the biocontrol of plant pathogens by pseudomonads. They are involved in the biosynthesis of an antimicrobial compound by Ps. fluorescens strain X and their transcription is controlled by glucose, possibly through the activity of a GntR-type transcriptional factor binding upstream of this putative operon.

Collapse

145

Kang C, Yu H, Yi GS. Finding type 2 diabetes causal single nucleotide polymorphism combinations and functional modules from genome-wide association data. BMC Med Inform Decis Mak 2013;13 Suppl 1:S3. [PMID: 23566118 PMCID: PMC3618247 DOI: 10.1186/1472-6947-13-s1-s3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open

Abstract

Background

Due to the low statistical power of individual markers from a genome-wide association study (GWAS), detecting causal single nucleotide polymorphisms (SNPs) for complex diseases is a challenge. SNP combinations are suggested to compensate for the low statistical power of individual markers, but SNP combinations from GWAS generate high computational complexity.

Methods

We aim to detect type 2 diabetes (T2D) causal SNP combinations from a GWAS dataset with optimal filtration and to discover the biological meaning of the detected SNP combinations. Optimal filtration can enhance the statistical power of SNP combinations by comparing the error rates of SNP combinations from various Bonferroni thresholds and p-value range-based thresholds combined with linkage disequilibrium (LD) pruning. T2D causal SNP combinations are selected using random forests with variable selection from an optimal SNP dataset. T2D causal SNP combinations and genome-wide SNPs are mapped into functional modules using expanded gene set enrichment analysis (GSEA) considering pathway, transcription factor (TF)-target, miRNA-target, gene ontology, and protein complex functional modules. The prediction error rates are measured for SNP sets from functional module-based filtration that selects SNPs within functional modules from genome-wide SNPs based expanded GSEA.

Results

A T2D causal SNP combination containing 101 SNPs from the Wellcome Trust Case Control Consortium (WTCCC) GWAS dataset are selected using optimal filtration criteria, with an error rate of 10.25%. Matching 101 SNPs with known T2D genes and functional modules reveals the relationships between T2D and SNP combinations. The prediction error rates of SNP sets from functional module-based filtration record no significance compared to the prediction error rates of randomly selected SNP sets and T2D causal SNP combinations from optimal filtration.

Conclusions

We propose a detection method for complex disease causal SNP combinations from an optimal SNP dataset by using random forests with variable selection. Mapping the biological meanings of detected SNP combinations can help uncover complex disease mechanisms.

Collapse

146

Remli MA, Deris S. An Approach for Biological Data Integration and Knowledge Retrieval Based on Ontology, Semantic Web Services Composition, and AI Planning. Bioinformatics 2013. [DOI: 10.4018/978-1-4666-3604-0.ch091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open

147

Jacobsen UP, Nielsen HB, Hildebrand F, Raes J, Sicheritz-Ponten T, Kouskoumvekaki I, Panagiotou G. The chemical interactome space between the human host and the genetically defined gut metabotypes. THE ISME JOURNAL 2013;7:730-42. [PMID: 23178670 PMCID: PMC3603391 DOI: 10.1038/ismej.2012.141] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/14/2012] [Revised: 07/30/2012] [Accepted: 09/28/2012] [Indexed: 01/07/2023]

148

Foster A, Barnes N, Speight R, Morris PC, Keane MA. Role of amine oxidase expression to maintain putrescine homeostasis in Rhodococcus opacus. Enzyme Microb Technol 2013;52:286-95. [DOI: 10.1016/j.enzmictec.2013.01.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2012] [Revised: 12/12/2012] [Accepted: 01/07/2013] [Indexed: 10/27/2022]

149

De Filippo C, Ramazzotti M, Fontana P, Cavalieri D. Bioinformatic approaches for functional annotation and pathway inference in metagenomics data. Brief Bioinform 2013;13:696-710. [PMID: 23175748 PMCID: PMC3505041 DOI: 10.1093/bib/bbs070] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

150

Altman T, Travers M, Kothari A, Caspi R, Karp PD. A systematic comparison of the MetaCyc and KEGG pathway databases. BMC Bioinformatics 2013;14:112. [PMID: 23530693 PMCID: PMC3665663 DOI: 10.1186/1471-2105-14-112] [Citation(s) in RCA: 89] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2012] [Accepted: 03/04/2013] [Indexed: 01/06/2023] Open

Abstract

BACKGROUND

The MetaCyc and KEGG projects have developed large metabolic pathway databases that are used for a variety of applications including genome analysis and metabolic engineering. We present a comparison of the compound, reaction, and pathway content of MetaCyc version 16.0 and a KEGG version downloaded on Feb-27-2012 to increase understanding of their relative sizes, their degree of overlap, and their scope. To assess their overlap, we must know the correspondences between compounds, reactions, and pathways in MetaCyc, and those in KEGG. We devoted significant effort to computational and manual matching of these entities, and we evaluated the accuracy of the correspondences.

RESULTS

KEGG contains 179 module pathways versus 1,846 base pathways in MetaCyc; KEGG contains 237 map pathways versus 296 super pathways in MetaCyc. KEGG pathways contain 3.3 times as many reactions on average as do MetaCyc pathways, and the databases employ different conceptualizations of metabolic pathways. KEGG contains 8,692 reactions versus 10,262 for MetaCyc. 6,174 KEGG reactions are components of KEGG pathways versus 6,348 for MetaCyc. KEGG contains 16,586 compounds versus 11,991 for MetaCyc. 6,912 KEGG compounds act as substrates in KEGG reactions versus 8,891 for MetaCyc. MetaCyc contains a broader set of database attributes than does KEGG, such as relationships from a compound to enzymes that it regulates, identification of spontaneous reactions, and the expected taxonomic range of metabolic pathways. MetaCyc contains many pathways not found in KEGG, from plants, fungi, metazoa, and actinobacteria; KEGG contains pathways not found in MetaCyc, for xenobiotic degradation, glycan metabolism, and metabolism of terpenoids and polyketides. MetaCyc contains fewer unbalanced reactions, which facilitates metabolic modeling such as using flux-balance analysis. MetaCyc includes generic reactions that may be instantiated computationally.

CONCLUSIONS

KEGG contains significantly more compounds than does MetaCyc, whereas MetaCyc contains significantly more reactions and pathways than does KEGG, in particular KEGG modules are quite incomplete. The number of reactions occurring in pathways in the two DBs are quite similar.

Collapse