Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Karp PD, Paley S. Integrated access to metabolic and genomic data. J Comput Biol 1996;3:191-212. [PMID: 8697237 DOI: 10.1089/cmb.1996.3.191] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open

Number

Cited by Other Article(s)

Herson J, Krummenacker M, Spaulding A, O'Maille P, Karp PD. The Genome Explorer genome browser. mSystems 2024;9:e0026724. [PMID: 38958457 PMCID: PMC11265445 DOI: 10.1128/msystems.00267-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Accepted: 05/28/2024] [Indexed: 07/04/2024] Open

Abstract

Are two adjacent genes in the same operon? What are the order and spacing between several transcription factor binding sites? Genome browsers are software data visualization and exploration tools that enable biologists to answer questions such as these. In this paper, we report on a major update to our browser, Genome Explorer, that provides nearly instantaneous scaling and traversing of a genome, enabling users to quickly and easily zoom into an area of interest. The user can rapidly move between scales that depict the entire genome, individual genes, and the sequence; Genome Explorer presents the most relevant detail and context for each scale. By downloading the data for the entire genome to the user's web browser and dynamically generating visualizations locally, we enable ﬁne control of zoom and pan functions and real-time redrawing of the visualization, resulting in smoother and more intuitive exploration of a genome than is possible with other browsers. Further, genome features are presented together, in-line, using familiar graphical depictions. In contrast, many other browsers depict genome features using data tracks, which have low information density and can visually obscure the relative positions of features. Genome Explorer diagrams have a high information density that provides larger amounts of genome context and sequence information to be presented in a given-sized monitor than for tracks-based browsers. Genome Explorer provides optional data tracks for the analysis of large-scale data sets and a unique comparative mode that aligns genomes at orthologous genes with synchronized zooming.

IMPORTANCE

Genome browsers provide graphical depictions of genome information to speed the uptake of complex genome data by scientists. They provide search operations to help scientists find information and zoom operations to enable scientists to view genome features at different resolutions. We introduce the Genome Explorer browser, which provides extremely fast zooming and panning of genome visualizations and displays with high information density.

Collapse

Khomtchouk BB, Weitz E, Karp PD, Wahlestedt C. How the strengths of Lisp-family languages facilitate building complex and flexible bioinformatics applications. Brief Bioinform 2018;19:537-543. [PMID: 28040748 PMCID: PMC5952920 DOI: 10.1093/bib/bbw130] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2016] [Revised: 11/16/2016] [Indexed: 11/14/2022] Open

Karp PD, Latendresse M, Paley SM, Krummenacker M, Ong QD, Billington R, Kothari A, Weaver D, Lee T, Subhraveti P, Spaulding A, Fulcher C, Keseler IM, Caspi R. Pathway Tools version 19.0 update: software for pathway/genome informatics and systems biology. Brief Bioinform 2015;17:877-90. [PMID: 26454094 DOI: 10.1093/bib/bbv079] [Citation(s) in RCA: 173] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2015] [Indexed: 11/15/2022] Open

Hamerly T, Tripet BP, Tigges M, Giannone RJ, Wurch L, Hettich RL, Podar M, Copié V, Bothner B. Untargeted metabolomics studies employing NMR and LC-MS reveal metabolic coupling between Nanoarcheum equitans and its archaeal host Ignicoccus hospitalis. Metabolomics 2015;11:895-907. [PMID: 26273237 PMCID: PMC4529127 DOI: 10.1007/s11306-014-0747-6] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Fondi M, Liò P. Genome-scale metabolic network reconstruction. Methods Mol Biol 2015;1231:233-256. [PMID: 25343869 DOI: 10.1007/978-1-4939-1720-4_15] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Shanmugasundram A, Gonzalez-Galarza FF, Wastling JM, Vasieva O, Jones AR. An integrated approach to understand apicomplexan metabolism from their genomes. BMC Bioinformatics 2014. [PMCID: PMC4071867 DOI: 10.1186/1471-2105-15-s3-a3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Shanmugasundram A, Gonzalez-Galarza FF, Wastling JM, Vasieva O, Jones AR. Library of Apicomplexan Metabolic Pathways: a manually curated database for metabolic pathways of apicomplexan parasites. Nucleic Acids Res 2012. [PMID: 23193253 PMCID: PMC3531055 DOI: 10.1093/nar/gks1139] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

MILED ZINABEN, WEBSTER YUEW, LIU YANG, LI NIANHUA. AN ONTOLOGY FOR SEMANTIC INTEGRATION OF LIFE SCIENCE WEB DATABASES. INT J COOP INF SYST 2012. [DOI: 10.1142/s0218843003000747] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Karp PD, Paley SM, Krummenacker M, Latendresse M, Dale JM, Lee TJ, Kaipa P, Gilham F, Spaulding A, Popescu L, Altman T, Paulsen I, Keseler IM, Caspi R. Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology. Brief Bioinform 2009;11:40-79. [PMID: 19955237 DOI: 10.1093/bib/bbp043] [Citation(s) in RCA: 325] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open

Luciano JS, Stevens RD. e-Science and biological pathway semantics. BMC Bioinformatics 2007;8 Suppl 3:S3. [PMID: 17493286 PMCID: PMC1892100 DOI: 10.1186/1471-2105-8-s3-s3] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Garcia Castro A, Chen YPP, Ragan MA. Information integration in molecular bioscience. ACTA ACUST UNITED AC 2006;4:157-73. [PMID: 16231958 DOI: 10.2165/00822942-200504030-00001] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]

Shannon PT, Reiss DJ, Bonneau R, Baliga NS. The Gaggle: an open-source software system for integrating bioinformatics software and data sources. BMC Bioinformatics 2006;7:176. [PMID: 16569235 PMCID: PMC1464137 DOI: 10.1186/1471-2105-7-176] [Citation(s) in RCA: 122] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2005] [Accepted: 03/28/2006] [Indexed: 01/16/2023] Open

Abstract

Background

Systems biologists work with many kinds of data, from many different sources, using a variety of software tools. Each of these tools typically excels at one type of analysis, such as of microarrays, of metabolic networks and of predicted protein structure. A crucial challenge is to combine the capabilities of these (and other forthcoming) data resources and tools to create a data exploration and analysis environment that does justice to the variety and complexity of systems biology data sets. A solution to this problem should recognize that data types, formats and software in this high throughput age of biology are constantly changing.

Results

In this paper we describe the Gaggle -a simple, open-source Java software environment that helps to solve the problem of software and database integration. Guided by the classic software engineering strategy of separation of concerns and a policy of semantic flexibility, it integrates existing popular programs and web resources into a user-friendly, easily-extended environment.

We demonstrate that four simple data types (names, matrices, networks, and associative arrays) are sufficient to bring together diverse databases and software. We highlight some capabilities of the Gaggle with an exploration of Helicobacter pylori pathogenesis genes, in which we identify a putative ricin-like protein -a discovery made possible by simultaneous data exploration using a wide range of publicly available data and a variety of popular bioinformatics software tools.

Conclusion

We have integrated diverse databases (for example, KEGG, BioCyc, String) and software (Cytoscape, DataMatrixViewer, R statistical environment, and TIGR Microarray Expression Viewer). Through this loose coupling of diverse software and databases the Gaggle enables simultaneous exploration of experimental data (mRNA and protein abundance, protein-protein and protein-DNA interactions), functional associations (operon, chromosomal proximity, phylogenetic pattern), metabolic pathways (KEGG) and Pubmed abstracts (STRING web resource), creating an exploratory environment useful to 'web browser and spreadsheet biologists', to statistically savvy computational biologists, and those in between. The Gaggle uses Java RMI and Java Web Start technologies and can be found at .

Collapse

Wiback SJ, Mahadevan R, Palsson BØ. Reconstructing metabolic flux vectors from extreme pathways: defining the alpha-spectrum. J Theor Biol 2003;224:313-24. [PMID: 12941590 DOI: 10.1016/s0022-5193(03)00168-1] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Abstract

The move towards genome-scale analysis of cellular functions has necessitated the development of analytical (in silico) methods to understand such large and complex biochemical reaction networks. One such method is extreme pathway analysis that uses stoichiometry and thermodynamic irreversibly to define mathematically unique, systemic metabolic pathways. These extreme pathways form the edges of a high-dimensional convex cone in the flux space that contains all the attainable steady state solutions, or flux distributions, for the metabolic network. By definition, any steady state flux distribution can be described as a nonnegative linear combination of the extreme pathways. To date, much effort has been focused on calculating, defining, and understanding these extreme pathways. However, little work has been performed to determine how these extreme pathways contribute to a given steady state flux distribution. This study represents an initial effort aimed at defining how physiological steady state solutions can be reconstructed from a network's extreme pathways. In general, there is not a unique set of nonnegative weightings on the extreme pathways that produce a given steady state flux distribution but rather a range of possible values. This range can be determined using linear optimization to maximize and minimize the weightings of a particular extreme pathway in the reconstruction, resulting in what we have termed the alpha-spectrum. The alpha-spectrum defines which extreme pathways can and cannot be included in the reconstruction of a given steady state flux distribution and to what extent they individually contribute to the reconstruction. It is shown that accounting for transcriptional regulatory constraints can considerably shrink the alpha-spectrum. The alpha-spectrum is computed and interpreted for two cases; first, optimal states of a skeleton representation of core metabolism that include transcriptional regulation, and second for human red blood cell metabolism under various physiological, non-optimal conditions.

Collapse

van Helden J, Wernisch L, Gilbert D, Wodak SJ. Graph-based analysis of metabolic networks. ERNST SCHERING RESEARCH FOUNDATION WORKSHOP 2002:245-74. [PMID: 12061005 DOI: 10.1007/978-3-662-04747-7_12] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/11/2023]

Médigue C, Bocs S, Labarre L, Mathé C, Vallenet D. L’annotationin silicodes séquences génomiques. Med Sci (Paris) 2002. [DOI: 10.1051/medsci/2002182237] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Karp PD, Riley M, Saier M, Paulsen IT, Collado-Vides J, Paley SM, Pellegrini-Toole A, Bonavides C, Gama-Castro S. The EcoCyc Database. Nucleic Acids Res 2002;30:56-8. [PMID: 11752253 PMCID: PMC99147 DOI: 10.1093/nar/30.1.56] [Citation(s) in RCA: 245] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Koike T, Rzhetsky A. A graphic editor for analyzing signal-transduction pathways. Gene 2000;259:235-44. [PMID: 11163981 DOI: 10.1016/s0378-1119(00)00458-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Tsoka S, Ouzounis CA. Recent developments and future directions in computational genomics. FEBS Lett 2000;480:42-8. [PMID: 10967327 DOI: 10.1016/s0014-5793(00)01776-2] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Häring D, Kypr J. Escherichia coli genome is composed of two distinct types of nucleotide sequences. Biochem Biophys Res Commun 2000;272:571-5. [PMID: 10833453 DOI: 10.1006/bbrc.2000.2825] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Karp PD, Riley M, Saier M, Paulsen IT, Paley SM, Pellegrini-Toole A. The EcoCyc and MetaCyc databases. Nucleic Acids Res 2000;28:56-9. [PMID: 10592180 PMCID: PMC102475 DOI: 10.1093/nar/28.1.56] [Citation(s) in RCA: 134] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/1999] [Revised: 10/15/1999] [Accepted: 10/15/1999] [Indexed: 11/13/2022] Open

Karp PD, Krummenacker M, Paley S, Wagg J. Integrated pathway-genome databases and their role in drug discovery. Trends Biotechnol 1999;17:275-81. [PMID: 10370234 DOI: 10.1016/s0167-7799(99)01316-5] [Citation(s) in RCA: 113] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Karp PD, Riley M, Paley SM, Pellegrini-Toole A, Krummenacker M. Eco Cyc: encyclopedia of Escherichia coli genes and metabolism. Nucleic Acids Res 1999;27:55-8. [PMID: 9847140 PMCID: PMC148095 DOI: 10.1093/nar/27.1.55] [Citation(s) in RCA: 61] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Paulsen IT, Sliwinski MK, Saier MH. Microbial genome analyses: global comparisons of transport capabilities based on phylogenies, bioenergetics and substrate specificities. J Mol Biol 1998;277:573-92. [PMID: 9533881 DOI: 10.1006/jmbi.1998.1609] [Citation(s) in RCA: 210] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Abstract

We have conducted genome sequence analyses of seven prokaryotic microorganisms for which completely sequenced genomes are available (Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Bacillus subtilis, Mycoplasma genitalium, Synechocystis PCC6803 and Methanococcus jannaschii). We report the distribution of encoded known and putative polytopic cytoplasmic membrane transport proteins within these genomes. Transport systems for each organism were classified according to (1) putative membrane topology, (2) protein family, (3) bioenergetics, and (4) substrate specificities. The overall transport capabilities of each organism were thereby estimated. Probable function was assigned to greater than 90% of the putative transport proteins identified. The results show the following: (1) Numbers of transport systems in eubacteria are approximately proportional to genome size and correspond to 9.7 to 10.8% of the total encoded genes except for H. pylori (5.4%), Synechocystis (4.7%) and M. jannaschii (3.5%) which exhibit substantially lower proportions. (2) The distribution of topological types is similar in all seven organisms. (3) Transport systems belonging to 67 families were identified within the genomes of these organisms, and about half of these families are also found in eukaryotes. (4) 12% of these families are found exclusively in Gram-negative bacteria, but none is found exclusively in Gram-positive bacteria, cyanobacteria or archaea. (5) Two superfamilies, the ATP-binding cassette (ABC) and major facilitator (MF) superfamilies account for nearly 50% of all transporters in each organism, but the relative representation of these two transporter types varies over a tenfold range, depending on the organism. (6) Secondary, pmf-dependent carriers are 1.5 to threefold more prevalent than primary ATP-dependent carriers in E. coli, H. influenzae, H. pylori and B. subtilis while primary carriers are about twofold more prevalent in M. genitalium and Synechocystis. M. jannaschii exhibits a slight preference for secondary carriers. (7) Bioenergetics of transport generally correlate with the primary forms of energy generated via available metabolic pathways but ecological niche and substrate availability may also be determining factors. (8) All organisms display a similar range of transport specificities with quantitative differences presumably reflective of disparate ecological niches. (9) M. jannaschii and Synechocystis have a two to threefold increased proportion of transporters for inorganic ions with a concomitant decrease in transporters for organic compounds. (10) 6 to 18% of all transporters in these bacteria probably function as drug export systems showing that these systems are prevalent in non-pathogenic as well as pathogenic organisms. (11) All seven prokaryotes examined encode proteins homologous to known channel proteins, but none of the channel types identified occurs in all of these organisms. (12) The phosphoenolpyruvate:sugar phosphotransferase system is prevalent in the large genome organisms, E. coli and B. subtilis, and is present in the small genome organisms, H. influenzae and M. genitalium, but is totally lacking in H. pylori, Synechocystis and M. jannaschii. Details of the information summarized in this article are available on our web sites, and this information will be periodically updated and corrected as new sequence and biochemical data become available.

Collapse

Karp PD. Metabolic databases. Trends Biochem Sci 1998;23:114-6. [PMID: 9581504 DOI: 10.1016/s0968-0004(98)01184-0] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Karp PD, Riley M, Paley SM, Pellegrini-Toole A, Krummenacker M. EcoCyc: Encyclopedia of Escherichia coli genes and metabolism. Nucleic Acids Res 1998;26:50-3. [PMID: 9399798 PMCID: PMC147256 DOI: 10.1093/nar/26.1.50] [Citation(s) in RCA: 42] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open

Karp PD, Riley M, Paley SM, Pellegrini-Toole A, Krummenacker M. EcoCyc: Enyclopedia of Escherichia coli Genes and Metabolism. Nucleic Acids Res 1997;25:43-51. [PMID: 9016502 PMCID: PMC146379 DOI: 10.1093/nar/25.1.43] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open

Karp PD. Database links are a foundation for interoperability. Trends Biotechnol 1996;14:273-9. [PMID: 8987457 DOI: 10.1016/0167-7799(96)10044-5] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]