351
|
Knoop H, Zilliges Y, Lockau W, Steuer R. The metabolic network of Synechocystis sp. PCC 6803: systemic properties of autotrophic growth. PLANT PHYSIOLOGY 2010; 154:410-22. [PMID: 20616194 PMCID: PMC2938163 DOI: 10.1104/pp.110.157198] [Citation(s) in RCA: 122] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2010] [Accepted: 07/07/2010] [Indexed: 05/17/2023]
Abstract
Unicellular cyanobacteria have attracted growing attention as potential host organisms for the production of valuable organic products and provide an ideal model to understand oxygenic photosynthesis and phototrophic metabolism. To obtain insight into the functional properties of phototrophic growth, we present a detailed reconstruction of the primary metabolic network of the autotrophic prokaryote Synechocystis sp. PCC 6803. The reconstruction is based on multiple data sources and extensive manual curation and significantly extends currently available repositories of cyanobacterial metabolism. A systematic functional analysis, utilizing the framework of flux-balance analysis, allows the prediction of essential metabolic pathways and reactions and allows the identification of inconsistencies in the current annotation. As a counterintuitive result, our computational model indicates that photorespiration is beneficial to achieve optimal growth rates. The reconstruction process highlights several obstacles currently encountered in the context of large-scale reconstructions of metabolic networks.
Collapse
|
352
|
Dunn WB, Broadhurst DI, Atherton HJ, Goodacre R, Griffin JL. Systems level studies of mammalian metabolomes: the roles of mass spectrometry and nuclear magnetic resonance spectroscopy. Chem Soc Rev 2010; 40:387-426. [PMID: 20717559 DOI: 10.1039/b906712b] [Citation(s) in RCA: 557] [Impact Index Per Article: 39.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The study of biological systems in a holistic manner (systems biology) is increasingly being viewed as a necessity to provide qualitative and quantitative descriptions of the emergent properties of the complete system. Systems biology performs studies focussed on the complex interactions of system components; emphasising the whole system rather than the individual parts. Many perturbations to mammalian systems (diet, disease, drugs) are multi-factorial and the study of small parts of the system is insufficient to understand the complete phenotypic changes induced. Metabolomics is one functional level tool being employed to investigate the complex interactions of metabolites with other metabolites (metabolism) but also the regulatory role metabolites provide through interaction with genes, transcripts and proteins (e.g. allosteric regulation). Technological developments are the driving force behind advances in scientific knowledge. Recent advances in the two analytical platforms of mass spectrometry (MS) and nuclear magnetic resonance (NMR) spectroscopy have driven forward the discipline of metabolomics. In this critical review, an introduction to metabolites, metabolomes, metabolomics and the role of MS and NMR spectroscopy will be provided. The applications of metabolomics in mammalian systems biology for the study of the health-disease continuum, drug efficacy and toxicity and dietary effects on mammalian health will be reviewed. The current limitations and future goals of metabolomics in systems biology will also be discussed (374 references).
Collapse
Affiliation(s)
- Warwick B Dunn
- Manchester Centre for Integrative Systems Biology, University of Manchester, 131 Princess Street, Manchester, M1 7DN, UK.
| | | | | | | | | |
Collapse
|
353
|
Radrich K, Tsuruoka Y, Dobson P, Gevorgyan A, Swainston N, Baart G, Schwartz JM. Integration of metabolic databases for the reconstruction of genome-scale metabolic networks. BMC SYSTEMS BIOLOGY 2010; 4:114. [PMID: 20712863 PMCID: PMC2930596 DOI: 10.1186/1752-0509-4-114] [Citation(s) in RCA: 72] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/27/2010] [Accepted: 08/16/2010] [Indexed: 01/13/2023]
Abstract
BACKGROUND Genome-scale metabolic reconstructions have been recognised as a valuable tool for a variety of applications ranging from metabolic engineering to evolutionary studies. However, the reconstruction of such networks remains an arduous process requiring a high level of human intervention. This process is further complicated by occurrences of missing or conflicting information and the absence of common annotation standards between different data sources. RESULTS In this article, we report a semi-automated methodology aimed at streamlining the process of metabolic network reconstruction by enabling the integration of different genome-wide databases of metabolic reactions. We present results obtained by applying this methodology to the metabolic network of the plant Arabidopsis thaliana. A systematic comparison of compounds and reactions between two genome-wide databases allowed us to obtain a high-quality core consensus reconstruction, which was validated for stoichiometric consistency. A lower level of consensus led to a larger reconstruction, which has a lower quality standard but provides a baseline for further manual curation. CONCLUSION This semi-automated methodology may be applied to other organisms and help to streamline the process of genome-scale network reconstruction in order to accelerate the transfer of such models to applications.
Collapse
Affiliation(s)
- Karin Radrich
- Faculty of Life Sciences, University of Manchester, Manchester M13 9PT, UK
| | | | | | | | | | | | | |
Collapse
|
354
|
Kritz MV, Trindade Dos Santos M, Urrutia S, Schwartz JM. Organising metabolic networks: Cycles in flux distributions. J Theor Biol 2010; 265:250-60. [PMID: 20435049 DOI: 10.1016/j.jtbi.2010.04.026] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2009] [Revised: 02/04/2010] [Accepted: 04/24/2010] [Indexed: 11/26/2022]
Abstract
Metabolic networks are among the most widely studied biological systems. The topology and interconnections of metabolic reactions have been well described for many species. This is, however, not sufficient to understand how their activity is regulated in living organisms. These descriptions depict a static set of possible chains of reactions, with no information about the dynamic activity of reaction fluxes. Cyclic structures are thought to play a central role in the homeostasis of biological systems and in their resilience to a changing environment. In this work, we present a methodology to help investigating dynamic fluxes associated to biochemical reactions in metabolic networks. We introduce an algorithm for partitioning fluxes between cyclic and acyclic sub-networks, adapted from an algorithm initially developed to study fluxes in trophic networks. Using this algorithm, we analyse three metabolic systems: the central metabolism of wild type and a deletion mutant of Escherichia coli, erythrocyte metabolism and the central metabolism of the bacterium Methylobacterium extorquens. This methodology unveils the role of cycles in driving and maintaining metabolic fluxes under perturbations in these examples, and may be used to further investigate and understand the organisational invariance of biological systems.
Collapse
|
355
|
Swainston N, Golebiewski M, Messiha HL, Malys N, Kania R, Kengne S, Krebs O, Mir S, Sauer-Danzwith H, Smallbone K, Weidemann A, Wittig U, Kell DB, Mendes P, Müller W, Paton NW, Rojas I. Enzyme kinetics informatics: from instrument to browser. FEBS J 2010; 277:3769-79. [PMID: 20738395 DOI: 10.1111/j.1742-4658.2010.07778.x] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A limited number of publicly available resources provide access to enzyme kinetic parameters. These have been compiled through manual data mining of published papers, not from the original, raw experimental data from which the parameters were calculated. This is largely due to the lack of software or standards to support the capture, analysis, storage and dissemination of such experimental data. Introduced here is an integrative system to manage experimental enzyme kinetics data from instrument to browser. The approach is based on two interrelated databases: the existing SABIO-RK database, containing kinetic data and corresponding metadata, and the newly introduced experimental raw data repository, MeMo-RK. Both systems are publicly available by web browser and web service interfaces and are configurable to ensure privacy of unpublished data. Users of this system are provided with the ability to view both kinetic parameters and the experimental raw data from which they are calculated, providing increased confidence in the data. A data analysis and submission tool, the kineticswizard, has been developed to allow the experimentalist to perform data collection, analysis and submission to both data resources. The system is designed to be extensible, allowing integration with other manufacturer instruments covering a range of analytical techniques.
Collapse
Affiliation(s)
- Neil Swainston
- Manchester Centre for Integrative Systems Biology, University of Manchester, Manchester, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
356
|
|
357
|
Edelman LB, Eddy JA, Price ND. In silico models of cancer. WILEY INTERDISCIPLINARY REVIEWS. SYSTEMS BIOLOGY AND MEDICINE 2010; 2:438-459. [PMID: 20836040 PMCID: PMC3157287 DOI: 10.1002/wsbm.75] [Citation(s) in RCA: 72] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Cancer is a complex disease that involves multiple types of biological interactions across diverse physical, temporal, and biological scales. This complexity presents substantial challenges for the characterization of cancer biology, and motivates the study of cancer in the context of molecular, cellular, and physiological systems. Computational models of cancer are being developed to aid both biological discovery and clinical medicine. The development of these in silico models is facilitated by rapidly advancing experimental and analytical tools that generate information-rich, high-throughput biological data. Statistical models of cancer at the genomic, transcriptomic, and pathway levels have proven effective in developing diagnostic and prognostic molecular signatures, as well as in identifying perturbed pathways. Statistically inferred network models can prove useful in settings where data overfitting can be avoided, and provide an important means for biological discovery. Mechanistically based signaling and metabolic models that apply a priori knowledge of biochemical processes derived from experiments can also be reconstructed where data are available, and can provide insight and predictive ability regarding the behavior of these systems. At longer length scales, continuum and agent-based models of the tumor microenvironment and other tissue-level interactions enable modeling of cancer cell populations and tumor progression. Even though cancer has been among the most-studied human diseases using systems approaches, significant challenges remain before the enormous potential of in silico cancer biology can be fully realized.
Collapse
Affiliation(s)
- Lucas B. Edelman
- Institute for Genomic Biology, Department of Bioengineering, University of Illinois, Urbana-Champaign
| | - James A. Eddy
- Institute for Genomic Biology, Department of Bioengineering, University of Illinois, Urbana-Champaign
| | - Nathan D. Price
- Department of Chemical and Biomolecular Engineering, Institute for Genomic Biology, Center for Biophysics and Computational Biology, University of Illinois, Urbana-Champaign
| |
Collapse
|
358
|
Logan JA, Kelly ME, Ayers D, Shipillis N, Baier G, Day PJR. Systems biology and modeling in neuroblastoma: practicalities and perspectives. Expert Rev Mol Diagn 2010; 10:131-45. [PMID: 20214533 DOI: 10.1586/erm.10.4] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Neuroblastoma (NB) is a common pediatric malignancy characterized by clinical and biological heterogeneity. A host of prognostic markers are available, contributing to accurate risk stratification and appropriate treatment allocation. Unfortunately, outcome is still poor for many patients, indicating the need for a new approach with enhanced utilization of the available biological data. Systems biology is a holistic approach in which all components of a biological system carry equal importance. Systems biology uses mathematical modeling and simulation to investigate dynamic interactions between system components, as a means of explaining overall system behavior. Systems biology can benefit the biomedical sciences by providing a more complete understanding of human disease, enhancing the development of targeted therapeutics. Systems biology is largely contiguous with current approaches in NB, which already employ an integrative and pseudo-holistic approach to disease management. Systems modeling of NB offers an optimal method for continuing progression in this field, and conferring additional benefit to current risk stratification and management. Likewise, NB provides an opportunity for systems biology to prove its utility in the context of human disease, since the biology of NB is comprehensively characterized and, therefore, suited to modeling. The purpose of this review is to outline the benefits, challenges and fundamental workings of systems modeling in human disease, using a specific example of bottom-up modeling in NB. The intention is to demonstrate practical requirements to begin bridging the gap between biological research and applied mathematical approaches for the mutual gain of both fields, and with additional benefits for clinical management.
Collapse
Affiliation(s)
- Jennifer A Logan
- Quantitative Molecular Medicine, Faculty of Medicine and Health Sciences, The Manchester Interdisciplinary Biocentre, University of Manchester, 131 Princess Street, Manchester, M1 7DN, UK
| | | | | | | | | | | |
Collapse
|
359
|
Abstract
The advent of high throughput genome-scale bioinformatics has led to an exponential increase in available cellular system data. Systems metabolic engineering attempts to use data-driven approaches--based on the data collected with high throughput technologies--to identify gene targets and optimize phenotypical properties on a systems level. Current systems metabolic engineering tools are limited for predicting and defining complex phenotypes such as chemical tolerances and other global, multigenic traits. The most pragmatic systems-based tool for metabolic engineering to arise is the in silico genome-scale metabolic reconstruction. This tool has seen wide adoption for modeling cell growth and predicting beneficial gene knockouts, and we examine here how this approach can be expanded for novel organisms. This review will highlight advances of the systems metabolic engineering approach with a focus on de novo development and use of genome-scale metabolic reconstructions for metabolic engineering applications. We will then discuss the challenges and prospects for this emerging field to enable model-based metabolic engineering. Specifically, we argue that current state-of-the-art systems metabolic engineering techniques represent a viable first step for improving product yield that still must be followed by combinatorial techniques or random strain mutagenesis to achieve optimal cellular systems.
Collapse
Affiliation(s)
- John Blazeck
- Department of Chemical Engineering, The University of Texas at Austin, 1 University Station, Austin, TX 78712, USA
| | | |
Collapse
|
360
|
Smith GR, Shanley DP. Modelling the response of FOXO transcription factors to multiple post-translational modifications made by ageing-related signalling pathways. PLoS One 2010; 5:e11092. [PMID: 20567500 PMCID: PMC2886341 DOI: 10.1371/journal.pone.0011092] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2010] [Accepted: 05/01/2010] [Indexed: 01/10/2023] Open
Abstract
FOXO transcription factors are an important, conserved family of regulators of cellular processes including metabolism, cell-cycle progression, apoptosis and stress resistance. They are required for the efficacy of several of the genetic interventions that modulate lifespan. FOXO activity is regulated by multiple post-translational modifications (PTMs) that affect its subcellular localization, half-life, DNA binding and transcriptional activity. Here, we show how a mathematical modelling approach can be used to simulate the effects, singly and in combination, of these PTMs. Our model is implemented using the Systems Biology Markup Language (SBML), generated by an ancillary program and simulated in a stochastic framework. The use of the ancillary program to generate the SBML is necessary because the possibility that many regulatory PTMs may be added, each independently of the others, means that a large number of chemically distinct forms of the FOXO molecule must be taken into account, and the program is used to generate them. Although the model does not yet include detailed representations of events upstream and downstream of FOXO, we show how it can qualitatively, and in some cases quantitatively, reproduce the known effects of certain treatments that induce various single and multiple PTMs, and allows for a complex spatiotemporal interplay of effects due to the activation of multiple PTM-inducing treatments. Thus, it provides an important framework to integrate current knowledge about the behaviour of FOXO. The approach should be generally applicable to other proteins experiencing multiple regulations.
Collapse
Affiliation(s)
- Graham R. Smith
- Henry Wellcome Laboratory for Biogerontology, Institute for Ageing and Health, Newcastle University, Newcastle upon Tyne, United Kingdom
| | - Daryl P. Shanley
- Henry Wellcome Laboratory for Biogerontology, Institute for Ageing and Health, Newcastle University, Newcastle upon Tyne, United Kingdom
- * E-mail:
| |
Collapse
|
361
|
Ananiadou S, Pyysalo S, Tsujii J, Kell DB. Event extraction for systems biology by text mining the literature. Trends Biotechnol 2010; 28:381-90. [PMID: 20570001 DOI: 10.1016/j.tibtech.2010.04.005] [Citation(s) in RCA: 140] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2010] [Revised: 04/20/2010] [Accepted: 04/26/2010] [Indexed: 01/08/2023]
Abstract
Systems biology recognizes in particular the importance of interactions between biological components and the consequences of these interactions. Such interactions and their downstream effects are known as events. To computationally mine the literature for such events, text mining methods that can detect, extract and annotate them are required. This review summarizes the methods that are currently available, with a specific focus on protein-protein interactions and pathway or network reconstruction. The approaches described will be of considerable value in associating particular pathways and their components with higher-order physiological properties, including disease states.
Collapse
|
362
|
Reconstruction of the yeast protein-protein interaction network involved in nutrient sensing and global metabolic regulation. BMC SYSTEMS BIOLOGY 2010; 4:68. [PMID: 20500839 PMCID: PMC2889877 DOI: 10.1186/1752-0509-4-68] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/07/2009] [Accepted: 05/25/2010] [Indexed: 02/06/2023]
Abstract
Background Several protein-protein interaction studies have been performed for the yeast Saccharomyces cerevisiae using different high-throughput experimental techniques. All these results are collected in the BioGRID database and the SGD database provide detailed annotation of the different proteins. Despite the value of BioGRID for studying protein-protein interactions, there is a need for manual curation of these interactions in order to remove false positives. Results Here we describe an annotated reconstruction of the protein-protein interactions around four key nutrient-sensing and metabolic regulatory signal transduction pathways (STP) operating in Saccharomyces cerevisiae. The reconstructed STP network includes a full protein-protein interaction network including the key nodes Snf1, Tor1, Hog1 and Pka1. The network includes a total of 623 structural open reading frames (ORFs) and 779 protein-protein interactions. A number of proteins were identified having interactions with more than one of the protein kinases. The fully reconstructed interaction network includes all the information available in separate databases for all the proteins included in the network (nodes) and for all the interactions between them (edges). The annotated information is readily available utilizing the functionalities of network modelling tools such as Cytoscape and CellDesigner. Conclusions The reported fully annotated interaction model serves as a platform for integrated systems biology studies of nutrient sensing and regulation in S. cerevisiae. Furthermore, we propose this annotated reconstruction as a first step towards generation of an extensive annotated protein-protein interaction network of signal transduction and metabolic regulation in this yeast.
Collapse
|
363
|
Gutteridge A, Pir P, Castrillo JI, Charles PD, Lilley KS, Oliver SG. Nutrient control of eukaryote cell growth: a systems biology study in yeast. BMC Biol 2010; 8:68. [PMID: 20497545 PMCID: PMC2895586 DOI: 10.1186/1741-7007-8-68] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2010] [Accepted: 05/24/2010] [Indexed: 01/21/2023] Open
Abstract
Background To elucidate the biological processes affected by changes in growth rate and nutrient availability, we have performed a comprehensive analysis of the transcriptome, proteome and metabolome responses of chemostat cultures of the yeast, Saccharomyces cerevisiae, growing at a range of growth rates and in four different nutrient-limiting conditions. Results We find significant changes in expression for many genes in each of the four nutrient-limited conditions tested. We also observe several processes that respond differently to changes in growth rate and are specific to each nutrient-limiting condition. These include carbohydrate storage, mitochondrial function, ribosome synthesis, and phosphate transport. Integrating transcriptome data with proteome measurements allows us to identify previously unrecognized examples of post-transcriptional regulation in response to both nutrient and growth-rate signals. Conclusions Our results emphasize the unique properties of carbon metabolism and the carbon substrate, the limitation of which induces significant changes in gene regulation at the transcriptional and post-transcriptional level, as well as altering how many genes respond to growth rate. By comparison, the responses to growth limitation by other nutrients involve a smaller set of genes that participate in specific pathways. See associated commentary http://www.biomedcentral.com/1741-7007/8/62
Collapse
Affiliation(s)
- Alex Gutteridge
- Cambridge Systems Biology Centre & Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK
| | | | | | | | | | | |
Collapse
|
364
|
Cvijovic M, Olivares-Hernández R, Agren R, Dahr N, Vongsangnak W, Nookaew I, Patil KR, Nielsen J. BioMet Toolbox: genome-wide analysis of metabolism. Nucleic Acids Res 2010; 38:W144-9. [PMID: 20483918 PMCID: PMC2896146 DOI: 10.1093/nar/gkq404] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
The rapid progress of molecular biology tools for directed genetic modifications, accurate quantitative experimental approaches, high-throughput measurements, together with development of genome sequencing has made the foundation for a new area of metabolic engineering that is driven by metabolic models. Systematic analysis of biological processes by means of modelling and simulations has made the identification of metabolic networks and prediction of metabolic capabilities under different conditions possible. For facilitating such systemic analysis, we have developed the BioMet Toolbox, a web-based resource for stoichiometric analysis and for integration of transcriptome and interactome data, thereby exploiting the capabilities of genome-scale metabolic models. The BioMet Toolbox provides an effective user-friendly way to perform linear programming simulations towards maximized or minimized growth rates, substrate uptake rates and metabolic production rates by detecting relevant fluxes, simulate single and double gene deletions or detect metabolites around which major transcriptional changes are concentrated. These tools can be used for high-throughput in silico screening and allows fully standardized simulations. Model files for various model organisms (fungi and bacteria) are included. Overall, the BioMet Toolbox serves as a valuable resource for exploring the capabilities of these metabolic networks. BioMet Toolbox is freely available at www.sysbio.se/BioMet/.
Collapse
Affiliation(s)
- Marija Cvijovic
- Department of Chemical and Biological Engineering, Chalmers University of Technology, Göteborg, Sweden
| | | | | | | | | | | | | | | |
Collapse
|
365
|
Aho T, Almusa H, Matilainen J, Larjo A, Ruusuvuori P, Aho KL, Wilhelm T, Lähdesmäki H, Beyer A, Harju M, Chowdhury S, Leinonen K, Roos C, Yli-Harja O. Reconstruction and validation of RefRec: a global model for the yeast molecular interaction network. PLoS One 2010; 5:e10662. [PMID: 20498836 PMCID: PMC2871048 DOI: 10.1371/journal.pone.0010662] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2009] [Accepted: 04/15/2010] [Indexed: 11/26/2022] Open
Abstract
Molecular interaction networks establish all cell biological processes. The networks are under intensive research that is facilitated by new high-throughput measurement techniques for the detection, quantification, and characterization of molecules and their physical interactions. For the common model organism yeast Saccharomyces cerevisiae, public databases store a significant part of the accumulated information and, on the way to better understanding of the cellular processes, there is a need to integrate this information into a consistent reconstruction of the molecular interaction network. This work presents and validates RefRec, the most comprehensive molecular interaction network reconstruction currently available for yeast. The reconstruction integrates protein synthesis pathways, a metabolic network, and a protein-protein interaction network from major biological databases. The core of the reconstruction is based on a reference object approach in which genes, transcripts, and proteins are identified using their primary sequences. This enables their unambiguous identification and non-redundant integration. The obtained total number of different molecular species and their connecting interactions is approximately 67,000. In order to demonstrate the capacity of RefRec for functional predictions, it was used for simulating the gene knockout damage propagation in the molecular interaction network in approximately 590,000 experimentally validated mutant strains. Based on the simulation results, a statistical classifier was subsequently able to correctly predict the viability of most of the strains. The results also showed that the usage of different types of molecular species in the reconstruction is important for accurate phenotype prediction. In general, the findings demonstrate the benefits of global reconstructions of molecular interaction networks. With all the molecular species and their physical interactions explicitly modeled, our reconstruction is able to serve as a valuable resource in additional analyses involving objects from multiple molecular -omes. For that purpose, RefRec is freely available in the Systems Biology Markup Language format.
Collapse
Affiliation(s)
- Tommi Aho
- Department of Signal Processing, Tampere University of Technology, Tampere, Finland.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
366
|
Schellenberger J, Park JO, Conrad TM, Palsson BØ. BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions. BMC Bioinformatics 2010; 11:213. [PMID: 20426874 PMCID: PMC2874806 DOI: 10.1186/1471-2105-11-213] [Citation(s) in RCA: 362] [Impact Index Per Article: 25.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2009] [Accepted: 04/29/2010] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Genome-scale metabolic reconstructions under the Constraint Based Reconstruction and Analysis (COBRA) framework are valuable tools for analyzing the metabolic capabilities of organisms and interpreting experimental data. As the number of such reconstructions and analysis methods increases, there is a greater need for data uniformity and ease of distribution and use. DESCRIPTION We describe BiGG, a knowledgebase of Biochemically, Genetically and Genomically structured genome-scale metabolic network reconstructions. BiGG integrates several published genome-scale metabolic networks into one resource with standard nomenclature which allows components to be compared across different organisms. BiGG can be used to browse model content, visualize metabolic pathway maps, and export SBML files of the models for further analysis by external software packages. Users may follow links from BiGG to several external databases to obtain additional information on genes, proteins, reactions, metabolites and citations of interest. CONCLUSIONS BiGG addresses a need in the systems biology community to have access to high quality curated metabolic models and reconstructions. It is freely available for academic use at http://bigg.ucsd.edu.
Collapse
Affiliation(s)
- Jan Schellenberger
- Bioinformatics Program, University of California San Diego, La Jolla, California, 92093-0419, USA
| | | | | | | |
Collapse
|
367
|
Milne CB, Kim PJ, Eddy JA, Price ND. Accomplishments in genome-scale in silico modeling for industrial and medical biotechnology. Biotechnol J 2010; 4:1653-70. [PMID: 19946878 DOI: 10.1002/biot.200900234] [Citation(s) in RCA: 72] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Driven by advancements in high-throughput biological technologies and the growing number of sequenced genomes, the construction of in silico models at the genome scale has provided powerful tools to investigate a vast array of biological systems and applications. Here, we review comprehensively the uses of such models in industrial and medical biotechnology, including biofuel generation, food production, and drug development. While the use of in silico models is still in its early stages for delivering to industry, significant initial successes have been achieved. For the cases presented here, genome-scale models predict engineering strategies to enhance properties of interest in an organism or to inhibit harmful mechanisms of pathogens. Going forward, genome-scale in silico models promise to extend their application and analysis scope to become a trans-formative tool in biotechnology.
Collapse
Affiliation(s)
- Caroline B Milne
- Institute for Genomic Biology, University of Illinois, Urbana, IL, USA
| | | | | | | |
Collapse
|
368
|
|
369
|
Abstract
The chemical industry is currently undergoing a dramatic change driven by demand for developing more sustainable processes for the production of fuels, chemicals, and materials. In biotechnological processes different microorganisms can be exploited, and the large diversity of metabolic reactions represents a rich repository for the design of chemical conversion processes that lead to efficient production of desirable products. However, often microorganisms that produce a desirable product, either naturally or because they have been engineered through insertion of heterologous pathways, have low yields and productivities, and in order to establish an economically viable process it is necessary to improve the performance of the microorganism. Here metabolic engineering is the enabling technology. Through metabolic engineering the metabolic landscape of the microorganism is engineered such that there is an efficient conversion of the raw material, typically glucose, to the product of interest. This process may involve both insertion of new enzymes activities, deletion of existing enzyme activities, but often also deregulation of existing regulatory structures operating in the cell. In order to rapidly identify the optimal metabolic engineering strategy the industry is to an increasing extent looking into the use of tools from systems biology. This involves both x-ome technologies such as transcriptome, proteome, metabolome, and fluxome analysis, and advanced mathematical modeling tools such as genome-scale metabolic modeling. Here we look into the history of these different techniques and review how they find application in industrial biotechnology, which will lead to what we here define as industrial systems biology.
Collapse
Affiliation(s)
- José Manuel Otero
- Department of Chemical and Biological Engineering, Chalmers University of Technology, Göteborg, Sweden
| | | |
Collapse
|
370
|
Dada JO, Spasić I, Paton NW, Mendes P. SBRML: a markup language for associating systems biology data with models. ACTA ACUST UNITED AC 2010; 26:932-8. [PMID: 20176582 DOI: 10.1093/bioinformatics/btq069] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Research in systems biology is carried out through a combination of experiments and models. Several data standards have been adopted for representing models (Systems Biology Markup Language) and various types of relevant experimental data (such as FuGE and those of the Proteomics Standards Initiative). However, until now, there has been no standard way to associate a model and its entities to the corresponding datasets, or vice versa. Such a standard would provide a means to represent computational simulation results as well as to frame experimental data in the context of a particular model. Target applications include model-driven data analysis, parameter estimation, and sharing and archiving model simulations. RESULTS We propose the Systems Biology Results Markup Language (SBRML), an XML-based language that associates a model with several datasets. Each dataset is represented as a series of values associated with model variables, and their corresponding parameter values. SBRML provides a flexible way of indexing the results to model parameter values, which supports both spreadsheet-like data and multidimensional data cubes. We present and discuss several examples of SBRML usage in applications such as enzyme kinetics, microarray gene expression and various types of simulation results. AVAILABILITY AND IMPLEMENTATION The XML Schema file for SBRML is available at http://www.comp-sys-bio.org/SBRML under the Academic Free License (AFL) v3.0.
Collapse
Affiliation(s)
- Joseph O Dada
- Manchester Centre for Integrative Systems Biology, Manchester Interdisciplinary Biocentre, The University of Manchester, 131 Princess Street, Manchester M1 7DN, UK
| | | | | | | |
Collapse
|
371
|
Kim TY, Kim HU, Lee SY. Data integration and analysis of biological networks. Curr Opin Biotechnol 2010; 21:78-84. [PMID: 20138751 DOI: 10.1016/j.copbio.2010.01.003] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2009] [Accepted: 01/14/2010] [Indexed: 12/22/2022]
Abstract
During the past decade, bottom-up and top-down approaches of network reconstruction have greatly facilitated integration and analysis of biological networks, including transcriptional, protein interaction, and metabolic networks. As increasing amounts of multidimensional high-throughput data become available, biological networks have also been upgraded, allowing more accurate understanding of whole cellular characteristics. The network size is constantly expanding as larger volume of information and omics data are further integrated into the biological networks previously built upon a single type of data. Such effort more recently led to the modeling of human metabolic network and prediction of its tissue-specific metabolism, reconstruction of consensus yeast metabolic network, and simulation of mutual interactions among multiple microorganisms. It is expected that this trend will continue, the outcomes of which will allow development of more sophisticated networks integrating diverse omics data, and enhance our understanding of biological systems.
Collapse
Affiliation(s)
- Tae Yong Kim
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 program), Center for Systems and Synthetic Biotechnology, Institute for the BioCentury, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 305-701, Republic of Korea
| | | | | |
Collapse
|
372
|
Pitkänen E, Rousu J, Ukkonen E. Computational methods for metabolic reconstruction. Curr Opin Biotechnol 2010; 21:70-7. [DOI: 10.1016/j.copbio.2010.01.010] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2009] [Revised: 01/17/2010] [Accepted: 01/20/2010] [Indexed: 12/19/2022]
|
373
|
Smallbone K, Simeonidis E, Swainston N, Mendes P. Towards a genome-scale kinetic model of cellular metabolism. BMC SYSTEMS BIOLOGY 2010; 4:6. [PMID: 20109182 PMCID: PMC2829494 DOI: 10.1186/1752-0509-4-6] [Citation(s) in RCA: 120] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/27/2009] [Accepted: 01/28/2010] [Indexed: 11/10/2022]
Abstract
Background Advances in bioinformatic techniques and analyses have led to the availability of genome-scale metabolic reconstructions. The size and complexity of such networks often means that their potential behaviour can only be analysed with constraint-based methods. Whilst requiring minimal experimental data, such methods are unable to give insight into cellular substrate concentrations. Instead, the long-term goal of systems biology is to use kinetic modelling to characterize fully the mechanics of each enzymatic reaction, and to combine such knowledge to predict system behaviour. Results We describe a method for building a parameterized genome-scale kinetic model of a metabolic network. Simplified linlog kinetics are used and the parameters are extracted from a kinetic model repository. We demonstrate our methodology by applying it to yeast metabolism. The resultant model has 956 metabolic reactions involving 820 metabolites, and, whilst approximative, has considerably broader remit than any existing models of its type. Control analysis is used to identify key steps within the system. Conclusions Our modelling framework may be considered a stepping-stone toward the long-term goal of a fully-parameterized model of yeast metabolism. The model is available in SBML format from the BioModels database (BioModels ID: MODEL1001200000) and at http://www.mcisb.org/resources/genomescale/.
Collapse
Affiliation(s)
- Kieran Smallbone
- Manchester Centre for Integrative Systems Biology, Manchester Interdisciplinary Biocentre, 131 Princess Street, Manchester, M1 7DN, UK
| | | | | | | |
Collapse
|
374
|
Nielsen J. Systems biology of lipid metabolism: from yeast to human. FEBS Lett 2010; 583:3905-13. [PMID: 19854183 DOI: 10.1016/j.febslet.2009.10.054] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2009] [Revised: 10/13/2009] [Accepted: 10/20/2009] [Indexed: 10/20/2022]
Abstract
Lipid metabolism is highly relevant as it plays a central role in a number of human diseases. Due to the highly interactive structure of lipid metabolism and its regulation, it is necessary to apply a holistic approach, and systems biology is therefore well suited for integrated analysis of lipid metabolism. In this paper it is demonstrated that the yeast Saccharomyces cerevisiae serves as an excellent model organism for studying the regulation of lipid metabolism in eukaryotes as most of the regulatory structures in this part of the metabolism are conserved between yeast and mammals. Hereby yeast systems biology can assist to improve our understanding of how lipid metabolism is regulated.
Collapse
Affiliation(s)
- Jens Nielsen
- Department of Chemical and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden.
| |
Collapse
|
375
|
Systems biology of industrial microorganisms. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2010; 120:51-99. [PMID: 20503029 DOI: 10.1007/10_2009_59] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
The field of industrial biotechnology is expanding rapidly as the chemical industry is looking towards more sustainable production of chemicals that can be used as fuels or building blocks for production of solvents and materials. In connection with the development of sustainable bioprocesses, it is a major challenge to design and develop efficient cell factories that can ensure cost efficient conversion of the raw material into the chemical of interest. This is achieved through metabolic engineering, where the metabolism of the cell factory is engineered such that there is an efficient conversion of sugars, the typical raw materials in the fermentation industry, into the desired product. However, engineering of cellular metabolism is often challenging due to the complex regulation that has evolved in connection with adaptation of the different microorganisms to their ecological niches. In order to map these regulatory structures and further de-regulate them, as well as identify ingenious metabolic engineering strategies that full-fill mass balance constraints, tools from systems biology can be applied. This involves both high-throughput analysis tools like transcriptome, proteome and metabolome analysis, as well as the use of mathematical modeling to simulate the phenotypes resulting from the different metabolic engineering strategies. It is in fact expected that systems biology may substantially improve the process of cell factory development, and we therefore propose the term Industrial Systems Biology for how systems biology will enhance the development of industrial biotechnology for sustainable chemical production.
Collapse
|
376
|
Computational analysis of phenotypic space in heterologous polyketide biosynthesis—Applications to Escherichia coli, Bacillus subtilis, and Saccharomyces cerevisiae. J Theor Biol 2010; 262:197-207. [DOI: 10.1016/j.jtbi.2009.10.006] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2008] [Revised: 10/05/2009] [Accepted: 10/06/2009] [Indexed: 11/21/2022]
|
377
|
Edelman LB, Eddy JA, Price ND. In silico models of cancer. WILEY INTERDISCIPLINARY REVIEWS. SYSTEMS BIOLOGY AND MEDICINE 2010; 2. [PMID: 20836040 PMCID: PMC3157287 DOI: 10.1002/wsbm.75 10.1002/wsbm.75] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Cancer is a complex disease that involves multiple types of biological interactions across diverse physical, temporal, and biological scales. This complexity presents substantial challenges for the characterization of cancer biology, and motivates the study of cancer in the context of molecular, cellular, and physiological systems. Computational models of cancer are being developed to aid both biological discovery and clinical medicine. The development of these in silico models is facilitated by rapidly advancing experimental and analytical tools that generate information-rich, high-throughput biological data. Statistical models of cancer at the genomic, transcriptomic, and pathway levels have proven effective in developing diagnostic and prognostic molecular signatures, as well as in identifying perturbed pathways. Statistically inferred network models can prove useful in settings where data overfitting can be avoided, and provide an important means for biological discovery. Mechanistically based signaling and metabolic models that apply a priori knowledge of biochemical processes derived from experiments can also be reconstructed where data are available, and can provide insight and predictive ability regarding the behavior of these systems. At longer length scales, continuum and agent-based models of the tumor microenvironment and other tissue-level interactions enable modeling of cancer cell populations and tumor progression. Even though cancer has been among the most-studied human diseases using systems approaches, significant challenges remain before the enormous potential of in silico cancer biology can be fully realized.
Collapse
Affiliation(s)
- Lucas B. Edelman
- Institute for Genomic Biology, Department of Bioengineering, University of Illinois, Urbana-Champaign
| | - James A. Eddy
- Institute for Genomic Biology, Department of Bioengineering, University of Illinois, Urbana-Champaign
| | - Nathan D. Price
- Department of Chemical and Biomolecular Engineering, Institute for Genomic Biology, Center for Biophysics and Computational Biology, University of Illinois, Urbana-Champaign
| |
Collapse
|
378
|
|
379
|
Attwood TK, Kell DB, McDermott P, Marsh J, Pettifer SR, Thorne D. Calling International Rescue: knowledge lost in literature and data landslide! Biochem J 2009; 424:317-33. [PMID: 19929850 PMCID: PMC2805925 DOI: 10.1042/bj20091474] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2009] [Accepted: 09/29/2009] [Indexed: 11/17/2022]
Abstract
We live in interesting times. Portents of impending catastrophe pervade the literature, calling us to action in the face of unmanageable volumes of scientific data. But it isn't so much data generation per se, but the systematic burial of the knowledge embodied in those data that poses the problem: there is so much information available that we simply no longer know what we know, and finding what we want is hard - too hard. The knowledge we seek is often fragmentary and disconnected, spread thinly across thousands of databases and millions of articles in thousands of journals. The intellectual energy required to search this array of data-archives, and the time and money this wastes, has led several researchers to challenge the methods by which we traditionally commit newly acquired facts and knowledge to the scientific record. We present some of these initiatives here - a whirlwind tour of recent projects to transform scholarly publishing paradigms, culminating in Utopia and the Semantic Biochemical Journal experiment. With their promises to provide new ways of interacting with the literature, and new and more powerful tools to access and extract the knowledge sequestered within it, we ask what advances they make and what obstacles to progress still exist? We explore these questions, and, as you read on, we invite you to engage in an experiment with us, a real-time test of a new technology to rescue data from the dormant pages of published documents. We ask you, please, to read the instructions carefully. The time has come: you may turn over your papers...
Collapse
Key Words
- dynamic document content
- interactive pdf
- linking documents with research data
- manuscript mark-up
- mark-up standards
- semantic publishing
- bj, biochemical journal
- cohse, conceptual open hypermedia services environment
- doi, digital object identifier
- go, gene ontology
- gpcr, g protein-coupled receptor
- html, hypertext mark-up language
- iupac, international union of pure and applied chemistry
- ntd, neglected tropical diseases
- obo, open biomedical ontologies
- pdb, protein data bank
- pdf, portable document format
- plos, public library of science
- pmc, pubmed central
- ptm, post-translational modification
- rsc, royal society of chemistry
- sda, structured digital abstract
- stm, scientific, technical and medical
- ud, utopia documents
- xml, extensible mark-up language
- xmp, extensible metadata platform
Collapse
Affiliation(s)
- Teresa K Attwood
- School of Chemistry, The University of Manchester, Oxford Road, Manchester M13 9PL, UK.
| | | | | | | | | | | |
Collapse
|
380
|
Klipp E. Timing matters. FEBS Lett 2009; 583:4013-8. [DOI: 10.1016/j.febslet.2009.11.065] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2009] [Revised: 11/10/2009] [Accepted: 11/18/2009] [Indexed: 01/25/2023]
|
381
|
Systems Biology: The elements and principles of Life. FEBS Lett 2009; 583:3882-90. [DOI: 10.1016/j.febslet.2009.11.018] [Citation(s) in RCA: 70] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2009] [Accepted: 11/09/2009] [Indexed: 02/01/2023]
|
382
|
Evolution of biomolecular networks: lessons from metabolic and protein interactions. Nat Rev Mol Cell Biol 2009; 10:791-803. [PMID: 19851337 DOI: 10.1038/nrm2787] [Citation(s) in RCA: 144] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Despite only becoming popular at the beginning of this decade, biomolecular networks are now frameworks that facilitate many discoveries in molecular biology. The nodes of these networks are usually proteins (specifically enzymes in metabolic networks), whereas the links (or edges) are their interactions with other molecules. These networks are made up of protein-protein interactions or enzyme-enzyme interactions through shared metabolites in the case of metabolic networks. Evolutionary analysis has revealed that changes in the nodes and links in protein-protein interaction and metabolic networks are subject to different selection pressures owing to distinct topological features. However, many evolutionary constraints can be uncovered only if temporal and spatial aspects are included in the network analysis.
Collapse
|
383
|
Boer VM, Crutchfield CA, Bradley PH, Botstein D, Rabinowitz JD. Growth-limiting intracellular metabolites in yeast growing under diverse nutrient limitations. Mol Biol Cell 2009; 21:198-211. [PMID: 19889834 PMCID: PMC2801714 DOI: 10.1091/mbc.e09-07-0597] [Citation(s) in RCA: 179] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Microbes tailor their growth rate to nutrient availability. Here, we measured, using liquid chromatography-mass spectrometry, >100 intracellular metabolites in steady-state cultures of Saccharomyces cerevisiae growing at five different rates and in each of five different limiting nutrients. In contrast to gene transcripts, where approximately 25% correlated with growth rate irrespective of the nature of the limiting nutrient, metabolite concentrations were highly sensitive to the limiting nutrient's identity. Nitrogen (ammonium) and carbon (glucose) limitation were characterized by low intracellular amino acid and high nucleotide levels, whereas phosphorus (phosphate) limitation resulted in the converse. Low adenylate energy charge was found selectively in phosphorus limitation, suggesting the energy charge may actually measure phosphorus availability. Particularly strong concentration responses occurred in metabolites closely linked to the limiting nutrient, e.g., glutamine in nitrogen limitation, ATP in phosphorus limitation, and pyruvate in carbon limitation. A simple but physically realistic model involving the availability of these metabolites was adequate to account for cellular growth rate. The complete data can be accessed at the interactive website http://growthrate.princeton.edu/metabolome.
Collapse
Affiliation(s)
- Viktor M Boer
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
| | | | | | | | | |
Collapse
|
384
|
Oberhardt MA, Palsson BØ, Papin JA. Applications of genome-scale metabolic reconstructions. Mol Syst Biol 2009; 5:320. [PMID: 19888215 PMCID: PMC2795471 DOI: 10.1038/msb.2009.77] [Citation(s) in RCA: 577] [Impact Index Per Article: 38.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2009] [Accepted: 09/22/2009] [Indexed: 12/12/2022] Open
Abstract
The availability and utility of genome-scale metabolic reconstructions have exploded since the first genome-scale reconstruction was published a decade ago. Reconstructions have now been built for a wide variety of organisms, and have been used toward five major ends: (1) contextualization of high-throughput data, (2) guidance of metabolic engineering, (3) directing hypothesis-driven discovery, (4) interrogation of multi-species relationships, and (5) network property discovery. In this review, we examine the many uses and future directions of genome-scale metabolic reconstructions, and we highlight trends and opportunities in the field that will make the greatest impact on many fields of biology.
Collapse
Affiliation(s)
- Matthew A Oberhardt
- Department of Biomedical Engineering, University of Virginia, Health System, Charlottesville, VA, USA
| | | | | |
Collapse
|
385
|
Gerlee P, Lizana L, Sneppen K. Pathway identification by network pruning in the metabolic network of Escherichia coli. ACTA ACUST UNITED AC 2009; 25:3282-8. [PMID: 19808881 DOI: 10.1093/bioinformatics/btp575] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION All metabolic networks contain metabolites, such as ATP and NAD, known as currency metabolites, which take part in many reactions. These are often removed in the study of these networks, but no consensus exists on what actually constitutes a currency metabolite, and it is also unclear how these highly connected nodes contribute to the global structure of the network. RESULTS In this article, we analyse how the Escherichia coli metabolic network responds to pruning in the form of sequential removal of metabolites with highest degree. As expected this leads to network fragmentation, but the process by which it occurs suggests modularity and long-range correlations within the network. We find that the pruned networks contain longer paths than the random expectation, and that the paths that survive the pruning also exhibit a lower cost (number of involved metabolites) compared with random paths in the full metabolic network. Finally we confirm that paths detected by pruning overlap with known metabolic pathways. We conclude that pruning reveals functional pathways in metabolic networks, where currency metabolites may be seen as ingredients in a well-balanced soup in which main metabolic production lines are immersed. CONTACT gerlee@nbi.dk SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- P Gerlee
- Niels Bohr Institute, Blegdamsvej 17, 2100, Copenhagen, Denmark.
| | | | | |
Collapse
|
386
|
Christian N, May P, Kempa S, Handorf T, Ebenhöh O. An integrative approach towards completing genome-scale metabolic networks. MOLECULAR BIOSYSTEMS 2009; 5:1889-903. [PMID: 19763335 DOI: 10.1039/b915913b] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Genome-scale metabolic networks which have been automatically derived through sequence comparison techniques are necessarily incomplete. We propose a strategy that incorporates genomic sequence data and metabolite profiles into modeling approaches to arrive at improved gene annotations and more complete genome-scale metabolic networks. The core of our strategy is an algorithm that computes minimal sets of reactions by which a draft network has to be extended in order to be consistent with experimental observations. A particular strength of our approach is that alternative possibilities are suggested and thus experimentally testable hypotheses are produced. We carefully evaluate our strategy on the well-studied metabolic network of Escherichia coli, demonstrating how the predictions can be improved by incorporating sequence data. Subsequently, we apply our method to the recently sequenced green alga Chlamydomonas reinhardtii. We suggest specific genes in the genome of Chlamydomonas which are the strongest candidates for coding the responsible enzymes.
Collapse
Affiliation(s)
- Nils Christian
- Max-Planck-Institute for Molecular Plant Physiology, Potsdam-Golm, Germany
| | | | | | | | | |
Collapse
|
387
|
Tan KC, Ipcho SVS, Trengove RD, Oliver RP, Solomon PS. Assessing the impact of transcriptomics, proteomics and metabolomics on fungal phytopathology. MOLECULAR PLANT PATHOLOGY 2009; 10:703-15. [PMID: 19694958 PMCID: PMC6640398 DOI: 10.1111/j.1364-3703.2009.00565.x] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
SUMMARY Peer-reviewed literature is today littered with exciting new tools and techniques that are being used in all areas of biology and medicine. Transcriptomics, proteomics and, more recently, metabolomics are three of these techniques that have impacted on fungal plant pathology. Used individually, each of these techniques can generate a plethora of data that could occupy a laboratory for years. When used in combination, they have the potential to comprehensively dissect a system at the transcriptional and translational level. Transcriptomics, or quantitative gene expression profiling, is arguably the most familiar to researchers in the field of fungal plant pathology. Microarrays have been the primary technique for the last decade, but others are now emerging. Proteomics has also been exploited by the fungal phytopathogen community, but perhaps not to its potential. A lack of genome sequence information has frustrated proteomics researchers and has largely contributed to this technique not fulfilling its potential. The coming of the genome sequencing era has partially alleviated this problem. Metabolomics is the most recent of these techniques to emerge and is concerned with the non-targeted profiling of all metabolites in a given system. Metabolomics studies on fungal plant pathogens are only just beginning to appear, although its potential to dissect many facets of the pathogen and disease will see its popularity increase quickly. This review assesses the impact of transcriptomics, proteomics and metabolomics on fungal plant pathology over the last decade and discusses their futures. Each of the techniques is described briefly with further reading recommended. Key examples highlighting the application of these technologies to fungal plant pathogens are also reviewed.
Collapse
Affiliation(s)
- Kar-Chun Tan
- Australian Centre for Necrotrophic Fungal Pathogens, SABC, Faculty of Health Sciences, Murdoch University, Murdoch 6150, Australia
| | | | | | | | | |
Collapse
|
388
|
de Groot MJL, van Berlo RJP, van Winden WA, Verheijen PJT, Reinders MJT, de Ridder D. Metabolite and reaction inference based on enzyme specificities. ACTA ACUST UNITED AC 2009; 25:2975-82. [PMID: 19696044 PMCID: PMC2773254 DOI: 10.1093/bioinformatics/btp507] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Motivation: Many enzymes are not absolutely specific, or even promiscuous: they can catalyze transformations of more compounds than the traditional ones as listed in, e.g. KEGG. This information is currently only available in databases, such as the BRENDA enzyme activity database. In this article, we propose to model enzyme aspecificity by predicting whether an input compound is likely to be transformed by a certain enzyme. Such a predictor has many applications, for example, to complete reconstructed metabolic networks, to aid in metabolic engineering or to help identify unknown peaks in mass spectra. Results: We have developed a system for metabolite and reaction inference based on enzyme specificities (MaRIboES). It employs structural and stereochemistry similarity measures and molecular fingerprints to generalize enzymatic reactions based on data available in BRENDA. Leave-one-out cross-validation shows that 80% of known reactions are predicted well. Application to the yeast glycolytic and pentose phosphate pathways predicts a large number of known and new reactions, often leading to the formation of novel compounds, as well as a number of interesting bypasses and cross-links. Availability: Matlab and C++ code is freely available at https://gforge.nbic.nl/projects/mariboes/ Contact:d.deridder@tudelft.nl Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- M J L de Groot
- The Delft Bioinformatics Lab, Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, Mekelweg 4, 2628 CD Delft, The Netherlands
| | | | | | | | | | | |
Collapse
|
389
|
Pathway databases and tools for their exploitation: benefits, current limitations and challenges. Mol Syst Biol 2009; 5:290. [PMID: 19638971 PMCID: PMC2724977 DOI: 10.1038/msb.2009.47] [Citation(s) in RCA: 130] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2009] [Accepted: 06/15/2009] [Indexed: 12/28/2022] Open
Abstract
In past years, comprehensive representations of cell signalling pathways have been developed by manual curation from literature, which requires huge effort and would benefit from information stored in databases and from automatic retrieval and integration methods. Once a reconstruction of the network of interactions is achieved, analysis of its structural features and its dynamic behaviour can take place. Mathematical modelling techniques are used to simulate the complex behaviour of cell signalling networks, which ultimately sheds light on the mechanisms leading to complex diseases or helps in the identification of drug targets. A variety of databases containing information on cell signalling pathways have been developed in conjunction with methodologies to access and analyse the data. In principle, the scenario is prepared to make the most of this information for the analysis of the dynamics of signalling pathways. However, are the knowledge repositories of signalling pathways ready to realize the systems biology promise? In this article we aim to initiate this discussion and to provide some insights on this issue.
Collapse
|
390
|
YAN SK, ZHAO J, DOU SS, JIANG P, LIU RH, ZHANG WD. Methodology of Modernization Research in Traditional Chinese Medicine Based on Systems Biology and Network Biology. Chin J Nat Med 2009. [DOI: 10.3724/sp.j.1009.2009.00249] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
391
|
Draper J, Enot DP, Parker D, Beckmann M, Snowdon S, Lin W, Zubair H. Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules'. BMC Bioinformatics 2009; 10:227. [PMID: 19622150 PMCID: PMC2721842 DOI: 10.1186/1471-2105-10-227] [Citation(s) in RCA: 127] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2009] [Accepted: 07/21/2009] [Indexed: 01/21/2023] Open
Abstract
Background Metabolomics experiments using Mass Spectrometry (MS) technology measure the mass to charge ratio (m/z) and intensity of ionised molecules in crude extracts of complex biological samples to generate high dimensional metabolite 'fingerprint' or metabolite 'profile' data. High resolution MS instruments perform routinely with a mass accuracy of < 5 ppm (parts per million) thus providing potentially a direct method for signal putative annotation using databases containing metabolite mass information. Most database interfaces support only simple queries with the default assumption that molecules either gain or lose a single proton when ionised. In reality the annotation process is confounded by the fact that many ionisation products will be not only molecular isotopes but also salt/solvent adducts and neutral loss fragments of original metabolites. This report describes an annotation strategy that will allow searching based on all potential ionisation products predicted to form during electrospray ionisation (ESI). Results Metabolite 'structures' harvested from publicly accessible databases were converted into a common format to generate a comprehensive archive in MZedDB. 'Rules' were derived from chemical information that allowed MZedDB to generate a list of adducts and neutral loss fragments putatively able to form for each structure and calculate, on the fly, the exact molecular weight of every potential ionisation product to provide targets for annotation searches based on accurate mass. We demonstrate that data matrices representing populations of ionisation products generated from different biological matrices contain a large proportion (sometimes > 50%) of molecular isotopes, salt adducts and neutral loss fragments. Correlation analysis of ESI-MS data features confirmed the predicted relationships of m/z signals. An integrated isotope enumerator in MZedDB allowed verification of exact isotopic pattern distributions to corroborate experimental data. Conclusion We conclude that although ultra-high accurate mass instruments provide major insight into the chemical diversity of biological extracts, the facile annotation of a large proportion of signals is not possible by simple, automated query of current databases using computed molecular formulae. Parameterising MZedDB to take into account predicted ionisation behaviour and the biological source of any sample improves greatly both the frequency and accuracy of potential annotation 'hits' in ESI-MS data.
Collapse
Affiliation(s)
- John Draper
- Institute of Biological Environmental and Rural Sciences, Aberystwyth University, Aberystwyth SY23 3DA, UK.
| | | | | | | | | | | | | |
Collapse
|
392
|
Williamson T, Schwartz JM, Kell DB, Stateva L. Deterministic mathematical models of the cAMP pathway in Saccharomyces cerevisiae. BMC SYSTEMS BIOLOGY 2009; 3:70. [PMID: 19607691 PMCID: PMC2719611 DOI: 10.1186/1752-0509-3-70] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/09/2009] [Accepted: 07/16/2009] [Indexed: 11/10/2022]
Abstract
BACKGROUND Cyclic adenosine monophosphate (cAMP) has a key signaling role in all eukaryotic organisms. In Saccharomyces cerevisiae, it is the second messenger in the Ras/PKA pathway which regulates nutrient sensing, stress responses, growth, cell cycle progression, morphogenesis, and cell wall biosynthesis. A stochastic model of the pathway has been reported. RESULTS We have created deterministic mathematical models of the PKA module of the pathway, as well as the complete cAMP pathway. First, a simplified conceptual model was created which reproduced the dynamics of changes in cAMP levels in response to glucose addition in wild-type as well as cAMP phosphodiesterase deletion mutants. This model was used to investigate the role of the regulatory Krh proteins that had not been included previously. The Krh-containing conceptual model reproduced very well the experimental evidence supporting the role of Krh as a direct inhibitor of PKA. These results were used to develop the Complete cAMP Model. Upon simulation it illustrated several important features of the yeast cAMP pathway: Pde1p is more important than is Pde2p for controlling the cAMP levels following glucose pulses; the proportion of active PKA is not directly proportional to the cAMP level, allowing PKA to exert negative feedback; negative feedback mechanisms include activating Pde1p and deactivating Ras2 via phosphorylation of Cdc25. The Complete cAMP model is easier to simulate, and although significantly simpler than the existing stochastic one, it recreates cAMP levels and patterns of changes in cAMP levels observed experimentally in vivo in response to glucose addition in wild-type as well as representative mutant strains such as pde1Delta, pde2Delta, cyr1Delta, and others. The complete model is made available in SBML format. CONCLUSION We suggest that the lower number of reactions and parameters makes these models suitable for integrating them with models of metabolism or of the cell cycle in S. cerevisiae. Similar models could be also useful for studies in the human pathogen Candida albicans as well as other less well-characterized fungal species.
Collapse
Affiliation(s)
- Thomas Williamson
- Faculty of Life Sciences, The University of Manchester, Manchester, M13 9PT, UK.
| | | | | | | |
Collapse
|
393
|
Petranovic D, Vemuri GN. Impact of yeast systems biology on industrial biotechnology. J Biotechnol 2009; 144:204-11. [PMID: 19616047 DOI: 10.1016/j.jbiotec.2009.07.005] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2009] [Revised: 06/29/2009] [Accepted: 07/08/2009] [Indexed: 11/16/2022]
Abstract
Systems biology is yet an emerging discipline that aims to quantitatively describe and predict the functioning of a biological system. This nascent discipline relies on the recent advances in the analytical technology (such as DNA microarrays, mass spectromety, etc.) to quantify cellular characteristics (such as gene expression, protein and metabolite abundance, etc.) and computational methods to integrate information from these measurements. The model eukaryote, Saccharomyces cerevisiae, has played a pivotal role in the development of many of these analytical and computational methods and consequently is the biological system of choice for testing new hypotheses. The knowledge gained from such studies in S. cerevisiae is proving to be extremely useful in designing metabolism that is targeted to specific industrial applications. As a result, the portfolio of products that are being produced using this yeast is expanding rapidly. We review the recent developments in yeast systems biology and how they relate to industrial biotechnology.
Collapse
Affiliation(s)
- Dina Petranovic
- Systems Biology, Department of Chemical and Biological Engineering, Chalmers University of Technology, Kemivägen 10, Göteborg 412 96, Sweden.
| | | |
Collapse
|
394
|
|
395
|
Abstract
SUMMARY The Systems Biology Markup Language (SBML) is an established community XML format for the markup of biochemical models. With the introduction of SBML level 2 version 3, specific model entities, such as species or reactions, can now be annotated using ontological terms. These annotations, which are encoded using the resource description framework (RDF), provide the facility to specify definite terms to individual components, allowing software to unambiguously identify such components and thus link the models to existing data resources. libSBML is an application programming interface library for the manipulation of SBML files. While libSBML provides the facilities for reading and writing such annotations from and to models, it is beyond the scope of libSBML to provide interpretation of these terms. The libAnnotationSBML library introduced here acts as a layer on top of libSBML linking SBML annotations to the web services that describe these ontological terms. Two applications that use this library are described: SbmlSynonymExtractor finds name synonyms of SBML model entities and SbmlReactionBalancer checks SBML files to determine whether specifed reactions are elementally balanced.
Collapse
Affiliation(s)
- Neil Swainston
- Manchester Centre for Integrative Systems Biology, Manchester Interdisciplinary Biocentre, University of Manchester, Manchester M1 7DN, UK.
| | | |
Collapse
|
396
|
Abstract
Stable isotope, and in particular (13)C-based flux analysis, is the exclusive approach to experimentally quantify the integrated responses of metabolic networks. Here we describe a protocol that is based on growing microbes on (13)C-labeled glucose and subsequent gas chromatography mass spectrometric detection of (13)C-patterns in protein-bound amino acids. Relying on publicly available software packages, we then describe two complementary mathematical approaches to estimate either local ratios of converging fluxes or absolute fluxes through different pathways. As amino acids in cell protein are abundant and stable, this protocol requires a minimum of equipment and analytical expertise. Most other flux methods are variants of the principles presented here. A true alternative is the analytically more demanding dynamic flux analysis that relies on (13)C-pattern in free intracellular metabolites. The presented protocols take 5-10 d, have been used extensively in the past decade and are exemplified here for the central metabolism of Escherichia coli.
Collapse
Affiliation(s)
- Nicola Zamboni
- Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | | | | | | |
Collapse
|
397
|
King RD, Rowland J, Oliver SG, Young M, Aubrey W, Byrne E, Liakata M, Markham M, Pir P, Soldatova LN, Sparkes A, Whelan KE, Clare A. The automation of science. Science 2009; 324:85-9. [PMID: 19342587 DOI: 10.1126/science.1165620] [Citation(s) in RCA: 219] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
The basis of science is the hypothetico-deductive method and the recording of experiments in sufficient detail to enable reproducibility. We report the development of Robot Scientist "Adam," which advances the automation of both. Adam has autonomously generated functional genomics hypotheses about the yeast Saccharomyces cerevisiae and experimentally tested these hypotheses by using laboratory automation. We have confirmed Adam's conclusions through manual experiments. To describe Adam's research, we have developed an ontology and logical language. The resulting formalization involves over 10,000 different research units in a nested treelike structure, 10 levels deep, that relates the 6.6 million biomass measurements to their logical description. This formalization describes how a machine contributed to scientific knowledge.
Collapse
Affiliation(s)
- Ross D King
- Department of Computer Science, Aberystwyth University, SY23 3DB, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
398
|
Brown M, Dunn WB, Dobson P, Patel Y, Winder CL, Francis-McIntyre S, Begley P, Carroll K, Broadhurst D, Tseng A, Swainston N, Spasic I, Goodacre R, Kell DB. Mass spectrometry tools and metabolite-specific databases for molecular identification in metabolomics. Analyst 2009; 134:1322-32. [PMID: 19562197 DOI: 10.1039/b901179j] [Citation(s) in RCA: 219] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The chemical identification of mass spectrometric signals in metabolomic applications is important to provide conversion of analytical data to biological knowledge about metabolic pathways. The complexity of electrospray mass spectrometric data acquired from a range of samples (serum, urine, yeast intracellular extracts, yeast metabolic footprints, placental tissue metabolic footprints) has been investigated and has defined the frequency of different ion types routinely detected. Although some ion types were expected (protonated and deprotonated peaks, isotope peaks, multiply charged peaks) others were not expected (sodium formate adduct ions). In parallel, the Manchester Metabolomics Database (MMD) has been constructed with data from genome scale metabolic reconstructions, HMDB, KEGG, Lipid Maps, BioCyc and DrugBank to provide knowledge on 42,687 endogenous and exogenous metabolite species. The combination of accurate mass data for a large collection of metabolites, theoretical isotope abundance data and knowledge of the different ion types detected provided a greater number of electrospray mass spectrometric signals which were putatively identified and with greater confidence in the samples studied. To provide definitive identification metabolite-specific mass spectral libraries for UPLC-MS and GC-MS have been constructed for 1,065 commercially available authentic standards. The MMD data are available at http://dbkgroup.org/MMD/.
Collapse
Affiliation(s)
- M Brown
- Bioanalytical Sciences Group, School of Chemistry, Manchester Interdisciplinary Biocentre, University of Manchester, UK M1 7DN.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
399
|
Current awareness on yeast. Yeast 2009. [DOI: 10.1002/yea.1619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
|
400
|
Spasic I, Simeonidis E, Messiha HL, Paton NW, Kell DB. KiPar, a tool for systematic information retrieval regarding parameters for kinetic modelling of yeast metabolic pathways. ACTA ACUST UNITED AC 2009; 25:1404-11. [PMID: 19336445 DOI: 10.1093/bioinformatics/btp175] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Most experimental evidence on kinetic parameters is buried in the literature, whose manual searching is complex, time consuming and partial. These shortcomings become particularly acute in systems biology, where these parameters need to be integrated into detailed, genome-scale, metabolic models. These problems are addressed by KiPar, a dedicated information retrieval system designed to facilitate access to the literature relevant for kinetic modelling of a given metabolic pathway in yeast. Searching for kinetic data in the context of an individual pathway offers modularity as a way of tackling the complexity of developing a full metabolic model. It is also suitable for large-scale mining, since multiple reactions and their kinetic parameters can be specified in a single search request, rather than one reaction at a time, which is unsuitable given the size of genome-scale models. RESULTS We developed an integrative approach, combining public data and software resources for the rapid development of large-scale text mining tools targeting complex biological information. The user supplies input in the form of identifiers used in relevant data resources to refer to the concepts of interest, e.g. EC numbers, GO and SBO identifiers. By doing so, the user is freed from providing any other knowledge or terminology concerned with these concepts and their relations, since they are retrieved from these and cross-referenced resources automatically. The terminology acquired is used to index the literature by mapping concepts to their synonyms, and then to textual documents mentioning them. The indexing results and the previously acquired knowledge about relations between concepts are used to formulate complex search queries aiming at documents relevant to the user's information needs. The conceptual approach is demonstrated in the implementation of KiPar. Evaluation reveals that KiPar performs better than a Boolean search. The precision achieved for abstracts (60%) and full-text articles (48%) is considerably better than the baseline precision (44% and 24%, respectively). The baseline recall is improved by 36% for abstracts and by 100% for full text. It appears that full-text articles are a much richer source of information on kinetic data than are their abstracts. Finally, the combined results for abstracts and full text compared with the curated literature provide high values for relative recall (88%) and novelty ratio (92%), suggesting that the system is able to retrieve a high proportion of new documents. AVAILABILITY Source code and documentation are available at: (http://www.mcisb.org/resources/kipar/).
Collapse
Affiliation(s)
- Irena Spasic
- Manchester Centre for Integrative Systems Biology, The University of Manchester, Manchester, UK.
| | | | | | | | | |
Collapse
|