1051
|
González-Domenech CM, Belda E, Patiño-Navarrete R, Moya A, Peretó J, Latorre A. Metabolic stasis in an ancient symbiosis: genome-scale metabolic networks from two Blattabacterium cuenoti strains, primary endosymbionts of cockroaches. BMC Microbiol 2012; 12 Suppl 1:S5. [PMID: 22376077 PMCID: PMC3287516 DOI: 10.1186/1471-2180-12-s1-s5] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Cockroaches are terrestrial insects that strikingly eliminate waste nitrogen as ammonia instead of uric acid. Blattabacterium cuenoti (Mercier 1906) strains Bge and Pam are the obligate primary endosymbionts of the cockroaches Blattella germanica and Periplaneta americana, respectively. The genomes of both bacterial endosymbionts have recently been sequenced, making possible a genome-scale constraint-based reconstruction of their metabolic networks. The mathematical expression of a metabolic network and the subsequent quantitative studies of phenotypic features by Flux Balance Analysis (FBA) represent an efficient functional approach to these uncultivable bacteria. RESULTS We report the metabolic models of Blattabacterium strains Bge (iCG238) and Pam (iCG230), comprising 296 and 289 biochemical reactions, associated with 238 and 230 genes, and 364 and 358 metabolites, respectively. Both models reflect both the striking similarities and the singularities of these microorganisms. FBA was used to analyze the properties, potential and limits of the models, assuming some environmental constraints such as aerobic conditions and the net production of ammonia from these bacterial systems, as has been experimentally observed. In addition, in silico simulations with the iCG238 model have enabled a set of carbon and nitrogen sources to be defined, which would also support a viable phenotype in terms of biomass production in the strain Pam, which lacks the first three steps of the tricarboxylic acid cycle. FBA reveals a metabolic condition that renders these enzymatic steps dispensable, thus offering a possible evolutionary explanation for their elimination. We also confirm, by computational simulations, the fragility of the metabolic networks and their host dependence. CONCLUSIONS The minimized Blattabacterium metabolic networks are surprisingly similar in strains Bge and Pam, after 140 million years of evolution of these endosymbionts in separate cockroach lineages. FBA performed on the reconstructed networks from the two bacteria helps to refine the functional analysis of the genomes enabling us to postulate how slightly different host metabolic contexts drove their parallel evolution.
Collapse
Affiliation(s)
- Carmen Maria González-Domenech
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, P.O. Box 22085, E-46071, València, Spain
- Faculty of Pharmacy, University of Granada. Campus of Cartuja, E-18071. Granada, Spain
| | - Eugeni Belda
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, P.O. Box 22085, E-46071, València, Spain
| | - Rafael Patiño-Navarrete
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, P.O. Box 22085, E-46071, València, Spain
| | - Andrés Moya
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, P.O. Box 22085, E-46071, València, Spain
- Departament de Genètica, Universitat de València, Spain
- Centre for Public Health Research (CSISP), E-46020. València, Spain
| | - Juli Peretó
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, P.O. Box 22085, E-46071, València, Spain
- Departament de Bioquímica i Biologia Molecular, Universitat de València, Spain
| | - Amparo Latorre
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, P.O. Box 22085, E-46071, València, Spain
- Departament de Genètica, Universitat de València, Spain
- Centre for Public Health Research (CSISP), E-46020. València, Spain
| |
Collapse
|
1052
|
Kell DB. Scientific discovery as a combinatorial optimisation problem: how best to navigate the landscape of possible experiments? Bioessays 2012; 34:236-44. [PMID: 22252984 PMCID: PMC3321226 DOI: 10.1002/bies.201100144] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
A considerable number of areas of bioscience, including gene and drug discovery, metabolic engineering for the biotechnological improvement of organisms, and the processes of natural and directed evolution, are best viewed in terms of a ‘landscape’ representing a large search space of possible solutions or experiments populated by a considerably smaller number of actual solutions that then emerge. This is what makes these problems ‘hard’, but as such these are to be seen as combinatorial optimisation problems that are best attacked by heuristic methods known from that field. Such landscapes, which may also represent or include multiple objectives, are effectively modelled in silico, with modern active learning algorithms such as those based on Darwinian evolution providing guidance, using existing knowledge, as to what is the ‘best’ experiment to do next. An awareness, and the application, of these methods can thereby enhance the scientific discovery process considerably. This analysis fits comfortably with an emerging epistemology that sees scientific reasoning, the search for solutions, and scientific discovery as Bayesian processes.
Collapse
Affiliation(s)
- Douglas B Kell
- School of Chemistry and Manchester Interdisciplinary Biocentre, The University of Manchester, Manchester, Lancs, UK.
| |
Collapse
|
1053
|
Latendresse M, Krummenacker M, Trupp M, Karp PD. Construction and completion of flux balance models from pathway databases. ACTA ACUST UNITED AC 2012; 28:388-96. [PMID: 22262672 PMCID: PMC3268246 DOI: 10.1093/bioinformatics/btr681] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Flux balance analysis (FBA) is a well-known technique for genome-scale modeling of metabolic flux. Typically, an FBA formulation requires the accurate specification of four sets: biochemical reactions, biomass metabolites, nutrients and secreted metabolites. The development of FBA models can be time consuming and tedious because of the difficulty in assembling completely accurate descriptions of these sets, and in identifying errors in the composition of these sets. For example, the presence of a single non-producible metabolite in the biomass will make the entire model infeasible. Other difficulties in FBA modeling are that model distributions, and predicted fluxes, can be cryptic and difficult to understand. RESULTS We present a multiple gap-filling method to accelerate the development of FBA models using a new tool, called MetaFlux, based on mixed integer linear programming (MILP). The method suggests corrections to the sets of reactions, biomass metabolites, nutrients and secretions. The method generates FBA models directly from Pathway/Genome Databases. Thus, FBA models developed in this framework are easily queried and visualized using the Pathway Tools software. Predicted fluxes are more easily comprehended by visualizing them on diagrams of individual metabolic pathways or of metabolic maps. MetaFlux can also remove redundant high-flux loops, solve FBA models once they are generated and model the effects of gene knockouts. MetaFlux has been validated through construction of FBA models for Escherichia coli and Homo sapiens. AVAILABILITY Pathway Tools with MetaFlux is freely available to academic users, and for a fee to commercial users. Download from: biocyc.org/download.shtml. CONTACT mario.latendresse@sri.com SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mario Latendresse
- Bioinformatics Research Group/Artificial Intelligence Center, SRI International, Menlo Park, CA 94025, USA.
| | | | | | | |
Collapse
|
1054
|
Trinh CT, Thompson RA. Elementary mode analysis: a useful metabolic pathway analysis tool for reprograming microbial metabolic pathways. Subcell Biochem 2012; 64:21-42. [PMID: 23080244 DOI: 10.1007/978-94-007-5055-5_2] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Elementary mode analysis is a useful metabolic pathway analysis tool to characterize cellular metabolism. It can identify all feasible metabolic pathways known as elementary modes that are inherent to a metabolic network. Each elementary mode contains a minimal and unique set of enzymatic reactions that can support cellular functions at steady state. Knowledge of all these pathway options enables systematic characterization of cellular phenotypes, analysis of metabolic network properties (e.g. structure, regulation, robustness, and fragility), phenotypic behavior discovery, and rational strain design for metabolic engineering application. This chapter focuses on the application of elementary mode analysis to reprogram microbial metabolic pathways for rational strain design and the metabolic pathway evolution of designed strains.
Collapse
Affiliation(s)
- Cong T Trinh
- Department of Chemical and Biomolecular Engineering, University of Tennessee, Knoxville, TN, USA,
| | | |
Collapse
|
1055
|
Gerdtzen ZP. Modeling metabolic networks for mammalian cell systems: general considerations, modeling strategies, and available tools. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2012; 127:71-108. [PMID: 21984615 DOI: 10.1007/10_2011_120] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Over the past decades, the availability of large amounts of information regarding cellular processes and reaction rates, along with increasing knowledge about the complex mechanisms involved in these processes, has changed the way we approach the understanding of cellular processes. We can no longer rely only on our intuition for interpreting experimental data and evaluating new hypotheses, as the information to analyze is becoming increasingly complex. The paradigm for the analysis of cellular systems has shifted from a focus on individual processes to comprehensive global mathematical descriptions that consider the interactions of metabolic, genomic, and signaling networks. Analysis and simulations are used to test our knowledge by refuting or validating new hypotheses regarding a complex system, which can result in predictive capabilities that lead to better experimental design. Different types of models can be used for this purpose, depending on the type and amount of information available for the specific system. Stoichiometric models are based on the metabolic structure of the system and allow explorations of steady state distributions in the network. Detailed kinetic models provide a description of the dynamics of the system, they involve a large number of reactions with varied kinetic characteristics and require a large number of parameters. Models based on statistical information provide a description of the system without information regarding structure and interactions of the networks involved. The development of detailed models for mammalian cell metabolism has only recently started to grow more strongly, due to the intrinsic complexities of mammalian systems, and the limited availability of experimental information and adequate modeling tools. In this work we review the strategies, tools, current advances, and recent models of mammalian cells, focusing mainly on metabolism, but discussing the methodology applied to other types of networks as well.
Collapse
Affiliation(s)
- Ziomara P Gerdtzen
- Department of Chemical Engineering and Biotechnology, Millennium Institute for Cell Dynamics and Biotechnology: a Centre for Systems Biology, University of Chile, Beauchef 850, Santiago, Chile,
| |
Collapse
|
1056
|
Systems Metabolic Engineering: The Creation of Microbial Cell Factories by Rational Metabolic Design and Evolution. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2012; 131:1-23. [DOI: 10.1007/10_2012_137] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
1057
|
Levy R, Borenstein E. Reverse Ecology: from systems to environments and back. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2012; 751:329-45. [PMID: 22821465 DOI: 10.1007/978-1-4614-3567-9_15] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
The structure of complex biological systems reflects not only their function but also the environments in which they evolved and are adapted to. Reverse Ecology-an emerging new frontier in Evolutionary Systems Biology-aims to extract this information and to obtain novel insights into an organism's ecology. The Reverse Ecology framework facilitates the translation of high-throughput genomic data into large-scale ecological data, and has the potential to transform ecology into a high-throughput field. In this chapter, we describe some of the pioneering work in Reverse Ecology, demonstrating how system-level analysis of complex biological networks can be used to predict the natural habitats of poorly characterized microbial species, their interactions with other species, and universal patterns governing the adaptation of organisms to their environments. We further present several studies that applied Reverse Ecology to elucidate various aspects of microbial ecology, and lay out exciting future directions and potential future applications in biotechnology, biomedicine, and ecological engineering.
Collapse
Affiliation(s)
- Roie Levy
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | | |
Collapse
|
1058
|
Cattle genomics and its implications for future nutritional strategies for dairy cattle. Animal 2011; 7 Suppl 1:172-83. [PMID: 23031138 DOI: 10.1017/s1751731111002588] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
The recently sequenced cattle (Bos taurus) genome unraveled the unique genomic features of the species and provided the molecular basis for applying a systemic approach to systematically link genomic information to metabolic traits. Comparative analysis has identified a variety of evolutionary adaptive features in the cattle genome, such as an expansion of the gene families related to the rumen function, large number of chromosomal rearrangements affecting regulation of genes for lactation, and chromosomal rearrangements that are associated with segmental duplications and copy number variations. Metabolic reconstruction of the cattle genome has revealed that core metabolic pathways are highly conserved among mammals although five metabolic genes are deleted or highly diverged and seven metabolic genes are present in duplicate in the cattle genome compared to their human counter parts. The evolutionary loss and gain of metabolic genes in the cattle genome may reflect metabolic adaptations of cattle. Metabolic reconstruction also provides a platform for better understanding of metabolic regulation in cattle and ruminants. A substantial body of transcriptomics data from dairy and beef cattle under different nutritional management and across different stages of growth and lactation are already available and will aid in linking the genome with metabolism and nutritional physiology of cattle. Application of cattle genomics has great potential for future development of nutritional strategies to improve efficiency and sustainability of beef and milk production. One of the biggest challenges is to integrate genomic and phenotypic data and interpret them in a biological and practical platform. Systems biology, a holistic and systemic approach, will be very useful in overcoming this challenge.
Collapse
|
1059
|
Reconstruction of Arabidopsis metabolic network models accounting for subcellular compartmentalization and tissue-specificity. Proc Natl Acad Sci U S A 2011; 109:339-44. [PMID: 22184215 DOI: 10.1073/pnas.1100358109] [Citation(s) in RCA: 189] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Plant metabolic engineering is commonly used in the production of functional foods and quality trait improvement. However, to date, computational model-based approaches have only been scarcely used in this important endeavor, in marked contrast to their prominent success in microbial metabolic engineering. In this study we present a computational pipeline for the reconstruction of fully compartmentalized tissue-specific models of Arabidopsis thaliana on a genome scale. This reconstruction involves automatic extraction of known biochemical reactions in Arabidopsis for both primary and secondary metabolism, automatic gap-filling, and the implementation of methods for determining subcellular localization and tissue assignment of enzymes. The reconstructed tissue models are amenable for constraint-based modeling analysis, and significantly extend upon previous model reconstructions. A set of computational validations (i.e., cross-validation tests, simulations of known metabolic functionalities) and experimental validations (comparison with experimental metabolomics datasets under various compartments and tissues) strongly testify to the predictive ability of the models. The utility of the derived models was demonstrated in the prediction of measured fluxes in metabolically engineered seed strains and the design of genetic manipulations that are expected to increase vitamin E content, a significant nutrient for human health. Overall, the reconstructed tissue models are expected to lay down the foundations for computational-based rational design of plant metabolic engineering. The reconstructed compartmentalized Arabidopsis tissue models are MIRIAM-compliant and are available upon request.
Collapse
|
1060
|
Monitoring metabolites consumption and secretion in cultured cells using ultra-performance liquid chromatography quadrupole-time of flight mass spectrometry (UPLC-Q-ToF-MS). Anal Bioanal Chem 2011; 402:1183-98. [PMID: 22159369 DOI: 10.1007/s00216-011-5556-4] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2011] [Revised: 10/30/2011] [Accepted: 11/02/2011] [Indexed: 10/14/2022]
Abstract
Here we present an ultra-performance liquid chromatography-mass spectrometry (UPLC-MS) method for extracellular measurements of known and unexpected metabolites in parallel. The method was developed by testing 86 metabolites, including amino acids, organic acids, sugars, purines, pyrimidines, vitamins, and nucleosides, that can be resolved by combining chromatographic and m/z dimensions. Subsequently, a targeted quantitative method was developed for 80 metabolites. The presented method combines a UPLC approach using hydrophilic interaction liquid chromatography (HILIC) and MS detection achieved by a hybrid quadrupole-time of flight (Q-ToF) mass spectrometer. The optimal setup was achieved by evaluating reproducibility and repeatability of the analytical platforms using pooled quality control samples to minimize the drift in instrumental performance over time. Then, the method was validated by analyzing extracellular metabolites from acute lymphoblastic leukemia cell lines (MOLT-4 and CCRF-CEM) treated with direct (A-769662) and indirect (AICAR) AMP activated kinase (AMPK) activators, monitoring uptake and secretion of the targeted compound over time. This analysis pointed towards a perturbed purine and pyrimidine catabolism upon AICAR treatment. Our data suggest that the method presented can be used for qualitative and quantitative analysis of extracellular metabolites and it is suitable for routine applications such as in vitro drug screening.
Collapse
|
1061
|
Genome-scale metabolic reconstruction and hypothesis testing in the methanogenic archaeon Methanosarcina acetivorans C2A. J Bacteriol 2011; 194:855-65. [PMID: 22139506 DOI: 10.1128/jb.06040-11] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Methanosarcina acetivorans strain C2A is a marine methanogenic archaeon notable for its substrate utilization, genetic tractability, and novel energy conservation mechanisms. To help probe the phenotypic implications of this organism's unique metabolism, we have constructed and manually curated a genome-scale metabolic model of M. acetivorans, iMB745, which accounts for 745 of the 4,540 predicted protein-coding genes (16%) in the M. acetivorans genome. The reconstruction effort has identified key knowledge gaps and differences in peripheral and central metabolism between methanogenic species. Using flux balance analysis, the model quantitatively predicts wild-type phenotypes and is 96% accurate in knockout lethality predictions compared to currently available experimental data. The model was used to probe the mechanisms and energetics of by-product formation and growth on carbon monoxide, as well as the nature of the reaction catalyzed by the soluble heterodisulfide reductase HdrABC in M. acetivorans. The genome-scale model provides quantitative and qualitative hypotheses that can be used to help iteratively guide additional experiments to further the state of knowledge about methanogenesis.
Collapse
|
1062
|
Kim HU, Sohn SB, Lee SY. Metabolic network modeling and simulation for drug targeting and discovery. Biotechnol J 2011; 7:330-42. [DOI: 10.1002/biot.201100159] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2011] [Revised: 09/09/2011] [Accepted: 10/08/2011] [Indexed: 11/08/2022]
|
1063
|
Peyraud R, Schneider K, Kiefer P, Massou S, Vorholt JA, Portais JC. Genome-scale reconstruction and system level investigation of the metabolic network of Methylobacterium extorquens AM1. BMC SYSTEMS BIOLOGY 2011; 5:189. [PMID: 22074569 PMCID: PMC3227643 DOI: 10.1186/1752-0509-5-189] [Citation(s) in RCA: 93] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/06/2011] [Accepted: 11/10/2011] [Indexed: 01/21/2023]
Abstract
Background Methylotrophic microorganisms are playing a key role in biogeochemical processes - especially the global carbon cycle - and have gained interest for biotechnological purposes. Significant progress was made in the recent years in the biochemistry, genetics, genomics, and physiology of methylotrophic bacteria, showing that methylotrophy is much more widespread and versatile than initially assumed. Despite such progress, system-level description of the methylotrophic metabolism is currently lacking, and much remains to understand regarding the network-scale organization and properties of methylotrophy, and how the methylotrophic capacity emerges from this organization, especially in facultative organisms. Results In this work, we report on the integrated, system-level investigation of the metabolic network of the facultative methylotroph Methylobacterium extorquens AM1, a valuable model of methylotrophic bacteria. The genome-scale metabolic network of the bacterium was reconstructed and contains 1139 reactions and 977 metabolites. The sub-network operating upon methylotrophic growth was identified from both in silico and experimental investigations, and 13C-fluxomics was applied to measure the distribution of metabolic fluxes under such conditions. The core metabolism has a highly unusual topology, in which the unique enzymes that catalyse the key steps of C1 assimilation are tightly connected by several, large metabolic cycles (serine cycle, ethylmalonyl-CoA pathway, TCA cycle, anaplerotic processes). The entire set of reactions must operate as a unique process to achieve C1 assimilation, but was shown to be structurally fragile based on network analysis. This observation suggests that in nature a strong pressure of selection must exist to maintain the methylotrophic capability. Nevertheless, substantial substrate cycling could be measured within C2/C3/C4 inter-conversions, indicating that the metabolic network is highly versatile around a flexible backbone of central reactions that allows rapid switching to multi-carbon sources. Conclusions This work emphasizes that the metabolism of M. extorquens AM1 is adapted to its lifestyle not only in terms of enzymatic equipment, but also in terms of network-level structure and regulation. It suggests that the metabolism of the bacterium has evolved both structurally and functionally to an efficient but transitory utilization of methanol. Besides, this work provides a basis for metabolic engineering to convert methanol into value-added products.
Collapse
Affiliation(s)
- Rémi Peyraud
- Institute of Microbiology, ETH Zürich, 8093 Zürich, Switzerland
| | | | | | | | | | | |
Collapse
|
1064
|
Kim TY, Sohn SB, Kim YB, Kim WJ, Lee SY. Recent advances in reconstruction and applications of genome-scale metabolic models. Curr Opin Biotechnol 2011; 23:617-23. [PMID: 22054827 DOI: 10.1016/j.copbio.2011.10.007] [Citation(s) in RCA: 128] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2011] [Accepted: 10/17/2011] [Indexed: 11/19/2022]
Abstract
In the last decade, reconstruction and applications of genome-scale metabolic models have greatly influenced the field of systems biology by providing a platform on which high-throughput computational analysis of metabolic networks can be performed. The last two years have seen an increase in volume of more than 33% in the number of published genome-scale metabolic models, signifying a high demand for these metabolic models in studying specific organisms. The diversity in modeling different types of cells, from photosynthetic microorganisms to human cell types, also demonstrates their growing influence in biology. Here we review the recent advances and current state of genome-scale metabolic models, the methods employed towards ensuring high quality models, their biotechnological applications, and the progress towards the automated reconstruction of genome-scale metabolic models.
Collapse
Affiliation(s)
- Tae Yong Kim
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 program), Center for Systems and Synthetic Biotechnology, Institute for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon 305-701, Republic of Korea
| | | | | | | | | |
Collapse
|
1065
|
Abstract
Understanding complex biological systems requires extensive support from software tools. Such tools are needed at each step of a systems biology computational workflow, which typically consists of data handling, network inference, deep curation, dynamical simulation and model analysis. In addition, there are now efforts to develop integrated software platforms, so that tools that are used at different stages of the workflow and by different researchers can easily be used together. This Review describes the types of software tools that are required at different stages of systems biology research and the current options that are available for systems biology researchers. We also discuss the challenges and prospects for modelling the effects of genetic changes on physiology and the concept of an integrated platform.
Collapse
|
1066
|
The evolution of metabolic networks of E. coli. BMC SYSTEMS BIOLOGY 2011; 5:182. [PMID: 22044664 PMCID: PMC3229490 DOI: 10.1186/1752-0509-5-182] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2011] [Accepted: 11/01/2011] [Indexed: 11/19/2022]
Abstract
Background Despite the availability of numerous complete genome sequences from E. coli strains, published genome-scale metabolic models exist only for two commensal E. coli strains. These models have proven useful for many applications, such as engineering strains for desired product formation, and we sought to explore how constructing and evaluating additional metabolic models for E. coli strains could enhance these efforts. Results We used the genomic information from 16 E. coli strains to generate an E. coli pangenome metabolic network by evaluating their collective 76,990 ORFs. Each of these ORFs was assigned to one of 17,647 ortholog groups including ORFs associated with reactions in the most recent metabolic model for E. coli K-12. For orthologous groups that contain an ORF already represented in the MG1655 model, the gene to protein to reaction associations represented in this model could then be easily propagated to other E. coli strain models. All remaining orthologous groups were evaluated to see if new metabolic reactions could be added to generate a pangenome-scale metabolic model (iEco1712_pan). The pangenome model included reactions from a metabolic model update for E. coli K-12 MG1655 (iEco1339_MG1655) and enabled development of five additional strain-specific genome-scale metabolic models. These additional models include a second K-12 strain (iEco1335_W3110) and four pathogenic strains (two enterohemorrhagic E. coli O157:H7 and two uropathogens). When compared to the E. coli K-12 models, the metabolic models for the enterohemorrhagic (iEco1344_EDL933 and iEco1345_Sakai) and uropathogenic strains (iEco1288_CFT073 and iEco1301_UTI89) contained numerous lineage-specific gene and reaction differences. All six E. coli models were evaluated by comparing model predictions to carbon source utilization measurements under aerobic and anaerobic conditions, and to batch growth profiles in minimal media with 0.2% (w/v) glucose. An ancestral genome-scale metabolic model based on conserved ortholog groups in all 16 E. coli genomes was also constructed, reflecting the conserved ancestral core of E. coli metabolism (iEco1053_core). Comparative analysis of all six strain-specific E. coli models revealed that some of the pathogenic E. coli strains possess reactions in their metabolic networks enabling higher biomass yields on glucose. Finally the lineage-specific metabolic traits were compared to the ancestral core model predictions to derive new insight into the evolution of metabolism within this species. Conclusion Our findings demonstrate that a pangenome-scale metabolic model can be used to rapidly construct additional E. coli strain-specific models, and that quantitative models of different strains of E. coli can accurately predict strain-specific phenotypes. Such pangenome and strain-specific models can be further used to engineer metabolic phenotypes of interest, such as designing new industrial E. coli strains.
Collapse
|
1067
|
Bordbar A, Feist AM, Usaite-Black R, Woodcock J, Palsson BO, Famili I. A multi-tissue type genome-scale metabolic network for analysis of whole-body systems physiology. BMC SYSTEMS BIOLOGY 2011; 5:180. [PMID: 22041191 PMCID: PMC3219569 DOI: 10.1186/1752-0509-5-180] [Citation(s) in RCA: 140] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/28/2011] [Accepted: 10/31/2011] [Indexed: 02/06/2023]
Abstract
Background Genome-scale metabolic reconstructions provide a biologically meaningful mechanistic basis for the genotype-phenotype relationship. The global human metabolic network, termed Recon 1, has recently been reconstructed allowing the systems analysis of human metabolic physiology and pathology. Utilizing high-throughput data, Recon 1 has recently been tailored to different cells and tissues, including the liver, kidney, brain, and alveolar macrophage. These models have shown utility in the study of systems medicine. However, no integrated analysis between human tissues has been done. Results To describe tissue-specific functions, Recon 1 was tailored to describe metabolism in three human cells: adipocytes, hepatocytes, and myocytes. These cell-specific networks were manually curated and validated based on known cellular metabolic functions. To study intercellular interactions, a novel multi-tissue type modeling approach was developed to integrate the metabolic functions for the three cell types, and subsequently used to simulate known integrated metabolic cycles. In addition, the multi-tissue model was used to study diabetes: a pathology with systemic properties. High-throughput data was integrated with the network to determine differential metabolic activity between obese and type II obese gastric bypass patients in a whole-body context. Conclusion The multi-tissue type modeling approach presented provides a platform to study integrated metabolic states. As more cell and tissue-specific models are released, it is critical to develop a framework in which to study their interdependencies.
Collapse
Affiliation(s)
- Aarash Bordbar
- GT Life Sciences, 10520 Wateridge Circle, San Diego, CA 92121, USA
| | | | | | | | | | | |
Collapse
|
1068
|
Abstract
Although the metabolic networks of the three domains of life consist of different constituents and metabolic pathways, they exhibit the same scale-free organization. This phenomenon has been hypothetically explained by preferential attachment principle that the new-recruited metabolites attach preferentially to those that are already well connected. However, since metabolites are usually small molecules and metabolic processes are basically chemical reactions, we speculate that the metabolic network organization may have a chemical basis. In this paper, chemoinformatic analyses on metabolic networks of Kyoto Encyclopedia of Genes and Genomes (KEGG), Escherichia coli and Saccharomyces cerevisiae were performed. It was found that there exist qualitative and quantitative correlations between network topology and chemical properties of metabolites. The metabolites with larger degrees of connectivity (hubs) are of relatively stronger polarity. This suggests that metabolic networks are chemically organized to a certain extent, which was further elucidated in terms of high concentrations required by metabolic hubs to drive a variety of reactions. This finding not only provides a chemical explanation to the preferential attachment principle for metabolic network expansion, but also has important implications for metabolic network design and metabolite concentration prediction. The metabolic networks of the three domains of life exhibit the same scale-free organization, which has been hypothetically explained in terms of preferential attachment principle. Here we reveal that the scale-free organization of metabolic networks may have a chemical basis. Through a chemoinformatic analysis on metabolic networks of Kyoto Encyclopedia of Genes and Genomes (KEGG), Escherichia coli and Saccharomyces cerevisiae, it was found that the metabolites with higher degrees of connectivity (hubs) are of relatively stronger polarity. The reason underlying this phenomenon is that to drive a variety of reactions, metabolic hubs have to be highly concentrated. Since the intracellular environments are hydrophilic, metabolic hubs have to be strong-polar to reach high concentrations. This finding has direct implications for metabolic network design and provides a chemical explanation to the preferential attachment principle, which has been validated by numerical simulations of metabolic network expansion. In addition, the correlations between metabolite concentration, metabolic network topology and metabolite chemical properties also suggest that we can use chemical and topological properties of metabolites to predict their intracellular concentrations. A support vector regression model has been successfully established to predict the metabolite concentrations for Escherichia coli.
Collapse
|
1069
|
Charusanti P, Chauhan S, McAteer K, Lerman JA, Hyduke DR, Motin VL, Ansong C, Adkins JN, Palsson BO. An experimentally-supported genome-scale metabolic network reconstruction for Yersinia pestis CO92. BMC SYSTEMS BIOLOGY 2011; 5:163. [PMID: 21995956 PMCID: PMC3220653 DOI: 10.1186/1752-0509-5-163] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2011] [Accepted: 10/13/2011] [Indexed: 11/20/2022]
Abstract
Background Yersinia pestis is a gram-negative bacterium that causes plague, a disease linked historically to the Black Death in Europe during the Middle Ages and to several outbreaks during the modern era. Metabolism in Y. pestis displays remarkable flexibility and robustness, allowing the bacterium to proliferate in both warm-blooded mammalian hosts and cold-blooded insect vectors such as fleas. Results Here we report a genome-scale reconstruction and mathematical model of metabolism for Y. pestis CO92 and supporting experimental growth and metabolite measurements. The model contains 815 genes, 678 proteins, 963 unique metabolites and 1678 reactions, accurately simulates growth on a range of carbon sources both qualitatively and quantitatively, and identifies gaps in several key biosynthetic pathways and suggests how those gaps might be filled. Furthermore, our model presents hypotheses to explain certain known nutritional requirements characteristic of this strain. Conclusions Y. pestis continues to be a dangerous threat to human health during modern times. The Y. pestis genome-scale metabolic reconstruction presented here, which has been benchmarked against experimental data and correctly reproduces known phenotypes, provides an in silico platform with which to investigate the metabolism of this important human pathogen.
Collapse
Affiliation(s)
- Pep Charusanti
- Department of Bioengineering, University of California, San Diego, La Jolla, California, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
1070
|
Orth JD, Conrad TM, Na J, Lerman JA, Nam H, Feist AM, Palsson BØ. A comprehensive genome-scale reconstruction of Escherichia coli metabolism--2011. Mol Syst Biol 2011; 7:535. [PMID: 21988831 PMCID: PMC3261703 DOI: 10.1038/msb.2011.65] [Citation(s) in RCA: 692] [Impact Index Per Article: 53.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2011] [Accepted: 08/17/2011] [Indexed: 12/16/2022] Open
Abstract
The initial genome-scale reconstruction of the metabolic network of Escherichia coli K-12 MG1655 was assembled in 2000. It has been updated and periodically released since then based on new and curated genomic and biochemical knowledge. An update has now been built, named iJO1366, which accounts for 1366 genes, 2251 metabolic reactions, and 1136 unique metabolites. iJO1366 was (1) updated in part using a new experimental screen of 1075 gene knockout strains, illuminating cases where alternative pathways and isozymes are yet to be discovered, (2) compared with its predecessor and to experimental data sets to confirm that it continues to make accurate phenotypic predictions of growth on different substrates and for gene knockout strains, and (3) mapped to the genomes of all available sequenced E. coli strains, including pathogens, leading to the identification of hundreds of unannotated genes in these organisms. Like its predecessors, the iJO1366 reconstruction is expected to be widely deployed for studying the systems biology of E. coli and for metabolic engineering applications.
Collapse
Affiliation(s)
- Jeffrey D Orth
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Tom M Conrad
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Jessica Na
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Joshua A Lerman
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA, USA
| | - Hojung Nam
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Adam M Feist
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Bernhard Ø Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| |
Collapse
|
1071
|
Fleming RMT, Maes CM, Saunders MA, Ye Y, Palsson BØ. A variational principle for computing nonequilibrium fluxes and potentials in genome-scale biochemical networks. J Theor Biol 2011; 292:71-7. [PMID: 21983269 DOI: 10.1016/j.jtbi.2011.09.029] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2011] [Revised: 08/23/2011] [Accepted: 09/26/2011] [Indexed: 01/22/2023]
Abstract
We derive a convex optimization problem on a steady-state nonequilibrium network of biochemical reactions, with the property that energy conservation and the second law of thermodynamics both hold at the problem solution. This suggests a new variational principle for biochemical networks that can be implemented in a computationally tractable manner. We derive the Lagrange dual of the optimization problem and use strong duality to demonstrate that a biochemical analogue of Tellegen's theorem holds at optimality. Each optimal flux is dependent on a free parameter that we relate to an elementary kinetic parameter when mass action kinetics is assumed.
Collapse
Affiliation(s)
- R M T Fleming
- Center for Systems Biology, University of Iceland, Sturlugata 8, Reykjavik 101, Iceland.
| | | | | | | | | |
Collapse
|
1072
|
Connecting genotype to phenotype in the era of high-throughput sequencing. Biochim Biophys Acta Gen Subj 2011; 1810:967-77. [PMID: 21421023 DOI: 10.1016/j.bbagen.2011.03.010] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2010] [Revised: 02/17/2011] [Accepted: 03/13/2011] [Indexed: 12/25/2022]
|
1073
|
Rolfsson O, Palsson BØ, Thiele I. The human metabolic reconstruction Recon 1 directs hypotheses of novel human metabolic functions. BMC SYSTEMS BIOLOGY 2011; 5:155. [PMID: 21962087 PMCID: PMC3224382 DOI: 10.1186/1752-0509-5-155] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/29/2011] [Accepted: 10/01/2011] [Indexed: 11/29/2022]
Abstract
Background Metabolic network reconstructions formalize our knowledge of metabolism. Gaps in these networks pinpoint regions of metabolism where biological components and functions are "missing." At the same time, a major challenge in the post genomic era involves characterisation of missing biological components to complete genome annotation. Results We used the human metabolic network reconstruction RECON 1 and established constraint-based modelling tools to uncover novel functions associated with human metabolism. Flux variability analysis identified 175 gaps in RECON 1 in the form of blocked reactions. These gaps were unevenly distributed within metabolic pathways but primarily found in the cytosol and often caused by compounds whose metabolic fate, rather than production, is unknown. Using a published algorithm, we computed gap-filling solutions comprised of non-organism specific metabolic reactions capable of bridging the identified gaps. These candidate solutions were found to be dependent upon the reaction environment of the blocked reaction. Importantly, we showed that automatically generated solutions could produce biologically realistic hypotheses of novel human metabolic reactions such as of the fate of iduronic acid following glycan degradation and of N-acetylglutamate in amino acid metabolism. Conclusions The results demonstrate how metabolic models can be utilised to direct hypotheses of novel metabolic functions in human metabolism; a process that we find is heavily reliant upon manual curation and biochemical insight. The effectiveness of a systems approach for novel biochemical pathway discovery in mammals is demonstrated and steps required to tailor future gap filling algorithms to mammalian metabolic networks are proposed.
Collapse
Affiliation(s)
- Ottar Rolfsson
- Center for Systems Biology, University of Iceland, Sturlugata 8, 101 Reykjavik, Iceland
| | | | | |
Collapse
|
1074
|
Alam MT, Takano E, Breitling R. Prioritizing orphan proteins for further study using phylogenomics and gene expression profiles in Streptomyces coelicolor. BMC Res Notes 2011; 4:325. [PMID: 21899768 PMCID: PMC3224560 DOI: 10.1186/1756-0500-4-325] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2011] [Accepted: 09/07/2011] [Indexed: 11/10/2022] Open
Abstract
Background Streptomyces coelicolor, a model organism of antibiotic producing bacteria, has one of the largest genomes of the bacterial kingdom, including 7825 predicted protein coding genes. A large number of these genes, nearly 34%, are functionally orphan (hypothetical proteins with unknown function). However, in gene expression time course data, many of these functionally orphan genes show interesting expression patterns. Results In this paper, we analyzed all functionally orphan genes of Streptomyces coelicolor and identified a list of "high priority" orphans by combining gene expression analysis and additional phylogenetic information (i.e. the level of evolutionary conservation of each protein). Conclusions The prioritized orphan genes are promising candidates to be examined experimentally in the lab for further characterization of their function.
Collapse
Affiliation(s)
- Mohammad Tauqeer Alam
- Institute of Molecular, Cell and Systems Biology College of Medical, Veterinary and Life Sciences, Joseph Black Building B3,10, University of Glasgow, G12 8QQ, Glasgow, UK.
| | | | | |
Collapse
|
1075
|
Teusink B, Bachmann H, Molenaar D. Systems biology of lactic acid bacteria: a critical review. Microb Cell Fact 2011; 10 Suppl 1:S11. [PMID: 21995498 PMCID: PMC3231918 DOI: 10.1186/1475-2859-10-s1-s11] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Understanding the properties of a system as emerging from the interaction of well described parts is the most important goal of Systems Biology. Although in the practice of Lactic Acid Bacteria (LAB) physiology we most often think of the parts as the proteins and metabolites, a wider interpretation of what a part is can be useful. For example, different strains or species can be the parts of a community, or we could study only the chemical reactions as the parts of metabolism (and forgetting about the enzymes that catalyze them), as is done in flux balance analysis. As long as we have some understanding of the properties of these parts, we can investigate whether their interaction leads to novel or unanticipated behaviour of the system that they constitute. There has been a tendency in the Systems Biology community to think that the collection and integration of data should continue ad infinitum, or that we will otherwise not be able to understand the systems that we study in their details. However, it may sometimes be useful to take a step back and consider whether the knowledge that we already have may not explain the system behaviour that we find so intriguing. Reasoning about systems can be difficult, and may require the application of mathematical techniques. The reward is sometimes the realization of unexpected conclusions, or in the worst case, that we still do not know enough details of the parts, or of the interactions between them. We will discuss a number of cases, with a focus on LAB-related work, where a typical systems approach has brought new knowledge or perspective, often counterintuitive, and clashing with conclusions from simpler approaches. Also novel types of testable hypotheses may be generated by the systems approach, which we will illustrate. Finally we will give an outlook on the fields of research where the systems approach may point the way for the near future.
Collapse
Affiliation(s)
- Bas Teusink
- Systems Bioinformatics/NISB, Faculty of Earth and Life Sciences, VU University Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, The Netherlands.
| | | | | |
Collapse
|
1076
|
Jang YS, Park JM, Choi S, Choi YJ, Seung DY, Cho JH, Lee SY. Engineering of microorganisms for the production of biofuels and perspectives based on systems metabolic engineering approaches. Biotechnol Adv 2011; 30:989-1000. [PMID: 21889585 DOI: 10.1016/j.biotechadv.2011.08.015] [Citation(s) in RCA: 82] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2011] [Revised: 08/06/2011] [Accepted: 08/17/2011] [Indexed: 12/30/2022]
Abstract
The increasing oil price and environmental concerns caused by the use of fossil fuel have renewed our interest in utilizing biomass as a sustainable resource for the production of biofuel. It is however essential to develop high performance microbes that are capable of producing biofuels with very high efficiency in order to compete with the fossil fuel. Recently, the strategies for developing microbial strains by systems metabolic engineering, which can be considered as metabolic engineering integrated with systems biology and synthetic biology, have been developed. Systems metabolic engineering allows successful development of microbes that are capable of producing several different biofuels including bioethanol, biobutanol, alkane, and biodiesel, and even hydrogen. In this review, the approaches employed to develop efficient biofuel producers by metabolic engineering and systems metabolic engineering approaches are reviewed with relevant example cases. It is expected that systems metabolic engineering will be employed as an essential strategy for the development of microbial strains for industrial applications.
Collapse
Affiliation(s)
- Yu-Sin Jang
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 Program), BioProcess Engineering Research Center, Center for Systems and Synthetic Biotechnology, Institute for the BioCentury, KAIST, Daejeon, Republic of Korea
| | | | | | | | | | | | | |
Collapse
|
1077
|
Milne CB, Eddy JA, Raju R, Ardekani S, Kim PJ, Senger RS, Jin YS, Blaschek HP, Price ND. Metabolic network reconstruction and genome-scale model of butanol-producing strain Clostridium beijerinckii NCIMB 8052. BMC SYSTEMS BIOLOGY 2011; 5:130. [PMID: 21846360 PMCID: PMC3212993 DOI: 10.1186/1752-0509-5-130] [Citation(s) in RCA: 88] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/03/2011] [Accepted: 08/16/2011] [Indexed: 01/29/2023]
Abstract
BACKGROUND Solventogenic clostridia offer a sustainable alternative to petroleum-based production of butanol--an important chemical feedstock and potential fuel additive or replacement. C. beijerinckii is an attractive microorganism for strain design to improve butanol production because it (i) naturally produces the highest recorded butanol concentrations as a byproduct of fermentation; and (ii) can co-ferment pentose and hexose sugars (the primary products from lignocellulosic hydrolysis). Interrogating C. beijerinckii metabolism from a systems viewpoint using constraint-based modeling allows for simulation of the global effect of genetic modifications. RESULTS We present the first genome-scale metabolic model (iCM925) for C. beijerinckii, containing 925 genes, 938 reactions, and 881 metabolites. To build the model we employed a semi-automated procedure that integrated genome annotation information from KEGG, BioCyc, and The SEED, and utilized computational algorithms with manual curation to improve model completeness. Interestingly, we found only a 34% overlap in reactions collected from the three databases--highlighting the importance of evaluating the predictive accuracy of the resulting genome-scale model. To validate iCM925, we conducted fermentation experiments using the NCIMB 8052 strain, and evaluated the ability of the model to simulate measured substrate uptake and product production rates. Experimentally observed fermentation profiles were found to lie within the solution space of the model; however, under an optimal growth objective, additional constraints were needed to reproduce the observed profiles--suggesting the existence of selective pressures other than optimal growth. Notably, a significantly enriched fraction of actively utilized reactions in simulations--constrained to reflect experimental rates--originated from the set of reactions that overlapped between all three databases (P = 3.52 × 10-9, Fisher's exact test). Inhibition of the hydrogenase reaction was found to have a strong effect on butanol formation--as experimentally observed. CONCLUSIONS Microbial production of butanol by C. beijerinckii offers a promising, sustainable, method for generation of this important chemical and potential biofuel. iCM925 is a predictive model that can accurately reproduce physiological behavior and provide insight into the underlying mechanisms of microbial butanol production. As such, the model will be instrumental in efforts to better understand, and metabolically engineer, this microorganism for improved butanol production.
Collapse
Affiliation(s)
- Caroline B Milne
- Department of Chemical and Biomolecular Engineering, University of Illinois, Urbana, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
1078
|
Osterlund T, Nookaew I, Nielsen J. Fifteen years of large scale metabolic modeling of yeast: developments and impacts. Biotechnol Adv 2011; 30:979-88. [PMID: 21846501 DOI: 10.1016/j.biotechadv.2011.07.021] [Citation(s) in RCA: 88] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2011] [Accepted: 07/26/2011] [Indexed: 10/17/2022]
Abstract
Since the first large-scale reconstruction of the Saccharomyces cerevisiae metabolic network 15 years ago the development of yeast metabolic models has progressed rapidly, resulting in no less than nine different yeast genome-scale metabolic models. Here we review the historical development of large-scale mathematical modeling of yeast metabolism and the growing scope and impact of applications of these models in four different areas: as guide for metabolic engineering and strain improvement, as a tool for biological interpretation and discovery, applications of novel computational framework and for evolutionary studies.
Collapse
Affiliation(s)
- Tobias Osterlund
- Department of Chemical and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | | | | |
Collapse
|
1079
|
Schellenberger J, Que R, Fleming RMT, Thiele I, Orth JD, Feist AM, Zielinski DC, Bordbar A, Lewis NE, Rahmanian S, Kang J, Hyduke DR, Palsson BØ. Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0. Nat Protoc 2011; 6:1290-307. [PMID: 21886097 DOI: 10.1038/nprot.2011.308] [Citation(s) in RCA: 980] [Impact Index Per Article: 75.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Over the past decade, a growing community of researchers has emerged around the use of constraint-based reconstruction and analysis (COBRA) methods to simulate, analyze and predict a variety of metabolic phenotypes using genome-scale models. The COBRA Toolbox, a MATLAB package for implementing COBRA methods, was presented earlier. Here we present a substantial update of this in silico toolbox. Version 2.0 of the COBRA Toolbox expands the scope of computations by including in silico analysis methods developed since its original release. New functions include (i) network gap filling, (ii) (13)C analysis, (iii) metabolic engineering, (iv) omics-guided analysis and (v) visualization. As with the first version, the COBRA Toolbox reads and writes systems biology markup language-formatted models. In version 2.0, we improved performance, usability and the level of documentation. A suite of test scripts can now be used to learn the core functionality of the toolbox and validate results. This toolbox lowers the barrier of entry to use powerful COBRA methods.
Collapse
Affiliation(s)
- Jan Schellenberger
- Bioinformatics Program, University of California San Diego, La Jolla, California, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1080
|
Metabolic network reconstruction of Chlamydomonas offers insight into light-driven algal metabolism. Mol Syst Biol 2011; 7:518. [PMID: 21811229 PMCID: PMC3202792 DOI: 10.1038/msb.2011.52] [Citation(s) in RCA: 235] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2011] [Accepted: 06/18/2011] [Indexed: 12/18/2022] Open
Abstract
A comprehensive genome-scale metabolic network of Chlamydomonas reinhardtii, including a detailed account of light-driven metabolism, is reconstructed and validated. The model provides a new resource for research of C. reinhardtii metabolism and in algal biotechnology. The genome-scale metabolic network of Chlamydomonas reinhardtii (iRC1080) was reconstructed, accounting for >32% of the estimated metabolic genes encoded in the genome, and including extensive details of lipid metabolic pathways. This is the first metabolic network to explicitly account for stoichiometry and wavelengths of metabolic photon usage, providing a new resource for research of C. reinhardtii metabolism and developments in algal biotechnology. Metabolic functional annotation and the largest transcript verification of a metabolic network to date was performed, at least partially verifying >90% of the transcripts accounted for in iRC1080. Analysis of the network supports hypotheses concerning the evolution of latent lipid pathways in C. reinhardtii, including very long-chain polyunsaturated fatty acid and ceramide synthesis pathways. A novel approach for modeling light-driven metabolism was developed that accounts for both light source intensity and spectral quality of emitted light. The constructs resulting from this approach, termed prism reactions, were shown to significantly improve the accuracy of model predictions, and their use was demonstrated for evaluation of light source efficiency and design.
Algae have garnered significant interest in recent years, especially for their potential application in biofuel production. The hallmark, model eukaryotic microalgae Chlamydomonas reinhardtii has been widely used to study photosynthesis, cell motility and phototaxis, cell wall biogenesis, and other fundamental cellular processes (Harris, 2001). Characterizing algal metabolism is key to engineering production strains and understanding photobiological phenomena. Based on extensive literature on C. reinhardtii metabolism, its genome sequence (Merchant et al, 2007), and gene functional annotation, we have reconstructed and experimentally validated the genome-scale metabolic network for this alga, iRC1080, the first network to account for detailed photon absorption permitting growth simulations under different light sources. iRC1080 accounts for 1080 genes, associated with 2190 reactions and 1068 unique metabolites and encompasses 83 subsystems distributed across 10 cellular compartments (Figure 1A). Its >32% coverage of estimated metabolic genes is a tremendous expansion over previous algal reconstructions (Boyle and Morgan, 2009; Manichaikul et al, 2009). The lipid metabolic pathways of iRC1080 are considerably expanded relative to existing networks, and chemical properties of all metabolites in these pathways are accounted for explicitly, providing sufficient detail to completely specify all individual molecular species: backbone molecule and stereochemical numbering of acyl-chain positions; acyl-chain length; and number, position, and cis–trans stereoisomerism of carbon–carbon double bonds. Such detail in lipid metabolism will be critical for model-driven metabolic engineering efforts. We experimentally verified transcripts accounted for in the network under permissive growth conditions, detecting >90% of tested transcript models (Figure 1B) and providing validating evidence for the contents of iRC1080. We also analyzed the extent of transcript verification by specific metabolic subsystems. Some subsystems stood out as more poorly verified, including chloroplast and mitochondrial transport systems and sphingolipid metabolism, all of which exhibited <80% of transcripts detected, reflecting incomplete characterization of compartmental transporters and supporting a hypothesis of latent pathway evolution for ceramide synthesis in C. reinhardtii. Additional lines of evidence from the reconstruction effort similarly support this hypothesis including lack of ceramide synthetase and other annotation gaps downstream in sphingolipid metabolism. A similar hypothesis of latent pathway evolution was established for very long-chain fatty acids (VLCFAs) and their polyunsaturated analogs (VLCPUFAs) (Figure 1C), owing to the absence of this class of lipids in previous experimental measurements, lack of a candidate VLCFA elongase in the functional annotation, and additional downstream annotation gaps in arachidonic acid metabolism. The network provides a detailed account of metabolic photon absorption by light-driven reactions, including photosystems I and II, light-dependent protochlorophyllide oxidoreductase, provitamin D3 photoconversion to vitamin D3, and rhodopsin photoisomerase; this network accounting permits the precise modeling of light-dependent metabolism. iRC1080 accounts for effective light spectral ranges through analysis of biochemical activity spectra (Figure 3A), either reaction activity or absorbance at varying light wavelengths. Defining effective spectral ranges associated with each photon-utilizing reaction enabled our network to model growth under different light sources via stoichiometric representation of the spectral composition of emitted light, termed prism reactions. Coefficients for different photon wavelengths in a prism reaction correspond to the ratios of photon flux in the defined effective spectral ranges to the total emitted photon flux from a given light source (Figure 3B). This approach distinguishes the amount of emitted photons that drive different metabolic reactions. We created prism reactions for most light sources that have been used in published studies for algal and plant growth including solar light, various light bulbs, and LEDs. We also included regulatory effects, resulting from lighting conditions insofar as published studies enabled. Light and dark conditions have been shown to affect metabolic enzyme activity in C. reinhardtii on multiple levels: transcriptional regulation, chloroplast RNA degradation, translational regulation, and thioredoxin-mediated enzyme regulation. Through application of our light model and prism reactions, we were able to closely recapitulate experimental growth measurements under solar, incandescent, and red LED lights. Through unbiased sampling, we were able to establish the tremendous statistical significance of the accuracy of growth predictions achievable through implementation of prism reactions. Finally, application of the photosynthetic model was demonstrated prospectively to evaluate light utilization efficiency under different light sources. The results suggest that, of the existing light sources, red LEDs provide the greatest efficiency, about three times as efficient as sunlight. Extending this analysis, the model was applied to design a maximally efficient LED spectrum for algal growth. The result was a 677-nm peak LED spectrum with a total incident photon flux of 360 μE/m2/s, suggesting that for the simple objective of maximizing growth efficiency, LED technology has already reached an effective theoretical optimum. In summary, the C. reinhardtii metabolic network iRC1080 that we have reconstructed offers insight into the basic biology of this species and may be employed prospectively for genetic engineering design and light source design relevant to algal biotechnology. iRC1080 was used to analyze lipid metabolism and generate novel hypotheses about the evolution of latent pathways. The predictive capacity of metabolic models developed from iRC1080 was demonstrated in simulating mutant phenotypes and in evaluation of light source efficiency. Our network provides a broad knowledgebase of the biochemistry and genomics underlying global metabolism of a photoautotroph, and our modeling approach for light-driven metabolism exemplifies how integration of largely unvisited data types, such as physicochemical environmental parameters, can expand the diversity of applications of metabolic networks. Metabolic network reconstruction encompasses existing knowledge about an organism's metabolism and genome annotation, providing a platform for omics data analysis and phenotype prediction. The model alga Chlamydomonas reinhardtii is employed to study diverse biological processes from photosynthesis to phototaxis. Recent heightened interest in this species results from an international movement to develop algal biofuels. Integrating biological and optical data, we reconstructed a genome-scale metabolic network for this alga and devised a novel light-modeling approach that enables quantitative growth prediction for a given light source, resolving wavelength and photon flux. We experimentally verified transcripts accounted for in the network and physiologically validated model function through simulation and generation of new experimental growth data, providing high confidence in network contents and predictive applications. The network offers insight into algal metabolism and potential for genetic engineering and efficient light source design, a pioneering resource for studying light-driven metabolism and quantitative systems biology.
Collapse
|
1081
|
iRsp1095: a genome-scale reconstruction of the Rhodobacter sphaeroides metabolic network. BMC SYSTEMS BIOLOGY 2011; 5:116. [PMID: 21777427 PMCID: PMC3152904 DOI: 10.1186/1752-0509-5-116] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/07/2011] [Accepted: 07/21/2011] [Indexed: 11/10/2022]
Abstract
BACKGROUND Rhodobacter sphaeroides is one of the best studied purple non-sulfur photosynthetic bacteria and serves as an excellent model for the study of photosynthesis and the metabolic capabilities of this and related facultative organisms. The ability of R. sphaeroides to produce hydrogen (H₂), polyhydroxybutyrate (PHB) or other hydrocarbons, as well as its ability to utilize atmospheric carbon dioxide (CO₂) as a carbon source under defined conditions, make it an excellent candidate for use in a wide variety of biotechnological applications. A genome-level understanding of its metabolic capabilities should help realize this biotechnological potential. RESULTS Here we present a genome-scale metabolic network model for R. sphaeroides strain 2.4.1, designated iRsp1095, consisting of 1,095 genes, 796 metabolites and 1158 reactions, including R. sphaeroides-specific biomass reactions developed in this study. Constraint-based analysis showed that iRsp1095 agreed well with experimental observations when modeling growth under respiratory and phototrophic conditions. Genes essential for phototrophic growth were predicted by single gene deletion analysis. During pathway-level analyses of R. sphaeroides metabolism, an alternative route for CO₂ assimilation was identified. Evaluation of photoheterotrophic H2 production using iRsp1095 indicated that maximal yield would be obtained from growing cells, with this predicted maximum ~50% higher than that observed experimentally from wild type cells. Competing pathways that might prevent the achievement of this theoretical maximum were identified to guide future genetic studies. CONCLUSIONS iRsp1095 provides a robust framework for future metabolic engineering efforts to optimize the solar- and nutrient-powered production of biofuels and other valuable products by R. sphaeroides and closely related organisms.
Collapse
|
1082
|
Bordbar A, Jamshidi N, Palsson BO. iAB-RBC-283: A proteomically derived knowledge-base of erythrocyte metabolism that can be used to simulate its physiological and patho-physiological states. BMC SYSTEMS BIOLOGY 2011; 5:110. [PMID: 21749716 PMCID: PMC3158119 DOI: 10.1186/1752-0509-5-110] [Citation(s) in RCA: 74] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/20/2011] [Accepted: 07/12/2011] [Indexed: 02/06/2023]
Abstract
BACKGROUND The development of high-throughput technologies capable of whole cell measurements of genes, proteins, and metabolites has led to the emergence of systems biology. Integrated analysis of the resulting omic data sets has proved to be hard to achieve. Metabolic network reconstructions enable complex relationships amongst molecular components to be represented formally in a biologically relevant manner while respecting physical constraints. In silico models derived from such reconstructions can then be queried or interrogated through mathematical simulations. Proteomic profiling studies of the mature human erythrocyte have shown more proteins present related to metabolic function than previously thought; however the significance and the causal consequences of these findings have not been explored. RESULTS Erythrocyte proteomic data was used to reconstruct the most expansive description of erythrocyte metabolism to date, following extensive manual curation, assessment of the literature, and functional testing. The reconstruction contains 281 enzymes representing functions from glycolysis to cofactor and amino acid metabolism. Such a comprehensive view of erythrocyte metabolism implicates the erythrocyte as a potential biomarker for different diseases as well as a 'cell-based' drug-screening tool. The analysis shows that 94 erythrocyte enzymes are implicated in morbid single nucleotide polymorphisms, representing 142 pathologies. In addition, over 230 FDA-approved and experimental pharmaceuticals have enzymatic targets in the erythrocyte. CONCLUSION The advancement of proteomic technologies and increased generation of high-throughput proteomic data have created the need for a means to analyze these data in a coherent manner. Network reconstructions provide a systematic means to integrate and analyze proteomic data in a biologically meaning manner. Analysis of the red cell proteome has revealed an unexpected level of complexity in the functional capabilities of human erythrocyte metabolism.
Collapse
Affiliation(s)
- Aarash Bordbar
- Department of Bioengineering, University of California San Diego, La Jolla, 92093-0412, USA
| | | | | |
Collapse
|
1083
|
Saha R, Suthers PF, Maranas CD. Zea mays iRS1563: a comprehensive genome-scale metabolic reconstruction of maize metabolism. PLoS One 2011; 6:e21784. [PMID: 21755001 PMCID: PMC3131064 DOI: 10.1371/journal.pone.0021784] [Citation(s) in RCA: 136] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2011] [Accepted: 06/09/2011] [Indexed: 11/18/2022] Open
Abstract
The scope and breadth of genome-scale metabolic reconstructions have continued to expand over the last decade. Herein, we introduce a genome-scale model for a plant with direct applications to food and bioenergy production (i.e., maize). Maize annotation is still underway, which introduces significant challenges in the association of metabolic functions to genes. The developed model is designed to meet rigorous standards on gene-protein-reaction (GPR) associations, elementally and charged balanced reactions and a biomass reaction abstracting the relative contribution of all biomass constituents. The metabolic network contains 1,563 genes and 1,825 metabolites involved in 1,985 reactions from primary and secondary maize metabolism. For approximately 42% of the reactions direct literature evidence for the participation of the reaction in maize was found. As many as 445 reactions and 369 metabolites are unique to the maize model compared to the AraGEM model for A. thaliana. 674 metabolites and 893 reactions are present in Zea mays iRS1563 that are not accounted for in maize C4GEM. All reactions are elementally and charged balanced and localized into six different compartments (i.e., cytoplasm, mitochondrion, plastid, peroxisome, vacuole and extracellular). GPR associations are also established based on the functional annotation information and homology prediction accounting for monofunctional, multifunctional and multimeric proteins, isozymes and protein complexes. We describe results from performing flux balance analysis under different physiological conditions, (i.e., photosynthesis, photorespiration and respiration) of a C4 plant and also explore model predictions against experimental observations for two naturally occurring mutants (i.e., bm1 and bm3). The developed model corresponds to the largest and more complete to-date effort at cataloguing metabolism for a plant species.
Collapse
Affiliation(s)
- Rajib Saha
- Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Patrick F. Suthers
- Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Costas D. Maranas
- Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
1084
|
Faust K, Croes D, van Helden J. Prediction of metabolic pathways from genome-scale metabolic networks. Biosystems 2011; 105:109-21. [PMID: 21645586 DOI: 10.1016/j.biosystems.2011.05.004] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2010] [Revised: 03/23/2011] [Accepted: 05/05/2011] [Indexed: 01/06/2023]
Abstract
The analysis of a variety of data sets (transcriptome arrays, phylogenetic profiles, etc.) yields groups of functionally related genes. In order to determine their biological function, associated gene groups are often projected onto known pathways or tested for enrichment of known functions. However, these approaches are not flexible enough to deal with variations or novel pathways. During the last decade, we developed and refined an approach that predicts metabolic pathways from a global metabolic network encompassing all known reactions and their substrates/products, by extracting a subgraph connecting at best a set of seed nodes (compounds, reactions, enzymes or enzyme-coding genes). In this review, we summarize this work, while discussing the problems and pitfalls but also the advantages and applications of network-based metabolic pathway prediction.
Collapse
Affiliation(s)
- Karoline Faust
- Research Group of Bioinformatics and (Eco-)Systems Biology (BSB), VIB - Vrije Universiteit Brussel, Pleinlaan, Belgium
| | | | | |
Collapse
|
1085
|
Serrano MÁ, Sagués F. Network-based scoring system for genome-scale metabolic reconstructions. BMC SYSTEMS BIOLOGY 2011; 5:76. [PMID: 21595941 PMCID: PMC3113238 DOI: 10.1186/1752-0509-5-76] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/03/2011] [Accepted: 05/19/2011] [Indexed: 11/17/2022]
Abstract
Background Network reconstructions at the cell level are a major development in Systems Biology. However, we are far from fully exploiting its potentialities. Often, the incremental complexity of the pursued systems overrides experimental capabilities, or increasingly sophisticated protocols are underutilized to merely refine confidence levels of already established interactions. For metabolic networks, the currently employed confidence scoring system rates reactions discretely according to nested categories of experimental evidence or model-based likelihood. Results Here, we propose a complementary network-based scoring system that exploits the statistical regularities of a metabolic network as a bipartite graph. As an illustration, we apply it to the metabolism of Escherichia coli. The model is adjusted to the observations to derive connection probabilities between individual metabolite-reaction pairs and, after validation, to assess the reliability of each reaction in probabilistic terms. This network-based scoring system uncovers very specific reactions that could be functionally or evolutionary important, identifies prominent experimental targets, and enables further confirmation of modeling results. Conclusions We foresee a wide range of potential applications at different sub-cellular or supra-cellular levels of biological interactions given the natural bipartivity of many biological networks.
Collapse
Affiliation(s)
- M Ángeles Serrano
- Departament de Química Física, Universitat de Barcelona, Martí i Franquès 1, Barcelona, 08028, Spain.
| | | |
Collapse
|
1086
|
Thorleifsson SG, Thiele I. rBioNet: A COBRA toolbox extension for reconstructing high-quality biochemical networks. Bioinformatics 2011; 27:2009-10. [PMID: 21596791 DOI: 10.1093/bioinformatics/btr308] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Genome-scale metabolic networks are widely used in systems biology. However, to date no freely available tool exists that ensures quality control during the reconstruction process. RESULTS Here, we present a COBRA toolbox extension, rBioNet, enabling the construction of publication-level biochemical networks while enforcing necessary quality control measures. rBioNet has an intuitive user interface facilitating the reconstruction process for novices and experts. AVAILABILITY The rBioNet is freely available from http://opencobra.sourceforge.net.
Collapse
Affiliation(s)
- Stefan Gretar Thorleifsson
- Center for Systems Biology, Mechanical Engineering and Computer Science, University of Iceland, Reykjavik, Iceland
| | | |
Collapse
|
1087
|
Kaleta C, de Figueiredo LF, Heiland I, Klamt S, Schuster S. Special issue: integration of OMICs datasets into metabolic pathway analysis. Biosystems 2011; 105:107-8. [PMID: 21619911 DOI: 10.1016/j.biosystems.2011.05.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Affiliation(s)
- Christoph Kaleta
- Department of Bioinformatics, School of Biology and Pharmaceutics, Friedrich Schiller University Jena, Germany.
| | | | | | | | | |
Collapse
|
1088
|
Kim HU, Kim SY, Jeong H, Kim TY, Kim JJ, Choy HE, Yi KY, Rhee JH, Lee SY. Integrative genome-scale metabolic analysis of Vibrio vulnificus for drug targeting and discovery. Mol Syst Biol 2011; 7:460. [PMID: 21245845 PMCID: PMC3049409 DOI: 10.1038/msb.2010.115] [Citation(s) in RCA: 130] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2010] [Accepted: 12/06/2010] [Indexed: 01/01/2023] Open
Abstract
Chromosome 1 of Vibrio vulnificus tends to contain larger portion of essential or housekeeping genes on the basis of the genomic analysis and gene knockout experiments performed in this study, while its chromosome 2 seems to have originated and evolved from a plasmid. The genome-scale metabolic network model of V. vulnificus was reconstructed based on databases and literature, and was used to identify 193 essential metabolites. Five essential metabolites finally selected after the filtering process are 2-amino-4-hydroxy-6-hydroxymethyl-7,8-dihydropteridine (AHHMP), D-glutamate (DGLU), 2,3-dihydrodipicolinate (DHDP), 1-deoxy-D-xylulose 5-phosphate (DX5P), and 4-aminobenzoate (PABA), which were predicted to be essential in V. vulnificus, absent in human, and are consumed by multiple reactions. Chemical analogs of the five essential metabolites were screened and a hit compound showing the minimal inhibitory concentration (MIC) of 2 μg/ml and the minimal bactericidal concentration (MBC) of 4 μg/ml against V. vulnificus was identified.
Discovering new antimicrobial targets and consequently new antimicrobials is important as drug resistance of pathogenic microorganisms is becoming an increasingly serious problem in human healthcare management (Fischbach and Walsh, 2009). There clearly exists a gap between genomic studies and drug discovery as the accumulation of knowledge on pathogens at genome level has not successfully transformed into the development of effective drugs (Mills, 2006; Payne et al, 2007). In this study, we dissected the genome of a microbial pathogen in detail, and subsequently developed a systems biological strategy of employing genome-scale metabolic modeling and simulation together with metabolite essentiality analysis for effective drug targeting and discovery. This strategy was used for identifying new drug targets in an opportunistic pathogen Vibrio vulnificus CMCP6 as a model. V. vulnificus is a Gram-negative halophilic bacterium that is found in estuarine waters, brackish ponds, or coastal areas, and its Biotype 1 is an opportunistic human pathogen that can attack immune-compromised patients, and causes primary septicemia, necrotized wound infections, and gastroenteritis. We previously found that many metabolic genes were specifically induced in vivo, suggesting that specific metabolic pathways are essential for in vivo survival and virulence of this pathogen (Kim et al, 2003; Lee et al, 2007). These results motivated us to carry out systems biological analysis of the genome and the metabolic network for new drug target discovery. V. vulnificus CMCP6 has two chromosomes. We first re-sequenced genomic regions assembled in low quality and low depth, and subsequently re-annotated the whole genome of V. vulnificus. Horizontal gene transfer was suspected to be responsible for the diversification of each chromosome of V. vulnificus, and the presence of metabolic genes was more biased to chromosome 1 than chromosome 2. Further studies on V. vulnificus genome revealed that chromosome 2 is more prone to diversification for better adaptation to the environment than its chromosome 1, while chromosome 1 tends to expand their genetic repertoire while maintaining the core genes at a constant level. Next, a genome-scale metabolic network VvuMBEL943 was reconstructed based on literature, databases and experiments for systematic studies on the metabolism of this pathogen and prediction of drug targets. The VvuMBEL943 model is composed of 943 reactions and 765 metabolites, and covers 673 genes. The model was validated by comparing its simulated cell growth phenotype obtained by constraints-based flux analysis with the V. vulnificus-specific experimental data previously reported in the literature. In this study, constraints-based flux analysis is an optimization-based simulation method that calculates intracellular fluxes under the specific genetic and environmental condition (Kim et al, 2008). As a result, 17 growth phenotypes were correctly predicted out of 18 cases, which demonstrate the validity of VvuMBEL943. The main objective of constructing VvuMBEL943 in this study is to predict potential drug targets by system-wide analysis of the metabolic network for the effective treatment of V. vulnificus. To achieve this goal, a set of drug target candidates was predicted by taking a metabolite-centric approach. Metabolite essentiality analysis is a concept recently introduced for the study of cellular robustness to complement conventional reaction or gene-centric approach (Kim et al, 2007b). Metabolite essentiality analysis observes changes in flux distribution by removing each metabolite from the in silico metabolic network. Hence, metabolite essentiality predicts essential metabolites whose absence causes cell death. By selecting essential metabolites, it is possible to directly screen only their structural analogs, which substantially reduces the number of chemical compounds to screen from the chemical compound library. As a result of implementing this approach, 193 metabolites were initially identified to be essential to the cell. These essential metabolites were then further filtered based on the predetermined criteria, mainly organism specificity and multiple connectivity associated with each metabolite, in order to reduce the number of initial target candidates towards identifying the most effective ones. Five essential metabolites finally selected are 2-amino-4-hydroxy-6-hydroxymethyl-7,8-dihydropteridine (AHHMP), D-glutamate (DGLU), 2,3-dihydrodipicolinate (DHDP), 1-deoxy-D-xylulose 5-phosphate (DX5P), and 4-aminobenzoate (PABA). Enzymes that consume these essential metabolites were experimentally verified to be essential, which indeed demonstrates the essentiality of these five metabolites. On the basis of the structural information of these five essential metabolites, whole-cell screening assay was performed using their analogs for possible antibacterial discovery. We screened 352 chemical analogs of the essential metabolites selected from the chemical compound library, and found a hit compound 24837, which shows the minimal inhibitory concentration (MIC) of 2 μg/ml and minimal bactericidal concentration (MBC) of 4 μg/ml, showing good antibacterial activity without further structural modification. Although this study demonstrates a proof-of-concept, the approaches and their rationale taken here should serve as a general strategy for discovering novel antibiotics and drugs based on systems-level analysis of metabolic networks. Although the genomes of many microbial pathogens have been studied to help identify effective drug targets and novel drugs, such efforts have not yet reached full fruition. In this study, we report a systems biological approach that efficiently utilizes genomic information for drug targeting and discovery, and apply this approach to the opportunistic pathogen Vibrio vulnificus CMCP6. First, we partially re-sequenced and fully re-annotated the V. vulnificus CMCP6 genome, and accordingly reconstructed its genome-scale metabolic network, VvuMBEL943. The validated network model was employed to systematically predict drug targets using the concept of metabolite essentiality, along with additional filtering criteria. Target genes encoding enzymes that interact with the five essential metabolites finally selected were experimentally validated. These five essential metabolites are critical to the survival of the cell, and hence were used to guide the cost-effective selection of chemical analogs, which were then screened for antimicrobial activity in a whole-cell assay. This approach is expected to help fill the existing gap between genomics and drug discovery.
Collapse
Affiliation(s)
- Hyun Uk Kim
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 program), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Republic of Korea
| | | | | | | | | | | | | | | | | |
Collapse
|
1089
|
Connecting the virtual world of computers to the real world of medicinal chemistry. Future Med Chem 2011; 3:399-403. [PMID: 21452976 DOI: 10.4155/fmc.11.16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Drug discovery involves the simultaneous optimization of chemical and biological properties, usually in a single small molecule, which modulates one of nature's most complex systems: the balance between human health and disease. The increased use of computer-aided methods is having a significant impact on all aspects of the drug-discovery and development process and with improved methods and ever faster computers, computer-aided molecular design will be ever more central to the discovery process.
Collapse
|
1090
|
Oberhardt MA, Puchałka J, Martins dos Santos VAP, Papin JA. Reconciliation of genome-scale metabolic reconstructions for comparative systems analysis. PLoS Comput Biol 2011; 7:e1001116. [PMID: 21483480 PMCID: PMC3068926 DOI: 10.1371/journal.pcbi.1001116] [Citation(s) in RCA: 103] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2010] [Accepted: 02/28/2011] [Indexed: 11/18/2022] Open
Abstract
In the past decade, over 50 genome-scale metabolic reconstructions have been built for a variety of single- and multi- cellular organisms. These reconstructions have enabled a host of computational methods to be leveraged for systems-analysis of metabolism, leading to greater understanding of observed phenotypes. These methods have been sparsely applied to comparisons between multiple organisms, however, due mainly to the existence of differences between reconstructions that are inherited from the respective reconstruction processes of the organisms to be compared. To circumvent this obstacle, we developed a novel process, termed metabolic network reconciliation, whereby non-biological differences are removed from genome-scale reconstructions while keeping the reconstructions as true as possible to the underlying biological data on which they are based. This process was applied to two organisms of great importance to disease and biotechnological applications, Pseudomonas aeruginosa and Pseudomonas putida, respectively. The result is a pair of revised genome-scale reconstructions for these organisms that can be analyzed at a systems level with confidence that differences are indicative of true biological differences (to the degree that is currently known), rather than artifacts of the reconstruction process. The reconstructions were re-validated with various experimental data after reconciliation. With the reconciled and validated reconstructions, we performed a genome-wide comparison of metabolic flexibility between P. aeruginosa and P. putida that generated significant new insight into the underlying biology of these important organisms. Through this work, we provide a novel methodology for reconciling models, present new genome-scale reconstructions of P. aeruginosa and P. putida that can be directly compared at a network level, and perform a network-wide comparison of the two species. These reconstructions provide fresh insights into the metabolic similarities and differences between these important Pseudomonads, and pave the way towards full comparative analysis of genome-scale metabolic reconstructions of multiple species.
Collapse
Affiliation(s)
- Matthew A. Oberhardt
- Department of Biomedical Engineering, University of Virginia,
Charlottesville, Virginia, United States of America
| | - Jacek Puchałka
- Helmholtz Center for Infection Research (HZI), Braunschweig,
Germany
| | - Vítor A. P. Martins dos Santos
- Department of Biomedical Engineering, University of Virginia,
Charlottesville, Virginia, United States of America
- Helmholtz Center for Infection Research (HZI), Braunschweig,
Germany
- * E-mail: (VAPMdS); (JAP)
| | - Jason A. Papin
- Department of Biomedical Engineering, University of Virginia,
Charlottesville, Virginia, United States of America
- * E-mail: (VAPMdS); (JAP)
| |
Collapse
|
1091
|
Silva-Rocha R, Tamames J, dos Santos VM, de Lorenzo V. The logicome of environmental bacteria: merging catabolic and regulatory events with Boolean formalisms. Environ Microbiol 2011; 13:2389-402. [PMID: 21410625 DOI: 10.1111/j.1462-2920.2011.02455.x] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The regulatory and metabolic networks that rule biodegradation of pollutants by environmental bacteria are wired to the rest of the cellular physiology through both transcriptional factors and intermediary signal molecules. In this review, we examine some formalisms for describing catalytic/regulatory circuits of this sort and advocate the adoption of Boolean logic for combining transcriptional and enzymatic occurrences in the same biological system. As an example, we show how known regulatory and metabolic actions that bring about biodegradation of m-xylene by Pseudomonas putida mt-2 can be represented as clusters of binary operations and then reconstructed as a digital network. Despite the many simplifications, Boolean tools still capture the gross behaviour of the system even in the absence of kinetic constants determined experimentally. On this basis, we argue that still with a limited volume of data binary formalisms allow us to penetrate the raison d'être of extant regulatory and metabolic architectures.
Collapse
Affiliation(s)
- Rafael Silva-Rocha
- Systems Biology Program, Centro Nacional de Biotecnología CSIC, Cantoblanco-Madrid, 28049, Spain
| | | | | | | |
Collapse
|
1092
|
Pilalis E, Chatziioannou A, Thomasset B, Kolisis F. An in silico compartmentalized metabolic model of Brassica napus enables the systemic study of regulatory aspects of plant central metabolism. Biotechnol Bioeng 2011; 108:1673-82. [PMID: 21337341 DOI: 10.1002/bit.23107] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2010] [Revised: 01/31/2011] [Accepted: 02/08/2011] [Indexed: 12/22/2022]
Abstract
Biochemical network reconstructions represent valuable tools for the computational metabolic modeling of organisms that present a great biotechnological interest. An in silico multi-compartmental model of the central metabolism of the plant Brassica napus (Rapeseed) was constructed, aiming to investigate the metabolic properties of the Brassicaceae family. This family comprises many plants with major importance for the energy and nutrition sector, including the model plant Arabidopsis thaliana. The model utilized as objective function to be subsequently optimized, the biomass production of rapeseed developing embryos, which are characterized by a very high, oil content, up to 60% of biomass weight. In order to study global network properties of seed metabolism, various methods were employed, like Flux Balance Analysis, Principal Component Analysis of the flux space and reaction deletion studies, which simulate the effect of gene knock-out experiments. The model successfully simulated seed growth during the stage of oil accumulation and provided insight, regarding certain aspects of network plasticity, with the emphasis given in lipid biosynthesis regulation.
Collapse
Affiliation(s)
- Eleftherios Pilalis
- Institute of Biological Research and Biotechnology, National Hellenic Research Foundation, 48 Vas. Constantinou Ave, GR-11635, Athens, Greece
| | | | | | | |
Collapse
|
1093
|
Karlsson FH, Nookaew I, Petranovic D, Nielsen J. Prospects for systems biology and modeling of the gut microbiome. Trends Biotechnol 2011; 29:251-8. [PMID: 21392838 DOI: 10.1016/j.tibtech.2011.01.009] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2010] [Revised: 01/25/2011] [Accepted: 01/26/2011] [Indexed: 02/07/2023]
Abstract
Abundant microorganisms that inhabit the human intestine are implicated in health and disease. The gut microbiome has been studied with metagenomic tools, and over 3 million genes have been discovered, constituting a 'parts list' of this ecosystem; further understanding requires studies of the interacting parts. Mouse models have provided a glimpse into the microbiota and host interactions at metabolic and immunologic levels; however, to provide more insight, there is a need to generate mathematical models that can reveal genotype-phenotype relationships and provide scaffolds for integrated analyses. To this end, we propose the use of genome-scale metabolic models that have successfully been used in studying interactions between human hosts and microbes, as well as microbes in isolation and in communities.
Collapse
Affiliation(s)
- Fredrik H Karlsson
- Department of Chemical and Biological Engineering, Chalmers University of Technology, Kemivägen 10, SE-412 96 Göteborg, Sweden
| | | | | | | |
Collapse
|
1094
|
Hung SS, Parkinson J. Post-genomics resources and tools for studying apicomplexan metabolism. Trends Parasitol 2011; 27:131-40. [DOI: 10.1016/j.pt.2010.11.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2010] [Revised: 11/03/2010] [Accepted: 11/10/2010] [Indexed: 11/26/2022]
|
1095
|
Satish Kumar V, Ferry JG, Maranas CD. Metabolic reconstruction of the archaeon methanogen Methanosarcina Acetivorans. BMC SYSTEMS BIOLOGY 2011; 5:28. [PMID: 21324125 PMCID: PMC3048526 DOI: 10.1186/1752-0509-5-28] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2010] [Accepted: 02/15/2011] [Indexed: 11/10/2022]
Abstract
BACKGROUND Methanogens are ancient organisms that are key players in the carbon cycle accounting for about one billion tones of biological methane produced annually. Methanosarcina acetivorans, with a genome size of ~5.7 mb, is the largest sequenced archaeon methanogen and unique amongst the methanogens in its biochemical characteristics. By following a systematic workflow we reconstruct a genome-scale metabolic model for M. acetivorans. This process relies on previously developed computational tools developed in our group to correct growth prediction inconsistencies with in vivo data sets and rectify topological inconsistencies in the model. RESULTS The generated model iVS941 accounts for 941 genes, 705 reactions and 708 metabolites. The model achieves 93.3% prediction agreement with in vivo growth data across different substrates and multiple gene deletions. The model also correctly recapitulates metabolic pathway usage patterns of M. acetivorans such as the indispensability of flux through methanogenesis for growth on acetate and methanol and the unique biochemical characteristics under growth on carbon monoxide. CONCLUSIONS Based on the size of the genome-scale metabolic reconstruction and extent of validated predictions this model represents the most comprehensive up-to-date effort to catalogue methanogenic metabolism. The reconstructed model is available in spreadsheet and SBML formats to enable dissemination.
Collapse
Affiliation(s)
- Vinay Satish Kumar
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| | | | | |
Collapse
|
1096
|
Schellenberger J, Lewis NE, Palsson BØ. Elimination of thermodynamically infeasible loops in steady-state metabolic models. Biophys J 2011; 100:544-553. [PMID: 21281568 PMCID: PMC3030201 DOI: 10.1016/j.bpj.2010.12.3707] [Citation(s) in RCA: 135] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2010] [Revised: 11/16/2010] [Accepted: 12/10/2010] [Indexed: 01/03/2023] Open
Abstract
The constraint-based reconstruction and analysis (COBRA) framework has been widely used to study steady-state flux solutions in genome-scale metabolic networks. One shortcoming of current COBRA methods is the possible violation of the loop law in the computed steady-state flux solutions. The loop law is analogous to Kirchhoff's second law for electric circuits, and states that at steady state there can be no net flux around a closed network cycle. Although the consequences of the loop law have been known for years, it has been computationally difficult to work with. Therefore, the resulting loop-law constraints have been overlooked. Here, we present a general mixed integer programming approach called loopless COBRA (ll-COBRA), which can be used to eliminate all steady-state flux solutions that are incompatible with the loop law. We apply this approach to improve flux predictions on three common COBRA methods: flux balance analysis, flux variability analysis, and Monte Carlo sampling of the flux space. Moreover, we demonstrate that the imposition of loop-law constraints with ll-COBRA improves the consistency of simulation results with experimental data. This method provides an additional constraint for many COBRA methods, enabling the acquisition of more realistic simulation results.
Collapse
Affiliation(s)
- Jan Schellenberger
- Bioinformatics and Systems Biology Program, University of California, San Diego, California
| | - Nathan E Lewis
- Bioengineering Department, University of California, San Diego, California
| | - Bernhard Ø Palsson
- Bioengineering Department, University of California, San Diego, California.
| |
Collapse
|
1097
|
Munro SA, Zinder SH, Walker LP. ORIGINAL RESEARCH: Comparative constraint-based model development for thermophilic hydrogen production. Ind Biotechnol (New Rochelle N Y) 2011. [DOI: 10.1089/ind.2011.7.063] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Affiliation(s)
- Sarah A. Munro
- Department of Biological and Environmental Engineering, Cornell University, Ithaca, New York USA 14853
| | - Stephen H. Zinder
- Department of Microbiology, Cornell University, Ithaca, New York USA 14853
| | - Larry P. Walker
- Department of Biological and Environmental Engineering, Cornell University, Ithaca, New York USA 14853
| |
Collapse
|
1098
|
Ates Ö, Oner ET, Arga KY. Genome-scale reconstruction of metabolic network for a halophilic extremophile, Chromohalobacter salexigens DSM 3043. BMC SYSTEMS BIOLOGY 2011; 5:12. [PMID: 21251315 PMCID: PMC3034673 DOI: 10.1186/1752-0509-5-12] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/04/2010] [Accepted: 01/21/2011] [Indexed: 12/16/2022]
Abstract
BACKGROUND Chromohalobacter salexigens (formerly Halomonas elongata DSM 3043) is a halophilic extremophile with a very broad salinity range and is used as a model organism to elucidate prokaryotic osmoadaptation due to its strong euryhaline phenotype. RESULTS C. salexigens DSM 3043's metabolism was reconstructed based on genomic, biochemical and physiological information via a non-automated but iterative process. This manually-curated reconstruction accounts for 584 genes, 1386 reactions, and 1411 metabolites. By using flux balance analysis, the model was extensively validated against literature data on the C. salexigens phenotypic features, the transport and use of different substrates for growth as well as against experimental observations on the uptake and accumulation of industrially important organic osmolytes, ectoine, betaine, and its precursor choline, which play important roles in the adaptive response to osmotic stress. CONCLUSIONS This work presents the first comprehensive genome-scale metabolic model of a halophilic bacterium. Being a useful guide for identification and filling of knowledge gaps, the reconstructed metabolic network iOA584 will accelerate the research on halophilic bacteria towards application of systems biology approaches and design of metabolic engineering strategies.
Collapse
Affiliation(s)
- Özlem Ates
- Department of Bioengineering, Marmara University, 34722, Istanbul, Turkey
| | - Ebru Toksoy Oner
- Department of Bioengineering, Marmara University, 34722, Istanbul, Turkey
| | - Kazim Y Arga
- Department of Bioengineering, Marmara University, 34722, Istanbul, Turkey
| |
Collapse
|
1099
|
Thiele I, Hyduke DR, Steeb B, Fankam G, Allen DK, Bazzani S, Charusanti P, Chen FC, Fleming RMT, Hsiung CA, De Keersmaecker SCJ, Liao YC, Marchal K, Mo ML, Özdemir E, Raghunathan A, Reed JL, Shin SI, Sigurbjörnsdóttir S, Steinmann J, Sudarsan S, Swainston N, Thijs IM, Zengler K, Palsson BO, Adkins JN, Bumann D. A community effort towards a knowledge-base and mathematical model of the human pathogen Salmonella Typhimurium LT2. BMC SYSTEMS BIOLOGY 2011; 5:8. [PMID: 21244678 PMCID: PMC3032673 DOI: 10.1186/1752-0509-5-8] [Citation(s) in RCA: 104] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/26/2010] [Accepted: 01/18/2011] [Indexed: 01/05/2023]
Abstract
Background Metabolic reconstructions (MRs) are common denominators in systems biology and represent biochemical, genetic, and genomic (BiGG) knowledge-bases for target organisms by capturing currently available information in a consistent, structured manner. Salmonella enterica subspecies I serovar Typhimurium is a human pathogen, causes various diseases and its increasing antibiotic resistance poses a public health problem. Results Here, we describe a community-driven effort, in which more than 20 experts in S. Typhimurium biology and systems biology collaborated to reconcile and expand the S. Typhimurium BiGG knowledge-base. The consensus MR was obtained starting from two independently developed MRs for S. Typhimurium. Key results of this reconstruction jamboree include i) development and implementation of a community-based workflow for MR annotation and reconciliation; ii) incorporation of thermodynamic information; and iii) use of the consensus MR to identify potential multi-target drug therapy approaches. Conclusion Taken together, with the growing number of parallel MRs a structured, community-driven approach will be necessary to maximize quality while increasing adoption of MRs in experimental design and interpretation.
Collapse
Affiliation(s)
- Ines Thiele
- Center for Systems Biology, University of Iceland, Reykjavik, Iceland
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
1100
|
|