51
|
Zhuang K, Bakshi BR, Herrgård MJ. Multi-scale modeling for sustainable chemical production. Biotechnol J 2013; 8:973-84. [PMID: 23520143 DOI: 10.1002/biot.201200272] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2012] [Revised: 01/18/2013] [Accepted: 02/11/2013] [Indexed: 11/10/2022]
Abstract
With recent advances in metabolic engineering, it is now technically possible to produce a wide portfolio of existing petrochemical products from biomass feedstock. In recent years, a number of modeling approaches have been developed to support the engineering and decision-making processes associated with the development and implementation of a sustainable biochemical industry. The temporal and spatial scales of modeling approaches for sustainable chemical production vary greatly, ranging from metabolic models that aid the design of fermentative microbial strains to material and monetary flow models that explore the ecological impacts of all economic activities. Research efforts that attempt to connect the models at different scales have been limited. Here, we review a number of existing modeling approaches and their applications at the scales of metabolism, bioreactor, overall process, chemical industry, economy, and ecosystem. In addition, we propose a multi-scale approach for integrating the existing models into a cohesive framework. The major benefit of this proposed framework is that the design and decision-making at each scale can be informed, guided, and constrained by simulations and predictions at every other scale. In addition, the development of this multi-scale framework would promote cohesive collaborations across multiple traditionally disconnected modeling disciplines to achieve sustainable chemical production.
Collapse
|
52
|
Mo ML, Palsson BO, Herrgård MJ. Connecting extracellular metabolomic measurements to intracellular flux states in yeast. BMC SYSTEMS BIOLOGY 2009; 3:37. [PMID: 19321003 PMCID: PMC2679711 DOI: 10.1186/1752-0509-3-37] [Citation(s) in RCA: 322] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/15/2008] [Accepted: 03/25/2009] [Indexed: 11/17/2022]
Abstract
Background Metabolomics has emerged as a powerful tool in the quantitative identification of physiological and disease-induced biological states. Extracellular metabolome or metabolic profiling data, in particular, can provide an insightful view of intracellular physiological states in a noninvasive manner. Results We used an updated genome-scale metabolic network model of Saccharomyces cerevisiae, iMM904, to investigate how changes in the extracellular metabolome can be used to study systemic changes in intracellular metabolic states. The iMM904 metabolic network was reconstructed based on an existing genome-scale network, iND750, and includes 904 genes and 1,412 reactions. The network model was first validated by comparing 2,888 in silico single-gene deletion strain growth phenotype predictions to published experimental data. Extracellular metabolome data measured in response to environmental and genetic perturbations of ammonium assimilation pathways was then integrated with the iMM904 network in the form of relative overflow secretion constraints and a flux sampling approach was used to characterize candidate flux distributions allowed by these constraints. Predicted intracellular flux changes were consistent with published measurements on intracellular metabolite levels and fluxes. Patterns of predicted intracellular flux changes could also be used to correctly identify the regions of the metabolic network that were perturbed. Conclusion Our results indicate that integrating quantitative extracellular metabolomic profiles in a constraint-based framework enables inferring changes in intracellular metabolic flux states. Similar methods could potentially be applied towards analyzing biofluid metabolome variations related to human physiological and disease states.
Collapse
|
53
|
Feist AM, Herrgård MJ, Thiele I, Reed JL, Palsson BØ. Reconstruction of biochemical networks in microorganisms. Nat Rev Microbiol 2009; 7:129-43. [PMID: 19116616 PMCID: PMC3119670 DOI: 10.1038/nrmicro1949] [Citation(s) in RCA: 578] [Impact Index Per Article: 38.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Systems analysis of metabolic and growth functions in microbial organisms is rapidly developing and maturing. Such studies are enabled by reconstruction, at the genomic scale, of the biochemical reaction networks that underlie cellular processes. The network reconstruction process is organism specific and is based on an annotated genome sequence, high-throughput network-wide data sets and bibliomic data on the detailed properties of individual network components. Here we describe the process that is currently used to achieve comprehensive network reconstructions and discuss how these reconstructions are curated and validated. This review should aid the growing number of researchers who are carrying out reconstructions for particular target organisms.
Collapse
|
54
|
Shlomi T, Cabili MN, Herrgård MJ, Palsson BØ, Ruppin E. Network-based prediction of human tissue-specific metabolism. Nat Biotechnol 2008; 26:1003-10. [PMID: 18711341 DOI: 10.1038/nbt.1487] [Citation(s) in RCA: 446] [Impact Index Per Article: 27.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Direct in vivo investigation of mammalian metabolism is complicated by the distinct metabolic functions of different tissues. We present a computational method that successfully describes the tissue specificity of human metabolism on a large scale. By integrating tissue-specific gene- and protein-expression data with an existing comprehensive reconstruction of the global human metabolic network, we predict tissue-specific metabolic activity in ten human tissues. This reveals a central role for post-transcriptional regulation in shaping tissue-specific metabolic activity profiles. The predicted tissue specificity of genes responsible for metabolic diseases and tissue-specific differences in metabolite exchange with biofluids extend markedly beyond tissue-specific differences manifest in enzyme-expression data, and are validated by large-scale mining of tissue-specificity data. Our results establish a computational basis for the genome-wide study of normal and abnormal human metabolism in a tissue-specific manner.
Collapse
|
55
|
Cho BK, Charusanti P, Herrgård MJ, Palsson BO. Microbial regulatory and metabolic networks. Curr Opin Biotechnol 2007; 18:360-4. [PMID: 17719767 DOI: 10.1016/j.copbio.2007.07.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2007] [Accepted: 07/12/2007] [Indexed: 11/18/2022]
Abstract
Reconstruction of transcriptional regulatory and metabolic networks is the foundation of large-scale microbial systems and synthetic biology. An enormous amount of information including the annotated genomic sequences and the genomic locations of DNA-binding regulatory proteins can be used to define metabolic and regulatory networks in cells. In particular, advances in experimental methods to map regulatory networks in microbial cells have allowed reliable data-driven reconstruction of these networks. Recent work on metabolic engineering and experimental evolution of microbes highlights the key role of global regulatory networks in controlling specific metabolic processes and the need to consider the integrated function of multiple types of networks for both scientific and engineering purposes.
Collapse
|
56
|
Herrgård MJ, Fong SS, Palsson BØ. Identification of genome-scale metabolic network models using experimentally measured flux profiles. PLoS Comput Biol 2006; 2:e72. [PMID: 16839195 PMCID: PMC1487183 DOI: 10.1371/journal.pcbi.0020072] [Citation(s) in RCA: 98] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2005] [Accepted: 05/10/2006] [Indexed: 11/20/2022] Open
Abstract
Genome-scale metabolic network models can be reconstructed for well-characterized organisms using genomic annotation and literature information. However, there are many instances in which model predictions of metabolic fluxes are not entirely consistent with experimental data, indicating that the reactions in the model do not match the active reactions in the in vivo system. We introduce a method for determining the active reactions in a genome-scale metabolic network based on a limited number of experimentally measured fluxes. This method, called optimal metabolic network identification (OMNI), allows efficient identification of the set of reactions that results in the best agreement between in silico predicted and experimentally measured flux distributions. We applied the method to intracellular flux data for evolved Escherichia coli mutant strains with lower than predicted growth rates in order to identify reactions that act as flux bottlenecks in these strains. The expression of the genes corresponding to these bottleneck reactions was often found to be downregulated in the evolved strains relative to the wild-type strain. We also demonstrate the ability of the OMNI method to diagnose problems in E. coli strains engineered for metabolite overproduction that have not reached their predicted production potential. The OMNI method applied to flux data for evolved strains can be used to provide insights into mechanisms that limit the ability of microbial strains to evolve towards their predicted optimal growth phenotypes. When applied to industrial production strains, the OMNI method can also be used to suggest metabolic engineering strategies to improve byproduct secretion. In addition to these applications, the method should prove to be useful in general for reconstructing metabolic networks of ill-characterized microbial organisms based on limited amounts of experimental data. One of the major uses of in silico models in biology is to identify discrepancies between model predictions and experimental data and use these discrepancies to drive discovery of novel biological mechanisms. However, models only allow for identification of the discrepancies; they do not necessarily provide any assistance in discovering what are the missing or incorrect functionalities in the model that cause these discrepancies. Herrgård et al. describe a new in silico method, optimal metabolic network identification, or OMNI, that performs this discovery process in an efficient and systematic manner for genome-scale metabolic networks. Given a preliminary metabolic network model and experimentally determined metabolic flux data, OMNI finds the changes that need to be made to the model so that its predictions match the experimental data as well as possible. Herrgård et al. apply the method to identify metabolic bottlenecks in experimentally evolved Escherichia coli strains and to diagnose problems in strains designed through metabolic engineering strategies to overproduce specific desirable byproducts. The OMNI method can also be adapted to number of other settings, including identification of novel biochemical pathways in ill-characterized organisms based on limited amounts of experimental data.
Collapse
|
57
|
Herrgård MJ, Lee BS, Portnoy V, Palsson BØ. Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae. Genome Res 2006. [PMID: 16606697 DOI: 10.1101/gr.4083206.predict] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
We describe the use of model-driven analysis of multiple data types relevant to transcriptional regulation of metabolism to discover novel regulatory mechanisms in Saccharomyces cerevisiae. We have reconstructed the nutrient-controlled transcriptional regulatory network controlling metabolism in S. cerevisiae consisting of 55 transcription factors regulating 750 metabolic genes, based on information in the primary literature. This reconstructed regulatory network coupled with an existing genome-scale metabolic network model allows in silico prediction of growth phenotypes of regulatory gene deletions as well as gene expression profiles. We compared model predictions of gene expression changes in response to genetic and environmental perturbations to experimental data to identify potential novel targets for transcription factors. We then identified regulatory cascades connecting transcription factors to the potential targets through a systematic model expansion strategy using published genome-wide chromatin immunoprecipitation and binding-site-motif data sets. Finally, we show the ability of an integrated metabolic and regulatory network model to predict growth phenotypes of transcription factor knockout strains. These studies illustrate the potential of model-driven data integration to systematically discover novel components and interactions in regulatory and metabolic networks in eukaryotic cells.
Collapse
|
58
|
Herrgård MJ, Lee BS, Portnoy V, Palsson BØ. Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae. Genome Res 2006; 16:627-35. [PMID: 16606697 PMCID: PMC1457053 DOI: 10.1101/gr.4083206] [Citation(s) in RCA: 127] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
We describe the use of model-driven analysis of multiple data types relevant to transcriptional regulation of metabolism to discover novel regulatory mechanisms in Saccharomyces cerevisiae. We have reconstructed the nutrient-controlled transcriptional regulatory network controlling metabolism in S. cerevisiae consisting of 55 transcription factors regulating 750 metabolic genes, based on information in the primary literature. This reconstructed regulatory network coupled with an existing genome-scale metabolic network model allows in silico prediction of growth phenotypes of regulatory gene deletions as well as gene expression profiles. We compared model predictions of gene expression changes in response to genetic and environmental perturbations to experimental data to identify potential novel targets for transcription factors. We then identified regulatory cascades connecting transcription factors to the potential targets through a systematic model expansion strategy using published genome-wide chromatin immunoprecipitation and binding-site-motif data sets. Finally, we show the ability of an integrated metabolic and regulatory network model to predict growth phenotypes of transcription factor knockout strains. These studies illustrate the potential of model-driven data integration to systematically discover novel components and interactions in regulatory and metabolic networks in eukaryotic cells.
Collapse
|
59
|
Abstract
An analysis of an integrated network of over 150,000 functional and physical interactions in yeast suggests that the network can be hierarchically decomposed into themes and thematic maps. This decomposition can be used to explore the organizational principles of integrated biological networks within cells.
Collapse
|
60
|
Herrgård MJ, Covert MW, Palsson BØ. Reconstruction of microbial transcriptional regulatory networks. Curr Opin Biotechnol 2004; 15:70-7. [PMID: 15102470 DOI: 10.1016/j.copbio.2003.11.002] [Citation(s) in RCA: 93] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Although metabolic networks can be readily reconstructed through comparative genomics, the reconstruction of regulatory networks has been hindered by the relatively low level of evolutionary conservation of their molecular components. Recent developments in experimental techniques have allowed the generation of vast amounts of data related to regulatory networks. This data together with literature-derived knowledge has opened the way for genome-scale reconstruction of transcriptional regulatory networks. Large-scale regulatory network reconstructions can be converted to in silico models that allow systematic analysis of network behavior in response to changes in environmental conditions. These models can further be combined with genome-scale metabolic models to build integrated models of cellular function including both metabolism and its regulation.
Collapse
|
61
|
Herrgård MJ, Palsson BØ. Flagellar biosynthesis in silico: building quantitative models of regulatory networks. Cell 2004; 117:689-90. [PMID: 15186769 DOI: 10.1016/j.cell.2004.05.020] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
In this issue of Cell, describe the construction of an in silico model for the regulatory network responsible for the control of flagellar biosynthesis in E. coli based on quantitative gene expression data. They show how the model can be used as a quantitative blueprint to design genetic modifications with predictable influence on the dynamics of the system. The work by provides a general approach for building detailed models of transcriptional regulatory networks.
Collapse
|
62
|
Duarte NC, Herrgård MJ, Palsson BØ. Reconstruction and validation of Saccharomyces cerevisiae iND750, a fully compartmentalized genome-scale metabolic model. Genome Res 2004; 14:1298-309. [PMID: 15197165 PMCID: PMC442145 DOI: 10.1101/gr.2250904] [Citation(s) in RCA: 435] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
A fully compartmentalized genome-scale metabolic model of Saccharomyces cerevisiae that accounts for 750 genes and their associated transcripts, proteins, and reactions has been reconstructed and validated. All of the 1149 reactions included in this in silico model are both elementally and charge balanced and have been assigned to one of eight cellular locations (extracellular space, cytosol, mitochondrion, peroxisome, nucleus, endoplasmic reticulum, Golgi apparatus, or vacuole). When in silico predictions of 4154 growth phenotypes were compared to two published large-scale gene deletion studies, an 83% agreement was found between iND750's predictions and the experimental studies. Analysis of the failure modes showed that false predictions were primarily caused by iND750's limited inclusion of cellular processes outside of metabolism. This study systematically identified inconsistencies in our knowledge of yeast metabolism that require specific further experimental investigation.
Collapse
|
63
|
Allen TE, Herrgård MJ, Liu M, Qiu Y, Glasner JD, Blattner FR, Palsson BØ. Genome-scale analysis of the uses of the Escherichia coli genome: model-driven analysis of heterogeneous data sets. J Bacteriol 2003; 185:6392-9. [PMID: 14563874 PMCID: PMC219383 DOI: 10.1128/jb.185.21.6392-6399.2003] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The recent availability of heterogeneous high-throughput data types has increased the need for scalable in silico methods with which to integrate data related to the processes of regulation, protein synthesis, and metabolism. A sequence-based framework for modeling transcription and translation in prokaryotes has been established and has been extended to study the expression state of the entire Escherichia coli genome. The resulting in silico analysis of the expression state highlighted three facets of gene expression in E. coli: (i) the metabolic resources required for genome expression and protein synthesis were found to be relatively invariant under the conditions tested; (ii) effective promoter strengths were estimated at the genome scale by using global mRNA abundance and half-life data, revealing genes subject to regulation under the experimental conditions tested; and (iii) large-scale genome location-dependent expression patterns with approximately 600-kb periodicity were detected in the E. coli genome based on the 49 expression data sets analyzed. These results support the notion that a structured model-driven analysis of expression data yields additional information that can be subjected to commonly used statistical analyses. The integration of heterogeneous genome-scale data (i.e., sequence, expression data, and mRNA half-life data) is readily achieved in the context of an in silico model.
Collapse
|
64
|
Herrgård MJ, Covert MW, Palsson BØ. Reconciling gene expression data with known genome-scale regulatory network structures. Genome Res 2003; 13:2423-34. [PMID: 14559784 PMCID: PMC403761 DOI: 10.1101/gr.1330003] [Citation(s) in RCA: 81] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The availability of genome-scale gene expression data sets has initiated the development of methods that use this data to infer transcriptional regulatory networks. Alternatively, such regulatory network structures can be reconstructed based on annotated genome information, well-curated databases, and primary research literature. As a first step toward reconciling the two approaches, we examine the consistency between known genome-wide regulatory network structures and extensive gene expression data collections in Escherichia coli and Saccharomyces cerevisiae. By decomposing the regulatory network into a set of basic network elements, we can compute the local consistency of each instance of a particular type of network element. We find that the consistency of network elements is influenced by both structural features of the network such as the number of regulators acting on a target gene and by the functional classes of the genes involved in a particular element. Taken together, the approach presented allows us to define regulatory network subcomponents with a high degree of consistency between the network structure and gene expression data. The results suggest that targeted gene expression profiling data can be used to refine and expand particular subcomponents of known regulatory networks that are sufficiently decoupled from the rest of the network.
Collapse
|