1
|
Laurent H, Hughes MDG, Walko M, Brockwell DJ, Mahmoudi N, Youngs TGA, Headen TF, Dougan L. Visualization of Self-Assembly and Hydration of a β-Hairpin through Integrated Small and Wide-Angle Neutron Scattering. Biomacromolecules 2023; 24:4869-4879. [PMID: 37874935 PMCID: PMC10646990 DOI: 10.1021/acs.biomac.3c00583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 10/03/2023] [Indexed: 10/26/2023]
Abstract
Fundamental understanding of the structure and assembly of nanoscale building blocks is crucial for the development of novel biomaterials with defined architectures and function. However, accessing self-consistent structural information across multiple length scales is challenging. This limits opportunities to exploit atomic scale interactions to achieve emergent macroscale properties. In this work we present an integrative small- and wide-angle neutron scattering approach coupled with computational modeling to reveal the multiscale structure of hierarchically self-assembled β hairpins in aqueous solution across 4 orders of magnitude in length scale from 0.1 Å to 300 nm. Our results demonstrate the power of this self-consistent cross-length scale approach and allows us to model both the large-scale self-assembly and small-scale hairpin hydration of the model β hairpin CLN025. Using this combination of techniques, we map the hydrophobic/hydrophilic character of this model self-assembled biomolecular surface with atomic resolution. These results have important implications for the multiscale investigation of aqueous peptides and proteins, for the prediction of ligand binding and molecular associations for drug design, and for understanding the self-assembly of peptides and proteins for functional biomaterials.
Collapse
Affiliation(s)
- Harrison Laurent
- School
of Physics and Astronomy, University of
Leeds, Leeds, United Kingdom, LS2
9JT
| | - Matt D. G. Hughes
- School
of Physics and Astronomy, University of
Leeds, Leeds, United Kingdom, LS2
9JT
- Astbury
Centre for Structural Molecular Biology, University of Leeds, Leeds, United Kingdom LS2
9JT
| | - Martin Walko
- School
of Chemistry, University of Leeds, Leeds, United
Kingdom, LS2 9JT
| | - David J. Brockwell
- Astbury
Centre for Structural Molecular Biology, University of Leeds, Leeds, United Kingdom LS2
9JT
| | - Najet Mahmoudi
- ISIS
Neutron and Muon Source, Rutherford Appleton
Laboratory, Harwell Oxford, Didcot, United Kingdom, OX11 0QX
| | - Tristan G. A. Youngs
- ISIS
Neutron and Muon Source, Rutherford Appleton
Laboratory, Harwell Oxford, Didcot, United Kingdom, OX11 0QX
| | - Thomas F. Headen
- ISIS
Neutron and Muon Source, Rutherford Appleton
Laboratory, Harwell Oxford, Didcot, United Kingdom, OX11 0QX
| | - Lorna Dougan
- School
of Physics and Astronomy, University of
Leeds, Leeds, United Kingdom, LS2
9JT
- Astbury
Centre for Structural Molecular Biology, University of Leeds, Leeds, United Kingdom LS2
9JT
| |
Collapse
|
2
|
Suthers PF, Dinh HV, Fatma Z, Shen Y, Chan SHJ, Rabinowitz JD, Zhao H, Maranas CD. Genome-scale metabolic reconstruction of the non-model yeast Issatchenkia orientalis SD108 and its application to organic acids production. Metab Eng Commun 2020; 11:e00148. [PMID: 33134082 PMCID: PMC7586132 DOI: 10.1016/j.mec.2020.e00148] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2020] [Revised: 09/08/2020] [Accepted: 10/05/2020] [Indexed: 12/18/2022] Open
Abstract
Many platform chemicals can be produced from renewable biomass by microorganisms, with organic acids making up a large fraction. Intolerance to the resulting low pH growth conditions, however, remains a challenge for the industrial production of organic acids by microorganisms. Issatchenkia orientalis SD108 is a promising host for industrial production because it is tolerant to acidic conditions as low as pH 2.0. With the goal to systematically assess the metabolic capabilities of this non-model yeast, we developed a genome-scale metabolic model for I. orientalis SD108 spanning 850 genes, 1826 reactions, and 1702 metabolites. In order to improve the model’s quantitative predictions, organism-specific macromolecular composition and ATP maintenance requirements were determined experimentally and implemented. We examined its network topology, including essential genes and flux coupling analysis and drew comparisons with the Yeast 8.3 model for Saccharomyces cerevisiae. We explored the carbon substrate utilization and examined the organism’s production potential for the industrially-relevant succinic acid, making use of the OptKnock framework to identify gene knockouts which couple production of the targeted chemical to biomass production. The genome-scale metabolic model iIsor850 is a data-supported curated model which can inform genetic interventions for overproduction. Genome-scale metabolic model iIsor850 describes metabolism of I. orientalis SD108. Customized biomass reaction highlights differences with S. cerevisiae. Chemostat data elucidate growth-associated ATP maintenance. Substrate utilization and CRISPR/Cas9 gene knockout phenotypes validate model. Model pinpoints candidate gene deletions coupling succinic acid production to growth.
Collapse
Affiliation(s)
- Patrick F Suthers
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, 16802, USA
| | - Hoang V Dinh
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, 16802, USA
| | - Zia Fatma
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Yihui Shen
- Department of Chemistry, Princeton University, Princeton, NJ, USA.,Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Siu Hung Joshua Chan
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, 16802, USA
| | - Joshua D Rabinowitz
- Department of Chemistry, Princeton University, Princeton, NJ, USA.,Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Huimin Zhao
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA.,Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champagne, Urbana, IL, USA
| | - Costas D Maranas
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, 16802, USA
| |
Collapse
|
3
|
Stanway RR, Bushell E, Chiappino-Pepe A, Roques M, Sanderson T, Franke-Fayard B, Caldelari R, Golomingi M, Nyonda M, Pandey V, Schwach F, Chevalley S, Ramesar J, Metcalf T, Herd C, Burda PC, Rayner JC, Soldati-Favre D, Janse CJ, Hatzimanikatis V, Billker O, Heussler VT. Genome-Scale Identification of Essential Metabolic Processes for Targeting the Plasmodium Liver Stage. Cell 2020; 179:1112-1128.e26. [PMID: 31730853 PMCID: PMC6904910 DOI: 10.1016/j.cell.2019.10.030] [Citation(s) in RCA: 74] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2019] [Revised: 07/23/2019] [Accepted: 10/23/2019] [Indexed: 12/11/2022]
Abstract
Plasmodium gene functions in mosquito and liver stages remain poorly characterized due to limitations in the throughput of phenotyping at these stages. To fill this gap, we followed more than 1,300 barcoded P. berghei mutants through the life cycle. We discover 461 genes required for efficient parasite transmission to mosquitoes through the liver stage and back into the bloodstream of mice. We analyze the screen in the context of genomic, transcriptomic, and metabolomic data by building a thermodynamic model of P. berghei liver-stage metabolism, which shows a major reprogramming of parasite metabolism to achieve rapid growth in the liver. We identify seven metabolic subsystems that become essential at the liver stages compared with asexual blood stages: type II fatty acid synthesis and elongation (FAE), tricarboxylic acid, amino sugar, heme, lipoate, and shikimate metabolism. Selected predictions from the model are individually validated in single mutants to provide future targets for drug development. 1,342 barcoded P. berghei knockout (KO) mutants analyzed for stage-specific phenotypes Life-stage-specific metabolic models reveal reprogramming of cellular function High agreement between blood/liver stage metabolic models and genetic screening data Essential metabolic pathways for parasite development and mechanistic origin revealed
Collapse
Affiliation(s)
- Rebecca R Stanway
- Institute of Cell Biology, University of Bern, Bern 3012, Switzerland
| | - Ellen Bushell
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK; Molecular Infection Medicine Sweden (MIMS), Department of Molecular Biology, Umeå University, Umeå 901 87, Sweden
| | - Anush Chiappino-Pepe
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFL, Lausanne 1015, Switzerland
| | - Magali Roques
- Institute of Cell Biology, University of Bern, Bern 3012, Switzerland
| | - Theo Sanderson
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Blandine Franke-Fayard
- Leiden Malaria Research Group, Parasitology, Center of Infectious Diseases, Leiden University Medical Center (LUMC), Leiden 2333ZA, the Netherlands
| | - Reto Caldelari
- Institute of Cell Biology, University of Bern, Bern 3012, Switzerland
| | | | - Mary Nyonda
- Department of Microbiology & Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva 1211, Switzerland
| | - Vikash Pandey
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFL, Lausanne 1015, Switzerland; Molecular Infection Medicine Sweden (MIMS), Department of Molecular Biology, Umeå University, Umeå 901 87, Sweden
| | - Frank Schwach
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Séverine Chevalley
- Leiden Malaria Research Group, Parasitology, Center of Infectious Diseases, Leiden University Medical Center (LUMC), Leiden 2333ZA, the Netherlands
| | - Jai Ramesar
- Leiden Malaria Research Group, Parasitology, Center of Infectious Diseases, Leiden University Medical Center (LUMC), Leiden 2333ZA, the Netherlands
| | - Tom Metcalf
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Colin Herd
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Paul-Christian Burda
- Institute of Cell Biology, University of Bern, Bern 3012, Switzerland; Bernhard Nocht Institute for Tropical Medicine, Hamburg 20359, Germany
| | - Julian C Rayner
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK; Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2, 0XY, UK
| | - Dominique Soldati-Favre
- Department of Microbiology & Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva 1211, Switzerland
| | - Chris J Janse
- Leiden Malaria Research Group, Parasitology, Center of Infectious Diseases, Leiden University Medical Center (LUMC), Leiden 2333ZA, the Netherlands
| | - Vassily Hatzimanikatis
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFL, Lausanne 1015, Switzerland
| | - Oliver Billker
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK; Molecular Infection Medicine Sweden (MIMS), Department of Molecular Biology, Umeå University, Umeå 901 87, Sweden.
| | - Volker T Heussler
- Institute of Cell Biology, University of Bern, Bern 3012, Switzerland.
| |
Collapse
|
4
|
Kim J, Tremaine M, Grass JA, Purdy HM, Landick R, Kiley PJ, Reed JL. Systems Metabolic Engineering of Escherichia coli Improves Coconversion of Lignocellulose-Derived Sugars. Biotechnol J 2019; 14:e1800441. [PMID: 31297978 DOI: 10.1002/biot.201800441] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2019] [Revised: 07/08/2019] [Indexed: 11/10/2022]
Abstract
Currently, microbial conversion of lignocellulose-derived glucose and xylose to biofuels is hindered by the fact that most microbes (including Escherichia coli [E. coli], Saccharomyces cerevisiae, and Zymomonas mobilis) preferentially consume glucose first and consume xylose slowly after glucose is depleted in lignocellulosic hydrolysates. In this study, E. coli strains are developed that simultaneously utilize glucose and xylose in lignocellulosic biomass hydrolysate using genome-scale models and adaptive laboratory evolution. E. coli strains are designed and constructed that coutilize glucose and xylose and adaptively evolve them to improve glucose and xylose utilization. Whole-genome resequencing of the evolved strains find relevant mutations in metabolic and regulatory genes and the mutations' involvement in sugar coutilization is investigated. The developed strains show significantly improved coconversion of sugars in lignocellulosic biomass hydrolysates and provide a promising platform for producing next-generation biofuels.
Collapse
Affiliation(s)
- Joonhoon Kim
- US Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, Madison, WI, 53711, USA.,Department of Chemical and Biological Engineering, University of Wisconsin-Madison, 1415 Engineering Dr, Madison, WI, 53711, USA
| | - Mary Tremaine
- US Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, Madison, WI, 53711, USA
| | - Jeffrey A Grass
- US Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, Madison, WI, 53711, USA
| | - Hugh M Purdy
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, 1415 Engineering Dr, Madison, WI, 53711, USA
| | - Robert Landick
- US Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, Madison, WI, 53711, USA.,Department of Biochemistry, University of Wisconsin-Madison, Madison, WI, 53711, USA.,Department of Bacteriology, University of Wisconsin-Madison, Madison, WI, 53711, USA
| | - Patricia J Kiley
- US Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, Madison, WI, 53711, USA.,Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI, 53711, USA
| | - Jennifer L Reed
- US Department of Energy Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, Madison, WI, 53711, USA.,Department of Chemical and Biological Engineering, University of Wisconsin-Madison, 1415 Engineering Dr, Madison, WI, 53711, USA
| |
Collapse
|
5
|
Xu N, Ye C, Liu L. Genome-scale biological models for industrial microbial systems. Appl Microbiol Biotechnol 2018; 102:3439-3451. [PMID: 29497793 DOI: 10.1007/s00253-018-8803-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2017] [Revised: 01/19/2018] [Accepted: 01/21/2018] [Indexed: 01/08/2023]
Abstract
The primary aims and challenges associated with microbial fermentation include achieving faster cell growth, higher productivity, and more robust production processes. Genome-scale biological models, predicting the formation of an interaction among genetic materials, enzymes, and metabolites, constitute a systematic and comprehensive platform to analyze and optimize the microbial growth and production of biological products. Genome-scale biological models can help optimize microbial growth-associated traits by simulating biomass formation, predicting growth rates, and identifying the requirements for cell growth. With regard to microbial product biosynthesis, genome-scale biological models can be used to design product biosynthetic pathways, accelerate production efficiency, and reduce metabolic side effects, leading to improved production performance. The present review discusses the development of microbial genome-scale biological models since their emergence and emphasizes their pertinent application in improving industrial microbial fermentation of biological products.
Collapse
Affiliation(s)
- Nan Xu
- State Key Laboratory of Food Science and Technology, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China.,College of Bioscience and Biotechnology, Yangzhou University, Yangzhou, Jiangsu, 225009, China.,The Laboratory of Food Microbial-Manufacturing Engineering, Jiangnan University, Wuxi, 214122, China
| | - Chao Ye
- State Key Laboratory of Food Science and Technology, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China.,Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China.,The Laboratory of Food Microbial-Manufacturing Engineering, Jiangnan University, Wuxi, 214122, China
| | - Liming Liu
- State Key Laboratory of Food Science and Technology, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China. .,Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China. .,The Laboratory of Food Microbial-Manufacturing Engineering, Jiangnan University, Wuxi, 214122, China.
| |
Collapse
|
6
|
van der Stel AX, van de Lest CHA, Huynh S, Parker CT, van Putten JPM, Wösten MMSM. Catabolite repression in Campylobacter jejuni correlates with intracellular succinate levels. Environ Microbiol 2018; 20:1374-1388. [PMID: 29318721 DOI: 10.1111/1462-2920.14042] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Accepted: 01/06/2018] [Indexed: 12/28/2022]
Abstract
Bacteria have evolved different mechanisms to catabolize carbon sources from nutrient mixtures. They first consume their preferred carbon source, before others are used. Regulatory mechanisms adapt the metabolism accordingly to maximize growth and to outcompete other organisms. The human pathogen Campylobacter jejuni is an asaccharolytic Gram-negative bacterium that catabolizes amino acids and organic acids for growth. It prefers serine and aspartate as carbon sources, however it lacks all regulators known to be involved in regulating carbon source utilization in other organisms. In which manner C. jejuni adapts its metabolism towards the presence or absence of preferred carbon sources is unknown. In this study, we show with transcriptomic analysis and enzyme assays how C. jejuni adapts its metabolism in response to its preferred carbon sources. In the presence of serine as well as lactate and pyruvate C. jejuni inhibits the utilization of other carbon sources, by repressing the expression of a number of central metabolic enzymes. The regulatory proteins RacR, Cj1000 and CsrA play a role in the regulation of these metabolic enzymes. This metabolism dependent transcriptional repression correlates with an accumulation of intracellular succinate. Hence, we propose a demand-based catabolite repression mechanism in C. jejuni, depended on intracellular succinate levels.
Collapse
Affiliation(s)
| | - Chris H A van de Lest
- Department of Biochemistry and Cell Biology, Utrecht University, Utrecht, The Netherlands
| | - Steven Huynh
- Agricultural Research Service, U.S. Department of Agriculture, Produce Safety and Microbiology Research Unit, Albany, CA, USA
| | - Craig T Parker
- Agricultural Research Service, U.S. Department of Agriculture, Produce Safety and Microbiology Research Unit, Albany, CA, USA
| | - Jos P M van Putten
- Department of Infectious Diseases and Immunology, Utrecht University, Utrecht, The Netherlands
| | - Marc M S M Wösten
- Department of Infectious Diseases and Immunology, Utrecht University, Utrecht, The Netherlands
| |
Collapse
|
7
|
Chen X, Gao C, Guo L, Hu G, Luo Q, Liu J, Nielsen J, Chen J, Liu L. DCEO Biotechnology: Tools To Design, Construct, Evaluate, and Optimize the Metabolic Pathway for Biosynthesis of Chemicals. Chem Rev 2017; 118:4-72. [DOI: 10.1021/acs.chemrev.6b00804] [Citation(s) in RCA: 109] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Affiliation(s)
- Xiulai Chen
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Cong Gao
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Liang Guo
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Guipeng Hu
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Qiuling Luo
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Jia Liu
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Jens Nielsen
- Department
of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg SE-412 96, Sweden
- Novo
Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK2800 Lyngby, Denmark
| | - Jian Chen
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Liming Liu
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Department
of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg SE-412 96, Sweden
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| |
Collapse
|
8
|
Brbić M, Piškorec M, Vidulin V, Kriško A, Šmuc T, Supek F. The landscape of microbial phenotypic traits and associated genes. Nucleic Acids Res 2016; 44:10074-10090. [PMID: 27915291 PMCID: PMC5137458 DOI: 10.1093/nar/gkw964] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2016] [Revised: 09/21/2016] [Accepted: 10/11/2016] [Indexed: 12/31/2022] Open
Abstract
Bacteria and Archaea display a variety of phenotypic traits and can adapt to diverse ecological niches. However, systematic annotation of prokaryotic phenotypes is lacking. We have therefore developed ProTraits, a resource containing ∼545 000 novel phenotype inferences, spanning 424 traits assigned to 3046 bacterial and archaeal species. These annotations were assigned by a computational pipeline that associates microbes with phenotypes by text-mining the scientific literature and the broader World Wide Web, while also being able to define novel concepts from unstructured text. Moreover, the ProTraits pipeline assigns phenotypes by drawing extensively on comparative genomics, capturing patterns in gene repertoires, codon usage biases, proteome composition and co-occurrence in metagenomes. Notably, we find that gene synteny is highly predictive of many phenotypes, and highlight examples of gene neighborhoods associated with spore-forming ability. A global analysis of trait interrelatedness outlined clusters in the microbial phenotype network, suggesting common genetic underpinnings. Our extended set of phenotype annotations allows detection of 57 088 high confidence gene-trait links, which recover many known associations involving sporulation, flagella, catalase activity, aerobicity, photosynthesis and other traits. Over 99% of the commonly occurring gene families are involved in genetic interactions conditional on at least one phenotype, suggesting that epistasis has a major role in shaping microbial gene content.
Collapse
Affiliation(s)
- Maria Brbić
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia
| | - Matija Piškorec
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia
| | - Vedrana Vidulin
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia
| | - Anita Kriško
- Mediterranean Institute of Life Sciences, 21000 Split, Croatia
| | - Tomislav Šmuc
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia
| | - Fran Supek
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia .,EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, 08003 Barcelona, Spain.,Universitat Pompeu Fabra (UPF), 08002 Barcelona, Spain
| |
Collapse
|
9
|
Watson E, Yilmaz LS, Walhout AJM. Understanding Metabolic Regulation at a Systems Level: Metabolite Sensing, Mathematical Predictions, and Model Organisms. Annu Rev Genet 2016; 49:553-75. [PMID: 26631516 DOI: 10.1146/annurev-genet-112414-055257] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Metabolic networks are extensively regulated to facilitate tissue-specific metabolic programs and robustly maintain homeostasis in response to dietary changes. Homeostatic metabolic regulation is achieved through metabolite sensing coupled to feedback regulation of metabolic enzyme activity or expression. With a wealth of transcriptomic, proteomic, and metabolomic data available for different cell types across various conditions, we are challenged with understanding global metabolic network regulation and the resulting metabolic outputs. Stoichiometric metabolic network modeling integrated with "omics" data has addressed this challenge by generating nonintuitive, testable hypotheses about metabolic flux rewiring. Model organism studies have also yielded novel insight into metabolic networks. This review covers three topics: the feedback loops inherent in metabolic regulatory networks, metabolic network modeling, and interspecies studies utilizing Caenorhabditis elegans and various bacterial diets that have revealed novel metabolic paradigms.
Collapse
Affiliation(s)
- Emma Watson
- Program in Systems Biology, Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts 01605; , ,
| | - L Safak Yilmaz
- Program in Systems Biology, Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts 01605; , ,
| | - Albertha J M Walhout
- Program in Systems Biology, Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts 01605; , ,
| |
Collapse
|
10
|
Tervo CJ, Reed JL. MapMaker and PathTracer for tracking carbon in genome-scale metabolic models. Biotechnol J 2016; 11:648-61. [PMID: 26771089 DOI: 10.1002/biot.201500267] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2015] [Revised: 11/11/2015] [Accepted: 01/07/2016] [Indexed: 11/09/2022]
Abstract
Constraint-based reconstruction and analysis (COBRA) modeling results can be difficult to interpret given the large numbers of reactions in genome-scale models. While paths in metabolic networks can be found, existing methods are not easily combined with constraint-based approaches. To address this limitation, two tools (MapMaker and PathTracer) were developed to find paths (including cycles) between metabolites, where each step transfers carbon from reactant to product. MapMaker predicts carbon transfer maps (CTMs) between metabolites using only information on molecular formulae and reaction stoichiometry, effectively determining which reactants and products share carbon atoms. MapMaker correctly assigned CTMs for over 97% of the 2,251 reactions in an Escherichia coli metabolic model (iJO1366). Using CTMs as inputs, PathTracer finds paths between two metabolites. PathTracer was applied to iJO1366 to investigate the importance of using CTMs and COBRA constraints when enumerating paths, to find active and high flux paths in flux balance analysis (FBA) solutions, to identify paths for putrescine utilization, and to elucidate a potential CO2 fixation pathway in E. coli. These results illustrate how MapMaker and PathTracer can be used in combination with constraint-based models to identify feasible, active, and high flux paths between metabolites.
Collapse
Affiliation(s)
- Christopher J Tervo
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, Wisconsin, USA
| | - Jennifer L Reed
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, Wisconsin, USA.
| |
Collapse
|
11
|
PSAMM: A Portable System for the Analysis of Metabolic Models. PLoS Comput Biol 2016; 12:e1004732. [PMID: 26828591 PMCID: PMC4734835 DOI: 10.1371/journal.pcbi.1004732] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2015] [Accepted: 01/05/2016] [Indexed: 11/19/2022] Open
Abstract
The genome-scale models of metabolic networks have been broadly applied in phenotype prediction, evolutionary reconstruction, community functional analysis, and metabolic engineering. Despite the development of tools that support individual steps along the modeling procedure, it is still difficult to associate mathematical simulation results with the annotation and biological interpretation of metabolic models. In order to solve this problem, here we developed a Portable System for the Analysis of Metabolic Models (PSAMM), a new open-source software package that supports the integration of heterogeneous metadata in model annotations and provides a user-friendly interface for the analysis of metabolic models. PSAMM is independent of paid software environments like MATLAB, and all its dependencies are freely available for academic users. Compared to existing tools, PSAMM significantly reduced the running time of constraint-based analysis and enabled flexible settings of simulation parameters using simple one-line commands. The integration of heterogeneous, model-specific annotation information in PSAMM is achieved with a novel format of YAML-based model representation, which has several advantages, such as providing a modular organization of model components and simulation settings, enabling model version tracking, and permitting the integration of multiple simulation problems. PSAMM also includes a number of quality checking procedures to examine stoichiometric balance and to identify blocked reactions. Applying PSAMM to 57 models collected from current literature, we demonstrated how the software can be used for managing and simulating metabolic models. We identified a number of common inconsistencies in existing models and constructed an updated model repository to document the resolution of these inconsistencies. The broad application of genome-scale metabolic modeling has made it a useful technique for tackling fundamental questions in biological research and engineering. Today over 100 models have been constructed for organisms that carry out a diverse array of metabolic activities spanning all three kingdoms of life. These models, however, have been curated independently following different conventions. The maintenance of model consistency has been challenging due to the lack of consensus in model representation and the absence of integrated modeling software for associating mathematical simulations with the annotation and biological interpretation of metabolic models. To solve this problem, we developed a new software package, PSAMM, and a new model format that incorporates heterogeneous, model-specific annotation information into modular representations of model definitions and simulation settings. PSAMM provides significant advances in standardizing the workflow of model annotation and consistency checking. Compared to existing tools, PSAMM supports more flexible configurations and is more efficient in running constraint-based simulations. All functions of PSAMM are freely available for academic users and can be downloaded from a public Git repository (https://zhanglab.github.io/psamm/) under the GNU General Public License.
Collapse
|
12
|
O'Brien EJ, Monk JM, Palsson BO. Using Genome-scale Models to Predict Biological Capabilities. Cell 2015; 161:971-987. [PMID: 26000478 DOI: 10.1016/j.cell.2015.05.019] [Citation(s) in RCA: 449] [Impact Index Per Article: 44.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2014] [Indexed: 11/29/2022]
Abstract
Constraint-based reconstruction and analysis (COBRA) methods at the genome scale have been under development since the first whole-genome sequences appeared in the mid-1990s. A few years ago, this approach began to demonstrate the ability to predict a range of cellular functions, including cellular growth capabilities on various substrates and the effect of gene knockouts at the genome scale. Thus, much interest has developed in understanding and applying these methods to areas such as metabolic engineering, antibiotic design, and organismal and enzyme evolution. This Primer will get you started.
Collapse
Affiliation(s)
- Edward J O'Brien
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA; Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA 92093, USA
| | - Jonathan M Monk
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA; Department of NanoEngineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Bernhard O Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA; Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby 2800, Denmark.
| |
Collapse
|
13
|
Imam S, Schäuble S, Brooks AN, Baliga NS, Price ND. Data-driven integration of genome-scale regulatory and metabolic network models. Front Microbiol 2015; 6:409. [PMID: 25999934 PMCID: PMC4419725 DOI: 10.3389/fmicb.2015.00409] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2015] [Accepted: 04/20/2015] [Indexed: 12/21/2022] Open
Abstract
Microbes are diverse and extremely versatile organisms that play vital roles in all ecological niches. Understanding and harnessing microbial systems will be key to the sustainability of our planet. One approach to improving our knowledge of microbial processes is through data-driven and mechanism-informed computational modeling. Individual models of biological networks (such as metabolism, transcription, and signaling) have played pivotal roles in driving microbial research through the years. These networks, however, are highly interconnected and function in concert-a fact that has led to the development of a variety of approaches aimed at simulating the integrated functions of two or more network types. Though the task of integrating these different models is fraught with new challenges, the large amounts of high-throughput data sets being generated, and algorithms being developed, means that the time is at hand for concerted efforts to build integrated regulatory-metabolic networks in a data-driven fashion. In this perspective, we review current approaches for constructing integrated regulatory-metabolic models and outline new strategies for future development of these network models for any microbial system.
Collapse
Affiliation(s)
- Saheed Imam
- Institute for Systems Biology Seattle, WA, USA
| | - Sascha Schäuble
- Institute for Systems Biology Seattle, WA, USA ; Jena University Language and Information Engineering Lab, Friedrich-Schiller-University Jena Jena, Germany
| | | | - Nitin S Baliga
- Institute for Systems Biology Seattle, WA, USA ; Departments of Biology and Microbiology, University of Washington Seattle, WA, USA ; Molecular and Cellular Biology Program, University of Washington Seattle, WA, USA ; Lawrence Berkeley National Lab Berkeley, CA, USA
| | | |
Collapse
|
14
|
Qi Q, Li J, Cheng J. Reconstruction of metabolic pathways by combining probabilistic graphical model-based and knowledge-based methods. BMC Proc 2014; 8:S5. [PMID: 25374614 PMCID: PMC4202177 DOI: 10.1186/1753-6561-8-s6-s5] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Automatic reconstruction of metabolic pathways for an organism from genomics and transcriptomics data has been a challenging and important problem in bioinformatics. Traditionally, known reference pathways can be mapped into an organism-specific ones based on its genome annotation and protein homology. However, this simple knowledge-based mapping method might produce incomplete pathways and generally cannot predict unknown new relations and reactions. In contrast, ab initio metabolic network construction methods can predict novel reactions and interactions, but its accuracy tends to be low leading to a lot of false positives. Here we combine existing pathway knowledge and a new ab initio Bayesian probabilistic graphical model together in a novel fashion to improve automatic reconstruction of metabolic networks. Specifically, we built a knowledge database containing known, individual gene / protein interactions and metabolic reactions extracted from existing reference pathways. Known reactions and interactions were then used as constraints for Bayesian network learning methods to predict metabolic pathways. Using individual reactions and interactions extracted from different pathways of many organisms to guide pathway construction is new and improves both the coverage and accuracy of metabolic pathway construction. We applied this probabilistic knowledge-based approach to construct the metabolic networks from yeast gene expression data and compared its results with 62 known metabolic networks in the KEGG database. The experiment showed that the method improved the coverage of metabolic network construction over the traditional reference pathway mapping method and was more accurate than pure ab initio methods.
Collapse
Affiliation(s)
- Qi Qi
- Department of Computer Science, University of Missouri, Columbia, MO 65201, USA
| | - Jilong Li
- Department of Computer Science, University of Missouri, Columbia, MO 65201, USA
| | - Jianlin Cheng
- Department of Computer Science, University of Missouri, Columbia, MO 65201, USA ; Informatics Institute, University of Missouri, Columbia, MO 65201, USA
| |
Collapse
|
15
|
Schmidt R, Waschina S, Boettger-Schmidt D, Kost C, Kaleta C. Computing autocatalytic sets to unravel inconsistencies in metabolic network reconstructions. ACTA ACUST UNITED AC 2014; 31:373-81. [PMID: 25286919 DOI: 10.1093/bioinformatics/btu658] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
MOTIVATION Genome-scale metabolic network reconstructions have been established as a powerful tool for the prediction of cellular phenotypes and metabolic capabilities of organisms. In recent years, the number of network reconstructions has been constantly increasing, mostly because of the availability of novel (semi-)automated procedures, which enabled the reconstruction of metabolic models based on individual genomes and their annotation. The resulting models are widely used in numerous applications. However, the accuracy and predictive power of network reconstructions are commonly limited by inherent inconsistencies and gaps. RESULTS Here we present a novel method to validate metabolic network reconstructions based on the concept of autocatalytic sets. Autocatalytic sets correspond to collections of metabolites that, besides enzymes and a growth medium, are required to produce all biomass components in a metabolic model. These autocatalytic sets are well-conserved across all domains of life, and their identification in specific genome-scale reconstructions allows us to draw conclusions about potential inconsistencies in these models. The method is capable of detecting inconsistencies, which are neglected by other gap-finding methods. We tested our method on the Model SEED, which is the largest repository for automatically generated genome-scale network reconstructions. In this way, we were able to identify a significant number of missing pathways in several of these reconstructions. Hence, the method we report represents a powerful tool to identify inconsistencies in large-scale metabolic networks. AVAILABILITY AND IMPLEMENTATION The method is available as source code on http://users.minet.uni-jena.de/∼m3kach/ASBIG/ASBIG.zip. CONTACT christoph.kaleta@uni-jena.de SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ralf Schmidt
- Research Group Theoretical Systems Biology, Faculty of Biology and Pharmacy, Friedrich Schiller University Jena, 07743 Jena, Department of Bioorganic Chemistry, Experimental Ecology and Evolution Research Group, Max Planck Institute for Chemical Ecology, 07745 Jena, Department of Biomolecular Chemistry, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, 07745 Jena and Department of Computational Biology, Institute for Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense M, Denmark
| | - Silvio Waschina
- Research Group Theoretical Systems Biology, Faculty of Biology and Pharmacy, Friedrich Schiller University Jena, 07743 Jena, Department of Bioorganic Chemistry, Experimental Ecology and Evolution Research Group, Max Planck Institute for Chemical Ecology, 07745 Jena, Department of Biomolecular Chemistry, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, 07745 Jena and Department of Computational Biology, Institute for Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense M, Denmark Research Group Theoretical Systems Biology, Faculty of Biology and Pharmacy, Friedrich Schiller University Jena, 07743 Jena, Department of Bioorganic Chemistry, Experimental Ecology and Evolution Research Group, Max Planck Institute for Chemical Ecology, 07745 Jena, Department of Biomolecular Chemistry, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, 07745 Jena and Department of Computational Biology, Institute for Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense M, Denmark
| | - Daniela Boettger-Schmidt
- Research Group Theoretical Systems Biology, Faculty of Biology and Pharmacy, Friedrich Schiller University Jena, 07743 Jena, Department of Bioorganic Chemistry, Experimental Ecology and Evolution Research Group, Max Planck Institute for Chemical Ecology, 07745 Jena, Department of Biomolecular Chemistry, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, 07745 Jena and Department of Computational Biology, Institute for Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense M, Denmark
| | - Christian Kost
- Research Group Theoretical Systems Biology, Faculty of Biology and Pharmacy, Friedrich Schiller University Jena, 07743 Jena, Department of Bioorganic Chemistry, Experimental Ecology and Evolution Research Group, Max Planck Institute for Chemical Ecology, 07745 Jena, Department of Biomolecular Chemistry, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, 07745 Jena and Department of Computational Biology, Institute for Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense M, Denmark
| | - Christoph Kaleta
- Research Group Theoretical Systems Biology, Faculty of Biology and Pharmacy, Friedrich Schiller University Jena, 07743 Jena, Department of Bioorganic Chemistry, Experimental Ecology and Evolution Research Group, Max Planck Institute for Chemical Ecology, 07745 Jena, Department of Biomolecular Chemistry, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, 07745 Jena and Department of Computational Biology, Institute for Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense M, Denmark Research Group Theoretical Systems Biology, Faculty of Biology and Pharmacy, Friedrich Schiller University Jena, 07743 Jena, Department of Bioorganic Chemistry, Experimental Ecology and Evolution Research Group, Max Planck Institute for Chemical Ecology, 07745 Jena, Department of Biomolecular Chemistry, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute, 07745 Jena and Department of Computational Biology, Institute for Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense M, Denmark
| |
Collapse
|
16
|
Affiliation(s)
- Jonathan Monk
- Department of Bioengineering, University of California, San Diego, CA 92093-0412, USA
| | - Bernhard O Palsson
- Department of Bioengineering, University of California, San Diego, CA 92093-0412, USA.
| |
Collapse
|
17
|
Weaver DS, Keseler IM, Mackie A, Paulsen IT, Karp PD. A genome-scale metabolic flux model of Escherichia coli K-12 derived from the EcoCyc database. BMC SYSTEMS BIOLOGY 2014; 8:79. [PMID: 24974895 PMCID: PMC4086706 DOI: 10.1186/1752-0509-8-79] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2014] [Accepted: 06/19/2014] [Indexed: 12/14/2022]
Abstract
BACKGROUND Constraint-based models of Escherichia coli metabolic flux have played a key role in computational studies of cellular metabolism at the genome scale. We sought to develop a next-generation constraint-based E. coli model that achieved improved phenotypic prediction accuracy while being frequently updated and easy to use. We also sought to compare model predictions with experimental data to highlight open questions in E. coli biology. RESULTS We present EcoCyc-18.0-GEM, a genome-scale model of the E. coli K-12 MG1655 metabolic network. The model is automatically generated from the current state of EcoCyc using the MetaFlux software, enabling the release of multiple model updates per year. EcoCyc-18.0-GEM encompasses 1445 genes, 2286 unique metabolic reactions, and 1453 unique metabolites. We demonstrate a three-part validation of the model that breaks new ground in breadth and accuracy: (i) Comparison of simulated growth in aerobic and anaerobic glucose culture with experimental results from chemostat culture and simulation results from the E. coli modeling literature. (ii) Essentiality prediction for the 1445 genes represented in the model, in which EcoCyc-18.0-GEM achieves an improved accuracy of 95.2% in predicting the growth phenotype of experimental gene knockouts. (iii) Nutrient utilization predictions under 431 different media conditions, for which the model achieves an overall accuracy of 80.7%. The model's derivation from EcoCyc enables query and visualization via the EcoCyc website, facilitating model reuse and validation by inspection. We present an extensive investigation of disagreements between EcoCyc-18.0-GEM predictions and experimental data to highlight areas of interest to E. coli modelers and experimentalists, including 70 incorrect predictions of gene essentiality on glucose, 80 incorrect predictions of gene essentiality on glycerol, and 83 incorrect predictions of nutrient utilization. CONCLUSION Significant advantages can be derived from the combination of model organism databases and flux balance modeling represented by MetaFlux. Interpretation of the EcoCyc database as a flux balance model results in a highly accurate metabolic model and provides a rigorous consistency check for information stored in the database.
Collapse
Affiliation(s)
- Daniel S Weaver
- Bioinformatics Research Group, SRI International, 333 Ravenswood Ave., 94025 Menlo Park, CA, USA
| | - Ingrid M Keseler
- Bioinformatics Research Group, SRI International, 333 Ravenswood Ave., 94025 Menlo Park, CA, USA
| | - Amanda Mackie
- Department of Chemistry and Biomolecular Science, Macquarie University, Balaclava Rd, North Ryde NSW 2109, Australia
| | - Ian T Paulsen
- Department of Chemistry and Biomolecular Science, Macquarie University, Balaclava Rd, North Ryde NSW 2109, Australia
| | - Peter D Karp
- Bioinformatics Research Group, SRI International, 333 Ravenswood Ave., 94025 Menlo Park, CA, USA
| |
Collapse
|
18
|
Liu G, Marras A, Nielsen J. The future of genome-scale modeling of yeast through integration of a transcriptional regulatory network. QUANTITATIVE BIOLOGY 2014. [DOI: 10.1007/s40484-014-0027-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
19
|
Kim J, Reed JL. Refining metabolic models and accounting for regulatory effects. Curr Opin Biotechnol 2014; 29:34-8. [PMID: 24632483 DOI: 10.1016/j.copbio.2014.02.009] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2013] [Revised: 01/23/2014] [Accepted: 02/13/2014] [Indexed: 11/28/2022]
Abstract
Advances in genome-scale metabolic modeling allow us to investigate and engineer metabolism at a systems level. Metabolic network reconstructions have been made for many organisms and computational approaches have been developed to convert these reconstructions into predictive models. However, due to incomplete knowledge these reconstructions often have missing or extraneous components and interactions, which can be identified by reconciling model predictions with experimental data. Recent studies have provided methods to further improve metabolic model predictions by incorporating transcriptional regulatory interactions and high-throughput omics data to yield context-specific metabolic models. Here we discuss recent approaches for resolving model-data discrepancies and building context-specific metabolic models. Once developed highly accurate metabolic models can be used in a variety of biotechnology applications.
Collapse
Affiliation(s)
- Joonhoon Kim
- DOE Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, United States; Department of Chemical and Biological Engineering, University of Wisconsin-Madison, United States
| | - Jennifer L Reed
- DOE Great Lakes Bioenergy Research Center, University of Wisconsin-Madison, United States; Department of Chemical and Biological Engineering, University of Wisconsin-Madison, United States.
| |
Collapse
|
20
|
Dash S, Mueller TJ, Venkataramanan KP, Papoutsakis ET, Maranas CD. Capturing the response of Clostridium acetobutylicum to chemical stressors using a regulated genome-scale metabolic model. BIOTECHNOLOGY FOR BIOFUELS 2014; 7:144. [PMID: 25379054 PMCID: PMC4207355 DOI: 10.1186/s13068-014-0144-4] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2014] [Accepted: 09/22/2014] [Indexed: 05/20/2023]
Abstract
BACKGROUND Clostridia are anaerobic Gram-positive Firmicutes containing broad and flexible systems for substrate utilization, which have been used successfully to produce a range of industrial compounds. In particular, Clostridium acetobutylicum has been used to produce butanol on an industrial scale through acetone-butanol-ethanol (ABE) fermentation. A genome-scale metabolic (GSM) model is a powerful tool for understanding the metabolic capacities of an organism and developing metabolic engineering strategies for strain development. The integration of stress-related specific transcriptomics information with the GSM model provides opportunities for elucidating the focal points of regulation. RESULTS We describe here the construction and validation of a GSM model for C. acetobutylicum ATCC 824, iCac802. iCac802 spans 802 genes and includes 1,137 metabolites and 1,462 reactions, along with gene-protein-reaction associations. Both (13)C-MFA and gene deletion data in the ABE fermentation pathway were used to test the predicted flux ranges allowed by the model. We also describe the CoreReg method, introduced in this paper, to integrate transcriptomic data and identify core sets of reactions that, when their flux was selectively restricted, reproduced flux and biomass-formation ranges seen under all regulatory constraints. CoreReg was used in response to butanol and butyrate stress to tighten bounds for 50 reactions within the iCac802 model. These bounds affected the flux of tens of reactions in core metabolism. The model, incorporating the regulatory restrictions from CoreReg under chemical stress, exhibited an approximate 70% reduction in biomass yield for most stress conditions. CONCLUSIONS The regulation placed on the model for the two stresses using CoreReg identified differences in the respective responses, including distinct core sets and the restriction of biomass production similar to experimental observations. Given the core sets predicted by the CoreReg method, remedial actions can be taken to counteract the effect of stress on metabolism. For less well-known systems, plausible regulatory loops can be suggested around the affected metabolic reactions, and the hypotheses can be tested experimentally.
Collapse
Affiliation(s)
- Satyakam Dash
- />Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania USA
| | - Thomas J Mueller
- />Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania USA
| | - Keerthi P Venkataramanan
- />Delaware Biotechnology Institute, 15 Innovation Way, Newark, 19711 Delaware USA
- />Department of Chemical Engineering, University of Delaware, Newark, Delaware USA
| | - Eleftherios T Papoutsakis
- />Delaware Biotechnology Institute, 15 Innovation Way, Newark, 19711 Delaware USA
- />Department of Chemical Engineering, University of Delaware, Newark, Delaware USA
| | - Costas D Maranas
- />Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania USA
| |
Collapse
|
21
|
Mackie AM, Hassan KA, Paulsen IT, Tetu SG. Biolog Phenotype Microarrays for phenotypic characterization of microbial cells. Methods Mol Biol 2014; 1096:123-30. [PMID: 24515365 DOI: 10.1007/978-1-62703-712-9_10] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Biolog Phenotype MicroArrays for microorganisms provide a high-throughput method for the global analysis of microbial growth phenotypes. Using a colorimetric reaction that is indicative of respiration, these microplate assays measure the response of an individual strain or microbial community to a large and diverse range of nutrients and chemicals. Phenotype MicroArrays have been used to study gene function and to improve genome annotation in single microorganisms and for physiological profiling of bacterial communities. The microplate system can be used to obtain a comprehensive overview of metabolic capability, or it can be tailored, through the use of subsets of plates, to address specific research needs.
Collapse
Affiliation(s)
- Amanda M Mackie
- Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, NSW, Australia
| | | | | | | |
Collapse
|
22
|
Tervo CJ, Reed JL. BioMog: a computational framework for the de novo generation or modification of essential biomass components. PLoS One 2013; 8:e81322. [PMID: 24339916 PMCID: PMC3855262 DOI: 10.1371/journal.pone.0081322] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2013] [Accepted: 10/11/2013] [Indexed: 12/20/2022] Open
Abstract
The success of genome-scale metabolic modeling is contingent on a model's ability to accurately predict growth and metabolic behaviors. To date, little focus has been directed towards developing systematic methods of proposing, modifying and interrogating an organism's biomass requirements that are used in constraint-based models. To address this gap, the biomass modification and generation (BioMog) framework was created and used to generate lists of biomass components de novo, as well as to modify predefined biomass component lists, for models of Escherichia coli (iJO1366) and of Shewanella oneidensis (iSO783) from high-throughput growth phenotype and fitness datasets. BioMog's de novo biomass component lists included, either implicitly or explicitly, up to seventy percent of the components included in the predefined biomass equations, and the resulting de novo biomass equations outperformed the predefined biomass equations at qualitatively predicting mutant growth phenotypes by up to five percent. Additionally, the BioMog procedure can quantify how many experiments support or refute a particular metabolite's essentiality to a cell, and it facilitates the determination of inconsistent experiments and inaccurate reaction and/or gene to reaction associations. To further interrogate metabolite essentiality, the BioMog framework includes an experiment generation algorithm that allows for the design of experiments to test whether a metabolite is essential. Using BioMog, we correct experimental results relating to the essentiality of thyA gene in E. coli, as well as perform knockout experiments supporting the essentiality of protoheme. With these capabilities, BioMog can be a valuable resource for analyzing growth phenotyping data and component of a model developer's toolbox.
Collapse
Affiliation(s)
- Christopher J. Tervo
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Jennifer L. Reed
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
- * E-mail:
| |
Collapse
|
23
|
Chandrasekaran S, Price ND. Metabolic constraint-based refinement of transcriptional regulatory networks. PLoS Comput Biol 2013; 9:e1003370. [PMID: 24348226 PMCID: PMC3857774 DOI: 10.1371/journal.pcbi.1003370] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2013] [Accepted: 10/16/2013] [Indexed: 01/01/2023] Open
Abstract
There is a strong need for computational frameworks that integrate different biological processes and data-types to unravel cellular regulation. Current efforts to reconstruct transcriptional regulatory networks (TRNs) focus primarily on proximal data such as gene co-expression and transcription factor (TF) binding. While such approaches enable rapid reconstruction of TRNs, the overwhelming combinatorics of possible networks limits identification of mechanistic regulatory interactions. Utilizing growth phenotypes and systems-level constraints to inform regulatory network reconstruction is an unmet challenge. We present our approach Gene Expression and Metabolism Integrated for Network Inference (GEMINI) that links a compendium of candidate regulatory interactions with the metabolic network to predict their systems-level effect on growth phenotypes. We then compare predictions with experimental phenotype data to select phenotype-consistent regulatory interactions. GEMINI makes use of the observation that only a small fraction of regulatory network states are compatible with a viable metabolic network, and outputs a regulatory network that is simultaneously consistent with the input genome-scale metabolic network model, gene expression data, and TF knockout phenotypes. GEMINI preferentially recalls gold-standard interactions (p-value = 10−172), significantly better than using gene expression alone. We applied GEMINI to create an integrated metabolic-regulatory network model for Saccharomyces cerevisiae involving 25,000 regulatory interactions controlling 1597 metabolic reactions. The model quantitatively predicts TF knockout phenotypes in new conditions (p-value = 10−14) and revealed potential condition-specific regulatory mechanisms. Our results suggest that a metabolic constraint-based approach can be successfully used to help reconstruct TRNs from high-throughput data, and highlights the potential of using a biochemically-detailed mechanistic framework to integrate and reconcile inconsistencies across different data-types. The algorithm and associated data are available at https://sourceforge.net/projects/gemini-data/ Cellular networks, such as metabolic and transcriptional regulatory networks (TRNs), do not operate independently but work together in unison to determine cellular phenotypes. Further, the phenotype and architecture of one network constrains the topology of other networks. Hence, it is critical to study network components and interactions in the context of the entire cell. Typically, efforts to reconstruct TRNs focus only on immediately proximal data such as gene co-expression and transcription factor (TF)-binding. Herein, we take a different strategy by linking candidate TRNs with the metabolic network to predict systems-level responses such as growth phenotypes of TF knockout strains, and compare predictions with experimental phenotype data to select amongst the candidate TRNs. Our approach goes beyond traditional data integration approaches for network inference and refinement by using a predictive network model (metabolism) to refine another network model (regulation) – thus providing an alternative avenue to this area of research. Understanding how the networks function together in a cell will pave the way for synthetic biology and has a wide-range of applications in biotechnology, drug discovery and diagnostics. Further we demonstrate how metabolic models can integrate and reconcile inconsistencies across different data-types.
Collapse
Affiliation(s)
- Sriram Chandrasekaran
- Institute for Systems Biology, Seattle, Washington, United States of America
- Center for Biophysics and Computational Biology, University of Illinois, Urbana-Champaign, Illinois, United States of America
| | - Nathan D. Price
- Institute for Systems Biology, Seattle, Washington, United States of America
- Center for Biophysics and Computational Biology, University of Illinois, Urbana-Champaign, Illinois, United States of America
- * E-mail:
| |
Collapse
|
24
|
Hamilton JJ, Reed JL. Software platforms to facilitate reconstructing genome-scale metabolic networks. Environ Microbiol 2013; 16:49-59. [PMID: 24148076 DOI: 10.1111/1462-2920.12312] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2013] [Accepted: 10/12/2013] [Indexed: 12/24/2022]
Abstract
System-level analyses of microbial metabolism are facilitated by genome-scale reconstructions of microbial biochemical networks. A reconstruction provides a structured representation of the biochemical transformations occurring within an organism, as well as the genes necessary to carry out these transformations, as determined by the annotated genome sequence and experimental data. Network reconstructions also serve as platforms for constraint-based computational techniques, which facilitate biological studies in a variety of applications, including evaluation of network properties, metabolic engineering and drug discovery. Bottom-up metabolic network reconstructions have been developed for dozens of organisms, but until recently, the pace of reconstruction has failed to keep up with advances in genome sequencing. To address this problem, a number of software platforms have been developed to automate parts of the reconstruction process, thereby alleviating much of the manual effort previously required. Here, we review four such platforms in the context of established guidelines for network reconstruction. While many steps of the reconstruction process have been successfully automated, some manual evaluation of the results is still required to ensure a high-quality reconstruction. Widespread adoption of these platforms by the scientific community is underway and will be further enabled by exchangeable formats across platforms.
Collapse
Affiliation(s)
- Joshua J Hamilton
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | | |
Collapse
|
25
|
Imam S, Noguera DR, Donohue TJ. Global insights into energetic and metabolic networks in Rhodobacter sphaeroides. BMC SYSTEMS BIOLOGY 2013; 7:89. [PMID: 24034347 PMCID: PMC3849096 DOI: 10.1186/1752-0509-7-89] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Accepted: 09/10/2013] [Indexed: 11/29/2022]
Abstract
Background Improving our understanding of processes at the core of cellular lifestyles can be aided by combining information from genetic analyses, high-throughput experiments and computational predictions. Results We combined data and predictions derived from phenotypic, physiological, genetic and computational analyses to dissect the metabolic and energetic networks of the facultative photosynthetic bacterium Rhodobacter sphaeroides. We focused our analysis on pathways crucial to the production and recycling of pyridine nucleotides during aerobic respiratory and anaerobic photosynthetic growth in the presence of an organic electron donor. In particular, we assessed the requirement for NADH/NADPH transhydrogenase enzyme, PntAB during respiratory and photosynthetic growth. Using high-throughput phenotype microarrays (PMs), we found that PntAB is essential for photosynthetic growth in the presence of many organic electron donors, particularly those predicted to require its activity to produce NADPH. Utilizing the genome-scale metabolic model iRsp1095, we predicted alternative routes of NADPH synthesis and used gene expression analyses to show that transcripts from a subset of the corresponding genes were conditionally increased in a ΔpntAB mutant. We then used a combination of metabolic flux predictions and mutational analysis to identify flux redistribution patterns utilized in the ΔpntAB mutant to compensate for the loss of this enzyme. Data generated from metabolic and phenotypic analyses of wild type and mutant cells were used to develop iRsp1140, an expanded genome-scale metabolic reconstruction for R. sphaeroides with improved ability to analyze and predict pathways associated with photosynthesis and other metabolic processes. Conclusions These analyses increased our understanding of key aspects of the photosynthetic lifestyle, highlighting the added importance of NADPH production under these conditions. It also led to a significant improvement in the predictive capabilities of a metabolic model for the different energetic lifestyles of a facultative organism.
Collapse
Affiliation(s)
- Saheed Imam
- Department of Bacteriology, University of Wisconsin, Madison, Suite 5166, Wisconsin Energy Institute, 1552 University Avenue, Madison, WI 53726-4084, USA.
| | | | | |
Collapse
|
26
|
Walpole J, Papin JA, Peirce SM. Multiscale computational models of complex biological systems. Annu Rev Biomed Eng 2013; 15:137-54. [PMID: 23642247 DOI: 10.1146/annurev-bioeng-071811-150104] [Citation(s) in RCA: 135] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Integration of data across spatial, temporal, and functional scales is a primary focus of biomedical engineering efforts. The advent of powerful computing platforms, coupled with quantitative data from high-throughput experimental methodologies, has allowed multiscale modeling to expand as a means to more comprehensively investigate biological phenomena in experimentally relevant ways. This review aims to highlight recently published multiscale models of biological systems, using their successes to propose the best practices for future model development. We demonstrate that coupling continuous and discrete systems best captures biological information across spatial scales by selecting modeling techniques that are suited to the task. Further, we suggest how to leverage these multiscale models to gain insight into biological systems using quantitative biomedical engineering methods to analyze data in nonintuitive ways. These topics are discussed with a focus on the future of the field, current challenges encountered, and opportunities yet to be realized.
Collapse
Affiliation(s)
- Joseph Walpole
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA
| | | | | |
Collapse
|
27
|
Abstract
Phenotypic microarray (PM) is a standardized, high-throughput technology for profiling phenotypes of microorganisms, which allows for characterization on around 2,000 different media conditions. The data generated using PM can be incorporated into genome-scale metabolic models to improve their predictive capability. In addition, a comparison of phenotypic profiles of wild-type and gene knockout mutants can give essential information about gene functions of unknown genes. In this chapter, we present a protocol to refine preconstructed metabolic models using the PM data. Both manual refinement and algorithmic approaches for integrating the PM data into metabolic models have been discussed.
Collapse
|
28
|
McCloskey D, Palsson BØ, Feist AM. Basic and applied uses of genome-scale metabolic network reconstructions of Escherichia coli. Mol Syst Biol 2013; 9:661. [PMID: 23632383 PMCID: PMC3658273 DOI: 10.1038/msb.2013.18] [Citation(s) in RCA: 234] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2012] [Accepted: 03/11/2013] [Indexed: 02/07/2023] Open
Abstract
The genome-scale model (GEM) of metabolism in the bacterium Escherichia coli K-12 has been in development for over a decade and is now in wide use. GEM-enabled studies of E. coli have been primarily focused on six applications: (1) metabolic engineering, (2) model-driven discovery, (3) prediction of cellular phenotypes, (4) analysis of biological network properties, (5) studies of evolutionary processes, and (6) models of interspecies interactions. In this review, we provide an overview of these applications along with a critical assessment of their successes and limitations, and a perspective on likely future developments in the field. Taken together, the studies performed over the past decade have established a genome-scale mechanistic understanding of genotype-phenotype relationships in E. coli metabolism that forms the basis for similar efforts for other microbial species. Future challenges include the expansion of GEMs by integrating additional cellular processes beyond metabolism, the identification of key constraints based on emerging data types, and the development of computational methods able to handle such large-scale network models with sufficient accuracy.
Collapse
Affiliation(s)
- Douglas McCloskey
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
| | - Bernhard Ø Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark
| | - Adam M Feist
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, USA
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark
| |
Collapse
|
29
|
Systematic applications of metabolomics in metabolic engineering. Metabolites 2012; 2:1090-122. [PMID: 24957776 PMCID: PMC3901235 DOI: 10.3390/metabo2041090] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2012] [Revised: 11/29/2012] [Accepted: 12/10/2012] [Indexed: 02/05/2023] Open
Abstract
The goals of metabolic engineering are well-served by the biological information provided by metabolomics: information on how the cell is currently using its biochemical resources is perhaps one of the best ways to inform strategies to engineer a cell to produce a target compound. Using the analysis of extracellular or intracellular levels of the target compound (or a few closely related molecules) to drive metabolic engineering is quite common. However, there is surprisingly little systematic use of metabolomics datasets, which simultaneously measure hundreds of metabolites rather than just a few, for that same purpose. Here, we review the most common systematic approaches to integrating metabolite data with metabolic engineering, with emphasis on existing efforts to use whole-metabolome datasets. We then review some of the most common approaches for computational modeling of cell-wide metabolism, including constraint-based models, and discuss current computational approaches that explicitly use metabolomics data. We conclude with discussion of the broader potential of computational approaches that systematically use metabolomics data to drive metabolic engineering.
Collapse
|
30
|
Tervo CJ, Reed JL. FOCAL: an experimental design tool for systematizing metabolic discoveries and model development. Genome Biol 2012; 13:R116. [PMID: 23236964 PMCID: PMC4056367 DOI: 10.1186/gb-2012-13-12-r116] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2012] [Accepted: 12/13/2012] [Indexed: 01/05/2023] Open
Abstract
Current computational tools can generate and improve genome-scale models based on existing data; however, for many organisms, the data needed to test and refine such models are not available. To facilitate model development, we created the forced coupling algorithm, FOCAL, to identify genetic and environmental conditions such that a reaction becomes essential for an experimentally measurable phenotype. This reaction's conditional essentiality can then be tested experimentally to evaluate whether network connections occur or to create strains with desirable phenotypes. FOCAL allows network connections to be queried, which improves our understanding of metabolism and accuracy of developed models.
Collapse
|
31
|
|
32
|
Zomorrodi AR, Suthers PF, Ranganathan S, Maranas CD. Mathematical optimization applications in metabolic networks. Metab Eng 2012; 14:672-86. [PMID: 23026121 DOI: 10.1016/j.ymben.2012.09.005] [Citation(s) in RCA: 103] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2012] [Revised: 08/31/2012] [Accepted: 09/14/2012] [Indexed: 11/30/2022]
Abstract
Genome-scale metabolic models are increasingly becoming available for a variety of microorganisms. This has spurred the development of a wide array of computational tools, and in particular, mathematical optimization approaches, to assist in fundamental metabolic network analyses and redesign efforts. This review highlights a number of optimization-based frameworks developed towards addressing challenges in the analysis and engineering of metabolic networks. In particular, three major types of studies are covered here including exploring model predictions, correction and improvement of models of metabolism, and redesign of metabolic networks for the targeted overproduction of a desired compound. Overall, the methods reviewed in this paper highlight the diversity of queries, breadth of questions and complexity of redesigns that are amenable to mathematical optimization strategies.
Collapse
Affiliation(s)
- Ali R Zomorrodi
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, USA
| | | | | | | |
Collapse
|
33
|
Lewis NE, Nagarajan H, Palsson BO. Constraining the metabolic genotype-phenotype relationship using a phylogeny of in silico methods. Nat Rev Microbiol 2012; 10:291-305. [PMID: 22367118 DOI: 10.1038/nrmicro2737] [Citation(s) in RCA: 556] [Impact Index Per Article: 42.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Reconstructed microbial metabolic networks facilitate a mechanistic description of the genotype-phenotype relationship through the deployment of constraint-based reconstruction and analysis (COBRA) methods. As reconstructed networks leverage genomic data for insight and phenotype prediction, the development of COBRA methods has accelerated following the advent of whole-genome sequencing. Here, we describe a phylogeny of COBRA methods that has rapidly evolved from the few early methods, such as flux balance analysis and elementary flux mode analysis, into a repertoire of more than 100 methods. These methods have enabled genome-scale analysis of microbial metabolism for numerous basic and applied uses, including antibiotic discovery, metabolic engineering and modelling of microbial community behaviour.
Collapse
Affiliation(s)
- Nathan E Lewis
- Department of Bioengineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093-0412, USA
| | | | | |
Collapse
|
34
|
Metabolic network reconstruction: advances in in silico interpretation of analytical information. Curr Opin Biotechnol 2012; 23:77-82. [DOI: 10.1016/j.copbio.2011.10.015] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2011] [Revised: 10/31/2011] [Accepted: 10/31/2011] [Indexed: 11/22/2022]
|
35
|
Ashworth J, Wurtmann EJ, Baliga NS. Reverse engineering systems models of regulation: discovery, prediction and mechanisms. Curr Opin Biotechnol 2011; 23:598-603. [PMID: 22209016 DOI: 10.1016/j.copbio.2011.12.005] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2011] [Accepted: 12/08/2011] [Indexed: 10/14/2022]
Abstract
Biological systems can now be understood in comprehensive and quantitative detail using systems biology approaches. Putative genome-scale models can be built rapidly based upon biological inventories and strategic system-wide molecular measurements. Current models combine statistical associations, causative abstractions, and known molecular mechanisms to explain and predict quantitative and complex phenotypes. This top-down 'reverse engineering' approach generates useful organism-scale models despite noise and incompleteness in data and knowledge. Here we review and discuss the reverse engineering of biological systems using top-down data-driven approaches, in order to improve discovery, hypothesis generation, and the inference of biological properties.
Collapse
|
36
|
Kim TY, Sohn SB, Kim YB, Kim WJ, Lee SY. Recent advances in reconstruction and applications of genome-scale metabolic models. Curr Opin Biotechnol 2011; 23:617-23. [PMID: 22054827 DOI: 10.1016/j.copbio.2011.10.007] [Citation(s) in RCA: 128] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2011] [Accepted: 10/17/2011] [Indexed: 11/19/2022]
Abstract
In the last decade, reconstruction and applications of genome-scale metabolic models have greatly influenced the field of systems biology by providing a platform on which high-throughput computational analysis of metabolic networks can be performed. The last two years have seen an increase in volume of more than 33% in the number of published genome-scale metabolic models, signifying a high demand for these metabolic models in studying specific organisms. The diversity in modeling different types of cells, from photosynthetic microorganisms to human cell types, also demonstrates their growing influence in biology. Here we review the recent advances and current state of genome-scale metabolic models, the methods employed towards ensuring high quality models, their biotechnological applications, and the progress towards the automated reconstruction of genome-scale metabolic models.
Collapse
Affiliation(s)
- Tae Yong Kim
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 program), Center for Systems and Synthetic Biotechnology, Institute for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon 305-701, Republic of Korea
| | | | | | | | | |
Collapse
|
37
|
The evolution of metabolic networks of E. coli. BMC SYSTEMS BIOLOGY 2011; 5:182. [PMID: 22044664 PMCID: PMC3229490 DOI: 10.1186/1752-0509-5-182] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2011] [Accepted: 11/01/2011] [Indexed: 11/19/2022]
Abstract
Background Despite the availability of numerous complete genome sequences from E. coli strains, published genome-scale metabolic models exist only for two commensal E. coli strains. These models have proven useful for many applications, such as engineering strains for desired product formation, and we sought to explore how constructing and evaluating additional metabolic models for E. coli strains could enhance these efforts. Results We used the genomic information from 16 E. coli strains to generate an E. coli pangenome metabolic network by evaluating their collective 76,990 ORFs. Each of these ORFs was assigned to one of 17,647 ortholog groups including ORFs associated with reactions in the most recent metabolic model for E. coli K-12. For orthologous groups that contain an ORF already represented in the MG1655 model, the gene to protein to reaction associations represented in this model could then be easily propagated to other E. coli strain models. All remaining orthologous groups were evaluated to see if new metabolic reactions could be added to generate a pangenome-scale metabolic model (iEco1712_pan). The pangenome model included reactions from a metabolic model update for E. coli K-12 MG1655 (iEco1339_MG1655) and enabled development of five additional strain-specific genome-scale metabolic models. These additional models include a second K-12 strain (iEco1335_W3110) and four pathogenic strains (two enterohemorrhagic E. coli O157:H7 and two uropathogens). When compared to the E. coli K-12 models, the metabolic models for the enterohemorrhagic (iEco1344_EDL933 and iEco1345_Sakai) and uropathogenic strains (iEco1288_CFT073 and iEco1301_UTI89) contained numerous lineage-specific gene and reaction differences. All six E. coli models were evaluated by comparing model predictions to carbon source utilization measurements under aerobic and anaerobic conditions, and to batch growth profiles in minimal media with 0.2% (w/v) glucose. An ancestral genome-scale metabolic model based on conserved ortholog groups in all 16 E. coli genomes was also constructed, reflecting the conserved ancestral core of E. coli metabolism (iEco1053_core). Comparative analysis of all six strain-specific E. coli models revealed that some of the pathogenic E. coli strains possess reactions in their metabolic networks enabling higher biomass yields on glucose. Finally the lineage-specific metabolic traits were compared to the ancestral core model predictions to derive new insight into the evolution of metabolism within this species. Conclusion Our findings demonstrate that a pangenome-scale metabolic model can be used to rapidly construct additional E. coli strain-specific models, and that quantitative models of different strains of E. coli can accurately predict strain-specific phenotypes. Such pangenome and strain-specific models can be further used to engineer metabolic phenotypes of interest, such as designing new industrial E. coli strains.
Collapse
|
38
|
Heux S, Philippe B, Portais JC. High-throughput workflow for monitoring and mining bioprocess data and its application to inferring the physiological response of Escherichia coli to perturbations. Appl Environ Microbiol 2011; 77:7040-9. [PMID: 21841033 PMCID: PMC3187081 DOI: 10.1128/aem.05838-11] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2011] [Accepted: 08/03/2011] [Indexed: 11/20/2022] Open
Abstract
Miniaturization and high-throughput screening are currently the focus of emerging research areas such as systems biology and systems biotechnology. A fluorescence-based screening assay for the online monitoring of oxygen and pH and a numerical method to mine the resulting online process data are described. The assay employs commercial phosphorescent oxygen- and pH-sensitive probes in standard 48- or 96-well plates on a plate reader equipped with a shaker. In addition to dual parametric analysis of both pH and oxygen in a single well, the assay allows monitoring of growth, as measured by absorbance. Validation of the assay is presented and compared with commercially available plates equipped with optical sensors for oxygen and pH. By using model-free fitting to the readily available online measurements, the length and rate of each phase such as the duration of lag and transition phase or acidification, growth, and oxygen consumption rates are automatically detected. In total, nine physiological descriptors, which can be used for further statistical and comparison analysis, are extracted from the pH, oxygen partial pressure (pO(2)), and optical density (OD) profiles. The combination of a simple mix-and-measure procedure with an automatic data mining method allows high sample throughput and good reproducibility while providing a physiological state identification and characterization of test cells. As a proof of concept, the utility of the workflow in assessing the physiological response of Escherichia coli to environmental and genetic perturbations is demonstrated.
Collapse
Affiliation(s)
- Stéphanie Heux
- Université de Toulouse, INSA, UPS, INP, LISBP, 135 Avenue de Rangueil, F-31077 Toulouse, France
| | | | | |
Collapse
|
39
|
Jensen PA, Lutz KA, Papin JA. TIGER: Toolbox for integrating genome-scale metabolic models, expression data, and transcriptional regulatory networks. BMC SYSTEMS BIOLOGY 2011; 5:147. [PMID: 21943338 PMCID: PMC3224351 DOI: 10.1186/1752-0509-5-147] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2011] [Accepted: 09/23/2011] [Indexed: 11/10/2022]
Abstract
Background Several methods have been developed for analyzing genome-scale models of metabolism and transcriptional regulation. Many of these methods, such as Flux Balance Analysis, use constrained optimization to predict relationships between metabolic flux and the genes that encode and regulate enzyme activity. Recently, mixed integer programming has been used to encode these gene-protein-reaction (GPR) relationships into a single optimization problem, but these techniques are often of limited generality and lack a tool for automating the conversion of rules to a coupled regulatory/metabolic model. Results We present TIGER, a Toolbox for Integrating Genome-scale Metabolism, Expression, and Regulation. TIGER converts a series of generalized, Boolean or multilevel rules into a set of mixed integer inequalities. The package also includes implementations of existing algorithms to integrate high-throughput expression data with genome-scale models of metabolism and transcriptional regulation. We demonstrate how TIGER automates the coupling of a genome-scale metabolic model with GPR logic and models of transcriptional regulation, thereby serving as a platform for algorithm development and large-scale metabolic analysis. Additionally, we demonstrate how TIGER's algorithms can be used to identify inconsistencies and improve existing models of transcriptional regulation with examples from the reconstructed transcriptional regulatory network of Saccharomyces cerevisiae. Conclusion The TIGER package provides a consistent platform for algorithm development and extending existing genome-scale metabolic models with regulatory networks and high-throughput data.
Collapse
Affiliation(s)
- Paul A Jensen
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA 22908, USA
| | | | | |
Collapse
|
40
|
McDermott JE, Yoon H, Nakayasu ES, Metz TO, Hyduke DR, Kidwai AS, Palsson BO, Adkins JN, Heffron F. Technologies and approaches to elucidate and model the virulence program of salmonella. Front Microbiol 2011; 2:121. [PMID: 21687430 PMCID: PMC3108385 DOI: 10.3389/fmicb.2011.00121] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2011] [Accepted: 05/15/2011] [Indexed: 11/13/2022] Open
Abstract
Salmonella is a primary cause of enteric diseases in a variety of animals. During its evolution into a pathogenic bacterium, Salmonella acquired an elaborate regulatory network that responds to multiple environmental stimuli within host animals and integrates them resulting in fine regulation of the virulence program. The coordinated action by this regulatory network involves numerous virulence regulators, necessitating genome-wide profiling analysis to assess and combine efforts from multiple regulons. In this review we discuss recent high-throughput analytic approaches used to understand the regulatory network of Salmonella that controls virulence processes. Application of high-throughput analyses have generated large amounts of data and necessitated the development of computational approaches for data integration. Therefore, we also cover computer-aided network analyses to infer regulatory networks, and demonstrate how genome-scale data can be used to construct regulatory and metabolic systems models of Salmonella pathogenesis. Genes that are coordinately controlled by multiple virulence regulators under infectious conditions are more likely to be important for pathogenesis. Thus, reconstructing the global regulatory network during infection or, at the very least, under conditions that mimic the host cellular environment not only provides a bird's eye view of Salmonella survival strategy in response to hostile host environments but also serves as an efficient means to identify novel virulence factors that are essential for Salmonella to accomplish systemic infection in the host.
Collapse
Affiliation(s)
- Jason E. McDermott
- Computational Biology and Bioinformatics Group, Pacific Northwest National LaboratoryRichland, WA, USA
| | - Hyunjin Yoon
- Department of Molecular Microbiology and Immunology, Oregon Health and Sciences UniversityPortland, OR, USA
| | - Ernesto S. Nakayasu
- Biological Separations and Mass Spectroscopy Group, Pacific Northwest National LaboratoryRichland WA, USA
| | - Thomas O. Metz
- Biological Separations and Mass Spectroscopy Group, Pacific Northwest National LaboratoryRichland WA, USA
| | - Daniel R. Hyduke
- Systems Biology, University of California San DiegoSan Diego, CA, USA
| | - Afshan S. Kidwai
- Department of Molecular Microbiology and Immunology, Oregon Health and Sciences UniversityPortland, OR, USA
| | | | - Joshua N. Adkins
- Biological Separations and Mass Spectroscopy Group, Pacific Northwest National LaboratoryRichland WA, USA
| | - Fred Heffron
- Department of Molecular Microbiology and Immunology, Oregon Health and Sciences UniversityPortland, OR, USA
| |
Collapse
|
41
|
Gerosa L, Sauer U. Regulation and control of metabolic fluxes in microbes. Curr Opin Biotechnol 2011; 22:566-75. [PMID: 21600757 DOI: 10.1016/j.copbio.2011.04.016] [Citation(s) in RCA: 109] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2011] [Accepted: 04/20/2011] [Indexed: 01/09/2023]
Abstract
After about ten years of research renaissance in metabolism, the present challenge is to understand how metabolic fluxes are controlled by a complex interplay of overlapping regulatory mechanisms. Reconstruction of various regulatory network topologies is steaming, illustrating that we underestimated the broad importance of post-translational modifications such as enzyme phosphorylation or acetylation for microbial metabolism. With the growing topological knowledge, the functional relevance of these regulatory events becomes an even more pressing need. A major knowledge gap resides in the regulatory network of protein-metabolite interactions, simply because we lacked pertinent methods for systematic analyses - but a start has now been made. Perhaps most dramatic was the conceptual shift in our perception of metabolism from an engine of cellular operation to a generator of input and feedback signals for regulatory circuits that govern many important decisions on cell proliferation, differentiation, death, and naturally metabolism.
Collapse
Affiliation(s)
- Luca Gerosa
- Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | | |
Collapse
|