1
|
Tarzi C, Zampieri G, Sullivan N, Angione C. Emerging methods for genome-scale metabolic modeling of microbial communities. Trends Endocrinol Metab 2024; 35:533-548. [PMID: 38575441 DOI: 10.1016/j.tem.2024.02.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 02/28/2024] [Accepted: 02/29/2024] [Indexed: 04/06/2024]
Abstract
Genome-scale metabolic models (GEMs) are consolidating as platforms for studying mixed microbial populations, by combining biological data and knowledge with mathematical rigor. However, deploying these models to answer research questions can be challenging due to the increasing number of available computational tools, the lack of universal standards, and their inherent limitations. Here, we present a comprehensive overview of foundational concepts for building and evaluating genome-scale models of microbial communities. We then compare tools in terms of requirements, capabilities, and applications. Next, we highlight the current pitfalls and open challenges to consider when adopting existing tools and developing new ones. Our compendium can be relevant for the expanding community of modelers, both at the entry and experienced levels.
Collapse
Affiliation(s)
- Chaimaa Tarzi
- School of Computing, Engineering and Digital Technologies, Teesside University, Southfield Rd, Middlesbrough, TS1 3BX, North Yorkshire, UK
| | - Guido Zampieri
- Department of Biology, University of Padova, Padova, 35122, Veneto, Italy
| | - Neil Sullivan
- Complement Genomics Ltd, Station Rd, Lanchester, Durham, DH7 0EX, County Durham, UK
| | - Claudio Angione
- School of Computing, Engineering and Digital Technologies, Teesside University, Southfield Rd, Middlesbrough, TS1 3BX, North Yorkshire, UK; Centre for Digital Innovation, Teesside University, Southfield Rd, Middlesbrough, TS1 3BX, North Yorkshire, UK; National Horizons Centre, Teesside University, 38 John Dixon Ln, Darlington, DL1 1HG, North Yorkshire, UK.
| |
Collapse
|
2
|
Gong Z, Chen J, Jiao X, Gong H, Pan D, Liu L, Zhang Y, Tan T. Genome-scale metabolic network models for industrial microorganisms metabolic engineering: Current advances and future prospects. Biotechnol Adv 2024; 72:108319. [PMID: 38280495 DOI: 10.1016/j.biotechadv.2024.108319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 01/04/2024] [Accepted: 01/18/2024] [Indexed: 01/29/2024]
Abstract
The construction of high-performance microbial cell factories (MCFs) is the centerpiece of biomanufacturing. However, the complex metabolic regulatory network of microorganisms poses great challenges for the efficient design and construction of MCFs. The genome-scale metabolic network models (GSMs) can systematically simulate the metabolic regulation process of microorganisms in silico, providing effective guidance for the rapid design and construction of MCFs. In this review, we summarized the development status of 16 important industrial microbial GSMs, and further outline the technologies or methods that continuously promote high-quality GSMs construction from five aspects: I) Databases and modeling tools facilitate GSMs reconstruction; II) evolving gap-filling technologies; III) constraint-based model reconstruction; IV) advances in algorithms; and V) developed visualization tools. In addition, we also summarized the applications of GSMs in guiding metabolic engineering from four aspects: I) exploring and explaining metabolic features; II) predicting the effects of genetic perturbations on metabolism; III) predicting the optimal phenotype; IV) guiding cell factories construction in practical experiment. Finally, we discussed the development of GSMs, aiming to provide a reference for efficiently reconstructing GSMs and guiding metabolic engineering.
Collapse
Affiliation(s)
- Zhijin Gong
- National Energy R&D Center for Biorefinery, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China; Beijing Key Laboratory of Bioprocess, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
| | - Jiayao Chen
- National Energy R&D Center for Biorefinery, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China; Beijing Key Laboratory of Bioprocess, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
| | - Xinyu Jiao
- National Energy R&D Center for Biorefinery, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China; Beijing Key Laboratory of Bioprocess, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
| | - Hao Gong
- National Energy R&D Center for Biorefinery, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China; Beijing Key Laboratory of Bioprocess, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China; College of Mathematics and Physics, Beijing University of Chemical Technology, Beijing 100029, China
| | - Danzi Pan
- National Energy R&D Center for Biorefinery, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China; Beijing Key Laboratory of Bioprocess, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China; College of Mathematics and Physics, Beijing University of Chemical Technology, Beijing 100029, China
| | - Lingli Liu
- National Energy R&D Center for Biorefinery, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China; Beijing Key Laboratory of Bioprocess, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China; College of Mathematics and Physics, Beijing University of Chemical Technology, Beijing 100029, China
| | - Yang Zhang
- National Energy R&D Center for Biorefinery, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China; Beijing Key Laboratory of Bioprocess, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
| | - Tianwei Tan
- National Energy R&D Center for Biorefinery, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China; Beijing Key Laboratory of Bioprocess, College of Life Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China.
| |
Collapse
|
3
|
Brunner JD, Chia N. Metabolic model-based ecological modeling for probiotic design. eLife 2024; 13:e83690. [PMID: 38380900 PMCID: PMC10942782 DOI: 10.7554/elife.83690] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2022] [Accepted: 02/19/2024] [Indexed: 02/22/2024] Open
Abstract
The microbial community composition in the human gut has a profound effect on human health. This observation has lead to extensive use of microbiome therapies, including over-the-counter 'probiotic' treatments intended to alter the composition of the microbiome. Despite so much promise and commercial interest, the factors that contribute to the success or failure of microbiome-targeted treatments remain unclear. We investigate the biotic interactions that lead to successful engraftment of a novel bacterial strain introduced to the microbiome as in probiotic treatments. We use pairwise genome-scale metabolic modeling with a generalized resource allocation constraint to build a network of interactions between taxa that appear in an experimental engraftment study. We create induced sub-graphs using the taxa present in individual samples and assess the likelihood of invader engraftment based on network structure. To do so, we use a generalized Lotka-Volterra model, which we show has strong ability to predict if a particular invader or probiotic will successfully engraft into an individual's microbiome. Furthermore, we show that the mechanistic nature of the model is useful for revealing which microbe-microbe interactions potentially drive engraftment.
Collapse
Affiliation(s)
- James D Brunner
- Biosciences Division, Los Alamos National LaboratoryLos AlamosUnited States
- Center for Nonlinear Studies, Los Alamos National LaboratoryLos AlamosUnited States
| | - Nicholas Chia
- Data Science and Learning, Argonne National LaboratoryLemontUnited States
| |
Collapse
|
4
|
Carter EL, Constantinidou C, Alam MT. Applications of genome-scale metabolic models to investigate microbial metabolic adaptations in response to genetic or environmental perturbations. Brief Bioinform 2023; 25:bbad439. [PMID: 38048080 PMCID: PMC10694557 DOI: 10.1093/bib/bbad439] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 09/21/2023] [Accepted: 11/08/2023] [Indexed: 12/05/2023] Open
Abstract
Environmental perturbations are encountered by microorganisms regularly and will require metabolic adaptations to ensure an organism can survive in the newly presenting conditions. In order to study the mechanisms of metabolic adaptation in such conditions, various experimental and computational approaches have been used. Genome-scale metabolic models (GEMs) are one of the most powerful approaches to study metabolism, providing a platform to study the systems level adaptations of an organism to different environments which could otherwise be infeasible experimentally. In this review, we are describing the application of GEMs in understanding how microbes reprogram their metabolic system as a result of environmental variation. In particular, we provide the details of metabolic model reconstruction approaches, various algorithms and tools for model simulation, consequences of genetic perturbations, integration of '-omics' datasets for creating context-specific models and their application in studying metabolic adaptation due to the change in environmental conditions.
Collapse
Affiliation(s)
- Elena Lucy Carter
- Warwick Medical School, University of Warwick, Coventry, CV4 7HL, UK
| | | | | |
Collapse
|
5
|
Kim M, Sung J, Chia N. Resource-allocation constraint governs structure and function of microbial communities in metabolic modeling. Metab Eng 2022; 70:12-22. [PMID: 34990848 DOI: 10.1016/j.ymben.2021.12.011] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 12/01/2021] [Accepted: 12/29/2021] [Indexed: 10/19/2022]
Abstract
Predictive modeling tools for assessing microbial communities are important for realizing transformative capabilities of microbiomes in agriculture, ecology, and medicine. Constraint-based community-scale metabolic modeling is unique in its potential for making mechanistic predictions regarding both the structure and function of microbial communities. However, accessing this potential requires an understanding of key physicochemical constraints, which are typically considered on a per-species basis. What is needed is a means of incorporating global constraints relevant to microbial ecology into community models. Resource-allocation constraint, which describes how limited resources should be distributed to different cellular processes, sets limits on the efficiency of metabolic and ecological processes. In this study, we investigate the implications of resource-allocation constraints in community-scale metabolic modeling through a simple mechanism-agnostic implementation of resource-allocation constraints directly at the flux level. By systematically performing single-, two-, and multi-species growth simulations, we show that resource-allocation constraints are indispensable for predicting the structure and function of microbial communities. Our findings call for a scalable workflow for implementing a mechanistic version of resource-allocation constraints to ultimately harness the full potential of community-scale metabolic modeling tools.
Collapse
Affiliation(s)
- Minsuk Kim
- Microbiome Program, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, 55905, USA; Division of Surgical Research, Department of Surgery, Mayo Clinic, Rochester, MN, 55905, USA
| | - Jaeyun Sung
- Microbiome Program, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, 55905, USA; Division of Surgical Research, Department of Surgery, Mayo Clinic, Rochester, MN, 55905, USA; Division of Rheumatology, Department of Medicine, Mayo Clinic, Rochester, MN, 55905, USA
| | - Nicholas Chia
- Microbiome Program, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, 55905, USA; Division of Surgical Research, Department of Surgery, Mayo Clinic, Rochester, MN, 55905, USA.
| |
Collapse
|
6
|
Giannari D, Ho CH, Mahadevan R. A gap-filling algorithm for prediction of metabolic interactions in microbial communities. PLoS Comput Biol 2021; 17:e1009060. [PMID: 34723959 PMCID: PMC8584699 DOI: 10.1371/journal.pcbi.1009060] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 11/11/2021] [Accepted: 10/05/2021] [Indexed: 11/24/2022] Open
Abstract
The study of microbial communities and their interactions has attracted the interest of the scientific community, because of their potential for applications in biotechnology, ecology and medicine. The complexity of interspecies interactions, which are key for the macroscopic behavior of microbial communities, cannot be studied easily experimentally. For this reason, the modeling of microbial communities has begun to leverage the knowledge of established constraint-based methods, which have long been used for studying and analyzing the microbial metabolism of individual species based on genome-scale metabolic reconstructions of microorganisms. A main problem of genome-scale metabolic reconstructions is that they usually contain metabolic gaps due to genome misannotations and unknown enzyme functions. This problem is traditionally solved by using gap-filling algorithms that add biochemical reactions from external databases to the metabolic reconstruction, in order to restore model growth. However, gap-filling algorithms could evolve by taking into account metabolic interactions among species that coexist in microbial communities. In this work, a gap-filling method that resolves metabolic gaps at the community level was developed. The efficacy of the algorithm was tested by analyzing its ability to resolve metabolic gaps on a synthetic community of auxotrophic Escherichia coli strains. Subsequently, the algorithm was applied to resolve metabolic gaps and predict metabolic interactions in a community of Bifidobacterium adolescentis and Faecalibacterium prausnitzii, two species present in the human gut microbiota, and in an experimentally studied community of Dehalobacter and Bacteroidales species of the ACT-3 community. The community gap-filling method can facilitate the improvement of metabolic models and the identification of metabolic interactions that are difficult to identify experimentally in microbial communities.
Collapse
Affiliation(s)
- Dafni Giannari
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario, Canada
| | | | - Radhakrishnan Mahadevan
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario, Canada
- The Institute of Biomaterials & Biomedical Engineering, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
7
|
Zimmermann J, Kaleta C, Waschina S. gapseq: informed prediction of bacterial metabolic pathways and reconstruction of accurate metabolic models. Genome Biol 2021; 22:81. [PMID: 33691770 PMCID: PMC7949252 DOI: 10.1186/s13059-021-02295-1] [Citation(s) in RCA: 97] [Impact Index Per Article: 32.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Accepted: 02/10/2021] [Indexed: 12/21/2022] Open
Abstract
Genome-scale metabolic models of microorganisms are powerful frameworks to predict phenotypes from an organism's genotype. While manual reconstructions are laborious, automated reconstructions often fail to recapitulate known metabolic processes. Here we present gapseq ( https://github.com/jotech/gapseq ), a new tool to predict metabolic pathways and automatically reconstruct microbial metabolic models using a curated reaction database and a novel gap-filling algorithm. On the basis of scientific literature and experimental data for 14,931 bacterial phenotypes, we demonstrate that gapseq outperforms state-of-the-art tools in predicting enzyme activity, carbon source utilisation, fermentation products, and metabolic interactions within microbial communities.
Collapse
Affiliation(s)
- Johannes Zimmermann
- Christian-Albrechts-University Kiel, Institute of Experimental Medicine, Research Group Medical Systems Biology, Michaelis-Str. 5, Kiel, 24105 Germany
| | - Christoph Kaleta
- Christian-Albrechts-University Kiel, Institute of Experimental Medicine, Research Group Medical Systems Biology, Michaelis-Str. 5, Kiel, 24105 Germany
| | - Silvio Waschina
- Christian-Albrechts-University Kiel, Institute of Experimental Medicine, Research Group Medical Systems Biology, Michaelis-Str. 5, Kiel, 24105 Germany
- Christian-Albrechts-University Kiel, Institute of Human Nutrition and Food Science, Nutriinformatics, Heinrich-Hecht-Platz 10, Kiel, 24118 Germany
| |
Collapse
|
8
|
Bernstein DB, Sulheim S, Almaas E, Segrè D. Addressing uncertainty in genome-scale metabolic model reconstruction and analysis. Genome Biol 2021; 22:64. [PMID: 33602294 PMCID: PMC7890832 DOI: 10.1186/s13059-021-02289-z] [Citation(s) in RCA: 63] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Accepted: 02/04/2021] [Indexed: 02/07/2023] Open
Abstract
The reconstruction and analysis of genome-scale metabolic models constitutes a powerful systems biology approach, with applications ranging from basic understanding of genotype-phenotype mapping to solving biomedical and environmental problems. However, the biological insight obtained from these models is limited by multiple heterogeneous sources of uncertainty, which are often difficult to quantify. Here we review the major sources of uncertainty and survey existing approaches developed for representing and addressing them. A unified formal characterization of these uncertainties through probabilistic approaches and ensemble modeling will facilitate convergence towards consistent reconstruction pipelines, improved data integration algorithms, and more accurate assessment of predictive capacity.
Collapse
Affiliation(s)
- David B Bernstein
- Department of Biomedical Engineering and Biological Design Center, Boston University, Boston, MA, USA
| | - Snorre Sulheim
- Bioinformatics Program, Boston University, Boston, MA, USA
- Department of Biotechnology and Food Science, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
- Department of Biotechnology and Nanomedicine, SINTEF Industry, Trondheim, Norway
| | - Eivind Almaas
- Department of Biotechnology and Food Science, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
- K.G. Jebsen Center for Genetic Epidemiology, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
| | - Daniel Segrè
- Department of Biomedical Engineering and Biological Design Center, Boston University, Boston, MA, USA.
- Bioinformatics Program, Boston University, Boston, MA, USA.
- Department of Biology and Department of Physics, Boston University, Boston, MA, USA.
| |
Collapse
|
9
|
Patumcharoenpol P, Nakphaichit M, Panagiotou G, Senavonge A, Suratannon N, Vongsangnak W. MetGEMs Toolbox: Metagenome-scale models as integrative toolbox for uncovering metabolic functions and routes of human gut microbiome. PLoS Comput Biol 2021; 17:e1008487. [PMID: 33406089 PMCID: PMC7787440 DOI: 10.1371/journal.pcbi.1008487] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 11/03/2020] [Indexed: 12/20/2022] Open
Abstract
Investigating metabolic functional capability of a human gut microbiome enables the quantification of microbiome changes, which can cause a phenotypic change of host physiology and disease. One possible way to estimate the functional capability of a microbial community is through inferring metagenomic content from 16S rRNA gene sequences. Genome-scale models (GEMs) can be used as scaffold for functional estimation analysis at a systematic level, however up to date, there is no integrative toolbox based on GEMs for uncovering metabolic functions. Here, we developed the MetGEMs (metagenome-scale models) toolbox, an open-source application for inferring metabolic functions from 16S rRNA gene sequences to facilitate the study of the human gut microbiome by the wider scientific community. The developed toolbox was validated using shotgun metagenomic data and shown to be superior in predicting functional composition in human clinical samples compared to existing state-of-the-art tools. Therefore, the MetGEMs toolbox was subsequently applied for annotating putative enzyme functions and metabolic routes related in human disease using atopic dermatitis as a case study.
Collapse
Grants
- Kasetsart University Research and Development Institute (KURDI) at Kasetsart University
- Department of Zoology, Faculty of Science, Kasetsart University
- Omics Center for Agriculture, Bioresources, Food, and Health, Kasetsart University (OmiKU)
- Interdisciplinary Graduate Program in Bioscience, Faculty of Science, Kasetsart University
- International Affairs Division (IAD), Kasetsart University
- National Science and Technology Development Agency
- Ratchadapisek Research Funds
- Chulalongkorn University
- Deutsche Forschungsgemeinschaft (DFG) CRC/Transregio 124 “Pathogenic fungi and their human host: Networks of interaction”, subprojects B5 and INF
Collapse
Affiliation(s)
- Preecha Patumcharoenpol
- Interdisciplinary Graduate Program in Bioscience, Faculty of Science, Kasetsart University, Bangkok, Thailand
| | - Massalin Nakphaichit
- Department of Biotechnology, Faculty of Agro-Industry, Kasetsart University, Bangkok, Thailand
| | - Gianni Panagiotou
- Systems Biology & Bioinformatics Group, School of Biological Sciences, The University of Hong Kong, Hong Kong S.A.R., China
- Department of Medicine and State Key Laboratory of Pharmaceutical Biotechnology, The University of Hong Kong, Hong Kong S.A.R., China
- Systems Biology & Bioinformatics Unit, Leibniz Institute for Natural Product Research and Infection Biology–Hans Knöll Institute, Jena, Germany
| | - Anchalee Senavonge
- Pediatric Allergy & Clinical Immunology Research Unit, Division of Allergy and Immunology, Department of Pediatrics, Faculty of Medicine, Chulalongkorn University, King Chulalongkorn Memorial Hospital, the Thai Red Cross Society, Bangkok, Thailand
| | - Narissara Suratannon
- Pediatric Allergy & Clinical Immunology Research Unit, Division of Allergy and Immunology, Department of Pediatrics, Faculty of Medicine, Chulalongkorn University, King Chulalongkorn Memorial Hospital, the Thai Red Cross Society, Bangkok, Thailand
| | - Wanwipa Vongsangnak
- Department of Zoology, Faculty of Science, Kasetsart University, Bangkok, Thailand
- Omics Center for Agriculture, Bioresources, Food, and Health, Kasetsart University (OmiKU), Bangkok, Thailand
| |
Collapse
|
10
|
Hanna EM, Zhang X, Eide M, Fallahi S, Furmanek T, Yadetie F, Zielinski DC, Goksøyr A, Jonassen I. ReCodLiver0.9: Overcoming Challenges in Genome-Scale Metabolic Reconstruction of a Non-model Species. Front Mol Biosci 2020; 7:591406. [PMID: 33324679 PMCID: PMC7726423 DOI: 10.3389/fmolb.2020.591406] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 10/22/2020] [Indexed: 12/13/2022] Open
Abstract
The availability of genome sequences, annotations, and knowledge of the biochemistry underlying metabolic transformations has led to the generation of metabolic network reconstructions for a wide range of organisms in bacteria, archaea, and eukaryotes. When modeled using mathematical representations, a reconstruction can simulate underlying genotype-phenotype relationships. Accordingly, genome-scale metabolic models (GEMs) can be used to predict the response of organisms to genetic and environmental variations. A bottom-up reconstruction procedure typically starts by generating a draft model from existing annotation data on a target organism. For model species, this part of the process can be straightforward, due to the abundant organism-specific biochemical data. However, the process becomes complicated for non-model less-annotated species. In this paper, we present a draft liver reconstruction, ReCodLiver0.9, of Atlantic cod (Gadus morhua), a non-model teleost fish, as a practicable guide for cases with comparably few resources. Although the reconstruction is considered a draft version, we show that it already has utility in elucidating metabolic response mechanisms to environmental toxicants by mapping gene expression data of exposure experiments to the resulting model.
Collapse
Affiliation(s)
- Eileen Marie Hanna
- Department of Computer Science and Mathematics, Lebanese American University, Byblos, Lebanon
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
| | - Xiaokang Zhang
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
| | - Marta Eide
- Department of Biological Sciences, University of Bergen, Bergen, Norway
| | - Shirin Fallahi
- Department of Mathematics, University of Bergen, Bergen, Norway
| | | | - Fekadu Yadetie
- Department of Biological Sciences, University of Bergen, Bergen, Norway
| | - Daniel Craig Zielinski
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, United States
| | - Anders Goksøyr
- Department of Biological Sciences, University of Bergen, Bergen, Norway
| | - Inge Jonassen
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
| |
Collapse
|
11
|
Systematically gap-filling the genome-scale metabolic model of CHO cells. Biotechnol Lett 2020; 43:73-87. [PMID: 33040240 DOI: 10.1007/s10529-020-03021-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2020] [Accepted: 10/03/2020] [Indexed: 10/23/2022]
Abstract
OBJECTIVE Chinese hamster ovary (CHO) cells are the leading cell factories for producing recombinant proteins in the biopharmaceutical industry. In this regard, constraint-based metabolic models are useful platforms to perform computational analysis of cell metabolism. These models need to be regularly updated in order to include the latest biochemical data of the cells, and to increase their predictive power. Here, we provide an update to iCHO1766, the metabolic model of CHO cells. RESULTS We expanded the existing model of Chinese hamster metabolism with the help of four gap-filling approaches, leading to the addition of 773 new reactions and 335 new genes. We incorporated these into an updated genome-scale metabolic network model of CHO cells, named iCHO2101. In this updated model, the number of reactions and pathways capable of carrying flux is substantially increased. CONCLUSIONS The present CHO model is an important step towards more complete metabolic models of CHO cells.
Collapse
|
12
|
López-Agudelo VA, Mendum TA, Laing E, Wu H, Baena A, Barrera LF, Beste DJV, Rios-Estepa R. A systematic evaluation of Mycobacterium tuberculosis Genome-Scale Metabolic Networks. PLoS Comput Biol 2020; 16:e1007533. [PMID: 32542021 PMCID: PMC7316355 DOI: 10.1371/journal.pcbi.1007533] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 06/25/2020] [Accepted: 05/08/2020] [Indexed: 01/06/2023] Open
Abstract
Metabolism underpins the pathogenic strategy of the causative agent of TB, Mycobacterium tuberculosis (Mtb), and therefore metabolic pathways have recently re-emerged as attractive drug targets. A powerful approach to study Mtb metabolism as a whole, rather than just individual enzymatic components, is to use a systems biology framework, such as a Genome-Scale Metabolic Network (GSMN) that allows the dynamic interactions of all the components of metabolism to be interrogated together. Several GSMNs networks have been constructed for Mtb and used to study the complex relationship between the Mtb genotype and its phenotype. However, the utility of this approach is hampered by the existence of multiple models, each with varying properties and performances. Here we systematically evaluate eight recently published metabolic models of Mtb-H37Rv to facilitate model choice. The best performing models, sMtb2018 and iEK1011, were refined and improved for use in future studies by the TB research community.
Collapse
Affiliation(s)
- Víctor A. López-Agudelo
- Grupo de Bioprocesos, Departamento de Ingeniería Química, Universidad de Antioquia UdeA, Medellín, Colombia
- Grupo de Inmunología Celular e Inmunogenética (GICIG), Facultad de Medicina, Universidad de Antioquia UdeA, Medellín, Colombia
| | - Tom A. Mendum
- Department of Microbial Sciences, Faculty of Health and Medical Sciences, University of Surrey, Guildford, United Kingdom
| | - Emma Laing
- Department of Microbial Sciences, Faculty of Health and Medical Sciences, University of Surrey, Guildford, United Kingdom
| | - HuiHai Wu
- Department of Microbial Sciences, Faculty of Health and Medical Sciences, University of Surrey, Guildford, United Kingdom
| | - Andres Baena
- Grupo de Inmunología Celular e Inmunogenética (GICIG), Facultad de Medicina, Universidad de Antioquia UdeA, Medellín, Colombia
- Departamento de Microbiología y Parasitología, Facultad de Medicina, Universidad de Antioquia UdeA, Medellín, Colombia
| | - Luis F. Barrera
- Grupo de Inmunología Celular e Inmunogenética (GICIG), Facultad de Medicina, Universidad de Antioquia UdeA, Medellín, Colombia
- Instituto de Investigaciones Médicas, Facultad de Medicina, Universidad de Antioquia UdeA, Medellín, Colombia
| | - Dany J. V. Beste
- Department of Microbial Sciences, Faculty of Health and Medical Sciences, University of Surrey, Guildford, United Kingdom
| | - Rigoberto Rios-Estepa
- Grupo de Bioprocesos, Departamento de Ingeniería Química, Universidad de Antioquia UdeA, Medellín, Colombia
| |
Collapse
|
13
|
Stanway RR, Bushell E, Chiappino-Pepe A, Roques M, Sanderson T, Franke-Fayard B, Caldelari R, Golomingi M, Nyonda M, Pandey V, Schwach F, Chevalley S, Ramesar J, Metcalf T, Herd C, Burda PC, Rayner JC, Soldati-Favre D, Janse CJ, Hatzimanikatis V, Billker O, Heussler VT. Genome-Scale Identification of Essential Metabolic Processes for Targeting the Plasmodium Liver Stage. Cell 2020; 179:1112-1128.e26. [PMID: 31730853 PMCID: PMC6904910 DOI: 10.1016/j.cell.2019.10.030] [Citation(s) in RCA: 73] [Impact Index Per Article: 18.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2019] [Revised: 07/23/2019] [Accepted: 10/23/2019] [Indexed: 12/11/2022]
Abstract
Plasmodium gene functions in mosquito and liver stages remain poorly characterized due to limitations in the throughput of phenotyping at these stages. To fill this gap, we followed more than 1,300 barcoded P. berghei mutants through the life cycle. We discover 461 genes required for efficient parasite transmission to mosquitoes through the liver stage and back into the bloodstream of mice. We analyze the screen in the context of genomic, transcriptomic, and metabolomic data by building a thermodynamic model of P. berghei liver-stage metabolism, which shows a major reprogramming of parasite metabolism to achieve rapid growth in the liver. We identify seven metabolic subsystems that become essential at the liver stages compared with asexual blood stages: type II fatty acid synthesis and elongation (FAE), tricarboxylic acid, amino sugar, heme, lipoate, and shikimate metabolism. Selected predictions from the model are individually validated in single mutants to provide future targets for drug development. 1,342 barcoded P. berghei knockout (KO) mutants analyzed for stage-specific phenotypes Life-stage-specific metabolic models reveal reprogramming of cellular function High agreement between blood/liver stage metabolic models and genetic screening data Essential metabolic pathways for parasite development and mechanistic origin revealed
Collapse
Affiliation(s)
- Rebecca R Stanway
- Institute of Cell Biology, University of Bern, Bern 3012, Switzerland
| | - Ellen Bushell
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK; Molecular Infection Medicine Sweden (MIMS), Department of Molecular Biology, Umeå University, Umeå 901 87, Sweden
| | - Anush Chiappino-Pepe
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFL, Lausanne 1015, Switzerland
| | - Magali Roques
- Institute of Cell Biology, University of Bern, Bern 3012, Switzerland
| | - Theo Sanderson
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Blandine Franke-Fayard
- Leiden Malaria Research Group, Parasitology, Center of Infectious Diseases, Leiden University Medical Center (LUMC), Leiden 2333ZA, the Netherlands
| | - Reto Caldelari
- Institute of Cell Biology, University of Bern, Bern 3012, Switzerland
| | | | - Mary Nyonda
- Department of Microbiology & Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva 1211, Switzerland
| | - Vikash Pandey
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFL, Lausanne 1015, Switzerland; Molecular Infection Medicine Sweden (MIMS), Department of Molecular Biology, Umeå University, Umeå 901 87, Sweden
| | - Frank Schwach
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Séverine Chevalley
- Leiden Malaria Research Group, Parasitology, Center of Infectious Diseases, Leiden University Medical Center (LUMC), Leiden 2333ZA, the Netherlands
| | - Jai Ramesar
- Leiden Malaria Research Group, Parasitology, Center of Infectious Diseases, Leiden University Medical Center (LUMC), Leiden 2333ZA, the Netherlands
| | - Tom Metcalf
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Colin Herd
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Paul-Christian Burda
- Institute of Cell Biology, University of Bern, Bern 3012, Switzerland; Bernhard Nocht Institute for Tropical Medicine, Hamburg 20359, Germany
| | - Julian C Rayner
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK; Cambridge Institute for Medical Research, University of Cambridge, Hills Road, Cambridge CB2, 0XY, UK
| | - Dominique Soldati-Favre
- Department of Microbiology & Molecular Medicine, Faculty of Medicine, University of Geneva, Geneva 1211, Switzerland
| | - Chris J Janse
- Leiden Malaria Research Group, Parasitology, Center of Infectious Diseases, Leiden University Medical Center (LUMC), Leiden 2333ZA, the Netherlands
| | - Vassily Hatzimanikatis
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFL, Lausanne 1015, Switzerland
| | - Oliver Billker
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SA, UK; Molecular Infection Medicine Sweden (MIMS), Department of Molecular Biology, Umeå University, Umeå 901 87, Sweden.
| | - Volker T Heussler
- Institute of Cell Biology, University of Bern, Bern 3012, Switzerland.
| |
Collapse
|
14
|
Holzheu P, Kummer U. Computational systems biology of cellular processes in Arabidopsis thaliana: an overview. Cell Mol Life Sci 2020; 77:433-440. [PMID: 31768604 PMCID: PMC11105087 DOI: 10.1007/s00018-019-03379-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2019] [Revised: 11/11/2019] [Accepted: 11/12/2019] [Indexed: 02/06/2023]
Abstract
Systems biology strives for gaining an understanding of biological phenomena by studying the interactions of different parts of a system and integrating the knowledge obtained into the current view of the underlying processes. This is achieved by a tight combination of quantitative experimentation and computational modeling. While there is already a large quantity of systems biology studies describing human, animal and especially microbial cell biological systems, plant biology has been lagging behind for many years. However, in the case of the model plant Arabidopsis thaliana, the steadily increasing amount of information on the levels of its genome, proteome and on a variety of its metabolic and signalling pathways is progressively enabling more researchers to construct models for cellular processes for the plant, which in turn encourages more experimental data to be generated, showing also for plant sciences how fruitful systems biology research can be. In this review, we provide an overview over some of these recent studies which use different systems biological approaches to get a better understanding of the cell biology of A. thaliana. The approaches used in these are genome-scale metabolic modeling, as well as kinetic modeling of metabolic and signalling pathways. Furthermore, we selected several cases to exemplify how the modeling approaches have led to significant advances or new perspectives in the field.
Collapse
Affiliation(s)
- Pascal Holzheu
- INF 267 (Bioquant), Heidelberg University, 69120, Heidelberg, Germany
| | - Ursula Kummer
- INF 267 (Bioquant), Heidelberg University, 69120, Heidelberg, Germany.
| |
Collapse
|
15
|
Medlock GL, Papin JA. Guiding the Refinement of Biochemical Knowledgebases with Ensembles of Metabolic Networks and Machine Learning. Cell Syst 2020; 10:109-119.e3. [PMID: 31926940 PMCID: PMC6975163 DOI: 10.1016/j.cels.2019.11.006] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2019] [Revised: 08/27/2019] [Accepted: 11/14/2019] [Indexed: 11/16/2022]
Abstract
Mechanistic models explicitly represent hypothesized biological knowledge. As such, they offer more generalizability than data-driven models. However, identifying model curation efforts that improve performance for mechanistic models is nontrivial. Here, we develop a solution to this problem for genome-scale metabolic models. We generate an ensemble of models, each equally consistent with experimental data, then perform simulations with them. We apply machine learning to the simulation output to identify model structure variation that maximally influences simulations. These variants are high-priority candidates for curation through removal, addition, or reannotation in the model. We apply this approach, automated metabolic model ensemble-driven elimination of uncertainty with statistical learning (AMMEDEUS), to 29 bacterial species to improve gene essentiality predictions. We explore targets for individual species and compile pan-species targets to improve the database used during model construction. AMMEDEUS is an automated and performance-driven recommendation system that complements intuition during curation of biochemical knowledgebases.
Collapse
Affiliation(s)
- Gregory L Medlock
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA
| | - Jason A Papin
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA; Department of Medicine, Division of Infectious Diseases & International Health, University of Virginia, Charlottesville, VA, USA; Department of Biochemistry & Molecular Genetics, University of Virginia, Charlottesville, VA, USA.
| |
Collapse
|
16
|
Gilbert J, Pearcy N, Norman R, Millat T, Winzer K, King J, Hodgman C, Minton N, Twycross J. Gsmodutils: a python based framework for test-driven genome scale metabolic model development. Bioinformatics 2019; 35:3397-3403. [PMID: 30759197 PMCID: PMC6748746 DOI: 10.1093/bioinformatics/btz088] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Revised: 01/29/2019] [Accepted: 02/12/2019] [Indexed: 12/13/2022] Open
Abstract
MOTIVATION Genome scale metabolic models (GSMMs) are increasingly important for systems biology and metabolic engineering research as they are capable of simulating complex steady-state behaviour. Constraints based models of this form can include thousands of reactions and metabolites, with many crucial pathways that only become activated in specific simulation settings. However, despite their widespread use, power and the availability of tools to aid with the construction and analysis of large scale models, little methodology is suggested for their continued management. For example, when genome annotations are updated or new understanding regarding behaviour is discovered, models often need to be altered to reflect this. This is quickly becoming an issue for industrial systems and synthetic biotechnology applications, which require good quality reusable models integral to the design, build, test and learn cycle. RESULTS As part of an ongoing effort to improve genome scale metabolic analysis, we have developed a test-driven development methodology for the continuous integration of validation data from different sources. Contributing to the open source technology based around COBRApy, we have developed the gsmodutils modelling framework placing an emphasis on test-driven design of models through defined test cases. Crucially, different conditions are configurable allowing users to examine how different designs or curation impact a wide range of system behaviours, minimizing error between model versions. AVAILABILITY AND IMPLEMENTATION The software framework described within this paper is open source and freely available from http://github.com/SBRCNottingham/gsmodutils. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- James Gilbert
- Synthetic Biology Research Centre, University of Nottingham, Nottingham, UK
| | - Nicole Pearcy
- Synthetic Biology Research Centre, University of Nottingham, Nottingham, UK
| | - Rupert Norman
- Synthetic Biology Research Centre, University of Nottingham, Nottingham, UK
- School of Biosciences, University of Nottingham, Sutton Bonington, Loughborough, UK
| | - Thomas Millat
- Synthetic Biology Research Centre, University of Nottingham, Nottingham, UK
| | - Klaus Winzer
- Synthetic Biology Research Centre, University of Nottingham, Nottingham, UK
| | - John King
- School of Mathematical Sciences, University of Nottingham, Nottingham, UK
| | - Charlie Hodgman
- Synthetic Biology Research Centre, University of Nottingham, Nottingham, UK
- School of Biosciences, University of Nottingham, Sutton Bonington, Loughborough, UK
| | - Nigel Minton
- Synthetic Biology Research Centre, University of Nottingham, Nottingham, UK
| | - Jamie Twycross
- School of Computer Science, University of Nottingham, Nottingham, UK
| |
Collapse
|
17
|
Kumar M, Ji B, Zengler K, Nielsen J. Modelling approaches for studying the microbiome. Nat Microbiol 2019; 4:1253-1267. [PMID: 31337891 DOI: 10.1038/s41564-019-0491-9] [Citation(s) in RCA: 81] [Impact Index Per Article: 16.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2017] [Accepted: 05/21/2019] [Indexed: 02/08/2023]
Abstract
Advances in metagenome sequencing of the human microbiome have provided a plethora of new insights and revealed a close association of this complex ecosystem with a range of human diseases. However, there is little knowledge about how the different members of the microbial community interact with each other and with the host, and we lack basic mechanistic understanding of these interactions related to health and disease. Mathematical modelling has been demonstrated to be highly advantageous for gaining insights into the dynamics and interactions of complex systems and in recent years, several modelling approaches have been proposed to enhance our understanding of the microbiome. Here, we review the latest developments and current approaches, and highlight how different modelling strategies have been applied to unravel the highly dynamic nature of the human microbiome. Furthermore, we discuss present limitations of different modelling strategies and provide a perspective of how modelling can advance understanding and offer new treatment routes to impact human health.
Collapse
Affiliation(s)
- Manish Kumar
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden.,Department of Pediatrics, University of California, San Diego, CA, USA
| | - Boyang Ji
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden
| | - Karsten Zengler
- Department of Pediatrics, University of California, San Diego, CA, USA.,Department of Bioengineering, University of California, San Diego, CA, USA.,Center for Microbiome Innovation, University of California, San Diego, CA, USA
| | - Jens Nielsen
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden. .,Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark.
| |
Collapse
|
18
|
Griesemer M, Kimbrel JA, Zhou CE, Navid A, D'haeseleer P. Combining multiple functional annotation tools increases coverage of metabolic annotation. BMC Genomics 2018; 19:948. [PMID: 30567498 PMCID: PMC6299973 DOI: 10.1186/s12864-018-5221-9] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Accepted: 11/05/2018] [Indexed: 12/15/2022] Open
Abstract
Background Genome-scale metabolic modeling is a cornerstone of systems biology analysis of microbial organisms and communities, yet these genome-scale modeling efforts are invariably based on incomplete functional annotations. Annotated genomes typically contain 30–50% of genes without functional annotation, severely limiting our knowledge of the “parts lists” that the organisms have at their disposal. These incomplete annotations may be sufficient to derive a model of a core set of well-studied metabolic pathways that support growth in pure culture. However, pathways important for growth on unusual metabolites exchanged in complex microbial communities are often less understood, resulting in missing functional annotations in newly sequenced genomes. Results Here, we present results on a comprehensive reannotation of 27 bacterial reference genomes, focusing on enzymes with EC numbers annotated by KEGG, RAST, EFICAz, and the BRENDA enzyme database, and on membrane transport annotations by TransportDB, KEGG and RAST. Our analysis shows that annotation using multiple tools can result in a drastically larger metabolic network reconstruction, adding on average 40% more EC numbers, 3–8 times more substrate-specific transporters, and 37% more metabolic genes. These results are even more pronounced for bacterial species that are phylogenetically distant from well-studied model organisms such as E. coli. Conclusions Metabolic annotations are often incomplete and inconsistent. Combining multiple functional annotation tools can greatly improve genome coverage and metabolic network size, especially for non-model organisms and non-core pathways. Electronic supplementary material The online version of this article (10.1186/s12864-018-5221-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Marc Griesemer
- Biosciences and Biotechnology Division, Lawrence Livermore National Laboratory, Livermore, CA, 94551, USA
| | - Jeffrey A Kimbrel
- Biosciences and Biotechnology Division, Lawrence Livermore National Laboratory, Livermore, CA, 94551, USA
| | - Carol E Zhou
- Global Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, CA, 94551, USA
| | - Ali Navid
- Biosciences and Biotechnology Division, Lawrence Livermore National Laboratory, Livermore, CA, 94551, USA
| | - Patrik D'haeseleer
- Biosciences and Biotechnology Division, Lawrence Livermore National Laboratory, Livermore, CA, 94551, USA. .,Global Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, CA, 94551, USA.
| |
Collapse
|
19
|
Hale VL, Jeraldo P, Chen J, Mundy M, Yao J, Priya S, Keeney G, Lyke K, Ridlon J, White BA, French AJ, Thibodeau SN, Diener C, Resendis-Antonio O, Gransee J, Dutta T, Petterson XM, Sung J, Blekhman R, Boardman L, Larson D, Nelson H, Chia N. Distinct microbes, metabolites, and ecologies define the microbiome in deficient and proficient mismatch repair colorectal cancers. Genome Med 2018; 10:78. [PMID: 30376889 PMCID: PMC6208080 DOI: 10.1186/s13073-018-0586-6] [Citation(s) in RCA: 90] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2018] [Accepted: 10/08/2018] [Indexed: 01/14/2023] Open
Abstract
Background Links between colorectal cancer (CRC) and the gut microbiome have been established, but the specific microbial species and their role in carcinogenesis remain an active area of inquiry. Our understanding would be enhanced by better accounting for tumor subtype, microbial community interactions, metabolism, and ecology. Methods We collected paired colon tumor and normal-adjacent tissue and mucosa samples from 83 individuals who underwent partial or total colectomies for CRC. Mismatch repair (MMR) status was determined in each tumor sample and classified as either deficient MMR (dMMR) or proficient MMR (pMMR) tumor subtypes. Samples underwent 16S rRNA gene sequencing and a subset of samples from 50 individuals were submitted for targeted metabolomic analysis to quantify amino acids and short-chain fatty acids. A PERMANOVA was used to identify the biological variables that explained variance within the microbial communities. dMMR and pMMR microbial communities were then analyzed separately using a generalized linear mixed effects model that accounted for MMR status, sample location, intra-subject variability, and read depth. Genome-scale metabolic models were then used to generate microbial interaction networks for dMMR and pMMR microbial communities. We assessed global network properties as well as the metabolic influence of each microbe within the dMMR and pMMR networks. Results We demonstrate distinct roles for microbes in dMMR and pMMR CRC. Bacteroides fragilis and sulfidogenic Fusobacterium nucleatum were significantly enriched in dMMR CRC, but not pMMR CRC. These findings were further supported by metabolic modeling and metabolomics indicating suppression of B. fragilis in pMMR CRC and increased production of amino acid proxies for hydrogen sulfide in dMMR CRC. Conclusions Integrating tumor biology and microbial ecology highlighted distinct microbial, metabolic, and ecological properties unique to dMMR and pMMR CRC. This approach could critically improve our ability to define, predict, prevent, and treat colorectal cancers. Electronic supplementary material The online version of this article (10.1186/s13073-018-0586-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Vanessa L Hale
- Department of Veterinary Preventive Medicine, The Ohio State University College of Veterinary Medicine, Columbus, OH, USA.,Microbiome Program, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, USA.,Division of Surgical Research, Department of Surgery, Mayo Clinic, Rochester, MN, USA
| | - Patricio Jeraldo
- Microbiome Program, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, USA.,Division of Surgical Research, Department of Surgery, Mayo Clinic, Rochester, MN, USA
| | - Jun Chen
- Microbiome Program, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, USA.,Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Michael Mundy
- Center for Individualized Medicine, Mayo Clinic, Rochester, MN, USA
| | - Janet Yao
- Microbiome Program, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, USA
| | - Sambhawa Priya
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, MN, USA
| | - Gary Keeney
- Division of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA
| | - Kelly Lyke
- Division of Surgical Research, Department of Surgery, Mayo Clinic, Rochester, MN, USA
| | - Jason Ridlon
- Carl R. Woese Institute for Genomic Biology, Department of Animal Sciences, Division of Nutritional Sciences, University of Illinois, Urbana-Champaign, IL, USA
| | - Bryan A White
- Carl R. Woese Institute for Genomic Biology, Department of Animal Sciences, Division of Nutritional Sciences, University of Illinois, Urbana-Champaign, IL, USA
| | - Amy J French
- Division of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA
| | - Stephen N Thibodeau
- Division of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA
| | - Christian Diener
- Human Systems Biology Laboratory, National Institute of Genomic Medicine, Mexico City, Mexico
| | - Osbaldo Resendis-Antonio
- Human Systems Biology Laboratory, National Institute of Genomic Medicine, Mexico City, Mexico.,Coordinación de la Investigación Científica, Red de Apoyo a la Investigación, UNAM, Mexico City, Mexico
| | - Jaime Gransee
- Mayo Clinic Metabolomics Core Laboratory, Mayo Clinic, Rochester, MN, USA
| | - Tumpa Dutta
- Mayo Clinic Metabolomics Core Laboratory, Mayo Clinic, Rochester, MN, USA
| | - Xuan-Mai Petterson
- Mayo Clinic Metabolomics Core Laboratory, Mayo Clinic, Rochester, MN, USA
| | - Jaeyun Sung
- Microbiome Program, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, USA.,Division of Surgical Research, Department of Surgery, Mayo Clinic, Rochester, MN, USA
| | - Ran Blekhman
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, MN, USA
| | - Lisa Boardman
- Division of Gastroenterology and Hepatology, Mayo Clinic, Rochester, MN, USA
| | - David Larson
- Division of Colon and Rectal Surgery, Department of Surgery, Mayo Clinic, Rochester, MN, USA
| | - Heidi Nelson
- Microbiome Program, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, USA.,Division of Colon and Rectal Surgery, Department of Surgery, Mayo Clinic, Rochester, MN, USA
| | - Nicholas Chia
- Microbiome Program, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, USA. .,Division of Surgical Research, Department of Surgery, Mayo Clinic, Rochester, MN, USA.
| |
Collapse
|
20
|
Medlock GL, Carey MA, McDuffie DG, Mundy MB, Giallourou N, Swann JR, Kolling GL, Papin JA. Inferring Metabolic Mechanisms of Interaction within a Defined Gut Microbiota. Cell Syst 2018; 7:245-257.e7. [PMID: 30195437 PMCID: PMC6166237 DOI: 10.1016/j.cels.2018.08.003] [Citation(s) in RCA: 66] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2018] [Revised: 06/15/2018] [Accepted: 08/03/2018] [Indexed: 12/20/2022]
Abstract
The diversity and number of species present within microbial communities create the potential for a multitude of interspecies metabolic interactions. Here, we develop, apply, and experimentally test a framework for inferring metabolic mechanisms associated with interspecies interactions. We perform pairwise growth and metabolome profiling of co-cultures of strains from a model mouse microbiota. We then apply our framework to dissect emergent metabolic behaviors that occur in co-culture. Based on one of the inferences from this framework, we identify and interrogate an amino acid cross-feeding interaction and validate that the proposed interaction leads to a growth benefit in vitro. Our results reveal the type and extent of emergent metabolic behavior in microbial communities composed of gut microbes. We focus on growth-modulating interactions, but the framework can be applied to interspecies interactions that modulate any phenotype of interest within microbial communities.
Collapse
Affiliation(s)
- Gregory L Medlock
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA
| | - Maureen A Carey
- Department of Microbiology, Immunology, and Cancer Biology, University of Virginia, Charlottesville, VA, USA
| | - Dennis G McDuffie
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA
| | - Michael B Mundy
- Center for Individualized Medicine, Mayo Clinic, Rochester, MN, USA
| | - Natasa Giallourou
- Department of Surgery and Cancer, Division of Integrative Systems Medicine and Digestive Diseases, Faculty of Medicine, Imperial College London, South Kensington, London, UK
| | - Jonathan R Swann
- Department of Surgery and Cancer, Division of Integrative Systems Medicine and Digestive Diseases, Faculty of Medicine, Imperial College London, South Kensington, London, UK
| | - Glynis L Kolling
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA
| | - Jason A Papin
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA; Department of Medicine, Division of Infectious Diseases & International Health, University of Virginia, Charlottesville, VA, USA; Department of Biochemistry & Molecular Genetics, University of Virginia, Charlottesville, VA, USA.
| |
Collapse
|
21
|
Machado D, Andrejev S, Tramontano M, Patil KR. Fast automated reconstruction of genome-scale metabolic models for microbial species and communities. Nucleic Acids Res 2018; 46:7542-7553. [PMID: 30192979 PMCID: PMC6125623 DOI: 10.1093/nar/gky537] [Citation(s) in RCA: 311] [Impact Index Per Article: 51.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Revised: 05/17/2018] [Accepted: 05/29/2018] [Indexed: 12/26/2022] Open
Abstract
Genome-scale metabolic models are instrumental in uncovering operating principles of cellular metabolism, for model-guided re-engineering, and unraveling cross-feeding in microbial communities. Yet, the application of genome-scale models, especially to microbial communities, is lagging behind the availability of sequenced genomes. This is largely due to the time-consuming steps of manual curation required to obtain good quality models. Here, we present an automated tool, CarveMe, for reconstruction of species and community level metabolic models. We introduce the concept of a universal model, which is manually curated and simulation ready. Starting with this universal model and annotated genome sequences, CarveMe uses a top-down approach to build single-species and community models in a fast and scalable manner. We show that CarveMe models perform closely to manually curated models in reproducing experimental phenotypes (substrate utilization and gene essentiality). Additionally, we build a collection of 74 models for human gut bacteria and test their ability to reproduce growth on a set of experimentally defined media. Finally, we create a database of 5587 bacterial models and demonstrate its potential for fast generation of microbial community models. Overall, CarveMe provides an open-source and user-friendly tool towards broadening the use of metabolic modeling in studying microbial species and communities.
Collapse
Affiliation(s)
- Daniel Machado
- European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | - Sergej Andrejev
- European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | - Melanie Tramontano
- European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | - Kiran Raosaheb Patil
- European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| |
Collapse
|
22
|
Kumar M, Ji B, Babaei P, Das P, Lappa D, Ramakrishnan G, Fox TE, Haque R, Petri WA, Bäckhed F, Nielsen J. Gut microbiota dysbiosis is associated with malnutrition and reduced plasma amino acid levels: Lessons from genome-scale metabolic modeling. Metab Eng 2018; 49:128-142. [PMID: 30075203 PMCID: PMC6871511 DOI: 10.1016/j.ymben.2018.07.018] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2018] [Revised: 07/30/2018] [Accepted: 07/30/2018] [Indexed: 12/31/2022]
Abstract
Malnutrition is a severe non-communicable disease, which is prevalent in children from low-income countries. Recently, a number of metagenomics studies have illustrated associations between the altered gut microbiota and child malnutrition. However, these studies did not examine metabolic functions and interactions between individual species in the gut microbiota during health and malnutrition. Here, we applied genome-scale metabolic modeling to model the gut microbial species, which were selected from healthy and malnourished children from three countries. Our analysis showed reduced metabolite production capabilities in children from two low-income countries compared with a high-income country. Additionally, the models were also used to predict the community-level metabolic potentials of gut microbes and the patterns of pairwise interactions among species. Hereby we found that due to bacterial interactions there may be reduced production of certain amino acids in malnourished children compared with healthy children from the same communities. To gain insight into alterations in the metabolism of malnourished (stunted) children, we also performed targeted plasma metabolic profiling in the first 2 years of life of 25 healthy and 25 stunted children. Plasma metabolic profiling further revealed that stunted children had reduced plasma levels of essential amino acids compared to healthy controls. Our analyses provide a framework for future efforts towards further characterization of gut microbial metabolic capabilities and their contribution to malnutrition.
Collapse
Affiliation(s)
- Manish Kumar
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE41128 Gothenburg, Sweden
| | - Boyang Ji
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE41128 Gothenburg, Sweden
| | - Parizad Babaei
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE41128 Gothenburg, Sweden
| | - Promi Das
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE41128 Gothenburg, Sweden
| | - Dimitra Lappa
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE41128 Gothenburg, Sweden
| | - Girija Ramakrishnan
- Department of Medicine/Division of Infectious Diseases, and University of Virginia, Charlottesville, VA, USA
| | - Todd E Fox
- Department of Pharmacology, University of Virginia, Charlottesville, VA, USA
| | - Rashidul Haque
- International Centre for Diarrheal Disease Research, Dhaka, Bangladesh
| | - William A Petri
- Department of Medicine/Division of Infectious Diseases, and University of Virginia, Charlottesville, VA, USA
| | - Fredrik Bäckhed
- The Wallenberg Laboratory, Department of Molecular and Clinical Medicine, University of Gothenburg, 41345 Gothenburg, Sweden; Novo Nordisk Foundation Center for Basic Metabolic Research, Section for Metabolic Receptology and Enteroendocrinology, Faculty of Health Sciences, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Jens Nielsen
- Department of Biology and Biological Engineering, Chalmers University of Technology, SE41128 Gothenburg, Sweden; Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK2800 Lyngby, Denmark.
| |
Collapse
|
23
|
Karp PD, Weaver D, Latendresse M. How accurate is automated gap filling of metabolic models? BMC SYSTEMS BIOLOGY 2018; 12:73. [PMID: 29914471 PMCID: PMC6006690 DOI: 10.1186/s12918-018-0593-7] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Accepted: 05/31/2018] [Indexed: 12/20/2022]
Abstract
Background Reaction gap filling is a computational technique for proposing the addition of reactions to genome-scale metabolic models to permit those models to run correctly. Gap filling completes what are otherwise incomplete models that lack fully connected metabolic networks. The models are incomplete because they are derived from annotated genomes in which not all enzymes have been identified. Here we compare the results of applying an automated likelihood-based gap filler within the Pathway Tools software with the results of manually gap filling the same metabolic model. Both gap-filling exercises were applied to the same genome-derived qualitative metabolic reconstruction for Bifidobacterium longum subsp. longum JCM 1217, and to the same modeling conditions — anaerobic growth under four nutrients producing 53 biomass metabolites. Results The solution computed by the gap-filling program GenDev contained 12 reactions, but closer examination showed that solution was not minimal; two of the twelve reactions can be removed to yield a set of ten reactions that enable model growth. The manually curated solution contained 13 reactions, eight of which were shared with the 12-reaction computed solution. Thus, GenDev achieved recall of 61.5% and precision of 66.6%. These results suggest that although computational gap fillers are populating metabolic models with significant numbers of correct reactions, automatically gap-filled metabolic models also contain significant numbers of incorrect reactions. Conclusions Our conclusion is that manual curation of gap-filler results is needed to obtain high-accuracy models. Many of the differences between the manual and automatic solutions resulted from using expert biological knowledge to direct the choice of reactions within the curated solution, such as reactions specific to the anaerobic lifestyle of B. longum. Electronic supplementary material The online version of this article (10.1186/s12918-018-0593-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Peter D Karp
- Bioinformatics Research Group, SRI International, 333 Ravenswood Ave, Menlo Park, 94025, USA.
| | - Daniel Weaver
- Bioinformatics Research Group, SRI International, 333 Ravenswood Ave, Menlo Park, 94025, USA
| | - Mario Latendresse
- Bioinformatics Research Group, SRI International, 333 Ravenswood Ave, Menlo Park, 94025, USA
| |
Collapse
|
24
|
Mundy M, Mendes-Soares H, Chia N. Mackinac: a bridge between ModelSEED and COBRApy to generate and analyze genome-scale metabolic models. Bioinformatics 2018; 33:2416-2418. [PMID: 28379466 PMCID: PMC5860119 DOI: 10.1093/bioinformatics/btx185] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2017] [Accepted: 03/28/2017] [Indexed: 12/18/2022] Open
Abstract
Summary Reconstructing and analyzing a large number of genome-scale metabolic models is a fundamental part of the integrated study of microbial communities; however, two of the most widely used frameworks for building and analyzing models use different metabolic network representations. Here we describe Mackinac, a Python package that combines ModelSEED’s ability to automatically reconstruct metabolic models with COBRApy’s advanced analysis capabilities to bridge the differences between the two frameworks and facilitate the study of the metabolic potential of microorganisms. Availability and Implementation This package works with Python 2.7, 3.4, and 3.5 on MacOS, Linux and Windows. The source code is available from https://github.com/mmundy42/mackinac.
Collapse
Affiliation(s)
| | | | - Nicholas Chia
- Center for Individualized Medicine.,Department of Surgery.,Department of Physiology and Biomedical Engineering, Mayo Clinic, Rochester, MN, USA
| |
Collapse
|
25
|
Advances in analytical tools for high throughput strain engineering. Curr Opin Biotechnol 2018; 54:33-40. [PMID: 29448095 DOI: 10.1016/j.copbio.2018.01.027] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2018] [Revised: 01/24/2018] [Accepted: 01/28/2018] [Indexed: 01/09/2023]
Abstract
The emergence of inexpensive, base-perfect genome editing is revolutionising biology. Modern industrial biotechnology exploits the advances in genome editing in combination with automation, analytics and data integration to build high-throughput automated strain engineering pipelines also known as biofoundries. Biofoundries replace the slow and inconsistent artisanal processes used to build microbial cell factories with an automated design-build-test cycle, considerably reducing the time needed to deliver commercially viable strains. Testing and hence learning remains relatively shallow, but recent advances in analytical chemistry promise to increase the depth of characterization possible. Analytics combined with models of cellular physiology in automated systems biology pipelines should enable deeper learning and hence a steeper pitch of the learning cycle. This review explores the progress, advances and remaining bottlenecks of analytical tools for high throughput strain engineering.
Collapse
|
26
|
Advances in gap-filling genome-scale metabolic models and model-driven experiments lead to novel metabolic discoveries. Curr Opin Biotechnol 2017; 51:103-108. [PMID: 29278837 DOI: 10.1016/j.copbio.2017.12.012] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2017] [Revised: 12/08/2017] [Accepted: 12/08/2017] [Indexed: 12/18/2022]
Abstract
With rapid improvements in next-generation sequencing technologies, our knowledge about metabolism of many organisms is rapidly increasing. However, gaps in metabolic networks exist due to incomplete knowledge (e.g., missing reactions, unknown pathways, unannotated and misannotated genes, promiscuous enzymes, and underground metabolic pathways). In this review, we discuss recent advances in gap-filling algorithms based on genome-scale metabolic models and the importance of both high-throughput experiments and detailed biochemical characterization, which work in concert with in silico methods, to allow a more accurate and comprehensive understanding of metabolism.
Collapse
|
27
|
Krumholz EW, Libourel IGL. Thermodynamic Constraints Improve Metabolic Networks. Biophys J 2017; 113:679-689. [PMID: 28793222 DOI: 10.1016/j.bpj.2017.06.018] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Revised: 05/24/2017] [Accepted: 06/02/2017] [Indexed: 10/19/2022] Open
Abstract
In pursuit of establishing a realistic metabolic phenotypic space, the reversibility of reactions is thermodynamically constrained in modern metabolic networks. The reversibility constraints follow from heuristic thermodynamic poise approximations that take anticipated cellular metabolite concentration ranges into account. Because constraints reduce the feasible space, draft metabolic network reconstructions may need more extensive reconciliation, and a larger number of genes may become essential. Notwithstanding ubiquitous application, the effect of reversibility constraints on the predictive capabilities of metabolic networks has not been investigated in detail. Instead, work has focused on the implementation and validation of the thermodynamic poise calculation itself. With the advance of fast linear programming-based network reconciliation, the effects of reversibility constraints on network reconciliation and gene essentiality predictions have become feasible and are the subject of this study. Networks with thermodynamically informed reversibility constraints outperformed gene essentiality predictions compared to networks that were constrained with randomly shuffled constraints. Unconstrained networks predicted gene essentiality as accurately as thermodynamically constrained networks, but predicted substantially fewer essential genes. Networks that were reconciled with sequence similarity data and strongly enforced reversibility constraints outperformed all other networks. We conclude that metabolic network analysis confirmed the validity of the thermodynamic constraints, and that thermodynamic poise information is actionable during network reconciliation.
Collapse
Affiliation(s)
- Elias W Krumholz
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, Minnesota
| | - Igor G L Libourel
- Department of Plant and Microbial Biology, University of Minnesota, Saint Paul, Minnesota; Biotechnology Institute, University of Minnesota, Saint Paul, Minnesota; Genome Craft, Saint Paul, Minnesota.
| |
Collapse
|
28
|
Abstract
The increasing prevalence of infections involving intracellular apicomplexan parasites such as Plasmodium, Toxoplasma, and Cryptosporidium (the causative agents of malaria, toxoplasmosis, and cryptosporidiosis, respectively) represent a significant global healthcare burden. Despite their significance, few treatments are available; a situation that is likely to deteriorate with the emergence of new resistant strains of parasites. To lay the foundation for programs of drug discovery and vaccine development, genome sequences for many of these organisms have been generated, together with large-scale expression and proteomic datasets. Comparative analyses of these datasets are beginning to identify the molecular innovations supporting both conserved processes mediating fundamental roles in parasite survival and persistence, as well as lineage-specific adaptations associated with divergent life-cycle strategies. The challenge is how best to exploit these data to derive insights into parasite virulence and identify those genes representing the most amenable targets. In this review, we outline genomic datasets currently available for apicomplexans and discuss biological insights that have emerged as a consequence of their analysis. Of particular interest are systems-based resources, focusing on areas of metabolism and host invasion that are opening up opportunities for discovering new therapeutic targets.
Collapse
Affiliation(s)
| | - John Parkinson
- a Program in Molecular Structure and Function , Hospital for Sick Children , Toronto , Ontario , Canada
- b Departments of Biochemistry, Molecular Genetics and Computer Science , University of Toronto , Toronto , Ontario , Canada
| |
Collapse
|
29
|
Chen PW, Theisen MK, Liao JC. Metabolic systems modeling for cell factories improvement. Curr Opin Biotechnol 2017; 46:114-119. [PMID: 28388485 DOI: 10.1016/j.copbio.2017.02.005] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2016] [Revised: 02/10/2017] [Accepted: 02/13/2017] [Indexed: 12/23/2022]
Abstract
Techniques for modeling microbial bioproduction systems have evolved over many decades. Here, we survey recent literature and focus on modeling approaches for improving bioproduction. These techniques from systems biology are based on different methodologies, starting from stoichiometry only to various stoichiometry with kinetics approaches that address different issues in metabolic systems. Techniques to overcome unknown kinetic parameters using random sampling have emerged to address meaningful questions. Among those questions, pathway robustness seems to be an important issue for metabolic engineering. We also discuss the increasing significance of databases in biology and their potential impact for biotechnology.
Collapse
Affiliation(s)
- Po-Wei Chen
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, Los Angeles, CA 90095, United States
| | - Matthew K Theisen
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, Los Angeles, CA 90095, United States
| | - James C Liao
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, Los Angeles, CA 90095, United States; Academia Sinica, Taipei 11529, Taiwan.
| |
Collapse
|
30
|
Mendes-Soares H, Chia N. Community metabolic modeling approaches to understanding the gut microbiome: Bridging biochemistry and ecology. Free Radic Biol Med 2017; 105:102-109. [PMID: 27989793 PMCID: PMC5401773 DOI: 10.1016/j.freeradbiomed.2016.12.017] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/23/2016] [Revised: 11/27/2016] [Accepted: 12/12/2016] [Indexed: 12/27/2022]
Abstract
Interest in the human microbiome is at an all time high. The number of human microbiome studies is growing exponentially, as are reported associations between microbial communities and disease. However, we have not been able to translate the ever-growing amount of microbiome sequence data into better health. To do this, we need a practical means of transforming a disease-associated microbiome into a health-associated microbiome. This will require a framework that can be used to generate predictions about community dynamics within the microbiome under different conditions, predictions that can be tested and validated. In this review, using the gut microbiome to illustrate, we describe two classes of model that are currently being used to generate predictions about microbial community dynamics: ecological models and metabolic models. We outline the strengths and weaknesses of each approach and discuss the insights into the gut microbiome that have emerged from modeling thus far. We then argue that the two approaches can be combined to yield a community metabolic model, which will supply the framework needed to move from high-throughput omics data to testable predictions about how prebiotic, probiotic, and nutritional interventions affect the microbiome. We are confident that with a suitable model, researchers and clinicians will be able to harness the stream of sequence data and begin designing strategies to make targeted alterations to the microbiome and improve health.
Collapse
Affiliation(s)
- Helena Mendes-Soares
- Microbiome Program, Center for Individualized Medicine, Mayo Clinic, Rochester, MN 55905, USA; Department of Surgery, Mayo Clinic, Rochester, MN 55905, USA
| | - Nicholas Chia
- Microbiome Program, Center for Individualized Medicine, Mayo Clinic, Rochester, MN 55905, USA; Department of Surgery, Mayo Clinic, Rochester, MN 55905, USA; Department of Bioengineering and Physiology, College of Medicine, Mayo Clinic, Rochester, MN 55905, USA.
| |
Collapse
|
31
|
Biggs MB, Papin JA. Managing uncertainty in metabolic network structure and improving predictions using EnsembleFBA. PLoS Comput Biol 2017; 13:e1005413. [PMID: 28263984 PMCID: PMC5358886 DOI: 10.1371/journal.pcbi.1005413] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2016] [Revised: 03/20/2017] [Accepted: 02/15/2017] [Indexed: 11/19/2022] Open
Abstract
Genome-scale metabolic network reconstructions (GENREs) are repositories of knowledge about the metabolic processes that occur in an organism. GENREs have been used to discover and interpret metabolic functions, and to engineer novel network structures. A major barrier preventing more widespread use of GENREs, particularly to study non-model organisms, is the extensive time required to produce a high-quality GENRE. Many automated approaches have been developed which reduce this time requirement, but automatically-reconstructed draft GENREs still require curation before useful predictions can be made. We present a novel approach to the analysis of GENREs which improves the predictive capabilities of draft GENREs by representing many alternative network structures, all equally consistent with available data, and generating predictions from this ensemble. This ensemble approach is compatible with many reconstruction methods. We refer to this new approach as Ensemble Flux Balance Analysis (EnsembleFBA). We validate EnsembleFBA by predicting growth and gene essentiality in the model organism Pseudomonas aeruginosa UCBPP-PA14. We demonstrate how EnsembleFBA can be included in a systems biology workflow by predicting essential genes in six Streptococcus species and mapping the essential genes to small molecule ligands from DrugBank. We found that some metabolic subsystems contributed disproportionately to the set of predicted essential reactions in a way that was unique to each Streptococcus species, leading to species-specific outcomes from small molecule interactions. Through our analyses of P. aeruginosa and six Streptococci, we show that ensembles increase the quality of predictions without drastically increasing reconstruction time, thus making GENRE approaches more practical for applications which require predictions for many non-model organisms. All of our functions and accompanying example code are available in an open online repository. Metabolism is the driving force behind all biological activity. Genome-scale metabolic network reconstructions (GENREs) are representations of metabolic systems that can be analyzed mathematically to make predictions about how a system will behave, as well as to design systems with new properties. GENREs have traditionally been reconstructed manually, which can require extensive time and effort. Recent software solutions automate the process (drastically reducing the required effort) but the resulting GENREs are of lower quality and produce less reliable predictions than the manually-curated versions. We present a novel method (“EnsembleFBA”) which accounts for uncertainties involved in automated reconstruction by pooling many different draft GENREs together into an ensemble. We tested EnsembleFBA by predicting the growth and essential genes of the common pathogen Pseudomonas aeruginosa. We found that when predicting growth or essential genes, ensembles of GENREs achieved much better precision or captured many more essential genes than any of the individual GENREs within the ensemble. By improving the predictions that can be made with automatically-generated GENREs, this approach enables the modeling of biochemical systems which would otherwise be infeasible.
Collapse
Affiliation(s)
- Matthew B. Biggs
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, United States of America
| | - Jason A. Papin
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, United States of America
- * E-mail:
| |
Collapse
|
32
|
Hosseini Z, Marashi SA. Discovering missing reactions of metabolic networks by using gene co-expression data. Sci Rep 2017; 7:41774. [PMID: 28150713 PMCID: PMC5288723 DOI: 10.1038/srep41774] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Accepted: 12/30/2016] [Indexed: 11/23/2022] Open
Abstract
Flux coupling analysis is a computational method which is able to explain co-expression of metabolic genes by analyzing the topological structure of a metabolic network. It has been suggested that if genes in two seemingly fully-coupled reactions are not highly co-expressed, then these two reactions are not fully coupled in reality, and hence, there is a gap or missing reaction in the network. Here, we present GAUGE as a novel approach for gap filling of metabolic networks, which is a two-step algorithm based on a mixed integer linear programming formulation. In GAUGE, the discrepancies between experimental co-expression data and predicted flux coupling relations is minimized by adding a minimum number of reactions to the network. We show that GAUGE is able to predict missing reactions of E. coli metabolism that are not detectable by other popular gap filling approaches. We propose that our algorithm may be used as a complementary strategy for the gap filling problem of metabolic networks. Since GAUGE relies only on gene expression data, it can be potentially useful for exploring missing reactions in the metabolism of non-model organisms, which are often poorly characterized, cannot grow in the laboratory, and lack genetic tools for generating knockouts.
Collapse
Affiliation(s)
- Zhaleh Hosseini
- Department of Biotechnology, College of science, University of Tehran, Tehran, Iran
| | - Sayed-Amir Marashi
- Department of Biotechnology, College of science, University of Tehran, Tehran, Iran
| |
Collapse
|
33
|
Liu L, Zhang Z, Sheng T, Chen M. DEF: an automated dead-end filling approach based on quasi-endosymbiosis. Bioinformatics 2017; 33:405-413. [PMID: 28171511 DOI: 10.1093/bioinformatics/btw604] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2015] [Revised: 06/27/2016] [Accepted: 09/16/2016] [Indexed: 11/15/2022] Open
Abstract
Motivation Gap filling for the reconstruction of metabolic networks is to restore the connectivity of metabolites via finding high-confidence reactions that could be missed in target organism. Current methods for gap filling either fall into the network topology or have limited capability in finding missing reactions that are indirectly related to dead-end metabolites but of biological importance to the target model. Results We present an automated dead-end filling (DEF) approach, which is derived from the wisdom of endosymbiosis theory, to fill gaps by finding the most efficient dead-end utilization paths in a constructed quasi-endosymbiosis model. The recalls of reactions and dead ends of DEF reach around 73% and 86%, respectively. This method is capable of finding indirectly dead-end-related reactions with biological importance for the target organism and is applicable to any given metabolic model. In the E. coli iJR904 model, for instance, about 42% of the dead-end metabolites were fixed by our proposed method. Availabilty and Implementaion DEF is publicly available at http://bis.zju.edu.cn/DEF/. Contact mchen@zju.edu.cn Supplimentary Information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Lili Liu
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou, China
| | - Zijun Zhang
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou, China.,Department of Bioinformatics, University of California, Los Angeles, Los Angeles, CA, USA
| | - Taotao Sheng
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou, China
| | - Ming Chen
- Department of Bioinformatics, College of Life Sciences, Zhejiang University, Hangzhou, China
| |
Collapse
|
34
|
Prigent S, Frioux C, Dittami SM, Thiele S, Larhlimi A, Collet G, Gutknecht F, Got J, Eveillard D, Bourdon J, Plewniak F, Tonon T, Siegel A. Meneco, a Topology-Based Gap-Filling Tool Applicable to Degraded Genome-Wide Metabolic Networks. PLoS Comput Biol 2017; 13:e1005276. [PMID: 28129330 PMCID: PMC5302834 DOI: 10.1371/journal.pcbi.1005276] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2015] [Revised: 02/10/2017] [Accepted: 11/30/2016] [Indexed: 11/18/2022] Open
Abstract
Increasing amounts of sequence data are becoming available for a wide range of non-model organisms. Investigating and modelling the metabolic behaviour of those organisms is highly relevant to understand their biology and ecology. As sequences are often incomplete and poorly annotated, draft networks of their metabolism largely suffer from incompleteness. Appropriate gap-filling methods to identify and add missing reactions are therefore required to address this issue. However, current tools rely on phenotypic or taxonomic information, or are very sensitive to the stoichiometric balance of metabolic reactions, especially concerning the co-factors. This type of information is often not available or at least prone to errors for newly-explored organisms. Here we introduce Meneco, a tool dedicated to the topological gap-filling of genome-scale draft metabolic networks. Meneco reformulates gap-filling as a qualitative combinatorial optimization problem, omitting constraints raised by the stoichiometry of a metabolic network considered in other methods, and solves this problem using Answer Set Programming. Run on several artificial test sets gathering 10,800 degraded Escherichia coli networks Meneco was able to efficiently identify essential reactions missing in networks at high degradation rates, outperforming the stoichiometry-based tools in scalability. To demonstrate the utility of Meneco we applied it to two case studies. Its application to recent metabolic networks reconstructed for the brown algal model Ectocarpus siliculosus and an associated bacterium Candidatus Phaeomarinobacter ectocarpi revealed several candidate metabolic pathways for algal-bacterial interactions. Then Meneco was used to reconstruct, from transcriptomic and metabolomic data, the first metabolic network for the microalga Euglena mutabilis. These two case studies show that Meneco is a versatile tool to complete draft genome-scale metabolic networks produced from heterogeneous data, and to suggest relevant reactions that explain the metabolic capacity of a biological system.
Collapse
Affiliation(s)
- Sylvain Prigent
- Institute for Research in IT and Random Systems - IRISA, Université de Rennes 1, Rennes, France
- Department of Biology and Biological Engineering, Chalmers University of Technology, Göteborg, Sweden
- Irisa, CNRS, Rennes, France
- Dyliss, Inria, Rennes, France
- * E-mail: (AS); (SP)
| | - Clémence Frioux
- Institute for Research in IT and Random Systems - IRISA, Université de Rennes 1, Rennes, France
- Irisa, CNRS, Rennes, France
- Dyliss, Inria, Rennes, France
| | - Simon M. Dittami
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, UMR 8227, Integrative Biology of Marine Models, Station Biologique de Roscoff, Roscoff, France
| | | | - Abdelhalim Larhlimi
- Computer Science Laboratory of Nantes Atlantique - LINA UMR6241, Université de Nantes, Nantes, France
| | - Guillaume Collet
- Institute for Research in IT and Random Systems - IRISA, Université de Rennes 1, Rennes, France
- Irisa, CNRS, Rennes, France
- Dyliss, Inria, Rennes, France
| | - Fabien Gutknecht
- Molecular Genetics, Genomics and Microbiology - GMGM, Université de Strasbourg, Strasbourg, France
| | - Jeanne Got
- Institute for Research in IT and Random Systems - IRISA, Université de Rennes 1, Rennes, France
- Irisa, CNRS, Rennes, France
- Dyliss, Inria, Rennes, France
| | - Damien Eveillard
- Computer Science Laboratory of Nantes Atlantique - LINA UMR6241, Université de Nantes, Nantes, France
| | - Jérémie Bourdon
- Computer Science Laboratory of Nantes Atlantique - LINA UMR6241, Université de Nantes, Nantes, France
| | - Frédéric Plewniak
- Molecular Genetics, Genomics and Microbiology - GMGM, Université de Strasbourg, Strasbourg, France
- GMGM, CNRS, Strasbourg, France
| | - Thierry Tonon
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, UMR 8227, Integrative Biology of Marine Models, Station Biologique de Roscoff, Roscoff, France
| | - Anne Siegel
- Institute for Research in IT and Random Systems - IRISA, Université de Rennes 1, Rennes, France
- Irisa, CNRS, Rennes, France
- Dyliss, Inria, Rennes, France
- * E-mail: (AS); (SP)
| |
Collapse
|
35
|
Faria JP, Davis JJ, Edirisinghe JN, Taylor RC, Weisenhorn P, Olson RD, Stevens RL, Rocha M, Rocha I, Best AA, DeJongh M, Tintle NL, Parrello B, Overbeek R, Henry CS. Computing and Applying Atomic Regulons to Understand Gene Expression and Regulation. Front Microbiol 2016; 7:1819. [PMID: 27933038 PMCID: PMC5121216 DOI: 10.3389/fmicb.2016.01819] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Accepted: 10/28/2016] [Indexed: 01/13/2023] Open
Abstract
Understanding gene function and regulation is essential for the interpretation, prediction, and ultimate design of cell responses to changes in the environment. An important step toward meeting the challenge of understanding gene function and regulation is the identification of sets of genes that are always co-expressed. These gene sets, Atomic Regulons (ARs), represent fundamental units of function within a cell and could be used to associate genes of unknown function with cellular processes and to enable rational genetic engineering of cellular systems. Here, we describe an approach for inferring ARs that leverages large-scale expression data sets, gene context, and functional relationships among genes. We computed ARs for Escherichia coli based on 907 gene expression experiments and compared our results with gene clusters produced by two prevalent data-driven methods: Hierarchical clustering and k-means clustering. We compared ARs and purely data-driven gene clusters to the curated set of regulatory interactions for E. coli found in RegulonDB, showing that ARs are more consistent with gold standard regulons than are data-driven gene clusters. We further examined the consistency of ARs and data-driven gene clusters in the context of gene interactions predicted by Context Likelihood of Relatedness (CLR) analysis, finding that the ARs show better agreement with CLR predicted interactions. We determined the impact of increasing amounts of expression data on AR construction and find that while more data improve ARs, it is not necessary to use the full set of gene expression experiments available for E. coli to produce high quality ARs. In order to explore the conservation of co-regulated gene sets across different organisms, we computed ARs for Shewanella oneidensis, Pseudomonas aeruginosa, Thermus thermophilus, and Staphylococcus aureus, each of which represents increasing degrees of phylogenetic distance from E. coli. Comparison of the organism-specific ARs showed that the consistency of AR gene membership correlates with phylogenetic distance, but there is clear variability in the regulatory networks of closely related organisms. As large scale expression data sets become increasingly common for model and non-model organisms, comparative analyses of atomic regulons will provide valuable insights into fundamental regulatory modules used across the bacterial domain.
Collapse
Affiliation(s)
- José P Faria
- Computation Institute, University of ChicagoChicago, IL, USA; Computing, Environment and Life Sciences, Argonne National LaboratoryArgonne, IL, USA; Centre of Biological Engineering, University of Minho, Campus de GualtarBraga, Portugal; Mathematics and Computer Science Division, Argonne National LaboratoryArgonne, IL, USA
| | - James J Davis
- Computation Institute, University of ChicagoChicago, IL, USA; Computing, Environment and Life Sciences, Argonne National LaboratoryArgonne, IL, USA
| | - Janaka N Edirisinghe
- Computation Institute, University of ChicagoChicago, IL, USA; Computing, Environment and Life Sciences, Argonne National LaboratoryArgonne, IL, USA
| | - Ronald C Taylor
- Computational Biology and Bioinformatics Group, Pacific Northwest National Laboratory (U.S. Dept. of Energy) Richland, WA, USA
| | - Pamela Weisenhorn
- Mathematics and Computer Science Division, Argonne National Laboratory Argonne, IL, USA
| | - Robert D Olson
- Computation Institute, University of ChicagoChicago, IL, USA; Computing, Environment and Life Sciences, Argonne National LaboratoryArgonne, IL, USA
| | - Rick L Stevens
- Computation Institute, University of ChicagoChicago, IL, USA; Computing, Environment and Life Sciences, Argonne National LaboratoryArgonne, IL, USA; Department of Computer Science, Ryerson Physical Laboratory, University of ChicagoChicago, IL, USA
| | - Miguel Rocha
- Centre of Biological Engineering, University of Minho, Campus de Gualtar Braga, Portugal
| | - Isabel Rocha
- Centre of Biological Engineering, University of Minho, Campus de Gualtar Braga, Portugal
| | - Aaron A Best
- Biology Department, Hope College Holland, MI, USA
| | | | - Nathan L Tintle
- Department of Mathematics, Statistics and Computer Science, Dordt College Sioux Center, IA, USA
| | - Bruce Parrello
- Computing, Environment and Life Sciences, Argonne National LaboratoryArgonne, IL, USA; Fellowship for Interpretation of GenomesBurr Ridge, IL, USA
| | - Ross Overbeek
- Computation Institute, University of ChicagoChicago, IL, USA; Computing, Environment and Life Sciences, Argonne National LaboratoryArgonne, IL, USA; Fellowship for Interpretation of GenomesBurr Ridge, IL, USA
| | - Christopher S Henry
- Computation Institute, University of ChicagoChicago, IL, USA; Mathematics and Computer Science Division, Argonne National LaboratoryArgonne, IL, USA
| |
Collapse
|
36
|
Exploring Hydrogenotrophic Methanogenesis: a Genome Scale Metabolic Reconstruction of Methanococcus maripaludis. J Bacteriol 2016; 198:3379-3390. [PMID: 27736793 PMCID: PMC5116941 DOI: 10.1128/jb.00571-16] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2016] [Accepted: 09/22/2016] [Indexed: 02/03/2023] Open
Abstract
Hydrogenotrophic methanogenesis occurs in multiple environments, ranging from the intestinal tracts of animals to anaerobic sediments and hot springs. Energy conservation in hydrogenotrophic methanogens was long a mystery; only within the last decade was it reported that net energy conservation for growth depends on electron bifurcation. In this work, we focus on Methanococcus maripaludis, a well-studied hydrogenotrophic marine methanogen. To better understand hydrogenotrophic methanogenesis and compare it with methylotrophic methanogenesis that utilizes oxidative phosphorylation rather than electron bifurcation, we have built iMR539, a genome scale metabolic reconstruction that accounts for 539 of the 1,722 protein-coding genes of M. maripaludis strain S2. Our reconstructed metabolic network uses recent literature to not only represent the central electron bifurcation reaction but also incorporate vital biosynthesis and assimilation pathways, including unique cofactor and coenzyme syntheses. We show that our model accurately predicts experimental growth and gene knockout data, with 93% accuracy and a Matthews correlation coefficient of 0.78. Furthermore, we use our metabolic network reconstruction to probe the implications of electron bifurcation by showing its essentiality, as well as investigating the infeasibility of aceticlastic methanogenesis in the network. Additionally, we demonstrate a method of applying thermodynamic constraints to a metabolic model to quickly estimate overall free-energy changes between what comes in and out of the cell. Finally, we describe a novel reconstruction-specific computational toolbox we created to improve usability. Together, our results provide a computational network for exploring hydrogenotrophic methanogenesis and confirm the importance of electron bifurcation in this process. IMPORTANCE Understanding and applying hydrogenotrophic methanogenesis is a promising avenue for developing new bioenergy technologies around methane gas. Although a significant portion of biological methane is generated through this environmentally ubiquitous pathway, existing methanogen models portray the more traditional energy conservation mechanisms that are found in other methanogens. We have constructed a genome scale metabolic network of Methanococcus maripaludis that explicitly accounts for all major reactions involved in hydrogenotrophic methanogenesis. Our reconstruction demonstrates the importance of electron bifurcation in central metabolism, providing both a window into hydrogenotrophic methanogenesis and a hypothesis-generating platform to fuel metabolic engineering efforts.
Collapse
|
37
|
Sung J, Hale V, Merkel AC, Kim PJ, Chia N. Metabolic modeling with Big Data and the gut microbiome. Appl Transl Genom 2016; 10:10-5. [PMID: 27668170 PMCID: PMC5025471 DOI: 10.1016/j.atg.2016.02.001] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2015] [Revised: 01/19/2016] [Accepted: 02/02/2016] [Indexed: 12/20/2022]
Abstract
The recent advances in high-throughput omics technologies have enabled researchers to explore the intricacies of the human microbiome. On the clinical front, the gut microbial community has been the focus of many biomarker-discovery studies. While the recent deluge of high-throughput data in microbiome research has been vastly informative and groundbreaking, we have yet to capture the full potential of omics-based approaches. Realizing the promise of multi-omics data will require integration of disparate omics data, as well as a biologically relevant, mechanistic framework - or metabolic model - on which to overlay these data. Also, a new paradigm for metabolic model evaluation is necessary. Herein, we outline the need for multi-omics data integration, as well as the accompanying challenges. Furthermore, we present a framework for characterizing the ecology of the gut microbiome based on metabolic network modeling.
Collapse
Affiliation(s)
- Jaeyun Sung
- Asia Pacific Center for Theoretical Physics, Pohang, Gyeongbuk 37673, Republic of Korea
| | - Vanessa Hale
- Center for Individualized Medicine, Microbiome Program, Mayo Clinic, Rochester, MN 55905, USA
- Department of Surgery, Mayo Clinic, Rochester, MN 55905, USA
| | - Annette C. Merkel
- Center for Individualized Medicine, Microbiome Program, Mayo Clinic, Rochester, MN 55905, USA
| | - Pan-Jun Kim
- Asia Pacific Center for Theoretical Physics, Pohang, Gyeongbuk 37673, Republic of Korea
- Department of Physics, Pohang University of Science and Technology, Pohang, Gyeongbuk 37673, Republic of Korea
| | - Nicholas Chia
- Center for Individualized Medicine, Microbiome Program, Mayo Clinic, Rochester, MN 55905, USA
- Department of Surgery, Mayo Clinic, Rochester, MN 55905, USA
- Department of Biomedical Engineering, Mayo College, Rochester, MN 55905, USA
| |
Collapse
|
38
|
Cuevas DA, Edirisinghe J, Henry CS, Overbeek R, O’Connell TG, Edwards RA. From DNA to FBA: How to Build Your Own Genome-Scale Metabolic Model. Front Microbiol 2016; 7:907. [PMID: 27379044 PMCID: PMC4911401 DOI: 10.3389/fmicb.2016.00907] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2016] [Accepted: 05/27/2016] [Indexed: 11/19/2022] Open
Abstract
Microbiological studies are increasingly relying on in silico methods to perform exploration and rapid analysis of genomic data, and functional genomics studies are supplemented by the new perspectives that genome-scale metabolic models offer. A mathematical model consisting of a microbe's entire metabolic map can be rapidly determined from whole-genome sequencing and annotating the genomic material encoded in its DNA. Flux-balance analysis (FBA), a linear programming technique that uses metabolic models to predict the phenotypic responses imposed by environmental elements and factors, is the leading method to simulate and manipulate cellular growth in silico. However, the process of creating an accurate model to use in FBA consists of a series of steps involving a multitude of connections between bioinformatics databases, enzyme resources, and metabolic pathways. We present the methodology and procedure to obtain a metabolic model using PyFBA, an extensible Python-based open-source software package aimed to provide a platform where functional annotations are used to build metabolic models (http://linsalrob.github.io/PyFBA). Backed by the Model SEED biochemistry database, PyFBA contains methods to reconstruct a microbe's metabolic map, run FBA upon different media conditions, and gap-fill its metabolism. The extensibility of PyFBA facilitates novel techniques in creating accurate genome-scale metabolic models.
Collapse
Affiliation(s)
- Daniel A. Cuevas
- Computational Science Research Center, San Diego State University, San DiegoCA, USA
| | - Janaka Edirisinghe
- Mathematics and Computer Science Division, Argonne National Laboratory, ArgonneIL, USA
| | - Chris S. Henry
- Mathematics and Computer Science Division, Argonne National Laboratory, ArgonneIL, USA
| | - Ross Overbeek
- Fellowship for Interpretation of Genomes, Burr RidgeIL, USA
| | - Taylor G. O’Connell
- Biological and Medical Informatics Research Center, San Diego State University, San DiegoCA, USA
| | - Robert A. Edwards
- Computational Science Research Center, San Diego State University, San DiegoCA, USA
- Biological and Medical Informatics Research Center, San Diego State University, San DiegoCA, USA
- Department of Computer Science, San Diego State University, San DiegoCA, USA
- Department of Biology, San Diego State University, San DiegoCA, USA
| |
Collapse
|
39
|
Davis JJ, Gerdes S, Olsen GJ, Olson R, Pusch GD, Shukla M, Vonstein V, Wattam AR, Yoo H. PATtyFams: Protein Families for the Microbial Genomes in the PATRIC Database. Front Microbiol 2016; 7:118. [PMID: 26903996 PMCID: PMC4744870 DOI: 10.3389/fmicb.2016.00118] [Citation(s) in RCA: 118] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Accepted: 01/22/2016] [Indexed: 01/12/2023] Open
Abstract
The ability to build accurate protein families is a fundamental operation in bioinformatics that influences comparative analyses, genome annotation, and metabolic modeling. For several years we have been maintaining protein families for all microbial genomes in the PATRIC database (Pathosystems Resource Integration Center, patricbrc.org) in order to drive many of the comparative analysis tools that are available through the PATRIC website. However, due to the burgeoning number of genomes, traditional approaches for generating protein families are becoming prohibitive. In this report, we describe a new approach for generating protein families, which we call PATtyFams. This method uses the k-mer-based function assignments available through RAST (Rapid Annotation using Subsystem Technology) to rapidly guide family formation, and then differentiates the function-based groups into families using a Markov Cluster algorithm (MCL). This new approach for generating protein families is rapid, scalable and has properties that are consistent with alignment-based methods.
Collapse
Affiliation(s)
- James J Davis
- Computation Institute, University of ChicagoChicago, IL, USA; Computing, Environment and Life Sciences, Argonne National LaboratoryArgonne IL, USA
| | - Svetlana Gerdes
- Computing, Environment and Life Sciences, Argonne National LaboratoryArgonne IL, USA; Fellowship for Interpretation of GenomesBurr Ridge, IL, USA
| | - Gary J Olsen
- Department of Microbiology and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign Urbana, IL, USA
| | - Robert Olson
- Computation Institute, University of ChicagoChicago, IL, USA; Mathematics and Computer Science Division, Argonne National LaboratoryArgonne, IL, USA
| | - Gordon D Pusch
- Computing, Environment and Life Sciences, Argonne National LaboratoryArgonne IL, USA; Fellowship for Interpretation of GenomesBurr Ridge, IL, USA
| | - Maulik Shukla
- Computation Institute, University of ChicagoChicago, IL, USA; Computing, Environment and Life Sciences, Argonne National LaboratoryArgonne IL, USA
| | - Veronika Vonstein
- Computing, Environment and Life Sciences, Argonne National LaboratoryArgonne IL, USA; Fellowship for Interpretation of GenomesBurr Ridge, IL, USA
| | - Alice R Wattam
- Virginia Bioinformatics Institute, Virginia Tech University Blacksburg, VA, USA
| | - Hyunseung Yoo
- Computation Institute, University of ChicagoChicago, IL, USA; Computing, Environment and Life Sciences, Argonne National LaboratoryArgonne IL, USA
| |
Collapse
|
40
|
Heavner BD, Price ND. Comparative Analysis of Yeast Metabolic Network Models Highlights Progress, Opportunities for Metabolic Reconstruction. PLoS Comput Biol 2015; 11:e1004530. [PMID: 26566239 PMCID: PMC4643975 DOI: 10.1371/journal.pcbi.1004530] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2015] [Accepted: 08/28/2015] [Indexed: 11/18/2022] Open
Abstract
We have compared 12 genome-scale models of the Saccharomyces cerevisiae metabolic network published since 2003 to evaluate progress in reconstruction of the yeast metabolic network. We compared the genomic coverage, overlap of annotated metabolites, predictive ability for single gene essentiality with a selection of model parameters, and biomass production predictions in simulated nutrient-limited conditions. We have also compared pairwise gene knockout essentiality predictions for 10 of these models. We found that varying approaches to model scope and annotation reflected the involvement of multiple research groups in model development; that single-gene essentiality predictions were affected by simulated medium, objective function, and the reference list of essential genes; and that predictive ability for single-gene essentiality did not correlate well with predictive ability for our reference list of synthetic lethal gene interactions (R = 0.159). We conclude that the reconstruction of the yeast metabolic network is indeed gradually improving through the iterative process of model development, and there remains great opportunity for advancing our understanding of biology through continued efforts to reconstruct the full biochemical reaction network that constitutes yeast metabolism. Additionally, we suggest that there is opportunity for refining the process of deriving a metabolic model from a metabolic network reconstruction to facilitate mechanistic investigation and discovery. This comparative study lays the groundwork for developing improved tools and formalized methods to quantitatively assess metabolic network reconstructions independently of any particular model application, which will facilitate ongoing efforts to advance our understanding of the relationship between genotype and cellular phenotype.
Collapse
Affiliation(s)
- Benjamin D. Heavner
- Institute for Systems Biology, Seattle, Washington, United States of America
| | - Nathan D. Price
- Institute for Systems Biology, Seattle, Washington, United States of America
- * E-mail:
| |
Collapse
|
41
|
Zomorrodi AR, Segrè D. Synthetic Ecology of Microbes: Mathematical Models and Applications. J Mol Biol 2015; 428:837-61. [PMID: 26522937 DOI: 10.1016/j.jmb.2015.10.019] [Citation(s) in RCA: 125] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2015] [Revised: 10/17/2015] [Accepted: 10/21/2015] [Indexed: 12/29/2022]
Abstract
As the indispensable role of natural microbial communities in many aspects of life on Earth is uncovered, the bottom-up engineering of synthetic microbial consortia with novel functions is becoming an attractive alternative to engineering single-species systems. Here, we summarize recent work on synthetic microbial communities with a particular emphasis on open challenges and opportunities in environmental sustainability and human health. We next provide a critical overview of mathematical approaches, ranging from phenomenological to mechanistic, to decipher the principles that govern the function, dynamics and evolution of microbial ecosystems. Finally, we present our outlook on key aspects of microbial ecosystems and synthetic ecology that require further developments, including the need for more efficient computational algorithms, a better integration of empirical methods and model-driven analysis, the importance of improving gene function annotation, and the value of a standardized library of well-characterized organisms to be used as building blocks of synthetic communities.
Collapse
Affiliation(s)
| | - Daniel Segrè
- Bioinformatics Program, Boston University, Boston, MA; Department of Biology, Boston University, Boston, MA; Department of Biomedical Engineering, Boston University, Boston, MA.
| |
Collapse
|
42
|
Biggs MB, Medlock GL, Kolling GL, Papin JA. Metabolic network modeling of microbial communities. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2015; 7:317-34. [PMID: 26109480 DOI: 10.1002/wsbm.1308] [Citation(s) in RCA: 73] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Received: 04/01/2015] [Revised: 05/07/2015] [Accepted: 05/13/2015] [Indexed: 12/15/2022]
Abstract
Genome-scale metabolic network reconstructions and constraint-based analyses are powerful methods that have the potential to make functional predictions about microbial communities. Genome-scale metabolic networks are used to characterize the metabolic functions of microbial communities via several techniques including species compartmentalization, separating species-level and community-level objectives, dynamic analysis, the 'enzyme-soup' approach, multiscale modeling, and others. There are many challenges in the field, including a need for tools that accurately assign high-level omics signals to individual community members, the need for improved automated network reconstruction methods, and novel algorithms for integrating omics data and engineering communities. As technologies and modeling frameworks improve, we expect that there will be corresponding advances in the fields of ecology, health science, and microbial community engineering.
Collapse
Affiliation(s)
- Matthew B Biggs
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA
| | - Gregory L Medlock
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA
| | - Glynis L Kolling
- Department of Medicine, Infectious Diseases, University of Virginia, Charlottesville, VA, USA
| | - Jason A Papin
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
43
|
Alkema W, Boekhorst J, Wels M, van Hijum SAFT. Microbial bioinformatics for food safety and production. Brief Bioinform 2015; 17:283-92. [PMID: 26082168 PMCID: PMC4793891 DOI: 10.1093/bib/bbv034] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2015] [Indexed: 12/14/2022] Open
Abstract
In the production of fermented foods, microbes play an important role. Optimization of fermentation processes or starter culture production traditionally was a trial-and-error approach inspired by expert knowledge of the fermentation process. Current developments in high-throughput 'omics' technologies allow developing more rational approaches to improve fermentation processes both from the food functionality as well as from the food safety perspective. Here, the authors thematically review typical bioinformatics techniques and approaches to improve various aspects of the microbial production of fermented food products and food safety.
Collapse
|
44
|
Krumholz EW, Libourel IGL. Sequence-based Network Completion Reveals the Integrality of Missing Reactions in Metabolic Networks. J Biol Chem 2015; 290:19197-207. [PMID: 26041773 DOI: 10.1074/jbc.m114.634121] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2014] [Indexed: 11/06/2022] Open
Abstract
Genome-scale metabolic models are central in connecting genotypes to metabolic phenotypes. However, even for well studied organisms, such as Escherichia coli, draft networks do not contain a complete biochemical network. Missing reactions are referred to as gaps. These gaps need to be filled to enable functional analysis, and gap-filling choices influence model predictions. To investigate whether functional networks existed where all gap-filling reactions were supported by sequence similarity to annotated enzymes, four draft networks were supplemented with all reactions from the Model SEED database for which minimal sequence similarity was found in their genomes. Quadratic programming revealed that the number of reactions that could partake in a gap-filling solution was vast: 3,270 in the case of E. coli, where 72% of the metabolites in the draft network could connect a gap-filling solution. Nonetheless, no network could be completed without the inclusion of orphaned enzymes, suggesting that parts of the biochemistry integral to biomass precursor formation are uncharacterized. However, many gap-filling reactions were well determined, and the resulting networks showed improved prediction of gene essentiality compared with networks generated through canonical gap filling. In addition, gene essentiality predictions that were sensitive to poorly determined gap-filling reactions were of poor quality, suggesting that damage to the network structure resulting from the inclusion of erroneous gap-filling reactions may be predictable.
Collapse
Affiliation(s)
| | - Igor G L Libourel
- From the Department of Plant Biology and the Biotechnology Institute, University of Minnesota, Saint Paul, Minnesota 55108
| |
Collapse
|
45
|
Imam S, Schäuble S, Brooks AN, Baliga NS, Price ND. Data-driven integration of genome-scale regulatory and metabolic network models. Front Microbiol 2015; 6:409. [PMID: 25999934 PMCID: PMC4419725 DOI: 10.3389/fmicb.2015.00409] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2015] [Accepted: 04/20/2015] [Indexed: 12/21/2022] Open
Abstract
Microbes are diverse and extremely versatile organisms that play vital roles in all ecological niches. Understanding and harnessing microbial systems will be key to the sustainability of our planet. One approach to improving our knowledge of microbial processes is through data-driven and mechanism-informed computational modeling. Individual models of biological networks (such as metabolism, transcription, and signaling) have played pivotal roles in driving microbial research through the years. These networks, however, are highly interconnected and function in concert-a fact that has led to the development of a variety of approaches aimed at simulating the integrated functions of two or more network types. Though the task of integrating these different models is fraught with new challenges, the large amounts of high-throughput data sets being generated, and algorithms being developed, means that the time is at hand for concerted efforts to build integrated regulatory-metabolic networks in a data-driven fashion. In this perspective, we review current approaches for constructing integrated regulatory-metabolic models and outline new strategies for future development of these network models for any microbial system.
Collapse
Affiliation(s)
- Saheed Imam
- Institute for Systems Biology Seattle, WA, USA
| | - Sascha Schäuble
- Institute for Systems Biology Seattle, WA, USA ; Jena University Language and Information Engineering Lab, Friedrich-Schiller-University Jena Jena, Germany
| | | | - Nitin S Baliga
- Institute for Systems Biology Seattle, WA, USA ; Departments of Biology and Microbiology, University of Washington Seattle, WA, USA ; Molecular and Cellular Biology Program, University of Washington Seattle, WA, USA ; Lawrence Berkeley National Lab Berkeley, CA, USA
| | | |
Collapse
|
46
|
Genome-scale modeling for metabolic engineering. J Ind Microbiol Biotechnol 2015; 42:327-38. [PMID: 25578304 DOI: 10.1007/s10295-014-1576-3] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2014] [Accepted: 12/20/2014] [Indexed: 01/04/2023]
Abstract
We focus on the application of constraint-based methodologies and, more specifically, flux balance analysis in the field of metabolic engineering, and enumerate recent developments and successes of the field. We also review computational frameworks that have been developed with the express purpose of automatically selecting optimal gene deletions for achieving improved production of a chemical of interest. The application of flux balance analysis methods in rational metabolic engineering requires a metabolic network reconstruction and a corresponding in silico metabolic model for the microorganism in question. For this reason, we additionally present a brief overview of automated reconstruction techniques. Finally, we emphasize the importance of integrating metabolic networks with regulatory information-an area which we expect will become increasingly important for metabolic engineering-and present recent developments in the field of metabolic and regulatory integration.
Collapse
|
47
|
Transparency in metabolic network reconstruction enables scalable biological discovery. Curr Opin Biotechnol 2015; 34:105-9. [PMID: 25562137 DOI: 10.1016/j.copbio.2014.12.010] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2014] [Revised: 12/11/2014] [Accepted: 12/12/2014] [Indexed: 12/19/2022]
Abstract
Reconstructing metabolic pathways has long been a focus of active research. Now, draft models can be generated from genomic annotation and used to simulate metabolic fluxes of mass and energy at the whole-cell scale. This approach has led to an explosion in the number of functional metabolic network models. However, more models have not led to expanded coverage of metabolic reactions known to occur in the biosphere. Thus, there exists opportunity to reconsider the process of reconstruction and model derivation to better support the less-scalable investigative processes of biocuration and experimentation. Realizing this opportunity to improve our knowledge of metabolism requires developing new tools that make reconstructions more useful by highlighting metabolic network knowledge limitations to guide future research.
Collapse
|