1
|
Kundu P, Beura S, Mondal S, Das AK, Ghosh A. Machine learning for the advancement of genome-scale metabolic modeling. Biotechnol Adv 2024; 74:108400. [PMID: 38944218 DOI: 10.1016/j.biotechadv.2024.108400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 05/13/2024] [Accepted: 06/23/2024] [Indexed: 07/01/2024]
Abstract
Constraint-based modeling (CBM) has evolved as the core systems biology tool to map the interrelations between genotype, phenotype, and external environment. The recent advancement of high-throughput experimental approaches and multi-omics strategies has generated a plethora of new and precise information from wide-ranging biological domains. On the other hand, the continuously growing field of machine learning (ML) and its specialized branch of deep learning (DL) provide essential computational architectures for decoding complex and heterogeneous biological data. In recent years, both multi-omics and ML have assisted in the escalation of CBM. Condition-specific omics data, such as transcriptomics and proteomics, helped contextualize the model prediction while analyzing a particular phenotypic signature. At the same time, the advanced ML tools have eased the model reconstruction and analysis to increase the accuracy and prediction power. However, the development of these multi-disciplinary methodological frameworks mainly occurs independently, which limits the concatenation of biological knowledge from different domains. Hence, we have reviewed the potential of integrating multi-disciplinary tools and strategies from various fields, such as synthetic biology, CBM, omics, and ML, to explore the biochemical phenomenon beyond the conventional biological dogma. How the integrative knowledge of these intersected domains has improved bioengineering and biomedical applications has also been highlighted. We categorically explained the conventional genome-scale metabolic model (GEM) reconstruction tools and their improvement strategies through ML paradigms. Further, the crucial role of ML and DL in omics data restructuring for GEM development has also been briefly discussed. Finally, the case-study-based assessment of the state-of-the-art method for improving biomedical and metabolic engineering strategies has been elaborated. Therefore, this review demonstrates how integrating experimental and in silico strategies can help map the ever-expanding knowledge of biological systems driven by condition-specific cellular information. This multiview approach will elevate the application of ML-based CBM in the biomedical and bioengineering fields for the betterment of society and the environment.
Collapse
Affiliation(s)
- Pritam Kundu
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Satyajit Beura
- Department of Bioscience and Biotechnology, Indian Institute of Technology, Kharagpur, West Bengal 721302, India
| | - Suman Mondal
- P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Amit Kumar Das
- Department of Bioscience and Biotechnology, Indian Institute of Technology, Kharagpur, West Bengal 721302, India
| | - Amit Ghosh
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India; P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India.
| |
Collapse
|
2
|
Ramoneda J, Jensen TBN, Price MN, Casamayor EO, Fierer N. Taxonomic and environmental distribution of bacterial amino acid auxotrophies. Nat Commun 2023; 14:7608. [PMID: 37993466 PMCID: PMC10665431 DOI: 10.1038/s41467-023-43435-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 11/08/2023] [Indexed: 11/24/2023] Open
Abstract
Many microorganisms are auxotrophic-unable to synthesize the compounds they require for growth. With this work, we quantify the prevalence of amino acid auxotrophies across a broad diversity of bacteria and habitats. We predicted the amino acid biosynthetic capabilities of 26,277 unique bacterial genomes spanning 12 phyla using a metabolic pathway model validated with empirical data. Amino acid auxotrophy is widespread across bacterial phyla, but we conservatively estimate that the majority of taxa (78.4%) are able to synthesize all amino acids. Our estimates indicate that amino acid auxotrophies are more prevalent among obligate intracellular parasites and in free-living taxa with genomic attributes characteristic of 'streamlined' life history strategies. We predicted the amino acid biosynthetic capabilities of bacterial communities found in 12 unique habitats to investigate environmental associations with auxotrophy, using data compiled from 3813 samples spanning major aquatic, terrestrial, and engineered environments. Auxotrophic taxa were more abundant in host-associated environments (including the human oral cavity and gut) and in fermented food products, with auxotrophic taxa being relatively rare in soil and aquatic systems. Overall, this work contributes to a more complete understanding of amino acid auxotrophy across the bacterial tree of life and the ecological contexts in which auxotrophy can be a successful strategy.
Collapse
Affiliation(s)
- Josep Ramoneda
- Cooperative Institute for Research in Environmental Sciences (CIRES), University of Colorado, Boulder, CO, USA.
| | - Thomas B N Jensen
- Cooperative Institute for Research in Environmental Sciences (CIRES), University of Colorado, Boulder, CO, USA
- Center for Microbial Communities, Department of Chemistry and Bioscience, Aalborg University, Aalborg, Denmark
| | - Morgan N Price
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Emilio O Casamayor
- Spanish Research Council (CSIC), Center for Advanced Studies of Blanes (CEAB), Blanes, Spain
| | - Noah Fierer
- Cooperative Institute for Research in Environmental Sciences (CIRES), University of Colorado, Boulder, CO, USA.
- Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, CO, USA.
| |
Collapse
|
3
|
Sanchez-Alonso P, Cobos-Justo E, Avalos-Rangel MA, López-Reyes L, Paniagua-Contreras GL, Vaca-Paniagua F, Anastacio-Marcelino E, López-Ochoa AJ, Pérez Marquez VM, Negrete-Abascal E, Vázquez-Cruz C. A Maverick-like cluster in the genome of a pathogenic, moderately virulent strain of Gallibacterium anatis, ESV200, a transient biofilm producer. Front Microbiol 2023; 14:1084766. [PMID: 36778889 PMCID: PMC9909271 DOI: 10.3389/fmicb.2023.1084766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Accepted: 01/06/2023] [Indexed: 01/28/2023] Open
Abstract
Introduction Gallibacterium anatis causes gallibacteriosis in birds. These bacteria produce biofilms and secrete several fimbrial appendages as tools to cause disease in animals. G. anatis strains contain up to three types of fimbriae. Complete genome sequencing is the strategy currently used to determine variations in the gene content of G. anatis, although today only the completely circularized genome of G. anatis UMN179 is available. Methods The appearance of growth of various strains of G. anatis in liquid culture medium was studied. Biofilm production and how the amount of biofilm was affected by DNase, Proteinase K, and Pronase E enzymes were analyzed. Fimbrial gene expression was performed by protein analysis and qRT-PCR. In an avian model, the pathogenesis generated by the strains G. anatis ESV200 and 12656-12 was investigated. Using bioinformatic tools, the complete genome of G. anatis ESV200 was comparatively studied to search for virulence factors that would help explain the pathogenic behavior of this strain. Results and Discussion G. anatis ESV200 strain differs from the 12656-12 strain because it produces a biofilm at 20%. G. anatis ESV200 strain express fimbrial genes and produces biofilm but with a different structure than that observed for strain 12656-12. ESV200 and 12656-12 strains are pathogenic for chickens, although the latter is the most virulent. Here, we show that the complete genome of the ESV200 strain is similar to that of the UNM179 strain. However, these strains have evolved with many structural rearrangements; the most striking chromosomal arrangement is a Maverick-like element present in the ESV200 strain.
Collapse
Affiliation(s)
- Patricia Sanchez-Alonso
- Centro de Investigaciones en Ciencias Microbiológicas, Instituto de Ciencias, Benemérita Universidad Autónoma de Puebla, Puebla, Mexico,*Correspondence: Patricia Sanchez-Alonso,
| | - Elena Cobos-Justo
- Centro de Investigaciones en Ciencias Microbiológicas, Instituto de Ciencias, Benemérita Universidad Autónoma de Puebla, Puebla, Mexico
| | - Miguel Angel Avalos-Rangel
- Centro de Investigaciones en Ciencias Microbiológicas, Instituto de Ciencias, Benemérita Universidad Autónoma de Puebla, Puebla, Mexico
| | - Lucía López-Reyes
- Centro de Investigaciones en Ciencias Microbiológicas, Instituto de Ciencias, Benemérita Universidad Autónoma de Puebla, Puebla, Mexico
| | - Gloria Luz Paniagua-Contreras
- Carrera de Biología, Facultad de Estudios Superiores de Iztacala, UNAM, Los Reyes Iztacala, Estado de, México, Mexico
| | - Felipe Vaca-Paniagua
- Carrera de Biología, Facultad de Estudios Superiores de Iztacala, UNAM, Los Reyes Iztacala, Estado de, México, Mexico,Subdirección de Investigación Basica, Instituto Nacional de Cancerología, CDMX, México
| | - Estela Anastacio-Marcelino
- Centro de Investigaciones en Ciencias Microbiológicas, Instituto de Ciencias, Benemérita Universidad Autónoma de Puebla, Puebla, Mexico
| | - Ana Jaqueline López-Ochoa
- Centro de Investigaciones en Ciencias Microbiológicas, Instituto de Ciencias, Benemérita Universidad Autónoma de Puebla, Puebla, Mexico
| | - Victor M. Pérez Marquez
- Diagnóstico y Patobiología Aviar, Biotecnología Veterinaria S.A.-Biovetsa, BIOVETSA, Tehuacán, Mexico
| | - Erasmo Negrete-Abascal
- Carrera de Biología, Facultad de Estudios Superiores de Iztacala, UNAM, Los Reyes Iztacala, Estado de, México, Mexico
| | - Candelario Vázquez-Cruz
- Centro de Investigaciones en Ciencias Microbiológicas, Instituto de Ciencias, Benemérita Universidad Autónoma de Puebla, Puebla, Mexico,Candelario Vázquez-Cruz,
| |
Collapse
|
4
|
Mostolizadeh R, Glöckler M, Dräger A. Towards the human nasal microbiome: Simulating D. pigrum and S. aureus. Front Cell Infect Microbiol 2022; 12:925215. [PMID: 36605126 PMCID: PMC9810029 DOI: 10.3389/fcimb.2022.925215] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Accepted: 08/15/2022] [Indexed: 01/12/2023] Open
Abstract
The human nose harbors various microbes that decisively influence the wellbeing and health of their host. Among the most threatening pathogens in this habitat is Staphylococcus aureus. Multiple epidemiological studies identify Dolosigranulum pigrum as a likely beneficial bacterium based on its positive association with health, including negative associations with S. aureus. Carefully curated GEMs are available for both bacterial species that reliably simulate their growth behavior in isolation. To unravel the mutual effects among bacteria, building community models for simulating co-culture growth is necessary. However, modeling microbial communities remains challenging. This article illustrates how applying the NCMW fosters our understanding of two microbes' joint growth conditions in the nasal habitat and their intricate interplay from a metabolic modeling perspective. The resulting community model combines the latest available curated GEMs of D. pigrum and S. aureus. This uses case illustrates how to incorporate genuine GEM of participating microorganisms and creates a basic community model mimicking the human nasal environment. Our analysis supports the role of negative microbe-microbe interactions involving D. pigrum examined experimentally in the lab. By this, we identify and characterize metabolic exchange factors involved in a specific interaction between D. pigrum and S. aureus as an in silico candidate factor for a deep insight into the associated species. This method may serve as a blueprint for developing more complex microbial interaction models. Its direct application suggests new ways to prevent disease-causing infections by inhibiting the growth of pathogens such as S. aureus through microbe-microbe interactions.
Collapse
Affiliation(s)
- Reihaneh Mostolizadeh
- Computational Systems Biology of Infections and Antimicrobial-Resistant Pathogens, Institute for Bioinformatics and Medical Informatics (IBMI), University of Tübingen, Tübingen, Germany,Department of Computer Science, University of Tübingen, Tübingen, Germany,German Center for Infection Research (DZIF), Partner site, Tübingen, Germany,Cluster of Excellence ‘Controlling Microbes to Fight Infections’, University of Tübingen, Tübingen, Germany,*Correspondence: Reihaneh Mostolizadeh,
| | - Manuel Glöckler
- Department of Computer Science, University of Tübingen, Tübingen, Germany
| | - Andreas Dräger
- Computational Systems Biology of Infections and Antimicrobial-Resistant Pathogens, Institute for Bioinformatics and Medical Informatics (IBMI), University of Tübingen, Tübingen, Germany,Department of Computer Science, University of Tübingen, Tübingen, Germany,German Center for Infection Research (DZIF), Partner site, Tübingen, Germany,Cluster of Excellence ‘Controlling Microbes to Fight Infections’, University of Tübingen, Tübingen, Germany
| |
Collapse
|
5
|
Integrative modeling of the cell. Acta Biochim Biophys Sin (Shanghai) 2022; 54:1213-1221. [PMID: 36017893 PMCID: PMC9909318 DOI: 10.3724/abbs.2022115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
A whole-cell model represents certain aspects of the cell structure and/or function. Due to the high complexity of the cell, an integrative modeling approach is often taken to utilize all available information including experimental data, prior knowledge and prior models. In this review, we summarize an emerging workflow of whole-cell modeling into five steps: (i) gather information; (ii) represent the modeled system into modules; (iii) translate input information into scoring function; (iv) sample the whole-cell model; (v) validate and interpret the model. In particular, we propose the integrative modeling of the cell by combining available (whole-cell) models to maximize the accuracy, precision, and completeness. In addition, we list quantitative predictions of various aspects of cell biology from existing whole-cell models. Moreover, we discuss the remaining challenges and future directions, and highlight the opportunity to establish an integrative spatiotemporal multi-scale whole-cell model based on a community approach.
Collapse
|
6
|
Buchner BA, Zanghellini J. EFMlrs: a Python package for elementary flux mode enumeration via lexicographic reverse search. BMC Bioinformatics 2021; 22:547. [PMID: 34758748 PMCID: PMC8579665 DOI: 10.1186/s12859-021-04417-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Accepted: 09/27/2021] [Indexed: 12/02/2022] Open
Abstract
Background Elementary flux mode (EFM) analysis is a well-established, yet computationally challenging approach to characterize metabolic networks. Standard algorithms require huge amounts of memory and lack scalability which limits their application to single servers and consequently limits a comprehensive analysis to medium-scale networks. Recently, Avis et al. developed mplrs—a parallel version of the lexicographic reverse search (lrs) algorithm, which, in principle, enables an EFM analysis on high-performance computing environments (Avis and Jordan. mplrs: a scalable parallel vertex/facet enumeration code. arXiv:1511.06487, 2017). Here we test its applicability for EFM enumeration. Results We developed EFMlrs, a Python package that gives users access to the enumeration capabilities of mplrs. EFMlrs uses COBRApy to process metabolic models from sbml files, performs loss-free compressions of the stoichiometric matrix, and generates suitable inputs for mplrs as well as efmtool, providing support not only for our proposed new method for EFM enumeration but also for already established tools. By leveraging COBRApy, EFMlrs also allows the application of additional reaction boundaries and seamlessly integrates into existing workflows. Conclusion We show that due to mplrs’s properties, the algorithm is perfectly suited for high-performance computing (HPC) and thus offers new possibilities for the unbiased analysis of substantially larger metabolic models via EFM analyses. EFMlrs is an open-source program that comes together with a designated workflow and can be easily installed via pip. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04417-9.
Collapse
Affiliation(s)
- Bianca A Buchner
- Department of Biotechnology, University of Natural Resources and Life Sciences, Vienna, Austria.,Austrian Centre of Industrial Biotechnology, Vienna, Austria
| | - Jürgen Zanghellini
- Department of Analytical Chemistry, University of Vienna, Vienna, Austria.
| |
Collapse
|
7
|
Kordium VA. Defining life and evolution: Essay on the origin, expansion, and evolution of living matter. Biosystems 2021; 209:104500. [PMID: 34352326 DOI: 10.1016/j.biosystems.2021.104500] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2021] [Accepted: 07/30/2021] [Indexed: 11/17/2022]
Abstract
This essay aims to define the origin, expansion, and evolution of living matter. The first formations, identified as remains, fossils, traces etc. of life are almost as old as the Earth itself. During four billion years, life on the Earth has continuously existed and been implemented in the range of conditions, ensuring the liquid state of water. During the entire period of life existence, its evolution was proceeding with the tendency of multidirectionality, after each catastrophe tending to the diversity and vastness of distribution, and all the currently living species, regardless of their complexity, have the same evolutionary age. The property of reproductive surplus (multiplication) is inherent in all the living matter. The reproduction of all the living matter is implemented via the "development" - a process of continuous occurrence of something new that did not exist in the previous moment in the reproduced individual at each specific moment of time with the tendency towards the reproduction of a "copy". In its fundamental basis, Life is based on a programme, its material support is implemented and exists not in the field of causative-consecutive events, but in the field of programmed-causative-consecutive events. This predetermines the "biology laws", the behaviour of the material constituent of Life at each time period, and the future of the material constituent of life.
Collapse
Affiliation(s)
- Vitaly A Kordium
- Institute of Molecular Biology and Genetics of the National Academy of Sciences of Ukraine, Kyiv, 03143, Ukraine.
| |
Collapse
|
8
|
Lachance J, Matteau D, Brodeur J, Lloyd CJ, Mih N, King ZA, Knight TF, Feist AM, Monk JM, Palsson BO, Jacques P, Rodrigue S. Genome-scale metabolic modeling reveals key features of a minimal gene set. Mol Syst Biol 2021; 17:e10099. [PMID: 34288418 PMCID: PMC8290834 DOI: 10.15252/msb.202010099] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Revised: 06/18/2021] [Accepted: 06/22/2021] [Indexed: 12/19/2022] Open
Abstract
Mesoplasma florum, a fast-growing near-minimal organism, is a compelling model to explore rational genome designs. Using sequence and structural homology, the set of metabolic functions its genome encodes was identified, allowing the reconstruction of a metabolic network representing ˜ 30% of its protein-coding genes. Growth medium simplification enabled substrate uptake and product secretion rate quantification which, along with experimental biomass composition, were integrated as species-specific constraints to produce the functional iJL208 genome-scale model (GEM) of metabolism. Genome-wide expression and essentiality datasets as well as growth data on various carbohydrates were used to validate and refine iJL208. Discrepancies between model predictions and observations were mechanistically explained using protein structures and network analysis. iJL208 was also used to propose an in silico reduced genome. Comparing this prediction to the minimal cell JCVI-syn3.0 and its parent JCVI-syn1.0 revealed key features of a minimal gene set. iJL208 is a stepping-stone toward model-driven whole-genome engineering.
Collapse
Affiliation(s)
| | - Dominick Matteau
- Département de BiologieUniversité de SherbrookeSherbrookeQCCanada
| | - Joëlle Brodeur
- Département de BiologieUniversité de SherbrookeSherbrookeQCCanada
| | - Colton J Lloyd
- Department of BioengineeringUniversity of CaliforniaSan Diego, La JollaCAUSA
| | - Nathan Mih
- Department of BioengineeringUniversity of CaliforniaSan Diego, La JollaCAUSA
| | - Zachary A King
- Department of BioengineeringUniversity of CaliforniaSan Diego, La JollaCAUSA
| | | | - Adam M Feist
- Department of BioengineeringUniversity of CaliforniaSan Diego, La JollaCAUSA
- Department of PediatricsUniversity of CaliforniaSan Diego, La JollaCAUSA
| | - Jonathan M Monk
- Department of BioengineeringUniversity of CaliforniaSan Diego, La JollaCAUSA
| | - Bernhard O Palsson
- Department of BioengineeringUniversity of CaliforniaSan Diego, La JollaCAUSA
- Department of PediatricsUniversity of CaliforniaSan Diego, La JollaCAUSA
- Bioinformatics and Systems Biology ProgramUniversity of CaliforniaSan Diego, La JollaCAUSA
- Novo Nordisk Foundation Center for BiosustainabilityTechnical University of DenmarkLyngbyDenmark
| | | | | |
Collapse
|
9
|
Panikov NS. True and Illusory Benefits of Modeling: Comment on "Genome-Scale Metabolic Network Reconstruction and In Silico Analysis of Hexanoic Acid Producing Megasphaera elsdenii. Microorganisms 2020, 8, 539". Microorganisms 2020; 8:microorganisms8111742. [PMID: 33172047 PMCID: PMC7694653 DOI: 10.3390/microorganisms8111742] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Accepted: 11/03/2020] [Indexed: 11/21/2022] Open
|
10
|
Suthers PF, Maranas CD. Challenges of cultivated meat production and applications of genome‐scale metabolic modeling. AIChE J 2020. [DOI: 10.1002/aic.16235] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Affiliation(s)
- Patrick F. Suthers
- Department of Chemical EngineeringThe Pennsylvania State University University Park Pennsylvania USA
| | - Costas D. Maranas
- Department of Chemical EngineeringThe Pennsylvania State University University Park Pennsylvania USA
| |
Collapse
|
11
|
Yang M, Rajeeve K, Rudel T, Dandekar T. Comprehensive Flux Modeling of Chlamydia trachomatis Proteome and qRT-PCR Data Indicate Biphasic Metabolic Differences Between Elementary Bodies and Reticulate Bodies During Infection. Front Microbiol 2019; 10:2350. [PMID: 31681215 PMCID: PMC6803457 DOI: 10.3389/fmicb.2019.02350] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Accepted: 09/26/2019] [Indexed: 11/13/2022] Open
Abstract
Metabolic adaptation to the host cell is important for obligate intracellular pathogens such as Chlamydia trachomatis (Ct). Here we infer the flux differences for Ct from proteome and qRT-PCR data by comprehensive pathway modeling. We compare the comparatively inert infectious elementary body (EB) and the active replicative reticulate body (RB) systematically using a genome-scale metabolic model with 321 metabolites and 277 reactions. This did yield 84 extreme pathways based on a published proteomics dataset at three different time points of infection. Validation of predictions was done by quantitative RT-PCR of enzyme mRNA expression at three time points. Ct’s major active pathways are glycolysis, gluconeogenesis, glycerol-phospholipid (GPL) biosynthesis (support from host acetyl-CoA) and pentose phosphate pathway (PPP), while its incomplete TCA and fatty acid biosynthesis are less active. The modeled metabolic pathways are much more active in RB than in EB. Our in silico model suggests that EB and RB utilize folate to generate NAD(P)H using independent pathways. The only low metabolic flux inferred for EB involves mainly carbohydrate metabolism. RB utilizes energy -rich compounds to generate ATP in nucleic acid metabolism. Validation data for the modeling include proteomics experiments (model basis) as well as qRT-PCR confirmation of selected metabolic enzyme mRNA expression differences. The metabolic modeling is made fully available here. Its detailed insights and models on Ct metabolic adaptations during infection are a useful modeling basis for future studies.
Collapse
Affiliation(s)
- Manli Yang
- Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany
| | - Karthika Rajeeve
- Department of Microbiology, Biocenter, University of Würzburg, Würzburg, Germany.,Department of Biomedicine, Aarhus University, Aarhus, Denmark
| | - Thomas Rudel
- Department of Microbiology, Biocenter, University of Würzburg, Würzburg, Germany
| | - Thomas Dandekar
- Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany.,European Molecular Biology Laboratory, Computational Biology and Structures Program, Heidelberg, Germany
| |
Collapse
|
12
|
De Carvalho NS, Palú G, Witkin SS. Mycoplasma genitalium, a stealth female reproductive tract. Eur J Clin Microbiol Infect Dis 2019; 39:229-234. [PMID: 31522281 DOI: 10.1007/s10096-019-03707-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Accepted: 09/06/2019] [Indexed: 10/26/2022]
Abstract
Mycoplasma genitalium was first isolated from the urethral swabs of two symptomatic men with urethritis in 1980. It is a sexually transmitted bacterium associated with a number of urogenital conditions in women like cervicitis, endometritis, pelvic inflammatory disease, infertility, and susceptibility to human immunodeficiency virus (HIV). However, M. genitalium may also act like a stealth pathogen at female reproductive tract, giving no symptoms. Its prevalence varies between different groups, with the average being 0.5-10% in the general population and 20-40% in women with sexually transmitted infections. The recommended treatment of this infection is azithromycin as a single 1-g dose. However, in recent years, macrolide resistance has increased which is significantly lowering the cure rate, being less than 50% in some studies. New treatment regimens need to be investigated due to increasing drug resistance. The discussion and suggestion of an algorithm for management of this infection is the highlight of this paper.
Collapse
Affiliation(s)
- Newton Sergio De Carvalho
- Obstetrics and Gynecology Department of Federal University of Paraná (DTG/UFPR), Gynecology and Obstetrics Infections Sector from Clinical Hospital - UFPR, Rua Saldanha Marinho 1422 - 801, Curitiba, Paraná, CEP 80430-160, Brazil.
| | - Gabriele Palú
- Obstetrics and Gynecology Department of Federal University of Paraná (DTG/UFPR), Gynecology and Obstetrics Infections Sector from Clinical Hospital - UFPR, Rua Saldanha Marinho 1422 - 801, Curitiba, Paraná, CEP 80430-160, Brazil
| | - Steven S Witkin
- Division of Immunology and Infectious Diseases, Department of Obstetrics and Gynecology, Weill Cornell Medicine, New York, NY, USA
| |
Collapse
|
13
|
Röhl A, Bockmayr A. Finding MEMo: minimum sets of elementary flux modes. J Math Biol 2019; 79:1749-1777. [PMID: 31388689 DOI: 10.1007/s00285-019-01409-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2018] [Revised: 07/15/2019] [Indexed: 10/26/2022]
Abstract
Metabolic network reconstructions are widely used in computational systems biology for in silico studies of cellular metabolism. A common approach to analyse these models are elementary flux modes (EFMs), which correspond to minimal functional units in the network. Already for medium-sized networks, it is often impossible to compute the set of all EFMs, due to their huge number. From a practical point of view, this might also not be necessary because a subset of EFMs may already be sufficient to answer relevant biological questions. In this article, we study MEMos or minimum sets of EFMs that can generate all possible steady-state behaviours of a metabolic network. The number of EFMs in a MEMo may be by several orders of magnitude smaller than the total number of EFMs. Using MEMos, we can compute generating sets of EFMs in metabolic networks where the whole set of EFMs is too large to be enumerated.
Collapse
Affiliation(s)
- Annika Röhl
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 6, 14195, Berlin, Germany.
| | - Alexander Bockmayr
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 6, 14195, Berlin, Germany
| |
Collapse
|
14
|
Metagenome level metabolic network reconstruction analysis reveals the microbiome in the Bogotá River is functionally close to the microbiome in produced water. Ecol Modell 2019. [DOI: 10.1016/j.ecolmodel.2019.02.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
15
|
Breuer M, Earnest TM, Merryman C, Wise KS, Sun L, Lynott MR, Hutchison CA, Smith HO, Lapek JD, Gonzalez DJ, de Crécy-Lagard V, Haas D, Hanson AD, Labhsetwar P, Glass JI, Luthey-Schulten Z. Essential metabolism for a minimal cell. eLife 2019; 8:36842. [PMID: 30657448 PMCID: PMC6609329 DOI: 10.7554/elife.36842] [Citation(s) in RCA: 89] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2018] [Accepted: 01/17/2019] [Indexed: 11/29/2022] Open
Abstract
JCVI-syn3A, a robust minimal cell with a 543 kbp genome and 493 genes, provides a versatile platform to study the basics of life. Using the vast amount of experimental information available on its precursor, Mycoplasma mycoides capri, we assembled a near-complete metabolic network with 98% of enzymatic reactions supported by annotation or experiment. The model agrees well with genome-scale in vivo transposon mutagenesis experiments, showing a Matthews correlation coefficient of 0.59. The genes in the reconstruction have a high in vivo essentiality or quasi-essentiality of 92% (68% essential), compared to 79% in silico essentiality. This coherent model of the minimal metabolism in JCVI-syn3A at the same time also points toward specific open questions regarding the minimal genome of JCVI-syn3A, which still contains many genes of generic or completely unclear function. In particular, the model, its comparison to in vivo essentiality and proteomics data yield specific hypotheses on gene functions and metabolic capabilities; and provide suggestions for several further gene removals. In this way, the model and its accompanying data guide future investigations of the minimal cell. Finally, the identification of 30 essential genes with unclear function will motivate the search for new biological mechanisms beyond metabolism. One way that researchers can test whether they understand a biological system is to see if they can accurately recreate it as a computer model. The more they learn about living things, the more the researchers can improve their models and the closer the models become to simulating the original. In this approach, it is best to start by trying to model a simple system. Biologists have previously succeeded in creating ‘minimal bacterial cells’. These synthetic cells contain fewer genes than almost all other living things and they are believed to be among the simplest possible forms of life that can grow on their own. The minimal cells can produce all the chemicals that they need to survive – in other words, they have a metabolism. Accurately recreating one of these cells in a computer is a key first step towards simulating a complete living system. Breuer et al. have developed a computer model to simulate the network of the biochemical reactions going on inside a minimal cell with just 493 genes. By altering the parameters of their model and comparing the results to experimental data, Breuer et al. explored the accuracy of their model. Overall, the model reproduces experimental results, but it is not yet perfect. The differences between the model and the experiments suggest new questions and tests that could advance our understanding of biology. In particular, Breuer et al. identified 30 genes that are essential for life in these cells but that currently have no known purpose. Continuing to develop and expand models like these to reproduce more complex living systems provides a tool to test current knowledge of biology. These models may become so advanced that they could predict how living things will respond to changing situations. This would allow scientists to test ideas sooner and make much faster progress in understanding life on Earth. Ultimately, these models could one day help to accelerate medical and industrial processes to save lives and enhance productivity.
Collapse
Affiliation(s)
- Marian Breuer
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, United States
| | - Tyler M Earnest
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, United States
| | | | - Kim S Wise
- J Craig Venter Institute, La Jolla, United States
| | - Lijie Sun
- J Craig Venter Institute, La Jolla, United States
| | | | | | | | - John D Lapek
- Department of Pharmacology and School of Pharmacy, University of California at San Diego, La Jolla, United States
| | - David J Gonzalez
- Department of Pharmacology and School of Pharmacy, University of California at San Diego, La Jolla, United States
| | - Valérie de Crécy-Lagard
- Department of Microbiology and Cell Science, University of Florida, Gainesville, United States
| | - Drago Haas
- Department of Microbiology and Cell Science, University of Florida, Gainesville, United States
| | - Andrew D Hanson
- Horticultural Sciences Department, University of Florida, Gainesville, United States
| | - Piyush Labhsetwar
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, United States
| | - John I Glass
- J Craig Venter Institute, La Jolla, United States
| | - Zaida Luthey-Schulten
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, United States
| |
Collapse
|
16
|
Gerstl MP, Müller S, Regensburger G, Zanghellini J. Flux tope analysis: studying the coordination of reaction directions in metabolic networks. Bioinformatics 2019; 35:266-273. [PMID: 30649351 PMCID: PMC6330010 DOI: 10.1093/bioinformatics/bty550] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Revised: 05/30/2018] [Accepted: 06/29/2018] [Indexed: 01/06/2023] Open
Abstract
Motivation Elementary flux mode (EFM) analysis allows an unbiased description of metabolic networks in terms of minimal pathways (involving a minimal set of reactions). To date, the enumeration of EFMs is impracticable in genome-scale metabolic models. In a complementary approach, we introduce the concept of a flux tope (FT), involving a maximal set of reactions (with fixed directions), which allows one to study the coordination of reaction directions in metabolic networks and opens a new way for EFM enumeration. Results A FT is a (nontrivial) subset of the flux cone specified by fixing the directions of all reversible reactions. In a consistent metabolic network (without unused reactions), every FT contains a 'maximal pathway', carrying flux in all reactions. This decomposition of the flux cone into FTs allows the enumeration of EFMs (of individual FTs) without increasing the problem dimension by reaction splitting. To develop a mathematical framework for FT analysis, we build on the concepts of sign vectors and hyperplane arrangements. Thereby, we observe that FT analysis can be applied also to flux optimization problems involving additional (inhomogeneous) linear constraints. For the enumeration of FTs, we adapt the reverse search algorithm and provide an efficient implementation. We demonstrate that (biomass-optimal) FTs can be enumerated in genome-scale metabolic models of B.cuenoti and E.coli, and we use FTs to enumerate EFMs in models of M.genitalium and B.cuenoti. Availability and implementation The source code is freely available at https://github.com/mpgerstl/FTA. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Matthias P Gerstl
- Department of Biotechnology, University of Natural Resources and Life Sciences, Vienna, Austria, EU
- Austrian Centre of Industrial Biotechnology, Vienna, Austria, EU
| | - Stefan Müller
- Faculty of Mathematics, University of Vienna, Vienna, Austria, EU
| | - Georg Regensburger
- Institute for Algebra, Johannes Kepler University Linz, Linz, Austria, EU
| | - Jürgen Zanghellini
- Department of Biotechnology, University of Natural Resources and Life Sciences, Vienna, Austria, EU
- Austrian Centre of Industrial Biotechnology, Vienna, Austria, EU
- Austrian Biotech University of Applied Sciences, Tulln, Austria, EU
| |
Collapse
|
17
|
Machado D, Andrejev S, Tramontano M, Patil KR. Fast automated reconstruction of genome-scale metabolic models for microbial species and communities. Nucleic Acids Res 2018; 46:7542-7553. [PMID: 30192979 PMCID: PMC6125623 DOI: 10.1093/nar/gky537] [Citation(s) in RCA: 325] [Impact Index Per Article: 54.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Revised: 05/17/2018] [Accepted: 05/29/2018] [Indexed: 12/26/2022] Open
Abstract
Genome-scale metabolic models are instrumental in uncovering operating principles of cellular metabolism, for model-guided re-engineering, and unraveling cross-feeding in microbial communities. Yet, the application of genome-scale models, especially to microbial communities, is lagging behind the availability of sequenced genomes. This is largely due to the time-consuming steps of manual curation required to obtain good quality models. Here, we present an automated tool, CarveMe, for reconstruction of species and community level metabolic models. We introduce the concept of a universal model, which is manually curated and simulation ready. Starting with this universal model and annotated genome sequences, CarveMe uses a top-down approach to build single-species and community models in a fast and scalable manner. We show that CarveMe models perform closely to manually curated models in reproducing experimental phenotypes (substrate utilization and gene essentiality). Additionally, we build a collection of 74 models for human gut bacteria and test their ability to reproduce growth on a set of experimentally defined media. Finally, we create a database of 5587 bacterial models and demonstrate its potential for fast generation of microbial community models. Overall, CarveMe provides an open-source and user-friendly tool towards broadening the use of metabolic modeling in studying microbial species and communities.
Collapse
Affiliation(s)
- Daniel Machado
- European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | - Sergej Andrejev
- European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | - Melanie Tramontano
- European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| | - Kiran Raosaheb Patil
- European Molecular Biology Laboratory (EMBL), Meyerhofstrasse 1, 69117 Heidelberg, Germany
| |
Collapse
|
18
|
Abdel-Haleem AM, Hefzi H, Mineta K, Gao X, Gojobori T, Palsson BO, Lewis NE, Jamshidi N. Functional interrogation of Plasmodium genus metabolism identifies species- and stage-specific differences in nutrient essentiality and drug targeting. PLoS Comput Biol 2018; 14:e1005895. [PMID: 29300748 PMCID: PMC5771636 DOI: 10.1371/journal.pcbi.1005895] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Revised: 01/17/2018] [Accepted: 11/24/2017] [Indexed: 12/17/2022] Open
Abstract
Several antimalarial drugs exist, but differences between life cycle stages among malaria species pose challenges for developing more effective therapies. To understand the diversity among stages and species, we reconstructed genome-scale metabolic models (GeMMs) of metabolism for five life cycle stages and five species of Plasmodium spanning the blood, transmission, and mosquito stages. The stage-specific models of Plasmodium falciparum uncovered stage-dependent changes in central carbon metabolism and predicted potential targets that could affect several life cycle stages. The species-specific models further highlight differences between experimental animal models and the human-infecting species. Comparisons between human- and rodent-infecting species revealed differences in thiamine (vitamin B1), choline, and pantothenate (vitamin B5) metabolism. Thus, we show that genome-scale analysis of multiple stages and species of Plasmodium can prioritize potential drug targets that could be both anti-malarials and transmission blocking agents, in addition to guiding translation from non-human experimental disease models. Malaria kills nearly one-half million people a year and over 1 billion people are at risk of becoming infected by the parasite. Plasmodial infections are difficult to treat for a myriad of reasons, but the ability of the organism to remain latent in hosts and the complex life cycles greatly contributed to the difficulty in treat malaria. Genome-scale metabolic models (GeMMs) enable hierarchical integration of disparate data types into a framework amenable to computational simulations enabling deeper mechanistic insights from high-throughput data measurements. In this study, GeMMs of multiple Plasmodium species are used to study metabolic similarities and differences across the Plasmodium genus. In silico gene-knock out simulations across species and stages uncovered functional metabolic differences between human- and rodent-infecting species as well as across the parasite’s life-cycle stages. These findings may help identify drug regimens that are more effective in targeting human-infecting species across multiple stages of the organism.
Collapse
Affiliation(s)
- Alyaa M. Abdel-Haleem
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Centre (CBRC), Thuwal, Saudi Arabia
- King Abdullah University of Science and Technology (KAUST), Biological and Environmental Sciences and Engineering (BESE) division, Thuwal, Saudi Arabia
| | - Hooman Hefzi
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, United States of America
- Novo Nordisk Foundation Center for Biosustainability at the University of California, San Diego School of Medicine, La Jolla, CA, United States of America
| | - Katsuhiko Mineta
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Centre (CBRC), Thuwal, Saudi Arabia
| | - Xin Gao
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Centre (CBRC), Thuwal, Saudi Arabia
| | - Takashi Gojobori
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Centre (CBRC), Thuwal, Saudi Arabia
| | - Bernhard O. Palsson
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, United States of America
- Novo Nordisk Foundation Center for Biosustainability at the University of California, San Diego School of Medicine, La Jolla, CA, United States of America
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, United States of America
| | - Nathan E. Lewis
- Novo Nordisk Foundation Center for Biosustainability at the University of California, San Diego School of Medicine, La Jolla, CA, United States of America
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, United States of America
| | - Neema Jamshidi
- Institute of Engineering in Medicine, University of California, San Diego, La Jolla, CA, United States of America
- Department of Radiological Sciences, University of California, Los Angeles, CA, United States of America
- * E-mail: ,
| |
Collapse
|
19
|
The MONGOOSE Rational Arithmetic Toolbox. Methods Mol Biol 2017. [PMID: 29222749 DOI: 10.1007/978-1-4939-7528-0_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
The modeling of metabolic networks has seen a rapid expansion following the complete sequencing of thousands of genomes. The constraint-based modeling framework has emerged as one of the most popular approaches to reconstructing and analyzing genome-scale metabolic models. Its main assumption is that of a quasi-steady-state, requiring that the production of each internal metabolite be balanced by its consumption. However, due to the multiscale nature of the models, the large number of reactions and metabolites, and the use of floating-point arithmetic for the stoichiometric coefficients, ensuring that this assumption holds can be challenging.The MONGOOSE toolbox addresses this problem by using rational arithmetic, thus ensuring that models are analyzed in a reproducible manner and consistently with modeling assumptions. In this chapter we present a protocol for the complete analysis of a metabolic network model using the MONGOOSE toolbox, via its newly developed GUI, and describe how it can be used as a model-checking platform both during and after the model construction process.
Collapse
|
20
|
Kamminga T, Slagman SJ, Bijlsma JJE, Martins Dos Santos VAP, Suarez-Diez M, Schaap PJ. Metabolic modeling of energy balances in Mycoplasma hyopneumoniae shows that pyruvate addition increases growth rate. Biotechnol Bioeng 2017; 114:2339-2347. [PMID: 28600895 PMCID: PMC6084303 DOI: 10.1002/bit.26347] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Revised: 05/13/2017] [Accepted: 06/07/2017] [Indexed: 11/08/2022]
Abstract
Mycoplasma hyopneumoniae is cultured on large-scale to produce antigen for inactivated whole-cell vaccines against respiratory disease in pigs. However, the fastidious nutrient requirements of this minimal bacterium and the low growth rate make it challenging to reach sufficient biomass yield for antigen production. In this study, we sequenced the genome of M. hyopneumoniae strain 11 and constructed a high quality constraint-based genome-scale metabolic model of 284 chemical reactions and 298 metabolites. We validated the model with time-series data of duplicate fermentation cultures to aim for an integrated model describing the dynamic profiles measured in fermentations. The model predicted that 84% of cellular energy in a standard M. hyopneumoniae cultivation was used for non-growth associated maintenance and only 16% of cellular energy was used for growth and growth associated maintenance. Following a cycle of model-driven experimentation in dedicated fermentation experiments, we were able to increase the fraction of cellular energy used for growth through pyruvate addition to the medium. This increase in turn led to an increase in growth rate and a 2.3 times increase in the total biomass concentration reached after 3-4 days of fermentation, enhancing the productivity of the overall process. The model presented provides a solid basis to understand and further improve M. hyopneumoniae fermentation processes. Biotechnol. Bioeng. 2017;114: 2339-2347. © 2017 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Tjerko Kamminga
- Laboratory of Systems and Synthetic Biology, Department of Agrotechnology and Food Sciences, Wageningen University and Research, Stippeneng 4, 6708, Wageningen, The Netherlands.,Bioprocess Technology and Support, MSD Animal Health, Boxmeer, The Netherlands
| | - Simen-Jan Slagman
- Bioprocess Technology and Support, MSD Animal Health, Boxmeer, The Netherlands
| | - Jetta J E Bijlsma
- Discovery and Technology, MSD Animal Health, Boxmeer, The Netherlands
| | - Vitor A P Martins Dos Santos
- Laboratory of Systems and Synthetic Biology, Department of Agrotechnology and Food Sciences, Wageningen University and Research, Stippeneng 4, 6708, Wageningen, The Netherlands
| | - Maria Suarez-Diez
- Laboratory of Systems and Synthetic Biology, Department of Agrotechnology and Food Sciences, Wageningen University and Research, Stippeneng 4, 6708, Wageningen, The Netherlands
| | - Peter J Schaap
- Laboratory of Systems and Synthetic Biology, Department of Agrotechnology and Food Sciences, Wageningen University and Research, Stippeneng 4, 6708, Wageningen, The Netherlands
| |
Collapse
|
21
|
Loira N, Mendoza S, Paz Cortés M, Rojas N, Travisany D, Genova AD, Gajardo N, Ehrenfeld N, Maass A. Reconstruction of the microalga Nannochloropsis salina genome-scale metabolic model with applications to lipid production. BMC SYSTEMS BIOLOGY 2017; 11:66. [PMID: 28676050 PMCID: PMC5496344 DOI: 10.1186/s12918-017-0441-1] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/18/2016] [Accepted: 06/09/2017] [Indexed: 11/10/2022]
Abstract
Background Nannochloropsis salina (= Eustigmatophyceae) is a marine microalga which has become a biotechnological target because of its high capacity to produce polyunsaturated fatty acids and triacylglycerols. It has been used as a source of biofuel, pigments and food supplements, like Omega 3. Only some Nannochloropsis species have been sequenced, but none of them benefit from a genome-scale metabolic model (GSMM), able to predict its metabolic capabilities. Results We present iNS934, the first GSMM for N. salina, including 2345 reactions, 934 genes and an exhaustive description of lipid and nitrogen metabolism. iNS934 has a 90% of accuracy when making simple growth/no-growth predictions and has a 15% error rate in predicting growth rates in different experimental conditions. Moreover, iNS934 allowed us to propose 82 different knockout strategies for strain optimization of triacylglycerols. Conclusions iNS934 provides a powerful tool for metabolic improvement, allowing predictions and simulations of N. salina metabolism under different media and genetic conditions. It also provides a systemic view of N. salina metabolism, potentially guiding research and providing context to -omics data. Electronic supplementary material The online version of this article (doi:10.1186/s12918-017-0441-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nicolás Loira
- Mathomics, Center for Mathematical Modeling, Universidad de Chile, Beauchef 851, 7th Floor, Santiago, Chile. .,Center for Genome Regulation (Fondap 15090007), Universidad de Chile, Blanco Encalada 2085, Santiago, Chile.
| | - Sebastian Mendoza
- Mathomics, Center for Mathematical Modeling, Universidad de Chile, Beauchef 851, 7th Floor, Santiago, Chile.,Center for Genome Regulation (Fondap 15090007), Universidad de Chile, Blanco Encalada 2085, Santiago, Chile
| | - María Paz Cortés
- Mathomics, Center for Mathematical Modeling, Universidad de Chile, Beauchef 851, 7th Floor, Santiago, Chile.,Center for Genome Regulation (Fondap 15090007), Universidad de Chile, Blanco Encalada 2085, Santiago, Chile.,Universidad Adolfo Ibáñez, Diagonal Las Torres 2640, Santiago, Chile
| | - Natalia Rojas
- Center for Genome Regulation (Fondap 15090007), Universidad de Chile, Blanco Encalada 2085, Santiago, Chile
| | - Dante Travisany
- Mathomics, Center for Mathematical Modeling, Universidad de Chile, Beauchef 851, 7th Floor, Santiago, Chile.,Center for Genome Regulation (Fondap 15090007), Universidad de Chile, Blanco Encalada 2085, Santiago, Chile
| | - Alex Di Genova
- Mathomics, Center for Mathematical Modeling, Universidad de Chile, Beauchef 851, 7th Floor, Santiago, Chile.,Center for Genome Regulation (Fondap 15090007), Universidad de Chile, Blanco Encalada 2085, Santiago, Chile
| | - Natalia Gajardo
- Centro de Investigación Austral Biotech, Universidad Santo Tomás, Avenida Ejercito 146, Santiago, Chile
| | - Nicole Ehrenfeld
- Centro de Investigación Austral Biotech, Universidad Santo Tomás, Avenida Ejercito 146, Santiago, Chile
| | - Alejandro Maass
- Mathomics, Center for Mathematical Modeling, Universidad de Chile, Beauchef 851, 7th Floor, Santiago, Chile.,Center for Genome Regulation (Fondap 15090007), Universidad de Chile, Blanco Encalada 2085, Santiago, Chile
| |
Collapse
|
22
|
Chen X, Gao C, Guo L, Hu G, Luo Q, Liu J, Nielsen J, Chen J, Liu L. DCEO Biotechnology: Tools To Design, Construct, Evaluate, and Optimize the Metabolic Pathway for Biosynthesis of Chemicals. Chem Rev 2017; 118:4-72. [DOI: 10.1021/acs.chemrev.6b00804] [Citation(s) in RCA: 109] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Affiliation(s)
- Xiulai Chen
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Cong Gao
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Liang Guo
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Guipeng Hu
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Qiuling Luo
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Jia Liu
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Jens Nielsen
- Department
of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg SE-412 96, Sweden
- Novo
Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK2800 Lyngby, Denmark
| | - Jian Chen
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| | - Liming Liu
- State
Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi 214122, China
- Department
of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg SE-412 96, Sweden
- Key
Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, Wuxi 214122, China
| |
Collapse
|
23
|
Genome-Scale Metabolic Modeling of Archaea Lends Insight into Diversity of Metabolic Function. ARCHAEA-AN INTERNATIONAL MICROBIOLOGICAL JOURNAL 2017; 2017:9763848. [PMID: 28133437 PMCID: PMC5241448 DOI: 10.1155/2017/9763848] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/22/2016] [Revised: 10/17/2016] [Accepted: 11/01/2016] [Indexed: 02/07/2023]
Abstract
Decades of biochemical, bioinformatic, and sequencing data are currently being systematically compiled into genome-scale metabolic reconstructions (GEMs). Such reconstructions are knowledge-bases useful for engineering, modeling, and comparative analysis. Here we review the fifteen GEMs of archaeal species that have been constructed to date. They represent primarily members of the Euryarchaeota with three-quarters comprising representative of methanogens. Unlike other reviews on GEMs, we specially focus on archaea. We briefly review the GEM construction process and the genealogy of the archaeal models. The major insights gained during the construction of these models are then reviewed with specific focus on novel metabolic pathway predictions and growth characteristics. Metabolic pathway usage is discussed in the context of the composition of each organism's biomass and their specific energy and growth requirements. We show how the metabolic models can be used to study the evolution of metabolism in archaea. Conservation of particular metabolic pathways can be studied by comparing reactions using the genes associated with their enzymes. This demonstrates the utility of GEMs to evolutionary studies, far beyond their original purpose of metabolic modeling; however, much needs to be done before archaeal models are as extensively complete as those for bacteria.
Collapse
|
24
|
Cole LW. The Evolution of Per-cell Organelle Number. Front Cell Dev Biol 2016; 4:85. [PMID: 27588285 PMCID: PMC4988970 DOI: 10.3389/fcell.2016.00085] [Citation(s) in RCA: 61] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Accepted: 08/04/2016] [Indexed: 11/13/2022] Open
Abstract
Organelles with their own distinct genomes, such as plastids and mitochondria, are found in most eukaryotic cells. As these organelles and their host cells have evolved, the partitioning of metabolic processes and the encoding of interacting gene products have created an obligate codependence. This relationship has played a role in shaping the number of organelles in cells through evolution. Factors such as stochastic evolutionary forces acting on genes involved in organelle biogenesis, organelle-nuclear gene interactions, and physical limitations may, to varying degrees, dictate the selective constraint that per-cell organelle number is under. In particular, coordination between nuclear and organellar gene expression may be important in maintaining gene product stoichiometry, which may have a significant role in constraining the evolution of this trait.
Collapse
Affiliation(s)
- Logan W Cole
- Department of Biology, Indiana University Bloomington, IN, USA
| |
Collapse
|
25
|
Hartleb D, Jarre F, Lercher MJ. Improved Metabolic Models for E. coli and Mycoplasma genitalium from GlobalFit, an Algorithm That Simultaneously Matches Growth and Non-Growth Data Sets. PLoS Comput Biol 2016; 12:e1005036. [PMID: 27482704 PMCID: PMC4970803 DOI: 10.1371/journal.pcbi.1005036] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2015] [Accepted: 06/27/2016] [Indexed: 01/02/2023] Open
Abstract
Constraint-based metabolic modeling methods such as Flux Balance Analysis (FBA) are routinely used to predict the effects of genetic changes and to design strains with desired metabolic properties. The major bottleneck in modeling genome-scale metabolic systems is the establishment and manual curation of reliable stoichiometric models. Initial reconstructions are typically refined through comparisons to experimental growth data from gene knockouts or nutrient environments. Existing methods iteratively correct one erroneous model prediction at a time, resulting in accumulating network changes that are often not globally optimal. We present GlobalFit, a bi-level optimization method that finds a globally optimal network, by identifying the minimal set of network changes needed to correctly predict all experimentally observed growth and non-growth cases simultaneously. When applied to the genome-scale metabolic model of Mycoplasma genitalium, GlobalFit decreases unexplained gene knockout phenotypes by 79%, increasing accuracy from 87.3% (according to the current state-of-the-art) to 97.3%. While currently available computers do not allow a global optimization of the much larger metabolic network of E. coli, the main strengths of GlobalFit are already played out when considering only one growth and one non-growth case simultaneously. Application of a corresponding strategy halves the number of unexplained cases for the already highly curated E. coli model, increasing accuracy from 90.8% to 95.4%. Mathematical models that aim to describe the complete metabolism of a cell help us understand cellular metabolic capabilities and evolution, and aid the biotechnological design of microbial strains with desired properties. Draft models are frequently improved through adjustments that increase the agreement of growth/non-growth predictions with observations from gene knockout experiments. Automated methods for this task typically correct one erroneous prediction after the other. We present GlobalFit, a novel method that can consider all experiments and all possible changes simultaneously to identify model modifications that are globally optimal (i.e., that correct the largest possible number of wrong predictions while introducing sets of changes that are most compatible with existing knowledge). This becomes computationally very hard when considering large metabolic models; however, a reduced application of GlobalFit that only looks at small subsets of experiments simultaneously works very well in practice. Allowing only changes that are conservative (e.g., introducing new reactions only if supported by significant genomic evidence), GlobalFit halves the number of wrong growth/non-growth predictions for the state-of-the-art metabolic models of E. coli and Mycoplasma genitalium, increasing prediction accuracy to 95.4% and 93.0%, respectively. By additionally allowing less conservative changes, we are able to improve accuracy further to 97.3% for the M. genitalium model.
Collapse
Affiliation(s)
- Daniel Hartleb
- Institute for Computer Science and Cluster of Excellence on Plant Sciences, Heinrich Heine University, Düsseldorf, Germany
| | - Florian Jarre
- Institute for Mathematics, Heinrich Heine University, Düsseldorf, Germany
| | - Martin J. Lercher
- Institute for Computer Science and Cluster of Excellence on Plant Sciences, Heinrich Heine University, Düsseldorf, Germany
- * E-mail:
| |
Collapse
|
26
|
Chen WH, van Noort V, Lluch-Senar M, Hennrich ML, Wodke JAH, Yus E, Alibés A, Roma G, Mende DR, Pesavento C, Typas A, Gavin AC, Serrano L, Bork P. Integration of multi-omics data of a genome-reduced bacterium: Prevalence of post-transcriptional regulation and its correlation with protein abundances. Nucleic Acids Res 2016; 44:1192-202. [PMID: 26773059 PMCID: PMC4756857 DOI: 10.1093/nar/gkw004] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2015] [Accepted: 01/03/2016] [Indexed: 01/16/2023] Open
Abstract
We developed a comprehensive resource for the genome-reduced bacterium Mycoplasma pneumoniae comprising 1748 consistently generated ‘-omics’ data sets, and used it to quantify the power of antisense non-coding RNAs (ncRNAs), lysine acetylation, and protein phosphorylation in predicting protein abundance (11%, 24% and 8%, respectively). These factors taken together are four times more predictive of the proteome abundance than of mRNA abundance. In bacteria, post-translational modifications (PTMs) and ncRNA transcription were both found to increase with decreasing genomic GC-content and genome size. Thus, the evolutionary forces constraining genome size and GC-content modify the relative contributions of the different regulatory layers to proteome homeostasis, and impact more genomic and genetic features than previously appreciated. Indeed, these scaling principles will enable us to develop more informed approaches when engineering minimal synthetic genomes.
Collapse
Affiliation(s)
- Wei-Hua Chen
- European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Vera van Noort
- European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Maria Lluch-Senar
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Marco L Hennrich
- European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Judith A H Wodke
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain Theoretical Biophysics, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099 Berlin, Germany
| | - Eva Yus
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Andreu Alibés
- Bioinformatics Unit, Centre for Genomic Regulation (CRG), Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Guglielmo Roma
- Bioinformatics Unit, Centre for Genomic Regulation (CRG), Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Daniel R Mende
- European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Christina Pesavento
- European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Athanasios Typas
- European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Anne-Claude Gavin
- European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117 Heidelberg, Germany
| | - Luis Serrano
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain Institució Catalana de Recerca i Estudis Avançats (ICREA), Pg. Lluis Companys 23, 08010 Barcelona, Spain
| | - Peer Bork
- European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117 Heidelberg, Germany Max-Delbrück-Centre (MDC) for Molecular Medicine, Robert-Rössle-Str. 10, 13092 Berlin, Germany
| |
Collapse
|
27
|
Ponce-de-Leon M, Calle-Espinosa J, Peretó J, Montero F. Consistency Analysis of Genome-Scale Models of Bacterial Metabolism: A Metamodel Approach. PLoS One 2015; 10:e0143626. [PMID: 26629901 PMCID: PMC4668087 DOI: 10.1371/journal.pone.0143626] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2015] [Accepted: 11/06/2015] [Indexed: 01/10/2023] Open
Abstract
Genome-scale metabolic models usually contain inconsistencies that manifest as blocked reactions and gap metabolites. With the purpose to detect recurrent inconsistencies in metabolic models, a large-scale analysis was performed using a previously published dataset of 130 genome-scale models. The results showed that a large number of reactions (~22%) are blocked in all the models where they are present. To unravel the nature of such inconsistencies a metamodel was construed by joining the 130 models in a single network. This metamodel was manually curated using the unconnected modules approach, and then, it was used as a reference network to perform a gap-filling on each individual genome-scale model. Finally, a set of 36 models that had not been considered during the construction of the metamodel was used, as a proof of concept, to extend the metamodel with new biochemical information, and to assess its impact on gap-filling results. The analysis performed on the metamodel allowed to conclude: 1) the recurrent inconsistencies found in the models were already present in the metabolic database used during the reconstructions process; 2) the presence of inconsistencies in a metabolic database can be propagated to the reconstructed models; 3) there are reactions not manifested as blocked which are active as a consequence of some classes of artifacts, and; 4) the results of an automatic gap-filling are highly dependent on the consistency and completeness of the metamodel or metabolic database used as the reference network. In conclusion the consistency analysis should be applied to metabolic databases in order to detect and fill gaps as well as to detect and remove artifacts and redundant information.
Collapse
Affiliation(s)
- Miguel Ponce-de-Leon
- Departamento de Bioquímica y Biología Molecular I, Facultad de Ciencias Químicas, Universidad Complutense de Madrid, Ciudad Universitaria, Madrid 28045, Spain
- * E-mail:
| | - Jorge Calle-Espinosa
- Departamento de Bioquímica y Biología Molecular I, Facultad de Ciencias Químicas, Universidad Complutense de Madrid, Ciudad Universitaria, Madrid 28045, Spain
| | - Juli Peretó
- Departament de Bioquímica i Biologia Molecular and Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, C/José Beltrán 2, Paterna 46980, Spain
| | - Francisco Montero
- Departamento de Bioquímica y Biología Molecular I, Facultad de Ciencias Químicas, Universidad Complutense de Madrid, Ciudad Universitaria, Madrid 28045, Spain
| |
Collapse
|
28
|
Kazakiewicz D, Karr JR, Langner KM, Plewczynski D. A combined systems and structural modeling approach repositions antibiotics for Mycoplasma genitalium. Comput Biol Chem 2015; 59 Pt B:91-7. [DOI: 10.1016/j.compbiolchem.2015.07.007] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2015] [Revised: 05/05/2015] [Accepted: 07/24/2015] [Indexed: 12/13/2022]
|
29
|
Bauer E, Laczny CC, Magnusdottir S, Wilmes P, Thiele I. Phenotypic differentiation of gastrointestinal microbes is reflected in their encoded metabolic repertoires. MICROBIOME 2015; 3:55. [PMID: 26617277 PMCID: PMC4663747 DOI: 10.1186/s40168-015-0121-6] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2015] [Accepted: 09/30/2015] [Indexed: 05/27/2023]
Abstract
BACKGROUND The human gastrointestinal tract harbors a diverse microbial community, in which metabolic phenotypes play important roles for the human host. Recent developments in meta-omics attempt to unravel metabolic roles of microbes by linking genotypic and phenotypic characteristics. This connection, however, still remains poorly understood with respect to its evolutionary and ecological context. RESULTS We generated automatically refined draft genome-scale metabolic models of 301 representative intestinal microbes in silico. We applied a combination of unsupervised machine-learning and systems biology techniques to study individual and global differences in genomic content and inferred metabolic capabilities. Based on the global metabolic differences, we found that energy metabolism and membrane synthesis play important roles in delineating different taxonomic groups. Furthermore, we found an exponential relationship between phylogeny and the reaction composition, meaning that closely related microbes of the same genus can exhibit pronounced differences with respect to their metabolic capabilities while at the family level only marginal metabolic differences can be observed. This finding was further substantiated by the metabolic divergence within different genera. In particular, we could distinguish three sub-type clusters based on membrane and energy metabolism within the Lactobacilli as well as two clusters within the Bifidobacteria and Bacteroides. CONCLUSIONS We demonstrate that phenotypic differentiation within closely related species could be explained by their metabolic repertoire rather than their phylogenetic relationships. These results have important implications in our understanding of the ecological and evolutionary complexity of the human gastrointestinal microbiome.
Collapse
Affiliation(s)
- Eugen Bauer
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
| | - Cedric Christian Laczny
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
| | - Stefania Magnusdottir
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
| | - Paul Wilmes
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
| | - Ines Thiele
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
| |
Collapse
|
30
|
Ganter M, Kaltenbach HM, Stelling J. Predicting network functions with nested patterns. Nat Commun 2015; 5:3006. [PMID: 24398547 DOI: 10.1038/ncomms4006] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2013] [Accepted: 11/24/2013] [Indexed: 12/20/2022] Open
Abstract
Identifying suitable patterns in complex biological interaction networks helps understanding network functions and allows for predictions at the pattern level: by recognizing a known pattern, one can assign its previously established function. However, current approaches fail for previously unseen patterns, when patterns overlap and when they are embedded into a new network context. Here we show how to conceptually extend pattern-based approaches. We define metabolite patterns in metabolic networks that formalize co-occurrences of metabolites. Our probabilistic framework decodes the implicit information in the networks' metabolite patterns to predict metabolic functions. We demonstrate the predictive power by identifying 'indicator patterns', for instance, for enzyme classification, by predicting directions of novel reactions and of known reactions in new network contexts, and by ranking candidate network extensions for gap filling. Beyond their use in improving genome annotations and metabolic network models, we expect that the concepts transfer to other network types.
Collapse
Affiliation(s)
- Mathias Ganter
- 1] Department of Biosystems Science & Engineering and Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, 4058 Basel, Switzerland [2]
| | - Hans-Michael Kaltenbach
- 1] Department of Biosystems Science & Engineering and Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, 4058 Basel, Switzerland [2]
| | - Jörg Stelling
- Department of Biosystems Science & Engineering and Swiss Institute of Bioinformatics, ETH Zurich, Mattenstr. 26, 4058 Basel, Switzerland
| |
Collapse
|
31
|
Hollinshead WD, Henson WR, Abernathy M, Moon TS, Tang YJ. Rapid metabolic analysis of
Rhodococcus opacus
PD630 via parallel
13
C‐metabolite fingerprinting. Biotechnol Bioeng 2015; 113:91-100. [DOI: 10.1002/bit.25702] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2015] [Revised: 07/04/2015] [Accepted: 07/07/2015] [Indexed: 12/17/2022]
Affiliation(s)
- Whitney D. Hollinshead
- Department of Energy, Environmental and Chemical EngineeringWashington University in St. LouisSt. LouisMissouri63130
| | - William R. Henson
- Department of Energy, Environmental and Chemical EngineeringWashington University in St. LouisSt. LouisMissouri63130
| | - Mary Abernathy
- Department of Energy, Environmental and Chemical EngineeringWashington University in St. LouisSt. LouisMissouri63130
| | - Tae Seok Moon
- Department of Energy, Environmental and Chemical EngineeringWashington University in St. LouisSt. LouisMissouri63130
| | - Yinjie J. Tang
- Department of Energy, Environmental and Chemical EngineeringWashington University in St. LouisSt. LouisMissouri63130
| |
Collapse
|
32
|
Calderón-Santiago M, Priego-Capote F, de Castro MDL. Enhancing detection coverage in untargeted metabolomics analysis by solid-phase extraction on-line coupled to LC-MS/MS. Electrophoresis 2015; 36:2179-2187. [DOI: 10.1002/elps.201500060] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2015] [Revised: 06/03/2015] [Accepted: 06/05/2015] [Indexed: 12/14/2022]
Affiliation(s)
- Mónica Calderón-Santiago
- Department of Analytical Chemistry, Annex Marie Curie Building, Campus of Rabanales; University of Córdoba; Córdoba Spain
- Maimónides Institute of Biomedical Research (IMIBIC), Reina Sofía University Hospital; University of Córdoba; Córdoba Spain
| | - Feliciano Priego-Capote
- Department of Analytical Chemistry, Annex Marie Curie Building, Campus of Rabanales; University of Córdoba; Córdoba Spain
- Maimónides Institute of Biomedical Research (IMIBIC), Reina Sofía University Hospital; University of Córdoba; Córdoba Spain
| | - María D. Luque de Castro
- Department of Analytical Chemistry, Annex Marie Curie Building, Campus of Rabanales; University of Córdoba; Córdoba Spain
- Maimónides Institute of Biomedical Research (IMIBIC), Reina Sofía University Hospital; University of Córdoba; Córdoba Spain
| |
Collapse
|
33
|
Abstract
The concept of the minimal cell has fascinated scientists for a long time, from both fundamental and applied points of view. This broad concept encompasses extreme reductions of genomes, the last universal common ancestor (LUCA), the creation of semiartificial cells, and the design of protocells and chassis cells. Here we review these different areas of research and identify common and complementary aspects of each one. We focus on systems biology, a discipline that is greatly facilitating the classical top-down and bottom-up approaches toward minimal cells. In addition, we also review the so-called middle-out approach and its contributions to the field with mathematical and computational models. Owing to the advances in genomics technologies, much of the work in this area has been centered on minimal genomes, or rather minimal gene sets, required to sustain life. Nevertheless, a fundamental expansion has been taking place in the last few years wherein the minimal gene set is viewed as a backbone of a more complex system. Complementing genomics, progress is being made in understanding the system-wide properties at the levels of the transcriptome, proteome, and metabolome. Network modeling approaches are enabling the integration of these different omics data sets toward an understanding of the complex molecular pathways connecting genotype to phenotype. We review key concepts central to the mapping and modeling of this complexity, which is at the heart of research on minimal cells. Finally, we discuss the distinction between minimizing the number of cellular components and minimizing cellular complexity, toward an improved understanding and utilization of minimal and simpler cells.
Collapse
|
34
|
Feig M, Harada R, Mori T, Yu I, Takahashi K, Sugita Y. Complete atomistic model of a bacterial cytoplasm for integrating physics, biochemistry, and systems biology. J Mol Graph Model 2015; 58:1-9. [PMID: 25765281 DOI: 10.1016/j.jmgm.2015.02.004] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2014] [Revised: 02/18/2015] [Accepted: 02/22/2015] [Indexed: 01/10/2023]
Abstract
A model for the cytoplasm of Mycoplasma genitalium is presented that integrates data from a variety of sources into a physically and biochemically consistent model. Based on gene annotations, core genes expected to be present in the cytoplasm were determined and a metabolic reaction network was reconstructed. The set of cytoplasmic genes and metabolites from the predicted reactions were assembled into a comprehensive atomistic model consisting of proteins with predicted structures, RNA, protein/RNA complexes, metabolites, ions, and solvent. The resulting model bridges between atomistic and cellular scales, between physical and biochemical aspects, and between structural and systems views of cellular systems and is meant as a starting point for a variety of simulation studies.
Collapse
Affiliation(s)
- Michael Feig
- Department of Biochemistry & Molecular Biology, Michigan State University, East Lansing, MI 48824, United States; Department of Chemistry, Michigan State University, East Lansing, MI 48824, United States; Quantitative Biology Center, RIKEN, International Medical Device Alliance (IMDA) 6F, 1-6-5 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan.
| | - Ryuhei Harada
- Advanced Institute for Computational Science, RIKEN, 7-1-26 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan; Quantitative Biology Center, RIKEN, International Medical Device Alliance (IMDA) 6F, 1-6-5 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan
| | - Takaharu Mori
- Quantitative Biology Center, RIKEN, International Medical Device Alliance (IMDA) 6F, 1-6-5 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan; Theoretical Molecular Science Laboratory and iTHES, RIKEN, 2-1 Hirosawa, Wako-shi, Saitama 351-0198, Japan
| | - Isseki Yu
- Theoretical Molecular Science Laboratory and iTHES, RIKEN, 2-1 Hirosawa, Wako-shi, Saitama 351-0198, Japan
| | - Koichi Takahashi
- Quantitative Biology Center, RIKEN, Laboratory for Biochemical Simulation, Suita, Osaka 565-0874, Japan; Institute for Advanced Biosciences, Keio University, Fujisawa 252-8520, Japan
| | - Yuji Sugita
- Advanced Institute for Computational Science, RIKEN, 7-1-26 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan; Quantitative Biology Center, RIKEN, International Medical Device Alliance (IMDA) 6F, 1-6-5 Minatojima-minamimachi, Chuo-ku, Kobe, Hyogo 650-0047, Japan; Theoretical Molecular Science Laboratory and iTHES, RIKEN, 2-1 Hirosawa, Wako-shi, Saitama 351-0198, Japan
| |
Collapse
|
35
|
Cazzaniga P, Damiani C, Besozzi D, Colombo R, Nobile MS, Gaglio D, Pescini D, Molinari S, Mauri G, Alberghina L, Vanoni M. Computational strategies for a system-level understanding of metabolism. Metabolites 2014; 4:1034-87. [PMID: 25427076 PMCID: PMC4279158 DOI: 10.3390/metabo4041034] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2014] [Revised: 11/05/2014] [Accepted: 11/12/2014] [Indexed: 12/20/2022] Open
Abstract
Cell metabolism is the biochemical machinery that provides energy and building blocks to sustain life. Understanding its fine regulation is of pivotal relevance in several fields, from metabolic engineering applications to the treatment of metabolic disorders and cancer. Sophisticated computational approaches are needed to unravel the complexity of metabolism. To this aim, a plethora of methods have been developed, yet it is generally hard to identify which computational strategy is most suited for the investigation of a specific aspect of metabolism. This review provides an up-to-date description of the computational methods available for the analysis of metabolic pathways, discussing their main advantages and drawbacks. In particular, attention is devoted to the identification of the appropriate scale and level of accuracy in the reconstruction of metabolic networks, and to the inference of model structure and parameters, especially when dealing with a shortage of experimental measurements. The choice of the proper computational methods to derive in silico data is then addressed, including topological analyses, constraint-based modeling and simulation of the system dynamics. A description of some computational approaches to gain new biological knowledge or to formulate hypotheses is finally provided.
Collapse
Affiliation(s)
- Paolo Cazzaniga
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Chiara Damiani
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Daniela Besozzi
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Riccardo Colombo
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Marco S Nobile
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Daniela Gaglio
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Dario Pescini
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Sara Molinari
- Dipartimento di Biotecnologie e Bioscienze, Università degli Studi di Milano-Bicocca, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Giancarlo Mauri
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Lilia Alberghina
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| | - Marco Vanoni
- SYSBIO Centre of Systems Biology, Piazza della Scienza 2, 20126 Milano, Italy.
| |
Collapse
|
36
|
Chindelevitch L, Trigg J, Regev A, Berger B. An exact arithmetic toolbox for a consistent and reproducible structural analysis of metabolic network models. Nat Commun 2014; 5:4893. [PMID: 25291352 PMCID: PMC4205847 DOI: 10.1038/ncomms5893] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2014] [Accepted: 08/04/2014] [Indexed: 12/03/2022] Open
Abstract
Constraint-based models are currently the only methodology that allows the study of metabolism at the whole-genome scale. Flux balance analysis is commonly used to analyse constraint-based models. Curiously, the results of this analysis vary with the software being run, a situation that we show can be remedied by using exact rather than floating-point arithmetic. Here we introduce MONGOOSE, a toolbox for analysing the structure of constraint-based metabolic models in exact arithmetic. We apply MONGOOSE to the analysis of 98 existing metabolic network models and find that the biomass reaction is surprisingly blocked (unable to sustain non-zero flux) in nearly half of them. We propose a principled approach for unblocking these reactions and extend it to the problems of identifying essential and synthetic lethal reactions and minimal media. Our structural insights enable a systematic study of constraint-based metabolic models, yielding a deeper understanding of their possibilities and limitations. Current tools to analyse constraint-based models of metabolic networks have limited accuracy due to their use of floating-point arithmetic. Here the authors present MONGOOSE, a new computational tool that analyses such models in exact arithmetic, providing improved accuracy and reproducibility.
Collapse
Affiliation(s)
- Leonid Chindelevitch
- 1] Mathematics Department, Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge, Massachusetts 02139, USA [2] Broad Institute, 7 Cambridge Center, Cambridge, Massachusetts 02142, USA
| | - Jason Trigg
- Mathematics Department, Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge, Massachusetts 02139, USA
| | - Aviv Regev
- 1] Broad Institute, 7 Cambridge Center, Cambridge, Massachusetts 02142, USA [2] Howard Hughes Medical Institute, 4000 Jones Bridge Road, Chevy Chase, MD 20815, USA [3] Department of Biology, MIT, Cambridge, Massachusetts 02139, USA
| | - Bonnie Berger
- 1] Mathematics Department, Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge, Massachusetts 02139, USA [2] Broad Institute, 7 Cambridge Center, Cambridge, Massachusetts 02142, USA
| |
Collapse
|
37
|
Simons M, Saha R, Guillard L, Clément G, Armengaud P, Cañas R, Maranas CD, Lea PJ, Hirel B. Nitrogen-use efficiency in maize (Zea mays L.): from 'omics' studies to metabolic modelling. JOURNAL OF EXPERIMENTAL BOTANY 2014; 65:5657-71. [PMID: 24863438 DOI: 10.1093/jxb/eru227] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
In this review, we will present the latest developments in systems biology with particular emphasis on improving nitrogen-use efficiency (NUE) in crops such as maize and demonstrating the application of metabolic models. The review highlights the importance of improving NUE in crops and provides an overview of the transcriptome, proteome, and metabolome datasets available, focusing on a comprehensive understanding of nitrogen regulation. 'Omics' data are hard to interpret in the absence of metabolic flux information within genome-scale models. These models, when integrated with 'omics' data, can serve as a basis for generating predictions that focus and guide further experimental studies. By simulating different nitrogen (N) conditions at a pseudo-steady state, the reactions affecting NUE and additional gene regulations can be determined. Such models thus provide a framework for improving our understanding of the metabolic processes underlying the more efficient use of N-based fertilizers.
Collapse
Affiliation(s)
- Margaret Simons
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| | - Rajib Saha
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| | - Lenaïg Guillard
- Adaptation des Plantes à leur Environnement, Unité Mixte de Recherche 1318, INRA-Agro-ParisTech, Institut Jean-Pierre Bourgin, Institut National de la Recherche Agronomique (INRA), Centre de Versailles-Grignon, RD 10, 78026 Versailles cedex, France
| | - Gilles Clément
- Plateau Technique Spécifique de Chimie du Végétal, Institut National de la Recherche Agronomique (INRA), Centre de Versailles-Grignon, Unité Mixte de Recherche 1318, INRA-Agro-ParisTech, Route de St Cyr, F-78026 Versailles Cedex, France
| | - Patrick Armengaud
- Adaptation des Plantes à leur Environnement, Unité Mixte de Recherche 1318, INRA-Agro-ParisTech, Institut Jean-Pierre Bourgin, Institut National de la Recherche Agronomique (INRA), Centre de Versailles-Grignon, RD 10, 78026 Versailles cedex, France
| | - Rafael Cañas
- Adaptation des Plantes à leur Environnement, Unité Mixte de Recherche 1318, INRA-Agro-ParisTech, Institut Jean-Pierre Bourgin, Institut National de la Recherche Agronomique (INRA), Centre de Versailles-Grignon, RD 10, 78026 Versailles cedex, France
| | - Costas D Maranas
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| | - Peter J Lea
- Lancaster Environment Centre, Lancaster University, Lancaster LA1 4YQ, UK
| | - Bertrand Hirel
- Adaptation des Plantes à leur Environnement, Unité Mixte de Recherche 1318, INRA-Agro-ParisTech, Institut Jean-Pierre Bourgin, Institut National de la Recherche Agronomique (INRA), Centre de Versailles-Grignon, RD 10, 78026 Versailles cedex, France
| |
Collapse
|
38
|
Recent advances in the reconstruction of metabolic models and integration of omics data. Curr Opin Biotechnol 2014; 29:39-45. [DOI: 10.1016/j.copbio.2014.02.011] [Citation(s) in RCA: 102] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2014] [Revised: 02/04/2014] [Accepted: 02/13/2014] [Indexed: 11/22/2022]
|
39
|
Zarecki R, Oberhardt MA, Reshef L, Gophna U, Ruppin E. A novel nutritional predictor links microbial fastidiousness with lowered ubiquity, growth rate, and cooperativeness. PLoS Comput Biol 2014; 10:e1003726. [PMID: 25033033 PMCID: PMC4102436 DOI: 10.1371/journal.pcbi.1003726] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2014] [Accepted: 06/02/2014] [Indexed: 02/01/2023] Open
Abstract
Understanding microbial nutritional requirements is a key challenge in microbiology. Here we leverage the recent availability of thousands of automatically generated genome-scale metabolic models to develop a predictor of microbial minimal medium requirements, which we apply to thousands of species to study the relationship between their nutritional requirements and their ecological and genomic traits. We first show that nutritional requirements are more similar among species that co-habit many ecological niches. We then reveal three fundamental characteristics of microbial fastidiousness (i.e., complex and specific nutritional requirements): (1) more fastidious microorganisms tend to be more ecologically limited; (2) fastidiousness is positively associated with smaller genomes and smaller metabolic networks; and (3) more fastidious species grow more slowly and have less ability to cooperate with other species than more metabolically versatile organisms. These associations reflect the adaptation of fastidious microorganisms to unique niches with few cohabitating species. They also explain how non-fastidious species inhabit many ecological niches with high abundance rates. Taken together, these results advance our understanding microbial nutrition on a large scale, by presenting new nutrition-related associations that govern the distribution of microorganisms in nature. Understanding microbial nutrition is critical for understanding microbial life, and thus has a major influence in many areas of biology. In recent years, the traditional methods of studying microbial nutrition, which rely on culturing bacteria and assessing their nutritional needs through extensive experiments, have been augmented by the development of genome-scale metabolic models, which enable in-depth analysis and prediction of nutrition for a few well-studied organisms. Recently, a pipeline was developed for generating genome-scale metabolic models automatically (the SEED). Here, we leverage models built from this pipeline in order to develop a novel predictor of microbial minimal medium requirements, which we then apply broadly for thousands of microbes across the tree of life. We first show that nutritional requirements are more similar among microorganisms that co-habit many ecological niches. We then use our medium predictions to examine the fastidiousness of organisms (i.e., their need for complex/specific media), and suggest an explanation for certain observed features of microbial abundance patterns. This study is one of the first to leverage genome-scale models on a large (>1000 species) scale, and sets the potential for a new host of strategies for understanding microbial nutrition and ecology in the future.
Collapse
Affiliation(s)
- Raphy Zarecki
- School of Computer Sciences & Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Matthew A. Oberhardt
- School of Computer Sciences & Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
- Department of Molecular Microbiology and Biotechnology, Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Leah Reshef
- Department of Molecular Microbiology and Biotechnology, Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Uri Gophna
- Department of Molecular Microbiology and Biotechnology, Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Eytan Ruppin
- School of Computer Sciences & Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
- * E-mail:
| |
Collapse
|
40
|
Lu W, Tamura T, Song J, Akutsu T. Integer programming-based method for designing synthetic metabolic networks by Minimum Reaction Insertion in a Boolean model. PLoS One 2014; 9:e92637. [PMID: 24651476 PMCID: PMC3961429 DOI: 10.1371/journal.pone.0092637] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2013] [Accepted: 02/25/2014] [Indexed: 01/10/2023] Open
Abstract
In this paper, we consider the Minimum Reaction Insertion (MRI) problem for finding the minimum number of additional reactions from a reference metabolic network to a host metabolic network so that a target compound becomes producible in the revised host metabolic network in a Boolean model. Although a similar problem for larger networks is solvable in a flux balance analysis (FBA)-based model, the solution of the FBA-based model tends to include more reactions than that of the Boolean model. However, solving MRI using the Boolean model is computationally more expensive than using the FBA-based model since the Boolean model needs more integer variables. Therefore, in this study, to solve MRI for larger networks in the Boolean model, we have developed an efficient Integer Programming formalization method in which the number of integer variables is reduced by the notion of feedback vertex set and minimal valid assignment. As a result of computer experiments conducted using the data of metabolic networks of E. coli and reference networks downloaded from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database, we have found that the developed method can appropriately solve MRI in the Boolean model and is applicable to large scale-networks for which an exhaustive search does not work. We have also compared the developed method with the existing connectivity-based methods and FBA-based methods, and show the difference between the solutions of our method and the existing methods. A theoretical analysis of MRI is also conducted, and the NP-completeness of MRI is proved in the Boolean model. Our developed software is available at "http://sunflower.kuicr.kyoto-u.ac.jp/~rogi/minRect/minRect.html."
Collapse
Affiliation(s)
- Wei Lu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto, Japan
| | - Takeyuki Tamura
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto, Japan
| | - Jiangning Song
- Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Australia
- National Engineering Laboratory for Industrial Enzymes, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Kyoto, Japan
| |
Collapse
|
41
|
Characterization of the operon encoding the Holliday junction helicase RuvAB from Mycoplasma genitalium and its role in mgpB and mgpC gene variation. J Bacteriol 2014; 196:1608-18. [PMID: 24532771 DOI: 10.1128/jb.01385-13] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Mycoplasma genitalium is an emerging sexually transmitted pathogen associated with reproductive tract disease in men and women, and it can persist for months to years despite the development of a robust antibody response. Mechanisms that may contribute to persistence in vivo include phase and antigenic variation of the MgpB and MgpC adhesins. These processes occur by segmental recombination between discrete variable regions within mgpB and mgpC and multiple archived donor sequences termed MgPa repeats (MgPars). The molecular factors governing mgpB and mgpC variation are poorly understood and obscured by the paucity of recombination genes conserved in the M. genitalium genome. Recently, we demonstrated the requirement for RecA using a quantitative PCR (qPCR) assay developed to measure recombination between the mgpB and mgpC genes and MgPars. Here, we expand these studies by examining the roles of M. genitalium ruvA and ruvB homologs. Deletion of ruvA and ruvB impaired the ability to generate mgpB and mgpC phase and sequence variants, and these deficiencies could be complemented with wild-type copies, including the ruvA gene from Mycoplasma pneumoniae. In contrast, ruvA and ruvB deletions did not affect the sensitivity to UV irradiation, reinforcing our previous findings that the recombinational repair pathway plays a minor role in M. genitalium. Reverse transcription-PCR (RT-PCR) and primer extension analyses also revealed a complex transcriptional organization of the RuvAB system of M. genitalium, which is cotranscribed with two novel open reading frames (ORFs) (termed ORF1 and ORF2 herein) conserved only in M. pneumoniae. These findings suggest that these novel ORFs may play a role in recombination in these two closely related bacteria.
Collapse
|
42
|
Pitkänen E, Jouhten P, Hou J, Syed MF, Blomberg P, Kludas J, Oja M, Holm L, Penttilä M, Rousu J, Arvas M. Comparative genome-scale reconstruction of gapless metabolic networks for present and ancestral species. PLoS Comput Biol 2014; 10:e1003465. [PMID: 24516375 PMCID: PMC3916221 DOI: 10.1371/journal.pcbi.1003465] [Citation(s) in RCA: 64] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2013] [Accepted: 12/18/2013] [Indexed: 12/12/2022] Open
Abstract
We introduce a novel computational approach, CoReCo, for comparative metabolic reconstruction and provide genome-scale metabolic network models for 49 important fungal species. Leveraging on the exponential growth in sequenced genome availability, our method reconstructs genome-scale gapless metabolic networks simultaneously for a large number of species by integrating sequence data in a probabilistic framework. High reconstruction accuracy is demonstrated by comparisons to the well-curated Saccharomyces cerevisiae consensus model and large-scale knock-out experiments. Our comparative approach is particularly useful in scenarios where the quality of available sequence data is lacking, and when reconstructing evolutionary distant species. Moreover, the reconstructed networks are fully carbon mapped, allowing their use in 13C flux analysis. We demonstrate the functionality and usability of the reconstructed fungal models with computational steady-state biomass production experiment, as these fungi include some of the most important production organisms in industrial biotechnology. In contrast to many existing reconstruction techniques, only minimal manual effort is required before the reconstructed models are usable in flux balance experiments. CoReCo is available at http://esaskar.github.io/CoReCo/. Advances in next-generation sequencing technologies are revolutionizing molecular biology. Sequencing-enabled cost-effective characterization of microbial genomes is a particularly exciting development in metabolic engineering. There, considerable effort has been put to reconstructing genome-scale metabolic networks that describe the collection of hundreds to thousands of biochemical reactions available for a microbial cell. These network models are instrumental in understanding microbial metabolism and guiding metabolic engineering efforts to improve biochemical yields. We have developed a novel computational method, CoReCo, which bridges the growing gap between the availability of sequenced genomes and respective reconstructed metabolic networks. The method reconstructs genome-scale metabolic networks simultaneously for related microbial species. It utilizes the available sequencing data from these species to correct for incomplete and missing data. We used the method to reconstruct metabolic networks for a set of 49 fungal species providing the method protein sequence data and a phylogenetic tree describing the evolutionary relationships between the species. We demonstrate the applicability of the method by comparing a metabolic reconstruction of Saccharomyces cerevisiae to the manually curated, high-quality consensus network. We also provide an easy-to-use implementation of the method, usable both in single computer and distributed computing environments.
Collapse
Affiliation(s)
- Esa Pitkänen
- Department of Computer Science, University of Helsinki, Helsinki, Finland
- Department of Medical Genetics, Genome-Scale Biology Research Program, University of Helsinki, Helsinki, Finland
- * E-mail:
| | - Paula Jouhten
- VTT Technical Research Centre of Finland, Espoo, Finland
| | - Jian Hou
- Department of Computer Science, University of Helsinki, Helsinki, Finland
- Department of Information and Computer Science, Aalto University, Espoo, Finland
| | | | - Peter Blomberg
- VTT Technical Research Centre of Finland, Espoo, Finland
| | - Jana Kludas
- Department of Information and Computer Science, Aalto University, Espoo, Finland
| | - Merja Oja
- VTT Technical Research Centre of Finland, Espoo, Finland
| | - Liisa Holm
- Institute of Biotechnology & Department of Biosciences, University of Helsinki, Helsinki, Finland
| | - Merja Penttilä
- VTT Technical Research Centre of Finland, Espoo, Finland
| | - Juho Rousu
- Department of Information and Computer Science, Aalto University, Espoo, Finland
| | - Mikko Arvas
- VTT Technical Research Centre of Finland, Espoo, Finland
| |
Collapse
|
43
|
Investigating host-pathogen behavior and their interaction using genome-scale metabolic network models. Methods Mol Biol 2014; 1184:523-62. [PMID: 25048144 DOI: 10.1007/978-1-4939-1115-8_29] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Genome Scale Metabolic Modeling methods represent one way to compute whole cell function starting from the genome sequence of an organism and contribute towards understanding and predicting the genotype-phenotype relationship. About 80 models spanning all the kingdoms of life from archaea to eukaryotes have been built till date and used to interrogate cell phenotype under varying conditions. These models have been used to not only understand the flux distribution in evolutionary conserved pathways like glycolysis and the Krebs cycle but also in applications ranging from value added product formation in Escherichia coli to predicting inborn errors of Homo sapiens metabolism. This chapter describes a protocol that delineates the process of genome scale metabolic modeling for analysing host-pathogen behavior and interaction using flux balance analysis (FBA). The steps discussed in the process include (1) reconstruction of a metabolic network from the genome sequence, (2) its representation in a precise mathematical framework, (3) its translation to a model, and (4) the analysis using linear algebra and optimization. The methods for biological interpretations of computed cell phenotypes in the context of individual host and pathogen models and their integration are also discussed.
Collapse
|
44
|
Liberal R, Pinney JW. Simple topological properties predict functional misannotations in a metabolic network. Bioinformatics 2013; 29:i154-61. [PMID: 23812979 PMCID: PMC3694667 DOI: 10.1093/bioinformatics/btt236] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Motivation: Misannotation in sequence databases is an important obstacle for automated tools for gene function annotation, which rely extensively on comparison with sequences with known function. To improve current annotations and prevent future propagation of errors, sequence-independent tools are, therefore, needed to assist in the identification of misannotated gene products. In the case of enzymatic functions, each functional assignment implies the existence of a reaction within the organism’s metabolic network; a first approximation to a genome-scale metabolic model can be obtained directly from an automated genome annotation. Any obvious problems in the network, such as dead end or disconnected reactions, can, therefore, be strong indications of misannotation. Results: We demonstrate that a machine-learning approach using only network topological features can successfully predict the validity of enzyme annotations. The predictions are tested at three different levels. A random forest using topological features of the metabolic network and trained on curated sets of correct and incorrect enzyme assignments was found to have an accuracy of up to 86% in 5-fold cross-validation experiments. Further cross-validation against unseen enzyme superfamilies indicates that this classifier can successfully extrapolate beyond the classes of enzyme present in the training data. The random forest model was applied to several automated genome annotations, achieving an accuracy of in most cases when validated against recent genome-scale metabolic models. We also observe that when applied to draft metabolic networks for multiple species, a clear negative correlation is observed between predicted annotation quality and phylogenetic distance to the major model organism for biochemistry (Escherichia coli for prokaryotes and Homo sapiens for eukaryotes). Contact:j.pinney@imperial.ac.uk Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Rodrigo Liberal
- Department of Life Sciences and Centre for Integrative Systems Biology and Bioinformatics, Imperial College London, London SW7 2AZ, UK
| | | |
Collapse
|
45
|
Ghosh S, Baloni P, Mukherjee S, Anand P, Chandra N. A multi-level multi-scale approach to study essential genes in Mycobacterium tuberculosis. BMC SYSTEMS BIOLOGY 2013; 7:132. [PMID: 24308365 PMCID: PMC4234997 DOI: 10.1186/1752-0509-7-132] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/25/2013] [Accepted: 11/20/2013] [Indexed: 11/10/2022]
Abstract
Background The set of indispensable genes that are required by an organism to grow and sustain life are termed as essential genes. There is a strong interest in identification of the set of essential genes, particularly in pathogens, not only for a better understanding of the pathogen biology, but also for identifying drug targets and the minimal gene set for the organism. Essentiality is inherently a systems property and requires consideration of the system as a whole for their identification. The available experimental approaches capture some aspects but each method comes with its own limitations. Moreover, they do not explain the basis for essentiality in most cases. A powerful prediction method to recognize this gene pool including rationalization of the known essential genes in a given organism would be very useful. Here we describe a multi-level multi-scale approach to identify the essential gene pool in a deadly pathogen, Mycobacterium tuberculosis. Results The multi-level workflow analyses the bacterial cell by studying (a) genome-wide gene expression profiles to identify the set of genes which show consistent and significant levels of expression in multiple samples of the same condition, (b) indispensability for growth by using gene expression integrated flux balance analysis of a genome-scale metabolic model, (c) importance for maintaining the integrity and flow in a protein-protein interaction network and (d) evolutionary conservation in a set of genomes of the same ecological niche. In the gene pool identified, the functional basis for essentiality has been addressed by studying residue level conservation and the sub-structure at the ligand binding pockets, from which essential amino acid residues in that pocket have also been identified. 283 genes were identified as essential genes with high-confidence. An agreement of about 73.5% is observed with that obtained from the experimental transposon mutagenesis technique. A large proportion of the identified genes belong to the class of intermediary metabolism and respiration. Conclusions The multi-scale, multi-level approach described can be generally applied to other pathogens as well. The essential gene pool identified form a basis for designing experiments to probe their finer functional roles and also serve as a ready shortlist for identifying drug targets.
Collapse
Affiliation(s)
| | | | | | | | - Nagasuma Chandra
- Department of Biochemistry, Indian Institute of Science, Bangalore, India.
| |
Collapse
|
46
|
Liebeke M, Lalk M. Staphylococcus aureus metabolic response to changing environmental conditions - a metabolomics perspective. Int J Med Microbiol 2013; 304:222-9. [PMID: 24439195 DOI: 10.1016/j.ijmm.2013.11.017] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2013] [Revised: 10/30/2013] [Accepted: 11/25/2013] [Indexed: 01/16/2023] Open
Abstract
Microorganisms preserve their metabolic function against a wide range of external perturbations including biotic or abiotic factors by utilizing cellular adaptations to maintain cell homeostasis. Functional genomics aims to detect such adaptive alterations on the level of transcriptome, proteome and metabolome to understand system wide changes and to identify interactions between the different levels of biochemical organization. Microbial metabolomics measures metabolites, the direct biochemical response to the environment, and is pivotal to the understanding of the variability and dynamics of bacterial cell metabolism. Metabolomics can measure many different types of compounds including primary metabolites, secondary metabolites, second messengers, quorum sensing compounds and others, which all contribute to the complex bacterial response to an environmental change. Recent data confirmed that many metabolic processes in pathogenic bacteria are linked to virulence and invasive capabilities. Deciphering bacterial metabolism in response to specific environmental conditions and in specific genetic backgrounds will help map the complex network between the metabolome and the other "-omes". Here, we will review a selection of case studies for the pathogenic Gram-positive bacterium Staphylococcus aureus and summarize the current state of metabolomics literature covering staphylococci metabolism under different physiological states.
Collapse
Affiliation(s)
- Manuel Liebeke
- Biomolecular Medicine, Department of Surgery and Cancer, Faculty of Medicine, Imperial College London, London SW7 2AZ, UK.
| | - Michael Lalk
- Institute of Biochemistry, Ernst-Moritz-Arndt-University of Greifswald, 17487 Greifswald, Germany
| |
Collapse
|
47
|
Marashi SA, Kouhestani H, Mahdavi M. Studying the relationship between robustness against mutations in metabolic networks and lifestyle of organisms. ScientificWorldJournal 2013; 2013:615697. [PMID: 24348175 PMCID: PMC3848384 DOI: 10.1155/2013/615697] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2013] [Accepted: 09/26/2013] [Indexed: 01/09/2023] Open
Abstract
Robustness is the key feature of biological networks that enables living organisms to keep their homeostatic state and to survive against external and internal perturbations. Variations in environmental conditions or nutrients and intracellular changes such as genetic mutations have the potential to change stability and efficiency of an organism. Structural robustness helps biological systems to choose alternative routes of adaptation to varying conditions. In this study, in order to estimate the structural robustness in metabolic networks we presented a novel flux balance-based approach inspired by bond percolation theory. Fourteen in silico metabolic models were studied in this work in order to examine the possible relationship between the lifestyle of organisms and their metabolic robustness. The results of this study confirm that in organisms which are highly adapted to their environment robustness to mutations may decrease compared to other organisms.
Collapse
Affiliation(s)
- Sayed-Amir Marashi
- Department of Biotechnology, College of Science, University of Tehran, Tehran 1417614411, Iran
- School of Computer Science, Institute for Research in Fundamental Sciences (IPM), P.O. Box 19395-5746, Tehran, Iran
| | - Hawa Kouhestani
- Department of Biology, Faculty of Natural Science, University of Tabriz, Tabriz 5166616471, Iran
| | - Majid Mahdavi
- Department of Biology, Faculty of Natural Science, University of Tabriz, Tabriz 5166616471, Iran
| |
Collapse
|
48
|
Abstract
To test the promise of whole-cell modeling to facilitate scientific inquiry, we compared growth rates simulated in a whole-cell model with experimental measurements for all viable single-gene disruption Mycoplasma genitalium strains. Discrepancies between simulations and experiments led to predictions about kinetic parameters of specific enzymes that we subsequently validated. These findings represent, to our knowledge, the first application of whole-cell modeling to accelerate biological discovery.
Collapse
|
49
|
Ponce-de-León M, Montero F, Peretó J. Solving gap metabolites and blocked reactions in genome-scale models: application to the metabolic network of Blattabacterium cuenoti. BMC SYSTEMS BIOLOGY 2013; 7:114. [PMID: 24176055 PMCID: PMC3819652 DOI: 10.1186/1752-0509-7-114] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/22/2013] [Accepted: 10/23/2013] [Indexed: 11/20/2022]
Abstract
Background Metabolic reconstruction is the computational-based process that aims to elucidate the network of metabolites interconnected through reactions catalyzed by activities assigned to one or more genes. Reconstructed models may contain inconsistencies that appear as gap metabolites and blocked reactions. Although automatic methods for solving this problem have been previously developed, there are many situations where manual curation is still needed. Results We introduce a general definition of gap metabolite that allows its detection in a straightforward manner. Moreover, a method for the detection of Unconnected Modules, defined as isolated sets of blocked reactions connected through gap metabolites, is proposed. The method has been successfully applied to the curation of iCG238, the genome-scale metabolic model for the bacterium Blattabacterium cuenoti, obligate endosymbiont of cockroaches. Conclusion We found the proposed approach to be a valuable tool for the curation of genome-scale metabolic models. The outcome of its application to the genome-scale model B. cuenoti iCG238 is a more accurate model version named as B. cuenoti iMP240.
Collapse
Affiliation(s)
| | - Francisco Montero
- Departamento de Bioquímica y Biología Molecular I, Facultad de Ciencias Químicas, Universidad Complutense de Madrid, Ciudad Universitaria, Madrid 28045, Spain.
| | | |
Collapse
|
50
|
Dissecting the energy metabolism in Mycoplasma pneumoniae through genome-scale metabolic modeling. Mol Syst Biol 2013; 9:653. [PMID: 23549481 PMCID: PMC3658275 DOI: 10.1038/msb.2013.6] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2012] [Accepted: 02/20/2013] [Indexed: 12/15/2022] Open
Abstract
A new genome-scale metabolic reconstruction of M. pneumonia is used in combination with external metabolite measurement and protein abundance measurements to quantitatively explore the energy metabolism of this genome-reduce human pathogen. ![]()
We established a detailed biomass composition for M. pneumoniae, thus allowing for growth simulations. Using our metabolic model, we corrected the metabolic network topology and the functional annotation of key metabolic enzymes. M. pneumoniae, unlike other laboratory-grown bacteria, uses a high fraction of energy (up to 89%) for cellular maintenance and not for growth. Simulating different growth conditions as well as single and double mutant phenotypes, we analyzed pathway connectivity and the impact of gene deletions on the growth performance of M. pneumoniae, highlighting the limited adaptive capabilities of this minimal model organism.
Mycoplasma pneumoniae, a threatening pathogen with a minimal genome, is a model organism for bacterial systems biology for which substantial experimental information is available. With the goal of understanding the complex interactions underlying its metabolism, we analyzed and characterized the metabolic network of M. pneumoniae in great detail, integrating data from different omics analyses under a range of conditions into a constraint-based model backbone. Iterating model predictions, hypothesis generation, experimental testing, and model refinement, we accurately curated the network and quantitatively explored the energy metabolism. In contrast to other bacteria, M. pneumoniae uses most of its energy for maintenance tasks instead of growth. We show that in highly linear networks the prediction of flux distributions for different growth times allows analysis of time-dependent changes, albeit using a static model. By performing an in silico knock-out study as well as analyzing flux distributions in single and double mutant phenotypes, we demonstrated that the model accurately represents the metabolism of M. pneumoniae. The experimentally validated model provides a solid basis for understanding its metabolic regulatory mechanisms.
Collapse
|