1
|
Liebal UW, Phan ANT, Sudhakar M, Raman K, Blank LM. Machine Learning Applications for Mass Spectrometry-Based Metabolomics. Metabolites 2020; 10:E243. [PMID: 32545768 PMCID: PMC7345470 DOI: 10.3390/metabo10060243] [Citation(s) in RCA: 133] [Impact Index Per Article: 33.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 06/09/2020] [Accepted: 06/11/2020] [Indexed: 12/20/2022] Open
Abstract
The metabolome of an organism depends on environmental factors and intracellular regulation and provides information about the physiological conditions. Metabolomics helps to understand disease progression in clinical settings or estimate metabolite overproduction for metabolic engineering. The most popular analytical metabolomics platform is mass spectrometry (MS). However, MS metabolome data analysis is complicated, since metabolites interact nonlinearly, and the data structures themselves are complex. Machine learning methods have become immensely popular for statistical analysis due to the inherent nonlinear data representation and the ability to process large and heterogeneous data rapidly. In this review, we address recent developments in using machine learning for processing MS spectra and show how machine learning generates new biological insights. In particular, supervised machine learning has great potential in metabolomics research because of the ability to supply quantitative predictions. We review here commonly used tools, such as random forest, support vector machines, artificial neural networks, and genetic algorithms. During processing steps, the supervised machine learning methods help peak picking, normalization, and missing data imputation. For knowledge-driven analysis, machine learning contributes to biomarker detection, classification and regression, biochemical pathway identification, and carbon flux determination. Of important relevance is the combination of different omics data to identify the contributions of the various regulatory levels. Our overview of the recent publications also highlights that data quality determines analysis quality, but also adds to the challenge of choosing the right model for the data. Machine learning methods applied to MS-based metabolomics ease data analysis and can support clinical decisions, guide metabolic engineering, and stimulate fundamental biological discoveries.
Collapse
Affiliation(s)
- Ulf W. Liebal
- Institute of Applied Microbiology, Aachen Biology and Biotechnology, RWTH Aachen University, Worringer Weg 1, 52074 Aachen, Germany;
| | - An N. T. Phan
- Institute of Applied Microbiology, Aachen Biology and Biotechnology, RWTH Aachen University, Worringer Weg 1, 52074 Aachen, Germany;
| | - Malvika Sudhakar
- Department of Biotechnology, Bhupat and Juoti Mehta School of Biosciences, Indian Institute of Technology (IIT) Madras, Chennai 600 036, India; (M.S.); (K.R.)
- Initiative for Biological Systems Engineering, IIT Madras, Chennai 600 036, India
- Robert Bosch Centre for Data Science and Artificial Intelligence (RBCDSAI), IIT Madras, Chennai 600 036, India
| | - Karthik Raman
- Department of Biotechnology, Bhupat and Juoti Mehta School of Biosciences, Indian Institute of Technology (IIT) Madras, Chennai 600 036, India; (M.S.); (K.R.)
- Initiative for Biological Systems Engineering, IIT Madras, Chennai 600 036, India
- Robert Bosch Centre for Data Science and Artificial Intelligence (RBCDSAI), IIT Madras, Chennai 600 036, India
| | - Lars M. Blank
- Institute of Applied Microbiology, Aachen Biology and Biotechnology, RWTH Aachen University, Worringer Weg 1, 52074 Aachen, Germany;
| |
Collapse
|
2
|
Haustenne L, Bastin G, Hols P, Fontaine L. Modeling of the ComRS Signaling Pathway Reveals the Limiting Factors Controlling Competence in Streptococcus thermophilus. Front Microbiol 2015; 6:1413. [PMID: 26733960 PMCID: PMC4686606 DOI: 10.3389/fmicb.2015.01413] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2015] [Accepted: 11/27/2015] [Indexed: 12/25/2022] Open
Abstract
In streptococci, entry in competence is dictated by ComX abundance. In Streptococcus thermophilus, production of ComX is transient and tightly regulated during growth: it is positively regulated by the cell-cell communication system ComRS during the activation phase and negatively regulated during the shut-off phase by unidentified late competence gene(s). Interestingly, most S. thermophilus strains are not or weakly transformable in permissive growth conditions (i.e., chemically defined medium, CDM), suggesting that some players of the ComRS regulatory pathway are limiting. Here, we combined mathematical modeling and experimental approaches to identify the components of the ComRS system which are critical for both dynamics and amplitude of ComX production in S. thermophilus. We built a deterministic, population-scaled model of the time-course regulation of specific ComX production in CDM growth conditions. Strains LMD-9 and LMG18311 were respectively selected as representative of highly and weakly transformable strains. Results from in silico simulations and in vivo luciferase activities show that ComR concentration is the main limiting factor for the level of comX expression and controls the kinetics of spontaneous competence induction in strain LMD-9. In addition, the model predicts that the poor transformability of strain LMG18311 results from a 10-fold lower comR expression level compared to strain LMD-9. In agreement, comR overexpression in both strains was shown to induce higher competence levels with deregulated kinetics patterns during growth. In conclusion, we propose that the level of ComR production is one important factor that could explain competence heterogeneity among S. thermophilus strains.
Collapse
Affiliation(s)
- Laurie Haustenne
- Biochimie, Biophysique et Génétique des Microorganismes, Institut des Sciences de la Vie, Université catholique de Louvain Louvain-la-Neuve, Belgium
| | - Georges Bastin
- Center for Systems Engineering and Applied Mechanics, ICTEAM, Université catholique de Louvain Louvain-la-Neuve, Belgium
| | - Pascal Hols
- Biochimie, Biophysique et Génétique des Microorganismes, Institut des Sciences de la Vie, Université catholique de Louvain Louvain-la-Neuve, Belgium
| | - Laetitia Fontaine
- Biochimie, Biophysique et Génétique des Microorganismes, Institut des Sciences de la Vie, Université catholique de Louvain Louvain-la-Neuve, Belgium
| |
Collapse
|
3
|
Lyon P. The cognitive cell: bacterial behavior reconsidered. Front Microbiol 2015; 6:264. [PMID: 25926819 PMCID: PMC4396460 DOI: 10.3389/fmicb.2015.00264] [Citation(s) in RCA: 123] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2014] [Accepted: 03/16/2015] [Indexed: 01/28/2023] Open
Abstract
Research on how bacteria adapt to changing environments underlies the contemporary biological understanding of signal transduction (ST), and ST provides the foundation of the information-processing approach that is the hallmark of the ‘cognitive revolution,’ which began in the mid-20th century. Yet cognitive scientists largely remain oblivious to research into microbial behavior that might provide insights into problems in their own domains, while microbiologists seem equally unaware of the potential importance of their work to understanding cognitive capacities in multicellular organisms, including vertebrates. Evidence in bacteria for capacities encompassed by the concept of cognition is reviewed. Parallels exist not only at the heuristic level of functional analogue, but also at the level of molecular mechanism, evolution and ecology, which is where fruitful cross-fertilization among disciplines might be found.
Collapse
Affiliation(s)
- Pamela Lyon
- Southgate Institute for Health, Society and Equity, School of Medicine, Flinders University Adelaide, SA, Australia
| |
Collapse
|
4
|
Eijlander RT, de Jong A, Krawczyk AO, Holsappel S, Kuipers OP. SporeWeb: an interactive journey through the complete sporulation cycle of Bacillus subtilis. Nucleic Acids Res 2013; 42:D685-91. [PMID: 24170806 PMCID: PMC3964945 DOI: 10.1093/nar/gkt1007] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Bacterial spores are a continuous problem for both food-based and health-related industries. Decades of scientific research dedicated towards understanding molecular and gene regulatory aspects of sporulation, spore germination and spore properties have resulted in a wealth of data and information. To facilitate obtaining a complete overview as well as new insights concerning this complex and tightly regulated process, we have developed a database-driven knowledge platform called SporeWeb (http://sporeweb.molgenrug.nl) that focuses on gene regulatory networks during sporulation in the Gram-positive bacterium Bacillus subtilis. Dynamic features allow the user to navigate through all stages of sporulation with review-like descriptions, schematic overviews on transcriptional regulation and detailed information on all regulators and the genes under their control. The Web site supports data acquisition on sporulation genes and their expression, regulon network interactions and direct links to other knowledge platforms or relevant literature. The information found on SporeWeb (including figures and tables) can and will be updated as new information becomes available in the literature. In this way, SporeWeb offers a novel, convenient and timely reference, an information source and a data acquisition tool that will aid in the general understanding of the dynamics of the complete sporulation cycle.
Collapse
Affiliation(s)
- Robyn T Eijlander
- Top Institute Food and Nutrition (TIFN), Nieuwe Kanaal 9A, 6709 PA Wageningen, The Netherlands and Department of Molecular Genetics, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, 9747 AG Groningen, The Netherlands
| | | | | | | | | |
Collapse
|
5
|
Liebal UW, Millat T, Marles-Wright J, Lewis RJ, Wolkenhauer O. Simulations of stressosome activation emphasize allosteric interactions between RsbR and RsbT. BMC SYSTEMS BIOLOGY 2013; 7:3. [PMID: 23320651 PMCID: PMC3556497 DOI: 10.1186/1752-0509-7-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/23/2012] [Accepted: 01/07/2013] [Indexed: 11/10/2022]
Abstract
BACKGROUND The stressosome is a bacterial signalling complex that responds to environmental changes by initiating a protein partner switching cascade, which leads to the release of the alternative sigma factor, σB. Stress perception increases the phosphorylation of the stressosome sensor protein, RsbR, and the scaffold protein, RsbS, by the protein kinase, RsbT. Subsequent dissociation of RsbT from the stressosome activates the σB cascade. However, the sequence of physical events that occur in the stressosome during signal transduction is insufficiently understood. RESULTS Here, we use computational modelling to correlate the structure of the stressosome with the efficiency of the phosphorylation reactions that occur upon activation by stress. In our model, the phosphorylation of any stressosome protein is dependent upon its nearest neighbours and their phosphorylation status. We compare different hypotheses about stressosome activation and find that only the model representing the allosteric activation of the kinase RsbT, by phosphorylated RsbR, qualitatively reproduces the experimental data. CONCLUSIONS Our simulations and the associated analysis of published data support the following hypotheses: (i) a simple Boolean model is capable of reproducing stressosome dynamics, (ii) different stressors induce identical stressosome activation patterns, and we also confirm that (i) phosphorylated RsbR activates RsbT, and (ii) the main purpose of RsbX is to dephosphorylate RsbS-P.
Collapse
Affiliation(s)
- Ulf W Liebal
- Department of Systems Biology & Bioinformatics, Institute of Computer Science, University of Rostock, 18051, Rostock, Germany
| | - Thomas Millat
- Department of Systems Biology & Bioinformatics, Institute of Computer Science, University of Rostock, 18051, Rostock, Germany
| | - Jon Marles-Wright
- Institute for Cell and Molecular Biosciences, Faculty of Medical Sciences, Newcastle University, Newcastle-upon-Tyne, NE2 4HH, UK
- Institute of Structural and Molecular Biology, School of Biological Sciences, Edinburgh University, Edinburgh, EH9 3JR, UK
| | - Richard J Lewis
- Institute for Cell and Molecular Biosciences, Faculty of Medical Sciences, Newcastle University, Newcastle-upon-Tyne, NE2 4HH, UK
| | - Olaf Wolkenhauer
- Department of Systems Biology & Bioinformatics, Institute of Computer Science, University of Rostock, 18051, Rostock, Germany
- Institute for Advanced Study (STIAS), Wallenberg Research Centre at Stellenbosch University, Stellenbosch, 7600, South Africa
| |
Collapse
|
6
|
Positive feedback and temperature mediated molecular switch controls differential gene regulation in Bordetella pertussis. Biosystems 2012; 110:107-18. [PMID: 22960292 DOI: 10.1016/j.biosystems.2012.08.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2012] [Revised: 07/19/2012] [Accepted: 08/16/2012] [Indexed: 11/24/2022]
Abstract
Based on the phosphorelay kinetics operative within BvgAS two component system we propose a mathematical framework for signal transduction and gene regulation of phenotypic phases in Bordetella pertussis. The proposed model identifies a novel mechanism of transcriptional interference between two promoters present in the bvg locus. To understand the system behavior under elevated temperature, the developed model has been studied in two different ways. First, a quasi-steady state analysis has been carried out for the two component system, comprising of sensor BvgS and response regulator BvgA. The quasi-steady state analysis reveals temperature induced sharp molecular switch, leading to amplification in the output of BvgA. Accumulation of a large pool of BvgA thus results into differential regulation of the downstream genes, including the gene encoding toxin. Numerical integration of the full network kinetics is then carried out to explore time dependent behavior of different system components, that qualitatively capture the essential features of experimental results performed in vivo. Furthermore, the developed model has been utilized to study mutants that are impaired in their ability to phosphorylate the transcription factor, BvgA, of the signaling network.
Collapse
|
7
|
Davidson FA, Seon-Yi C, Stanley-Wall NR. Selective heterogeneity in exoprotease production by Bacillus subtilis. PLoS One 2012; 7:e38574. [PMID: 22745669 PMCID: PMC3380070 DOI: 10.1371/journal.pone.0038574] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2012] [Accepted: 05/07/2012] [Indexed: 11/18/2022] Open
Abstract
Bacteria have elaborate signalling mechanisms to ensure a behavioural response that is most likely to enhance survival in a changing environment. It is becoming increasingly apparent that as part of this response, bacteria are capable of cell differentiation and can generate multiple, mutually exclusive co-existing cell states. These cell states are often associated with multicellular processes that bring benefit to the community as a whole but which may be, paradoxically, disadvantageous to an individual subpopulation. How this process of cell differentiation is controlled is intriguing and remains a largely open question. In this paper, we consider an important aspect of cell differentiation that is known to occur in the gram-positive bacterium Bacillus subtilis: we investigate the role of two master regulators DegU and Spo0A in the control of extra-cellular protease production. Recent work in this area focussed the on role of DegU in this process and suggested that transient effects in protein production were the drivers of cell-response heterogeneity. Here, using a combination of mathematical modelling, analysis and stochastic simulations, we provide a complementary analysis of this regulatory system that investigates the roles of both DegU and Spo0A in extra-cellular protease production. In doing so, we present a mechanism for bimodality, or system heterogeneity, without the need for a bistable switch in the underlying regulatory network. Moreover, our analysis leads us to conclude that this heterogeneity is in fact a persistent, stable feature. Our results suggest that system response is divided into three zones: low and high signal levels induce a unimodal or undifferentiated response from the cell population with all cells OFF and ON, respectively for exoprotease production. However, for intermediate levels of signal, a heterogeneous response is predicted with a spread of activity levels, representing typical "bet-hedging" behaviour.
Collapse
|
8
|
Tindall MJ, Gaffney EA, Maini PK, Armitage JP. Theoretical insights into bacterial chemotaxis. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2012; 4:247-59. [PMID: 22411503 DOI: 10.1002/wsbm.1168] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Research into understanding bacterial chemotactic systems has become a paradigm for Systems Biology. Experimental and theoretical researchers have worked hand-in-hand for over 40 years to understand the intricate behavior driving bacterial species, in particular how such small creatures, usually not more than 5 µm in length, detect and respond to small changes in their extracellular environment. In this review we highlight the importance that theoretical modeling has played in providing new insight and understanding into bacterial chemotaxis. We begin with an overview of the bacterial chemotaxis sensory response, before reviewing the role of theoretical modeling in understanding elements of the system on the single cell scale and features underpinning multiscale extensions to population models.
Collapse
Affiliation(s)
- Marcus J Tindall
- School of Biological Sciences, University of Reading, Whiteknights, Reading, UK.
| | | | | | | |
Collapse
|
9
|
Liebal UW, Sappa PK, Millat T, Steil L, Homuth G, Völker U, Wolkenhauer O. Proteolysis of beta-galactosidase following SigmaB activation in Bacillus subtilis. MOLECULAR BIOSYSTEMS 2012; 8:1806-14. [DOI: 10.1039/c2mb25031d] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
10
|
Tiwari A, Ray JCJ, Narula J, Igoshin OA. Bistable responses in bacterial genetic networks: designs and dynamical consequences. Math Biosci 2011; 231:76-89. [PMID: 21385588 DOI: 10.1016/j.mbs.2011.03.004] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2010] [Revised: 02/23/2011] [Accepted: 03/01/2011] [Indexed: 10/18/2022]
Abstract
A key property of living cells is their ability to react to stimuli with specific biochemical responses. These responses can be understood through the dynamics of underlying biochemical and genetic networks. Evolutionary design principles have been well studied in networks that display graded responses, with a continuous relationship between input signal and system output. Alternatively, biochemical networks can exhibit bistable responses so that over a range of signals the network possesses two stable steady states. In this review, we discuss several conceptual examples illustrating network designs that can result in a bistable response of the biochemical network. Next, we examine manifestations of these designs in bacterial master-regulatory genetic circuits. In particular, we discuss mechanisms and dynamic consequences of bistability in three circuits: two-component systems, sigma-factor networks, and a multistep phosphorelay. Analyzing these examples allows us to expand our knowledge of evolutionary design principles networks with bistable responses.
Collapse
Affiliation(s)
- Abhinav Tiwari
- Department of Bioengineering, Rice University, Houston, TX 77005, USA
| | | | | | | |
Collapse
|