1
|
Koblitz J, Halama P, Spring S, Thiel V, Baschien C, Hahnke R, Pester M, Overmann J, Reimer L. MediaDive: the expert-curated cultivation media database. Nucleic Acids Res 2022; 51:D1531-D1538. [PMID: 36134710 PMCID: PMC9825534 DOI: 10.1093/nar/gkac803] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 08/29/2022] [Accepted: 09/09/2022] [Indexed: 01/30/2023] Open
Abstract
We present MediaDive (https://mediadive.dsmz.de), a comprehensive and expert-curated cultivation media database, which comprises recipes, instructions and molecular compositions of >3200 standardized cultivation media for >40 000 microbial strains from all domains of life. MediaDive is designed to enable broad range applications from every-day-use in research and diagnostic laboratories to knowledge-driven support of new media design and artificial intelligence-driven data mining. It offers a number of intuitive search functions and comparison tools, for example to identify media for related taxonomic groups and to integrate strain-specific modifications. Besides classical PDF archiving and printing, the state-of-the-art website allows paperless use of media recipes on mobile devices for convenient wet-lab use. In addition, data can be retrieved using a RESTful web service for large-scale data analyses. An internal editor interface ensures continuous extension and curation of media by cultivation experts from the Leibniz Institute DSMZ, which is interlinked with the growing microbial collections at DSMZ. External user engagement is covered by a dedicated media builder tool. The standardized and programmatically accessible data will foster new approaches for the design of cultivation media to target the vast majority of uncultured microorganisms.
Collapse
Affiliation(s)
- Julia Koblitz
- To whom correspondence should be addressed. Tel: +49 531 2616 313; Fax: +49 531 2616 418;
| | - Philipp Halama
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Stefan Spring
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Vera Thiel
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Christiane Baschien
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Richard L Hahnke
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany
| | - Michael Pester
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, Braunschweig, Germany,Technical University of Braunschweig, Institute for Microbiology, Braunschweig, Germany
| | | | | |
Collapse
|
2
|
Apiyo D, Mouton JM, Louw C, Sampson SL, Louw TM. Dynamic mathematical model development and validation of in vitro Mycobacterium smegmatis growth under nutrient- and pH-stress. J Theor Biol 2022; 532:110921. [PMID: 34582827 DOI: 10.1016/j.jtbi.2021.110921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 08/27/2021] [Accepted: 09/21/2021] [Indexed: 10/20/2022]
Abstract
Mycobacterium tuberculosis can exist within a host for lengthy periods, tolerating even antibiotic challenge. This non-heritable, antibiotic tolerant "persister" state, is thought to underlie latent Tuberculosis (TB) infection and a deeper understanding thereof could inform treatment strategies. In addition to experimental studies, mathematical and computational modelling approaches are widely employed to study persistence from both an in vivo and in vitro perspective. However, specialized models (partial differential equations, agent-based, multiscale, etc.) rely on several difficult to determine parameters. In this study, a dynamic mathematical model was developed to predict the response of Mycobacterium smegmatis (a model organism for M. tuberculosis) grown in batch culture and subjected to a range of in vitro environmental stresses. Lag phase dynamics, pH variations and internal nitrogen storage were mechanistically modelled. Experimental results were used to train model parameters using global optimization, with extensive subsequent model validation to ensure extensibility to more complex modelling frameworks. This included an identifiability analysis which indicated that seven of the thirteen model parameters were uniquely identifiable. Non-identifiable parameters were critically evaluated. Model predictions compared to validation data (based on experimental results not used during training) were accurate with less than 16% maximum absolute percentage error, indicating that the model is accurate even when extrapolating to new experimental conditions. The bulk growth model can be extended to spatially heterogeneous simulations such as an agent-based model to simulate in vitro granuloma models or, eventually, in vivo conditions, where distributed environmental conditions are difficult to measure.
Collapse
Affiliation(s)
- D Apiyo
- Department of Process Engineering, Faculty of Engineering, Stellenbosch University, Stellenbosch, South Africa
| | - J M Mouton
- Department of Science and Innovation/National Research Foundation (DSI/NRF) Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - C Louw
- Department of Process Engineering, Faculty of Engineering, Stellenbosch University, Stellenbosch, South Africa
| | - S L Sampson
- Department of Science and Innovation/National Research Foundation (DSI/NRF) Centre of Excellence for Biomedical Tuberculosis Research, South African Medical Research Council Centre for Tuberculosis Research, Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - T M Louw
- Department of Process Engineering, Faculty of Engineering, Stellenbosch University, Stellenbosch, South Africa.
| |
Collapse
|
3
|
Functional prediction of environmental variables using metabolic networks. Sci Rep 2021; 11:12192. [PMID: 34108539 PMCID: PMC8190111 DOI: 10.1038/s41598-021-91486-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Accepted: 05/05/2021] [Indexed: 11/23/2022] Open
Abstract
In this manuscript, we propose a novel approach to assess relationships between environment and metabolic networks. We used a comprehensive dataset of more than 5000 prokaryotic species from which we derived the metabolic networks. We compute the scope from the reconstructed graphs, which is the set of all metabolites and reactions that can potentially be synthesized when provided with external metabolites. We show using machine learning techniques that the scope is an excellent predictor of taxonomic and environmental variables, namely growth temperature, oxygen tolerance, and habitat. In the literature, metabolites and pathways are rarely used to discriminate species. We make use of the scope underlying structure—metabolites and pathways—to construct the predictive models, giving additional information on the important metabolic pathways needed to discriminate the species, which is often absent in other metabolic network properties. For example, in the particular case of growth temperature, glutathione biosynthesis pathways are specific to species growing in cold environments, whereas tungsten metabolism is specific to species in warm environments, as was hinted in current literature. From a machine learning perspective, the scope is able to reduce the dimension of our data, and can thus be considered as an interpretable graph embedding.
Collapse
|
4
|
Bernstein DB, Sulheim S, Almaas E, Segrè D. Addressing uncertainty in genome-scale metabolic model reconstruction and analysis. Genome Biol 2021; 22:64. [PMID: 33602294 PMCID: PMC7890832 DOI: 10.1186/s13059-021-02289-z] [Citation(s) in RCA: 63] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Accepted: 02/04/2021] [Indexed: 02/07/2023] Open
Abstract
The reconstruction and analysis of genome-scale metabolic models constitutes a powerful systems biology approach, with applications ranging from basic understanding of genotype-phenotype mapping to solving biomedical and environmental problems. However, the biological insight obtained from these models is limited by multiple heterogeneous sources of uncertainty, which are often difficult to quantify. Here we review the major sources of uncertainty and survey existing approaches developed for representing and addressing them. A unified formal characterization of these uncertainties through probabilistic approaches and ensemble modeling will facilitate convergence towards consistent reconstruction pipelines, improved data integration algorithms, and more accurate assessment of predictive capacity.
Collapse
Affiliation(s)
- David B Bernstein
- Department of Biomedical Engineering and Biological Design Center, Boston University, Boston, MA, USA
| | - Snorre Sulheim
- Bioinformatics Program, Boston University, Boston, MA, USA
- Department of Biotechnology and Food Science, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
- Department of Biotechnology and Nanomedicine, SINTEF Industry, Trondheim, Norway
| | - Eivind Almaas
- Department of Biotechnology and Food Science, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
- K.G. Jebsen Center for Genetic Epidemiology, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
| | - Daniel Segrè
- Department of Biomedical Engineering and Biological Design Center, Boston University, Boston, MA, USA.
- Bioinformatics Program, Boston University, Boston, MA, USA.
- Department of Biology and Department of Physics, Boston University, Boston, MA, USA.
| |
Collapse
|
5
|
Wang D, Greenwood P, Klein MS. A protein-free chemically defined medium for the cultivation of various micro-organisms with food safety significance. J Appl Microbiol 2021; 131:844-854. [PMID: 33449387 DOI: 10.1111/jam.15005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2020] [Revised: 01/04/2021] [Accepted: 01/12/2021] [Indexed: 11/30/2022]
Abstract
AIMS To develop a broadly applicable medium free of proteins with well-defined and reproducible chemical composition for the cultivation of various micro-organisms with food safety significance. METHODS AND RESULTS The defined medium was designed as a buffered minimal salt medium supplemented with amino acids, vitamins, trace metals and other nutrients. Various strains commonly used for food safety research were selected to test the new defined medium. We investigated single growth factors needed by different strains and the growth performance of each strain cultivated in the defined medium. Results showed that the tested strains initially grew slower in the defined medium compared to tryptic soy broth, but after an overnight incubation cultures from the defined medium reached adequately high cell densities. CONCLUSIONS The newly designed defined medium can be widely applied in food safety studies that require media with well-defined chemical constituents. SIGNIFICANCE AND IMPACT OF THE STUDY Defined media are important in studies of microbial metabolites and physiological properties. A defined medium capable of cultivating different strains simultaneously is needed in the food safety area. The new defined medium has broader applications in comparing different strains directly and provides more reproducible results.
Collapse
Affiliation(s)
- D Wang
- Department of Food Science and Technology, The Ohio State University, Columbus, OH, USA
| | - P Greenwood
- Department of Food Science and Technology, The Ohio State University, Columbus, OH, USA
| | - M S Klein
- Department of Food Science and Technology, The Ohio State University, Columbus, OH, USA
| |
Collapse
|
6
|
Abstract
A synthesis of phenotypic and quantitative genomic traits is provided for bacteria and archaea, in the form of a scripted, reproducible workflow that standardizes and merges 26 sources. The resulting unified dataset covers 14 phenotypic traits, 5 quantitative genomic traits, and 4 environmental characteristics for approximately 170,000 strain-level and 15,000 species-aggregated records. It spans all habitats including soils, marine and fresh waters and sediments, host-associated and thermal. Trait data can find use in clarifying major dimensions of ecological strategy variation across species. They can also be used in conjunction with species and abundance sampling to characterize trait mixtures in communities and responses of traits along environmental gradients.
Collapse
|
7
|
Pelicaen R, Gonze D, Teusink B, De Vuyst L, Weckx S. Genome-Scale Metabolic Reconstruction of Acetobacter pasteurianus 386B, a Candidate Functional Starter Culture for Cocoa Bean Fermentation. Front Microbiol 2019; 10:2801. [PMID: 31921009 PMCID: PMC6915089 DOI: 10.3389/fmicb.2019.02801] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Accepted: 11/18/2019] [Indexed: 01/17/2023] Open
Abstract
Acetobacter pasteurianus 386B is a candidate functional starter culture for the cocoa bean fermentation process. To allow in silico simulations of its related metabolism in response to different environmental conditions, a genome-scale metabolic model for A. pasteurianus 386B was reconstructed. This is the first genome-scale metabolic model reconstruction for a member of the genus Acetobacter. The metabolic network reconstruction process was based on extensive genome re-annotation and comparative genomics analyses. The information content related to the functional annotation of metabolic enzymes and transporters was placed in a metabolic context by exploring and curating a Pathway/Genome Database of A. pasteurianus 386B using the Pathway Tools software. Metabolic reactions and curated gene-protein-reaction associations were bundled into a genome-scale metabolic model of A. pasteurianus 386B, named iAp386B454, containing 454 genes, 322 reactions, and 296 metabolites embedded in two cellular compartments. The reconstructed model was validated by performing growth experiments in a defined medium, which revealed that lactic acid as the sole carbon source could sustain growth of this strain. Further, the reconstruction of the A. pasteurianus 386B genome-scale metabolic model revealed knowledge gaps concerning the metabolism of this strain, especially related to the biosynthesis of its cell envelope and the presence or absence of metabolite transporters.
Collapse
Affiliation(s)
- Rudy Pelicaen
- Research Group of Industrial Microbiology and Food Biotechnology (IMDO), Faculty of Sciences and Bioengineering Sciences, Vrije Universiteit Brussel (VUB), Brussels, Belgium
- (IB) - Interuniversity Institute of Bioinformatics in Brussels (ULB-VUB), Brussels, Belgium
| | - Didier Gonze
- (IB) - Interuniversity Institute of Bioinformatics in Brussels (ULB-VUB), Brussels, Belgium
- Unité de Chronobiologie Théorique, Service de Chimie Physique, Faculté des Sciences, Université Libre de Bruxelles (ULB), Brussels, Belgium
| | - Bas Teusink
- Systems Bioinformatics, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
| | - Luc De Vuyst
- Research Group of Industrial Microbiology and Food Biotechnology (IMDO), Faculty of Sciences and Bioengineering Sciences, Vrije Universiteit Brussel (VUB), Brussels, Belgium
| | - Stefan Weckx
- Research Group of Industrial Microbiology and Food Biotechnology (IMDO), Faculty of Sciences and Bioengineering Sciences, Vrije Universiteit Brussel (VUB), Brussels, Belgium
- (IB) - Interuniversity Institute of Bioinformatics in Brussels (ULB-VUB), Brussels, Belgium
| |
Collapse
|
8
|
Ochs AR, Mehrabi M, Becker D, Asad MN, Zhao J, Zaragoza MV, Grosberg A. Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering. J Vis Exp 2019. [PMID: 31814616 DOI: 10.3791/60038] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Science relies on increasingly complex data sets for progress, but common data management methods such as spreadsheet programs are inadequate for the growing scale and complexity of this information. While database management systems have the potential to rectify these issues, they are not commonly utilized outside of business and informatics fields. Yet, many research labs already generate "medium sized", low velocity, multi-dimensional data that could greatly benefit from implementing similar systems. In this article, we provide a conceptual overview explaining how databases function and the advantages they provide in tissue engineering applications. Structural fibroblast data from individuals with a lamin A/C mutation was used to illustrate examples within a specific experimental context. Examples include visualizing multidimensional data, linking tables in a relational database structure, mapping a semi-automated data pipeline to convert raw data into structured formats, and explaining the underlying syntax of a query. Outcomes from analyzing the data were used to create plots of various arrangements and significance was demonstrated in cell organization in aligned environments between the positive control of Hutchinson-Gilford progeria, a well-known laminopathy, and all other experimental groups. In comparison to spreadsheets, database methods were enormously time efficient, simple to use once set up, allowed for immediate access of original file locations, and increased data rigor. In response to the National Institutes of Health (NIH) emphasis on experimental rigor, it is likely that many scientific fields will eventually adopt databases as common practice due to their strong capability to effectively organize complex data.
Collapse
Affiliation(s)
- Alexander R Ochs
- Department of Biomedical Engineering, University of California, Irvine; The Edwards Lifesciences Center for Advanced Cardiovascular Technology, University of California, Irvine
| | - Mehrsa Mehrabi
- Department of Biomedical Engineering, University of California, Irvine; The Edwards Lifesciences Center for Advanced Cardiovascular Technology, University of California, Irvine
| | - Danielle Becker
- Department of Biomedical Engineering, University of California, Irvine; The Edwards Lifesciences Center for Advanced Cardiovascular Technology, University of California, Irvine
| | - Mira N Asad
- Department of Biomedical Engineering, University of California, Irvine; The Edwards Lifesciences Center for Advanced Cardiovascular Technology, University of California, Irvine
| | - Jing Zhao
- Department of Biomedical Engineering, University of California, Irvine; The Edwards Lifesciences Center for Advanced Cardiovascular Technology, University of California, Irvine
| | - Michael V Zaragoza
- Pediatrics-Genetics & Genomics Division-School of Medicine, University of California, Irvine; Biological Chemistry-School of Medicine, University of California, Irvine
| | - Anna Grosberg
- Department of Biomedical Engineering, University of California, Irvine; The Edwards Lifesciences Center for Advanced Cardiovascular Technology, University of California, Irvine; Department of Chemical and Biomolecular Engineering, University of California, Irvine; Center for Complex Biological Systems, University of California, Irvine; The NSF-Simons Center for Multiscale Cell Fate Research (CMCF), University of California, Irvine;
| |
Collapse
|
9
|
Niu G, Li W. Next-Generation Drug Discovery to Combat Antimicrobial Resistance. Trends Biochem Sci 2019; 44:961-972. [PMID: 31256981 DOI: 10.1016/j.tibs.2019.05.005] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2019] [Revised: 05/30/2019] [Accepted: 05/31/2019] [Indexed: 12/16/2022]
Abstract
The widespread emergence of antibiotic-resistant pathogens poses a severe threat to public health. This problem becomes even worse with a coincident decline in the supply of new antibiotics. Conventional bioactivity-guided natural product discovery has failed to meet the urgent need for new antibiotics, largely due to limited resources and high rediscovery rates. Recent advances in cultivation techniques, analytical technologies, and genomics-based approaches have greatly expanded our access to previously underexploited microbial sources. These strategies will enable us to access new reservoirs of microorganisms and unleash their chemical potentials, thus opening new opportunities for the discovery of next-generation drugs to address the growing concerns of antimicrobial resistance.
Collapse
Affiliation(s)
- Guoqing Niu
- Biotechnology Research Center, Southwest University, Chongqing 400715, China; Chongqing Key Laboratory of Plant Resource Conservation and Germplasm Innovation, Southwest University, Chongqing 400715, China; State Cultivation Base of Crop Stress Biology for Southern Mountainous Land, Academy of Agricultural Sciences, Southwest University, Chongqing 400715, China.
| | - Wenli Li
- Key Laboratory of Marine Drugs, Ministry of Education of China, School of Medicine and Pharmacy, Ocean University of China, Qingdao 266003, China; Laboratory for Marine Drugs and Bioproducts, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266237, China.
| |
Collapse
|
10
|
Li G, Rabe KS, Nielsen J, Engqvist MKM. Machine Learning Applied to Predicting Microorganism Growth Temperatures and Enzyme Catalytic Optima. ACS Synth Biol 2019; 8:1411-1420. [PMID: 31117361 DOI: 10.1021/acssynbio.9b00099] [Citation(s) in RCA: 87] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Enzymes that catalyze chemical reactions at high temperatures are used for industrial biocatalysis, applications in molecular biology, and as highly evolvable starting points for protein engineering. The optimal growth temperature (OGT) of organisms is commonly used to estimate the stability of enzymes encoded in their genomes, but the number of experimentally determined OGT values are limited, particularly for thermophilic organisms. Here, we report on the development of a machine learning model that can accurately predict OGT for bacteria, archaea, and microbial eukaryotes directly from their proteome-wide 2-mer amino acid composition. The trained model is made freely available for reuse. In a subsequent step we use OGT data in combination with amino acid composition of individual enzymes to develop a second machine learning model-for prediction of enzyme catalytic temperature optima ( Topt). The resulting model generates enzyme Topt estimates that are far superior to using OGT alone. Finally, we predict Topt for 6.5 million enzymes, covering 4447 enzyme classes, and make the resulting data set available to researchers. This work enables simple and rapid identification of enzymes that are potentially functional at extreme temperatures.
Collapse
Affiliation(s)
- Gang Li
- Department of Biology and Biological Engineering , Chalmers University of Technology , SE-412 96 Gothenburg , Sweden
| | - Kersten S Rabe
- Institute for Biological Interfaces 1 (IBG 1) , Karlsruhe Institute of Technology (KIT) , Group for Molecular Evolution, 76131 Karlsruhe , Germany
| | - Jens Nielsen
- Department of Biology and Biological Engineering , Chalmers University of Technology , SE-412 96 Gothenburg , Sweden
- Novo Nordisk Foundation Center for Biosustainability , Technical University of Denmark , DK-2800 Kgs. Lyngby , Denmark
| | - Martin K M Engqvist
- Department of Biology and Biological Engineering , Chalmers University of Technology , SE-412 96 Gothenburg , Sweden
| |
Collapse
|
11
|
Engqvist MKM. Correlating enzyme annotations with a large set of microbial growth temperatures reveals metabolic adaptations to growth at diverse temperatures. BMC Microbiol 2018; 18:177. [PMID: 30400856 PMCID: PMC6219164 DOI: 10.1186/s12866-018-1320-7] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2018] [Accepted: 10/16/2018] [Indexed: 12/15/2022] Open
Abstract
Background The ambient temperature of all habitats is a key physical property that shapes the biology of microbes inhabiting them. The optimal growth temperature (OGT) of a microbe, is therefore a key piece of data needed to understand evolutionary adaptations manifested in their genome sequence. Unfortunately there is no growth temperature database or easily downloadable dataset encompassing the majority of cultured microorganisms. We are thus limited in interpreting genomic data to identify temperature adaptations in microbes. Results In this work I significantly contribute to closing this gap by mining data from major culture collection centres to obtain growth temperature data for a nonredundant set of 21,498 microbes. The dataset (10.5281/zenodo.1175608) contains mainly bacteria and archaea and spans psychrophiles, mesophiles, thermophiles and hyperthermophiles. Using this data a full 43% of all protein entries in the UniProt database can be annotated with the growth temperature of the species from which they originate. I validate the dataset by showing a Pearson correlation of up to 0.89 between growth temperature and mean enzyme optima, a physiological property directly influenced by the growth temperature. Using the temperature dataset I correlate the genomic occurance of enzyme functional annotations with growth temperature. I identify 319 enzyme functions that either increase or decrease in occurrence with temperature. Eight metabolic pathways were statistically enriched for these enzyme functions. Furthermore, I establish a correlation between 33 domains of unknown function (DUFs) with growth temperature in microbes, four of which (DUF438, DUF1524, DUF1957 and DUF3458_C) were significant in both archaea and bacteria. Conclusions The growth temperature dataset enables large-scale correlation analysis with enzyme function- and domain-level annotations. Growth-temperature dependent changes in their occurrence highlight potential evolutionary adaptations. A few of the identified changes are previously known, such as the preference for menaquinone biosynthesis through the futalosine pathway in bacteria growing at high temperatures. Others represent important starting points for future studies, such as DUFs where their occurrence change with temperature. The growth temperature dataset should become a valuable community resource and will find additional, important, uses in correlating genomic, transcriptomic, proteomic, metabolomic, phenotypic or taxonomic properties with temperature in future studies. Electronic supplementary material The online version of this article (10.1186/s12866-018-1320-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Martin K M Engqvist
- Department of Biology and Biological Engineering, Chalmers University of Technology, Gothenburg, Sweden.
| |
Collapse
|
12
|
Tierrafría VH, Mejía-Almonte C, Camacho-Zaragoza JM, Salgado H, Alquicira K, Ishida C, Gama-Castro S, Collado-Vides J. MCO: towards an ontology and unified vocabulary for a framework-based annotation of microbial growth conditions. Bioinformatics 2018; 35:856-864. [PMID: 30137210 PMCID: PMC7963087 DOI: 10.1093/bioinformatics/bty689] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Revised: 06/22/2018] [Accepted: 08/16/2018] [Indexed: 01/31/2023] Open
Abstract
MOTIVATION A major component in increasing our understanding of the biology of an organism is the mapping of its genotypic potential into its phenotypic expression profiles. This mapping is executed by the machinery of gene regulation, which is essentially studied by changes in growth conditions. Although many efforts have been made to systematize the annotation of experimental conditions in microbiology, the available annotations are not based on a consistent and controlled vocabulary, making difficult the identification of biologically meaningful comparisons of knowledge derived from different experiments or laboratories. RESULTS We curated terms related to experimental conditions that affect gene expression in Escherichia coli K-12. Since this is the best-studied microorganism, the collected terms are the seed for the Microbial Conditions Ontology (MCO), a controlled and structured vocabulary that can be expanded to annotate microbial conditions in general. Moreover, we developed an annotation framework to describe experimental conditions, providing the foundation to identify regulatory networks that operate under particular conditions. AVAILABILITY AND IMPLEMENTATION As far as we know, MCO is the first ontology for growth conditions of any bacterial organism, and it is available at http://regulondb.ccg.unam.mx and https://github.com/microbial-conditions-ontology. Furthermore, we will disseminate MCO throughout the Open Biological and Biomedical Ontology (OBO) Foundry in order to set a standard for the annotation of gene expression data. This will enable comparison of data from diverse data sources. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | - J M Camacho-Zaragoza
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - H Salgado
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - K Alquicira
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - C Ishida
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | | | | |
Collapse
|
13
|
Ye C, Xu N, Dong C, Ye Y, Zou X, Chen X, Guo F, Liu L. IMGMD: A platform for the integration and standardisation of In silico Microbial Genome-scale Metabolic Models. Sci Rep 2017; 7:727. [PMID: 28389638 PMCID: PMC5429687 DOI: 10.1038/s41598-017-00820-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2016] [Accepted: 03/16/2017] [Indexed: 11/12/2022] Open
Abstract
Genome-scale metabolic models (GSMMs) constitute a platform that combines genome sequences and detailed biochemical information to quantify microbial physiology at the system level. To improve the unity, integrity, correctness, and format of data in published GSMMs, a consensus IMGMD database was built in the LAMP (Linux + Apache + MySQL + PHP) system by integrating and standardizing 328 GSMMs constructed for 139 microorganisms. The IMGMD database can help microbial researchers download manually curated GSMMs, rapidly reconstruct standard GSMMs, design pathways, and identify metabolic targets for strategies on strain improvement. Moreover, the IMGMD database facilitates the integration of wet-lab and in silico data to gain an additional insight into microbial physiology. The IMGMD database is freely available, without any registration requirements, at http://imgmd.jiangnan.edu.cn/database.
Collapse
Affiliation(s)
- Chao Ye
- State Key Laboratory of Food Science and Technology, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China
- The Key Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China
| | - Nan Xu
- State Key Laboratory of Food Science and Technology, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China
- The Key Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China
| | - Chuan Dong
- Key Laboratory for Neuroinformation of Ministry of Education, University of Electronic Science and Technology of China, No. 4, 2nd Section, North Jianshe Road, Chengdu, Sichuan, 610054, China
| | - Yuannong Ye
- School of Biology and Engineering, Guizhou Medical University, Dongqing Road, Huaxi District, Guiyang, Guizhou, 550025, China
- School of Big Health, Guizhou Medical University, Dongqing Road, Huaxi District, Guiyang, Guizhou, 550025, China
| | - Xuan Zou
- State Key Laboratory of Food Science and Technology, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China
- The Key Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China
| | - Xiulai Chen
- State Key Laboratory of Food Science and Technology, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China
- The Key Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China
| | - Fengbiao Guo
- Key Laboratory for Neuroinformation of Ministry of Education, University of Electronic Science and Technology of China, No. 4, 2nd Section, North Jianshe Road, Chengdu, Sichuan, 610054, China
| | - Liming Liu
- State Key Laboratory of Food Science and Technology, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China.
- The Key Laboratory of Industrial Biotechnology, Ministry of Education, Jiangnan University, 1800 Lihu Road, Wuxi, Jiangsu, 214122, China.
| |
Collapse
|
14
|
Harnessing the landscape of microbial culture media to predict new organism-media pairings. Nat Commun 2015; 6:8493. [PMID: 26460590 PMCID: PMC4633754 DOI: 10.1038/ncomms9493] [Citation(s) in RCA: 95] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2015] [Accepted: 08/27/2015] [Indexed: 01/01/2023] Open
Abstract
Culturing microorganisms is a critical step in understanding and utilizing microbial life. Here we map the landscape of existing culture media by extracting natural-language media recipes into a Known Media Database (KOMODO), which includes >18,000 strain-media combinations, >3300 media variants and compound concentrations (the entire collection of the Leibniz Institute DSMZ repository). Using KOMODO, we show that although media are usually tuned for individual strains using biologically common salts, trace metals and vitamins/cofactors are the most differentiating components between defined media of strains within a genus. We leverage KOMODO to predict new organism-media pairings using a transitivity property (74% growth in new in vitro experiments) and a phylogeny-based collaborative filtering tool (83% growth in new in vitro experiments and stronger growth on predicted well-scored versus poorly scored media). These resources are integrated into a web-based platform that predicts media given an organism's 16S rDNA sequence, facilitating future cultivation efforts.
Collapse
|
15
|
Imam S, Schäuble S, Brooks AN, Baliga NS, Price ND. Data-driven integration of genome-scale regulatory and metabolic network models. Front Microbiol 2015; 6:409. [PMID: 25999934 PMCID: PMC4419725 DOI: 10.3389/fmicb.2015.00409] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2015] [Accepted: 04/20/2015] [Indexed: 12/21/2022] Open
Abstract
Microbes are diverse and extremely versatile organisms that play vital roles in all ecological niches. Understanding and harnessing microbial systems will be key to the sustainability of our planet. One approach to improving our knowledge of microbial processes is through data-driven and mechanism-informed computational modeling. Individual models of biological networks (such as metabolism, transcription, and signaling) have played pivotal roles in driving microbial research through the years. These networks, however, are highly interconnected and function in concert-a fact that has led to the development of a variety of approaches aimed at simulating the integrated functions of two or more network types. Though the task of integrating these different models is fraught with new challenges, the large amounts of high-throughput data sets being generated, and algorithms being developed, means that the time is at hand for concerted efforts to build integrated regulatory-metabolic networks in a data-driven fashion. In this perspective, we review current approaches for constructing integrated regulatory-metabolic models and outline new strategies for future development of these network models for any microbial system.
Collapse
Affiliation(s)
- Saheed Imam
- Institute for Systems Biology Seattle, WA, USA
| | - Sascha Schäuble
- Institute for Systems Biology Seattle, WA, USA ; Jena University Language and Information Engineering Lab, Friedrich-Schiller-University Jena Jena, Germany
| | | | - Nitin S Baliga
- Institute for Systems Biology Seattle, WA, USA ; Departments of Biology and Microbiology, University of Washington Seattle, WA, USA ; Molecular and Cellular Biology Program, University of Washington Seattle, WA, USA ; Lawrence Berkeley National Lab Berkeley, CA, USA
| | | |
Collapse
|
16
|
Transparency in metabolic network reconstruction enables scalable biological discovery. Curr Opin Biotechnol 2015; 34:105-9. [PMID: 25562137 DOI: 10.1016/j.copbio.2014.12.010] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2014] [Revised: 12/11/2014] [Accepted: 12/12/2014] [Indexed: 12/19/2022]
Abstract
Reconstructing metabolic pathways has long been a focus of active research. Now, draft models can be generated from genomic annotation and used to simulate metabolic fluxes of mass and energy at the whole-cell scale. This approach has led to an explosion in the number of functional metabolic network models. However, more models have not led to expanded coverage of metabolic reactions known to occur in the biosphere. Thus, there exists opportunity to reconsider the process of reconstruction and model derivation to better support the less-scalable investigative processes of biocuration and experimentation. Realizing this opportunity to improve our knowledge of metabolism requires developing new tools that make reconstructions more useful by highlighting metabolic network knowledge limitations to guide future research.
Collapse
|