1
|
Karlsen ST, Rau MH, Sánchez BJ, Jensen K, Zeidan AA. From genotype to phenotype: computational approaches for inferring microbial traits relevant to the food industry. FEMS Microbiol Rev 2023; 47:fuad030. [PMID: 37286882 PMCID: PMC10337747 DOI: 10.1093/femsre/fuad030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/31/2023] [Accepted: 06/06/2023] [Indexed: 06/09/2023] Open
Abstract
When selecting microbial strains for the production of fermented foods, various microbial phenotypes need to be taken into account to achieve target product characteristics, such as biosafety, flavor, texture, and health-promoting effects. Through continuous advances in sequencing technologies, microbial whole-genome sequences of increasing quality can now be obtained both cheaper and faster, which increases the relevance of genome-based characterization of microbial phenotypes. Prediction of microbial phenotypes from genome sequences makes it possible to quickly screen large strain collections in silico to identify candidates with desirable traits. Several microbial phenotypes relevant to the production of fermented foods can be predicted using knowledge-based approaches, leveraging our existing understanding of the genetic and molecular mechanisms underlying those phenotypes. In the absence of this knowledge, data-driven approaches can be applied to estimate genotype-phenotype relationships based on large experimental datasets. Here, we review computational methods that implement knowledge- and data-driven approaches for phenotype prediction, as well as methods that combine elements from both approaches. Furthermore, we provide examples of how these methods have been applied in industrial biotechnology, with special focus on the fermented food industry.
Collapse
Affiliation(s)
- Signe T Karlsen
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Martin H Rau
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Benjamín J Sánchez
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Kristian Jensen
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Ahmad A Zeidan
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| |
Collapse
|
2
|
Characterization and comparative transcriptome analyses of Salmonella enterica Enteritidis strains possessing different chlorine tolerance profiles. Lebensm Wiss Technol 2022. [DOI: 10.1016/j.lwt.2022.113945] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
3
|
Krysenko S, Wohlleben W. Polyamine and Ethanolamine Metabolism in Bacteria as an Important Component of Nitrogen Assimilation for Survival and Pathogenicity. Med Sci (Basel) 2022; 10:40. [PMID: 35997332 PMCID: PMC9397018 DOI: 10.3390/medsci10030040] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 07/23/2022] [Accepted: 07/25/2022] [Indexed: 11/16/2022] Open
Abstract
Nitrogen is an essential element required for bacterial growth. It serves as a building block for the biosynthesis of macromolecules and provides precursors for secondary metabolites. Bacteria have developed the ability to use various nitrogen sources and possess two enzyme systems for nitrogen assimilation involving glutamine synthetase/glutamate synthase and glutamate dehydrogenase. Microorganisms living in habitats with changeable availability of nutrients have developed strategies to survive under nitrogen limitation. One adaptation is the ability to acquire nitrogen from alternative sources including the polyamines putrescine, cadaverine, spermidine and spermine, as well as the monoamine ethanolamine. Bacterial polyamine and monoamine metabolism is not only important under low nitrogen availability, but it is also required to survive under high concentrations of these compounds. Such conditions can occur in diverse habitats such as soil, plant tissues and human cells. Strategies of pathogenic and non-pathogenic bacteria to survive in the presence of poly- and monoamines offer the possibility to combat pathogens by using their capability to metabolize polyamines as an antibiotic drug target. This work aims to summarize the knowledge on poly- and monoamine metabolism in bacteria and its role in nitrogen metabolism.
Collapse
Affiliation(s)
- Sergii Krysenko
- Interfaculty Institute of Microbiology and Infection Medicine Tübingen (IMIT), Department of Microbiology and Biotechnology, University of Tübingen, Auf der Morgenstelle 28, 72076 Tübingen, Germany;
- Cluster of Excellence ‘Controlling Microbes to Fight Infections’, University of Tübingen, 72076 Tübingen, Germany
| | - Wolfgang Wohlleben
- Interfaculty Institute of Microbiology and Infection Medicine Tübingen (IMIT), Department of Microbiology and Biotechnology, University of Tübingen, Auf der Morgenstelle 28, 72076 Tübingen, Germany;
- Cluster of Excellence ‘Controlling Microbes to Fight Infections’, University of Tübingen, 72076 Tübingen, Germany
| |
Collapse
|
4
|
Khan I, Bai Y, Zha L, Ullah N, Ullah H, Shah SRH, Sun H, Zhang C. Mechanism of the Gut Microbiota Colonization Resistance and Enteric Pathogen Infection. Front Cell Infect Microbiol 2022; 11:716299. [PMID: 35004340 PMCID: PMC8733563 DOI: 10.3389/fcimb.2021.716299] [Citation(s) in RCA: 75] [Impact Index Per Article: 37.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 11/26/2021] [Indexed: 12/26/2022] Open
Abstract
The mammalian gut microbial community, known as the gut microbiota, comprises trillions of bacteria, which co-evolved with the host and has an important role in a variety of host functions that include nutrient acquisition, metabolism, and immunity development, and more importantly, it plays a critical role in the protection of the host from enteric infections associated with exogenous pathogens or indigenous pathobiont outgrowth that may result from healthy gut microbial community disruption. Microbiota evolves complex mechanisms to restrain pathogen growth, which included nutrient competition, competitive metabolic interactions, niche exclusion, and induction of host immune response, which are collectively termed colonization resistance. On the other hand, pathogens have also developed counterstrategies to expand their population and enhance their virulence to cope with the gut microbiota colonization resistance and cause infection. This review summarizes the available literature on the complex relationship occurring between the intestinal microbiota and enteric pathogens, describing how the gut microbiota can mediate colonization resistance against bacterial enteric infections and how bacterial enteropathogens can overcome this resistance as well as how the understanding of this complex interaction can inform future therapies against infectious diseases.
Collapse
Affiliation(s)
- Israr Khan
- School of Life Sciences, Lanzhou University, Lanzhou, China.,Key Laboratory of Cell Activities and Stress Adaptations, Ministry of Education, Lanzhou University, Lanzhou, China.,Gansu Key Laboratory of Biomonitoring and Bioremediation for Environmental Pollution, Lanzhou University, Lanzhou, China.,Gansu Key Laboratory of Functional Genomics and Molecular Diagnosis, Lanzhou University, Lanzhou, China.,Cuiying Biomedical Research Centre, Lanzhou University Second Hospital, Lanzhou, China
| | - Yanrui Bai
- School of Life Sciences, Lanzhou University, Lanzhou, China.,Key Laboratory of Cell Activities and Stress Adaptations, Ministry of Education, Lanzhou University, Lanzhou, China.,Gansu Key Laboratory of Biomonitoring and Bioremediation for Environmental Pollution, Lanzhou University, Lanzhou, China.,Gansu Key Laboratory of Functional Genomics and Molecular Diagnosis, Lanzhou University, Lanzhou, China.,Cuiying Biomedical Research Centre, Lanzhou University Second Hospital, Lanzhou, China
| | - Lajia Zha
- School of Life Sciences, Lanzhou University, Lanzhou, China.,Key Laboratory of Cell Activities and Stress Adaptations, Ministry of Education, Lanzhou University, Lanzhou, China.,Gansu Key Laboratory of Biomonitoring and Bioremediation for Environmental Pollution, Lanzhou University, Lanzhou, China.,Gansu Key Laboratory of Functional Genomics and Molecular Diagnosis, Lanzhou University, Lanzhou, China.,Cuiying Biomedical Research Centre, Lanzhou University Second Hospital, Lanzhou, China
| | - Naeem Ullah
- School of Life Sciences, Lanzhou University, Lanzhou, China.,Key Laboratory of Cell Activities and Stress Adaptations, Ministry of Education, Lanzhou University, Lanzhou, China.,Gansu Key Laboratory of Biomonitoring and Bioremediation for Environmental Pollution, Lanzhou University, Lanzhou, China
| | - Habib Ullah
- School of Life Sciences, Lanzhou University, Lanzhou, China.,Key Laboratory of Cell Activities and Stress Adaptations, Ministry of Education, Lanzhou University, Lanzhou, China.,Gansu Key Laboratory of Biomonitoring and Bioremediation for Environmental Pollution, Lanzhou University, Lanzhou, China.,Cuiying Biomedical Research Centre, Lanzhou University Second Hospital, Lanzhou, China
| | - Syed Rafiq Hussain Shah
- Department of Microecology, School of Basic Medical Sciences, Dalian Medical University, Dalian, China
| | - Hui Sun
- Cuiying Biomedical Research Centre, Lanzhou University Second Hospital, Lanzhou, China
| | - Chunjiang Zhang
- School of Life Sciences, Lanzhou University, Lanzhou, China.,Key Laboratory of Cell Activities and Stress Adaptations, Ministry of Education, Lanzhou University, Lanzhou, China.,Gansu Key Laboratory of Biomonitoring and Bioremediation for Environmental Pollution, Lanzhou University, Lanzhou, China.,Gansu Key Laboratory of Functional Genomics and Molecular Diagnosis, Lanzhou University, Lanzhou, China
| |
Collapse
|
5
|
Pourreza Shahri M, Kahanda I. Deep semi-supervised learning ensemble framework for classifying co-mentions of human proteins and phenotypes. BMC Bioinformatics 2021; 22:500. [PMID: 34656098 PMCID: PMC8520253 DOI: 10.1186/s12859-021-04421-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2021] [Accepted: 10/04/2021] [Indexed: 11/13/2022] Open
Abstract
Background Identifying human protein-phenotype relationships has attracted researchers in bioinformatics and biomedical natural language processing due to its importance in uncovering rare and complex diseases. Since experimental validation of protein-phenotype associations is prohibitive, automated tools capable of accurately extracting these associations from the biomedical text are in high demand. However, while the manual annotation of protein-phenotype co-mentions required for training such models is highly resource-consuming, extracting millions of unlabeled co-mentions is straightforward. Results In this study, we propose a novel deep semi-supervised ensemble framework that combines deep neural networks, semi-supervised, and ensemble learning for classifying human protein-phenotype co-mentions with the help of unlabeled data. This framework allows the ability to incorporate an extensive collection of unlabeled sentence-level co-mentions of human proteins and phenotypes with a small labeled dataset to enhance overall performance. We develop PPPredSS, a prototype of our proposed semi-supervised framework that combines sophisticated language models, convolutional networks, and recurrent networks. Our experimental results demonstrate that the proposed approach provides a new state-of-the-art performance in classifying human protein-phenotype co-mentions by outperforming other supervised and semi-supervised counterparts. Furthermore, we highlight the utility of PPPredSS in powering a curation assistant system through case studies involving a group of biologists. Conclusions This article presents a novel approach for human protein-phenotype co-mention classification based on deep, semi-supervised, and ensemble learning. The insights and findings from this work have implications for biomedical researchers, biocurators, and the text mining community working on biomedical relationship extraction.
Collapse
Affiliation(s)
| | - Indika Kahanda
- School of Computing, University of North Florida, Jacksonville, USA.
| |
Collapse
|
6
|
Dank A, Zeng Z, Boeren S, Notebaart RA, Smid EJ, Abee T. Bacterial Microcompartment-Dependent 1,2-Propanediol Utilization of Propionibacterium freudenreichii. Front Microbiol 2021; 12:679827. [PMID: 34054787 PMCID: PMC8149966 DOI: 10.3389/fmicb.2021.679827] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 04/16/2021] [Indexed: 12/28/2022] Open
Abstract
Bacterial microcompartments (BMCs) are proteinaceous prokaryotic organelles that enable the utilization of substrates such as 1,2-propanediol and ethanolamine. BMCs are mostly linked to the survival of particular pathogenic bacteria by providing a growth advantage through utilization of 1,2-propanediol and ethanolamine which are abundantly present in the human gut. Although a 1,2-propanediol utilization cluster was found in the probiotic bacterium Propionibacterium freudenreichii, BMC-mediated metabolism of 1,2-propanediol has not been demonstrated experimentally in P. freudenreichii. In this study we show that P. freudenreichii DSM 20271 metabolizes 1,2-propanediol in anaerobic conditions to propionate and 1-propanol. Furthermore, 1,2-propanediol induced the formation of BMCs, which were visualized by transmission electron microscopy and resembled BMCs found in other bacteria. Proteomic analysis of 1,2-propanediol grown cells compared to L-lactate grown cells showed significant upregulation of proteins involved in propanediol-utilization (pdu-cluster), DNA repair mechanisms and BMC shell proteins while proteins involved in oxidative phosphorylation were down-regulated. 1,2-Propanediol utilizing cells actively produced vitamin B12 (cobalamin) in similar amounts as cells growing on L-lactate. The ability to metabolize 1,2-propanediol may have implications for human gut colonization and modulation, and can potentially aid in delivering propionate and vitamin B12in situ.
Collapse
Affiliation(s)
- Alexander Dank
- Food Microbiology, Wageningen University and Research, Wageningen, Netherlands
| | - Zhe Zeng
- Food Microbiology, Wageningen University and Research, Wageningen, Netherlands
| | - Sjef Boeren
- Laboratory of Biochemistry, Wageningen University and Research, Wageningen, Netherlands
| | - Richard A Notebaart
- Food Microbiology, Wageningen University and Research, Wageningen, Netherlands
| | - Eddy J Smid
- Food Microbiology, Wageningen University and Research, Wageningen, Netherlands
| | - Tjakko Abee
- Food Microbiology, Wageningen University and Research, Wageningen, Netherlands
| |
Collapse
|
7
|
Cui H, Zuo S, Liu Z, Liu H, Wang J, You T, Zheng Z, Zhou Y, Qian X, Yao H, Xie L, Liu T, Sham PC, Yu Y, Li MJ. The support of genetic evidence for cardiovascular risk induced by antineoplastic drugs. SCIENCE ADVANCES 2020; 6:eabb8543. [PMID: 33055159 PMCID: PMC7556838 DOI: 10.1126/sciadv.abb8543] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2020] [Accepted: 08/28/2020] [Indexed: 05/04/2023]
Abstract
Cardiovascular dysfunction is one of the most common complications of long-term cancer treatment. Growing evidence has shown that antineoplastic drugs can increase cardiovascular risk during cancer therapy, seriously affecting patient survival. However, little is known about the genetic factors associated with the cardiovascular risk of antineoplastic drugs. We established a compendium of genetic evidence that supports cardiovascular risk induced by antineoplastic drugs. Most of this genetic evidence is attributed to causal alleles altering the expression of cardiovascular disease genes. We found that antineoplastic drugs predicted to induce cardiovascular risk are significantly enriched in drugs associated with cardiovascular adverse reactions, including many first-line cancer treatments. Functional experiments validated that retinoid X receptor agonists can reduce triglyceride lipolysis, thus modulating cardiovascular risk. Our results establish a link between the causal allele of cardiovascular disease genes and the direction of pharmacological modulation, which could facilitate cancer drug discovery and clinical trial design.
Collapse
Affiliation(s)
- Hui Cui
- School of Life Science and Technology, ShanghaiTech University, Shanghai, China
- Key Laboratory of Food Safety Research, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute for Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Shengkai Zuo
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Zipeng Liu
- Centre for PanorOmic Sciences-Genomics and Bioinformatics Cores, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Huanhuan Liu
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Jianhua Wang
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Tianyi You
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Zhanye Zheng
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Yao Zhou
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Xinyi Qian
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Hongcheng Yao
- Centre for PanorOmic Sciences-Genomics and Bioinformatics Cores, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Lu Xie
- Shanghai Center for Bioinformation Technology, Shanghai Academy of Science and Technology, Shanghai, China
| | - Tong Liu
- Tianjin Key Laboratory of Ionic-Molecular Function of Cardiovascular Disease, Department of Cardiology, Tianjin Institute of Cardiology, Second Hospital of Tianjin Medical University, Tianjin, China
| | - Pak Chung Sham
- Centre for PanorOmic Sciences-Genomics and Bioinformatics Cores, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Ying Yu
- School of Life Science and Technology, ShanghaiTech University, Shanghai, China.
- Key Laboratory of Food Safety Research, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute for Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Mulin Jun Li
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China.
- Department of Epidemiology and Biostatistics, Tianjin Key Laboratory of Molecular Cancer Epidemiology, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| |
Collapse
|
8
|
Ravcheev DA, Moussu L, Smajic S, Thiele I. Comparative Genomic Analysis Reveals Novel Microcompartment-Associated Metabolic Pathways in the Human Gut Microbiome. Front Genet 2019; 10:636. [PMID: 31333721 PMCID: PMC6620236 DOI: 10.3389/fgene.2019.00636] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 06/18/2019] [Indexed: 12/16/2022] Open
Abstract
Bacterial microcompartments are self-assembling subcellular structures surrounded by a semipermeable protein shell and found only in bacteria, but not archaea or eukaryotes. The general functions of the bacterial microcompartments are to concentrate enzymes, metabolites, and cofactors for multistep pathways; maintain the cofactor ratio; protect the cell from toxic metabolic intermediates; and protect the encapsulated pathway from unwanted side reactions. The bacterial microcompartments were suggested to play a significant role in organisms of the human gut microbiome, especially for various pathogens. Here, we used a comparative genomics approach to analyze the bacterial microcompartments in 646 individual genomes of organisms commonly found in the human gut microbiome. The bacterial microcompartments were found in 150 (23.2%) analyzed genomes. These microcompartments include previously known ones for the utilization of ethanolamine, 1,2-propanediol, choline, and fucose/rhamnose. Moreover, we reconstructed two novel pathways associated with the bacterial microcompartments. These pathways are catabolic pathways for the utilization of 1-amino-2-propanol/1-amino-2-propanone and xanthine. Remarkably, the xanthine utilization pathway does not demonstrate similarity to previously known microcompartment-associated pathways. Thus, we describe a novel type of bacterial microcompartment.
Collapse
Affiliation(s)
- Dmitry A Ravcheev
- School of Medicine, National University of Ireland, Galway, University Road, Galway, Ireland.,Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Lubin Moussu
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Semra Smajic
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Ines Thiele
- School of Medicine, National University of Ireland, Galway, University Road, Galway, Ireland.,Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg.,Discipline of Microbiology, School of Natural Sciences, National University of Ireland, Galway, University Road, Galway, Ireland
| |
Collapse
|
9
|
Saqi M, Lysenko A, Guo YK, Tsunoda T, Auffray C. Navigating the disease landscape: knowledge representations for contextualizing molecular signatures. Brief Bioinform 2019; 20:609-623. [PMID: 29684165 PMCID: PMC6556902 DOI: 10.1093/bib/bby025] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2017] [Revised: 02/05/2018] [Indexed: 12/14/2022] Open
Abstract
Large amounts of data emerging from experiments in molecular medicine are leading to the identification of molecular signatures associated with disease subtypes. The contextualization of these patterns is important for obtaining mechanistic insight into the aberrant processes associated with a disease, and this typically involves the integration of multiple heterogeneous types of data. In this review, we discuss knowledge representations that can be useful to explore the biological context of molecular signatures, in particular three main approaches, namely, pathway mapping approaches, molecular network centric approaches and approaches that represent biological statements as knowledge graphs. We discuss the utility of each of these paradigms, illustrate how they can be leveraged with selected practical examples and identify ongoing challenges for this field of research.
Collapse
Affiliation(s)
- Mansoor Saqi
- Mansoor Saqi Data Science Institute, Imperial College London, UK
| | - Artem Lysenko
- Artem Lysenko Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Yi-Ke Guo
- Yi-Ke Guo Data Science Institute, Imperial College London, UK
| | - Tatsuhiko Tsunoda
- Tatsuhiko Tsunoda Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan CREST, JST, Tokyo, Japan Department of Medical Science Mathematics, Medical Research Institute, Tokyo Medical and Dental University, Tokyo, Japan
| | - Charles Auffray
- Charles Auffray European Institute for Systems Biology and Medicine, Lyon, France
| |
Collapse
|
10
|
Metabolic adaptation of adherent-invasive Escherichia coli to exposure to bile salts. Sci Rep 2019; 9:2175. [PMID: 30778122 PMCID: PMC6379400 DOI: 10.1038/s41598-019-38628-1] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2018] [Accepted: 12/13/2018] [Indexed: 12/12/2022] Open
Abstract
The adherent-invasive Escherichia coli (AIEC), which colonize the ileal mucosa of Crohn’s disease patients, adhere to intestinal epithelial cells, invade them and exacerbate intestinal inflammation. The high nutrient competition between the commensal microbiota and AIEC pathobiont requires the latter to occupy their own metabolic niches to survive and proliferate within the gut. In this study, a global RNA sequencing of AIEC strain LF82 has been used to observe the impact of bile salts on the expression of metabolic genes. The results showed a global up-regulation of genes involved in degradation and a down-regulation of those implicated in biosynthesis. The main up-regulated degradation pathways were ethanolamine, 1,2-propanediol and citrate utilization, as well as the methyl-citrate pathway. Our study reveals that ethanolamine utilization bestows a competitive advantage of AIEC strains that are metabolically capable of its degradation in the presence of bile salts. We observed that bile salts activated secondary metabolism pathways that communicate to provide an energy benefit to AIEC. Bile salts may be used by AIEC as an environmental signal to promote their colonization.
Collapse
|
11
|
Kafkas Ş, Hoehndorf R. Ontology based text mining of gene-phenotype associations: application to candidate gene prediction. Database (Oxford) 2019; 2019:baz019. [PMID: 30809638 PMCID: PMC6391585 DOI: 10.1093/database/baz019] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Revised: 01/09/2019] [Accepted: 01/26/2019] [Indexed: 01/07/2023]
Abstract
Gene-phenotype associations play an important role in understanding the disease mechanisms which is a requirement for treatment development. A portion of gene-phenotype associations are observed mainly experimentally and made publicly available through several standard resources such as MGI. However, there is still a vast amount of gene-phenotype associations buried in the biomedical literature. Given the large amount of literature data, we need automated text mining tools to alleviate the burden in manual curation of gene-phenotype associations and to develop comprehensive resources. In this study, we present an ontology-based approach in combination with statistical methods to text mine gene-phenotype associations from the literature. Our method achieved AUC values of 0.90 and 0.75 in recovering known gene-phenotype associations from HPO and MGI respectively. We posit that candidate genes and their relevant diseases should be expressed with similar phenotypes in publications. Thus, we demonstrate the utility of our approach by predicting disease candidate genes based on the semantic similarities of phenotypes associated with genes and diseases. To the best of our knowledge, this is the first study using an ontology based approach to extract gene-phenotype associations from the literature. We evaluated our disease candidate prediction model on the gene-disease associations from MGI. Our model achieved AUC values of 0.90 and 0.87 on OMIM (human) and MGI (mouse) datasets of gene-disease associations respectively. Our manual analysis on the text mined data revealed that our method can accurately extract gene-phenotype associations which are not currently covered by the existing public gene-phenotype resources. Overall, results indicate that our method can precisely extract known as well as new gene-phenotype associations from literature. All the data and methods are available at https://github.com/bio-ontology-research-group/genepheno.
Collapse
Affiliation(s)
- Şenay Kafkas
- Computer, Electrical and Mathematical Sciences & Engineering Division, Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Kingdom of Saudi Arabia
| | - Robert Hoehndorf
- Computer, Electrical and Mathematical Sciences & Engineering Division, Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Kingdom of Saudi Arabia
| |
Collapse
|
12
|
Ethanolamine Influences Human Commensal Escherichia coli Growth, Gene Expression, and Competition with Enterohemorrhagic E. coli O157:H7. mBio 2018; 9:mBio.01429-18. [PMID: 30279284 PMCID: PMC6168858 DOI: 10.1128/mbio.01429-18] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
The microbiota protects the host from invading pathogens by limiting access to nutrients. In turn, bacterial pathogens selectively exploit metabolites not readily used by the microbiota to establish infection. Ethanolamine has been linked to pathogenesis of diverse pathogens by serving as a noncompetitive metabolite that enhances pathogen growth as well as a signal that modulates virulence. Although ethanolamine is abundant in the gastrointestinal tract, the prevailing idea is that commensal bacteria do not utilize EA, and thus, EA utilization has been particularly associated with pathogenesis. Here, we provide evidence that two human commensal Escherichia coli isolates readily utilize ethanolamine to enhance growth, modulate gene expression, and outgrow the pathogen enterohemorrhagic E. coli. These data indicate a more complex role for ethanolamine in host-microbiota-pathogen interactions. A core principle of bacterial pathogenesis is that pathogens preferentially utilize metabolites that commensal bacteria do not in order to sidestep nutritional competition. The metabolite ethanolamine (EA) is well recognized to play a central role in host adaptation for diverse pathogens. EA promotes growth and influences virulence during host infection. Although genes encoding EA utilization have been identified in diverse bacteria (nonpathogenic and pathogenic), a prevailing idea is that commensal bacteria do not utilize EA to enhance growth, and thus, EA is a noncompetitive metabolite for pathogens. Here, we show that EA augments growth of two human commensal strains of Escherichia coli. Significantly, these commensal strains grow more rapidly than, and even outcompete, the pathogen enterohemorrhagic E. coli O157:H7 specifically when EA is provided as the sole nitrogen source. Moreover, EA-dependent signaling is similarly conserved in the human commensal E. coli strain HS and influences expression of adhesins. These findings suggest a more extensive role for EA utilization in bacterial physiology and host-microbiota-pathogen interactions than previously appreciated.
Collapse
|
13
|
Yu K, Lung PY, Zhao T, Zhao P, Tseng YY, Zhang J. Automatic extraction of protein-protein interactions using grammatical relationship graph. BMC Med Inform Decis Mak 2018; 18:42. [PMID: 30066644 PMCID: PMC6069288 DOI: 10.1186/s12911-018-0628-4] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Background Relationships between bio-entities (genes, proteins, diseases, etc.) constitute a significant part of our knowledge. Most of this information is documented as unstructured text in different forms, such as books, articles and on-line pages. Automatic extraction of such information and storing it in structured form could help researchers more easily access such information and also make it possible to incorporate it in advanced integrative analysis. In this study, we developed a novel approach to extract bio-entity relationships information using Nature Language Processing (NLP) and a graph-theoretic algorithm. Methods Our method, called GRGT (Grammatical Relationship Graph for Triplets), not only extracts the pairs of terms that have certain relationships, but also extracts the type of relationship (the word describing the relationships). In addition, the directionality of the relationship can also be extracted. Our method is based on the assumption that a triplet exists for a pair of interactions. A triplet is defined as two terms (entities) and an interaction word describing the relationship of the two terms in a sentence. We first use a sentence parsing tool to obtain the sentence structure represented as a dependency graph where words are nodes and edges are typed dependencies. The shortest paths among the pairs of words in the triplet are then extracted, which form the basis for our information extraction method. Flexible pattern matching scheme was then used to match a triplet graph with unknown relationship to those triplet graphs with labels (True or False) in the database. Results We applied the method on three benchmark datasets to extract the protein-protein-interactions (PPIs), and obtained better precision than the top performing methods in literature. Conclusions We have developed a method to extract the protein-protein interactions from biomedical literature. PPIs extracted by our method have higher precision among other methods, suggesting that our method can be used to effectively extract PPIs and deposit them into databases. Beyond extracting PPIs, our method could be easily extended to extracting relationship information between other bio-entities.
Collapse
Affiliation(s)
- Kaixian Yu
- Department of Statistics, Florida State University, Tallahassee, FL, 32306, USA. .,Department of Biostatistics, University of Texas MD Anderson Cancer Center, Houston, TX, 77054, USA.
| | - Pei-Yau Lung
- Department of Statistics, Florida State University, Tallahassee, FL, 32306, USA
| | - Tingting Zhao
- Department of Geography, Florida State University, Tallahassee, FL, 32306, USA
| | - Peixiang Zhao
- Department of Computer Science, Florida State University, Tallahassee, FL, 32306, USA
| | - Yan-Yuan Tseng
- Center for Molecular Medicine and Genetics, School of Medicine, Wayne State University, Detroit, MI, 48201, USA
| | - Jinfeng Zhang
- Department of Statistics, Florida State University, Tallahassee, FL, 32306, USA.
| |
Collapse
|
14
|
Khordad M, Mercer RE. Identifying genotype-phenotype relationships in biomedical text. J Biomed Semantics 2017; 8:57. [PMID: 29212530 PMCID: PMC5719522 DOI: 10.1186/s13326-017-0163-8] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2016] [Accepted: 10/28/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND One important type of information contained in biomedical research literature is the newly discovered relationships between phenotypes and genotypes. Because of the large quantity of literature, a reliable automatic system to identify this information for future curation is essential. Such a system provides important and up to date data for database construction and updating, and even text summarization. In this paper we present a machine learning method to identify these genotype-phenotype relationships. No large human-annotated corpus of genotype-phenotype relationships currently exists. So, a semi-automatic approach has been used to annotate a small labelled training set and a self-training method is proposed to annotate more sentences and enlarge the training set. RESULTS The resulting machine-learned model was evaluated using a separate test set annotated by an expert. The results show that using only the small training set in a supervised learning method achieves good results (precision: 76.47, recall: 77.61, F-measure: 77.03) which are improved by applying a self-training method (precision: 77.70, recall: 77.84, F-measure: 77.77). CONCLUSIONS Relationships between genotypes and phenotypes is biomedical information pivotal to the understanding of a patient's situation. Our proposed method is the first attempt to make a specialized system to identify genotype-phenotype relationships in biomedical literature. We achieve good results using a small training set. To improve the results other linguistic contexts need to be explored and an appropriately enlarged training set is required.
Collapse
Affiliation(s)
- Maryam Khordad
- Department of Computer Science, University of Western Ontario, 1151 Richmond Street, London, N6A 5B7 Canada
| | - Robert E. Mercer
- Department of Computer Science, University of Western Ontario, 1151 Richmond Street, London, N6A 5B7 Canada
| |
Collapse
|
15
|
Characterizing Gene and Protein Crosstalks in Subjects at Risk of Developing Alzheimer’s Disease: A New Computational Approach. Processes (Basel) 2017. [DOI: 10.3390/pr5030047] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
|
16
|
The Agr-Like Quorum Sensing System Is Required for Pathogenesis of Necrotic Enteritis Caused by Clostridium perfringens in Poultry. Infect Immun 2017; 85:IAI.00975-16. [PMID: 28373356 DOI: 10.1128/iai.00975-16] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2016] [Accepted: 03/24/2017] [Indexed: 12/13/2022] Open
Abstract
Clostridium perfringens encodes at least two different quorum sensing (QS) systems, the Agr-like and LuxS, and recent studies have highlighted their importance in the regulation of toxin production and virulence. The role of QS in the pathogenesis of necrotic enteritis (NE) in poultry and the regulation of NetB, the key toxin involved, has not yet been investigated. We have generated isogenic agrB-null and complemented strains from parent strain CP1 and demonstrated that the virulence of the agrB-null mutant was strongly attenuated in a chicken NE model system and restored by complementation. The production of NetB, a key NE-associated toxin, was dramatically reduced in the agrB mutant at both the transcriptional and protein levels, though not in a luxS mutant. Transwell assays confirmed that the Agr-like QS system controls NetB production through a diffusible signal. Global gene expression analysis of the agrB mutant identified additional genes modulated by Agr-like QS, including operons related to phospholipid metabolism and adherence, which may also play a role in NE pathogenesis. This study provides the first evidence that the Agr-like QS system is critical for NE pathogenesis and identifies a number of Agr-regulated genes, most notably netB, that are potentially involved in mediating its effects. The Agr-like QS system thus may serve as a target for developing novel interventions to prevent NE in chickens.
Collapse
|
17
|
Faber F, Thiennimitr P, Spiga L, Byndloss MX, Litvak Y, Lawhon S, Andrews-Polymenis HL, Winter SE, Bäumler AJ. Respiration of Microbiota-Derived 1,2-propanediol Drives Salmonella Expansion during Colitis. PLoS Pathog 2017; 13:e1006129. [PMID: 28056091 PMCID: PMC5215881 DOI: 10.1371/journal.ppat.1006129] [Citation(s) in RCA: 122] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2016] [Accepted: 12/14/2016] [Indexed: 12/16/2022] Open
Abstract
Intestinal inflammation caused by Salmonella enterica serovar Typhimurium increases the availability of electron acceptors that fuel a respiratory growth of the pathogen in the intestinal lumen. Here we show that one of the carbon sources driving this respiratory expansion in the mouse model is 1,2-propanediol, a microbial fermentation product. 1,2-propanediol utilization required intestinal inflammation induced by virulence factors of the pathogen. S. Typhimurium used both aerobic and anaerobic respiration to consume 1,2-propanediol and expand in the murine large intestine. 1,2-propanediol-utilization did not confer a benefit in germ-free mice, but the pdu genes conferred a fitness advantage upon S. Typhimurium in mice mono-associated with Bacteroides fragilis or Bacteroides thetaiotaomicron. Collectively, our data suggest that intestinal inflammation enables S. Typhimurium to sidestep nutritional competition by respiring a microbiota-derived fermentation product. Salmonella enterica serovar Typhimurium induces intestinal inflammation to induce the generation of host-derived respiratory electron acceptors, thereby driving a respiratory pathogen expansion, which aids infectious transmission by the fecal oral route. However, the identity of nutrients serving as electron donors to enable S. Typhimurium to edge out competing microbes in the competitive environment of the gut are just beginning to be worked out. Here we demonstrate that aerobic and anaerobic respiratory pathways cooperate to promote growth of Salmonella on the microbial fermentation product 1,2-propanediol. We propose that pathogen-induced intestinal inflammation enables Salmonella to sidestep nutritional competition with the largely anaerobic microbiota by respiring a microbe-derived metabolite that cannot be consumed by fermentation.
Collapse
Affiliation(s)
- Franziska Faber
- Department of Medial Microbiology and Immunology, School of Medicine, University of California Davis, Davis, CA, United States of America
| | - Parameth Thiennimitr
- Department of Microbiology, Faculty of Medicine, Chiang Mai University, Chiang Mai, Thailand
| | - Luisella Spiga
- Department of Microbiology, University of Texas Southwestern Medical Center, Dallas, TX, United States of America
| | - Mariana X. Byndloss
- Department of Medial Microbiology and Immunology, School of Medicine, University of California Davis, Davis, CA, United States of America
| | - Yael Litvak
- Department of Medial Microbiology and Immunology, School of Medicine, University of California Davis, Davis, CA, United States of America
| | - Sara Lawhon
- Department of Veterinary Pathobiology, College of Veterinary Medicine, Texas A&M University, College Station, TX, United States of America
| | - Helene L. Andrews-Polymenis
- Department of Microbial Pathogenesis and Immunology, College of Medicine, Texas A&M University Health Science Center, Bryan, TX, United States of America
| | - Sebastian E. Winter
- Department of Microbiology, University of Texas Southwestern Medical Center, Dallas, TX, United States of America
| | - Andreas J. Bäumler
- Department of Medial Microbiology and Immunology, School of Medicine, University of California Davis, Davis, CA, United States of America
- * E-mail:
| |
Collapse
|
18
|
Abstract
Microorganisms play a primary role in regulating biogeochemical cycles and are a valuable source of enzymes that have biotechnological applications, such as carbohydrate-active enzymes (CAZymes). However, the inability to culture the majority of microorganisms that exist in natural ecosystems using common culture-dependent techniques restricts access to potentially novel cellulolytic bacteria and beneficial enzymes. The development of molecular-based culture-independent methods such as metagenomics enables researchers to study microbial communities directly from environmental samples, and presents a platform from which enzymes of interest can be sourced. We outline key methodological stages that are required as well as describe specific protocols that are currently used for metagenomic projects dedicated to CAZyme discovery.
Collapse
Affiliation(s)
- Benoit J Kunath
- Department of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, 5003, 1432, Ås, Norway
| | - Andreas Bremges
- Computational Biology of Infection Research, Helmholtz Centre for Infection Research, 38124, Braunschweig, Germany
- German Center for Infection Research (DZIF), 38124, Braunschweig, Germany
| | - Aaron Weimann
- Computational Biology of Infection Research, Helmholtz Centre for Infection Research, 38124, Braunschweig, Germany
| | - Alice C McHardy
- Computational Biology of Infection Research, Helmholtz Centre for Infection Research, 38124, Braunschweig, Germany
| | - Phillip B Pope
- Department of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, 5003, 1432, Ås, Norway.
| |
Collapse
|
19
|
Rolhion N, Chassaing B. When pathogenic bacteria meet the intestinal microbiota. Philos Trans R Soc Lond B Biol Sci 2016; 371:20150504. [PMID: 27672153 PMCID: PMC5052746 DOI: 10.1098/rstb.2015.0504] [Citation(s) in RCA: 81] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/25/2016] [Indexed: 12/25/2022] Open
Abstract
The intestinal microbiota is a large and diverse microbial community that inhabits the intestinal tract, containing about 100 trillion bacteria from 500-1000 distinct species that, collectively, provide multiple benefits to the host. The gut microbiota contributes to nutrient absorption and maturation of the immune system, and also plays a central role in protection of the host from enteric bacterial infection. On the other hand, many enteric pathogens have developed strategies in order to be able to outcompete the intestinal community, leading to infection and/or chronic diseases. This review will summarize findings describing the complex relationship occurring between the intestinal microbiota and enteric pathogens, as well as how future therapies can ultimately benefit from such discoveries.This article is part of the themed issue 'The new bacteriology'.
Collapse
Affiliation(s)
- Nathalie Rolhion
- Institut Pasteur, Unité des interactions Bactéries-Cellules, 75015 Paris, France Inserm, U604, 75015 Paris, France INRA, Unité sous contrat 2020, 75015 Paris, France
| | - Benoit Chassaing
- Institute for Biomedical Sciences, Center for Inflammation, Immunity, and Infection, Georgia State University, Atlanta, GA 30303, USA
| |
Collapse
|
20
|
Brbić M, Piškorec M, Vidulin V, Kriško A, Šmuc T, Supek F. The landscape of microbial phenotypic traits and associated genes. Nucleic Acids Res 2016; 44:10074-10090. [PMID: 27915291 PMCID: PMC5137458 DOI: 10.1093/nar/gkw964] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2016] [Revised: 09/21/2016] [Accepted: 10/11/2016] [Indexed: 12/31/2022] Open
Abstract
Bacteria and Archaea display a variety of phenotypic traits and can adapt to diverse ecological niches. However, systematic annotation of prokaryotic phenotypes is lacking. We have therefore developed ProTraits, a resource containing ∼545 000 novel phenotype inferences, spanning 424 traits assigned to 3046 bacterial and archaeal species. These annotations were assigned by a computational pipeline that associates microbes with phenotypes by text-mining the scientific literature and the broader World Wide Web, while also being able to define novel concepts from unstructured text. Moreover, the ProTraits pipeline assigns phenotypes by drawing extensively on comparative genomics, capturing patterns in gene repertoires, codon usage biases, proteome composition and co-occurrence in metagenomes. Notably, we find that gene synteny is highly predictive of many phenotypes, and highlight examples of gene neighborhoods associated with spore-forming ability. A global analysis of trait interrelatedness outlined clusters in the microbial phenotype network, suggesting common genetic underpinnings. Our extended set of phenotype annotations allows detection of 57 088 high confidence gene-trait links, which recover many known associations involving sporulation, flagella, catalase activity, aerobicity, photosynthesis and other traits. Over 99% of the commonly occurring gene families are involved in genetic interactions conditional on at least one phenotype, suggesting that epistasis has a major role in shaping microbial gene content.
Collapse
Affiliation(s)
- Maria Brbić
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia
| | - Matija Piškorec
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia
| | - Vedrana Vidulin
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia
| | - Anita Kriško
- Mediterranean Institute of Life Sciences, 21000 Split, Croatia
| | - Tomislav Šmuc
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia
| | - Fran Supek
- Division of Electronics, Ruder Boskovic Institute, 10000 Zagreb, Croatia .,EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, 08003 Barcelona, Spain.,Universitat Pompeu Fabra (UPF), 08002 Barcelona, Spain
| |
Collapse
|
21
|
Bäumler AJ, Sperandio V. Interactions between the microbiota and pathogenic bacteria in the gut. Nature 2016; 535:85-93. [PMID: 27383983 PMCID: PMC5114849 DOI: 10.1038/nature18849] [Citation(s) in RCA: 809] [Impact Index Per Article: 101.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2015] [Accepted: 04/22/2016] [Indexed: 02/07/2023]
Abstract
The microbiome has an important role in human health. Changes in the microbiota can confer resistance to or promote infection by pathogenic bacteria. Antibiotics have a profound impact on the microbiota that alters the nutritional landscape of the gut and can lead to the expansion of pathogenic populations. Pathogenic bacteria exploit microbiota-derived sources of carbon and nitrogen as nutrients and regulatory signals to promote their own growth and virulence. By eliciting inflammation, these bacteria alter the intestinal environment and use unique systems for respiration and metal acquisition to drive their expansion. Unravelling the interactions between the microbiota, the host and pathogenic bacteria will produce strategies for manipulating the microbiota against infectious diseases.
Collapse
Affiliation(s)
- Andreas J Bäumler
- Department of Medical Microbiology and Immunology, University of California, Davis, School of Medicine, Davis, California 95616, USA
| | - Vanessa Sperandio
- Department of Microbiology, University of Texas Southwestern Medical Center, Dallas, Texas 75390-9048, USA
- Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas 75390-9038, USA
| |
Collapse
|
22
|
Li Z, Tan Z, Hao S, Jin B, Deng X, Hu G, Liu X, Zhang J, Jin H, Huang M, Kanegaye JT, Tremoulet AH, Burns JC, Wu J, Cohen HJ, Ling XB. Urinary Colorimetric Sensor Array and Algorithm to Distinguish Kawasaki Disease from Other Febrile Illnesses. PLoS One 2016; 11:e0146733. [PMID: 26859297 PMCID: PMC4747548 DOI: 10.1371/journal.pone.0146733] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2015] [Accepted: 12/20/2015] [Indexed: 01/12/2023] Open
Abstract
Objectives Kawasaki disease (KD) is an acute pediatric vasculitis of infants and young children with unknown etiology and no specific laboratory-based test to identify. A specific molecular diagnostic test is urgently needed to support the clinical decision of proper medical intervention, preventing subsequent complications of coronary artery aneurysms. We used a simple and low-cost colorimetric sensor array to address the lack of a specific diagnostic test to differentiate KD from febrile control (FC) patients with similar rash/fever illnesses. Study Design Demographic and clinical data were prospectively collected for subjects with KD and FCs under standard protocol. After screening using a genetic algorithm, eleven compounds including metalloporphyrins, pH indicators, redox indicators and solvatochromic dye categories, were selected from our chromatic compound library (n = 190) to construct a colorimetric sensor array for diagnosing KD. Quantitative color difference analysis led to a decision-tree-based KD diagnostic algorithm. Results This KD sensing array allowed the identification of 94% of KD subjects (receiver operating characteristic [ROC] area under the curve [AUC] 0.981) in the training set (33 KD, 33 FC) and 94% of KD subjects (ROC AUC: 0.873) in the testing set (16 KD, 17 FC). Color difference maps reconstructed from the digital images of the sensing compounds demonstrated distinctive patterns differentiating KD from FC patients. Conclusions The colorimetric sensor array, composed of common used chemical compounds, is an easily accessible, low-cost method to realize the discrimination of subjects with KD from other febrile illness.
Collapse
Affiliation(s)
- Zhen Li
- Institution of Microanalytical System, Zhejiang University, Hangzhou, Zhejiang, China
- Department of Surgery, Stanford University, Stanford, California, United States of America
| | - Zhou Tan
- Department of Surgery, Stanford University, Stanford, California, United States of America
| | - Shiying Hao
- Department of Surgery, Stanford University, Stanford, California, United States of America
| | - Bo Jin
- Department of Surgery, Stanford University, Stanford, California, United States of America
| | - Xiaohong Deng
- Department of Surgery, Stanford University, Stanford, California, United States of America
| | - Guang Hu
- Department of Surgery, Stanford University, Stanford, California, United States of America
| | - Xiaodan Liu
- Department of Surgery, Stanford University, Stanford, California, United States of America
| | - Jie Zhang
- Department of Surgery, Stanford University, Stanford, California, United States of America
| | - Hua Jin
- Department of Surgery, Stanford University, Stanford, California, United States of America
| | - Min Huang
- Shanghai Children's Hospital, Shanghai Jiao Tong University, Shanghai, China
| | - John T. Kanegaye
- Department of Pediatrics, University of California San Diego, La Jolla, California, United States of America
- Rady Children’s Hospital San Diego, San Diego, California, United States of America
| | - Adriana H. Tremoulet
- Department of Pediatrics, University of California San Diego, La Jolla, California, United States of America
- Rady Children’s Hospital San Diego, San Diego, California, United States of America
| | - Jane C. Burns
- Department of Pediatrics, University of California San Diego, La Jolla, California, United States of America
- Rady Children’s Hospital San Diego, San Diego, California, United States of America
| | - Jianmin Wu
- Institution of Microanalytical System, Zhejiang University, Hangzhou, Zhejiang, China
| | - Harvey J. Cohen
- Department of Pediatrics, Stanford University, Stanford, California, United States of America
| | - Xuefeng B. Ling
- Department of Surgery, Stanford University, Stanford, California, United States of America
- * E-mail:
| | | |
Collapse
|
23
|
Vitamin B12 Uptake by the Gut Commensal Bacteria Bacteroides thetaiotaomicron Limits the Production of Shiga Toxin by Enterohemorrhagic Escherichia coli. Toxins (Basel) 2016; 8:toxins8010014. [PMID: 26742075 PMCID: PMC4728536 DOI: 10.3390/toxins8010014] [Citation(s) in RCA: 38] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Revised: 12/17/2015] [Accepted: 12/29/2015] [Indexed: 01/11/2023] Open
Abstract
Enterohemorrhagic Escherichia coli (EHEC) are foodborne pathogens responsible for the development of bloody diarrhea and renal failure in humans. Many environmental factors have been shown to regulate the production of Shiga toxin 2 (Stx2), the main virulence factor of EHEC. Among them, soluble factors produced by human gut microbiota and in particular, by the predominant species Bacteroides thetaiotaomicron (B. thetaiotaomicron), inhibit Stx2 gene expression. In this study, we investigated the molecular mechanisms underlying the B. thetaiotaomicron-dependent inhibition of Stx2 production by EHEC. We determined that Stx2-regulating molecules are resistant to heat treatment but do not correspond to propionate and acetate, two short-chain fatty acids produced by B. thetaiotaomicron. Moreover, screening of a B. thetaiotaomicron mutant library identified seven mutants that do not inhibit Stx2 synthesis by EHEC. One mutant has impaired production of BtuB, an outer membrane receptor for vitamin B12. Together with restoration of Stx2 level after vitamin B12 supplementation, these data highlight vitamin B12 as a molecule produced by gut microbiota that modulates production of a key virulence factor of EHEC and consequently may affect the outcome of an infection.
Collapse
|
24
|
Abstract
MOTIVATION Discerning genetic contributions to diseases not only enhances our understanding of disease mechanisms, but also leads to translational opportunities for drug discovery. Recent computational approaches incorporate disease phenotypic similarities to improve the prediction power of disease gene discovery. However, most current studies used only one data source of human disease phenotype. We present an innovative and generic strategy for combining multiple different data sources of human disease phenotype and predicting disease-associated genes from integrated phenotypic and genomic data. RESULTS To demonstrate our approach, we explored a new phenotype database from biomedical ontologies and constructed Disease Manifestation Network (DMN). We combined DMN with mimMiner, which was a widely used phenotype database in disease gene prediction studies. Our approach achieved significantly improved performance over a baseline method, which used only one phenotype data source. In the leave-one-out cross-validation and de novo gene prediction analysis, our approach achieved the area under the curves of 90.7% and 90.3%, which are significantly higher than 84.2% (P < e(-4)) and 81.3% (P < e(-12)) for the baseline approach. We further demonstrated that our predicted genes have the translational potential in drug discovery. We used Crohn's disease as an example and ranked the candidate drugs based on the rank of drug targets. Our gene prediction approach prioritized druggable genes that are likely to be associated with Crohn's disease pathogenesis, and our rank of candidate drugs successfully prioritized the Food and Drug Administration-approved drugs for Crohn's disease. We also found literature evidence to support a number of drugs among the top 200 candidates. In summary, we demonstrated that a novel strategy combining unique disease phenotype data with system approaches can lead to rapid drug discovery. AVAILABILITY AND IMPLEMENTATION nlp. CASE edu/public/data/DMN
Collapse
Affiliation(s)
- Yang Chen
- Department of Electrical Engineering and Computer Science, Department of Epidemiology and Biostatistics and Department of Family Medicine and Community Health, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Li Li
- Department of Electrical Engineering and Computer Science, Department of Epidemiology and Biostatistics and Department of Family Medicine and Community Health, Case Western Reserve University, Cleveland, OH 44106, USA Department of Electrical Engineering and Computer Science, Department of Epidemiology and Biostatistics and Department of Family Medicine and Community Health, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Guo-Qiang Zhang
- Department of Electrical Engineering and Computer Science, Department of Epidemiology and Biostatistics and Department of Family Medicine and Community Health, Case Western Reserve University, Cleveland, OH 44106, USA
| | - Rong Xu
- Department of Electrical Engineering and Computer Science, Department of Epidemiology and Biostatistics and Department of Family Medicine and Community Health, Case Western Reserve University, Cleveland, OH 44106, USA
| |
Collapse
|
25
|
Tang S, Orsi RH, den Bakker HC, Wiedmann M, Boor KJ, Bergholz TM. Transcriptomic Analysis of the Adaptation of Listeria monocytogenes to Growth on Vacuum-Packed Cold Smoked Salmon. Appl Environ Microbiol 2015; 81:6812-24. [PMID: 26209664 PMCID: PMC4561693 DOI: 10.1128/aem.01752-15] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2015] [Accepted: 07/16/2015] [Indexed: 01/26/2023] Open
Abstract
The foodborne pathogen Listeria monocytogenes is able to survive and grow in ready-to-eat foods, in which it is likely to experience a number of environmental stresses due to refrigerated storage and the physicochemical properties of the food. Little is known about the specific molecular mechanisms underlying survival and growth of L. monocytogenes under different complex conditions on/in specific food matrices. Transcriptome sequencing (RNA-seq) was used to understand the transcriptional landscape of L. monocytogenes strain H7858 grown on cold smoked salmon (CSS; water phase salt, 4.65%; pH 6.1) relative to that in modified brain heart infusion broth (MBHIB; water phase salt, 4.65%; pH 6.1) at 7°C. Significant differential transcription of 149 genes was observed (false-discovery rate [FDR], <0.05; fold change, ≥2.5), and 88 and 61 genes were up- and downregulated, respectively, in H7858 grown on CSS relative to the genes in H7858 grown in MBHIB. In spite of these differences in transcriptomes under these two conditions, growth parameters for L. monocytogenes were not significantly different between CSS and MBHIB, indicating that the transcriptomic differences reflect how L. monocytogenes is able to facilitate growth under these different conditions. Differential expression analysis and Gene Ontology enrichment analysis indicated that genes encoding proteins involved in cobalamin biosynthesis as well as ethanolamine and 1,2-propanediol utilization have significantly higher transcript levels in H7858 grown on CSS than in that grown in MBHIB. Our data identify specific transcriptional profiles of L. monocytogenes growing on vacuum-packaged CSS, which may provide targets for the development of novel and improved strategies to control L. monocytogenes growth on this ready-to-eat food.
Collapse
Affiliation(s)
- Silin Tang
- Department of Food Science, College of Agriculture and Life Sciences, Cornell University, Ithaca, New York, USA
| | - Renato H Orsi
- Department of Food Science, College of Agriculture and Life Sciences, Cornell University, Ithaca, New York, USA
| | - Henk C den Bakker
- Department of Food Science, College of Agriculture and Life Sciences, Cornell University, Ithaca, New York, USA
| | - Martin Wiedmann
- Department of Food Science, College of Agriculture and Life Sciences, Cornell University, Ithaca, New York, USA
| | - Kathryn J Boor
- Department of Food Science, College of Agriculture and Life Sciences, Cornell University, Ithaca, New York, USA
| | - Teresa M Bergholz
- Department of Veterinary and Microbiological Sciences, North Dakota State University, Fargo, North Dakota, USA
| |
Collapse
|
26
|
Oellrich A, Collier N, Groza T, Rebholz-Schuhmann D, Shah N, Bodenreider O, Boland MR, Georgiev I, Liu H, Livingston K, Luna A, Mallon AM, Manda P, Robinson PN, Rustici G, Simon M, Wang L, Winnenburg R, Dumontier M. The digital revolution in phenotyping. Brief Bioinform 2015; 17:819-30. [PMID: 26420780 PMCID: PMC5036847 DOI: 10.1093/bib/bbv083] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2015] [Indexed: 12/22/2022] Open
Abstract
Phenotypes have gained increased notoriety in the clinical and biological domain owing to their application in numerous areas such as the discovery of disease genes and drug targets, phylogenetics and pharmacogenomics. Phenotypes, defined as observable characteristics of organisms, can be seen as one of the bridges that lead to a translation of experimental findings into clinical applications and thereby support 'bench to bedside' efforts. However, to build this translational bridge, a common and universal understanding of phenotypes is required that goes beyond domain-specific definitions. To achieve this ambitious goal, a digital revolution is ongoing that enables the encoding of data in computer-readable formats and the data storage in specialized repositories, ready for integration, enabling translational research. While phenome research is an ongoing endeavor, the true potential hidden in the currently available data still needs to be unlocked, offering exciting opportunities for the forthcoming years. Here, we provide insights into the state-of-the-art in digital phenotyping, by means of representing, acquiring and analyzing phenotype data. In addition, we provide visions of this field for future research work that could enable better applications of phenotype data.
Collapse
|
27
|
Cameron EA, Sperandio V. Frenemies: Signaling and Nutritional Integration in Pathogen-Microbiota-Host Interactions. Cell Host Microbe 2015; 18:275-84. [PMID: 26355214 PMCID: PMC4567707 DOI: 10.1016/j.chom.2015.08.007] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
The mammalian gastrointestinal (GI) microbiota is highly adapted to thrive in the GI environment and performs key functions related to host nutrition, physiology, development, immunity, and behavior. Successful host-bacterial associations require chemical signaling and optimal nutrient utilization and exchange. However, this important balance can be severely disrupted by environmental stimuli, with one of the most common insults upon the microbiota being infectious diseases. Although the microbiota acts as a barrier toward enteric pathogens, many enteric pathogens exploit signals and nutrients derived from both the microbiota and host to regulate their virulence programs. Here we review several signaling and nutrient recognition systems employed by GI pathogens to regulate growth and virulence. We discuss how shifts in the microbiota composition change host susceptibility to infection and how dietary changes or manipulation of the microbiota could potentially prevent and/or ameliorate GI infections.
Collapse
Affiliation(s)
- Elizabeth A Cameron
- Departments of Microbiology and Biochemistry, UT Southwestern Medical Center, Dallas, TX 75390-9048, USA
| | - Vanessa Sperandio
- Departments of Microbiology and Biochemistry, UT Southwestern Medical Center, Dallas, TX 75390-9048, USA.
| |
Collapse
|
28
|
Cornish AJ, Filippis I, David A, Sternberg MJE. Exploring the cellular basis of human disease through a large-scale mapping of deleterious genes to cell types. Genome Med 2015; 7:95. [PMID: 26330083 PMCID: PMC4557825 DOI: 10.1186/s13073-015-0212-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2015] [Accepted: 07/31/2015] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Each cell type found within the human body performs a diverse and unique set of functions, the disruption of which can lead to disease. However, there currently exists no systematic mapping between cell types and the diseases they can cause. METHODS In this study, we integrate protein-protein interaction data with high-quality cell-type-specific gene expression data from the FANTOM5 project to build the largest collection of cell-type-specific interactomes created to date. We develop a novel method, called gene set compactness (GSC), that contrasts the relative positions of disease-associated genes across 73 cell-type-specific interactomes to map genes associated with 196 diseases to the cell types they affect. We conduct text-mining of the PubMed database to produce an independent resource of disease-associated cell types, which we use to validate our method. RESULTS The GSC method successfully identifies known disease-cell-type associations, as well as highlighting associations that warrant further study. This includes mast cells and multiple sclerosis, a cell population currently being targeted in a multiple sclerosis phase 2 clinical trial. Furthermore, we build a cell-type-based diseasome using the cell types identified as manifesting each disease, offering insight into diseases linked through etiology. CONCLUSIONS The data set produced in this study represents the first large-scale mapping of diseases to the cell types in which they are manifested and will therefore be useful in the study of disease systems. Overall, we demonstrate that our approach links disease-associated genes to the phenotypes they produce, a key goal within systems medicine.
Collapse
Affiliation(s)
- Alex J Cornish
- Department of Life Sciences, Imperial College London, Exhibition Road, London, SW7 2AZ, UK.
| | - Ioannis Filippis
- Department of Life Sciences, Imperial College London, Exhibition Road, London, SW7 2AZ, UK.
| | - Alessia David
- Department of Life Sciences, Imperial College London, Exhibition Road, London, SW7 2AZ, UK.
| | - Michael J E Sternberg
- Department of Life Sciences, Imperial College London, Exhibition Road, London, SW7 2AZ, UK.
| |
Collapse
|
29
|
Ananiadou S, Thompson P, Nawaz R, McNaught J, Kell DB. Event-based text mining for biology and functional genomics. Brief Funct Genomics 2015; 14:213-30. [PMID: 24907365 PMCID: PMC4499874 DOI: 10.1093/bfgp/elu015] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The assessment of genome function requires a mapping between genome-derived entities and biochemical reactions, and the biomedical literature represents a rich source of information about reactions between biological components. However, the increasingly rapid growth in the volume of literature provides both a challenge and an opportunity for researchers to isolate information about reactions of interest in a timely and efficient manner. In response, recent text mining research in the biology domain has been largely focused on the identification and extraction of 'events', i.e. categorised, structured representations of relationships between biochemical entities, from the literature. Functional genomics analyses necessarily encompass events as so defined. Automatic event extraction systems facilitate the development of sophisticated semantic search applications, allowing researchers to formulate structured queries over extracted events, so as to specify the exact types of reactions to be retrieved. This article provides an overview of recent research into event extraction. We cover annotated corpora on which systems are trained, systems that achieve state-of-the-art performance and details of the community shared tasks that have been instrumental in increasing the quality, coverage and scalability of recent systems. Finally, several concrete applications of event extraction are covered, together with emerging directions of research.
Collapse
|
30
|
Thompson MC, Cascio D, Leibly DJ, Yeates TO. An allosteric model for control of pore opening by substrate binding in the EutL microcompartment shell protein. Protein Sci 2015; 24:956-75. [PMID: 25752492 DOI: 10.1002/pro.2672] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2014] [Revised: 02/23/2015] [Accepted: 03/03/2015] [Indexed: 01/01/2023]
Abstract
The ethanolamine utilization (Eut) microcompartment is a protein-based metabolic organelle that is strongly associated with pathogenesis in bacteria that inhabit the human gut. The exterior shell of this elaborate protein complex is composed from a few thousand copies of BMC-domain shell proteins, which form a semi-permeable diffusion barrier that provides the interior enzymes with substrates and cofactors while simultaneously retaining metabolic intermediates. The ability of this protein shell to regulate passage of substrate and cofactor molecules is critical for microcompartment function, but the details of how this diffusion barrier can allow the passage of large cofactors while still retaining small intermediates remain unclear. Previous work has revealed two conformations of the EutL shell protein, providing substantial evidence for a gated pore that might allow the passage of large cofactors. Here we report structural and biophysical evidence to show that ethanolamine, the substrate of the Eut microcompartment, acts as a negative allosteric regulator of EutL pore opening. Specifically, a series of X-ray crystal structures of EutL from Clostridium perfringens, along with equilibrium binding studies, reveal that ethanolamine binds to EutL at a site that exists in the closed-pore conformation and which is incompatible with opening of the large pore for cofactor transport. The allosteric mechanism we propose is consistent with the cofactor requirements of the Eut microcompartment, leading to a new model for EutL function. Furthermore, our results suggest the possibility of redox modulation of the allosteric mechanism, opening potentially new lines of investigation.
Collapse
Affiliation(s)
- Michael C Thompson
- Department of Chemistry and Biochemistry, University of California, Los Angeles, California, 90095
| | - Duilio Cascio
- UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles, California, 90095
| | - David J Leibly
- Department of Chemistry and Biochemistry, University of California, Los Angeles, California, 90095
| | - Todd O Yeates
- Department of Chemistry and Biochemistry, University of California, Los Angeles, California, 90095.,UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles, California, 90095
| |
Collapse
|
31
|
Inflammation-associated adherent-invasive Escherichia coli are enriched in pathways for use of propanediol and iron and M-cell translocation. Inflamm Bowel Dis 2014; 20:1919-32. [PMID: 25230163 DOI: 10.1097/mib.0000000000000183] [Citation(s) in RCA: 116] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
BACKGROUND Perturbations of the intestinal microbiome, termed dysbiosis, are linked to intestinal inflammation. Isolation of adherent-invasive Escherichia coli (AIEC) from intestines of patients with Crohn's disease (CD), dogs with granulomatous colitis, and mice with acute ileitis suggests these bacteria share pathoadaptive virulence factors that promote inflammation. METHODS To identify genes associated with AIEC, we sequenced the genomes of phylogenetically diverse AIEC strains isolated from people with CD (4), dogs with granulomatous colitis (2), and mice with ileitis (2) and 1 non-AIEC strain from CD ileum and compared them with 38 genome sequences of E. coli and Shigella. We then determined the prevalence of AIEC-associated genes in 49 E. coli strains from patients with CD and controls and correlated genotype with invasion of intestinal epithelial cells, persistence within macrophages, AIEC pathotype, and growth in standardized conditions. RESULTS Genes encoding propanediol utilization (pdu operon) and iron acquisition (yersiniabactin, chu operon) were overrepresented in AIEC relative to nonpathogenic E. coli. PduC (propanediol dehydratase) was enriched in CD-derived AIEC, correlated with increased cellular invasion, and persistence in vitro and was increasingly expressed in fucose-containing media. Growth of AIEC required iron, and the presence of chuA (heme acquisition) correlated with persistence in macrophages. CD-associated AIEC with lpfA 154 (long polar fimbriae) demonstrated increased invasion of epithelial cells and translocation across M cells. CONCLUSIONS Our findings provide novel insights into the genetic basis of the AIEC pathotype, supporting the concept that AIEC are equipped to exploit and promote intestinal inflammation and reveal potential targets for intervention against AIEC and inflammation-associated dysbiosis.
Collapse
|
32
|
Chen Y, Zhang X, Zhang GQ, Xu R. Comparative analysis of a novel disease phenotype network based on clinical manifestations. J Biomed Inform 2014; 53:113-20. [PMID: 25277758 DOI: 10.1016/j.jbi.2014.09.007] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2014] [Revised: 08/18/2014] [Accepted: 09/21/2014] [Indexed: 12/21/2022]
Abstract
Systems approaches to analyzing disease phenotype networks in combination with protein functional interaction networks have great potential in illuminating disease pathophysiological mechanisms. While many genetic networks are readily available, disease phenotype networks remain largely incomplete. In this study, we built a large-scale Disease Manifestation Network (DMN) from 50,543 highly accurate disease-manifestation semantic relationships in the United Medical Language System (UMLS). Our new phenotype network contains 2305 nodes and 373,527 weighted edges to represent the disease phenotypic similarities. We first compared DMN with the networks representing genetic relationships among diseases, and demonstrated that the phenotype clustering in DMN reflects common disease genetics. Then we compared DMN with a widely-used disease phenotype network in previous gene discovery studies, called mimMiner, which was extracted from the textual descriptions in Online Mendelian Inheritance in Man (OMIM). We demonstrated that DMN contains different knowledge from the existing phenotype data source. Finally, a case study on Marfan syndrome further proved that DMN contains useful information and can provide leads to discover unknown disease causes. Integrating DMN in systems approaches with mimMiner and other data offers the opportunities to predict novel disease genetics. We made DMN publicly available at nlp/case.edu/public/data/DMN.
Collapse
Affiliation(s)
- Yang Chen
- Department of Electrical Engineering and Computer Science, Case Western Reserve University, Cleveland, OH 44106, United States; Division of Medical Informatics, School of Medicine, Case Western Reserve University, Cleveland, OH 44106, United States
| | - Xiang Zhang
- Department of Electrical Engineering and Computer Science, Case Western Reserve University, Cleveland, OH 44106, United States
| | - Guo-Qiang Zhang
- Department of Electrical Engineering and Computer Science, Case Western Reserve University, Cleveland, OH 44106, United States; Division of Medical Informatics, School of Medicine, Case Western Reserve University, Cleveland, OH 44106, United States
| | - Rong Xu
- Division of Medical Informatics, School of Medicine, Case Western Reserve University, Cleveland, OH 44106, United States.
| |
Collapse
|
33
|
Belizário JE. The humankind genome: from genetic diversity to the origin of human diseases. Genome 2014; 56:705-16. [PMID: 24433206 DOI: 10.1139/gen-2013-0125] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Genome-wide association studies have failed to establish common variant risk for the majority of common human diseases. The underlying reasons for this failure are explained by recent studies of resequencing and comparison of over 1200 human genomes and 10 000 exomes, together with the delineation of DNA methylation patterns (epigenome) and full characterization of coding and noncoding RNAs (transcriptome) being transcribed. These studies have provided the most comprehensive catalogues of functional elements and genetic variants that are now available for global integrative analysis and experimental validation in prospective cohort studies. With these datasets, researchers will have unparalleled opportunities for the alignment, mining, and testing of hypotheses for the roles of specific genetic variants, including copy number variations, single nucleotide polymorphisms, and indels as the cause of specific phenotypes and diseases. Through the use of next-generation sequencing technologies for genotyping and standardized ontological annotation to systematically analyze the effects of genomic variation on humans and model organism phenotypes, we will be able to find candidate genes and new clues for disease's etiology and treatment. This article describes essential concepts in genetics and genomic technologies as well as the emerging computational framework to comprehensively search websites and platforms available for the analysis and interpretation of genomic data.
Collapse
Affiliation(s)
- Jose E Belizário
- Departamento de Farmacologia, Instituto de Ciências Biomédicas da Universidade de São Paulo, Avenida Lineu Prestes, 1524 CEP 05508-900, São Paulo, SP, Brazil
| |
Collapse
|
34
|
Panda A, Ghosh TC. Prevalent structural disorder carries signature of prokaryotic adaptation to oxic atmosphere. Gene 2014; 548:134-41. [PMID: 24999584 DOI: 10.1016/j.gene.2014.07.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2014] [Revised: 06/27/2014] [Accepted: 07/03/2014] [Indexed: 12/12/2022]
Abstract
Microbes have adopted efficient mechanisms to contend with environmental changes. The emergence of oxygen was a major event that led to an abrupt change in Earth's atmosphere. To adjust with this shift in environmental condition ancient microbes must have undergone several modifications. Although some proteomic and genomic attributes were proposed to facilitate survival of microorganisms in the presence of oxygen, the process of adaptation still remains elusive. Recent studies have focused that intrinsically disordered proteins play crucial roles in adaptation to a wide range of ecological conditions. Therefore, it is likely that disordered proteins could also play indispensable roles in microbial adaptation to the aerobic environment. To test this hypothesis we measured the disorder content of 679 prokaryotes from four oxygen requirement groups. Our result revealed that aerobic proteomes are endowed with the highest protein disorder followed by facultative microbes. Minimal disorder was observed in anaerobic and microaerophilic microbes with no significant difference in their disorder content. Considering all the potential confounding factors that can modulate protein disorder, here we established that the high protein disorder in aerobic microbe is not a by-product of adaptation to any other selective pressure. On the functional level, we found that the high disorder in aerobic proteomes has been utilized for processes that are important for their aerobic lifestyle. Moreover, aerobic proteomes were found to be enriched with disordered binding sites and to contain transcription factors with high disorder propensity. Based on our results, here we proposed that the high protein disorder is an adaptive opportunity for aerobic microbes to fit with the genomic and functional complexities of the aerobic lifestyle.
Collapse
Affiliation(s)
- Arup Panda
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, India
| | - Tapash Chandra Ghosh
- Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, India.
| |
Collapse
|
35
|
Staib L, Fuchs TM. From food to cell: nutrient exploitation strategies of enteropathogens. MICROBIOLOGY-SGM 2014; 160:1020-1039. [PMID: 24705229 DOI: 10.1099/mic.0.078105-0] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Upon entering the human gastrointestinal tract, foodborne bacterial enteropathogens encounter, among numerous other stress conditions, nutrient competition with the host organism and the commensal microbiota. The main carbon, nitrogen and energy sources exploited by pathogens during proliferation in, and colonization of, the gut have, however, not been identified completely. In recent years, a huge body of literature has provided evidence that most enteropathogens are equipped with a large set of specific metabolic pathways to overcome nutritional limitations in vivo, thus increasing bacterial fitness during infection. These adaptations include the degradation of myo-inositol, ethanolamine cleaved from phospholipids, fucose derived from mucosal glycoconjugates, 1,2-propanediol as the fermentation product of fucose or rhamnose and several other metabolites not accessible for commensal bacteria or present in competition-free microenvironments. Interestingly, the data reviewed here point to common metabolic strategies of enteric pathogens allowing the exploitation of nutrient sources that not only are present in the gut lumen, the mucosa or epithelial cells, but also are abundant in food. An increased knowledge of the metabolic strategies developed by enteropathogens is therefore a key factor to better control foodborne diseases.
Collapse
Affiliation(s)
- Lena Staib
- ZIEL, Abteilung Mikrobiologie, and Lehrstuhl für Mikrobielle Ökologie, Fakultät für Grundlagen der Biowissenschaften, Wissenschaftszentrum Weihenstephan, Technische Universität München, Weihenstephaner Berg 3, 85350 Freising, Germany
| | - Thilo M Fuchs
- ZIEL, Abteilung Mikrobiologie, and Lehrstuhl für Mikrobielle Ökologie, Fakultät für Grundlagen der Biowissenschaften, Wissenschaftszentrum Weihenstephan, Technische Universität München, Weihenstephaner Berg 3, 85350 Freising, Germany
| |
Collapse
|
36
|
Krisko A, Copic T, Gabaldón T, Lehner B, Supek F. Inferring gene function from evolutionary change in signatures of translation efficiency. Genome Biol 2014; 15:R44. [PMID: 24580753 PMCID: PMC4054840 DOI: 10.1186/gb-2014-15-3-r44] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2013] [Accepted: 03/03/2014] [Indexed: 11/13/2022] Open
Abstract
Background The genetic code is redundant, meaning that most amino acids can be encoded by more than one codon. Highly expressed genes tend to use optimal codons to increase the accuracy and speed of translation. Thus, codon usage biases provide a signature of the relative expression levels of genes, which can, uniquely, be quantified across the domains of life. Results Here we describe a general statistical framework to exploit this phenomenon and to systematically associate genes with environments and phenotypic traits through changes in codon adaptation. By inferring evolutionary signatures of translation efficiency in 911 bacterial and archaeal genomes while controlling for confounding effects of phylogeny and inter-correlated phenotypes, we linked 187 gene families to 24 diverse phenotypic traits. A series of experiments in Escherichia coli revealed that 13 of 15, 19 of 23, and 3 of 6 gene families with changes in codon adaptation in aerotolerant, thermophilic, or halophilic microbes. Respectively, confer specific resistance to, respectively, hydrogen peroxide, heat, and high salinity. Further, we demonstrate experimentally that changes in codon optimality alone are sufficient to enhance stress resistance. Finally, we present evidence that multiple genes with altered codon optimality in aerobes confer oxidative stress resistance by controlling the levels of iron and NAD(P)H. Conclusions Taken together, these results provide experimental evidence for a widespread connection between changes in translation efficiency and phenotypic adaptation. As the number of sequenced genomes increases, this novel genomic context method for linking genes to phenotypes based on sequence alone will become increasingly useful.
Collapse
|
37
|
Piedra D, Ferrer A, Gea J. Text mining and medicine: usefulness in respiratory diseases. Arch Bronconeumol 2014; 50:113-9. [PMID: 24507559 DOI: 10.1016/j.arbres.2013.04.009] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2013] [Revised: 04/12/2013] [Accepted: 04/18/2013] [Indexed: 12/24/2022]
Abstract
It is increasingly common to have medical information in electronic format. This includes scientific articles as well as clinical management reviews, and even records from health institutions with patient data. However, traditional instruments, both individual and institutional, are of little use for selecting the most appropriate information in each case, either in the clinical or research field. So-called text or data «mining» enables this huge amount of information to be managed, extracting it from various sources using processing systems (filtration and curation), integrating it and permitting the generation of new knowledge. This review aims to provide an overview of text and data mining, and of the potential usefulness of this bioinformatic technique in the exercise of care in respiratory medicine and in research in the same field.
Collapse
Affiliation(s)
- David Piedra
- Instituto de Investigación del Hospital del Mar (IMIM), Barcelona, España.
| | - Antoni Ferrer
- Instituto de Investigación del Hospital del Mar (IMIM), Barcelona, España; Servicio de Neumología, Hospital del Mar, Barcelona, España; Facultat de Ciències de la Salut i de la Vida, Universitat Pompeu Fabra, Barcelona, España; CIBERES, ISC III, Bunyola, Mallorca, España
| | - Joaquim Gea
- Instituto de Investigación del Hospital del Mar (IMIM), Barcelona, España; Servicio de Neumología, Hospital del Mar, Barcelona, España; Facultat de Ciències de la Salut i de la Vida, Universitat Pompeu Fabra, Barcelona, España; CIBERES, ISC III, Bunyola, Mallorca, España
| |
Collapse
|
38
|
Hoi SCH, Li Z, Wong L, Nguyen H, Li J. Coupling Graphs, Efficient Algorithms and B-Cell Epitope Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2014; 11:7-16. [PMID: 26355502 DOI: 10.1109/tcbb.2013.136] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Coupling graphs are newly introduced in this paper to meet many application needs particularly in the field of bioinformatics. A coupling graph is a two-layer graph complex, in which each node from one layer of the graph complex has at least one connection with the nodes in the other layer, and vice versa. The coupling graph model is sufficiently powerful to capture strong and inherent associations between subgraph pairs in complicated applications. The focus of this paper is on mining algorithms of frequent coupling subgraphs and bioinformatics application. Although existing frequent subgraph mining algorithms are competent to identify frequent subgraphs from a graph database, they perform poorly on frequent coupling subgraph mining because they generate many irrelevant subgraphs. We propose a novel graph transformation technique to transform a coupling graph into a generic graph. Based on the transformed coupling graphs, existing graph mining methods are then utilized to discover frequent coupling subgraphs. We prove that the transformation is precise and complete and that the restoration is reversible. Experiments carried out on a database containing 10,511 coupling graphs show that our proposed algorithm reduces the mining time very much in comparison with the existing subgraph mining algorithms. Moreover, we demonstrate the usefulness of frequent coupling subgraphs by applying our algorithm to make accurate predictions of epitopes in antibody-antigen binding.
Collapse
|
39
|
Cummins J, Casey PG, Joyce SA, Gahan CGM. A mariner transposon-based signature-tagged mutagenesis system for the analysis of oral infection by Listeria monocytogenes. PLoS One 2013; 8:e75437. [PMID: 24069416 PMCID: PMC3771922 DOI: 10.1371/journal.pone.0075437] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2013] [Accepted: 08/14/2013] [Indexed: 11/18/2022] Open
Abstract
Listeria monocytogenes is a Gram-positive foodborne pathogen and the causative agent of listerosis a disease that manifests predominately as meningitis in the non-pregnant individual or infection of the fetus and spontaneous abortion in pregnant women. Common-source outbreaks of foodborne listeriosis are associated with significant morbidity and mortality. However, relatively little is known concerning the mechanisms that govern infection via the oral route. In order to aid functional genetic analysis of the gastrointestinal phase of infection we designed a novel signature-tagged mutagenesis (STM) system based upon the invasive L. monocytogenes 4b serotype H7858 strain. To overcome the limitations of gastrointestinal infection by L. monocytogenes in the mouse model we created a H7858 strain that is genetically optimised for oral infection in mice. Furthermore our STM system was based upon a mariner transposon to favour numerous and random transposition events throughout the L. monocytogenes genome. Use of the STM bank to investigate oral infection by L. monocytogenes identified 21 insertion mutants that demonstrated significantly reduced potential for infection in our model. The sites of transposon insertion included lmOh7858_0671 (encoding an internalin homologous to Lmo0610), lmOh7858_0898 (encoding a putative surface-expressed LPXTG protein homologous to Lmo0842), lmOh7858_2579 (encoding the HupDGC hemin transport system) and lmOh7858_0399 (encoding a putative fructose specific phosphotransferase system). We propose that this represents an optimised STM system for functional genetic analysis of foodborne/oral infection by L. monocytogenes.
Collapse
Affiliation(s)
- Joanne Cummins
- Department of Microbiology, University College Cork, Cork, Ireland
| | - Pat G. Casey
- Department of Microbiology, University College Cork, Cork, Ireland
- Alimentary Pharmabiotic Centre, University College Cork, Cork, Ireland
| | - Susan A. Joyce
- Department of Microbiology, University College Cork, Cork, Ireland
- Alimentary Pharmabiotic Centre, University College Cork, Cork, Ireland
| | - Cormac G. M. Gahan
- Department of Microbiology, University College Cork, Cork, Ireland
- Alimentary Pharmabiotic Centre, University College Cork, Cork, Ireland
- School of Pharmacy, University College Cork, Cork, Ireland
- * E-mail:
| |
Collapse
|
40
|
EutR is a direct regulator of genes that contribute to metabolism and virulence in enterohemorrhagic Escherichia coli O157:H7. J Bacteriol 2013; 195:4947-53. [PMID: 23995630 DOI: 10.1128/jb.00937-13] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Ethanolamine (EA) metabolism is a trait associated with enteric pathogens, including enterohemorrhagic Escherichia coli O157:H7 (EHEC). EHEC causes severe bloody diarrhea and hemolytic uremic syndrome. EHEC encodes the ethanolamine utilization (eut) operon that allows EHEC to metabolize EA and gain a competitive advantage when colonizing the gastrointestinal tract. The eut operon encodes the transcriptional regulator EutR. Genetic studies indicated that EutR expression is induced by EA and vitamin B12 and that EutR promotes expression of the eut operon; however, biochemical evidence for these interactions has been lacking. We performed EA-binding assays and electrophoretic mobility shift assays (EMSAs) to elucidate a mechanism for EutR gene regulation. These studies confirmed EutR interaction with EA, as well as direct binding to the eutS promoter. EutR also contributes to expression of the locus of enterocyte effacement (LEE) in an EA-dependent manner. We performed EMSAs to examine EutR activation of the LEE. The results demonstrated that EutR directly binds the regulatory region of the ler promoter. These results present the first mechanistic description of EutR gene regulation and reveal a novel role for EutR in EHEC pathogenesis.
Collapse
|
41
|
Production of bulk chemicals via novel metabolic pathways in microorganisms. Biotechnol Adv 2012; 31:925-35. [PMID: 23280013 DOI: 10.1016/j.biotechadv.2012.12.008] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2012] [Revised: 12/09/2012] [Accepted: 12/23/2012] [Indexed: 02/05/2023]
Abstract
Metabolic engineering has been playing important roles in developing high performance microorganisms capable of producing various chemicals and materials from renewable biomass in a sustainable manner. Synthetic and systems biology are also contributing significantly to the creation of novel pathways and the whole cell-wide optimization of metabolic performance, respectively. In order to expand the spectrum of chemicals that can be produced biotechnologically, it is necessary to broaden the metabolic capacities of microorganisms. Expanding the metabolic pathways for biosynthesizing the target chemicals requires not only the enumeration of a series of known enzymes, but also the identification of biochemical gaps whose corresponding enzymes might not actually exist in nature; this issue is the focus of this paper. First, pathway prediction tools, effectively combining reactions that lead to the production of a target chemical, are analyzed in terms of logics representing chemical information, and designing and ranking the proposed metabolic pathways. Then, several approaches for potentially filling in the gaps of the novel metabolic pathway are suggested along with relevant examples, including the use of promiscuous enzymes that flexibly utilize different substrates, design of novel enzymes for non-natural reactions, and exploration of hypothetical proteins. Finally, strain optimization by systems metabolic engineering in the context of novel metabolic pathways constructed is briefly described. It is hoped that this review paper will provide logical ways of efficiently utilizing 'big' biological data to design and develop novel metabolic pathways for the production of various bulk chemicals that are currently produced from fossil resources.
Collapse
|
42
|
Abstract
Ethanolamine (EA) is a compound prevalent in the gastrointestinal (GI) environment. The ability to catabolize this compound has been associated with important GI pathogens, including enterohemorrhagic Escherichia coli O157:H7 (EHEC). It has been hypothesized that the ability of EHEC to utilize EA as a source of nitrogen provides EHEC with an important competitive advantage in the gut. However, new work by Kendall et al. (mBio 3:e00050-12, 2012) suggests that the role of EA in EHEC pathogenesis may be more fundamental; EA appears to be a signal for EHEC to commence its virulence program. In this commentary, I review the previously described connections of EA to bacterial pathogenesis in the GI tract, highlight the important findings of this new study, and note areas where further research is needed to fully comprehend the connection of EA utilization to bacterial pathogenesis.
Collapse
|
43
|
Ratkovic Z, Golik W, Warnier P. Event extraction of bacteria biotopes: a knowledge-intensive NLP-based approach. BMC Bioinformatics 2012; 13 Suppl 11:S8. [PMID: 22759462 PMCID: PMC3384252 DOI: 10.1186/1471-2105-13-s11-s8] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Background Bacteria biotopes cover a wide range of diverse habitats including animal and plant hosts, natural, medical and industrial environments. The high volume of publications in the microbiology domain provides a rich source of up-to-date information on bacteria biotopes. This information, as found in scientific articles, is expressed in natural language and is rarely available in a structured format, such as a database. This information is of great importance for fundamental research and microbiology applications (e.g., medicine, agronomy, food, bioenergy). The automatic extraction of this information from texts will provide a great benefit to the field. Methods We present a new method for extracting relationships between bacteria and their locations using the Alvis framework. Recognition of bacteria and their locations was achieved using a pattern-based approach and domain lexical resources. For the detection of environment locations, we propose a new approach that combines lexical information and the syntactic-semantic analysis of corpus terms to overcome the incompleteness of lexical resources. Bacteria location relations extend over sentence borders, and we developed domain-specific rules for dealing with bacteria anaphors. Results We participated in the BioNLP 2011 Bacteria Biotope (BB) task with the Alvis system. Official evaluation results show that it achieves the best performance of participating systems. New developments since then have increased the F-score by 4.1 points. Conclusions We have shown that the combination of semantic analysis and domain-adapted resources is both effective and efficient for event information extraction in the bacteria biotope domain. We plan to adapt the method to deal with a larger set of location types and a large-scale scientific article corpus to enable microbiologists to integrate and use the extracted knowledge in combination with experimental data.
Collapse
Affiliation(s)
- Zorana Ratkovic
- MIG INRA UR1077 Domaine de Vilvert, F-78352 Jouy-en-Josas, France.
| | | | | |
Collapse
|
44
|
Nakjang S, Ndeh DA, Wipat A, Bolam DN, Hirt RP. A novel extracellular metallopeptidase domain shared by animal host-associated mutualistic and pathogenic microbes. PLoS One 2012; 7:e30287. [PMID: 22299034 PMCID: PMC3267712 DOI: 10.1371/journal.pone.0030287] [Citation(s) in RCA: 76] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2011] [Accepted: 12/16/2011] [Indexed: 12/20/2022] Open
Abstract
The mucosal microbiota is recognised as an important factor for our health, with many disease states linked to imbalances in the normal community structure. Hence, there is considerable interest in identifying the molecular basis of human-microbe interactions. In this work we investigated the capacity of microbes to thrive on mucosal surfaces, either as mutualists, commensals or pathogens, using comparative genomics to identify co-occurring molecular traits. We identified a novel domain we named M60-like/PF13402 (new Pfam entry PF13402), which was detected mainly among proteins from animal host mucosa-associated prokaryotic and eukaryotic microbes ranging from mutualists to pathogens. Lateral gene transfers between distantly related microbes explained their shared M60-like/PF13402 domain. The novel domain is characterised by a zinc-metallopeptidase-like motif and is distantly related to known viral enhancin zinc-metallopeptidases. Signal peptides and/or cell surface anchoring features were detected in most microbial M60-like/PF13402 domain-containing proteins, indicating that these proteins target an extracellular substrate. A significant subset of these putative peptidases was further characterised by the presence of associated domains belonging to carbohydrate-binding module family 5/12, 32 and 51 and other glycan-binding domains, suggesting that these novel proteases are targeted to complex glycoproteins such as mucins. An in vitro mucinase assay demonstrated degradation of mammalian mucins by a recombinant form of an M60-like/PF13402-containing protein from the gut mutualist Bacteroides thetaiotaomicron. This study reveals that M60-like domains are peptidases targeting host glycoproteins. These peptidases likely play an important role in successful colonisation of both vertebrate mucosal surfaces and the invertebrate digestive tract by both mutualistic and pathogenic microbes. Moreover, 141 entries across various peptidase families described in the MEROPS database were also identified with carbohydrate-binding modules defining a new functional context for these glycan-binding domains and providing opportunities to engineer proteases targeting specific glycoproteins for both biomedical and industrial applications.
Collapse
Affiliation(s)
- Sirintra Nakjang
- Institute for Cell and Molecular Biosciences, Newcastle University, Newcastle upon Tyne, United Kingdom
| | - Didier A. Ndeh
- Institute for Cell and Molecular Biosciences, Newcastle University, Newcastle upon Tyne, United Kingdom
| | - Anil Wipat
- Institute for Cell and Molecular Biosciences, Newcastle University, Newcastle upon Tyne, United Kingdom
- School of Computing Science, Newcastle University, Newcastle upon Tyne, United Kingdom
| | - David N. Bolam
- Institute for Cell and Molecular Biosciences, Newcastle University, Newcastle upon Tyne, United Kingdom
| | - Robert P. Hirt
- Institute for Cell and Molecular Biosciences, Newcastle University, Newcastle upon Tyne, United Kingdom
- * E-mail:
| |
Collapse
|
45
|
Bonanata JN, Signorelli S, Coitiño EL. Increasing complexity models for describing the generation of substrate radicals at the active site of ethanolamine ammonia-lyase/B12. COMPUT THEOR CHEM 2011. [DOI: 10.1016/j.comptc.2011.07.029] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
46
|
Andronis C, Sharma A, Virvilis V, Deftereos S, Persidis A. Literature mining, ontologies and information visualization for drug repurposing. Brief Bioinform 2011; 12:357-68. [PMID: 21712342 DOI: 10.1093/bib/bbr005] [Citation(s) in RCA: 163] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The immense growth of MEDLINE coupled with the realization that a vast amount of biomedical knowledge is recorded in free-text format, has led to the appearance of a large number of literature mining techniques aiming to extract biomedical terms and their inter-relations from the scientific literature. Ontologies have been extensively utilized in the biomedical domain either as controlled vocabularies or to provide the framework for mapping relations between concepts in biology and medicine. Literature-based approaches and ontologies have been used in the past for the purpose of hypothesis generation in connection with drug discovery. Here, we review the application of literature mining and ontology modeling and traversal to the area of drug repurposing (DR). In recent years, DR has emerged as a noteworthy alternative to the traditional drug development process, in response to the decreased productivity of the biopharmaceutical industry. Thus, systematic approaches to DR have been developed, involving a variety of in silico, genomic and high-throughput screening technologies. Attempts to integrate literature mining with other types of data arising from the use of these technologies as well as visualization tools assisting in the discovery of novel associations between existing drugs and new indications will also be presented.
Collapse
|
47
|
Bell L, Chowdhary R, Liu JS, Niu X, Zhang J. Integrated bio-entity network: a system for biological knowledge discovery. PLoS One 2011; 6:e21474. [PMID: 21738677 PMCID: PMC3124513 DOI: 10.1371/journal.pone.0021474] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2011] [Accepted: 06/01/2011] [Indexed: 01/26/2023] Open
Abstract
A significant part of our biological knowledge is centered on relationships between biological entities (bio-entities) such as proteins, genes, small molecules, pathways, gene ontology (GO) terms and diseases. Accumulated at an increasing speed, the information on bio-entity relationships is archived in different forms at scattered places. Most of such information is buried in scientific literature as unstructured text. Organizing heterogeneous information in a structured form not only facilitates study of biological systems using integrative approaches, but also allows discovery of new knowledge in an automatic and systematic way. In this study, we performed a large scale integration of bio-entity relationship information from both databases containing manually annotated, structured information and automatic information extraction of unstructured text in scientific literature. The relationship information we integrated in this study includes protein–protein interactions, protein/gene regulations, protein–small molecule interactions, protein–GO relationships, protein–pathway relationships, and pathway–disease relationships. The relationship information is organized in a graph data structure, named integrated bio-entity network (IBN), where the vertices are the bio-entities and edges represent their relationships. Under this framework, graph theoretic algorithms can be designed to perform various knowledge discovery tasks. We designed breadth-first search with pruning (BFSP) and most probable path (MPP) algorithms to automatically generate hypotheses—the indirect relationships with high probabilities in the network. We show that IBN can be used to generate plausible hypotheses, which not only help to better understand the complex interactions in biological systems, but also provide guidance for experimental designs.
Collapse
Affiliation(s)
- Lindsey Bell
- Department of Statistics, Florida State University, Tallahassee, Florida, United States of America
| | | | | | | | | |
Collapse
|
48
|
PREDICT: a method for inferring novel drug indications with application to personalized medicine. Mol Syst Biol 2011; 7:496. [PMID: 21654673 PMCID: PMC3159979 DOI: 10.1038/msb.2011.26] [Citation(s) in RCA: 457] [Impact Index Per Article: 35.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2011] [Accepted: 04/12/2011] [Indexed: 12/24/2022] Open
Abstract
The authors present a new method, PREDICT, for the large-scale prediction of drug indications, and demonstrate its use on both approved drugs and novel molecules. They also provide a proof-of-concept for its potential utility in predicting patient-specific medications. We present a novel method for the large-scale prediction of drug indications that can handle both approved drugs and novel molecules. Our method utilizes multiple drug–drug and disease–disease similarity measures for the prediction task, obtaining high specificity and sensitivity rates (AUC=0.9). Our drug repositioning predictions cover 27% of the indications currently tested on clinical trials (P<2 × 10−220). We show comparable performance using a gene expression signature-based disease–disease similarity, laying the computational foundation for predicting patient-specific indications.
Predicting indications for new molecules or finding alternative indications for approved drugs is a laborious and costly process (DiMasi et al, 2003), calling for computational solutions that would minimize production time and development costs (Terstappen and Reggiani, 2001). Here, we present a novel method for predicting drug indications, PREDICT, capable of handling both approved drugs and novel molecules. Our method is based on the assumption that similar drugs are indicated for similar diseases. To score a possible drug–disease association, we compute its similarity to known associations by combining drug–drug and disease–disease similarity computations. This strategy achieves high specificity and sensitivity rates in a cross-validation setting, where part of the known associations are hidden and the method is assessed based on how well it can retrieve them based on the rest of the associations. Assessing its predictions of novel indications for existing drugs, we find that it covers a significant portion (27%, P<2 × 10−220) of drug indications currently tested on clinical trials. Examples of such predictions include: (i) Cabergoline, indicated for Hyperprolactinemia, which is predicted to treat Migrane, a prediction supported by two separate studies (Verhelst et al, 1999; Cavestro et al, 2006) and (ii) Progesterone, which is predicted to treat renal cell cancer, non-papillary (npRCC), supported by the study of Izumi et al (2007). In addition, we provide indication predictions for novel molecules. For example, Cycloleucine is predicted for the treatment of Alzheimer's disease (AD); indeed, Cycloleucine was found to be a potent and selective antagonist of NMDA receptor-mediated responses (Hershkowitz and Rogawski, 1989), a new promising class of chemicals for the treatment of AD (Farlow, 2004). As another example, Hyperforin, St John's wort extract, is predicted to treat hyperthermia. Interestingly, St John's wort extract was found to have anxiolytic effects on stress-induced hyperthermia in mice (Grundmann et al, 2006). We further introduce a disease–disease similarity measure based on disease-specific gene signatures and show that such a measure can be used by our method to accurately predict drug indications. Importantly, this suggests the potential utility of our approach also in a personalized medicine setting, whereby future gene expression signatures from individual patients would replace these disease-specific signatures. Inferring potential drug indications, for either novel or approved drugs, is a key step in drug development. Previous computational methods in this domain have focused on either drug repositioning or matching drug and disease gene expression profiles. Here, we present a novel method for the large-scale prediction of drug indications (PREDICT) that can handle both approved drugs and novel molecules. Our method is based on the observation that similar drugs are indicated for similar diseases, and utilizes multiple drug–drug and disease–disease similarity measures for the prediction task. On cross-validation, it obtains high specificity and sensitivity (AUC=0.9) in predicting drug indications, surpassing existing methods. We validate our predictions by their overlap with drug indications that are currently under clinical trials, and by their agreement with tissue-specific expression information on the drug targets. We further show that disease-specific genetic signatures can be used to accurately predict drug indications for new diseases (AUC=0.92). This lays the computational foundation for future personalized drug treatments, where gene expression signatures from individual patients would replace the disease-specific signatures.
Collapse
|
49
|
Singh AK, Ulanov AV, Li Z, Jayaswal RK, Wilkinson BJ. Metabolomes of the psychrotolerant bacterium Listeria monocytogenes 10403S grown at 37 °C and 8 °C. Int J Food Microbiol 2011; 148:107-14. [PMID: 21645939 DOI: 10.1016/j.ijfoodmicro.2011.05.008] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2010] [Revised: 04/01/2011] [Accepted: 05/10/2011] [Indexed: 11/19/2022]
Abstract
Listeria monocytogenes is a food-borne pathogen with the ability to grow at refrigeration temperatures. Knowledge of the mechanisms involved in low temperature growth is incomplete and here we report the results of a metabolomics investigation of this. The small molecule contents of L. monocytogenes 10403S grown at 37 °C and 8 °C were compared by gas chromatography/mass spectrometry (GC/MS). Over 500 peaks were detected in both 37 °C and 8 °C-grown cells, and 103 were identified. Of the identified metabolites, the concentrations of 56 metabolites were increased (P<0.05), while the concentrations of 8 metabolites were decreased at low temperature. Metabolites increasing in concentration at 8 °C included amino acids, sugars, organic acids, urea cycle intermediates, polyamines, and different compatible solutes. A principal component analysis (PCA) was used to visualize and compare the matrix containing the data in 6 samples, and this clearly identified the 37 °C and 8 °C metabolomes as different. The results indicated that an increase in solute concentrations in the cytoplasm was associated with low temperature adaptation, which may be a response to chill stress with the effect of lowering the freezing point of intracellular water and decreasing ice crystal formation.
Collapse
Affiliation(s)
- Atul K Singh
- Microbiology Group, School of Biological Sciences, Illinois State University, Normal, Illinois 61790-4120, USA
| | | | | | | | | |
Collapse
|
50
|
Transcription antitermination by a phosphorylated response regulator and cobalamin-dependent termination at a B₁₂ riboswitch contribute to ethanolamine utilization in Enterococcus faecalis. J Bacteriol 2011; 193:2575-86. [PMID: 21441515 DOI: 10.1128/jb.00217-11] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
The ability of bacteria to utilize ethanolamine (EA) as a carbon and nitrogen source may confer an advantage for survival, colonization, and pathogenicity in the human intestinal tract. Enterococcus faecalis, a Gram-positive human commensal organism, depends on a two-component signaling system (TCS-17) for sensing EA and regulating the expression of the ethanolamine utilization genes. Multiple promoters participate in eut gene expression in the presence of EA as the sole carbon source and cobalamin (CoB12), an essential cofactor in the enzymatic degradation process. By means of in vivo and in vitro approaches, this study characterized the transcriptional activity identified in the eutT-eutG intergenic region of the E. faecalis eut cluster. Two novel promoters in this region were shown to be active in vivo. The distal P2-1 promoter was associated with a B12 riboswitch that terminated transcription in the presence of CoB12. Transcription elongation from the proximal P2-2 promoter was regulated by antitermination mediated by the phosphorylated form of the response regulator of TCS-17 (RR17). 3'-Rapid amplification of cDNA ends (RACE) analyses of the terminated RNA products allowed precise identification of the hairpin loop structures involved in termination/antitermination. The results uncovered the role of the B12 riboswitch and RR17 in eut gene expression, adding to the complexity of this regulatory pathway and extending the knowledge of possible means of transcription regulation in Gram-positive organisms.
Collapse
|